SlideShare una empresa de Scribd logo
1 de 19
Community content building for evolutionary biology Lessons learned from LepTree and Encyclopedia of Life Cynthia Parr Smithsonian Institution University of Maryland
Today’s story LepTree and Encyclopedia of Life built a couple of websites LepTree: slow for social content-building but highly useful content EOL: quick for content aggregation, but now need to atomize and semanticize Conclusion: Best of both worlds
LepTreehttp://leptree.net
Community features Blog Commenting Forum Working Groups
Complex LepTreetaxontemplate
LepTree built semantic tools, then invited data entry Export
http://www.eol.org ,[object Object]
Freely accessible: open access, open source
Available from a single portal in a common format
Quality
Always growing as new species are discovered and new knowledge is generated,[object Object]
http://www.eol.org/content_partner Objects can come from many partners Objects are sorted by topic Each partner gets credit
EOL aggregates, then annotates Catalogue of Life IUCN Content providers Databases LifeDesks 	Public contribution Curating Commenting Tagging GBIF Biodiversity Heritage Library http://www.eol.org/content_partner
LepTree’s data approach is more complex and customized  LepTree ,[object Object]
Big S semantics (OWL, RDF triple store). Tied to people and project ontologies
Custom data entry: required new workflowEOL ,[object Object]
XML schema
Variety of data paths: avoid changes in workflow,[object Object]
1750 pages (107 rich pages + ~450 fossils) + ~1600 images)

Más contenido relacionado

Similar a Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life

5 steps to using open access in the classroom 11 9 2011
5 steps to using open access in the classroom 11 9 2011 5 steps to using open access in the classroom 11 9 2011
5 steps to using open access in the classroom 11 9 2011
Elizabeth Brown
 
Introduction to EOL v2 for Crossroads
Introduction to EOL v2 for Crossroads Introduction to EOL v2 for Crossroads
Introduction to EOL v2 for Crossroads
Cyndy Parr
 
Using Online Natural History Databases to Support Innovation in Undergraduate...
Using Online Natural History Databases to Support Innovation in Undergraduate...Using Online Natural History Databases to Support Innovation in Undergraduate...
Using Online Natural History Databases to Support Innovation in Undergraduate...
Encyclopedia of Life Learning + Education
 
Libraries,librarians,social media
Libraries,librarians,social mediaLibraries,librarians,social media
Libraries,librarians,social media
Anne Peoples
 

Similar a Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life (20)

Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
 
The OER in COERLL: Defining Open Education
The OER in COERLL: Defining Open EducationThe OER in COERLL: Defining Open Education
The OER in COERLL: Defining Open Education
 
5 steps to using open access in the classroom 11 9 2011
5 steps to using open access in the classroom 11 9 2011 5 steps to using open access in the classroom 11 9 2011
5 steps to using open access in the classroom 11 9 2011
 
Introduction to Open Educational Resources (OER)
 Introduction to Open Educational Resources (OER) Introduction to Open Educational Resources (OER)
Introduction to Open Educational Resources (OER)
 
Introduction to EOL v2 for Crossroads
Introduction to EOL v2 for Crossroads Introduction to EOL v2 for Crossroads
Introduction to EOL v2 for Crossroads
 
Introducing Encyclopedia of Life version 2
Introducing Encyclopedia of Life version 2Introducing Encyclopedia of Life version 2
Introducing Encyclopedia of Life version 2
 
Using Online Natural History Databases to Support Innovation in Undergraduate...
Using Online Natural History Databases to Support Innovation in Undergraduate...Using Online Natural History Databases to Support Innovation in Undergraduate...
Using Online Natural History Databases to Support Innovation in Undergraduate...
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific Publishers
 
Open Educational Resources (OER) for Enhancing Teaching and Learning
Open Educational Resources (OER) for Enhancing Teaching and LearningOpen Educational Resources (OER) for Enhancing Teaching and Learning
Open Educational Resources (OER) for Enhancing Teaching and Learning
 
UCT Opencontent 1 Year Anniversary
UCT Opencontent 1 Year AnniversaryUCT Opencontent 1 Year Anniversary
UCT Opencontent 1 Year Anniversary
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?
 
Competitive & Saleable E-Content for Philippine Libraries
Competitive & Saleable E-Content for Philippine LibrariesCompetitive & Saleable E-Content for Philippine Libraries
Competitive & Saleable E-Content for Philippine Libraries
 
Using OA Content
Using OA ContentUsing OA Content
Using OA Content
 
If They Build It They Will Come
If They Build It They Will ComeIf They Build It They Will Come
If They Build It They Will Come
 
CRE Resource Creation and Discovery
CRE Resource Creation and DiscoveryCRE Resource Creation and Discovery
CRE Resource Creation and Discovery
 
The OERs: Transforming Education for Sustainable Future by Dr. Sarita Anand
The OERs: Transforming Education for Sustainable Future by Dr. Sarita AnandThe OERs: Transforming Education for Sustainable Future by Dr. Sarita Anand
The OERs: Transforming Education for Sustainable Future by Dr. Sarita Anand
 
The repository ecology: an approach to understanding repository and service i...
The repository ecology: an approach to understanding repository and service i...The repository ecology: an approach to understanding repository and service i...
The repository ecology: an approach to understanding repository and service i...
 
NORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session OneNORFest 2023 Lightning Talks Session One
NORFest 2023 Lightning Talks Session One
 
Libraries,librarians,social media
Libraries,librarians,social mediaLibraries,librarians,social media
Libraries,librarians,social media
 
OER: JTCC
OER: JTCCOER: JTCC
OER: JTCC
 

Más de Cyndy Parr

Parr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbag
Cyndy Parr
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
Cyndy Parr
 

Más de Cyndy Parr (20)

Open data and the ag data commons
Open data and the ag data commonsOpen data and the ag data commons
Open data and the ag data commons
 
Ag Data Commons for AgBioData
Ag Data Commons for AgBioDataAg Data Commons for AgBioData
Ag Data Commons for AgBioData
 
Biodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscapeBiodiversity informatics and the agricultural data landscape
Biodiversity informatics and the agricultural data landscape
 
Public access to research results at USDA
Public access to research results at USDAPublic access to research results at USDA
Public access to research results at USDA
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
 
Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.Preparing for data-intensive science across domains.
Preparing for data-intensive science across domains.
 
Parr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbagParr ag datacommonsnal_brownbag
Parr ag datacommonsnal_brownbag
 
Ag Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research dataAg Data Commons: Adding Value to open agricultural research data
Ag Data Commons: Adding Value to open agricultural research data
 
Big Data Initiatives for Agroecosystems
Big Data Initiatives for AgroecosystemsBig Data Initiatives for Agroecosystems
Big Data Initiatives for Agroecosystems
 
TDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's WelcomeTDWG 2014 opening talk: Chair's Welcome
TDWG 2014 opening talk: Chair's Welcome
 
Behavior ontology workshop princeton
Behavior ontology workshop princetonBehavior ontology workshop princeton
Behavior ontology workshop princeton
 
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 
Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...Practical interoperability across semantic stores of data for ecological, tax...
Practical interoperability across semantic stores of data for ecological, tax...
 
Using and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute dataUsing and extending Darwin Core for structured attribute data
Using and extending Darwin Core for structured attribute data
 
How the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute dataHow the Encyclopedia of Life is wrangling organismal attribute data
How the Encyclopedia of Life is wrangling organismal attribute data
 
The Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of LifeThe Road to TraitBank: What's Next for the Encyclopedia of Life
The Road to TraitBank: What's Next for the Encyclopedia of Life
 
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
 

Último

Último (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life

Notas del editor

  1. I’m going to do a compare and contrast talk, so I have two projects to introduce you to. I apologize in advance if I go a bit quickly. Please feel free to catch me anytime in the next two days to get a demonstration of either of these projects
  2. Conclusion is that these are complementary approaches – can pursue in parallel. Focus on community driven databases that can be customized for the needs of the users of the data – result in highly atomized specialist data. Then alllow that information to be aggregated on EOL where it might find broader reuse and reinterpretation.
  3. LepTree is an Assembling the tree of life project whose major goal is to use nuclear genetic sequence to resolve deep nodes at the family and superfamily level in the Lepidoptera. This tree on the left shows our initial published findings which are not the point of this talk. I’ll just note that our analysis suggests that macrolepidoptera, shown by these orange bars, the very large moths and all butterflies, are clearly not a monophyletic group.The subject of today’s talk is the website tools we’ve created at leptree.net that include some features such as an interactive matrix visualizion of the sequencing status for the project of the where columns are each of the genes being sequenced and the rows show the hundreds of samples being used by the project, colors show our progress for each gene.We also have a fossil project and a morphology project that also have representation on our pages.
  4. The leptree website is built on a core of the open source drupal platform, and includes a number of the out-of-the-box community features, blog, discussion forum, commenting, the ability to create private working areas.In addition we have added new modules to allow community members to add information about their own projects, to post protocols that they are using so that they can link to them and other people can use the same protocols. Finally, we have a references module that lists about 800 articles on lepidopteransystematics. Rather than using the relational database that is the backend of drupal, these are actually storing data semantically – as RDF triples linked to rich ontologies.
  5. And finally, we also set up a custom module that presents a user with a complex temlpate for describing taxa. The checkboxes and data fields are the result of months of consultation with lepidopterists and are intended to cover the kinds of morphological and ecological variation across the group. Like the projects, protocols, and references modules, the data are stored in a sesame triple store repository. We can use this semantic representation to link our knowledge to that generated by other projects and use machine reasoning to come up with new results. This is the kind of data that would be appropriate to “decorate” a phylogenetic tree to look for patterns.The goal is to produce about 150 of these taxon pages but we designed the system to be expandable.
  6. So to summarize, LepTree built some semantics-enabled tools, combine this with data and links from a couple of other projects to create the taxonomic information pages you can see on LepTree.net under “Knowledge project”In addition, the taxon information is now being exported as text objects and also appears on the Encyclopedia of Life taxon pages.
  7. Objects such as these are essentially chunks of text sorted by topic.Each of these credits the source, and can receive comments or ratings, or can be trusted or untrusted by curators.
  8. So, the approach of EOL is rather different. EOL is a giant mashup that creates pages, that are then available for curators to assess and rate, or for anybody to provide comments or tags.LepTree has foccuseed on data entry tools while EOL has not – though I should note that we have also developed a Drupal-based system called LifeDesks, which are one of the many ways that data flows to the central EOL.
  9. On LepTree, burden on users to learn a new systemOn EOL, burden on programming staff, not on users
  10. The effort we went to in Leptree to add semantics to the tools likely just slowed us down, and distracted us from the effort of developing a community effort. But once we had tools with lots of checkboxes we have been able to accumulate a lot of potentially useful atomized data.By divide and conquer I mean that it should be possible to continue to promote community databases – these can be tailored to the specific needs of a scientific community and its audiences, with data as structured as possible. And then The data from these projects can be aggregated, essentially cross-indexed, so that they are accesssible from a common portal, EOL. If EOL had tried to structure or semanticize from the beginning we never would have achieved the growth we have.
  11. Build contentExpose triplesShare data