SlideShare a Scribd company logo
1 of 14
Linking Biodiversity Data for
Ecology: Case Studies in what we
can do with existing tools
Anne E. Thessen
http://www.slideshare.net/athessen
Acknowledgements
• David Patterson
• Dima Mozzherin
• David Shorthouse
• Cyndy Parr
• Paula Mabee
• Wasila Dahdul
• Sami Domisch
• Global Names Project
• Phenoscape
• Encyclopedia of Life
• Map of Life
• RDA/US Scholars
Program
• US National Science
Foundation
Case Studies
• Capturing species interactions using the
Encyclopedia of Life (EOL) and Global Names
Recognition and Discovery (GNRD)
• Linking traits and habitats using the
Phenoscape knowledgebase, Global
Biodiversity Information Facility (GBIF), and
Map of Life
Species Interactions
• Mine data about interactions from text
objects in EOL
• Create a “digital ecosystem”
• PLoS ONE
10.1371/journal.pone.
0089550
Workflow and Methods
• EOL provided us with a list of ID numbers for
their species
• Use the EOL API to access text objects under
the “Associations”, “Trophic Strategy”, etc.
Headings (http://eol.org/api)
• Use the GNRD API to find names in the text
objects (http://gnrd.globalnames.org/)
• Use Resolver to get the EOL ID corresponding
to each name found in the text
Workflow and Methods
EOL GNRD Results
EOL
TraitBank
Interaction mediated
between EOL and
GNRD APIs
GNRD API returns
results resolved to
EOL IDs
Machine-readable
results are added to
GloBI and visible in
TraitBank
GNRD
• Tool for finding taxonomic names in text
• Combination of TaxonFinder and Neti Neti
• Capable of some name reconciliation
• Recent overall performance evaluation on
published manuscripts and data files
– Precision = 0.880
– Recall = 0.642
– F1 Score = 0.742
• Largest sources of error were caused by table and
figure formatting and unusual punctuation
Resolver
• Tool for resolving taxonomic names in text
against an authority
• User can turn resolver on or off
• User can choose all or one of eight authorities
including CoL, IPNI, GBIF, etc.
• We resolved against EOL to get the
corresponding taxon IDs
Results
• Association detection performance
– Precision = 0.844
– Recall = 0.930
– F1 Score = 0.885
• Information extracted from entirety of EOL
and data set is part of GloBI
Linking Phenotypes to Environment
• Phenoscape and TraitBank link phenotypes to
taxa
• GBIF and Map of Life link taxa to locations and
habitats
• Can we take the extra step and link phenotypes
to environments?
Workflow and Methods
• Phenoscape provided us with a list of fish with
the miniaturization phenotype and a list of
sister taxa that are not miniaturized
• A search of these taxa in GBIF returned
geographic coordinates
• Those coordinates were given to Map of Life
who provided environmental data
• Used double-tailed t test to analyze result
Workflow and Methods
Phenoscape GBIF
Map of
Life
Interaction mediated
manually
Coordinates
Results
• We got back temperature, precipitation, slope,
land cover, and geology data
• Miniature fishes occur in wetter, warmer
environments
Variable Type Mean p value
Temperature mini 24.8 0.002
non mini 22.6
Precipitation mini 6.9 X 107 0.008
non mini 1.8 X 107
Relevant References
• Midford, P., P. Mabee, T. Vision, H. Lapp, J. Balhoff, W. Dahdul, C. Kothari, J.
Lundberg, and M. Westerfield. 2009. Phenoscape: Ontologies for large
multi-species phenotype datasets. Zoological Journal of the Linnean
Society 151: 691-757.
• Parr, C.S., N. Wilson, P. Leary, K.S. Schulz, K. Lans, L. Walley, J.A. Hammock,
A. Goddard, J. Rice, M. Studer, J.T.G. Holmes, and R.J. Corrigan Jr. 2014. The
Encyclopedia of Life v 2: Providing global access to knowledge about life
on Earth. Biodiversity Data Journal 2:e1079
• Thessen, A.E. and C.S. Parr. 2014. Knowledge extraction and semantic
annotation of text from the Encyclopedia of Life. PLoS ONE
http://dx.plos.org/10.1371/journal.pone.0089550
• Thessen, A.E., D.P. Shorthouse, D. Mozzherin, and D.J. Patterson. in prep.
Taxonomic names discovery to improve data discoverability.
• Thessen, A.E. et al. in prep. Linking phenotypes to environments
• Tuanmu, M.N. and W. Jetz. 2014. A global 1-km consensus land-cover
product for biodiversity and ecosystem modeling. Global Ecology and
Biogeography 23:1031-1045

More Related Content

What's hot

Summary of topic 2.3
Summary of topic 2.3Summary of topic 2.3
Summary of topic 2.3
Michael Smith
 

What's hot (20)

Potential for Community Photography to Help Evaluate Molt Phenology in Mounta...
Potential for Community Photography to Help Evaluate Molt Phenology in Mounta...Potential for Community Photography to Help Evaluate Molt Phenology in Mounta...
Potential for Community Photography to Help Evaluate Molt Phenology in Mounta...
 
High-throughput sequencing and latent variable modelling of within-host paras...
High-throughput sequencing and latent variable modelling of within-host paras...High-throughput sequencing and latent variable modelling of within-host paras...
High-throughput sequencing and latent variable modelling of within-host paras...
 
Earth Sciences / Geography Graduate Student Workshop November 11 2015
Earth Sciences / Geography Graduate Student Workshop November 11 2015Earth Sciences / Geography Graduate Student Workshop November 11 2015
Earth Sciences / Geography Graduate Student Workshop November 11 2015
 
Knowledge Base by Jerry Mead, Ph.D., Assistant Scientist & Section Leader, Ac...
Knowledge Base by Jerry Mead, Ph.D., Assistant Scientist & Section Leader, Ac...Knowledge Base by Jerry Mead, Ph.D., Assistant Scientist & Section Leader, Ac...
Knowledge Base by Jerry Mead, Ph.D., Assistant Scientist & Section Leader, Ac...
 
Gabriel laporta: Biodiversity can help prevent malaria outbreaks in tropical ...
Gabriel laporta: Biodiversity can help prevent malaria outbreaks in tropical ...Gabriel laporta: Biodiversity can help prevent malaria outbreaks in tropical ...
Gabriel laporta: Biodiversity can help prevent malaria outbreaks in tropical ...
 
Species Diversity Concepts
Species Diversity ConceptsSpecies Diversity Concepts
Species Diversity Concepts
 
Ecological sampling
Ecological samplingEcological sampling
Ecological sampling
 
Discovery Day Mertes
Discovery Day MertesDiscovery Day Mertes
Discovery Day Mertes
 
AER & FAME Review May 2014
AER & FAME Review May 2014AER & FAME Review May 2014
AER & FAME Review May 2014
 
Karl kjer : Passionate about entomology
Karl kjer : Passionate about entomologyKarl kjer : Passionate about entomology
Karl kjer : Passionate about entomology
 
Open Virus Indian Presentation
Open Virus Indian PresentationOpen Virus Indian Presentation
Open Virus Indian Presentation
 
EOL and Biotrackers HCIL Symposium Talk 5.26.2011
EOL and Biotrackers HCIL Symposium Talk 5.26.2011EOL and Biotrackers HCIL Symposium Talk 5.26.2011
EOL and Biotrackers HCIL Symposium Talk 5.26.2011
 
GloBI @ Berkeley Institute for Data Science Feb 5, 2015
GloBI @ Berkeley Institute for Data Science Feb 5, 2015GloBI @ Berkeley Institute for Data Science Feb 5, 2015
GloBI @ Berkeley Institute for Data Science Feb 5, 2015
 
Ecological Monitoring Techniques
Ecological Monitoring TechniquesEcological Monitoring Techniques
Ecological Monitoring Techniques
 
Protection goals, assessment endpoints, ecosystem services and biodiversity
Protection goals, assessment endpoints, ecosystem services and biodiversityProtection goals, assessment endpoints, ecosystem services and biodiversity
Protection goals, assessment endpoints, ecosystem services and biodiversity
 
Exploring the effects of climate change on marine species using linked data
Exploring the effects of climate change on marine species using linked dataExploring the effects of climate change on marine species using linked data
Exploring the effects of climate change on marine species using linked data
 
2.3.5
2.3.52.3.5
2.3.5
 
Alternate animal use in research
Alternate animal use in researchAlternate animal use in research
Alternate animal use in research
 
SRS Final Presentation
SRS Final Presentation SRS Final Presentation
SRS Final Presentation
 
Summary of topic 2.3
Summary of topic 2.3Summary of topic 2.3
Summary of topic 2.3
 

Viewers also liked

Disturbance ecology- UM
Disturbance ecology- UMDisturbance ecology- UM
Disturbance ecology- UM
Mark McGinley
 
Hotspots of biodiversity
Hotspots of biodiversityHotspots of biodiversity
Hotspots of biodiversity
Somya Bagai
 
Bridging discrepancies across North American butterfly naming authorities: Su...
Bridging discrepancies across North American butterfly naming authorities: Su...Bridging discrepancies across North American butterfly naming authorities: Su...
Bridging discrepancies across North American butterfly naming authorities: Su...
Anne Thessen
 
Biodiversity conservation
Biodiversity conservationBiodiversity conservation
Biodiversity conservation
rajeshap
 

Viewers also liked (16)

Ontological Support of Data Discovery and Synthesis in Estuarine and Coastal ...
Ontological Support of Data Discovery and Synthesis in Estuarine and Coastal ...Ontological Support of Data Discovery and Synthesis in Estuarine and Coastal ...
Ontological Support of Data Discovery and Synthesis in Estuarine and Coastal ...
 
Next-Gen Taxonomic Descriptions for Microbial Eukaryotes
Next-Gen Taxonomic Descriptions for Microbial EukaryotesNext-Gen Taxonomic Descriptions for Microbial Eukaryotes
Next-Gen Taxonomic Descriptions for Microbial Eukaryotes
 
Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
Knowledge Extraction and Semantic Linking in the Encyclopedia of LifeKnowledge Extraction and Semantic Linking in the Encyclopedia of Life
Knowledge Extraction and Semantic Linking in the Encyclopedia of Life
 
The Future of Microalgal Taxonomy
The Future of Microalgal TaxonomyThe Future of Microalgal Taxonomy
The Future of Microalgal Taxonomy
 
Data Infrastructure for Coastal and Estuarine Science
Data Infrastructure for Coastal and Estuarine ScienceData Infrastructure for Coastal and Estuarine Science
Data Infrastructure for Coastal and Estuarine Science
 
Tokyo Green Space Presentation
Tokyo Green Space PresentationTokyo Green Space Presentation
Tokyo Green Space Presentation
 
An introduction to biodiversity conservation
An introduction to biodiversity conservationAn introduction to biodiversity conservation
An introduction to biodiversity conservation
 
Gulf of Mexico Hydrocarbon Database: Integrating Heterogeneous Data for Impro...
Gulf of Mexico Hydrocarbon Database: Integrating Heterogeneous Data for Impro...Gulf of Mexico Hydrocarbon Database: Integrating Heterogeneous Data for Impro...
Gulf of Mexico Hydrocarbon Database: Integrating Heterogeneous Data for Impro...
 
Nathan Phillips - The Ecology of the City
Nathan Phillips - The Ecology of the CityNathan Phillips - The Ecology of the City
Nathan Phillips - The Ecology of the City
 
Urban ecology: will we act before its too late?
Urban ecology: will we act before its too late?Urban ecology: will we act before its too late?
Urban ecology: will we act before its too late?
 
Disturbance ecology- UM
Disturbance ecology- UMDisturbance ecology- UM
Disturbance ecology- UM
 
Hotspots of biodiversity
Hotspots of biodiversityHotspots of biodiversity
Hotspots of biodiversity
 
FUTURE CITIES: All-Sustainable (Innovative/Smart/Digital/Technological//Green...
FUTURE CITIES: All-Sustainable (Innovative/Smart/Digital/Technological//Green...FUTURE CITIES: All-Sustainable (Innovative/Smart/Digital/Technological//Green...
FUTURE CITIES: All-Sustainable (Innovative/Smart/Digital/Technological//Green...
 
Top 8 global megatrends
Top 8 global megatrendsTop 8 global megatrends
Top 8 global megatrends
 
Bridging discrepancies across North American butterfly naming authorities: Su...
Bridging discrepancies across North American butterfly naming authorities: Su...Bridging discrepancies across North American butterfly naming authorities: Su...
Bridging discrepancies across North American butterfly naming authorities: Su...
 
Biodiversity conservation
Biodiversity conservationBiodiversity conservation
Biodiversity conservation
 

Similar to Linking biodiversity data for ecology

Keynote Speaker 1 - Data Intensive Challenges in Biodiversity Conservation: a...
Keynote Speaker 1 - Data Intensive Challenges in Biodiversity Conservation: a...Keynote Speaker 1 - Data Intensive Challenges in Biodiversity Conservation: a...
Keynote Speaker 1 - Data Intensive Challenges in Biodiversity Conservation: a...
TERN Australia
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
Cyndy Parr
 

Similar to Linking biodiversity data for ecology (20)

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK
 
Frontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of LifeFrontiers of discovery with Encyclopedia of Life
Frontiers of discovery with Encyclopedia of Life
 
The emerging biodiversity data ecosystem
The emerging biodiversity data ecosystemThe emerging biodiversity data ecosystem
The emerging biodiversity data ecosystem
 
Keynote Speaker 1 - Data Intensive Challenges in Biodiversity Conservation: a...
Keynote Speaker 1 - Data Intensive Challenges in Biodiversity Conservation: a...Keynote Speaker 1 - Data Intensive Challenges in Biodiversity Conservation: a...
Keynote Speaker 1 - Data Intensive Challenges in Biodiversity Conservation: a...
 
Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013Global Biodiversity Information Facility - 2013
Global Biodiversity Information Facility - 2013
 
Research data and scholarly publications: going from casual acquaintances to ...
Research data and scholarly publications: going from casual acquaintances to ...Research data and scholarly publications: going from casual acquaintances to ...
Research data and scholarly publications: going from casual acquaintances to ...
 
Global patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctnessGlobal patterns of insect diiversity, distribution and evolutionary distinctness
Global patterns of insect diiversity, distribution and evolutionary distinctness
 
Encyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypesEncyclopedia of Life: Use cases for phenotypes
Encyclopedia of Life: Use cases for phenotypes
 
Cranston Evolution 2013
Cranston Evolution 2013Cranston Evolution 2013
Cranston Evolution 2013
 
Shorthouse
ShorthouseShorthouse
Shorthouse
 
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do this
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do thisANL Soil Metagenomics 2014 Soil Reference Database - Let's do this
ANL Soil Metagenomics 2014 Soil Reference Database - Let's do this
 
Open Science and Ecological meta-anlaysis
Open Science and Ecological meta-anlaysisOpen Science and Ecological meta-anlaysis
Open Science and Ecological meta-anlaysis
 
vERSO General
vERSO GeneralvERSO General
vERSO General
 
Digital tools and training for environmental sciences in Australia
Digital tools and training for environmental sciences in AustraliaDigital tools and training for environmental sciences in Australia
Digital tools and training for environmental sciences in Australia
 
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...Rapid Impact Assessment of Climatic and Physio-graphic Changes  on Flagship G...
Rapid Impact Assessment of Climatic and Physio-graphic Changes on Flagship G...
 
Knowledge Exchange, Nov 2011, Bonn
Knowledge Exchange, Nov 2011, BonnKnowledge Exchange, Nov 2011, Bonn
Knowledge Exchange, Nov 2011, Bonn
 
Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45mins
 
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
pro-iBiosphere Towards Open Biodiversity Knowledge COOPEUS 2013
 
Introduction to EOL.org for scientists
Introduction to EOL.org for scientistsIntroduction to EOL.org for scientists
Introduction to EOL.org for scientists
 
Data sharing archiving discovery, Bill Michener
Data sharing archiving discovery, Bill MichenerData sharing archiving discovery, Bill Michener
Data sharing archiving discovery, Bill Michener
 

More from Anne Thessen

Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Anne Thessen
 

More from Anne Thessen (6)

Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
Predicting Phenotype from Multi-Scale Genomic and Environment Data using Neur...
 
Unifying Genomics, Phenomics, and Environments
Unifying Genomics, Phenomics, and EnvironmentsUnifying Genomics, Phenomics, and Environments
Unifying Genomics, Phenomics, and Environments
 
Combining Phenomes and Genomes to Fill Analytical Gaps: Data Management in Ph...
Combining Phenomes and Genomes to Fill Analytical Gaps: Data Management in Ph...Combining Phenomes and Genomes to Fill Analytical Gaps: Data Management in Ph...
Combining Phenomes and Genomes to Fill Analytical Gaps: Data Management in Ph...
 
Knowledge extraction from the Encyclopedia of Life using Python NLTK
Knowledge extraction from the Encyclopedia of Life using Python NLTKKnowledge extraction from the Encyclopedia of Life using Python NLTK
Knowledge extraction from the Encyclopedia of Life using Python NLTK
 
Marrying models and data: Adventures in Modeling, Data Wrangling and Software...
Marrying models and data: Adventures in Modeling, Data Wrangling and Software...Marrying models and data: Adventures in Modeling, Data Wrangling and Software...
Marrying models and data: Adventures in Modeling, Data Wrangling and Software...
 
Visualizing Evolution
Visualizing EvolutionVisualizing Evolution
Visualizing Evolution
 

Recently uploaded

Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
RizalinePalanog2
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 

Recently uploaded (20)

Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 

Linking biodiversity data for ecology

  • 1. Linking Biodiversity Data for Ecology: Case Studies in what we can do with existing tools Anne E. Thessen http://www.slideshare.net/athessen
  • 2. Acknowledgements • David Patterson • Dima Mozzherin • David Shorthouse • Cyndy Parr • Paula Mabee • Wasila Dahdul • Sami Domisch • Global Names Project • Phenoscape • Encyclopedia of Life • Map of Life • RDA/US Scholars Program • US National Science Foundation
  • 3. Case Studies • Capturing species interactions using the Encyclopedia of Life (EOL) and Global Names Recognition and Discovery (GNRD) • Linking traits and habitats using the Phenoscape knowledgebase, Global Biodiversity Information Facility (GBIF), and Map of Life
  • 4. Species Interactions • Mine data about interactions from text objects in EOL • Create a “digital ecosystem” • PLoS ONE 10.1371/journal.pone. 0089550
  • 5. Workflow and Methods • EOL provided us with a list of ID numbers for their species • Use the EOL API to access text objects under the “Associations”, “Trophic Strategy”, etc. Headings (http://eol.org/api) • Use the GNRD API to find names in the text objects (http://gnrd.globalnames.org/) • Use Resolver to get the EOL ID corresponding to each name found in the text
  • 6. Workflow and Methods EOL GNRD Results EOL TraitBank Interaction mediated between EOL and GNRD APIs GNRD API returns results resolved to EOL IDs Machine-readable results are added to GloBI and visible in TraitBank
  • 7. GNRD • Tool for finding taxonomic names in text • Combination of TaxonFinder and Neti Neti • Capable of some name reconciliation • Recent overall performance evaluation on published manuscripts and data files – Precision = 0.880 – Recall = 0.642 – F1 Score = 0.742 • Largest sources of error were caused by table and figure formatting and unusual punctuation
  • 8. Resolver • Tool for resolving taxonomic names in text against an authority • User can turn resolver on or off • User can choose all or one of eight authorities including CoL, IPNI, GBIF, etc. • We resolved against EOL to get the corresponding taxon IDs
  • 9. Results • Association detection performance – Precision = 0.844 – Recall = 0.930 – F1 Score = 0.885 • Information extracted from entirety of EOL and data set is part of GloBI
  • 10. Linking Phenotypes to Environment • Phenoscape and TraitBank link phenotypes to taxa • GBIF and Map of Life link taxa to locations and habitats • Can we take the extra step and link phenotypes to environments?
  • 11. Workflow and Methods • Phenoscape provided us with a list of fish with the miniaturization phenotype and a list of sister taxa that are not miniaturized • A search of these taxa in GBIF returned geographic coordinates • Those coordinates were given to Map of Life who provided environmental data • Used double-tailed t test to analyze result
  • 12. Workflow and Methods Phenoscape GBIF Map of Life Interaction mediated manually Coordinates
  • 13. Results • We got back temperature, precipitation, slope, land cover, and geology data • Miniature fishes occur in wetter, warmer environments Variable Type Mean p value Temperature mini 24.8 0.002 non mini 22.6 Precipitation mini 6.9 X 107 0.008 non mini 1.8 X 107
  • 14. Relevant References • Midford, P., P. Mabee, T. Vision, H. Lapp, J. Balhoff, W. Dahdul, C. Kothari, J. Lundberg, and M. Westerfield. 2009. Phenoscape: Ontologies for large multi-species phenotype datasets. Zoological Journal of the Linnean Society 151: 691-757. • Parr, C.S., N. Wilson, P. Leary, K.S. Schulz, K. Lans, L. Walley, J.A. Hammock, A. Goddard, J. Rice, M. Studer, J.T.G. Holmes, and R.J. Corrigan Jr. 2014. The Encyclopedia of Life v 2: Providing global access to knowledge about life on Earth. Biodiversity Data Journal 2:e1079 • Thessen, A.E. and C.S. Parr. 2014. Knowledge extraction and semantic annotation of text from the Encyclopedia of Life. PLoS ONE http://dx.plos.org/10.1371/journal.pone.0089550 • Thessen, A.E., D.P. Shorthouse, D. Mozzherin, and D.J. Patterson. in prep. Taxonomic names discovery to improve data discoverability. • Thessen, A.E. et al. in prep. Linking phenotypes to environments • Tuanmu, M.N. and W. Jetz. 2014. A global 1-km consensus land-cover product for biodiversity and ecosystem modeling. Global Ecology and Biogeography 23:1031-1045