SlideShare una empresa de Scribd logo
1 de 21
BiSciCol: Biological Science
Collections Tracker

Tracking Biodiversity
Objects to Brokering Standards

Brian Stucky, University of Colorado, Boulder
John Deck, University of California, Berkeley
Lukasz Ziemba, University of Florida, Gaineseville
Nico Cellinese, University of Florida, Gainesville
Rob Guralnick, University of Colorado, Boulder
BiSciCol Team:
Reed Beaman, Nico Cellinese, Jonathan Coddington, Neil Davies, John Deck, Rob
Guralnick, Bryan P. Heidorn, Chris Meyer, Tom Orrell, Rich Pyle, Kate Rachwal, Brian
Stucky, Rob Whitton, Lukasz Ziemba




                                                                        Univ. Hawai’i
                                                                        Univ. Arizona
                                                                        Smithsonian
•   National Science Foundation funded 2010 – 2014
•   Infrastructure to tag & track specimens & derivates in cyberspace
•   Relies on globally unique identifiers (GUIDs) to track objects
•   Implements a Linked Data approach
QUANTITY OF DATA IS FIRST LINK IN A LARGER CHAIN OF ISSUES
Here is the problem:

Lots of Data ….
                  Generates …
Data stores:      Taxonomic concepts: Catalog of Life, WORMS, ITIS, EOL, GNA
                  Geography: GBIF, IUCN ranges, Map of Life, WDPA
      Standards   Genes/genomes: Genbank, TreeBase, ToL Web, AVATOL, BOLD
                  Phenotypes and traits: MorphBank, TRY, Phenoscape
EOL

                                          GBIF

             NCBI




A Growing Constellation of Biodiversity Data and Knowledge
How do we link all these data together?
Borrowing from Facebook and social media…
Can we track relationships for Biological Objects as well?
A Biological Relationship Graph …




                      Taxonomic Type Filter




                      Class Filter
                       X  Specimens
                           Tissues
                       X   Sequences

                      Functions
                       X   Infer Relationships Across providers
Moorea Biocode Example: From field collection through
          analysis, across multiple systems

                       Taxon          (Taxon)         Taxon*n          Taxon


                         Key           (Key)          Blast*n              Blast



   (Biocode Event)
                                                            (metagenomic
                                                            Sequencing)
                                    (CAMERA
                                    Gut Sample Event)


    (Essig Museum Specimen)
                                                            (Genbank Sequence)
                               (Smithsonian Tissue)
Examples:
Global Unique identifiers:                • Globally unique (mandatory)
                                          • Persistent (not mandatory, but very helpful)
      http://example.org/urn:lsid:example.org:specimen/7217D220-836A-11DF-8395-0800200C9A66
                                          • Resolvable (not mandatory, but very helpful)
      http://mycollection.org/specimen/JDeckSpecimen1
      http://mycollection.org/specimen/uuid=7217D220-836A-11DF-8395-0800200C9A66
      http://dx.doi.org/10.5072/FK2JW8GKM
Simple relationship
      terms:




    Graph
relationships:
ONE FINAL PIECE
OF THE PUZZLE:
 GIVING BIRTH TO
DATA IN THE RIGHT
   FORMAT FOR
     LINKING
“Triplifier” - creating the format for linking biological objects

                                          Darwin Core
                                            Archive
                                                           Darwin
                                                           Core
                                                           Archive




                                       Triplifier
                                       Create links from
                                       Native data formats


                  Mysql




                                KEMU
                                                   Mysql
QUERY AND RESULTS ACROSS LINKED DATA


                Response




        Query
BISCICOL – EXAMPLE SEARCH


Client Interface:
Search Scientific Name: Aedes increpitus               Run


 Results:
 OccurrenceID1 (Aedes increpitus Dyar, 1916 )
 OccurrenceID3 (Aedes vittata Theobald, 1903)



                    Taxon SERVICE (ITIS / GNUB)
                    http://lsid.itis.gov/urn:lsid:itis.gov:itis_tsn:126314
                    http://lsid.itis.gov/urn:lsid:itis.gov:itis_tsn:126317
                    http://gnub.org/8E19F1DC-74BA-47D4-A505-6498414B4CCE


                    BISCICOL SERVICE LOOKUP:
                    dwc:IdentificationID1 :relatedTo http://lsid.itis.gov/urn:lsid:itis.gov:itis_tsn:126314
                    dwc:IdentificationID1 :relatedTo dwc:OccurrenceID1
                    dwc:IdentificationID2 :relatedTo http://lsid.itis.gov/urn:lsid:itis.gov:itis_tsn:126317
                    dwc:IdentificationID2 :relatedTo dwc:OccurrenceID3
IndividualID1   EventID1   GeoreferenceID1

                EventID2   GeoreferenceID2

                EventID3   GeoreferenceID3
Working with
 Locations:
   Tracking
 location in
 space of a
    moving
  individual
   (whales)
Data Impact Factor – Graph Metrics

Collectors                            Graphs
          Gustav Paulay               [ ] GBIF Relations Graph
          (102,000 direct children)   [X] Moorea Biocode
                                      [X] SI MSNGR System
                                      [+] Add New Graph
          Christopher Meyer
          (83,000 direct children)    Occurrences
                                                MBIO99999
          Craig Moritz                          (1024 total descendents)
          (523 direct children)

                                                IMBL8888888
                                                (723 total descendents)

Events                                Cited occurrences over time
         Biocode10234
         (4234 direct children)

         Expedition21234
         (1023 direct children)
Why BiSciCol and Why SPNHC and Why Collaborations?

• New era of collections digitization
    •   new & derived data objects created, replicated, annotated
• BiSciCol tackles preservation of nat. hist. collections challenge:
    •   How to follow these digital objects
    •   How to link together objects and derivatives back to specimens
• BiSciCol is about community, collaborative practice
    •   Commitment to standards, ontologies
    •   Agreement on permanent, resolvable identifiers
    •   Triplification of data sources to enhance linked data

Más contenido relacionado

Destacado

SCIENCE and TECHNOLOGY XXI: New Physical Science XXI
SCIENCE and TECHNOLOGY XXI: New Physical Science XXISCIENCE and TECHNOLOGY XXI: New Physical Science XXI
SCIENCE and TECHNOLOGY XXI: New Physical Science XXIAzamat Abdoullaev
 
Physical science 101 intro sp 2011
Physical science 101 intro sp 2011Physical science 101 intro sp 2011
Physical science 101 intro sp 2011lschmidt1170
 
Intro to physical science and measurements
Intro to physical science and measurementsIntro to physical science and measurements
Intro to physical science and measurementssihellyay
 
cONSUMER mARKET aND bUYER bEhavior
cONSUMER mARKET aND bUYER bEhaviorcONSUMER mARKET aND bUYER bEhavior
cONSUMER mARKET aND bUYER bEhaviorArzar Rahim
 
Physical science
Physical sciencePhysical science
Physical scienceSheena Jose
 
Physical Science Notes - Properties, Systems, Matter & Energy
Physical Science Notes - Properties, Systems, Matter & EnergyPhysical Science Notes - Properties, Systems, Matter & Energy
Physical Science Notes - Properties, Systems, Matter & Energyjschmied
 
What is Physical Science?
What is Physical Science?What is Physical Science?
What is Physical Science?knewton1314
 
THE ART OF SALESMANSHIP
THE ART OF SALESMANSHIPTHE ART OF SALESMANSHIP
THE ART OF SALESMANSHIPjohn lomahan
 
Rizal chapter22 Exile in Dapitan (Gregorio F. Zaide)
Rizal chapter22 Exile in Dapitan (Gregorio F. Zaide)Rizal chapter22 Exile in Dapitan (Gregorio F. Zaide)
Rizal chapter22 Exile in Dapitan (Gregorio F. Zaide)Msphieebz Lazatin
 
Business Research Method
Business Research MethodBusiness Research Method
Business Research MethodGhulam Hasnain
 
Chapter 1 Introduction to Biology
Chapter 1  Introduction to BiologyChapter 1  Introduction to Biology
Chapter 1 Introduction to BiologyBelen Ynzon
 
World Literature - Overview of literature through the ages
World Literature -   Overview of literature through the agesWorld Literature -   Overview of literature through the ages
World Literature - Overview of literature through the agesKenzie Ancheta
 
new-product-development-process
new-product-development-processnew-product-development-process
new-product-development-processarunalapati
 

Destacado (18)

SCIENCE and TECHNOLOGY XXI: New Physical Science XXI
SCIENCE and TECHNOLOGY XXI: New Physical Science XXISCIENCE and TECHNOLOGY XXI: New Physical Science XXI
SCIENCE and TECHNOLOGY XXI: New Physical Science XXI
 
Physical science 101 intro sp 2011
Physical science 101 intro sp 2011Physical science 101 intro sp 2011
Physical science 101 intro sp 2011
 
Biological science
Biological scienceBiological science
Biological science
 
Intro to physical science and measurements
Intro to physical science and measurementsIntro to physical science and measurements
Intro to physical science and measurements
 
cONSUMER mARKET aND bUYER bEhavior
cONSUMER mARKET aND bUYER bEhaviorcONSUMER mARKET aND bUYER bEhavior
cONSUMER mARKET aND bUYER bEhavior
 
Physical science
Physical sciencePhysical science
Physical science
 
Biological science
Biological scienceBiological science
Biological science
 
Physical Science Notes - Properties, Systems, Matter & Energy
Physical Science Notes - Properties, Systems, Matter & EnergyPhysical Science Notes - Properties, Systems, Matter & Energy
Physical Science Notes - Properties, Systems, Matter & Energy
 
What is Physical Science?
What is Physical Science?What is Physical Science?
What is Physical Science?
 
THE ART OF SALESMANSHIP
THE ART OF SALESMANSHIPTHE ART OF SALESMANSHIP
THE ART OF SALESMANSHIP
 
Chapter 1: The Role of Business Research
Chapter 1:   The Role of Business ResearchChapter 1:   The Role of Business Research
Chapter 1: The Role of Business Research
 
Rizal chapter22 Exile in Dapitan (Gregorio F. Zaide)
Rizal chapter22 Exile in Dapitan (Gregorio F. Zaide)Rizal chapter22 Exile in Dapitan (Gregorio F. Zaide)
Rizal chapter22 Exile in Dapitan (Gregorio F. Zaide)
 
Introduction to world literature[1]
Introduction to world literature[1]Introduction to world literature[1]
Introduction to world literature[1]
 
Salesmanship
SalesmanshipSalesmanship
Salesmanship
 
Business Research Method
Business Research MethodBusiness Research Method
Business Research Method
 
Chapter 1 Introduction to Biology
Chapter 1  Introduction to BiologyChapter 1  Introduction to Biology
Chapter 1 Introduction to Biology
 
World Literature - Overview of literature through the ages
World Literature -   Overview of literature through the agesWorld Literature -   Overview of literature through the ages
World Literature - Overview of literature through the ages
 
new-product-development-process
new-product-development-processnew-product-development-process
new-product-development-process
 

Similar a Biological Science Collections Tagging and Tracking presented at SPNHC

Triplifier talk
Triplifier talkTriplifier talk
Triplifier talkJohn Deck
 
BiSciCol ievobio
BiSciCol ievobioBiSciCol ievobio
BiSciCol ievobioJohn Deck
 
3 bitriplifiertalk
3 bitriplifiertalk3 bitriplifiertalk
3 bitriplifiertalkJohn Deck
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingGigaScience, BGI Hong Kong
 
BiSciCol: Linking Information for Biodiversity Scientists
BiSciCol: Linking Information for Biodiversity ScientistsBiSciCol: Linking Information for Biodiversity Scientists
BiSciCol: Linking Information for Biodiversity ScientistsJohn Deck
 
IASSIST identifiers By Joan Starr
IASSIST identifiers By Joan StarrIASSIST identifiers By Joan Starr
IASSIST identifiers By Joan StarrCarly Strasser
 
Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45minsDimitrios Koureas
 
Ontologies for biodiversity informatics, UiO DSC June 2023
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023Dag Endresen
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceGigaScience, BGI Hong Kong
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"GigaScience, BGI Hong Kong
 
Digital Biology, by Ruediger Trojok
Digital Biology, by Ruediger TrojokDigital Biology, by Ruediger Trojok
Digital Biology, by Ruediger Trojokbioflux
 
Thomas ecn 2012
Thomas ecn 2012Thomas ecn 2012
Thomas ecn 2012ECNOfficer
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...GigaScience, BGI Hong Kong
 
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...datascienceiqss
 
TDWG_2010_Chavan_data_citation
TDWG_2010_Chavan_data_citationTDWG_2010_Chavan_data_citation
TDWG_2010_Chavan_data_citationVishwas Chavan
 
Mapping Genotype to Phenotype using Attribute Grammar, Laura Adam
Mapping Genotype to Phenotype using Attribute Grammar, Laura AdamMapping Genotype to Phenotype using Attribute Grammar, Laura Adam
Mapping Genotype to Phenotype using Attribute Grammar, Laura Adammadalladam
 
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...Dimitrios Koureas
 

Similar a Biological Science Collections Tagging and Tracking presented at SPNHC (20)

Triplifier talk
Triplifier talkTriplifier talk
Triplifier talk
 
BiSciCol ievobio
BiSciCol ievobioBiSciCol ievobio
BiSciCol ievobio
 
3 bitriplifiertalk
3 bitriplifiertalk3 bitriplifiertalk
3 bitriplifiertalk
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
 
BiSciCol: Linking Information for Biodiversity Scientists
BiSciCol: Linking Information for Biodiversity ScientistsBiSciCol: Linking Information for Biodiversity Scientists
BiSciCol: Linking Information for Biodiversity Scientists
 
IASSIST identifiers By Joan Starr
IASSIST identifiers By Joan StarrIASSIST identifiers By Joan Starr
IASSIST identifiers By Joan Starr
 
Scratchpads introductory presentation 45mins
Scratchpads introductory presentation   45minsScratchpads introductory presentation   45mins
Scratchpads introductory presentation 45mins
 
Ontologies for biodiversity informatics, UiO DSC June 2023
 Ontologies for biodiversity informatics, UiO DSC June 2023 Ontologies for biodiversity informatics, UiO DSC June 2023
Ontologies for biodiversity informatics, UiO DSC June 2023
 
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
 
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScienceScott Edmunds: Revolutionizing Data Dissemination: GigaScience
Scott Edmunds: Revolutionizing Data Dissemination: GigaScience
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"
 
Digital Biology, by Ruediger Trojok
Digital Biology, by Ruediger TrojokDigital Biology, by Ruediger Trojok
Digital Biology, by Ruediger Trojok
 
Trojok digital biology
Trojok digital biology Trojok digital biology
Trojok digital biology
 
Roberts leiden110213
Roberts leiden110213Roberts leiden110213
Roberts leiden110213
 
Thomas ecn 2012
Thomas ecn 2012Thomas ecn 2012
Thomas ecn 2012
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...Big Data Repository for Structural Biology: Challenges and Opportunities by P...
Big Data Repository for Structural Biology: Challenges and Opportunities by P...
 
TDWG_2010_Chavan_data_citation
TDWG_2010_Chavan_data_citationTDWG_2010_Chavan_data_citation
TDWG_2010_Chavan_data_citation
 
Mapping Genotype to Phenotype using Attribute Grammar, Laura Adam
Mapping Genotype to Phenotype using Attribute Grammar, Laura AdamMapping Genotype to Phenotype using Attribute Grammar, Laura Adam
Mapping Genotype to Phenotype using Attribute Grammar, Laura Adam
 
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
Publishing biodiversity: The interplay between Scratchpads and the new Biodiv...
 

Último

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 

Último (20)

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 

Biological Science Collections Tagging and Tracking presented at SPNHC

  • 1. BiSciCol: Biological Science Collections Tracker Tracking Biodiversity Objects to Brokering Standards Brian Stucky, University of Colorado, Boulder John Deck, University of California, Berkeley Lukasz Ziemba, University of Florida, Gaineseville Nico Cellinese, University of Florida, Gainesville Rob Guralnick, University of Colorado, Boulder BiSciCol Team: Reed Beaman, Nico Cellinese, Jonathan Coddington, Neil Davies, John Deck, Rob Guralnick, Bryan P. Heidorn, Chris Meyer, Tom Orrell, Rich Pyle, Kate Rachwal, Brian Stucky, Rob Whitton, Lukasz Ziemba Univ. Hawai’i Univ. Arizona Smithsonian
  • 2. National Science Foundation funded 2010 – 2014 • Infrastructure to tag & track specimens & derivates in cyberspace • Relies on globally unique identifiers (GUIDs) to track objects • Implements a Linked Data approach
  • 3.
  • 4. QUANTITY OF DATA IS FIRST LINK IN A LARGER CHAIN OF ISSUES
  • 5. Here is the problem: Lots of Data …. Generates …
  • 6. Data stores: Taxonomic concepts: Catalog of Life, WORMS, ITIS, EOL, GNA Geography: GBIF, IUCN ranges, Map of Life, WDPA Standards Genes/genomes: Genbank, TreeBase, ToL Web, AVATOL, BOLD Phenotypes and traits: MorphBank, TRY, Phenoscape
  • 7. EOL GBIF NCBI A Growing Constellation of Biodiversity Data and Knowledge
  • 8. How do we link all these data together?
  • 9. Borrowing from Facebook and social media… Can we track relationships for Biological Objects as well?
  • 10. A Biological Relationship Graph … Taxonomic Type Filter Class Filter X Specimens Tissues X Sequences Functions X Infer Relationships Across providers
  • 11. Moorea Biocode Example: From field collection through analysis, across multiple systems Taxon (Taxon) Taxon*n Taxon Key (Key) Blast*n Blast (Biocode Event) (metagenomic Sequencing) (CAMERA Gut Sample Event) (Essig Museum Specimen) (Genbank Sequence) (Smithsonian Tissue)
  • 12.
  • 13. Examples: Global Unique identifiers: • Globally unique (mandatory) • Persistent (not mandatory, but very helpful) http://example.org/urn:lsid:example.org:specimen/7217D220-836A-11DF-8395-0800200C9A66 • Resolvable (not mandatory, but very helpful) http://mycollection.org/specimen/JDeckSpecimen1 http://mycollection.org/specimen/uuid=7217D220-836A-11DF-8395-0800200C9A66 http://dx.doi.org/10.5072/FK2JW8GKM
  • 14. Simple relationship terms: Graph relationships:
  • 15. ONE FINAL PIECE OF THE PUZZLE: GIVING BIRTH TO DATA IN THE RIGHT FORMAT FOR LINKING
  • 16. “Triplifier” - creating the format for linking biological objects Darwin Core Archive Darwin Core Archive Triplifier Create links from Native data formats Mysql KEMU Mysql
  • 17. QUERY AND RESULTS ACROSS LINKED DATA Response Query
  • 18. BISCICOL – EXAMPLE SEARCH Client Interface: Search Scientific Name: Aedes increpitus Run Results: OccurrenceID1 (Aedes increpitus Dyar, 1916 ) OccurrenceID3 (Aedes vittata Theobald, 1903) Taxon SERVICE (ITIS / GNUB) http://lsid.itis.gov/urn:lsid:itis.gov:itis_tsn:126314 http://lsid.itis.gov/urn:lsid:itis.gov:itis_tsn:126317 http://gnub.org/8E19F1DC-74BA-47D4-A505-6498414B4CCE BISCICOL SERVICE LOOKUP: dwc:IdentificationID1 :relatedTo http://lsid.itis.gov/urn:lsid:itis.gov:itis_tsn:126314 dwc:IdentificationID1 :relatedTo dwc:OccurrenceID1 dwc:IdentificationID2 :relatedTo http://lsid.itis.gov/urn:lsid:itis.gov:itis_tsn:126317 dwc:IdentificationID2 :relatedTo dwc:OccurrenceID3
  • 19. IndividualID1 EventID1 GeoreferenceID1 EventID2 GeoreferenceID2 EventID3 GeoreferenceID3 Working with Locations: Tracking location in space of a moving individual (whales)
  • 20. Data Impact Factor – Graph Metrics Collectors Graphs Gustav Paulay [ ] GBIF Relations Graph (102,000 direct children) [X] Moorea Biocode [X] SI MSNGR System [+] Add New Graph Christopher Meyer (83,000 direct children) Occurrences MBIO99999 Craig Moritz (1024 total descendents) (523 direct children) IMBL8888888 (723 total descendents) Events Cited occurrences over time Biocode10234 (4234 direct children) Expedition21234 (1023 direct children)
  • 21. Why BiSciCol and Why SPNHC and Why Collaborations? • New era of collections digitization • new & derived data objects created, replicated, annotated • BiSciCol tackles preservation of nat. hist. collections challenge: • How to follow these digital objects • How to link together objects and derivatives back to specimens • BiSciCol is about community, collaborative practice • Commitment to standards, ontologies • Agreement on permanent, resolvable identifiers • Triplification of data sources to enhance linked data