SlideShare una empresa de Scribd logo
1 de 21
From Standards to Practice and Back Again.
News from TDWG*:
The Biodiversity Information Standards (TDWG) Conference 2013

Deborah L. Paul
Institute for Digital Information (iDigInfo)
Integrated Digitized Biocollections (iDigBio) at
Entomological Collections Network (ECN) Meeting
Austin, Texas 9 – 10 November 2013
iDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program
(Cooperative Agreement EF-1115210). Any opinions, findings, and conclusions or recommendations expressed in this material
are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. Images used are copyright
free or used with permission.
goals
 build an accessible

aggregated, integrated,
scalable,
vouchered-specimen
database (USA collections)

 facilitate and increase participation in digitization
 enable researchers’ access to and use of the data
 build partnerships to expand and enhance
iDigBio and the TCNs
Up for discussion – TDWG 2013 Topics
 Virtual Communities for Biodiversity
 eCollaboration for Sustainability











Data Quality (whose job is this anyway)?
Semantics (who needs these)?
Big Data
Names-Based Architecture for Linking Data
Global Observation Networks
Data and Metadata Standards: Beyond Darwin Core
Scholarly Publishing
Sharing and Re-using Phylogenetic Knowledge
Interest Groups / Working Groups / TAG

 What does the work of TDWG offer to the collections community?

How is it relevant to ECN?
 http://www.tdwg.org

 https://mbgserv18.mobot.org/ocs/index.php/tdwg/2013/schedConf/presentations
Why standards?

http://www.britishmuseum.org/images/rosettawriting384.jpg
Biodiversity Information Standards
 formerly known as
 Taxonomic Databases Working Group (TDWG)

 began 1985

 Our Mission
 Develop, adopt and promote standards and guidelines

for the recording and exchange of data about organisms
 Promote the use of standards through the most
appropriate and effective means and
 Act as a forum for discussion through holding meetings
and through publications
Overlap
Biodiversity
Information
Standards

Collections
• Physical
• Digital

GBIF
VertNet
iDigBio
TCNs
…
Biodiversity Information Standards (TDWG)
 TDWG warmly welcomes all newcomers, regardless of

background. We are always seeking input from…
http://imgs.xkcd.com/comics/duty_calls.png
The data is born (digital)?
 researcher collects data
 organizes it for their purpose
 or not

 non-standard metadata
 non-standard file formats, file-naming, packaging

 user file system
 unique
 sometimes enigmatic?
Data use, data re-use
 need rich/er metadata
 “good” (standard?) field notes
 will be increasingly shared / distributed / linked with

specimen data and flora / fauna data
 using standard terminology
 dwc, other standards, and ontologies
 data management skills
 data / dataset reuse, data citation – data
discovery, reproducibility
From the researcher into a database (eventually)
 has standard metadata
 in standard formats
 standard packaging
 storage

 Who bridges the transition from data collected in the

field to transform it, standardize it for
sharing, publication, storage?
Coming to a database near you?
What’s your title?
Research Information Manager

Technology Liaison to Science

Biodiversity Informatics Manager

Biodiversity Informatics & GIS Lab Manager

Collections Database Architect

Information Manager

Data Curator

Bioinformatics manager

Manager of Biodiversity Informatics

Research Specialist

Research Project Manager

Biodiversity Informatics Manager

Biodiversity Informatics Manager

Data Manager

Information Manager

Biodiversity Information

Assistant Botanist / Assistant Curator

Head of Nomenclature and Taxonomy
(Biodiversity Informatics)

Head, Computer Systems Office

Sr. Database Manager

Collection Manager

Database Admin/Programmer

Assistant Curator and Virtual Herbarium
Coordinator

Biological Informatician
For the (digital) collection manager
 tools for cleaning data
 open refine

 Specify Workbench
 Darwin Core Test validation tools

 data feedback from tools like Filtered PUSH, …
 TDWG offers tools, standards and methodologies
 enables GBIF (and others) to effectively share data
 and makes possible data discovery from other

collections
 what Texas knows…
 the Digital Collection is a tool for everyone
Data Quality – GBIF priorities
 metadata completeness

aids discovery and citation
data quality and fitness-for-use reports
 dataset and by species
possible approaches to endorsement of datasets
fitness-for-use working groups
all datasets and records have stable identifiers,
 allows annotation, correction, curation and citation
collaborate with other major players
 e.g., in developing a common global taxonomic
framework to underpin taxonomic quality







Data Quality - Southwest Collection of
Arthropods (SCAN) Thematic Collection Network
 Filtered Push (FP) based service
 http://wiki.filteredpush.org/wiki/
 primary purpose is to connect high-quality imaged of
yet insufficiently identified specimens with suitable
experts who can provide identifications remotely

 “IDs Needed” System
Data quality
 Beyond Barriers: Exporting data quality assessments from

Spain Arturo H. Ariño, Francisco Pando, Javier Otegui
 Data Quality Assessment tool - Darwin Test (DT)
 validates Darwin Core Archive files
 checks common errors arising from digitization
 checks for errors from migration

 enforces data standards on records,

records not conforming are sent back
 allows for calculation of the Apparent Quality Index (AQI) of the
dataset.
 reduces noise in the data published,
 allows data to be iteratively corrected before indexing.

Other bits of News from TDWG
 New standard ratified: Audubon Core
 for sharing media data and metadata

 iDigBio, Morphbank,

 Darwin Core definitions work – ongoing
 Darwin Core Archive Files +
 Semantic web
 Host relationships, for example

 Crowd-sourcing
 Collaboration
 trend / funding constraint / challenge / help

 Facilitating African Biodiversity
 next year’s meeting in Nairobi, Kenya
You and Biodiversity Information Standards?
 Join TDWG (it’s free)!
 Data Quality Interest Group?

 Find out what your peers are up to
 Avoid wheel re-invention and N-I-H too!
 Join the tdwg-content listserve

 North American TDWG representatives
 Bryan Heidorn
 James Macklin

 Inspiration, New Tools, New Ideas, Potential – all at TDWG
Acknowledgement
and Thanks to
 Gail Kampmeier, INHS

 Katja Seltmann, ECN, AMNH
 ECN 2013 Organizers and Attendees
 TDWG 2013 Organizers

Más contenido relacionado

La actualidad más candente

The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...ManjulaPatel
 
Curation and Preservation of Crystallography Data
Curation and Preservation of Crystallography DataCuration and Preservation of Crystallography Data
Curation and Preservation of Crystallography DataManjulaPatel
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD
 
Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?LIBER Europe
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesManjulaPatel
 
Knowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentKnowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentManjulaPatel
 
Research Data Services Best Practices by Dalal Rahme
Research Data Services Best Practices by Dalal RahmeResearch Data Services Best Practices by Dalal Rahme
Research Data Services Best Practices by Dalal RahmeDalal Rahme
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementD4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementBlue BRIDGE
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Jian Qin
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)SEAD
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012SEAD
 
Building the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsBuilding the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsCarole Goble
 
Research Data Management and Librarians
Research Data Management and LibrariansResearch Data Management and Librarians
Research Data Management and LibrariansJohann van Wyk
 

La actualidad más candente (17)

The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...
 
Curation and Preservation of Crystallography Data
Curation and Preservation of Crystallography DataCuration and Preservation of Crystallography Data
Curation and Preservation of Crystallography Data
 
SEAD slide set (October 2011)
SEAD slide set (October 2011)SEAD slide set (October 2011)
SEAD slide set (October 2011)
 
Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?Where is the opportunity for libraries in the collaborative data infrastructure?
Where is the opportunity for libraries in the collaborative data infrastructure?
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
Knowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents EnvironmentKnowledge Discovery in an Agents Environment
Knowledge Discovery in an Agents Environment
 
Research Data Services Best Practices by Dalal Rahme
Research Data Services Best Practices by Dalal RahmeResearch Data Services Best Practices by Dalal Rahme
Research Data Services Best Practices by Dalal Rahme
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementD4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Digital Curation 101 - Taster
Digital Curation 101 - TasterDigital Curation 101 - Taster
Digital Curation 101 - Taster
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
Digital Library Federation - DataNets Panel presentation (Nov. 1st, 2011)
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
 
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
Data 2012 -- Presentation by Margaret Hedstrom (Jan 2012
 
Building the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of ScientistsBuilding the FAIR Research Commons: A Data Driven Society of Scientists
Building the FAIR Research Commons: A Data Driven Society of Scientists
 
Research Data Management and Librarians
Research Data Management and LibrariansResearch Data Management and Librarians
Research Data Management and Librarians
 

Similar a D paul ecn2013

Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...GarethKnight
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data CommonsVivien Bonazzi
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 
Dataset citation and identification
Dataset citation and identificationDataset citation and identification
Dataset citation and identificationAdam Farquhar
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive trackGeorge Komatsoulis
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017Vivien Bonazzi
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationVishwas Chavan
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands Vivien Bonazzi
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional RepositoriesRobin Rice
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
Laurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & ReuseLaurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & ReuseGigaScience, BGI Hong Kong
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and LibariesRob Grim
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environmentphilipdurbin
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collectionsabedejesus
 
Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]guest410707c
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data ManagementCarole Goble
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycleSherry Lake
 

Similar a D paul ecn2013 (20)

Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...Research Data Management: What is it and why is the Library & Archives Servic...
Research Data Management: What is it and why is the Library & Archives Servic...
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset citation and identification
Dataset citation and identificationDataset citation and identification
Dataset citation and identification
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
 
FAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
 
EIA Biodiversity Data Mobilisation
EIA Biodiversity Data MobilisationEIA Biodiversity Data Mobilisation
EIA Biodiversity Data Mobilisation
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands BD2K and the Commons : ELIXR All Hands
BD2K and the Commons : ELIXR All Hands
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Laurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & ReuseLaurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
Laurie Goodman at NDIC: Big Data Publishing, Handling & Reuse
 
e-Science, Research Data and Libaries
e-Science, Research Data and Libariese-Science, Research Data and Libaries
e-Science, Research Data and Libaries
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
 
Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]Gettingstartedwithdigitalcollectionsweb[1]
Gettingstartedwithdigitalcollectionsweb[1]
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
 

Más de ECNOfficer

Price2 ecn2013
Price2 ecn2013Price2 ecn2013
Price2 ecn2013ECNOfficer
 
Sikes ecn2013 dn_ab
Sikes ecn2013 dn_abSikes ecn2013 dn_ab
Sikes ecn2013 dn_abECNOfficer
 
Janzen ecn2013
Janzen ecn2013Janzen ecn2013
Janzen ecn2013ECNOfficer
 
Nearns ecn2013
Nearns ecn2013Nearns ecn2013
Nearns ecn2013ECNOfficer
 
Giddens ecn2013
Giddens ecn2013Giddens ecn2013
Giddens ecn2013ECNOfficer
 
Rubinoff ecn2013 uhim
Rubinoff ecn2013 uhimRubinoff ecn2013 uhim
Rubinoff ecn2013 uhimECNOfficer
 
Mc alister ecn2013
Mc alister ecn2013Mc alister ecn2013
Mc alister ecn2013ECNOfficer
 
Dombroskie ecn2013
Dombroskie ecn2013Dombroskie ecn2013
Dombroskie ecn2013ECNOfficer
 
Dmitriev ecn2013
Dmitriev ecn2013Dmitriev ecn2013
Dmitriev ecn2013ECNOfficer
 
Oboyski ecn2013
Oboyski ecn2013Oboyski ecn2013
Oboyski ecn2013ECNOfficer
 
Thomas ecn2013
Thomas ecn2013Thomas ecn2013
Thomas ecn2013ECNOfficer
 
Jones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabioJones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabioECNOfficer
 
Austin ecn2013
Austin ecn2013Austin ecn2013
Austin ecn2013ECNOfficer
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingECNOfficer
 
Solis ecn2013 usfws
Solis ecn2013 usfwsSolis ecn2013 usfws
Solis ecn2013 usfwsECNOfficer
 
Schuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structureSchuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structureECNOfficer
 
Gil ecn2013 ppt
Gil ecn2013 pptGil ecn2013 ppt
Gil ecn2013 pptECNOfficer
 
Dm smith ecn2013
Dm smith ecn2013Dm smith ecn2013
Dm smith ecn2013ECNOfficer
 

Más de ECNOfficer (20)

Price2 ecn2013
Price2 ecn2013Price2 ecn2013
Price2 ecn2013
 
Sikes ecn2013 dn_ab
Sikes ecn2013 dn_abSikes ecn2013 dn_ab
Sikes ecn2013 dn_ab
 
Ryder ecn2013
Ryder ecn2013Ryder ecn2013
Ryder ecn2013
 
Janzen ecn2013
Janzen ecn2013Janzen ecn2013
Janzen ecn2013
 
Nearns ecn2013
Nearns ecn2013Nearns ecn2013
Nearns ecn2013
 
Krell ecn2013
Krell ecn2013Krell ecn2013
Krell ecn2013
 
Giddens ecn2013
Giddens ecn2013Giddens ecn2013
Giddens ecn2013
 
Rubinoff ecn2013 uhim
Rubinoff ecn2013 uhimRubinoff ecn2013 uhim
Rubinoff ecn2013 uhim
 
Mc alister ecn2013
Mc alister ecn2013Mc alister ecn2013
Mc alister ecn2013
 
Dombroskie ecn2013
Dombroskie ecn2013Dombroskie ecn2013
Dombroskie ecn2013
 
Dmitriev ecn2013
Dmitriev ecn2013Dmitriev ecn2013
Dmitriev ecn2013
 
Oboyski ecn2013
Oboyski ecn2013Oboyski ecn2013
Oboyski ecn2013
 
Thomas ecn2013
Thomas ecn2013Thomas ecn2013
Thomas ecn2013
 
Jones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabioJones ecn2013 the_goodbadugly conabio
Jones ecn2013 the_goodbadugly conabio
 
Austin ecn2013
Austin ecn2013Austin ecn2013
Austin ecn2013
 
Yu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasingYu ecn2013 cnc_databasing
Yu ecn2013 cnc_databasing
 
Solis ecn2013 usfws
Solis ecn2013 usfwsSolis ecn2013 usfws
Solis ecn2013 usfws
 
Schuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structureSchuh ecn2013 tcn_data_structure
Schuh ecn2013 tcn_data_structure
 
Gil ecn2013 ppt
Gil ecn2013 pptGil ecn2013 ppt
Gil ecn2013 ppt
 
Dm smith ecn2013
Dm smith ecn2013Dm smith ecn2013
Dm smith ecn2013
 

Último

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Último (20)

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

D paul ecn2013

  • 1. From Standards to Practice and Back Again. News from TDWG*: The Biodiversity Information Standards (TDWG) Conference 2013 Deborah L. Paul Institute for Digital Information (iDigInfo) Integrated Digitized Biocollections (iDigBio) at Entomological Collections Network (ECN) Meeting Austin, Texas 9 – 10 November 2013 iDigBio is funded by a grant from the National Science Foundation’s Advancing Digitization of Biodiversity Collections Program (Cooperative Agreement EF-1115210). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. Images used are copyright free or used with permission.
  • 2. goals  build an accessible aggregated, integrated, scalable, vouchered-specimen database (USA collections)  facilitate and increase participation in digitization  enable researchers’ access to and use of the data  build partnerships to expand and enhance
  • 4. Up for discussion – TDWG 2013 Topics  Virtual Communities for Biodiversity  eCollaboration for Sustainability          Data Quality (whose job is this anyway)? Semantics (who needs these)? Big Data Names-Based Architecture for Linking Data Global Observation Networks Data and Metadata Standards: Beyond Darwin Core Scholarly Publishing Sharing and Re-using Phylogenetic Knowledge Interest Groups / Working Groups / TAG  What does the work of TDWG offer to the collections community? How is it relevant to ECN?  http://www.tdwg.org  https://mbgserv18.mobot.org/ocs/index.php/tdwg/2013/schedConf/presentations
  • 6.
  • 7. Biodiversity Information Standards  formerly known as  Taxonomic Databases Working Group (TDWG)  began 1985  Our Mission  Develop, adopt and promote standards and guidelines for the recording and exchange of data about organisms  Promote the use of standards through the most appropriate and effective means and  Act as a forum for discussion through holding meetings and through publications
  • 9. Biodiversity Information Standards (TDWG)  TDWG warmly welcomes all newcomers, regardless of background. We are always seeking input from…
  • 11. The data is born (digital)?  researcher collects data  organizes it for their purpose  or not  non-standard metadata  non-standard file formats, file-naming, packaging  user file system  unique  sometimes enigmatic?
  • 12. Data use, data re-use  need rich/er metadata  “good” (standard?) field notes  will be increasingly shared / distributed / linked with specimen data and flora / fauna data  using standard terminology  dwc, other standards, and ontologies  data management skills  data / dataset reuse, data citation – data discovery, reproducibility
  • 13. From the researcher into a database (eventually)  has standard metadata  in standard formats  standard packaging  storage  Who bridges the transition from data collected in the field to transform it, standardize it for sharing, publication, storage?
  • 14. Coming to a database near you? What’s your title? Research Information Manager Technology Liaison to Science Biodiversity Informatics Manager Biodiversity Informatics & GIS Lab Manager Collections Database Architect Information Manager Data Curator Bioinformatics manager Manager of Biodiversity Informatics Research Specialist Research Project Manager Biodiversity Informatics Manager Biodiversity Informatics Manager Data Manager Information Manager Biodiversity Information Assistant Botanist / Assistant Curator Head of Nomenclature and Taxonomy (Biodiversity Informatics) Head, Computer Systems Office Sr. Database Manager Collection Manager Database Admin/Programmer Assistant Curator and Virtual Herbarium Coordinator Biological Informatician
  • 15. For the (digital) collection manager  tools for cleaning data  open refine  Specify Workbench  Darwin Core Test validation tools  data feedback from tools like Filtered PUSH, …  TDWG offers tools, standards and methodologies  enables GBIF (and others) to effectively share data  and makes possible data discovery from other collections  what Texas knows…  the Digital Collection is a tool for everyone
  • 16. Data Quality – GBIF priorities  metadata completeness aids discovery and citation data quality and fitness-for-use reports  dataset and by species possible approaches to endorsement of datasets fitness-for-use working groups all datasets and records have stable identifiers,  allows annotation, correction, curation and citation collaborate with other major players  e.g., in developing a common global taxonomic framework to underpin taxonomic quality      
  • 17. Data Quality - Southwest Collection of Arthropods (SCAN) Thematic Collection Network  Filtered Push (FP) based service  http://wiki.filteredpush.org/wiki/  primary purpose is to connect high-quality imaged of yet insufficiently identified specimens with suitable experts who can provide identifications remotely  “IDs Needed” System
  • 18. Data quality  Beyond Barriers: Exporting data quality assessments from Spain Arturo H. Ariño, Francisco Pando, Javier Otegui  Data Quality Assessment tool - Darwin Test (DT)  validates Darwin Core Archive files  checks common errors arising from digitization  checks for errors from migration  enforces data standards on records, records not conforming are sent back  allows for calculation of the Apparent Quality Index (AQI) of the dataset.  reduces noise in the data published,  allows data to be iteratively corrected before indexing. 
  • 19. Other bits of News from TDWG  New standard ratified: Audubon Core  for sharing media data and metadata  iDigBio, Morphbank,  Darwin Core definitions work – ongoing  Darwin Core Archive Files +  Semantic web  Host relationships, for example  Crowd-sourcing  Collaboration  trend / funding constraint / challenge / help  Facilitating African Biodiversity  next year’s meeting in Nairobi, Kenya
  • 20. You and Biodiversity Information Standards?  Join TDWG (it’s free)!  Data Quality Interest Group?  Find out what your peers are up to  Avoid wheel re-invention and N-I-H too!  Join the tdwg-content listserve  North American TDWG representatives  Bryan Heidorn  James Macklin  Inspiration, New Tools, New Ideas, Potential – all at TDWG
  • 21. Acknowledgement and Thanks to  Gail Kampmeier, INHS  Katja Seltmann, ECN, AMNH  ECN 2013 Organizers and Attendees  TDWG 2013 Organizers

Notas del editor

  1. Deborah Paul (iDigInfo, iDigBio)From Standards to Practice and Back Again. News from TDWG*: The Biodiversity Information Standards (TDWG) 2013 Conference - Virtual Communities for Biodiversity Science.AbstractFrom their website: "Biodiversity Information Standards (TDWG), also known as the Taxonomic Databases Working Group, is a not for profit, volunteer organization,…formed to establish international collaboration among biological database projects." Currently, TDWG focuses on the development of standards for the exchange of biological/biodiversity data. Whether you already know about BIS (TDWG) or have never heard them, this is your opportunity to find out what TDWG is working on now. Come find out about the recent symposiums and workshops (October 2013), some of which are: Biodiversity Data Quality, Crowd-sourcing Websites and their Communities, Biodiversity informatics services and workflows, Beyond Darwin Core, Biodiversity Observation Networks, Documenting the Darwin Core, e-Collaboration for Sustainability, Mobilizing African Biodiversity, and Sharing and Delivery of Reusable Phylogenetic Knowledge. What does the work of TDWG offer to the collections community? How is it relevant to ECN? How can the collections community work with TDWG? Please join in the conversation.
  2. From NIBA create a national database of vouchered specimen records from US institutions using existing national and international specimen data aggregation projects as models, specify the functional requirements of an aggregated US specimen data store.
  3. Image Interest Group Multimedia Resources Task Group Audubon Core (AC) - convener: Bob MorrisBiological Descriptions and Identification - convener: GregorHagedornGenomic Biodiversity Working Interest Group - convener: John DeckSemantics4Biodiversity - convener: Elizabeth ArnaudTechnical Architecture Group - convener: Greg WhitbreadSpecies Information Interest Group - convener: Paco PandoEconomic Botany - convener Nicola NicolsonEmpowering International e-Collaboration for Sustainability Biodiversity informatics services and workflows Global Earth Observation, Biodiversity Observation networks Biodiversity Data QualityCrafting the future of a Global Biodiversity Heritage Library for diverse community’s needsDocumenting the Darwin CoreMinimum Information Standards for Biological Collections: Beyond Darwin Core Building and maintaining crowd-sourcing Websites and their CommunitiesSemantics for Biodiversity Workshop:Mobilizing African Biodiversity - convener: Hank BartDarwin Archives: beyond star Developing a Names-based Architecture for Linking Biodiversity Sharing and delivery of reusable phylogenetic knowledge Biodiversity vocabulary management Use of Semantic MediaWiki for vocabulary managementDarwin Core DNA and Tissue Data Standard for the Global Genome Biodiversity Network Scholarly Data Publishing in Biodiversity: Challenges and Potentials - Convener: VishwasChavan
  4. http://prezi.com/iib3pqk-kyd-/curators-workbench/Why care about standards?What do they have the potential to accomplish?Collection Managers doing what they need to do – for themselves.When we share, we need standards.Data becomes useful for others / other purposes.a common vocab is requiredFeedback and Attribution become possible.The collection gets used, more, increasing the value of the collection. indirect, subtlePutting identifiers on specimens --- makes more useful to others. consistency is important!
  5. Scene from the Arno River, Florence Italy. TDWG 2013 meeting.
  6. http://www.tdwg.org/activities/https://mbgserv18.mobot.org/ocs/index.php/tdwg/2013/schedConf/presentations
  7. TDWG looks at the range of issues from higher level e-collaborationto nitty-gritty of interpretation of dwc termsto data trends in biodiversity information sharingTDWG warmly welcomes all newcomers, regardless of background. We are always seeking input from biologists, taxonomists, library and information scientists, zoologists, entomologists, ecologists, geneticists, information technologists...TDWG interested indiscussing higher level topics like e-collaborationto nitty-gritty of interpretation of dwc termsto data trends in biodiversity information sharingfostering standards development to support interoperability and data exchange andencouraging standards adoption / useparticipation of those who are developing standards and tools but also those who want to learnregister on the TDWG sitehttp://www.tdwg.org/activities/some of the interest / working groups
  8. https://mbgserv18.mobot.org/ocs/index.php/tdwg/2013/paper/view/502