SlideShare una empresa de Scribd logo
1 de 53
Descargar para leer sin conexión
On community-standards, FAIR data and
scholarly communication
Susanna-Assunta Sansone, PhD
ORCID: 0000-0001-5306-5690
INSERM Workshop 246 “Management and reuse of health data: methodological issues”, Bordeaux, 14-17 May 2017
Data Consultant,
Founding Academic Editor
Associate Director,
Principal Investigator
www.slideshare.net/SusannaSansone
Source: https://www.dataone.org/best-practices
Simplified research data life cycle
• Available in a public repository
• Findable through some sort of search facility
• Retrievable in a standard format
• Self-describing so that third parties can make sense of it
• The product of careful planning, organization and stewardship
• Intended to outlive the experiment for which they were
collected
To do better science, more efficiently
we need data that are…
Key problem: low findability and understandability
• Not always well cited and stored
o True for data as well as for any other digital asset
• Poorly described for third party reuse
o Different level of details and annotation
• Reporting and annotation activities are perceived as time
consuming
o Often rushed and minimally done
We need content or reporting standards
• To harmonized the datasets with respect to the structure
and level or annotation of their:
§ experimental components (e.g., design, conditions, parameters),
§ fundamental biological entities (e.g., samples, genes, cells),
§ complex concepts (such as bioprocesses, tissues, diseases),
§ analytical process and the mathematical models, and
§ their instantiation in computational simulations (from the
molecular level through to whole populations of individuals)
Minimum information reporting
requirements, checklists
o Report the same core, essential
information
o e.g. MIAME guidelines
Controlled vocabularies, taxonomies, thesauri,
ontologies etc.
o Unambiguous identification and definition of
concepts
o e.g. Gene Ontology
Conceptual model, schema,
exchange formats etc
o Define the structure and
interrelation of information, and
the transmission format
o e.g. FASTA
Formats Terminologies Guidelines
Types of content standards
de jure de facto
grass-roots
groups
standard
organizations
Nanotechnology Working Group
Formats Terminologies Guidelines
Community-driven efforts, just few examples
Formats Terminologies Guidelines
224
115
500+
source source
source
MIAME
MIRIAM
MIQAS
MIX
MIGEN
ARRIVE
MIAPE
MIASE
MIQE
MISFISHIE….
REMARK
CONSORT
SRAxml
SOFT FASTA
DICOM
MzML
SBRML
SEDML…
GELML
ISA
CML
MITAB
AAO
CHEBIOBI
PATO ENVO
MOD
BTO
IDO…
TEDDY
PRO
XAO
DO
VO
Content standards in numbers
How to discover the ‘right’ standards for your data?
A	web-based,	curated	and	searchable	portal	that monitors	the	development and	
evolution of	standards,	their	use in	databases and	the	adoption	of	both	in	data	
policies,	to	inform and	educate the	user	community
Data policies by
funders, journals and
other organizations
Content standards
Formats Terminologies Guidelines
Map this complex and evolving landscape
Databases
All	records	are	manually	curated	in-house	
and	verified	by	the	community	behind	each	resource
Data policies by
funders, journals and
other organizations
Databases
Content standards
Formats Terminologies Guidelines
Using indicators to describe ‘status’
Ready	for	use,	implementation,	or	recommendation
In	development
Status	uncertain
Deprecated	as	subsumed	or	superseded
Understanding how standards are used
Understanding how standards are used
Guideline
Understanding how standards are used
Formats
Guideline
Understanding how standards are used
Formats
Guideline
Formats
Understanding how standards are used
Formats
Guideline
Formats
Terminology
Data policies by
funders, journals and
other organizations
Databases
Content standards
Formats Terminologies Guidelines
Using indicators to indicate ‘adoption’
Standard developing groups:Journal, publishers:
Cross-links, data exchange:
Societies and organisations: Institutional RDM services:
Projects, programmes:
Technologically-delineated
views of the world
Biologically-delineated
views of the world
Generic features (‘common core’)
- description of source biomaterial
- experimental design components
Arrays
Scanning Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
Columns
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Duplications & lack of interoperability among standards
Arrays
Scanning Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
Columns
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Hard to use them in combinations, e.g. to represent:
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut
microbiota profiling
Arrays
Scanning Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
Columns
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Enhancing modularization
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut
microbiota profiling
Arrays
Scanning Arrays &
Scanning
Columns
Gels
MS MS
FTIR
NMR
Columns
transcriptomics
proteomics
metabolomics
plant biology
epidemiology
microbiology
Enhancing modularization
Proteomics-based gut microbiota profiling
Proteomics and metabolomics based gut
microbiota profiling
bsg-000174
biosharing:
ReportingGuideline
bsg-000161
MINSEQE
MIMARKS
sample
information
sample
identifier
taxonomy
identifier
sequence
read
geo location
High-level information about
the metadata standards
Representations
of the standards elements
Template elements
for
el-000001
el-000002
el-000003
provenance:
MINSEQE
provenance:
MINSEQE
and
MIMARKS
provenance:
MIMARKS
Serve machine-readable content metadata standards, providing provenance for
their elements, rendering standards invisible to the researchers
Inform the creation of metadata templates
How to discover the datasets relevant to your work?
OmicsDI: Nature Biotechnology 35, 406–409 (2017) doi:10.1038/nbt.3790
omicsdi.org
datamed.org
DataMed: bioRxiv 094888; https://doi.org/10.1101/094888 Nature Genetics (in press)
DATS: bioRxiv 103143; https://doi.org/10.1101/103143 Scientific Data (in press)
• Discoverability and reusability
o Complementing community
databases
• Incentive, credit for sharing
o Big and small data
o Unpublished data
o Long tail of data
o Curated aggregation
• Peer review of data
• Value of data vs. analysis
Growing number of data papers and data journals, e.g:
nature.com/scientificdataHonorary Academic Editor
Susanna-Assunta Sansone, PhD
Managing Editor
Andrew L Hufton, PhD
Editorial Curator
Varsha Khodiyar
Publisher
Iain Hrynaszkiewicz
A new open-access, online-only publication for
descriptions of scientifically valuable datasets
Supported by
• A peer reviewed description of data, to maximize usage
• Citable publications that give credit for reusable data
• It requires data deposition to the appropriate repository(s)
• Is complementary and can be associated or not to traditional article(s)
New article type
Research
papers
Data
records
Data
Descriptors
Value added component – complementing
articles and repositories
• Title
• Abstract
• Background & Summary
• Methods
• Data Records
• Technical Validation
• Usage Notes
• Figures & Tables
• References
• Data Citations
• following the Joint Declaration of Data Citation Principles
Detailed description of the methods and
technical analyses supporting the
quality of the measurements;
no scientific hypotheses
Article structure
Focus on data peer review
• Completeness = can others reproduce?
• Consistency = were community standards followed?
• Integrity = are data in the best repository?
• Experimental rigour, technical quality = were the methods sound?
Does not focus on perceived impact, importance, size, complexity of data
Credit for data producers, data managers/curators etc.
Credit to: Varsha Khodiyar
“The Data Descriptor made it easier to use
the data, for me it was critical that everything
was there…all the technical details like voxel
size.”
Professor Daniele Marinazzo
Credit to: Varsha Khodiyar
Data (re)use made easier
Decades
old dataset
Aggregated or
curated data
resources
Computationally
produced data
products
Large
consortium
dataset
Data from a
single
experiment
Data that YOU
find valuable
and that others
might find
useful too
Data associated
with a high impact
analysis article
What makes a good ?
Experimental metadata or
structured component
(in-house curated, machine-
readable formats)
Article or
narrative component
(PDF and HTML)
Data Descriptors has two components
The Data Curation Editor is responsible for creating and
curating the machine-readable structured component
• Enables browsing and searching the articles
• Facilitates links to related journal articles and repository
records
Curation and discoverability
Created with the input of the
authors, includes value-added
semantic annotation of the
experimental metadata
analysis
method
script
Data file or
record in a
database
Data Descriptors: structured component
Complementary roles of ISA and
nanopublications
From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles
of Data Models and Workflows in Bioinformatics. https://doi.org/10.1371/journal.pone.0127612
PloS ONE (2015)
The (long) road to FAIR
Responsibilities lie across several stakeholder groups
Understand the benefits of sharing
FAIR datasets and enact them
Engage and assist researchers to
enable them to share FAIR datasets
Release or endorse practices
and polices, but also incentive
and credit mechanisms for
researchers, curators and
developers
“As Data Science culture grows,
digital research outputs (such as
data, computational analysis and
software) are being established as
first-class citizens.
This cultural shift is required to go
one step further: to recognize
interoperability standards as digital
objects in their own right, with their
associated research, development
and educational activities”.
Sansone, Susanna-Assunta; Rocca-Serra, Philippe (2016).
Interoperability Standards - Digital Objects in Their Own
Right. Wellcome Trust”
https://dx.doi.org/10.6084/m9.figshare.4055496.v1
Philippe
Rocca-Serra, PhD
Senior Research Lecturer
Alejandra
Gonzalez-Beltran, PhD
Research Lecturer
Milo
Thurston, DPhD
Research Software Engineer
Massimiliano
Izzo, PhD
Research Software Engineer
Peter
McQuilton, PhD
Knowledge Engineer
Allyson
Lister, PhD
Knowledge Engineer
Eamonn
Maguire, Dphil
Contractor
David
Johnson, PhD
Research Software Engineer
Melanie
Adekale, PhD
Biocurator Contractor
Delphine
Dauga, PhD
Biocurator Contractor
We work with and for
to make data and other
digital research assets
Susanna-Assunta Sansone, PhD
Principal Investigator, Associate Director
and Data Consultant for Springer Nature
enabling open science,
driving science and discoveries

Más contenido relacionado

La actualidad más candente

FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features Susanna-Assunta Sansone
 
CrossRef at SciELO15 Conference 2013
CrossRef at SciELO15 Conference 2013CrossRef at SciELO15 Conference 2013
CrossRef at SciELO15 Conference 2013Crossref
 
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016Susanna-Assunta Sansone
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data ProjectEdward Blurock
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyPeter McQuilton
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsCarole Goble
 
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domainFAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domainSusanna-Assunta Sansone
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Merce Crosas
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefCrossref
 
The Dataverse Commons
The Dataverse CommonsThe Dataverse Commons
The Dataverse CommonsMerce Crosas
 
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...ASIS&T
 
Research Networking to Enhance Multi-institutional Expertise Discovery and Co...
Research Networking to Enhance Multi-institutional Expertise Discovery and Co...Research Networking to Enhance Multi-institutional Expertise Discovery and Co...
Research Networking to Enhance Multi-institutional Expertise Discovery and Co...Holly Falk-Krzesinski
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?Jian Qin
 

La actualidad más candente (20)

FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features
 
CrossRef at SciELO15 Conference 2013
CrossRef at SciELO15 Conference 2013CrossRef at SciELO15 Conference 2013
CrossRef at SciELO15 Conference 2013
 
Burton - Security, Privacy and Trust
Burton - Security, Privacy and TrustBurton - Security, Privacy and Trust
Burton - Security, Privacy and Trust
 
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
BioSharing at Internatiomnal Data Week - NIH BD2K session, Denver 2016
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
 
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domainFAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
FAIRsharing, FAIR principles and metrics - Working with/for the Agro domain
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
 
Borgman - Privacy, Policy and Data Governance in the University
Borgman - Privacy, Policy and Data Governance in the UniversityBorgman - Privacy, Policy and Data Governance in the University
Borgman - Privacy, Policy and Data Governance in the University
 
Hansen Metadata for Institutional Repositories
Hansen Metadata for Institutional RepositoriesHansen Metadata for Institutional Repositories
Hansen Metadata for Institutional Repositories
 
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
Addressing the New Challenges in Data Sharing: Large-Scale Data and Sensitive...
 
DataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRefDataCite: the Perfect Complement to CrossRef
DataCite: the Perfect Complement to CrossRef
 
Bracke may4-1
Bracke may4-1Bracke may4-1
Bracke may4-1
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
 
The Dataverse Commons
The Dataverse CommonsThe Dataverse Commons
The Dataverse Commons
 
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
RDAP 16 Poster: Diving into Data: Implementing a Data Repository at the Texas...
 
Research Networking to Enhance Multi-institutional Expertise Discovery and Co...
Research Networking to Enhance Multi-institutional Expertise Discovery and Co...Research Networking to Enhance Multi-institutional Expertise Discovery and Co...
Research Networking to Enhance Multi-institutional Expertise Discovery and Co...
 
The FAIR Cookbook in a nutshell
The FAIR Cookbook in a nutshellThe FAIR Cookbook in a nutshell
The FAIR Cookbook in a nutshell
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
 

Similar a INSERM - Data Management & Reuse of Health Data - May 2017

NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataSusanna-Assunta Sansone
 
Behind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and DreamersBehind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and DreamersSusanna-Assunta Sansone
 
My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018Susanna-Assunta Sansone
 
Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Clare Dean
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things dataARDC
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environmentphilipdurbin
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...heila1
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...SC CTSI at USC and CHLA
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositoriesChris Rusbridge
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystemVarsha Khodiyar
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional RepositoriesRobin Rice
 
FAIR, standards and FAIRsharing - MAQC Society 2019
FAIR, standards and FAIRsharing - MAQC Society 2019FAIR, standards and FAIRsharing - MAQC Society 2019
FAIR, standards and FAIRsharing - MAQC Society 2019Susanna-Assunta Sansone
 

Similar a INSERM - Data Management & Reuse of Health Data - May 2017 (20)

NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific DataNIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
NIH iDASH meeting on data sharing - BioSharing, ISA and Scientific Data
 
Behind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and DreamersBehind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and Dreamers
 
My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
The FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharingThe FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharing
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis Critical infrastructure to promote data synthesis
Critical infrastructure to promote data synthesis
 
Biosharing sansone-dryad-may13
Biosharing sansone-dryad-may13Biosharing sansone-dryad-may13
Biosharing sansone-dryad-may13
 
FAIR: standards and services
FAIR: standards and servicesFAIR: standards and services
FAIR: standards and services
 
Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018 Metadata 2020 Vivo Conference 2018
Metadata 2020 Vivo Conference 2018
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
 
Data curation issues for repositories
Data curation issues for repositoriesData curation issues for repositories
Data curation issues for repositories
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
FAIR, standards and FAIRsharing - MAQC Society 2019
FAIR, standards and FAIRsharing - MAQC Society 2019FAIR, standards and FAIRsharing - MAQC Society 2019
FAIR, standards and FAIRsharing - MAQC Society 2019
 

Más de Susanna-Assunta Sansone

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRSusanna-Assunta Sansone
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesSusanna-Assunta Sansone
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookSusanna-Assunta Sansone
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessSusanna-Assunta Sansone
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseSusanna-Assunta Sansone
 
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook Susanna-Assunta Sansone
 
FAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkFAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkSusanna-Assunta Sansone
 

Más de Susanna-Assunta Sansone (20)

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
FAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdfFAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdf
 
FAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdfFAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdf
 
FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
 
Metadata Standards
Metadata StandardsMetadata Standards
Metadata Standards
 
FAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-SingaporeFAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-Singapore
 
FAIR Cookbook
FAIR Cookbook FAIR Cookbook
FAIR Cookbook
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipes
 
FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook
 
FAIRsharing for EOSC
FAIRsharing for EOSC FAIRsharing for EOSC
FAIRsharing for EOSC
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
 
FAIRsharing: what we do for policies
FAIRsharing: what we do for policiesFAIRsharing: what we do for policies
FAIRsharing: what we do for policies
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRness
 
ELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - ExamplarsELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - Examplars
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 response
 
FAIRsharing poster
FAIRsharing posterFAIRsharing poster
FAIRsharing poster
 
The FAIR Cookbook poster
The FAIR Cookbook posterThe FAIR Cookbook poster
The FAIR Cookbook poster
 
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
 
FAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkFAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health Network
 

Último

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...shivangimorya083
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 

Último (20)

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔️Body to body massage wit...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 

INSERM - Data Management & Reuse of Health Data - May 2017

  • 1. On community-standards, FAIR data and scholarly communication Susanna-Assunta Sansone, PhD ORCID: 0000-0001-5306-5690 INSERM Workshop 246 “Management and reuse of health data: methodological issues”, Bordeaux, 14-17 May 2017 Data Consultant, Founding Academic Editor Associate Director, Principal Investigator www.slideshare.net/SusannaSansone
  • 2.
  • 4. • Available in a public repository • Findable through some sort of search facility • Retrievable in a standard format • Self-describing so that third parties can make sense of it • The product of careful planning, organization and stewardship • Intended to outlive the experiment for which they were collected To do better science, more efficiently we need data that are…
  • 5. Key problem: low findability and understandability • Not always well cited and stored o True for data as well as for any other digital asset • Poorly described for third party reuse o Different level of details and annotation • Reporting and annotation activities are perceived as time consuming o Often rushed and minimally done
  • 6. We need content or reporting standards • To harmonized the datasets with respect to the structure and level or annotation of their: § experimental components (e.g., design, conditions, parameters), § fundamental biological entities (e.g., samples, genes, cells), § complex concepts (such as bioprocesses, tissues, diseases), § analytical process and the mathematical models, and § their instantiation in computational simulations (from the molecular level through to whole populations of individuals)
  • 7. Minimum information reporting requirements, checklists o Report the same core, essential information o e.g. MIAME guidelines Controlled vocabularies, taxonomies, thesauri, ontologies etc. o Unambiguous identification and definition of concepts o e.g. Gene Ontology Conceptual model, schema, exchange formats etc o Define the structure and interrelation of information, and the transmission format o e.g. FASTA Formats Terminologies Guidelines Types of content standards
  • 8. de jure de facto grass-roots groups standard organizations Nanotechnology Working Group Formats Terminologies Guidelines Community-driven efforts, just few examples
  • 9. Formats Terminologies Guidelines 224 115 500+ source source source MIAME MIRIAM MIQAS MIX MIGEN ARRIVE MIAPE MIASE MIQE MISFISHIE…. REMARK CONSORT SRAxml SOFT FASTA DICOM MzML SBRML SEDML… GELML ISA CML MITAB AAO CHEBIOBI PATO ENVO MOD BTO IDO… TEDDY PRO XAO DO VO Content standards in numbers
  • 10.
  • 11. How to discover the ‘right’ standards for your data?
  • 12.
  • 13. A web-based, curated and searchable portal that monitors the development and evolution of standards, their use in databases and the adoption of both in data policies, to inform and educate the user community
  • 14. Data policies by funders, journals and other organizations Content standards Formats Terminologies Guidelines Map this complex and evolving landscape Databases All records are manually curated in-house and verified by the community behind each resource
  • 15. Data policies by funders, journals and other organizations Databases Content standards Formats Terminologies Guidelines Using indicators to describe ‘status’ Ready for use, implementation, or recommendation In development Status uncertain Deprecated as subsumed or superseded
  • 17. Understanding how standards are used Guideline
  • 18. Understanding how standards are used Formats Guideline
  • 19. Understanding how standards are used Formats Guideline Formats
  • 20. Understanding how standards are used Formats Guideline Formats Terminology
  • 21. Data policies by funders, journals and other organizations Databases Content standards Formats Terminologies Guidelines Using indicators to indicate ‘adoption’
  • 22.
  • 23.
  • 24.
  • 25. Standard developing groups:Journal, publishers: Cross-links, data exchange: Societies and organisations: Institutional RDM services: Projects, programmes:
  • 26. Technologically-delineated views of the world Biologically-delineated views of the world Generic features (‘common core’) - description of source biomaterial - experimental design components Arrays Scanning Arrays & Scanning Columns Gels MS MS FTIR NMR Columns transcriptomics proteomics metabolomics plant biology epidemiology microbiology Duplications & lack of interoperability among standards
  • 27. Arrays Scanning Arrays & Scanning Columns Gels MS MS FTIR NMR Columns transcriptomics proteomics metabolomics plant biology epidemiology microbiology Hard to use them in combinations, e.g. to represent: Proteomics-based gut microbiota profiling Proteomics and metabolomics based gut microbiota profiling
  • 28. Arrays Scanning Arrays & Scanning Columns Gels MS MS FTIR NMR Columns transcriptomics proteomics metabolomics plant biology epidemiology microbiology Enhancing modularization Proteomics-based gut microbiota profiling Proteomics and metabolomics based gut microbiota profiling
  • 29. Arrays Scanning Arrays & Scanning Columns Gels MS MS FTIR NMR Columns transcriptomics proteomics metabolomics plant biology epidemiology microbiology Enhancing modularization Proteomics-based gut microbiota profiling Proteomics and metabolomics based gut microbiota profiling
  • 30. bsg-000174 biosharing: ReportingGuideline bsg-000161 MINSEQE MIMARKS sample information sample identifier taxonomy identifier sequence read geo location High-level information about the metadata standards Representations of the standards elements Template elements for el-000001 el-000002 el-000003 provenance: MINSEQE provenance: MINSEQE and MIMARKS provenance: MIMARKS Serve machine-readable content metadata standards, providing provenance for their elements, rendering standards invisible to the researchers Inform the creation of metadata templates
  • 31. How to discover the datasets relevant to your work?
  • 32. OmicsDI: Nature Biotechnology 35, 406–409 (2017) doi:10.1038/nbt.3790 omicsdi.org
  • 33. datamed.org DataMed: bioRxiv 094888; https://doi.org/10.1101/094888 Nature Genetics (in press) DATS: bioRxiv 103143; https://doi.org/10.1101/103143 Scientific Data (in press)
  • 34. • Discoverability and reusability o Complementing community databases • Incentive, credit for sharing o Big and small data o Unpublished data o Long tail of data o Curated aggregation • Peer review of data • Value of data vs. analysis Growing number of data papers and data journals, e.g:
  • 35. nature.com/scientificdataHonorary Academic Editor Susanna-Assunta Sansone, PhD Managing Editor Andrew L Hufton, PhD Editorial Curator Varsha Khodiyar Publisher Iain Hrynaszkiewicz A new open-access, online-only publication for descriptions of scientifically valuable datasets Supported by
  • 36. • A peer reviewed description of data, to maximize usage • Citable publications that give credit for reusable data • It requires data deposition to the appropriate repository(s) • Is complementary and can be associated or not to traditional article(s) New article type
  • 37. Research papers Data records Data Descriptors Value added component – complementing articles and repositories
  • 38. • Title • Abstract • Background & Summary • Methods • Data Records • Technical Validation • Usage Notes • Figures & Tables • References • Data Citations • following the Joint Declaration of Data Citation Principles Detailed description of the methods and technical analyses supporting the quality of the measurements; no scientific hypotheses Article structure
  • 39. Focus on data peer review • Completeness = can others reproduce? • Consistency = were community standards followed? • Integrity = are data in the best repository? • Experimental rigour, technical quality = were the methods sound? Does not focus on perceived impact, importance, size, complexity of data
  • 40. Credit for data producers, data managers/curators etc. Credit to: Varsha Khodiyar
  • 41. “The Data Descriptor made it easier to use the data, for me it was critical that everything was there…all the technical details like voxel size.” Professor Daniele Marinazzo Credit to: Varsha Khodiyar Data (re)use made easier
  • 42. Decades old dataset Aggregated or curated data resources Computationally produced data products Large consortium dataset Data from a single experiment Data that YOU find valuable and that others might find useful too Data associated with a high impact analysis article What makes a good ?
  • 43. Experimental metadata or structured component (in-house curated, machine- readable formats) Article or narrative component (PDF and HTML) Data Descriptors has two components
  • 44. The Data Curation Editor is responsible for creating and curating the machine-readable structured component • Enables browsing and searching the articles • Facilitates links to related journal articles and repository records Curation and discoverability
  • 45. Created with the input of the authors, includes value-added semantic annotation of the experimental metadata analysis method script Data file or record in a database Data Descriptors: structured component
  • 46.
  • 47.
  • 48.
  • 49. Complementary roles of ISA and nanopublications From Peer-Reviewed to Peer-Reproduced in Scholarly Publishing: The Complementary Roles of Data Models and Workflows in Bioinformatics. https://doi.org/10.1371/journal.pone.0127612 PloS ONE (2015)
  • 50. The (long) road to FAIR
  • 51. Responsibilities lie across several stakeholder groups Understand the benefits of sharing FAIR datasets and enact them Engage and assist researchers to enable them to share FAIR datasets Release or endorse practices and polices, but also incentive and credit mechanisms for researchers, curators and developers
  • 52. “As Data Science culture grows, digital research outputs (such as data, computational analysis and software) are being established as first-class citizens. This cultural shift is required to go one step further: to recognize interoperability standards as digital objects in their own right, with their associated research, development and educational activities”. Sansone, Susanna-Assunta; Rocca-Serra, Philippe (2016). Interoperability Standards - Digital Objects in Their Own Right. Wellcome Trust” https://dx.doi.org/10.6084/m9.figshare.4055496.v1
  • 53. Philippe Rocca-Serra, PhD Senior Research Lecturer Alejandra Gonzalez-Beltran, PhD Research Lecturer Milo Thurston, DPhD Research Software Engineer Massimiliano Izzo, PhD Research Software Engineer Peter McQuilton, PhD Knowledge Engineer Allyson Lister, PhD Knowledge Engineer Eamonn Maguire, Dphil Contractor David Johnson, PhD Research Software Engineer Melanie Adekale, PhD Biocurator Contractor Delphine Dauga, PhD Biocurator Contractor We work with and for to make data and other digital research assets Susanna-Assunta Sansone, PhD Principal Investigator, Associate Director and Data Consultant for Springer Nature enabling open science, driving science and discoveries