SlideShare una empresa de Scribd logo
1 de 41
Data Consultant, 
Honorary Academic Editor 
Associate Director, 
Principal Investigator 
iDASH meeting, San Diego, Sept 15-16, 2014 
The rise of the data-centric 
research and publication enterprises 
Susanna-Assunta Sansone, PhD 
@biosharing 
@isatools 
@scientificdata 
Board of Directors; Technical Advisory Board; 
Coordinating Editors; Sector Lead
Credit to: 
https://projects.ac/blog/five-top-reasons-to-protect-your-data-and-practise-safe-science/
Worldwide movement for FAIR data 
Credit: Barend Mons
Worldwide movement for FAIR data 
Credit: Barend Mons 
http://bd2k.nih.gov/workshops.html#ADDS
Doing my fair share of work 
Increase the level of annotation at the source, tracking provenance and using community standards 
Notes and narrative Spreadsheets and tables Linked data and nanopublications 
Notes in Lab Books 
(information for humans) 
Spreadsheets and Tables 
( the compromise) 
Facts as RDF statements 
(information for machines) 
Working with and for:
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project 
6 
• make annotation explicit 
and discoverable 
• structure the descriptions for 
consistency 
• ensure/regulate access 
• deposit and publish 
• etc…. 
 To make any dataset ‘FAIR’, one 
must have standards, tools and 
best practices to: 
• report sufficient details 
• capture all salient features of 
the experimental workflow
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project 
7 
…breath and depth 
of the experimental context 
…is pivotal
sample characteristic(s) 
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project 
8 
experimental design 
experimental variable(s) 
technology(s) 
measurement(s) 
protocols(s) 
data file(s) 
......
The role of reporting or content standards 
Community-developed “norms” set to structure and enrich the 
description of datasets, facilitating understanding, sharing and reuse 
Including minimum 
information reporting 
requirements, or 
checklists to report the 
same core, essential 
information 
Including controlled 
vocabularies, taxonomies, 
thesauri, ontologies etc. to 
use the same word and 
refer to the same ‘thing’ 
Including conceptual 
model, conceptual 
schema from which an 
exchange format is 
derived to allow data to 
flow from one system to 
another
A community mobilization - some examples 
de jure de facto 
grass-roots 
groups 
standard 
organizations 
Nanotechnology Working Group
Organizational and operational structures - quite diverse 
de jure de facto 
grass-roots 
groups 
standard 
organizations 
Nanotechnology Working Group
Fragmentation, duplications and gaps 
Technologically-delineated 
views of the world 
12 
Biologically-delineated 
views of the world 
Generic features (‘common core’) 
- description of source biomaterial 
- experimental design components 
Arrays 
MS MS 
Gels 
Columns 
Scanning Arrays & 
Scanning 
NMR 
FTIR 
Columns 
transcriptomics 
proteomics 
metabolomics 
plant biology 
epidemiology 
microbiology 
To compare and integrate data we need interoperable standards
Growing number of reporting standards 
~ 156 
~ 70 
~ 334 
Source: BioPortal 
Databases, 
annotation, 
curation 
tools 
implementing 
standards 
miame 
MIAPA 
MIRIAM 
MIQAS 
MIX 
MIGEN 
CIMR 
MIAPE 
MIASE 
REMARK 
MIQE 
CONSORT 
MISFISHIE…. 
MAGE-Tab 
GCDML 
SRAxml 
SOFT 
FASTA 
DICOM 
SBRML 
MzML 
GELML 
SEDML… 
ISA-Tab 
CML 
MITAB 
AAO 
CHEBI 
OBI 
PATO ENVO 
MOD 
TEDDY 
BTO 
IDO… 
XAO 
PRO 
DO 
VO
Which standards and database can we use/recommend
BioSharing works to map the landscape of content standards in the 
life sciences, broadly covering biological, natural and 
biomedical sciences 
The web-based, curated and searchable registry works to ensure the 
standards are informative and discoverable, monitoring their 
development, evolution also their use in databases 
and adoption in data policies.
BioSharing’s goal is to assist stakeholders to make informed decisions: 
• researchers, developers and curators who lack support and guidance on how to 
best navigate and select the various content standards and understand their 
maturity, or find databases that implement them; 
• funders, journals, and librarians because they do not have enough information to 
make informed decisions on which content standards or database should be 
recommended in their policies, or funded or implemented.
Operational Team 
Advisory Board and RDA Working Group
Core functionalities: 
• search and filtering 
• submissions forms to add new records 
• “claim” functionality of existing records 
• person’s profile (as maintainer of 
records) associated to the ORCID 
profile 
• visualization and views of content 
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project 
1 
8 
Current content: 
• Over 500 
• Over 600
Registering and cataloging is just step one; the next include: 
• Develop assessment criteria for usability and popularity of standards 
CTSA Omics 
Data Standards 
Working Group
Registering and cataloging is just step one; the next include: 
• Develop assessment criteria for usability and popularity of standards 
• Associate standards to data policies and databases 
• Assemble journal and funder policies re data storage 
• Make fully cross-searchable 
• Continue to embed it in the ecosystem of complementary registries
Registering and cataloging is just step one; the next include: 
• Develop assessment criteria for usability and popularity of standards 
• Associate standards to data policies and databases 
• Assemble journal and funder policies re data storage 
• Make fully cross-searchable 
• Continue to embed it in the ecosystem of complementary registries
Registering and cataloging is just step one; the next include: 
• Develop assessment criteria for usability and popularity of standards 
• Associate standards to data policies and databases 
• Assemble journal and funder policies re data storage 
• Make fully cross-searchable 
• Continue to embed it in the ecosystem of complementary registries
General-purpose, configurable format for 
the description of experimental metadata 
Designed to support: 
• provenance tracking 
• use of community minimal reporting 
guidelines and terminologies 
- reference system to link to (CDISC) 
SDTM files; further connections 
explored via 
Designed to be converted to: 
• a growing number of other metadata 
formats, e.g. used by EBI repositories 
• RDF representation with mapping to 
several ontologies, incl. PROV-O to 
deliver 
analysis 
method 
script 
Data file or 
record in a 
database
ISA powers data collection, curation resources and repositories, e.g.: 
The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta 
Sansone www.ebi.ac.uk/net-project
Embedding and in activities 
CEDAR: 
Centre for Extended Data Annotation and Retrieval 
(PI: Musen; pending notification of award) 
The centre will take advantage of the recent growth in 
community-driven metadata standards to develop 
innovative methods to facilitate the annotation, 
cataloguing, and retrieval of dataset collections. 
(pending final decision and notification of award)
Role of publishers as “agents of change” 
• Data has to become an integral part 
of the scholarly communications 
• Responsibilities lie across several 
stakeholder groups: researchers, 
data centers, librarians, funding 
agencies and publishers 
• Publishers occupy a leverage point 
in this process
Launched on May 27th, 2014 
Credit for sharing 
your data 
Focused on reuse 
and reproducibility 
Peer reviewed, 
curated 
Promoting Community 
Data Repositories 
Open Access 
A new online-only publication for descriptions of scientifically valuable datasets 
in the life, environmental and biomedical sciences, but not limited to these 
Supported by:
Data Descriptor: narrative and structure 
Experimental metadata or 
structured component 
(in-house curated, machine-readable 
formats) 
Article or 
narrative component 
(PDF and HTML)
Data Descriptor: narrative and structure 
Experimental metadata or 
structured component 
(in-house curated, machine-readable 
formats) 
Article or 
narrative component 
(PDF and HTML)
Data Descriptor - focus on reuse 
Detailed descriptions of methods and technical analyses supporting quality 
of the measurements; does not contain tests of new scientific hypotheses 
Sections: 
• Title 
• Abstract 
• Background & Summary 
• Methods 
• Technical Validation 
• Data Records 
• Usage Notes 
• Figures & Tables 
• References 
• Data Citations 
In traditional publications this 
information is not provided in a 
sufficiently detailed manner 
However this information is 
essential for understanding, 
reusing, and reproducing 
datasets
Relation with traditional articles - content 
Scientific hypotheses: 
Synthesis 
Analysis 
Conclusions 
Methods and technical analyses supporting the quality 
of the measurements: 
What did I do to generate the data? 
How was the data processed? 
Where is the data? 
Who did what when
Relation with traditional articles - time 
BEFORE: get your data to the community as soon as possible (see NPG pre-publication policy) 
AT THE SAME TIME: publish your Data Descriptor(s) alongside research article(s) 
AFTER: expand on your research articles, adding further information for reuse of the data
Citations of and links to data files - databases 
Joint Declaration of Data Citation Principles by 
the Data Citation Synthesis Group
Value added component integrated in a 
growing ecosystem 
We currently recognize over 
50 public data repositories 
Research 
papers 
Data 
Data 
records 
Descriptors
Peer review process focused on quality and reuse 
Evaluation is not be based on the perceived impact or novelty of the findings 
• Experimental rigour and technical data quality 
o Methodologically sound 
o Technical validation experiments and statistical analyses 
o Depth, coverage, size, and/or completeness of data sufficient for the types 
of applications 
• Completeness of the description 
o Sufficient details to allow others to reproduce the results, reuse or 
integrate it with other data 
o Compliance with relevant minimum information or reporting standards 
• Integrity of the data files and repository record 
o Data files match the descriptions in the Data Descriptor 
o Deposited in the most appropriate available data repository
• Neuroscience, ecology, epidemiology, environmental science, functional 
genomics, metabolomics, toxicology etc. 
• New previously published individual datasets, curated aggregation and 
citizen science: 
o a fuller, more in-depth look at the data processing steps, supported by 
additional data files and code from each step 
o additional tutorial-like information for scientists interested in reusing or 
integrating the data with their own 
• Datasets in figshare, Dryad and domain specific databases 
• Code deposited in figshare and GitHub 
• First collection: 
39 
Current content is diverse - bimonthly releases
• Neuroscience, ecology, epidemiology, environmental science, functional 
genomics, metabolomics, toxicology etc. 
• New previously published individual datasets, curated aggregation and 
citizen science: 
o a fuller, more in-depth look at the data processing steps, supported by 
additional data files and code from each step 
o additional tutorial-like information for scientists interested in reusing or 
integrating the data with their own 
• Datasets in figshare, Dryad and domain specific databases 
• Code deposited in figshare and GitHub 
• First collection: 
40 
Current content is diverse - bimonthly releases
Acknowledgements 
Advisory Boards and Collaborators 
Philippe 
Rocca-Serra, PhD 
Alejandra 
Gonzalez-Beltran, PhD 
Eamonn 
Maguire 
Milo 
Thurston, PhD 
Visit 
nature.com/scientificdata 
Email 
scientificdata@nature.com 
Tweet 
@ScientificData 
Honorary Academic Editor 
Susanna-Assunta Sansone, PhD 
Managing Editor 
Andrew L Hufton, PhD 
Editorial Curator 
Victoria Newman 
Advisory Panel and Editorial Board including 
senior researchers, funders, librarians and curators

Más contenido relacionado

La actualidad más candente

RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 Peter McQuilton
 
From metadata to data curation: the role of libraries in data exchange
From metadata to data curation: the role of libraries in data exchangeFrom metadata to data curation: the role of libraries in data exchange
From metadata to data curation: the role of libraries in data exchangeLIBER Europe
 
Publishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsPublishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsARDC
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017ARDC
 
Presentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesPresentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesSEAD
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...SEAD
 
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FuturePoster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FutureASIS&T
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchUniversity of California Curation Center
 
Author identifiers & research impact: A role for libraries
Author identifiers & research impact: A role for librariesAuthor identifiers & research impact: A role for libraries
Author identifiers & research impact: A role for librariesKristi Holmes
 
Evolving Roles in Scholarly Communications
Evolving Roles in Scholarly Communications Evolving Roles in Scholarly Communications
Evolving Roles in Scholarly Communications LIBER Europe
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersIncisive_Events
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectASIS&T
 
A Data Curation Framework: Data Curation and Research Support Services
A Data Curation Framework: Data Curation and Research Support ServicesA Data Curation Framework: Data Curation and Research Support Services
A Data Curation Framework: Data Curation and Research Support ServicesSusanMRob
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...ARDC
 
Scholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online EnvironmentScholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online EnvironmentOCLC Research
 

La actualidad más candente (20)

RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8 RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
RDA P16 - Repository Selection Criteria - Funders IG Breakout 8
 
From metadata to data curation: the role of libraries in data exchange
From metadata to data curation: the role of libraries in data exchangeFrom metadata to data curation: the role of libraries in data exchange
From metadata to data curation: the role of libraries in data exchange
 
Publishing perspectives on data management & future directions
Publishing perspectives on data management & future directionsPublishing perspectives on data management & future directions
Publishing perspectives on data management & future directions
 
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
Research Data Management in practice, RIA Data Management Workshop Brisbane 2017
 
Presentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research SeriesPresentation to the UM Library Emergent Research Series
Presentation to the UM Library Emergent Research Series
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FuturePoster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
 
Baker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated AudiencesBaker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated Audiences
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
 
Jonathan Breeze, Symplectic
Jonathan Breeze, SymplecticJonathan Breeze, Symplectic
Jonathan Breeze, Symplectic
 
Author identifiers & research impact: A role for libraries
Author identifiers & research impact: A role for librariesAuthor identifiers & research impact: A role for libraries
Author identifiers & research impact: A role for libraries
 
Evolving Roles in Scholarly Communications
Evolving Roles in Scholarly Communications Evolving Roles in Scholarly Communications
Evolving Roles in Scholarly Communications
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producers
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
A Data Curation Framework: Data Curation and Research Support Services
A Data Curation Framework: Data Curation and Research Support ServicesA Data Curation Framework: Data Curation and Research Support Services
A Data Curation Framework: Data Curation and Research Support Services
 
Zucca "Technology & Systems"
Zucca "Technology & Systems"Zucca "Technology & Systems"
Zucca "Technology & Systems"
 
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
Introduction to the Research Integrity Advisor Data Management Workshop, Bris...
 
Scholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online EnvironmentScholarly Information Practices In The Online Environment
Scholarly Information Practices In The Online Environment
 
Johnston - How to Curate Research Data
Johnston - How to Curate Research DataJohnston - How to Curate Research Data
Johnston - How to Curate Research Data
 

Similar a Data Consultant and Researcher's Presentation on FAIR Data Standards

INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017Susanna-Assunta Sansone
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Susanna-Assunta Sansone
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Susanna-Assunta Sansone
 
ELIXIR Webinar: BioSharing
ELIXIR Webinar: BioSharingELIXIR Webinar: BioSharing
ELIXIR Webinar: BioSharingPeter McQuilton
 
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Susanna-Assunta Sansone
 
Cross-linked metadata standards, repositories and the data policies - The Bio...
Cross-linked metadata standards, repositories and the data policies - The Bio...Cross-linked metadata standards, repositories and the data policies - The Bio...
Cross-linked metadata standards, repositories and the data policies - The Bio...Peter McQuilton
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWS
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWSRDA BioSharing WG + RDA Metabolomics IG OVERVIEWS
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWSSusanna-Assunta Sansone
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Susanna-Assunta Sansone
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceDavid Johnson
 
BioSharing, an ELIXIR Interoperability Platform resource
BioSharing, an ELIXIR Interoperability Platform resourceBioSharing, an ELIXIR Interoperability Platform resource
BioSharing, an ELIXIR Interoperability Platform resourcePeter McQuilton
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014Susanna-Assunta Sansone
 
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...Peter McQuilton
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Susanna-Assunta Sansone
 
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014Susanna-Assunta Sansone
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyPeter McQuilton
 

Similar a Data Consultant and Researcher's Presentation on FAIR Data Standards (20)

INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017INSERM - Data Management & Reuse of Health Data - May 2017
INSERM - Data Management & Reuse of Health Data - May 2017
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
Overview to: BBSRC Oxford Doctoral Training Partnership - Dr Sansone - July 2014
 
ELIXIR Webinar: BioSharing
ELIXIR Webinar: BioSharingELIXIR Webinar: BioSharing
ELIXIR Webinar: BioSharing
 
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...Overview of standards/stakeholders in life science (RDA Engagement Interest G...
Overview of standards/stakeholders in life science (RDA Engagement Interest G...
 
Sansone mibbi-intro
Sansone mibbi-introSansone mibbi-intro
Sansone mibbi-intro
 
Cross-linked metadata standards, repositories and the data policies - The Bio...
Cross-linked metadata standards, repositories and the data policies - The Bio...Cross-linked metadata standards, repositories and the data policies - The Bio...
Cross-linked metadata standards, repositories and the data policies - The Bio...
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWS
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWSRDA BioSharing WG + RDA Metabolomics IG OVERVIEWS
RDA BioSharing WG + RDA Metabolomics IG OVERVIEWS
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...Scientific Data overview of Data Descriptors - WT Data-Literature integration...
Scientific Data overview of Data Descriptors - WT Data-Literature integration...
 
GARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant ScienceGARNet workshop on Integrating Large Data into Plant Science
GARNet workshop on Integrating Large Data into Plant Science
 
BioSharing, an ELIXIR Interoperability Platform resource
BioSharing, an ELIXIR Interoperability Platform resourceBioSharing, an ELIXIR Interoperability Platform resource
BioSharing, an ELIXIR Interoperability Platform resource
 
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
FAIR data and NPG Scientific Data: RIKEN Yokohama, 25 June, 2014
 
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
The Diversity of Biomedical Data, Databases and Standards (Research Data Alli...
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015 Scientific Data and peer review session at Dryad event, May 2015
Scientific Data and peer review session at Dryad event, May 2015
 
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
NPG Scientific Data - Metabolomics Society meeting, Tsuruola, Japan, 2014
 
FAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
 
BioSharing - Update - Feb2016
BioSharing - Update - Feb2016BioSharing - Update - Feb2016
BioSharing - Update - Feb2016
 

Más de Susanna-Assunta Sansone

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRSusanna-Assunta Sansone
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesSusanna-Assunta Sansone
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookSusanna-Assunta Sansone
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessSusanna-Assunta Sansone
 
FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features Susanna-Assunta Sansone
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseSusanna-Assunta Sansone
 

Más de Susanna-Assunta Sansone (20)

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
FAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdfFAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdf
 
FAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdfFAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdf
 
FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
 
Metadata Standards
Metadata StandardsMetadata Standards
Metadata Standards
 
FAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-SingaporeFAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-Singapore
 
FAIR Cookbook
FAIR Cookbook FAIR Cookbook
FAIR Cookbook
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipes
 
FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook
 
FAIRsharing for EOSC
FAIRsharing for EOSC FAIRsharing for EOSC
FAIRsharing for EOSC
 
FAIR: standards and services
FAIR: standards and servicesFAIR: standards and services
FAIR: standards and services
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
 
FAIRsharing: what we do for policies
FAIRsharing: what we do for policiesFAIRsharing: what we do for policies
FAIRsharing: what we do for policies
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRness
 
ELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - ExamplarsELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - Examplars
 
FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 response
 
FAIRsharing poster
FAIRsharing posterFAIRsharing poster
FAIRsharing poster
 
The FAIR Cookbook poster
The FAIR Cookbook posterThe FAIR Cookbook poster
The FAIR Cookbook poster
 

Último

Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 

Último (20)

Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 

Data Consultant and Researcher's Presentation on FAIR Data Standards

  • 1. Data Consultant, Honorary Academic Editor Associate Director, Principal Investigator iDASH meeting, San Diego, Sept 15-16, 2014 The rise of the data-centric research and publication enterprises Susanna-Assunta Sansone, PhD @biosharing @isatools @scientificdata Board of Directors; Technical Advisory Board; Coordinating Editors; Sector Lead
  • 3. Worldwide movement for FAIR data Credit: Barend Mons
  • 4. Worldwide movement for FAIR data Credit: Barend Mons http://bd2k.nih.gov/workshops.html#ADDS
  • 5. Doing my fair share of work Increase the level of annotation at the source, tracking provenance and using community standards Notes and narrative Spreadsheets and tables Linked data and nanopublications Notes in Lab Books (information for humans) Spreadsheets and Tables ( the compromise) Facts as RDF statements (information for machines) Working with and for:
  • 6. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 6 • make annotation explicit and discoverable • structure the descriptions for consistency • ensure/regulate access • deposit and publish • etc….  To make any dataset ‘FAIR’, one must have standards, tools and best practices to: • report sufficient details • capture all salient features of the experimental workflow
  • 7. The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 7 …breath and depth of the experimental context …is pivotal
  • 8. sample characteristic(s) The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 8 experimental design experimental variable(s) technology(s) measurement(s) protocols(s) data file(s) ......
  • 9. The role of reporting or content standards Community-developed “norms” set to structure and enrich the description of datasets, facilitating understanding, sharing and reuse Including minimum information reporting requirements, or checklists to report the same core, essential information Including controlled vocabularies, taxonomies, thesauri, ontologies etc. to use the same word and refer to the same ‘thing’ Including conceptual model, conceptual schema from which an exchange format is derived to allow data to flow from one system to another
  • 10. A community mobilization - some examples de jure de facto grass-roots groups standard organizations Nanotechnology Working Group
  • 11. Organizational and operational structures - quite diverse de jure de facto grass-roots groups standard organizations Nanotechnology Working Group
  • 12. Fragmentation, duplications and gaps Technologically-delineated views of the world 12 Biologically-delineated views of the world Generic features (‘common core’) - description of source biomaterial - experimental design components Arrays MS MS Gels Columns Scanning Arrays & Scanning NMR FTIR Columns transcriptomics proteomics metabolomics plant biology epidemiology microbiology To compare and integrate data we need interoperable standards
  • 13. Growing number of reporting standards ~ 156 ~ 70 ~ 334 Source: BioPortal Databases, annotation, curation tools implementing standards miame MIAPA MIRIAM MIQAS MIX MIGEN CIMR MIAPE MIASE REMARK MIQE CONSORT MISFISHIE…. MAGE-Tab GCDML SRAxml SOFT FASTA DICOM SBRML MzML GELML SEDML… ISA-Tab CML MITAB AAO CHEBI OBI PATO ENVO MOD TEDDY BTO IDO… XAO PRO DO VO
  • 14. Which standards and database can we use/recommend
  • 15. BioSharing works to map the landscape of content standards in the life sciences, broadly covering biological, natural and biomedical sciences The web-based, curated and searchable registry works to ensure the standards are informative and discoverable, monitoring their development, evolution also their use in databases and adoption in data policies.
  • 16. BioSharing’s goal is to assist stakeholders to make informed decisions: • researchers, developers and curators who lack support and guidance on how to best navigate and select the various content standards and understand their maturity, or find databases that implement them; • funders, journals, and librarians because they do not have enough information to make informed decisions on which content standards or database should be recommended in their policies, or funded or implemented.
  • 17. Operational Team Advisory Board and RDA Working Group
  • 18. Core functionalities: • search and filtering • submissions forms to add new records • “claim” functionality of existing records • person’s profile (as maintainer of records) associated to the ORCID profile • visualization and views of content The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project 1 8 Current content: • Over 500 • Over 600
  • 19. Registering and cataloging is just step one; the next include: • Develop assessment criteria for usability and popularity of standards CTSA Omics Data Standards Working Group
  • 20. Registering and cataloging is just step one; the next include: • Develop assessment criteria for usability and popularity of standards • Associate standards to data policies and databases • Assemble journal and funder policies re data storage • Make fully cross-searchable • Continue to embed it in the ecosystem of complementary registries
  • 21. Registering and cataloging is just step one; the next include: • Develop assessment criteria for usability and popularity of standards • Associate standards to data policies and databases • Assemble journal and funder policies re data storage • Make fully cross-searchable • Continue to embed it in the ecosystem of complementary registries
  • 22. Registering and cataloging is just step one; the next include: • Develop assessment criteria for usability and popularity of standards • Associate standards to data policies and databases • Assemble journal and funder policies re data storage • Make fully cross-searchable • Continue to embed it in the ecosystem of complementary registries
  • 23.
  • 24.
  • 25. General-purpose, configurable format for the description of experimental metadata Designed to support: • provenance tracking • use of community minimal reporting guidelines and terminologies - reference system to link to (CDISC) SDTM files; further connections explored via Designed to be converted to: • a growing number of other metadata formats, e.g. used by EBI repositories • RDF representation with mapping to several ontologies, incl. PROV-O to deliver analysis method script Data file or record in a database
  • 26.
  • 27. ISA powers data collection, curation resources and repositories, e.g.: The International Conference on Systems Biology (ICSB), 22-28 August, 2008 Susanna-Assunta Sansone www.ebi.ac.uk/net-project
  • 28. Embedding and in activities CEDAR: Centre for Extended Data Annotation and Retrieval (PI: Musen; pending notification of award) The centre will take advantage of the recent growth in community-driven metadata standards to develop innovative methods to facilitate the annotation, cataloguing, and retrieval of dataset collections. (pending final decision and notification of award)
  • 29. Role of publishers as “agents of change” • Data has to become an integral part of the scholarly communications • Responsibilities lie across several stakeholder groups: researchers, data centers, librarians, funding agencies and publishers • Publishers occupy a leverage point in this process
  • 30. Launched on May 27th, 2014 Credit for sharing your data Focused on reuse and reproducibility Peer reviewed, curated Promoting Community Data Repositories Open Access A new online-only publication for descriptions of scientifically valuable datasets in the life, environmental and biomedical sciences, but not limited to these Supported by:
  • 31. Data Descriptor: narrative and structure Experimental metadata or structured component (in-house curated, machine-readable formats) Article or narrative component (PDF and HTML)
  • 32. Data Descriptor: narrative and structure Experimental metadata or structured component (in-house curated, machine-readable formats) Article or narrative component (PDF and HTML)
  • 33. Data Descriptor - focus on reuse Detailed descriptions of methods and technical analyses supporting quality of the measurements; does not contain tests of new scientific hypotheses Sections: • Title • Abstract • Background & Summary • Methods • Technical Validation • Data Records • Usage Notes • Figures & Tables • References • Data Citations In traditional publications this information is not provided in a sufficiently detailed manner However this information is essential for understanding, reusing, and reproducing datasets
  • 34. Relation with traditional articles - content Scientific hypotheses: Synthesis Analysis Conclusions Methods and technical analyses supporting the quality of the measurements: What did I do to generate the data? How was the data processed? Where is the data? Who did what when
  • 35. Relation with traditional articles - time BEFORE: get your data to the community as soon as possible (see NPG pre-publication policy) AT THE SAME TIME: publish your Data Descriptor(s) alongside research article(s) AFTER: expand on your research articles, adding further information for reuse of the data
  • 36. Citations of and links to data files - databases Joint Declaration of Data Citation Principles by the Data Citation Synthesis Group
  • 37. Value added component integrated in a growing ecosystem We currently recognize over 50 public data repositories Research papers Data Data records Descriptors
  • 38. Peer review process focused on quality and reuse Evaluation is not be based on the perceived impact or novelty of the findings • Experimental rigour and technical data quality o Methodologically sound o Technical validation experiments and statistical analyses o Depth, coverage, size, and/or completeness of data sufficient for the types of applications • Completeness of the description o Sufficient details to allow others to reproduce the results, reuse or integrate it with other data o Compliance with relevant minimum information or reporting standards • Integrity of the data files and repository record o Data files match the descriptions in the Data Descriptor o Deposited in the most appropriate available data repository
  • 39. • Neuroscience, ecology, epidemiology, environmental science, functional genomics, metabolomics, toxicology etc. • New previously published individual datasets, curated aggregation and citizen science: o a fuller, more in-depth look at the data processing steps, supported by additional data files and code from each step o additional tutorial-like information for scientists interested in reusing or integrating the data with their own • Datasets in figshare, Dryad and domain specific databases • Code deposited in figshare and GitHub • First collection: 39 Current content is diverse - bimonthly releases
  • 40. • Neuroscience, ecology, epidemiology, environmental science, functional genomics, metabolomics, toxicology etc. • New previously published individual datasets, curated aggregation and citizen science: o a fuller, more in-depth look at the data processing steps, supported by additional data files and code from each step o additional tutorial-like information for scientists interested in reusing or integrating the data with their own • Datasets in figshare, Dryad and domain specific databases • Code deposited in figshare and GitHub • First collection: 40 Current content is diverse - bimonthly releases
  • 41. Acknowledgements Advisory Boards and Collaborators Philippe Rocca-Serra, PhD Alejandra Gonzalez-Beltran, PhD Eamonn Maguire Milo Thurston, PhD Visit nature.com/scientificdata Email scientificdata@nature.com Tweet @ScientificData Honorary Academic Editor Susanna-Assunta Sansone, PhD Managing Editor Andrew L Hufton, PhD Editorial Curator Victoria Newman Advisory Panel and Editorial Board including senior researchers, funders, librarians and curators