SlideShare una empresa de Scribd logo
1 de 62
Descargar para leer sin conexión
The international FAIR movement
Behind the brand:
thinkers, doers and dreamers
Susanna-Assunta Sansone
ORCiD: 0000-0001-5306-5690 | Twitter: @SusannaASansone
Oxford Open Access Week, 10 March 2020
Slides: https://www.slideshare.net/SusannaSansone
datareadiness.eng.ox.ac.uk
Associate Professor, Engineering Science
Associate Director, Oxford e-Research Centre
A set of principles to enhance the
value of all digital resources
2014
2016
Developed and endorsed by
researchers, service
providers, publishers, funding
agencies, industry partners
And the FAIR brand is born
Findable
Accessible
Interoperable
Reusable
• Globally unique, resolvable, and persistent identifiers
▪ To retrieve and connect data
• Community defined descriptive metadata
▪ To enhance discoverability
• Common terminologies
▪ To use the same term mean the same thing
• Detailed provenance
▪ To contextualize the data and facilitate reproducibility
• Terms of access
▪ Open as possible, closed as necessary
• Terms of use
▪ Clear licences, ideally to enable innovation and reuse
Data for humans and for machines
Everybody needs data that are
• Discoverable by humans and machines
• Retrievable and structured in standard format(s)
• Self-described so that third parties can make sense of it
Better data = better science and more efficiently
Datasets SOPs Figures, Photos Workflows Slides Codes Tools DatabasesAlgorithmsDocument
FAIR mythology summarized
Just principles not practice
Slide credit: Carole Goble
“Accessible, assessable,
intelligible, reusable…anyone,
anything, anytime…”
https://royalsociety.org/topics-policy/projects/science-public-enterprise/report/?gclid=EAIaIQobChMI5vSav9qP6AIVzLTtCh0J7wE4EAAYASAAEgIPx_D_BwE
FAIR is nothing new…
FAIR-driven
agendas, policies and
programmes
doi.org/10.2777/02999doi.org/10.2777/1524
www.gov.uk/government/publications/open-
research-data-task-force-final-report
doi.org/10.5281/zenodo.1245568
www.ukri.org/research/infrastructure
…but it has aligned the broad community around
a common guidelines; some examples
“…we estimate to be 10.2
billion euros per year…”
doi.org/10.7486/DRI.tq582c863
The scholarly publishing
ecosystem is changing
Data-relates mandates by funders
and institutions are growing
Researchers must be responsibile
and accountable for their research, but
they also need recognition and credit
theconversation.com/how-robots-can-help-us-embrace-a-more-human-view-of-disability-76815
Human-machine collaboration is the
future, and AI-ready data is essential
Responding to needs and crisis
o 21% pharmacology data (doi.org/10.1038/nrd3439-c1)
o 11% cancer data (doi.org/10.1038/483531a)
o unsatisfactory in ML (openreview.net/pdf?id=By4l2PbQ-)
towardsdatascience.com/scientific-data-analysis-pipelines-and-reproducibility-75ff9df5b4c5
Reproducibility of published studies is still problematic
We research and develop methods and tools to
improve data reuse;
we work for data transparency, research integrity
and the evolution of scholarly publishing
Recognition and credit for ‘building, training and
driving changes’
www.gov.uk/government/publications/open-research-data-task-force-final-report
Prof. Pam Thomas,
Open Research Data Task Force Chair
https://twitter.com/SusannaASansone/status/1227885895426154497/photo/1
• Dialogue, engagement and commitment among
stakeholders
• Infrastructure and services as integral part of the
research data cycle
• Cooperation with activities outside UK, also with
industries and business including SMEs
• Keep under continual review
2018
www.ukri.org/research/infrastructure
Depends upon several stakeholders in the research ecosystem
actively playing their parts to:
• deliver research infrastructures, tools and standards,
policies, education and training
• address technical, social and cultural challenges
It is not simple and it requires long term investment
Making FAIR a reality
Findable
Accessible
Interoperable
20%
identifiers
80%
metadata
https://doi.org/10.2777/1524
Reusable
Two pillars of FAIR
“Most metadata field names and their values
are not standardized or controlled”
“Even simple binary or numeric fields are
often populated with inadequate values of
different data types”
Better metadata for better data
https://doi.org/10.1038/sdata.2019.21
The importance of knowledge representation and
interoperability
The ability of computer systems or software to understand information
with sufficient accuracy to utilise (e.g. represent and exchange)
that information for an intelligent purpose
Formats Terminologies Guidelines Identifiers
ID
Conceptual model, conceptual
schema, exchange formats Controlled vocabularies,
thesauri, ontologies
Minimum information reporting
requirements, checklists
Unambiguous, persistent and context-
independent identifier schema
metadata
Findable
Accessible
Interoperable
20%
identifiers
80%
metadata
Reusable
FAIR is not one size fits all
FAIR is not about harmonizing all
metadata, or publishing all in RDF
Findable
Accessible
Interoperable
Reusable
https://www.natureindex.com/news-blog/what-scientists-need-to-know-about-fair-data
doi.org/10.5281/zenodo.1245568
The road to FAIRness
Before FAIR
http://blogs.nature.com/scientificdata/2019/10/22/the-layered-cake
The road to FAIRness
Before FAIR
After FAIR
http://blogs.nature.com/scientificdata/2019/10/22/the-layered-cake
The road to FAIRness
Before FAIR
After FAIR
….from chaos,
comes order?
The EOSC layered cake of FAIR
http://blogs.nature.com/scientificdata/2019/10/22/the-layered-cake
https://www.eoscsecretariat.eu
And a growing number of FAIR-related EOSC-funded projects in all disciplines; for example:
Growing number of reports, surveys and
recommendations
http://blogs.nature.com/scientificdata/2019/10/22/the-layered-cake
https://www.eoscsecretariat.eu/events/webinar-coming-together-map-national-landscape-analysis-eosc-between-5-relevant-projects
My fair share
of the work
€3.3 billion
programme
2014 - 2020
€300 million
programme
2018 - 2020
European
intergovernmental
organisation
23 member
countries and
over 180 research
organisations
Since 2014
1
2
3 Started in 2019
FAIR-enabling EU and USA biomedical infrastructure
programmes and projects, e.g.
Since in 2014, several programs:
2014-2017
2017-2018
Organization and structure
• Hub and (national) Nodes
• Community-driven and rooted
• Strong focus on interoperability
• SMEs and Industry links
• Cross-nodes funded activities
model and related formats
Initiated in
2003
Open source tools and formats to help researchers to:
describe multi-modal experiments
follow community-developed standards
curate, analyze, release, share and publish
Nowadays ISA (format and/or tools)
powers over 30 public resources, e.g.,
Funded by
Part of the
ISA-InterMine project
Reproducibility – FAIR at the first mile
From curated, structured metadata to data paper
datascriptor.org
Academics from several ELIXIR Nodes, with Janssen, AstraZeneca, Eli Lilly,
GSK, Novartis, Bayer, Boehringer Ingelheim
Define, document and implement a data FAIRification process:
Rocca-Serra and Sansone.
Experiment design driven FAIRification of
omics data matrices, an exemplar
Scientific Data 2019.
https://doi.org/10.1038/s41597-019-0286-0
Practical examples: data FAIRifications recipe
https://fairplus.github.io/the-fair-cookbook
https://www.turing.ac.uk/research/impact-stories/changing-culture-data-science
Based on the successful example
of the ATI’s Turing Way book
https://fairplus.github.io/the-fair-cookbook
Participatory and open
FAIR-driven biopharma R&D, e.g.
Implementation of FAIR by a company
Requirements
• Financial investment for technical infrastructure, training & education
• Make use of existing FAIR tools & best practices
• Show business value through transformation to digital drug discovery
A three-pronged approach
1. Build a data catalogue for Findability of all data assets in the company
Tip: Prioritise for datasets which will quickly demonstrate benefits
2. Bring data together into a virtual, federated, infrastructure, so that data with the
right credentials become instantaneously Accessible for human and machine
Tip: Build on previous efforts to move to cloud services
3. Make selected datasets Interoperable and Reusable on the metadata as well at
the individual data point level
Tip: Most costly, so important to determine likely return on business value
390+
162+
729+
~1300
13
MIAME
MIRIAM
MIQAS
MIX
MIGEN
ARRIVE
MIAPE
MIASE
MIQE
MISFISHIE
….
REMARK
CONSORT
SRAxml
SOFT FASTA
DICOM
MzML
SBRML
SEDML
…
GELML
ISA
CML
MITAB
…
AAO
CHEBIOBI
PATO ENVO
MOD
BTO
IDO
…
TEDDY
PRO
XAO
DO
VO
EC number
URL
PURLLSID
HandleORCID
RRID
InChI
…
IVOA ID
DOI
standard
organizations
grass-roots
groups
Formats Terminologies Guidelines Identifiers
ID
COMMUNITY STANDARDS
for metadata and identifiers
Domain-specific standards for datasets, e.g.
https://doi.org/10.6084/m9.figshare.3795816.v2
https://doi.org/10.6084/m9.figshare.4055496.v1
Analysis of the standards landscape in the life and
biomedical sciences
Fragmentation, duplication and gaps:
• Perspective and focus vary:
• Motivation is diverse
• Governance and participation vary
2013
2016
doi: 10.1126/science.1180598
doi:10.1038/nbt1346
OBO Portal and Foundry Portal and Foundrydoi: 10.1038/nbt.1411
Over 10+ years of community engagement
69 authors: adopters, collaborators and users
Open Access CC-BY
2007 20082009
2011 2019
doi.org/10.1038/s41587-019-0080-8
Since
2011
Currently primarily funded by
Formats Terminologies Guidelines Identifiers
ID
REPOSITORIES,
databases and
knowledgebases
DATA POLICIES
by journals, funders, and
other organizations
COMMUNITY STANDARDS
for metadata and identifiers
informative and educational resource
Curated inter-linked
descriptions
Formats Terminologies Guidelines Identifiers
ID
informative and educational resource
Curated inter-linked
descriptions
All records are manually curated
in-house, verified and claimed by the
community behind each resource
Ready for use, implementation, or recommendation
In development
Status uncertain
Deprecated as subsumed or superseded
REPOSITORIES,
databases and
knowledgebases
DATA POLICIES
by journals, funders, and
other organizations
COMMUNITY STANDARDS
for metadata and identifiers
Formats Terminologies Guidelines Identifiers
ID
REPOSITORIES,
databases and
knowledgebases
DATA POLICIES
by journals, funders, and
other organizations
COMMUNITY STANDARDS
for metadata and identifiers
informative and educational resource
Curated inter-linked
descriptions
We guide consumers to discover, select and use these
resources with confidence
We help producers to make their resources more visible, more
widely adopted and cited
Tracking evolution of (meta)data standards
Describing access and use of databases and
repositories
Describing the standards these databases and
repositories implement
Tracking which databases and standards are
recommended by (publishers) data policies
Maintaining these descriptions by working with the
owners of these resources, citing and crediting them
“The interactive browser will allow us to discover which databases and standards
are not currently included in our author guidelines, enabling us to regularly
monitor and refine our policies as appropriate, in support of our mission to help
our authors enhance the reproducibility of their work.”
H. Murray. Publishing Editor, F1000Research
Researchers in
academia, industry,
government
Developers and
curators of resources
Journal publishers or
organizations with data
policy
Research data
facilitators, librarians,
trainers
Learned societies,
unions and
associations
Funders and data
policy makers
A flagship output of and a WG in:
Recommended by funders, e.g.:
https://www.rd-alliance.org/enabling-open-research-publishing
Analysed the data policies by
journals/publishers, and the standards and
repositories they recommend
Working with journal editors and publishers
Participating
publishers:
Discrepancy in recommendation across the data policies
• some repositories are named, but very few standards are
• cautious approach due to the wealth of existing resources
Recommendations are often driven by
• the editor’s familiarity with one or more standards, notably for
journals or publishers focusing on specific disciplines
• the engagement with learned societies and researchers
actively supporting and using certain resources
Ø Consensus: FAIRsharing plays a key role in helping
editors to discover and recommend appropriate resources
What have we learned and are doing now
Participating
publishers:
Working on a set of criteria that journals and
publishers believe are important for the
identification and selection of data
repositories, which can be recommended to
researchers when they are preparing to
publish the data underlying their findings
Participating
publishers:
Data Repository Selection: Criteria That Matter
Pre-print:
https://osf.io/m2bce
Collaboration:
Participating
publishers:
Data Repository Selection: Criteria That Matter
Pre-print:
https://osf.io/m2bce
Harmonize journals and publishers’ data deposition
guidelines by defining a common set of criteria for
repository selection
Criteria include:
• Access conditions
• Reuse condition
• Deposition conditions
• Unique ID schema
• User support
• Curation
• …….
Collaboration:
Increase the number and the clarity of journals and funders
data policies by classifying the recommendations these policies contain
to improve their definition and guidance to researchers
Collaboration:
Workplan – phase 1:
Curate and assess their compliance to the Transparency and Openness Promotion
(TOP) guidelines and display the level in FAIRsharing
Assessments and
evaluations
A growing numbers of metrics, indicators, models and
certifications for FAIRness and trustworthiness
https://fairassist.org
Keeping track of these manual, semi-manual or automatic
tools the community is developing at:
infrastructures
standards
tools
policies
education
training
cultural normalization
incentives
long term investment
A FAIRy tale needs some magic…
It is not simple, but it is no longer optional
and our collaborators
datareadiness.eng.ox.ac.uk
FAIR synchrotron
data project
DPhil student
starting
Oct 2020

Más contenido relacionado

La actualidad más candente

EnablingFAIR - Open research data in the UK
EnablingFAIR - Open research data in the UKEnablingFAIR - Open research data in the UK
EnablingFAIR - Open research data in the UKSusanna-Assunta Sansone
 
Behind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and DreamersBehind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and DreamersSusanna-Assunta Sansone
 
FAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkFAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkSusanna-Assunta Sansone
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookSusanna-Assunta Sansone
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseSusanna-Assunta Sansone
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Susanna-Assunta Sansone
 
2021 04 Introduction to FAIRsharing - cineca
2021 04 Introduction to FAIRsharing - cineca2021 04 Introduction to FAIRsharing - cineca
2021 04 Introduction to FAIRsharing - cinecaAllyson Lister
 
FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features Susanna-Assunta Sansone
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessSusanna-Assunta Sansone
 
FAIRsharing: curating an ecosystem of research standards and databases
FAIRsharing: curating an ecosystem of research standards and databasesFAIRsharing: curating an ecosystem of research standards and databases
FAIRsharing: curating an ecosystem of research standards and databasesAllyson Lister
 
OeRC_BioNatMedSciences_TeamOverview_Dec2013
OeRC_BioNatMedSciences_TeamOverview_Dec2013OeRC_BioNatMedSciences_TeamOverview_Dec2013
OeRC_BioNatMedSciences_TeamOverview_Dec2013Susanna-Assunta Sansone
 
RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017Peter McQuilton
 
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013Susanna-Assunta Sansone
 

La actualidad más candente (20)

EnablingFAIR - Open research data in the UK
EnablingFAIR - Open research data in the UKEnablingFAIR - Open research data in the UK
EnablingFAIR - Open research data in the UK
 
All Things Biocuration
All Things BiocurationAll Things Biocuration
All Things Biocuration
 
Behind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and DreamersBehind the FAIR brand: Thinkers, Doers and Dreamers
Behind the FAIR brand: Thinkers, Doers and Dreamers
 
FAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health NetworkFAIRsharing COVID-19 Collection for The Global Health Network
FAIRsharing COVID-19 Collection for The Global Health Network
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
 
The FAIR Cookbook in a nutshell
The FAIR Cookbook in a nutshellThe FAIR Cookbook in a nutshell
The FAIR Cookbook in a nutshell
 
FAIR overview - MAQC Society, Feb 2018
FAIR overview - MAQC Society, Feb 2018FAIR overview - MAQC Society, Feb 2018
FAIR overview - MAQC Society, Feb 2018
 
FAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 responseFAIR data and standards for a coordinated COVID-19 response
FAIR data and standards for a coordinated COVID-19 response
 
FAIRsharing for RDA Funders Forum
FAIRsharing for RDA Funders ForumFAIRsharing for RDA Funders Forum
FAIRsharing for RDA Funders Forum
 
Metadata for Interoperable Bioscience
Metadata for Interoperable BioscienceMetadata for Interoperable Bioscience
Metadata for Interoperable Bioscience
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
2021 04 Introduction to FAIRsharing - cineca
2021 04 Introduction to FAIRsharing - cineca2021 04 Introduction to FAIRsharing - cineca
2021 04 Introduction to FAIRsharing - cineca
 
FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features FAIRsharing - focus on standards and new features
FAIRsharing - focus on standards and new features
 
FAIRsharing: what we do for policies
FAIRsharing: what we do for policiesFAIRsharing: what we do for policies
FAIRsharing: what we do for policies
 
FAIR: standards and services
FAIR: standards and servicesFAIR: standards and services
FAIR: standards and services
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRness
 
FAIRsharing: curating an ecosystem of research standards and databases
FAIRsharing: curating an ecosystem of research standards and databasesFAIRsharing: curating an ecosystem of research standards and databases
FAIRsharing: curating an ecosystem of research standards and databases
 
OeRC_BioNatMedSciences_TeamOverview_Dec2013
OeRC_BioNatMedSciences_TeamOverview_Dec2013OeRC_BioNatMedSciences_TeamOverview_Dec2013
OeRC_BioNatMedSciences_TeamOverview_Dec2013
 
RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017RDA BioSharing WG/ELIXIR Session Montreal 2017
RDA BioSharing WG/ELIXIR Session Montreal 2017
 
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
NPG Scientific Data Overview for GBIF - TDWG meeting Oct 2013
 

Similar a The international FAIR movement: thinkers, doers and dreamers

My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018Susanna-Assunta Sansone
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)Carole Goble
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Global Research Data Initiatives
Global Research Data InitiativesGlobal Research Data Initiatives
Global Research Data InitiativesSarah Jones
 
LIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into RealityLIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into RealityLIBER Europe
 
What it means to be FAIR
What it means to be FAIRWhat it means to be FAIR
What it means to be FAIRSarah Jones
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love Kristi Holmes
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13Kristi Holmes
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonAfrican Open Science Platform
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
 
Open data-for-innovation-smart-and-sustainable
Open data-for-innovation-smart-and-sustainableOpen data-for-innovation-smart-and-sustainable
Open data-for-innovation-smart-and-sustainablegyleodhis
 
Open data for innovation, smart and sustainable prof muliaro
Open data for innovation, smart and sustainable prof muliaroOpen data for innovation, smart and sustainable prof muliaro
Open data for innovation, smart and sustainable prof muliarogyleodhis
 
The importance of FAIR and the Community of Data Driven Insights - the road t...
The importance of FAIR and the Community of Data Driven Insights - the road t...The importance of FAIR and the Community of Data Driven Insights - the road t...
The importance of FAIR and the Community of Data Driven Insights - the road t...Carlos Utrilla Guerrero
 
The FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus projectThe FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus projectSusanna-Assunta Sansone
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AlonePhilip Bourne
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDASarah Jones
 
Turning FAIR data into reality
Turning FAIR data into realityTurning FAIR data into reality
Turning FAIR data into realitySarah Jones
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangePhilip Bourne
 

Similar a The international FAIR movement: thinkers, doers and dreamers (20)

My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
FAIR and biopharma
FAIR and biopharmaFAIR and biopharma
FAIR and biopharma
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Global Research Data Initiatives
Global Research Data InitiativesGlobal Research Data Initiatives
Global Research Data Initiatives
 
LIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into RealityLIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into Reality
 
What it means to be FAIR
What it means to be FAIRWhat it means to be FAIR
What it means to be FAIR
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.eu
 
Open data-for-innovation-smart-and-sustainable
Open data-for-innovation-smart-and-sustainableOpen data-for-innovation-smart-and-sustainable
Open data-for-innovation-smart-and-sustainable
 
Open data for innovation, smart and sustainable prof muliaro
Open data for innovation, smart and sustainable prof muliaroOpen data for innovation, smart and sustainable prof muliaro
Open data for innovation, smart and sustainable prof muliaro
 
The importance of FAIR and the Community of Data Driven Insights - the road t...
The importance of FAIR and the Community of Data Driven Insights - the road t...The importance of FAIR and the Community of Data Driven Insights - the road t...
The importance of FAIR and the Community of Data Driven Insights - the road t...
 
The FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus projectThe FAIR Principles and the IMI FAIRplus project
The FAIR Principles and the IMI FAIRplus project
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
 
Turning FAIR data into reality
Turning FAIR data into realityTurning FAIR data into reality
Turning FAIR data into reality
 
FAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdfFAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdf
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 

Más de Susanna-Assunta Sansone

Más de Susanna-Assunta Sansone (12)

FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
FAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdfFAIRsharing-Standards-4-GSC-Aug23.pdf
FAIRsharing-Standards-4-GSC-Aug23.pdf
 
FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023FAIRsharing & FAIRcookbook at RDA 2023
FAIRsharing & FAIRcookbook at RDA 2023
 
NFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIRNFDI Physical Sciences Colloquium - FAIR
NFDI Physical Sciences Colloquium - FAIR
 
Metadata Standards
Metadata StandardsMetadata Standards
Metadata Standards
 
FAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-SingaporeFAIRcookbook: GSRS22-Singapore
FAIRcookbook: GSRS22-Singapore
 
FAIR Cookbook
FAIR Cookbook FAIR Cookbook
FAIR Cookbook
 
FAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipesFAIR, community standards and data FAIRification: components and recipes
FAIR, community standards and data FAIRification: components and recipes
 
FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook FAIRsharing and the FAIR Cookbook
FAIRsharing and the FAIR Cookbook
 
FAIRsharing for EOSC
FAIRsharing for EOSC FAIRsharing for EOSC
FAIRsharing for EOSC
 
ELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - ExamplarsELIXIR FAIR Activities - Examplars
ELIXIR FAIR Activities - Examplars
 
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
Open Science FAIR 2021: FAIRsharing and the FAIR Cookbook
 

Último

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 

Último (20)

Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 

The international FAIR movement: thinkers, doers and dreamers

  • 1. The international FAIR movement Behind the brand: thinkers, doers and dreamers Susanna-Assunta Sansone ORCiD: 0000-0001-5306-5690 | Twitter: @SusannaASansone Oxford Open Access Week, 10 March 2020 Slides: https://www.slideshare.net/SusannaSansone datareadiness.eng.ox.ac.uk Associate Professor, Engineering Science Associate Director, Oxford e-Research Centre
  • 2. A set of principles to enhance the value of all digital resources 2014 2016 Developed and endorsed by researchers, service providers, publishers, funding agencies, industry partners
  • 3. And the FAIR brand is born
  • 4. Findable Accessible Interoperable Reusable • Globally unique, resolvable, and persistent identifiers ▪ To retrieve and connect data • Community defined descriptive metadata ▪ To enhance discoverability • Common terminologies ▪ To use the same term mean the same thing • Detailed provenance ▪ To contextualize the data and facilitate reproducibility • Terms of access ▪ Open as possible, closed as necessary • Terms of use ▪ Clear licences, ideally to enable innovation and reuse Data for humans and for machines
  • 5. Everybody needs data that are • Discoverable by humans and machines • Retrievable and structured in standard format(s) • Self-described so that third parties can make sense of it Better data = better science and more efficiently Datasets SOPs Figures, Photos Workflows Slides Codes Tools DatabasesAlgorithmsDocument
  • 6. FAIR mythology summarized Just principles not practice Slide credit: Carole Goble
  • 7. “Accessible, assessable, intelligible, reusable…anyone, anything, anytime…” https://royalsociety.org/topics-policy/projects/science-public-enterprise/report/?gclid=EAIaIQobChMI5vSav9qP6AIVzLTtCh0J7wE4EAAYASAAEgIPx_D_BwE FAIR is nothing new…
  • 9. doi.org/10.2777/02999doi.org/10.2777/1524 www.gov.uk/government/publications/open- research-data-task-force-final-report doi.org/10.5281/zenodo.1245568 www.ukri.org/research/infrastructure …but it has aligned the broad community around a common guidelines; some examples “…we estimate to be 10.2 billion euros per year…” doi.org/10.7486/DRI.tq582c863
  • 10. The scholarly publishing ecosystem is changing Data-relates mandates by funders and institutions are growing Researchers must be responsibile and accountable for their research, but they also need recognition and credit theconversation.com/how-robots-can-help-us-embrace-a-more-human-view-of-disability-76815 Human-machine collaboration is the future, and AI-ready data is essential Responding to needs and crisis o 21% pharmacology data (doi.org/10.1038/nrd3439-c1) o 11% cancer data (doi.org/10.1038/483531a) o unsatisfactory in ML (openreview.net/pdf?id=By4l2PbQ-) towardsdatascience.com/scientific-data-analysis-pipelines-and-reproducibility-75ff9df5b4c5 Reproducibility of published studies is still problematic
  • 11. We research and develop methods and tools to improve data reuse; we work for data transparency, research integrity and the evolution of scholarly publishing Recognition and credit for ‘building, training and driving changes’
  • 12. www.gov.uk/government/publications/open-research-data-task-force-final-report Prof. Pam Thomas, Open Research Data Task Force Chair https://twitter.com/SusannaASansone/status/1227885895426154497/photo/1 • Dialogue, engagement and commitment among stakeholders • Infrastructure and services as integral part of the research data cycle • Cooperation with activities outside UK, also with industries and business including SMEs • Keep under continual review 2018
  • 14. Depends upon several stakeholders in the research ecosystem actively playing their parts to: • deliver research infrastructures, tools and standards, policies, education and training • address technical, social and cultural challenges It is not simple and it requires long term investment Making FAIR a reality
  • 16. “Most metadata field names and their values are not standardized or controlled” “Even simple binary or numeric fields are often populated with inadequate values of different data types” Better metadata for better data https://doi.org/10.1038/sdata.2019.21
  • 17. The importance of knowledge representation and interoperability The ability of computer systems or software to understand information with sufficient accuracy to utilise (e.g. represent and exchange) that information for an intelligent purpose Formats Terminologies Guidelines Identifiers ID Conceptual model, conceptual schema, exchange formats Controlled vocabularies, thesauri, ontologies Minimum information reporting requirements, checklists Unambiguous, persistent and context- independent identifier schema metadata
  • 18. Findable Accessible Interoperable 20% identifiers 80% metadata Reusable FAIR is not one size fits all FAIR is not about harmonizing all metadata, or publishing all in RDF
  • 20. The road to FAIRness Before FAIR http://blogs.nature.com/scientificdata/2019/10/22/the-layered-cake
  • 21. The road to FAIRness Before FAIR After FAIR http://blogs.nature.com/scientificdata/2019/10/22/the-layered-cake
  • 22. The road to FAIRness Before FAIR After FAIR ….from chaos, comes order?
  • 23. The EOSC layered cake of FAIR http://blogs.nature.com/scientificdata/2019/10/22/the-layered-cake https://www.eoscsecretariat.eu And a growing number of FAIR-related EOSC-funded projects in all disciplines; for example:
  • 24. Growing number of reports, surveys and recommendations http://blogs.nature.com/scientificdata/2019/10/22/the-layered-cake https://www.eoscsecretariat.eu/events/webinar-coming-together-map-national-landscape-analysis-eosc-between-5-relevant-projects
  • 25. My fair share of the work
  • 26. €3.3 billion programme 2014 - 2020 €300 million programme 2018 - 2020 European intergovernmental organisation 23 member countries and over 180 research organisations Since 2014 1 2 3 Started in 2019 FAIR-enabling EU and USA biomedical infrastructure programmes and projects, e.g. Since in 2014, several programs: 2014-2017 2017-2018
  • 27. Organization and structure • Hub and (national) Nodes • Community-driven and rooted • Strong focus on interoperability • SMEs and Industry links • Cross-nodes funded activities
  • 28. model and related formats Initiated in 2003 Open source tools and formats to help researchers to: describe multi-modal experiments follow community-developed standards curate, analyze, release, share and publish
  • 29. Nowadays ISA (format and/or tools) powers over 30 public resources, e.g.,
  • 30.
  • 31. Funded by Part of the ISA-InterMine project Reproducibility – FAIR at the first mile From curated, structured metadata to data paper datascriptor.org
  • 32. Academics from several ELIXIR Nodes, with Janssen, AstraZeneca, Eli Lilly, GSK, Novartis, Bayer, Boehringer Ingelheim Define, document and implement a data FAIRification process:
  • 33. Rocca-Serra and Sansone. Experiment design driven FAIRification of omics data matrices, an exemplar Scientific Data 2019. https://doi.org/10.1038/s41597-019-0286-0 Practical examples: data FAIRifications recipe https://fairplus.github.io/the-fair-cookbook
  • 34. https://www.turing.ac.uk/research/impact-stories/changing-culture-data-science Based on the successful example of the ATI’s Turing Way book https://fairplus.github.io/the-fair-cookbook Participatory and open
  • 36. Implementation of FAIR by a company Requirements • Financial investment for technical infrastructure, training & education • Make use of existing FAIR tools & best practices • Show business value through transformation to digital drug discovery A three-pronged approach 1. Build a data catalogue for Findability of all data assets in the company Tip: Prioritise for datasets which will quickly demonstrate benefits 2. Bring data together into a virtual, federated, infrastructure, so that data with the right credentials become instantaneously Accessible for human and machine Tip: Build on previous efforts to move to cloud services 3. Make selected datasets Interoperable and Reusable on the metadata as well at the individual data point level Tip: Most costly, so important to determine likely return on business value
  • 37. 390+ 162+ 729+ ~1300 13 MIAME MIRIAM MIQAS MIX MIGEN ARRIVE MIAPE MIASE MIQE MISFISHIE …. REMARK CONSORT SRAxml SOFT FASTA DICOM MzML SBRML SEDML … GELML ISA CML MITAB … AAO CHEBIOBI PATO ENVO MOD BTO IDO … TEDDY PRO XAO DO VO EC number URL PURLLSID HandleORCID RRID InChI … IVOA ID DOI standard organizations grass-roots groups Formats Terminologies Guidelines Identifiers ID COMMUNITY STANDARDS for metadata and identifiers Domain-specific standards for datasets, e.g.
  • 38. https://doi.org/10.6084/m9.figshare.3795816.v2 https://doi.org/10.6084/m9.figshare.4055496.v1 Analysis of the standards landscape in the life and biomedical sciences Fragmentation, duplication and gaps: • Perspective and focus vary: • Motivation is diverse • Governance and participation vary 2013 2016
  • 39. doi: 10.1126/science.1180598 doi:10.1038/nbt1346 OBO Portal and Foundry Portal and Foundrydoi: 10.1038/nbt.1411 Over 10+ years of community engagement 69 authors: adopters, collaborators and users Open Access CC-BY 2007 20082009 2011 2019 doi.org/10.1038/s41587-019-0080-8
  • 41. Formats Terminologies Guidelines Identifiers ID REPOSITORIES, databases and knowledgebases DATA POLICIES by journals, funders, and other organizations COMMUNITY STANDARDS for metadata and identifiers informative and educational resource Curated inter-linked descriptions
  • 42. Formats Terminologies Guidelines Identifiers ID informative and educational resource Curated inter-linked descriptions All records are manually curated in-house, verified and claimed by the community behind each resource Ready for use, implementation, or recommendation In development Status uncertain Deprecated as subsumed or superseded REPOSITORIES, databases and knowledgebases DATA POLICIES by journals, funders, and other organizations COMMUNITY STANDARDS for metadata and identifiers
  • 43. Formats Terminologies Guidelines Identifiers ID REPOSITORIES, databases and knowledgebases DATA POLICIES by journals, funders, and other organizations COMMUNITY STANDARDS for metadata and identifiers informative and educational resource Curated inter-linked descriptions We guide consumers to discover, select and use these resources with confidence We help producers to make their resources more visible, more widely adopted and cited
  • 44. Tracking evolution of (meta)data standards
  • 45. Describing access and use of databases and repositories
  • 46. Describing the standards these databases and repositories implement
  • 47. Tracking which databases and standards are recommended by (publishers) data policies
  • 48. Maintaining these descriptions by working with the owners of these resources, citing and crediting them
  • 49.
  • 50. “The interactive browser will allow us to discover which databases and standards are not currently included in our author guidelines, enabling us to regularly monitor and refine our policies as appropriate, in support of our mission to help our authors enhance the reproducibility of their work.” H. Murray. Publishing Editor, F1000Research
  • 51. Researchers in academia, industry, government Developers and curators of resources Journal publishers or organizations with data policy Research data facilitators, librarians, trainers Learned societies, unions and associations Funders and data policy makers A flagship output of and a WG in: Recommended by funders, e.g.:
  • 53. Analysed the data policies by journals/publishers, and the standards and repositories they recommend Working with journal editors and publishers Participating publishers:
  • 54. Discrepancy in recommendation across the data policies • some repositories are named, but very few standards are • cautious approach due to the wealth of existing resources Recommendations are often driven by • the editor’s familiarity with one or more standards, notably for journals or publishers focusing on specific disciplines • the engagement with learned societies and researchers actively supporting and using certain resources Ø Consensus: FAIRsharing plays a key role in helping editors to discover and recommend appropriate resources What have we learned and are doing now Participating publishers:
  • 55. Working on a set of criteria that journals and publishers believe are important for the identification and selection of data repositories, which can be recommended to researchers when they are preparing to publish the data underlying their findings Participating publishers: Data Repository Selection: Criteria That Matter Pre-print: https://osf.io/m2bce Collaboration:
  • 56. Participating publishers: Data Repository Selection: Criteria That Matter Pre-print: https://osf.io/m2bce Harmonize journals and publishers’ data deposition guidelines by defining a common set of criteria for repository selection Criteria include: • Access conditions • Reuse condition • Deposition conditions • Unique ID schema • User support • Curation • ……. Collaboration:
  • 57. Increase the number and the clarity of journals and funders data policies by classifying the recommendations these policies contain to improve their definition and guidance to researchers Collaboration: Workplan – phase 1: Curate and assess their compliance to the Transparency and Openness Promotion (TOP) guidelines and display the level in FAIRsharing
  • 59. A growing numbers of metrics, indicators, models and certifications for FAIRness and trustworthiness
  • 60. https://fairassist.org Keeping track of these manual, semi-manual or automatic tools the community is developing at:
  • 61. infrastructures standards tools policies education training cultural normalization incentives long term investment A FAIRy tale needs some magic… It is not simple, but it is no longer optional
  • 62. and our collaborators datareadiness.eng.ox.ac.uk FAIR synchrotron data project DPhil student starting Oct 2020