SlideShare una empresa de Scribd logo
1 de 52
FAIRy stories: the FAIR Data
principles in theory and in practice
Carole Goble
The University of Manchester, UK
carole.goble@manchester.ac.uk
The views expressed in this talk are my own
NSF Convergence Accelerator Series Tracks A&B webinar, 19th May 2021
March 18, 2021
http://spatial.ucsb.edu/2021/Natasha-Noy
Why do we need FAIR data in Research?
“there must be loads of legacy data. We’re desperately trying to go
back and look at what we knew from SARS 10 years ago”
https://www.covid19dataportal.org/
https://www.rd-alliance.org/group/rda-covid19-rda-covid19-omics-rda-covid19-epidemiology-rda-covid19-
clinical-rda-covid19-1
https://doi.org/10.15497/rda00052
Why do we need FAIR data in Research?
COVID Data sharing boost – mobilising people, infrastructure & initiatives
Spotlighted technical, territorial & practices
Provider: collection, upload and governance bottlenecks
User: find and access to datasets, licenses, data and metadata quality
Access to data for processing at scale, common standards
Behaviour inertia and relapse
Long term sustainability
“global pandemic is not sufficient to radically modify
scientific practices”*
* Larregue et al https://blogs.lse.ac.uk/impactofsocialsciences/2020/11/30/covid-19-where-is-the-data/
https://www.nature.com/articles/d41586-021-00305-7
https://www.nature.com/articles/s41597-020-0524-5
Why do we need FAIR data in Research?
information flows, secondary use
Figure: KnowledgeTurning, Information Flow Josh Sommer, Chordoma Foundation, 2011
Community domain enclaves
Resource fragmentation
Flow across platforms/ sovereignties
Pan-discipline drivers
Knowledge churn, loss and cost
2016
A set of GUIDING PRINCIPLES to
enhance the value of all digital
resources and their reuse by PEOPLE
and by MACHINES
ALIGNING a COMMUNITY around
common data guidelines
FAIR Research Data
branding a trend
(re)-stimulating a
movement
What ARE the FAIR principles?
Aspirational guardrails
Not a standard, nor metrics
A contract between data
provider and user
In the original paper
https://www.go-fair.org/fair-principles/
Relaunch a dialogue - research and policy communities.
Reboot a journey - wider accessibility and reusability of data.
compare &
combine data
https://doi.org/10.1038/sdata.2016.18
“enhancing the ability of machines to
automatically find and use data or any digital
object, and support its reuse by individuals”
INCF Statement
Persistent identifiers
Globally unique, resolvable for
data and always for metadata
Structured metadata
Community defined descriptive
metadata using common
terminologies and standards
Linked Data
Vocabularies are FAIR, (meta)data
reference (meta)data, provenance
Automation-
readiness
Access protocols
Open, free and universally
implementable comms protocols
Semantic Web ->
Linked Data ->
Knowledge Graphs.
Machine-processable
metadata.
[Icons: FAIRsharing]
Open as possible, Closed as necessary
Clear licences for innovation and reuse
Sensitive data, GDPR, IPR, jumpy Deans.
Crossing sovereignty boundaries
• Data sharing becomes data visiting &
federated analysis
An industry in controlled secure access….
• Data Usage Ontology, Beacon Passports,
Trusted Research Environments etc….
Terms of access and use: FAIR ≠ OPEN
FAIR OPEN
SAFE
Privacy preservation
Regulatory rigour
FAIR Implicit Assumptions & Implications
Data are first class objects
Primarily aimed at data creators
and providers for benefit of
consumers.
Operating in an (Open) Data
Ecosystem.
Adoption at scale in legacy
settings.
Data sharing
The Life Sciences & pan-European scale data infrastructure
The Life Sciences Infrastructure Zoo
Flows around a Federated & Diverse System
1466 data repositories
(100+ in EOSC-Life)
916 data format and metadata
standards*
from compounds to clinical trials
https://fairsharing.org/ accessed May 2021
Common standards & agreements
mappings of PIDs and metadata
moving metadata around
accountability and responsibility
FAIR players simplified
Researchers and
company
scientists who
generate and use
the data
Service providers
who manage data
and infrastructure
Local -> Global level
Public -> Commercial
Authorities who
drive policy, practice
& resources
Funders, Policy makers,
Publishers, Professional
societies, Standards
organisations, Institutions
Global and national initiatives
Dedicated projects
Community Orgs
Funders
Policy
Publishers
FAIR
first
stage
Dedicated Services
Where we are going
Where we are
[Susanna Sansone]
FAIR
first
stage
FAIR first stage :
Policymakers, Data service providers
How to define, measure compliance and certify FAIR data?
What is a dataset?
General repos vs Curated authoritative archives?
Principles for Data Repositories
https://www.rd-alliance.org/trust-principles-rda-community-effort
https://fairassist.org/
https://www.natureindex.com/news-blog/what-scientists-need-to-know-about-fair-data
Open Data Survey, 2019
81% of researcher
respondents
unfamiliar with FAIR
1. A common mechanism for metadata
Respect and work with the huge legacy
resources: repositories, registries, tools …
community standards
Find, register, index, search resources
Move metadata between services
withoutAPIs
Repositories ->Tools, Aggregators (e.g. licenses)
-> Registries (upload, auto-curation)
Registries -> Registries (across disciplines)
Contribute to Knowledge Graphs
a little bit of semantics at scale
semantic underware
invisible to users
visible to developers & services
Picture: Carole Goble, Turing Lecture 2018
Schema.org: Semantic Mark up for the Web
Cartel of commercial search engines
Wide web use, web infrastructure
Web pages and sitemaps
Types (830+) IceCreamShop
Properties (1300+) hasMenu
Not targeted at science - too much / too little
Dataset type – 120 properties
(Google Data Profile requires 2 properties)
No type for Protein, Gene, Taxon
Harnessing Schema.org for Bioscience
Profile
Data model
Marginality information
Controlled vocabularies
Cardinality
Documentation
Examples
New (properties | types)
definition & consensus
deployment and use
tools & support
Opinionated conventions
Profiles & Link to domain ontologies
}Add Bioscience properties & types if necessary
Examples &Usage Guidelines
}
Community
Harnessing Schema.org for Bioscience
ChemicalSubstance
definition & consensus
deployment and use
tools & support
Opinionated conventions
Profiles & Link to domain ontologies
Add Bioscience properties & types if necessary
Examples &Usage Guidelines
Community
Bioschemas metadata stratification
broad & shallow / deepish & narrowish
Generic
Subject
specific
MolecularEntity,
Protein,
Sample,Taxon,
ChemicalSubstance…
DataCatalog
Dataset
dataset 5 minimum, 8
recommended properties
license & provenance
https://bioschemas.org/profiles/
Crosswalks to metadata schemas *
• DCAT, DataCite,CrossRef, OpenAIRE, DDI
• DCT:issued <-> Schema:dataPublished
What is a dataset?
Include community ontologies
• Type: ChemicalSubstance
• Property: biologicalRole
• ExpectedType: ChEBI ontology
* https://zenodo.org/record/4420116#.YKFOpaHTX18
400+
People
22
Types
32
Profiles
65
Sites
60M+
Pages
bioschemas.org/liveDeploys
bioschemas.org/
liveDeploys
20+
Countries
120
Profile deployments
bioschemas.org/
liveDeploys
Bioschemas Village
MolecularEntity ChemicalSubstance
Toxicology
Data Aggregator
[with thanks: EgonWillighagen]
MolecularEntity
Gene
Protein
Taxon
Dataset
Lessons: Putting FAIR into Practice
A little bit of semantics at scale -> build critical mass
Profiles
• Schema.org culture – Catch 22
• Consensus building, retention & Ontology-itis
Provider mark-up
• Developer friendly in house tools & wacky web implementations
• Adoption incentives & costs of adapting database processes
Consumer services
• Adoption incentives – Catch 22 & tipping points
• DataCatalog & Dataset popular -> Google Dataset search
Consumer-provider readiness
• Tools and training community take-up….
2. Packaging Research Objects
Gather together into a “crate” files,
unbounded references, & other
crates.
FAIR content: metadata,
identifiers, provenance, citation
about the content
FAIR crates: metadata, PIDs,
provenance, citation about the
crate.
more FAIR middleware -> towards FAIR Digital Objects*
*FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units:
https://doi.org/10.3390/publications8020021
Why “crate up” objects? FAIR+R
Flows:
Researchers work with multiple and
different objects using multiple
infrastructures over periods of time
exchange between platforms and people
Parts:
Research has associated objects
linked together by context
metadata files with files
datasets, scripts, SOPs, articles …
0
held in different places
made at different times by
different people & processes
publish, report, reuse, cite, reproduce
register, deposit, archive, port
point to big, sensitive & active content
Aggregate files and/or any URI-addressable
content with structured metadata
Web and Linked Data Native
machine and human readable PIDs + JSON-LD +
Schema.org, search engine & developer friendly
Flex for open ended content, respect legacy
typed by a profile + add more schema.org and
domain ontologies
http://www.researchobject.org/ro-crate/
Archive file
format
FAIR Object Middleware
FAIR Middleware
metadata carrying interchange format
Knowledge
Graph of
Research
Objects
It’s FAIR metadata middleware, stupid
• smart use of wheels already invented
• get tools, services on board
• developer friendly, firm best practice
Known and Unknown unknowns
One size does not fit all
• contextual interpretation
• descriptive openedness , multi-interpretation
Analogous to FAIR Software
• RDA/ReSA FAIR4Research SoftwareWG
Lessons: Putting FAIR into Practice
3. Making (legacy) datasets FAIR: FAIRification
[Picture credit: EgonWillighagen]
Credit to: Ian Harrow, FAIR & OM projects
FAIR as enabler for the digital transformation
● Biopharma R&D productivity can be
improved by implementing the FAIR Data
Principles.
● FAIR enables powerful new AI analytics access
to data for machine learning and prediction
● Fairly AI Ready
● Challenges
○ change the culture, show business value,
achieve the ‘FAIR enough’
○ Sustain FAIR solutions and activities
Slide credit: Susanna Sansone
Making (legacy) datasets FAIR: FAIRification
> 100 Public-Private partnerships of
European Commission, universities SMEs
and Big Pharma translational projects
Pharma’s own datasets
*https://www.go-fair.org/how-to-go-fair/fair-data-point/
Data visiting through a
FAIR Data Point*
Linked Data / RDF tech
Dataset transformation
Methodology
Linkset services
RDFWarehouse (Knowledge Graph)
- API not SPARQL
- Sustainability & maintenance
- Linksets PID mapping services
FAIRification of legacy datasets
Practical
advice
Assessment
processes
FAIR levels of
projects / data
Selection of
datasets
Cost/Benefit
analysis
Methodology
Steps for 1 or
more datasets
Cultural change
Legal templates
Squads & BYODs
Maturity models
Interlinking data from different sources
The lessons of good
global and persistent
identifiers.
Mapping identifiers
and services for
mapping ids to ids and
concepts to concepts.
https://fairplus.github.io/the-fair-cookbook/content/recipes/interoperability/identifier-mapping.html
FAIR by Design
At the start of a collection, built in throughout the life cycle
change management, capacity building
FAIRifying Retrospectively
Legacy datasets, build a cohort,
cost benefit and FAIR readiness over a collection of datasets
Reality
FA(I)R
New FAIRVariants
FAIR++
Legal > Organisational >
Semantic >Technical*
Business and change analysis.
Cost Benefit Analysis.
Scientific / BusinessValue
Sustainability
“…make a decision that
these data are valuable
enough to invest in the work
required for FAIRification.”
interoperability
*EOSC Interoperability Framework
What does FAIRifying a dataset mean?
A database?A pdf? Depositing to a public archive?
Identifier and ontology selecting, assigning,
mapping between and to existing vocabs, and knowing
about ontology services.
High-fidelity ETL loss-less moving (meta)data
from one system to another
Lessons: Putting FAIR into Practice
Lessons: Putting FAIR into Practice
FAIR enough.
Repository manager
Admin monitoring
Bioscientist
Scientific analysis
“Fairness does mean everyone
gets the same. Fairness means
everyone gets what they need”
(Rick Riordan).
Maturity and importance spectrum
Its not all worth it.
FAIR gardens + FAIR scrub
How to assess FAIR maturity
levels, not to be certified but
to make decisions.
FAIR ≠ FREE - an expensive, expert team sport
Mostly manual,
mostly specific
“It is a truth
universally
acknowledged
that a
Knowledge
Graph must be
in want of FAIR
data.
And FAIR data
is in want of
Knowledge
Graphs.”
harvesting
added value
DataCite PID Graph
Bottlenecks:
identifiers and ontologies
curating and ingest pipelines of data providers
4. FAIR Data by Design at Source
Data management platform for Project Hubs
organising, cataloguing, sharing and publishing
multiple kinds of research objects in multiple
repositories for multi-partner projects.
Community developed Knowledge Hub
for guides, examples, tools, and pointers.
Assembled and written by Life Science
researchers and data stewards for their peers.
https://rdmkit.elixir-europe.org
https://fair-dom.org
Lessons: Putting FAIR into Practice
Data creators
• Retention not sharing, act local not global
• Advantage*: intimate knowledge, data
flirting, credits & incentives
Process change and values
• Access to infrastructure with seamless
information flows,Values
• Time & resources to embed into practice
FAIR Stewardship skills
• Professionalisation & know-how
*Pasquetto, I. V., Borgman, C. L., & Wofford, M. F. (2019). Uses and Reuses of Scientific Data: The Data Creators’
Advantage. Harvard Data Science Review, 1(2). https://doi.org/10.1162/99608f92.fc14bf2d
Summary: FAIRy stories
Theory -> mobilised some
Practice -> marathon that takes a village
Move the story from data providers to
enabling creators & consumers prepare to
share FAIR -> Research on Research
Authorities Change Mgt
Stewardship
Service Providers
Sustained infrastructure
Acknowledgements
Special thanks to
• Stian Soiland-Reyes (Uni of Manchester/Uni of Amsterdam)
• Nick Juty & Ebtisam Alharbi (University of Manchester)
• Susanna Sansone (University of Oxford)
• Tony Burdett (EMBL-EBI)
• Ibrahim Emam (ImperialCollege)
• EgonWillighagen (Maastricht University)
• Alasdair Gray (Heriot-Watt University)
Manchester, Research Object, RDMkit, FAIRDOM, FAIRplus, Bioschemas colleagues
(about 130 people)
Icons from the noun project
(https://thenounproject.com/)

Más contenido relacionado

La actualidad más candente

Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
Denodo
 
Juanjo Hierro - Introduction and overview of FIWARE Vision on Data Spaces.pdf
Juanjo Hierro - Introduction and overview of FIWARE Vision on Data Spaces.pdfJuanjo Hierro - Introduction and overview of FIWARE Vision on Data Spaces.pdf
Juanjo Hierro - Introduction and overview of FIWARE Vision on Data Spaces.pdf
FIWARE
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 

La actualidad más candente (20)

Building Data Quality pipelines with Apache Spark and Delta Lake
Building Data Quality pipelines with Apache Spark and Delta LakeBuilding Data Quality pipelines with Apache Spark and Delta Lake
Building Data Quality pipelines with Apache Spark and Delta Lake
 
Data Architecture vs Data Modeling
Data Architecture vs Data ModelingData Architecture vs Data Modeling
Data Architecture vs Data Modeling
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data Mind
 
Enterprise guide to building a Data Mesh
Enterprise guide to building a Data MeshEnterprise guide to building a Data Mesh
Enterprise guide to building a Data Mesh
 
Data Lake,beyond the Data Warehouse
Data Lake,beyond the Data WarehouseData Lake,beyond the Data Warehouse
Data Lake,beyond the Data Warehouse
 
Data Quality
Data QualityData Quality
Data Quality
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Enabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data VirtualizationEnabling a Data Mesh Architecture with Data Virtualization
Enabling a Data Mesh Architecture with Data Virtualization
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
 
Juanjo Hierro - Introduction and overview of FIWARE Vision on Data Spaces.pdf
Juanjo Hierro - Introduction and overview of FIWARE Vision on Data Spaces.pdfJuanjo Hierro - Introduction and overview of FIWARE Vision on Data Spaces.pdf
Juanjo Hierro - Introduction and overview of FIWARE Vision on Data Spaces.pdf
 
Technip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matterTechnip Energies Italy: Planning is a graph matter
Technip Energies Italy: Planning is a graph matter
 
Data catalog
Data catalogData catalog
Data catalog
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
What it means to be FAIR
What it means to be FAIRWhat it means to be FAIR
What it means to be FAIR
 
Data as a Product by Wayne Eckerson
Data as a Product by Wayne EckersonData as a Product by Wayne Eckerson
Data as a Product by Wayne Eckerson
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Data council sf amundsen presentation
Data council sf    amundsen presentationData council sf    amundsen presentation
Data council sf amundsen presentation
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 

Similar a FAIRy stories: the FAIR Data principles in theory and in practice

RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
Carole Goble
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
Carole Goble
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 

Similar a FAIRy stories: the FAIR Data principles in theory and in practice (20)

FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basics
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptx
 
Prototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional RepositoryPrototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional Repository
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
LIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into RealityLIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into Reality
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
 
FAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
 
Shifting the Burden from the User to the Data Provider
Shifting the Burden from the User to the Data ProviderShifting the Burden from the User to the Data Provider
Shifting the Burden from the User to the Data Provider
 

Más de Carole Goble

Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Carole Goble
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
Carole Goble
 

Más de Carole Goble (20)

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 

Último

The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Sérgio Sacani
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Último (20)

Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Introduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptxIntroduction of DNA analysis in Forensic's .pptx
Introduction of DNA analysis in Forensic's .pptx
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 

FAIRy stories: the FAIR Data principles in theory and in practice

  • 1. FAIRy stories: the FAIR Data principles in theory and in practice Carole Goble The University of Manchester, UK carole.goble@manchester.ac.uk The views expressed in this talk are my own NSF Convergence Accelerator Series Tracks A&B webinar, 19th May 2021
  • 3. Why do we need FAIR data in Research? “there must be loads of legacy data. We’re desperately trying to go back and look at what we knew from SARS 10 years ago” https://www.covid19dataportal.org/ https://www.rd-alliance.org/group/rda-covid19-rda-covid19-omics-rda-covid19-epidemiology-rda-covid19- clinical-rda-covid19-1 https://doi.org/10.15497/rda00052
  • 4. Why do we need FAIR data in Research? COVID Data sharing boost – mobilising people, infrastructure & initiatives Spotlighted technical, territorial & practices Provider: collection, upload and governance bottlenecks User: find and access to datasets, licenses, data and metadata quality Access to data for processing at scale, common standards Behaviour inertia and relapse Long term sustainability “global pandemic is not sufficient to radically modify scientific practices”* * Larregue et al https://blogs.lse.ac.uk/impactofsocialsciences/2020/11/30/covid-19-where-is-the-data/
  • 6. Why do we need FAIR data in Research? information flows, secondary use Figure: KnowledgeTurning, Information Flow Josh Sommer, Chordoma Foundation, 2011 Community domain enclaves Resource fragmentation Flow across platforms/ sovereignties Pan-discipline drivers Knowledge churn, loss and cost
  • 7. 2016 A set of GUIDING PRINCIPLES to enhance the value of all digital resources and their reuse by PEOPLE and by MACHINES ALIGNING a COMMUNITY around common data guidelines FAIR Research Data
  • 9. What ARE the FAIR principles? Aspirational guardrails Not a standard, nor metrics A contract between data provider and user In the original paper https://www.go-fair.org/fair-principles/ Relaunch a dialogue - research and policy communities. Reboot a journey - wider accessibility and reusability of data.
  • 11. “enhancing the ability of machines to automatically find and use data or any digital object, and support its reuse by individuals” INCF Statement
  • 12. Persistent identifiers Globally unique, resolvable for data and always for metadata Structured metadata Community defined descriptive metadata using common terminologies and standards Linked Data Vocabularies are FAIR, (meta)data reference (meta)data, provenance Automation- readiness Access protocols Open, free and universally implementable comms protocols Semantic Web -> Linked Data -> Knowledge Graphs. Machine-processable metadata. [Icons: FAIRsharing]
  • 13. Open as possible, Closed as necessary Clear licences for innovation and reuse Sensitive data, GDPR, IPR, jumpy Deans. Crossing sovereignty boundaries • Data sharing becomes data visiting & federated analysis An industry in controlled secure access…. • Data Usage Ontology, Beacon Passports, Trusted Research Environments etc…. Terms of access and use: FAIR ≠ OPEN FAIR OPEN SAFE Privacy preservation Regulatory rigour
  • 14. FAIR Implicit Assumptions & Implications Data are first class objects Primarily aimed at data creators and providers for benefit of consumers. Operating in an (Open) Data Ecosystem. Adoption at scale in legacy settings. Data sharing
  • 15. The Life Sciences & pan-European scale data infrastructure
  • 16. The Life Sciences Infrastructure Zoo Flows around a Federated & Diverse System 1466 data repositories (100+ in EOSC-Life) 916 data format and metadata standards* from compounds to clinical trials https://fairsharing.org/ accessed May 2021 Common standards & agreements mappings of PIDs and metadata moving metadata around accountability and responsibility
  • 17. FAIR players simplified Researchers and company scientists who generate and use the data Service providers who manage data and infrastructure Local -> Global level Public -> Commercial Authorities who drive policy, practice & resources Funders, Policy makers, Publishers, Professional societies, Standards organisations, Institutions
  • 18. Global and national initiatives Dedicated projects Community Orgs Funders Policy Publishers FAIR first stage Dedicated Services
  • 19. Where we are going Where we are [Susanna Sansone] FAIR first stage
  • 20. FAIR first stage : Policymakers, Data service providers How to define, measure compliance and certify FAIR data? What is a dataset? General repos vs Curated authoritative archives? Principles for Data Repositories https://www.rd-alliance.org/trust-principles-rda-community-effort https://fairassist.org/
  • 22.
  • 23. 1. A common mechanism for metadata Respect and work with the huge legacy resources: repositories, registries, tools … community standards Find, register, index, search resources Move metadata between services withoutAPIs Repositories ->Tools, Aggregators (e.g. licenses) -> Registries (upload, auto-curation) Registries -> Registries (across disciplines) Contribute to Knowledge Graphs a little bit of semantics at scale semantic underware invisible to users visible to developers & services
  • 24. Picture: Carole Goble, Turing Lecture 2018 Schema.org: Semantic Mark up for the Web Cartel of commercial search engines Wide web use, web infrastructure Web pages and sitemaps Types (830+) IceCreamShop Properties (1300+) hasMenu Not targeted at science - too much / too little Dataset type – 120 properties (Google Data Profile requires 2 properties) No type for Protein, Gene, Taxon
  • 25. Harnessing Schema.org for Bioscience Profile Data model Marginality information Controlled vocabularies Cardinality Documentation Examples New (properties | types) definition & consensus deployment and use tools & support Opinionated conventions Profiles & Link to domain ontologies }Add Bioscience properties & types if necessary Examples &Usage Guidelines } Community
  • 26. Harnessing Schema.org for Bioscience ChemicalSubstance definition & consensus deployment and use tools & support Opinionated conventions Profiles & Link to domain ontologies Add Bioscience properties & types if necessary Examples &Usage Guidelines Community
  • 27. Bioschemas metadata stratification broad & shallow / deepish & narrowish Generic Subject specific MolecularEntity, Protein, Sample,Taxon, ChemicalSubstance… DataCatalog Dataset dataset 5 minimum, 8 recommended properties license & provenance https://bioschemas.org/profiles/ Crosswalks to metadata schemas * • DCAT, DataCite,CrossRef, OpenAIRE, DDI • DCT:issued <-> Schema:dataPublished What is a dataset? Include community ontologies • Type: ChemicalSubstance • Property: biologicalRole • ExpectedType: ChEBI ontology * https://zenodo.org/record/4420116#.YKFOpaHTX18
  • 29. MolecularEntity ChemicalSubstance Toxicology Data Aggregator [with thanks: EgonWillighagen] MolecularEntity Gene Protein Taxon Dataset
  • 30. Lessons: Putting FAIR into Practice A little bit of semantics at scale -> build critical mass Profiles • Schema.org culture – Catch 22 • Consensus building, retention & Ontology-itis Provider mark-up • Developer friendly in house tools & wacky web implementations • Adoption incentives & costs of adapting database processes Consumer services • Adoption incentives – Catch 22 & tipping points • DataCatalog & Dataset popular -> Google Dataset search Consumer-provider readiness • Tools and training community take-up….
  • 31. 2. Packaging Research Objects Gather together into a “crate” files, unbounded references, & other crates. FAIR content: metadata, identifiers, provenance, citation about the content FAIR crates: metadata, PIDs, provenance, citation about the crate. more FAIR middleware -> towards FAIR Digital Objects* *FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units: https://doi.org/10.3390/publications8020021
  • 32. Why “crate up” objects? FAIR+R Flows: Researchers work with multiple and different objects using multiple infrastructures over periods of time exchange between platforms and people Parts: Research has associated objects linked together by context metadata files with files datasets, scripts, SOPs, articles … 0 held in different places made at different times by different people & processes publish, report, reuse, cite, reproduce register, deposit, archive, port point to big, sensitive & active content
  • 33. Aggregate files and/or any URI-addressable content with structured metadata Web and Linked Data Native machine and human readable PIDs + JSON-LD + Schema.org, search engine & developer friendly Flex for open ended content, respect legacy typed by a profile + add more schema.org and domain ontologies http://www.researchobject.org/ro-crate/ Archive file format FAIR Object Middleware
  • 34. FAIR Middleware metadata carrying interchange format Knowledge Graph of Research Objects
  • 35. It’s FAIR metadata middleware, stupid • smart use of wheels already invented • get tools, services on board • developer friendly, firm best practice Known and Unknown unknowns One size does not fit all • contextual interpretation • descriptive openedness , multi-interpretation Analogous to FAIR Software • RDA/ReSA FAIR4Research SoftwareWG Lessons: Putting FAIR into Practice
  • 36. 3. Making (legacy) datasets FAIR: FAIRification [Picture credit: EgonWillighagen]
  • 37. Credit to: Ian Harrow, FAIR & OM projects FAIR as enabler for the digital transformation ● Biopharma R&D productivity can be improved by implementing the FAIR Data Principles. ● FAIR enables powerful new AI analytics access to data for machine learning and prediction ● Fairly AI Ready ● Challenges ○ change the culture, show business value, achieve the ‘FAIR enough’ ○ Sustain FAIR solutions and activities Slide credit: Susanna Sansone
  • 38. Making (legacy) datasets FAIR: FAIRification > 100 Public-Private partnerships of European Commission, universities SMEs and Big Pharma translational projects Pharma’s own datasets
  • 39. *https://www.go-fair.org/how-to-go-fair/fair-data-point/ Data visiting through a FAIR Data Point* Linked Data / RDF tech Dataset transformation Methodology Linkset services RDFWarehouse (Knowledge Graph) - API not SPARQL - Sustainability & maintenance - Linksets PID mapping services
  • 40. FAIRification of legacy datasets Practical advice Assessment processes FAIR levels of projects / data Selection of datasets Cost/Benefit analysis Methodology Steps for 1 or more datasets Cultural change Legal templates Squads & BYODs Maturity models
  • 41. Interlinking data from different sources The lessons of good global and persistent identifiers. Mapping identifiers and services for mapping ids to ids and concepts to concepts. https://fairplus.github.io/the-fair-cookbook/content/recipes/interoperability/identifier-mapping.html
  • 42. FAIR by Design At the start of a collection, built in throughout the life cycle change management, capacity building FAIRifying Retrospectively Legacy datasets, build a cohort, cost benefit and FAIR readiness over a collection of datasets
  • 44. FA(I)R New FAIRVariants FAIR++ Legal > Organisational > Semantic >Technical* Business and change analysis. Cost Benefit Analysis. Scientific / BusinessValue Sustainability “…make a decision that these data are valuable enough to invest in the work required for FAIRification.” interoperability *EOSC Interoperability Framework
  • 45. What does FAIRifying a dataset mean? A database?A pdf? Depositing to a public archive? Identifier and ontology selecting, assigning, mapping between and to existing vocabs, and knowing about ontology services. High-fidelity ETL loss-less moving (meta)data from one system to another Lessons: Putting FAIR into Practice
  • 46. Lessons: Putting FAIR into Practice FAIR enough. Repository manager Admin monitoring Bioscientist Scientific analysis “Fairness does mean everyone gets the same. Fairness means everyone gets what they need” (Rick Riordan). Maturity and importance spectrum Its not all worth it. FAIR gardens + FAIR scrub How to assess FAIR maturity levels, not to be certified but to make decisions.
  • 47. FAIR ≠ FREE - an expensive, expert team sport Mostly manual, mostly specific
  • 48. “It is a truth universally acknowledged that a Knowledge Graph must be in want of FAIR data. And FAIR data is in want of Knowledge Graphs.” harvesting added value DataCite PID Graph Bottlenecks: identifiers and ontologies curating and ingest pipelines of data providers
  • 49. 4. FAIR Data by Design at Source Data management platform for Project Hubs organising, cataloguing, sharing and publishing multiple kinds of research objects in multiple repositories for multi-partner projects. Community developed Knowledge Hub for guides, examples, tools, and pointers. Assembled and written by Life Science researchers and data stewards for their peers. https://rdmkit.elixir-europe.org https://fair-dom.org
  • 50. Lessons: Putting FAIR into Practice Data creators • Retention not sharing, act local not global • Advantage*: intimate knowledge, data flirting, credits & incentives Process change and values • Access to infrastructure with seamless information flows,Values • Time & resources to embed into practice FAIR Stewardship skills • Professionalisation & know-how *Pasquetto, I. V., Borgman, C. L., & Wofford, M. F. (2019). Uses and Reuses of Scientific Data: The Data Creators’ Advantage. Harvard Data Science Review, 1(2). https://doi.org/10.1162/99608f92.fc14bf2d
  • 51. Summary: FAIRy stories Theory -> mobilised some Practice -> marathon that takes a village Move the story from data providers to enabling creators & consumers prepare to share FAIR -> Research on Research Authorities Change Mgt Stewardship Service Providers Sustained infrastructure
  • 52. Acknowledgements Special thanks to • Stian Soiland-Reyes (Uni of Manchester/Uni of Amsterdam) • Nick Juty & Ebtisam Alharbi (University of Manchester) • Susanna Sansone (University of Oxford) • Tony Burdett (EMBL-EBI) • Ibrahim Emam (ImperialCollege) • EgonWillighagen (Maastricht University) • Alasdair Gray (Heriot-Watt University) Manchester, Research Object, RDMkit, FAIRDOM, FAIRplus, Bioschemas colleagues (about 130 people) Icons from the noun project (https://thenounproject.com/)