SlideShare una empresa de Scribd logo
1 de 28
Descargar para leer sin conexión
Digital Infrastructure for Reproducibility
in Science
RDA – Paris 2015
@mrgunn
https://orcid.org/0000-0002-3555-2054
Old Model: Single type of content;
single mode of distribution
Scholar
Library
Scholar
Publisher
FORCE11.org: Future of research communications and e-scholarship
Credit: Maryann Martone, hypothes.is
Scholar
Consumer
Libraries
Data Repositories
Code Repositories
Community
databases/platforms
OA
Curators
Social
Networks
Social
NetworksSocial
Networks
Peer Reviewers
Narrative
Workflows
Data
Models
Multimedia
Nanopublications
Code
“Hidden web”
FORCE11.org: Future of research
communications and e-scholarship
Credit: Maryann Martone, hypothes.is
Scholar
Consumer
Libraries
Data Repositories
Code Repositories
Community
databases/platforms
OA
Curators
Social
Networks
Social
NetworksSocial
Networks
Peer Reviewers
Narrative
Workflows
Data
Models
Multimedia
Nanopublications
Code
“Hidden web”
FORCE11.org: Future of research
communications and e-scholarship
Credit: Maryann Martone, hypothes.is
Make your research data citable
 DOIs for your data
 Force11 compliant citations
 Unique DOI for each version
 Integrated into article
submission
 data linked to article
 See metrics of reuse
Built for collaboration
Private
Full control over who
can see and download
Add and remove
collaborators
Local hosting option
ELN integration
Built for mobile
Mendeley Data has been
created to work on a wide
range of devices - from your
lab workstation to your mobile
phone. Browse datasets on the
go, or take a picture in the lab
and upload it instantly to your
research data.
Scholar
Consumer
Libraries
Data Repositories
Code Repositories
Community
databases/platforms
OA
Curators
Social
Networks
Social
NetworksSocial
Networks
Peer Reviewers
Narrative
Workflows
Data
Models
Multimedia
Nanopublications
Code
“Hidden web”
FORCE11.org: Future of research
communications and e-scholarship
Credit: Maryann Martone, hypothes.is
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Studies looking at the prevalence of irreproducibility estimate a rate of 50% or more
Vasilevsky et alBayer Healthcare
(Prinz et al.)
Amgen
(Begley and Ellis)
Glaziou et al.
51%
(n=80)
Hartshorne
and Schachner
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
51%
(n=257)
54%
(n=238)
78%
(n=67)
89%
(n=53)
64%
Source: GBSI 2015
Economic impact of irreproducibility
• A 50% irreproducibility rate within preclinical research implies that more
$28B/year is spent on research that is irreproducible
Reproducible
Irreproducible
Preclinical Research
$56.4B
$28.2B
(50%)
$28.2B
(50%)
US Preclinical Spend
2014E ($B)
$56B
(49%)
$58B
(51%)
Clinical
& Other
US Life Science Spend ($B)
Preclinical
R&D
$114B
Total US Life Science
Spend 2014E ($B)
Source: GBSI 2015
Retraction?
Only 0.2% of the literature (vs 50%+ irreproducibility)
Negative findings?
Less than 30% of researchers who could not reproduce published findings
published their failure1
Only 14% of the literature reports any negative results2
Additional publications from other academic
labs?
“We didn’t see that a target is more likely to be validated if it was reported in ten
publications or in two publications”3
Example: Retraction of PLOS4 and Science5 papers by Pamela Ronald at UC
Davis
• Self retraction due to reagent error
• Results had been ‘confirmed’ independently by three other groups6-8
1. Mobley et al. PLOS ONE. 8, e63221 (2013)
2. Fanelli. Scientometrics. 90, 891 (2012)
3. Prinz et al. Nat Rev Drug Discov. 10, 712 (2011)
4. Han et al. PLOS ONE. 6, e29192 (2011)
5. Lee et al. Science. 326, 850 (2009)
6. McCarthy et al. J Bacteriology. 193, 6375 (2011)
7. Shuguo et al. Appl Biochem Biotechnol. 166, 1368 (2012)
8. Qian et al. J Proteome Res. 12, 3327 (2013)
What about citations?
None of the replication studies reported
have found any correlation with citations (or
journal impact factor):
• NINDS - No significant difference1
• Bayer - No significant difference2
• Amgen - “We saw no significant difference in citation rates
between papers that were reproducible versus non-
reproducible”3
1. Stuart et al. Experimental Neurology 233, 597–605 (2012).
2. Prinz et al. Nat Rev Drug Discov. 10, 712 (2011).
3. Begley and Ellis. Nature. 483, 531-3 (2012)
Reproducibility Project: Cancer Biology
Reproducibility Project: Cancer Biology
Independently replicating key experimental results from
the top 50 cancer biology studies from 2010-2012
Learn more at http://cos.io/cancerbiology
Science Exchange
Validation Projects: lessons to date
Validation Projects: lessons to date
Validation Projects: lessons to date
Project 1. Cancer Biology Reproducibility Project
Resource identification:
Suggestions for improvements
Reduce reliance on contacting original authors who may
have moved on or forgotten details by the time of deposit
• Build tools to help capture research workflow and data
• (semi)-automated e.g. Riffyn, GenomeStudio
• Update journal formats to capture more information and
data e.g. GigaScience, Nature
• Uniquely identify commercial reagents e.g. Resource
Identification Initiative
• Repositories for materials e.g. Addgene, JAX, ATCC
Push over pull
DATA!
How and why we fail
at infrastructure
Definition – that which you only
notice when it breaks
We’re terrible at it, because it’s very
bureaucratic and boring
Funders want to talk about
infrastructure (meetings like
this!), but researchers don't.
How we do infrastructure now
1. A funder offers money to build infrastructure.
2. Researchers propose to do what they’re already doing
but generate infrastructure as a “side effect”.
3. A project gets funded using words such as “open,
distributed, lightweight, framework, etc”.
4. The distributed system gets co-opted by a centralized
one (or it fades away)
Examples: Handles & URIs-> DOIs, profiles -> ORCID
Bilder/Lin/Neylon org principles
• Trust can’t be solved by technology.
• A stakeholder-governed organization needs to manage
infrastructure.
• Transparency and openness is necessary but not
sufficient
• Org should aim for sustainability, but must not lobby and
needs formal incentives to replace itself
• Revenue should be based on services, not data
– builds trust, as community can go elsewhere if needed
www.mendeley.com
william.gunn@mendeley.com
@mrgunn

Más contenido relacionado

La actualidad más candente

Leveraging VIVO data: visualizations, queries, and reports
Leveraging VIVO data: visualizations, queries, and reportsLeveraging VIVO data: visualizations, queries, and reports
Leveraging VIVO data: visualizations, queries, and reports
Paul Albert
 

La actualidad más candente (20)

Laurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data PublicationLaurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data Publication
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
 
Third-Party PubMed Tools
Third-Party PubMed ToolsThird-Party PubMed Tools
Third-Party PubMed Tools
 
MLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed ToolsMLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed Tools
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
NITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkNITLE Open Notebook Science Talk
NITLE Open Notebook Science Talk
 
Leveraging publication metadata to help overcome the data ingest bottleneck
Leveraging publication metadata to help overcome the data ingest bottleneck Leveraging publication metadata to help overcome the data ingest bottleneck
Leveraging publication metadata to help overcome the data ingest bottleneck
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description Guidelines
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?
 
Data reuse and scholarly reward: understanding practice and building infrastr...
Data reuse and scholarly reward: understanding practice and building infrastr...Data reuse and scholarly reward: understanding practice and building infrastr...
Data reuse and scholarly reward: understanding practice and building infrastr...
 
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theory
 
Open Notebook Science in Drug Discovery
Open Notebook Science in Drug DiscoveryOpen Notebook Science in Drug Discovery
Open Notebook Science in Drug Discovery
 
Research data and scholarly publications: going from casual acquaintances to ...
Research data and scholarly publications: going from casual acquaintances to ...Research data and scholarly publications: going from casual acquaintances to ...
Research data and scholarly publications: going from casual acquaintances to ...
 
Leveraging VIVO data: visualizations, queries, and reports
Leveraging VIVO data: visualizations, queries, and reportsLeveraging VIVO data: visualizations, queries, and reports
Leveraging VIVO data: visualizations, queries, and reports
 
The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 

Destacado

How to Use the Average Function
How to Use the Average FunctionHow to Use the Average Function
How to Use the Average Function
murphyk0209
 
Panoramicas IV
Panoramicas IVPanoramicas IV
Panoramicas IV
F. Ovies
 
Definition of Mesoscale
Definition of MesoscaleDefinition of Mesoscale
Definition of Mesoscale
Ben Dewell
 
Redifice Arlington Promenade
Redifice Arlington PromenadeRedifice Arlington Promenade
Redifice Arlington Promenade
Bangalore Prj
 
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALEISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
Bangalore Prj
 
7. Stress Management Physiological
7. Stress Management Physiological7. Stress Management Physiological
7. Stress Management Physiological
rossbiology
 

Destacado (20)

How to Use the Average Function
How to Use the Average FunctionHow to Use the Average Function
How to Use the Average Function
 
Franciaország
FranciaországFranciaország
Franciaország
 
Panoramicas IV
Panoramicas IVPanoramicas IV
Panoramicas IV
 
Charles Porch: Facebook for Nonprofits
Charles Porch: Facebook for NonprofitsCharles Porch: Facebook for Nonprofits
Charles Porch: Facebook for Nonprofits
 
Gala NAZARENA 2.012
Gala NAZARENA 2.012Gala NAZARENA 2.012
Gala NAZARENA 2.012
 
Definition of Mesoscale
Definition of MesoscaleDefinition of Mesoscale
Definition of Mesoscale
 
Redifice Arlington Promenade
Redifice Arlington PromenadeRedifice Arlington Promenade
Redifice Arlington Promenade
 
Internet e' Cambiato
Internet e' CambiatoInternet e' Cambiato
Internet e' Cambiato
 
CIRCUITOS
CIRCUITOSCIRCUITOS
CIRCUITOS
 
Nnv
NnvNnv
Nnv
 
d8ooa.odp
d8ooa.odpd8ooa.odp
d8ooa.odp
 
Alexis Ohanian: The Benevolent Web
Alexis Ohanian: The Benevolent Web  Alexis Ohanian: The Benevolent Web
Alexis Ohanian: The Benevolent Web
 
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALEISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
 
Rory Green - "Building a Major Gift Pipeline From Scratch"
Rory Green - "Building a Major Gift Pipeline From Scratch"Rory Green - "Building a Major Gift Pipeline From Scratch"
Rory Green - "Building a Major Gift Pipeline From Scratch"
 
Translanguaging as pedagogic strategy and as resource for identity performanc...
Translanguaging as pedagogic strategy and as resource for identity performanc...Translanguaging as pedagogic strategy and as resource for identity performanc...
Translanguaging as pedagogic strategy and as resource for identity performanc...
 
7. Stress Management Physiological
7. Stress Management Physiological7. Stress Management Physiological
7. Stress Management Physiological
 
Proyecto final_flipped classroom
Proyecto final_flipped classroomProyecto final_flipped classroom
Proyecto final_flipped classroom
 
Análise estática de código Python
Análise estática de código PythonAnálise estática de código Python
Análise estática de código Python
 
Empreendedorismo e as Oportunidades Disfarçadas
Empreendedorismo e as Oportunidades DisfarçadasEmpreendedorismo e as Oportunidades Disfarçadas
Empreendedorismo e as Oportunidades Disfarçadas
 
Aula de Captação de Recursos - parte 2
Aula de Captação de Recursos - parte 2Aula de Captação de Recursos - parte 2
Aula de Captação de Recursos - parte 2
 

Similar a RDA Scholarly Infrastructure 2015

Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
Carole Goble
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble
 

Similar a RDA Scholarly Infrastructure 2015 (20)

Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific Publishers
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
2015 12 ebi_ganley_final
2015 12 ebi_ganley_final2015 12 ebi_ganley_final
2015 12 ebi_ganley_final
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early Thoughts
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
Elsevier - Labs on Line
Elsevier - Labs on Line Elsevier - Labs on Line
Elsevier - Labs on Line
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Biomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterpriseBiomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital Enterprise
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New Science
 
On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
 
The OpenCon Intro to Open Data
The OpenCon Intro to Open DataThe OpenCon Intro to Open Data
The OpenCon Intro to Open Data
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 

Último

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
PirithiRaju
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
Lokesh Kothari
 

Último (20)

High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
American Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptxAmerican Type Culture Collection (ATCC).pptx
American Type Culture Collection (ATCC).pptx
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
GUIDELINES ON SIMILAR BIOLOGICS Regulatory Requirements for Marketing Authori...
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 

RDA Scholarly Infrastructure 2015

  • 1. Digital Infrastructure for Reproducibility in Science RDA – Paris 2015 @mrgunn https://orcid.org/0000-0002-3555-2054
  • 2. Old Model: Single type of content; single mode of distribution Scholar Library Scholar Publisher FORCE11.org: Future of research communications and e-scholarship Credit: Maryann Martone, hypothes.is
  • 3. Scholar Consumer Libraries Data Repositories Code Repositories Community databases/platforms OA Curators Social Networks Social NetworksSocial Networks Peer Reviewers Narrative Workflows Data Models Multimedia Nanopublications Code “Hidden web” FORCE11.org: Future of research communications and e-scholarship Credit: Maryann Martone, hypothes.is
  • 4.
  • 5. Scholar Consumer Libraries Data Repositories Code Repositories Community databases/platforms OA Curators Social Networks Social NetworksSocial Networks Peer Reviewers Narrative Workflows Data Models Multimedia Nanopublications Code “Hidden web” FORCE11.org: Future of research communications and e-scholarship Credit: Maryann Martone, hypothes.is
  • 6. Make your research data citable  DOIs for your data  Force11 compliant citations  Unique DOI for each version  Integrated into article submission  data linked to article  See metrics of reuse
  • 7. Built for collaboration Private Full control over who can see and download Add and remove collaborators Local hosting option ELN integration
  • 8. Built for mobile Mendeley Data has been created to work on a wide range of devices - from your lab workstation to your mobile phone. Browse datasets on the go, or take a picture in the lab and upload it instantly to your research data.
  • 9.
  • 10. Scholar Consumer Libraries Data Repositories Code Repositories Community databases/platforms OA Curators Social Networks Social NetworksSocial Networks Peer Reviewers Narrative Workflows Data Models Multimedia Nanopublications Code “Hidden web” FORCE11.org: Future of research communications and e-scholarship Credit: Maryann Martone, hypothes.is
  • 11. 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Studies looking at the prevalence of irreproducibility estimate a rate of 50% or more Vasilevsky et alBayer Healthcare (Prinz et al.) Amgen (Begley and Ellis) Glaziou et al. 51% (n=80) Hartshorne and Schachner 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 51% (n=257) 54% (n=238) 78% (n=67) 89% (n=53) 64% Source: GBSI 2015
  • 12. Economic impact of irreproducibility • A 50% irreproducibility rate within preclinical research implies that more $28B/year is spent on research that is irreproducible Reproducible Irreproducible Preclinical Research $56.4B $28.2B (50%) $28.2B (50%) US Preclinical Spend 2014E ($B) $56B (49%) $58B (51%) Clinical & Other US Life Science Spend ($B) Preclinical R&D $114B Total US Life Science Spend 2014E ($B) Source: GBSI 2015
  • 13. Retraction? Only 0.2% of the literature (vs 50%+ irreproducibility) Negative findings? Less than 30% of researchers who could not reproduce published findings published their failure1 Only 14% of the literature reports any negative results2 Additional publications from other academic labs? “We didn’t see that a target is more likely to be validated if it was reported in ten publications or in two publications”3 Example: Retraction of PLOS4 and Science5 papers by Pamela Ronald at UC Davis • Self retraction due to reagent error • Results had been ‘confirmed’ independently by three other groups6-8 1. Mobley et al. PLOS ONE. 8, e63221 (2013) 2. Fanelli. Scientometrics. 90, 891 (2012) 3. Prinz et al. Nat Rev Drug Discov. 10, 712 (2011) 4. Han et al. PLOS ONE. 6, e29192 (2011) 5. Lee et al. Science. 326, 850 (2009) 6. McCarthy et al. J Bacteriology. 193, 6375 (2011) 7. Shuguo et al. Appl Biochem Biotechnol. 166, 1368 (2012) 8. Qian et al. J Proteome Res. 12, 3327 (2013)
  • 14. What about citations? None of the replication studies reported have found any correlation with citations (or journal impact factor): • NINDS - No significant difference1 • Bayer - No significant difference2 • Amgen - “We saw no significant difference in citation rates between papers that were reproducible versus non- reproducible”3 1. Stuart et al. Experimental Neurology 233, 597–605 (2012). 2. Prinz et al. Nat Rev Drug Discov. 10, 712 (2011). 3. Begley and Ellis. Nature. 483, 531-3 (2012)
  • 16. Reproducibility Project: Cancer Biology Independently replicating key experimental results from the top 50 cancer biology studies from 2010-2012 Learn more at http://cos.io/cancerbiology
  • 20. Validation Projects: lessons to date Project 1. Cancer Biology Reproducibility Project Resource identification:
  • 21. Suggestions for improvements Reduce reliance on contacting original authors who may have moved on or forgotten details by the time of deposit • Build tools to help capture research workflow and data • (semi)-automated e.g. Riffyn, GenomeStudio • Update journal formats to capture more information and data e.g. GigaScience, Nature • Uniquely identify commercial reagents e.g. Resource Identification Initiative • Repositories for materials e.g. Addgene, JAX, ATCC
  • 23. How and why we fail at infrastructure
  • 24. Definition – that which you only notice when it breaks We’re terrible at it, because it’s very bureaucratic and boring
  • 25. Funders want to talk about infrastructure (meetings like this!), but researchers don't.
  • 26. How we do infrastructure now 1. A funder offers money to build infrastructure. 2. Researchers propose to do what they’re already doing but generate infrastructure as a “side effect”. 3. A project gets funded using words such as “open, distributed, lightweight, framework, etc”. 4. The distributed system gets co-opted by a centralized one (or it fades away) Examples: Handles & URIs-> DOIs, profiles -> ORCID
  • 27. Bilder/Lin/Neylon org principles • Trust can’t be solved by technology. • A stakeholder-governed organization needs to manage infrastructure. • Transparency and openness is necessary but not sufficient • Org should aim for sustainability, but must not lobby and needs formal incentives to replace itself • Revenue should be based on services, not data – builds trust, as community can go elsewhere if needed