SlideShare a Scribd company logo
1 of 15
On the reproducibility
of science
Melissa Haendel
Beyond the PDF2
20 March 2013
@ontowonka
haendel@ohsu.edu
Do we know if the infrastructure is
actually broken?
Slide	
  from	
  Gully	
  Burns	
  
The	
  science	
  cycle	
  
This is a broken data story.
The	
  science	
  cycle	
  
Image:	
  h6p://www.joinchangena=on.org/blog/post/roadblocks-­‐on-­‐the-­‐pathway-­‐to-­‐ci=zenship	
  
Journal guidelines for methods are
often poor and space is limited
“All	
  companies	
  from	
  which	
  materials	
  were	
  obtained	
  should	
  
be	
  listed.”	
   -­‐	
  A	
  well-­‐known	
  journal	
  
Reproducibility	
  is	
  dependent	
  at	
  a	
  minimum,	
  on	
  using	
  the	
  
same	
  resources.	
  But…	
  
Hypothesis:	
  AnAbodies	
  in	
  the	
  published	
  literature	
  
are	
  not	
  uniquely	
  idenAfiable	
  	
  
An experiment in reproducibility
Gather	
  journal	
  
ar=cles	
  
5	
  domains:	
  
Immunology	
  
Cell	
  biology	
  
Neuroscience	
  
Developmental	
  biology	
  
General	
  biology	
  
3	
  impact	
  factors:	
  
High	
  
Medium	
  
Low	
  
28	
  Journals	
  
119	
  papers	
  
454	
  an=bodies	
  
408	
  commercial	
  
an=bodies	
  
46	
  non-­‐commercial	
  
an=bodies	
  
Iden=fying	
  ques=ons:	
  
Is	
  the	
  an=body	
  iden=fiable	
  
in	
  the	
  vendor	
  site?	
  
Is	
  the	
  catalog	
  number	
  
reported?	
  
Is	
  the	
  source	
  organism	
  
reported?	
  
Is	
  the	
  an=body	
  target	
  
iden=fiable?	
  
The data shows…
Approximately	
  half	
  of	
  anAbodies	
  are	
  not	
  uniquely	
  idenAfiable	
  in	
  
119	
  publicaAons	
  Percent	
  idenAfiable	
  
0%	
  
10%	
  
20%	
  
30%	
  
40%	
  
50%	
  
60%	
  
Commercial	
  an=body	
   Non-­‐commerical	
  an=body	
  
n=408	
  
n=46	
  
0%	
  
10%	
  
20%	
  
30%	
  
40%	
  
50%	
  
60%	
  
70%	
  
80%	
  
90%	
  
100%	
  
Immunology	
  Neuroscience	
   Dev	
  Bio	
   Cell	
  Bio	
   General	
  Bio	
  
High	
  
Medium	
  
Low	
  
Percent	
  iden=fiable	
  
n=124	
   n=94	
  
n=87	
  
n=95	
  
n=56	
  
Unique	
  idenAficaAon	
  of	
  commercial	
  anAbodies	
  varies	
  across	
  discipline	
  and	
  
impact	
  factor	
  
In some domains high impact journals have worse
reporting, and in others it is the opposite
Maybe labs are just disorganized?
Meet the Urban Lab
Meet the Urban Lab
Image:	
  Gourami	
  Watcher	
  
A+ organization!
The	
  Urban	
  lab	
  anAbodies	
  
Of 14 antibodies published in 45 articles,
only 38% were identifiable 
0%	
  
10%	
  
20%	
  
30%	
  
40%	
  
50%	
  
60%	
  
70%	
  
80%	
  
90%	
  
Commerical	
  Ab	
  
iden=fiable	
  
Non-­‐commercial	
  
Ab	
  iden=fiable	
  	
  
Catalog	
  number	
  
reported	
  
Source	
  organism	
  
reported	
  
Target	
  uniquely	
  
iden=fiable	
  
Percent	
  idenAfiable	
  
What does this tell us?
Scientists really do put their
data in cardboard boxes.
Ø Promote	
  beJer	
  reporAng	
  guidelines	
  in	
  journals	
  
Ø Include	
  reviewing	
  guidelines	
  
Ø Provide	
  tools	
  to	
  reference	
  research	
  resources	
  
with	
  unique	
  and	
  persistent	
  IDs/URIs	
  	
  
Ø Train	
  librarians	
  and	
  other	
  data	
  stewards	
  to	
  
apply	
  data	
  standards	
  
What are we going to do about it?

More Related Content

Viewers also liked (8)

Doc1
Doc1Doc1
Doc1
 
чсс
чссчсс
чсс
 
NIF Data Federation
NIF Data FederationNIF Data Federation
NIF Data Federation
 
Calderón Airlines
Calderón AirlinesCalderón Airlines
Calderón Airlines
 
A Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource LandscapeA Deep Survey of the Digital Resource Landscape
A Deep Survey of the Digital Resource Landscape
 
Google Drive Presentaciones
Google Drive PresentacionesGoogle Drive Presentaciones
Google Drive Presentaciones
 
NIFSTD: A Comprehensive Ontology for Neuroscience
NIFSTD: A Comprehensive Ontology for NeuroscienceNIFSTD: A Comprehensive Ontology for Neuroscience
NIFSTD: A Comprehensive Ontology for Neuroscience
 
Ikusentzutezko aholkuak tele lauro apunteak
Ikusentzutezko aholkuak tele lauro apunteakIkusentzutezko aholkuak tele lauro apunteak
Ikusentzutezko aholkuak tele lauro apunteak
 

Similar to On the Reproducibility of Science

On the reproducibility of science
On the reproducibility of scienceOn the reproducibility of science
On the reproducibility of sciencemhaendel
 
The Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and CurationThe Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and CurationNicole Vasilevsky
 
eScience Institute presentation on eagle-i
eScience Institute presentation on eagle-ieScience Institute presentation on eagle-i
eScience Institute presentation on eagle-imhaendel
 
Biocuration 2014 - The Resource Identification Initiative
Biocuration 2014 - The Resource Identification InitiativeBiocuration 2014 - The Resource Identification Initiative
Biocuration 2014 - The Resource Identification Initiativemhaendel
 
Excursions into the garden of the forking paths
Excursions into the garden of the forking paths Excursions into the garden of the forking paths
Excursions into the garden of the forking paths Ulrich Dirnagl
 
On the Reproducibility of Science: Unique Identification of Research Resourc...
On the Reproducibility of Science: Unique Identification of  Research Resourc...On the Reproducibility of Science: Unique Identification of  Research Resourc...
On the Reproducibility of Science: Unique Identification of Research Resourc...Nicole Vasilevsky
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16Sean Ekins
 
FAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR trackFAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR trackHelena Deus
 
Reprodutibilidade em resultados de pesquisa (Olavo Bohrer Amaral)
Reprodutibilidade em resultados de pesquisa (Olavo Bohrer Amaral)Reprodutibilidade em resultados de pesquisa (Olavo Bohrer Amaral)
Reprodutibilidade em resultados de pesquisa (Olavo Bohrer Amaral)http://bvsalud.org/
 
DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1AlyciaGold776
 
Indications discovery and drug repurposing
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposingSean Ekins
 
Scientific Method Lecture+.ppt
Scientific Method Lecture+.pptScientific Method Lecture+.ppt
Scientific Method Lecture+.pptRajNetkar
 
Scientific inv and nature of science
Scientific inv and nature of scienceScientific inv and nature of science
Scientific inv and nature of scienceKarl Pointer
 
Comparing Research Designs
Comparing Research DesignsComparing Research Designs
Comparing Research DesignsPat Barlow
 
Atul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at the From Data to Discovery symposium at WestatAtul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at the From Data to Discovery symposium at WestatUniversity of California, San Francisco
 
CEPLAS Cologne June 2017: Research misconduct; science‘s self administered ...
CEPLAS Cologne June 2017:  Research misconduct; science‘s self  administered ...CEPLAS Cologne June 2017:  Research misconduct; science‘s self  administered ...
CEPLAS Cologne June 2017: Research misconduct; science‘s self administered ...Leonid Schneider
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...DeVonne Parks, CEM
 

Similar to On the Reproducibility of Science (20)

On the reproducibility of science
On the reproducibility of scienceOn the reproducibility of science
On the reproducibility of science
 
The Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and CurationThe Role of Libraries in Data Management and Curation
The Role of Libraries in Data Management and Curation
 
eScience Institute presentation on eagle-i
eScience Institute presentation on eagle-ieScience Institute presentation on eagle-i
eScience Institute presentation on eagle-i
 
Biocuration 2014 - The Resource Identification Initiative
Biocuration 2014 - The Resource Identification InitiativeBiocuration 2014 - The Resource Identification Initiative
Biocuration 2014 - The Resource Identification Initiative
 
Excursions into the garden of the forking paths
Excursions into the garden of the forking paths Excursions into the garden of the forking paths
Excursions into the garden of the forking paths
 
On the Reproducibility of Science: Unique Identification of Research Resourc...
On the Reproducibility of Science: Unique Identification of  Research Resourc...On the Reproducibility of Science: Unique Identification of  Research Resourc...
On the Reproducibility of Science: Unique Identification of Research Resourc...
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16
 
FAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR trackFAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR track
 
Reprodutibilidade em resultados de pesquisa (Olavo Bohrer Amaral)
Reprodutibilidade em resultados de pesquisa (Olavo Bohrer Amaral)Reprodutibilidade em resultados de pesquisa (Olavo Bohrer Amaral)
Reprodutibilidade em resultados de pesquisa (Olavo Bohrer Amaral)
 
DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1DRUGS New agreement to tackle pharmaceutical pollution p.1
DRUGS New agreement to tackle pharmaceutical pollution p.1
 
Eric G. Campbell, "IRB Oversight of PCOR: A National Survey of IRB Chairs"
Eric G. Campbell, "IRB Oversight of PCOR: A National Survey of IRB Chairs"Eric G. Campbell, "IRB Oversight of PCOR: A National Survey of IRB Chairs"
Eric G. Campbell, "IRB Oversight of PCOR: A National Survey of IRB Chairs"
 
Indications discovery and drug repurposing
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposing
 
Scientific Method Lecture+.ppt
Scientific Method Lecture+.pptScientific Method Lecture+.ppt
Scientific Method Lecture+.ppt
 
Scientific inv and nature of science
Scientific inv and nature of scienceScientific inv and nature of science
Scientific inv and nature of science
 
Comparing Research Designs
Comparing Research DesignsComparing Research Designs
Comparing Research Designs
 
Atul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at the From Data to Discovery symposium at WestatAtul Butte's presentation at the From Data to Discovery symposium at Westat
Atul Butte's presentation at the From Data to Discovery symposium at Westat
 
Grief Responses
Grief ResponsesGrief Responses
Grief Responses
 
Epidemiological Studies
Epidemiological StudiesEpidemiological Studies
Epidemiological Studies
 
CEPLAS Cologne June 2017: Research misconduct; science‘s self administered ...
CEPLAS Cologne June 2017:  Research misconduct; science‘s self  administered ...CEPLAS Cologne June 2017:  Research misconduct; science‘s self  administered ...
CEPLAS Cologne June 2017: Research misconduct; science‘s self administered ...
 
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
January 13, 2016 NISO Webinar: Ensuring the Scholarly Record: Scholarly Retra...
 

More from Neuroscience Information Framework

Neurosciences Information Framework (NIF): An example of community Cyberinfra...
Neurosciences Information Framework (NIF): An example of community Cyberinfra...Neurosciences Information Framework (NIF): An example of community Cyberinfra...
Neurosciences Information Framework (NIF): An example of community Cyberinfra...Neuroscience Information Framework
 
The Neuroscience Information Framework: A Scalable Platform for Information E...
The Neuroscience Information Framework: A Scalable Platform for Information E...The Neuroscience Information Framework: A Scalable Platform for Information E...
The Neuroscience Information Framework: A Scalable Platform for Information E...Neuroscience Information Framework
 
The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework Neuroscience Information Framework
 
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Neuroscience Information Framework
 

More from Neuroscience Information Framework (20)

Why should my institution support RRIDs?
Why should my institution support RRIDs?Why should my institution support RRIDs?
Why should my institution support RRIDs?
 
Why should Journals ask fo RRIDs?
Why should Journals ask fo RRIDs?Why should Journals ask fo RRIDs?
Why should Journals ask fo RRIDs?
 
Funders and RRIDs
Funders and RRIDsFunders and RRIDs
Funders and RRIDs
 
Neuroscience as networked science
Neuroscience as networked scienceNeuroscience as networked science
Neuroscience as networked science
 
Martone acs presentation
Martone acs presentationMartone acs presentation
Martone acs presentation
 
Data Landscapes - Addiction
Data Landscapes - AddictionData Landscapes - Addiction
Data Landscapes - Addiction
 
INCF 2013 - Uniform Resource Layer
INCF 2013 - Uniform Resource LayerINCF 2013 - Uniform Resource Layer
INCF 2013 - Uniform Resource Layer
 
Neurosciences Information Framework (NIF): An example of community Cyberinfra...
Neurosciences Information Framework (NIF): An example of community Cyberinfra...Neurosciences Information Framework (NIF): An example of community Cyberinfra...
Neurosciences Information Framework (NIF): An example of community Cyberinfra...
 
The Neuroscience Information Framework: A Scalable Platform for Information E...
The Neuroscience Information Framework: A Scalable Platform for Information E...The Neuroscience Information Framework: A Scalable Platform for Information E...
The Neuroscience Information Framework: A Scalable Platform for Information E...
 
The Uniform Resource Layer
The Uniform Resource LayerThe Uniform Resource Layer
The Uniform Resource Layer
 
NIF services overview
NIF services overviewNIF services overview
NIF services overview
 
NIF Lexical Overview
NIF Lexical OverviewNIF Lexical Overview
NIF Lexical Overview
 
NIF Services
NIF ServicesNIF Services
NIF Services
 
NIF Data Registration
NIF Data RegistrationNIF Data Registration
NIF Data Registration
 
NIF Data Ingest
NIF Data IngestNIF Data Ingest
NIF Data Ingest
 
NIF Overview
NIF Overview NIF Overview
NIF Overview
 
The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework The possibility and probability of a global Neuroscience Information Framework
The possibility and probability of a global Neuroscience Information Framework
 
NIF: A vision for a uniform resource layer
NIF: A vision for a uniform resource layerNIF: A vision for a uniform resource layer
NIF: A vision for a uniform resource layer
 
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
Publishing for the 21st Century: Experiences from the NEUROSCIENCE INFORMATIO...
 
Navigating the Neuroscience Data Landscape
Navigating the Neuroscience Data LandscapeNavigating the Neuroscience Data Landscape
Navigating the Neuroscience Data Landscape
 

Recently uploaded

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 

Recently uploaded (20)

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 

On the Reproducibility of Science

  • 1. On the reproducibility of science Melissa Haendel Beyond the PDF2 20 March 2013 @ontowonka haendel@ohsu.edu
  • 2. Do we know if the infrastructure is actually broken? Slide  from  Gully  Burns   The  science  cycle  
  • 3. This is a broken data story. The  science  cycle   Image:  h6p://www.joinchangena=on.org/blog/post/roadblocks-­‐on-­‐the-­‐pathway-­‐to-­‐ci=zenship  
  • 4. Journal guidelines for methods are often poor and space is limited “All  companies  from  which  materials  were  obtained  should   be  listed.”   -­‐  A  well-­‐known  journal   Reproducibility  is  dependent  at  a  minimum,  on  using  the   same  resources.  But…  
  • 5. Hypothesis:  AnAbodies  in  the  published  literature   are  not  uniquely  idenAfiable     An experiment in reproducibility Gather  journal   ar=cles   5  domains:   Immunology   Cell  biology   Neuroscience   Developmental  biology   General  biology   3  impact  factors:   High   Medium   Low   28  Journals   119  papers   454  an=bodies   408  commercial   an=bodies   46  non-­‐commercial   an=bodies   Iden=fying  ques=ons:   Is  the  an=body  iden=fiable   in  the  vendor  site?   Is  the  catalog  number   reported?   Is  the  source  organism   reported?   Is  the  an=body  target   iden=fiable?  
  • 6. The data shows… Approximately  half  of  anAbodies  are  not  uniquely  idenAfiable  in   119  publicaAons  Percent  idenAfiable   0%   10%   20%   30%   40%   50%   60%   Commercial  an=body   Non-­‐commerical  an=body   n=408   n=46  
  • 7. 0%   10%   20%   30%   40%   50%   60%   70%   80%   90%   100%   Immunology  Neuroscience   Dev  Bio   Cell  Bio   General  Bio   High   Medium   Low   Percent  iden=fiable   n=124   n=94   n=87   n=95   n=56   Unique  idenAficaAon  of  commercial  anAbodies  varies  across  discipline  and   impact  factor   In some domains high impact journals have worse reporting, and in others it is the opposite
  • 8. Maybe labs are just disorganized?
  • 10. Meet the Urban Lab Image:  Gourami  Watcher  
  • 11. A+ organization! The  Urban  lab  anAbodies  
  • 12. Of 14 antibodies published in 45 articles, only 38% were identifiable 0%   10%   20%   30%   40%   50%   60%   70%   80%   90%   Commerical  Ab   iden=fiable   Non-­‐commercial   Ab  iden=fiable     Catalog  number   reported   Source  organism   reported   Target  uniquely   iden=fiable   Percent  idenAfiable  
  • 13. What does this tell us?
  • 14. Scientists really do put their data in cardboard boxes.
  • 15. Ø Promote  beJer  reporAng  guidelines  in  journals   Ø Include  reviewing  guidelines   Ø Provide  tools  to  reference  research  resources   with  unique  and  persistent  IDs/URIs     Ø Train  librarians  and  other  data  stewards  to   apply  data  standards   What are we going to do about it?