13. the status quo tolerates
poor communication
of findings can
reproduce
partially
can reproduce from 6%
processed data w/
discrepancies
21%
54%
cannot
can reproduce 8% reproduce
w/discrepancies
11%
can
reproduce in
principle
Ioannidis A. et al. Repeatability of published microarray gene expression analyses. Nature Genetics 41, 149-155 (2009) | doi:10.1038/ng.295
14. often what is in principle
reproducible, is not
practically reproducible
208,294,724
datapoints
124 pages
supplemental material
?? lines
unobtainable source code
?? version or architecture of
statistical analysis program (R)
enumerable R packages
and package dependencies
key R package “ClaNC”
no longer available
442 citations
unidentified publication
‣ from journal with 5 year impact factor of 28
‣ article freely available for download
‣ data freely available for download
15. how are we to move science forward
if we cannot understand what was done previously ?
18. scientific method
1. define a question
2. gather information and resources (background research)
3. form a hypothesis
4. test hypothesis experimentally
5. analyze experimental data
6. draw conclusions based on data
7. publish results
8. retest (frequently done by other scientists)
21. printed
on paper
store on local
server
experimentally generate
data @ the bench or static html
from a clinical cohort representation
accepted &
digitally
typeset
static pdf
representation
analyze on local
machine
sent to
write a document reviewers
as pdf
rn al
t to jou
s ubmi
26. clearScience
re-imagining scientific communication
allow consumption of content at a
variety of levels of complexity
and abstraction
leverage (open) RESTful APIs
27. clearScience
RESTful APIs
allow users to reassemble an entire
analysis environment
34. Acknowledgements
Sage Bionetworks External Partners
David Burdick - Rockstar Engineer Myles Axton - Nature Genetics
Stephen Friend - President and CEO Phil Bourne - PLoS Computational Biology
Erich S. Huang - Director of Cancer Research Josh Greenberg - Alfred P. Sloan Foundation
Mike Kellen - Director of Technology Kelly LaMarco - Science Translational Medicine
Ian Mulvaney - eLife Sciences
Eric Schadt - Open Network Biology