SlideShare a Scribd company logo
1 of 73
Software Sustainability: 
Better Software, Better Research 
Professor Carole Goble CBE FREng FBCS 
The University of Manchester, UK 
The Software Sustainability Institute UK 
carole.goble@manchester.ac.uk @caroleannegoble 
2nd Annual Netherlands eScience Symposium, 6th Nov 2014, Almere, The Netherlands 
Supported by Project funding from
Carole Goble , 
Better Software, Better Research 
Issue No.05 - Sept.-Oct. (2014 vol.18) 
pp: 4-8 
IEEE Computer Society 
http://www.computer.org/csdl/mags/ic/2014/ 
05/mic2014050004.pdf 
http://doi.ieeecomputersociety.org/10.1109/ 
MIC.2014.88 
http://www.software.ac.uk/resources/publications/better-software-better-research
Acknowledgements 
Neil Chue Hong 
Simon Hettrick 
Steve Crouch 
Aleks Pawlik 
Shoaib Sufi 
and many other SSI and 
other colleagues over the 
years
We’ve written lots of software 
Some sustained… 
Some not…
28/10/2014, 20.38 GMT
The research community 
relies on software 
Do you use research 
software? 
What would happen to your 
research without software 
Survey of researchers from 15 UK Russell Group universities conducted by SSI between August - October 
2014. 406 respondents covering representative range of funders, discipline and seniority.
The Research community 
produces software 
scientific software is important 
for their own research 91% 
developing scientific software is important 
for their own research 84% 
claimed to spend more time developing 
scientific software than they did 10 years ago 53% 
spend at least one fifth of their time 
developing software 38% 
2000 scientists. J.E. Hannay et al., “How Do Scientists Develop and Use Scientific Software?” Proc. 
ICSE Workshop Software Eng. for Computational Science and Eng., 2009, pp. 1–8.
The cost of UK research that 
relies on software 
£840m 
Investment in 2013-2014 financial 
year, an amount that has risen by 
3% on average over last four years 
30% 
Of total research investment has 
been spent on research which 
relies on software over the last four 
financial years 
How much is wasted? 
Analysis of data from 49,650 grant titles and abstracts published on Gateway to Research covering 2010-2014.
Software is the infrastructure 
“Software is the infrastructure, and hardware the 
consumable. 
Maintenance is required when there are hardware, 
operating system and software library changes.” 
Scientific Infrastructure, 
UK HOUSE OF LORDS, 
Select Committee on Science and Technology, 
2nd Report of Session 2013–14
Scientific software can live a long time 
Computational Chemistry - CASTEP 
1990s. From the first implementation of a DFT algorithm to a 
completely new code to community supported software 
• Individual 
• Group 
• Consortium 
• W/ industry 
• Community 
• Active 
Software advances 
< hardware speedup 
http://www.castep.org/
“An article about computational 
science in a scientific publication 
is not the scholarship itself, it is 
merely advertising of the 
scholarship. The actual 
scholarship is the complete 
software development 
environment, [the complete 
data] and the complete set of 
instructions which generated the 
figures.” 
David Donoho, “Wavelab and Reproducible 
Research,” 1995 
datasets 
data collections 
algorithms 
configurations 
tools and apps 
codes 
workflows 
scripts 
code libraries 
services, 
system software 
infrastructure, 
compilers 
hardware 
Morin et al Shining Light into Black Boxes 
Science 13 April 2012: 336(6078) 159-160 
Ince et al The case for open computer programs, 
Nature 482, 2012
Broken software, broken science 
• Geoffrey Chang, Scripps 
Institute 
• Homemade data-analysis 
program inherited from 
another lab 
• Flipped two columns of data, 
inverting the electron-density 
map used to derive protein 
structure 
• Retract 3 Science papers and 2 
papers in other journals 
• One paper cited by 364 
The structures of MsbA (purple) and 
Sav1866 (green) overlap little (left) 
until MsbA is inverted (right). 
Miller A Scientist's Nightmare: Software Problem Leads to Five Retractions Science 22 December 2006: 
vol. 314 no. 5807 1856-1857
recomputation.org 
sciencecodemanifesto.org
Campaigns 
Advice 
Guides 
Courses 
Workshops 
Fellowship 
Software 
Research 
Policy 
Training 
Community 
Consultancy 
Coordination 
Knowledge Transfer
What Software? 
What are we sustaining? 
Software context 
Software 
Sustainability 
Institute 
www.software.ac.uk
Not all software 
is worth sustaining 
Is it key to recomputing results? 
Could it be reused? 
Is it more than a one run shot? 
Is it obsolete? 
Does anybody care about it?
Novel reuse of public sector data 
What do we sustain: 
- Map? 
- Software that creates map so we can create it and other maps? 
http://www.mysociety.org
Do we sustain: 
-Workflow? 
- Software that runs workflow? 
- Software referenced by workflow?
Components & 
Dependencies 
• 35 kinds of annotations 
• 5 Main Workflows 
• 14 Nested Workflows 
• 11 Configuration files 
• 25 Scripts 
• 10 Software dependencies 
• 1 Workflow management 
system 
• 1 Web Service 
• Dataset: 90 galaxies observed 
in 3 bands 
Galaxy 
Luminosity 
Profiling 
José Enrique Ruiz (IAA-CSIC)
Sustaining the software ecosystem 
Tools 
Platforms 
Libraries 
Infrastructure 
Frameworks 
Visible 
Invisible 
Domain specific 
Domain generic 
CRAN
Computational Instrument Entropy 
form or function? preserve or sustain? 
prepare to repair 
Document Instrument 
Reproducibility by Inspection 
Read It, Repair It 
Reproducibility by Invocation 
Run It, Repair It
[Adapted Freire, 2013] 
preservation 
portability 
packaging 
provenance 
gather dependencies 
capture steps 
track & keep results 
tolerance 
versioning 
open 
accessible 
available 
machine actionable 
description 
intelligible 
machine-readable
preservation 
portability 
packaging 
provenance 
gather dependencies 
capture steps 
track & keep results 
tolerance 
Portable Package ReproZip 
Virtual Machines 
Open Source/Store 
versioning host 
ProvStore 
[Adapted Freire, 2013] 
Sci as a Service 
Integrative fws 
service 
Workflows, 
makefiles
Virtual 
Machines 
Open 
Source 
Recompute Redevelop 
Ian Gent http://www.recomputation.org/blog/2014/09/14/why-isnt-the-answer-43/
Research Objects 
A Framework to Bundle and Relatemulti-hosted 
(digital) resources of a scientific experiment or 
investigation using standard mechanisms & uniform 
access protocols. Carriers of Research Context 
Outputs are first class 
citizens to be managed, 
credited and tracked: 
data, software 
http://www.researchobject.org/ 
http://www.slideshare.net/carolegoble/goble-keynote-vivoscits2014
Research Objects – Method + Materials 
Bechhofer 2013, Why linked data is not enough for 
scientists, DOI: 10.1016/j.future.2011.08.004
Software Production 
Unsustainable software 
Research Software written by non-software engineers. 
Software 
Sustainability 
Institute 
www.software.ac.uk
Research software is used by 
90% of researchers and 
accounts for at least 30% of 
RCUK investment… 
http://www.phdcomics.com/comics/archive/phd031214s.gif 
Reusability 
Trust 
Building on prior work
through each SNP of interest 
= 0; $x < scalar @pos; $x++) 
then each downstream SNP of interest 
my $y = $x+1; $y < scalar @pos; $y++) 
Behind every great piece of science… 
if SNPs within our chosen distance (500kb) and both present in the haplotypes file 
if((!($trait[$x] eq $trait[$y])) && (abs($pos[$x] - $pos[$y]) <= 500000) && (exists($legArrayPos{$my $snp1ArrayPos = "”; 
my $snp2ArrayPos = "”; 
my $snp1All = "”; 
my $snp2All = "”; 
#create output file for this SNP pair 
my $filename = "ConditionedResults2/$chr[$x].$pos[$x]-$pos[$y].EHH.GBR.2.txt”; 
print "$filenamen”; 
unless (-e $filename) { 
open(OUT, ">$filename"); 
#####################CHANGE THESE IF NOT FOCUSING ON SECOND SNP######################### 
my $start = $pos[$y]-500000; 
if ($start < 1) { 
$start = 1; 
} 
my $end = $pos[$y]+500000; 
if ($end > $chrLengths{$chr[$x]}) { 
$end = $chrLengths{$chr[$x]}; 
}
Maintenance = Evolution 
all code becomes legacy, 
prepare to repair 
code that is used evolves 
• Testing, infrastructure, 
documentation 
• Corrective: 
– fixing faults 
• Preventative: 
– increasing maintainability 
• Adaptive: 
– adapting to changes in 
environment 
• Perfective: 
– meeting new/different user 
requirements
[Norman Morrison] 
Software entropy 
Time, resource and capacity to tackle
http://xkcd.com/974 
Long-term sustainable codes for wider adoption vs 
Fast-return codes for me. 
Working to working, jam today more jam tomorrow, 
just enough just in time, tailoring 
Time, resource & capacity to tackle sustaining
Rule 1: Don't Reinvent the Wheel 
Rule 2: Code Well 
Rule 3: Be Your Own User 
Rule 4: Be Transparent 
Rule 5: Be Simple 
Rule 6: Don't Be a Perfectionist 
Rule 7: Nurture and Grow 
Your Community 
Rule 8: Promote Your Project 
Rule 9: Find Sponsors 
Rule 10: Science Counts 
doi:10.1371/journal.pcbi.1002802
Software making practices 
Zeeya Merali , Nature 467, 775-777 (2010) | doi:10.1038/467775a 
Computational science: ...Error…why scientific programming does not compute. 
“As a general rule, 
researchers do not 
test or document 
their programs 
rigorously, and they 
rarely release their 
codes, making it 
almost impossible 
to reproduce and 
verify published 
results generated 
by scientific 
software” 
2000 scientists. J.E. Hannay et al., “How Do Scientists Develop and Use Scientific Software?” Proc. ICSE Workshop Software 
Eng. for Computational Science and Eng., 2009, pp. 1–8.
The UK research community 
making software 
56% 
Of UK researchers develop their 
own research software or scripts 
73% 
Of UK researchers have had no 
formal software engineering training 
140,000 
UK researchers rely on their own coding skills 
Survey of researchers from 15 Russell Group universities conducted by SSI between August - October 2014. 
406 respondents covering representative range of funders, discipline and seniority.
Training Unappreciated 
• Basic training for 
kitchen chef: 3-4 years 
• Head chef: 10 years 
• Basic training for s/w 
engineer: 3-4 years 
• Architect: 10 years 
Photo by ZagatBuzz 
Training in S/W Dev in UG Physics: 140 hours 
Training in S/W Dev in UG Geography: 0 hours 
Software 
Sustainability 
Institute
software-carpentry.org 
Software engineering life skills. 
Teach the “95% researchers” basic lab skills for scientific computing: the tools and 
techniques that will help them get more done in less time, and with less pain. 
Volunteer instructors / Bootcamps / Train the trainers / Free lesson materials
A one-man, small-scale 
software project -> multi-developer 
programme, 
transforming research into 
molecular binding. 
Christopher Woods, biochemist from the University of Bristol
Findable, Accessible, 
Interoperable, Reusable 
Software 
Software 
Sustainability 
Institute 
www.software.ac.uk
http://softwarediscoveryindex.org/
Required as condition of publication 
Code Sharing Policy 
170 journals, 2011-2012 
StoddenV, Guo P, Ma Z (2013) Toward Reproducible Computational Research: An 
Empirical Analysis of Data and Code Policy Adoption by Journals. PLoSONE 8(6): 
e67111. doi:10.1371/journal.pone.0067111 
Implied 
No mention 
Required but may not affect decisions 
Explicitly encouraged may be reviewed and/or hosted
The accessibility of software in articles 
87% software could be found 
20% mentions provided version data 
5% mentions provided actual version 
20% inaccessible in any form 
37% source code access 
20% permission to modify 
90 biology journal articles 
Howison and Bullard 2014 The visibility of software in the scientific literature: how do scientists mention software and 
how effective are those mentions? J Assoc fo Info Science and Technology In review
Barriers to Sharing 
Victoria Stodden, AMP 2011 http://www.stodden.net/AMP2011/, 
Special Issue Reproducible Research Computing in Science and Engineering July/August 2012, 14(4) 
Howison and Herbsleb (2013) "Incentives and Integration In Scientific Software Production" CSCW 2013.
http://matt.might.net/articles/crapl/ http://sciencecodemanifesto.org/
I can publish every new monolithic 
thing I make and I can’t publish if I 
reuse someone else’s thing. 
Novelty > sustainability 
Reward for Reusing as well 
as Reusability
Recognition 
Software 
Software developers 
Software 
Sustainability 
Institute 
www.software.ac.uk
The invisibility of software 
esp software that is widely used, 
infrastructural, components or cross-discipline 
[1960s Boeing 747-100 Software Configuration]
Credit is like Love not $$$$$ 
• Where do I get credit for my software? 
• Where is my name on your 5* journal paper? 
• Did you cite my software? 
• Niche credit and credit in private  
• Novel disposable code vs reliable software 
• Subsidized/spare time software development
The visibility of software in articles 
37% formal citation 
78% some form of credit Articles 
87% software could be found 
Journals 24% had citation policy 
Requests 18% offered preferred citation 
90 biology journal articles 
32% who cited ignored it 
Howison and Bullard 2014 The visibility of software in the scientific literature: how do scientists mention software and 
how effective are those mentions? J Assoc fo Info Science and Technology In review
Original Software Publication
http://www.rse.ac.uk/ 
“Skilled software developers and technicians are 
crucial to modern scientific endeavour so 
developing appropriate and appealing career paths 
and incentives and allocating appropriate 
operational budgets, taking these into account at the 
investment decision stage, and then committing to 
deliver them, are essential.” 
Scientific Infrastructure, 
UK HOUSE OF LORDS, 
Select Committee on Science and Technology, 
2nd Report of Session 2013–14
regulation of science 
institutions 
universities 
libraries 
publishers 
public services 
core facilities 
republic of science* 
*Merton’s four norms of scientific behaviour (1942) 
NSF Scientific 
Software 
Innovation 
Institutes (S2I2)
Sustainability models 
Cultivating Communities 
Contributory citizenship 
Better Funding, Better Software…. 
Software 
Sustainability 
Institute 
www.software.ac.uk
Technical 
Preservation 
Emulation 
Migration 
Cultivation 
Hiberation 
Depreciation 
Sustainability 
Approaches 
http://www.software.ac.uk/resources/approaches-software-sustainability
Cultivating Sustainable Communities 
Ladder Model of OSS Adoption 
(adapted from Carbone P., Value Derived from 
Open Source is a Function of Maturity Levels) 
[FLOSS@Sycracuse]
Developer Community: to enable 
Critical Mass 
User Community: to drive 
Resources & Time
Cultivate Contributors – R project 
• Basics: Website, mailing list, code repository, issue 
resolution, meet ups 
• Remove barriers to participation, increase efficiency 
• 1993: First public release; 2 devs 
• 1995: Code open sourced; 3 devs 
• 1996: r-testers list set up 
• 1997: lists split: r-announce, r-help, 
r-devel; public CVS; 11 devs 
• 2000: CRAN split and mirror 
• 2001: BioConductor 
• 2003: Namespaces 
• 2005: I8n, L8n 
• 2007: R-Forge 
• Today: BioConductor (33 core devs), 
R-Forge (532 projects, 1562 devs), 
CRAN (1400+ packages) 
http://cran.r-project.org/doc/html/interface98-paper/paper_2.html
People depend on 
my software?! 
But I’m a researcher… 
Software is free, 
like free puppies. 
s/w engineers are cute too. 
I didn’t intend to be a long 
term service provider.. 
[Scott McNealy, 2005] http://www.zdnet.com/open-source-is-free-like-a-puppy-is-free-says-sun-boss-3039202713/
Shared Responsibility 
Incidental 
Adoption 
intentions 
Familial 
Fundamental 
Autocratic 
Control 
intentions 
Adoptive 
community 
Selfish 
Contribute 
Contribution 
intentions 
Me 
Collaborate / 
Sponsor 
Family & Friends 
Acquaintances 
Strangers, Rivals 
Incidental 
Cooperative
Shared Responsibility 
Incidental 
Adoption 
intentions 
Familial 
Fundamental 
Autocratic 
Control 
intentions 
Adoptive 
community 
Selfish 
Contribute 
Contribution 
intentions 
Me 
Collaborate / 
Sponsor 
Family & Friends 
Acquaintances 
Strangers, Rivals 
Incidental 
Cooperative
Shared Responsibility 
Incidental 
Adoption 
intentions 
Familial 
Fundamental 
Autocratic 
Control 
intentions 
Adoptive 
community 
Selfish 
Contribute 
Contribution 
intentions 
Me 
Collaborate / 
Sponsor 
Family & Friends 
Acquaintances 
Strangers, Rivals 
Incidental 
Cooperative
Scale 
Community Interest 
Software Maturity 
Core funding 
Time 
Mind the gap
Sustainability Matrix/Mosaic/Mayhem 
Grant 
Portfolio 
Foundational 
Grants 
Fees 
Community 
Foundation 
Related 
Grants 
Commercial 
PPP 
Contracts 
Hobby 
Open development 
In Kind 
Students 
Licensing 
Donations 
Co-operative development 
Forked extensions
Sustainability through Subsidy 
The Core becomes Skunkworks
What can we do? 
Software 
Sustainability 
Institute 
www.software.ac.uk
Drivers 
Activities 
Outcomes 
Researcher 
Skills 
Reliability of 
Results 
Motivating 
Software 
Protecting 
Investment 
Improved 
researcher 
capability 
Greater reuse 
of software 
World leading 
science 
Recognition 
for developers 
Increased RSE 
capacity 
Research Software Engineers 
Research 
Software 
Training Community Policy
http://www.force11.org

More Related Content

What's hot

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
The DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with ConfidenceThe DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with ConfidenceMerce Crosas
 
Trustworthy AI and Open Science
Trustworthy AI and Open ScienceTrustworthy AI and Open Science
Trustworthy AI and Open ScienceBeth Plale
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Carole Goble
 
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Merce Crosas
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOMCarole Goble
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.orgNorman Morrison
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)Carole Goble
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for BiopharmaTom Plasterer
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Tom Plasterer
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsCarole Goble
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Tom Plasterer
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use CasesCarole Goble
 

What's hot (20)

Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
The DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with ConfidenceThe DataTags System: Sharing Sensitive Data with Confidence
The DataTags System: Sharing Sensitive Data with Confidence
 
Trustworthy AI and Open Science
Trustworthy AI and Open ScienceTrustworthy AI and Open Science
Trustworthy AI and Open Science
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
 
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
Open Source Tools Facilitating Sharing/Protecting Privacy: Dataverse and Data...
 
Introduction to FAIRDOM
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOM
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
Research Shared: researchobject.org
Research Shared: researchobject.orgResearch Shared: researchobject.org
Research Shared: researchobject.org
 
FAIRer Research
FAIRer ResearchFAIRer Research
FAIRer Research
 
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for Biopharma
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
Harnessing Edge Informatics to Accelerate Collaboration in BioPharma (Bio-IT ...
 
Advances in Scientific Workflow Environments
Advances in Scientific Workflow EnvironmentsAdvances in Scientific Workflow Environments
Advances in Scientific Workflow Environments
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...NISO/NFAIS Joint Virtual Conference:  Connecting the Library to the Wider Wor...
NISO/NFAIS Joint Virtual Conference: Connecting the Library to the Wider Wor...
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 

Similar to Software Sustainability: Better Software Better Science

Better software, better service, better research: The Software Sustainabilit...
Better software, better service, better research: The Software Sustainabilit...Better software, better service, better research: The Software Sustainabilit...
Better software, better service, better research: The Software Sustainabilit...Carole Goble
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability InstituteNeil Chue Hong
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Softwaredgarijo
 
Scientific Software: Sustainability, Skills & Sociology
Scientific Software: Sustainability, Skills & SociologyScientific Software: Sustainability, Skills & Sociology
Scientific Software: Sustainability, Skills & SociologyNeil Chue Hong
 
Introduction to Biological Network Analysis and Visualization with Cytoscape ...
Introduction to Biological Network Analysis and Visualization with Cytoscape ...Introduction to Biological Network Analysis and Visualization with Cytoscape ...
Introduction to Biological Network Analysis and Visualization with Cytoscape ...Keiichiro Ono
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402vrij
 
Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Carole Goble
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Ola Spjuth
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsGaignard Alban
 
Doing Science Properly In The Digital Age - Rutgers Seminar
Doing Science Properly In The Digital Age - Rutgers SeminarDoing Science Properly In The Digital Age - Rutgers Seminar
Doing Science Properly In The Digital Age - Rutgers SeminarNeil Chue Hong
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Sciencedgarijo
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynoteCarole Goble
 
myExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentmyExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentDavid De Roure
 
Digital Science: Towards the executable paper
Digital Science: Towards the executable paperDigital Science: Towards the executable paper
Digital Science: Towards the executable paperJose Enrique Ruiz
 

Similar to Software Sustainability: Better Software Better Science (20)

2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 
Better software, better service, better research: The Software Sustainabilit...
Better software, better service, better research: The Software Sustainabilit...Better software, better service, better research: The Software Sustainabilit...
Better software, better service, better research: The Software Sustainabilit...
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
Software Sustainability Institute
Software Sustainability InstituteSoftware Sustainability Institute
Software Sustainability Institute
 
OntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific SoftwareOntoSoft: A Distributed Semantic Registry for Scientific Software
OntoSoft: A Distributed Semantic Registry for Scientific Software
 
Scientific Software: Sustainability, Skills & Sociology
Scientific Software: Sustainability, Skills & SociologyScientific Software: Sustainability, Skills & Sociology
Scientific Software: Sustainability, Skills & Sociology
 
Introduction to Biological Network Analysis and Visualization with Cytoscape ...
Introduction to Biological Network Analysis and Visualization with Cytoscape ...Introduction to Biological Network Analysis and Visualization with Cytoscape ...
Introduction to Biological Network Analysis and Visualization with Cytoscape ...
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402
 
Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...
 
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reports
 
20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong20171003 lancaster data conversations Chue-Hong
20171003 lancaster data conversations Chue-Hong
 
Reproducible Research and the Cloud
Reproducible Research and the CloudReproducible Research and the Cloud
Reproducible Research and the Cloud
 
Doing Science Properly In The Digital Age - Rutgers Seminar
Doing Science Properly In The Digital Age - Rutgers SeminarDoing Science Properly In The Digital Age - Rutgers Seminar
Doing Science Properly In The Digital Age - Rutgers Seminar
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
myExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research EnvironmentmyExperiment - Defining the Social Virtual Research Environment
myExperiment - Defining the Social Virtual Research Environment
 
Digital Science: Towards the executable paper
Digital Science: Towards the executable paperDigital Science: Towards the executable paper
Digital Science: Towards the executable paper
 

More from Carole Goble

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsCarole Goble
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a VillageCarole Goble
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learningCarole Goble
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects Carole Goble
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)Carole Goble
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpCarole Goble
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardCarole Goble
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerCarole Goble
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsCarole Goble
 

More from Carole Goble (20)

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
RO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital ObjectsRO-Crate: packaging metadata love notes into FAIR Digital Objects
RO-Crate: packaging metadata love notes into FAIR Digital Objects
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Reflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
 
Reproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
 

Recently uploaded

COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLkantirani197
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPirithiRaju
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 

Recently uploaded (20)

COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 

Software Sustainability: Better Software Better Science

  • 1. Software Sustainability: Better Software, Better Research Professor Carole Goble CBE FREng FBCS The University of Manchester, UK The Software Sustainability Institute UK carole.goble@manchester.ac.uk @caroleannegoble 2nd Annual Netherlands eScience Symposium, 6th Nov 2014, Almere, The Netherlands Supported by Project funding from
  • 2. Carole Goble , Better Software, Better Research Issue No.05 - Sept.-Oct. (2014 vol.18) pp: 4-8 IEEE Computer Society http://www.computer.org/csdl/mags/ic/2014/ 05/mic2014050004.pdf http://doi.ieeecomputersociety.org/10.1109/ MIC.2014.88 http://www.software.ac.uk/resources/publications/better-software-better-research
  • 3. Acknowledgements Neil Chue Hong Simon Hettrick Steve Crouch Aleks Pawlik Shoaib Sufi and many other SSI and other colleagues over the years
  • 4. We’ve written lots of software Some sustained… Some not…
  • 6. The research community relies on software Do you use research software? What would happen to your research without software Survey of researchers from 15 UK Russell Group universities conducted by SSI between August - October 2014. 406 respondents covering representative range of funders, discipline and seniority.
  • 7. The Research community produces software scientific software is important for their own research 91% developing scientific software is important for their own research 84% claimed to spend more time developing scientific software than they did 10 years ago 53% spend at least one fifth of their time developing software 38% 2000 scientists. J.E. Hannay et al., “How Do Scientists Develop and Use Scientific Software?” Proc. ICSE Workshop Software Eng. for Computational Science and Eng., 2009, pp. 1–8.
  • 8. The cost of UK research that relies on software £840m Investment in 2013-2014 financial year, an amount that has risen by 3% on average over last four years 30% Of total research investment has been spent on research which relies on software over the last four financial years How much is wasted? Analysis of data from 49,650 grant titles and abstracts published on Gateway to Research covering 2010-2014.
  • 9. Software is the infrastructure “Software is the infrastructure, and hardware the consumable. Maintenance is required when there are hardware, operating system and software library changes.” Scientific Infrastructure, UK HOUSE OF LORDS, Select Committee on Science and Technology, 2nd Report of Session 2013–14
  • 10. Scientific software can live a long time Computational Chemistry - CASTEP 1990s. From the first implementation of a DFT algorithm to a completely new code to community supported software • Individual • Group • Consortium • W/ industry • Community • Active Software advances < hardware speedup http://www.castep.org/
  • 11. “An article about computational science in a scientific publication is not the scholarship itself, it is merely advertising of the scholarship. The actual scholarship is the complete software development environment, [the complete data] and the complete set of instructions which generated the figures.” David Donoho, “Wavelab and Reproducible Research,” 1995 datasets data collections algorithms configurations tools and apps codes workflows scripts code libraries services, system software infrastructure, compilers hardware Morin et al Shining Light into Black Boxes Science 13 April 2012: 336(6078) 159-160 Ince et al The case for open computer programs, Nature 482, 2012
  • 12. Broken software, broken science • Geoffrey Chang, Scripps Institute • Homemade data-analysis program inherited from another lab • Flipped two columns of data, inverting the electron-density map used to derive protein structure • Retract 3 Science papers and 2 papers in other journals • One paper cited by 364 The structures of MsbA (purple) and Sav1866 (green) overlap little (left) until MsbA is inverted (right). Miller A Scientist's Nightmare: Software Problem Leads to Five Retractions Science 22 December 2006: vol. 314 no. 5807 1856-1857
  • 14.
  • 15. Campaigns Advice Guides Courses Workshops Fellowship Software Research Policy Training Community Consultancy Coordination Knowledge Transfer
  • 16. What Software? What are we sustaining? Software context Software Sustainability Institute www.software.ac.uk
  • 17. Not all software is worth sustaining Is it key to recomputing results? Could it be reused? Is it more than a one run shot? Is it obsolete? Does anybody care about it?
  • 18. Novel reuse of public sector data What do we sustain: - Map? - Software that creates map so we can create it and other maps? http://www.mysociety.org
  • 19. Do we sustain: -Workflow? - Software that runs workflow? - Software referenced by workflow?
  • 20. Components & Dependencies • 35 kinds of annotations • 5 Main Workflows • 14 Nested Workflows • 11 Configuration files • 25 Scripts • 10 Software dependencies • 1 Workflow management system • 1 Web Service • Dataset: 90 galaxies observed in 3 bands Galaxy Luminosity Profiling José Enrique Ruiz (IAA-CSIC)
  • 21. Sustaining the software ecosystem Tools Platforms Libraries Infrastructure Frameworks Visible Invisible Domain specific Domain generic CRAN
  • 22. Computational Instrument Entropy form or function? preserve or sustain? prepare to repair Document Instrument Reproducibility by Inspection Read It, Repair It Reproducibility by Invocation Run It, Repair It
  • 23. [Adapted Freire, 2013] preservation portability packaging provenance gather dependencies capture steps track & keep results tolerance versioning open accessible available machine actionable description intelligible machine-readable
  • 24. preservation portability packaging provenance gather dependencies capture steps track & keep results tolerance Portable Package ReproZip Virtual Machines Open Source/Store versioning host ProvStore [Adapted Freire, 2013] Sci as a Service Integrative fws service Workflows, makefiles
  • 25. Virtual Machines Open Source Recompute Redevelop Ian Gent http://www.recomputation.org/blog/2014/09/14/why-isnt-the-answer-43/
  • 26. Research Objects A Framework to Bundle and Relatemulti-hosted (digital) resources of a scientific experiment or investigation using standard mechanisms & uniform access protocols. Carriers of Research Context Outputs are first class citizens to be managed, credited and tracked: data, software http://www.researchobject.org/ http://www.slideshare.net/carolegoble/goble-keynote-vivoscits2014
  • 27. Research Objects – Method + Materials Bechhofer 2013, Why linked data is not enough for scientists, DOI: 10.1016/j.future.2011.08.004
  • 28.
  • 29. Software Production Unsustainable software Research Software written by non-software engineers. Software Sustainability Institute www.software.ac.uk
  • 30. Research software is used by 90% of researchers and accounts for at least 30% of RCUK investment… http://www.phdcomics.com/comics/archive/phd031214s.gif Reusability Trust Building on prior work
  • 31. through each SNP of interest = 0; $x < scalar @pos; $x++) then each downstream SNP of interest my $y = $x+1; $y < scalar @pos; $y++) Behind every great piece of science… if SNPs within our chosen distance (500kb) and both present in the haplotypes file if((!($trait[$x] eq $trait[$y])) && (abs($pos[$x] - $pos[$y]) <= 500000) && (exists($legArrayPos{$my $snp1ArrayPos = "”; my $snp2ArrayPos = "”; my $snp1All = "”; my $snp2All = "”; #create output file for this SNP pair my $filename = "ConditionedResults2/$chr[$x].$pos[$x]-$pos[$y].EHH.GBR.2.txt”; print "$filenamen”; unless (-e $filename) { open(OUT, ">$filename"); #####################CHANGE THESE IF NOT FOCUSING ON SECOND SNP######################### my $start = $pos[$y]-500000; if ($start < 1) { $start = 1; } my $end = $pos[$y]+500000; if ($end > $chrLengths{$chr[$x]}) { $end = $chrLengths{$chr[$x]}; }
  • 32. Maintenance = Evolution all code becomes legacy, prepare to repair code that is used evolves • Testing, infrastructure, documentation • Corrective: – fixing faults • Preventative: – increasing maintainability • Adaptive: – adapting to changes in environment • Perfective: – meeting new/different user requirements
  • 33. [Norman Morrison] Software entropy Time, resource and capacity to tackle
  • 34. http://xkcd.com/974 Long-term sustainable codes for wider adoption vs Fast-return codes for me. Working to working, jam today more jam tomorrow, just enough just in time, tailoring Time, resource & capacity to tackle sustaining
  • 35. Rule 1: Don't Reinvent the Wheel Rule 2: Code Well Rule 3: Be Your Own User Rule 4: Be Transparent Rule 5: Be Simple Rule 6: Don't Be a Perfectionist Rule 7: Nurture and Grow Your Community Rule 8: Promote Your Project Rule 9: Find Sponsors Rule 10: Science Counts doi:10.1371/journal.pcbi.1002802
  • 36. Software making practices Zeeya Merali , Nature 467, 775-777 (2010) | doi:10.1038/467775a Computational science: ...Error…why scientific programming does not compute. “As a general rule, researchers do not test or document their programs rigorously, and they rarely release their codes, making it almost impossible to reproduce and verify published results generated by scientific software” 2000 scientists. J.E. Hannay et al., “How Do Scientists Develop and Use Scientific Software?” Proc. ICSE Workshop Software Eng. for Computational Science and Eng., 2009, pp. 1–8.
  • 37. The UK research community making software 56% Of UK researchers develop their own research software or scripts 73% Of UK researchers have had no formal software engineering training 140,000 UK researchers rely on their own coding skills Survey of researchers from 15 Russell Group universities conducted by SSI between August - October 2014. 406 respondents covering representative range of funders, discipline and seniority.
  • 38. Training Unappreciated • Basic training for kitchen chef: 3-4 years • Head chef: 10 years • Basic training for s/w engineer: 3-4 years • Architect: 10 years Photo by ZagatBuzz Training in S/W Dev in UG Physics: 140 hours Training in S/W Dev in UG Geography: 0 hours Software Sustainability Institute
  • 39. software-carpentry.org Software engineering life skills. Teach the “95% researchers” basic lab skills for scientific computing: the tools and techniques that will help them get more done in less time, and with less pain. Volunteer instructors / Bootcamps / Train the trainers / Free lesson materials
  • 40. A one-man, small-scale software project -> multi-developer programme, transforming research into molecular binding. Christopher Woods, biochemist from the University of Bristol
  • 41. Findable, Accessible, Interoperable, Reusable Software Software Sustainability Institute www.software.ac.uk
  • 42.
  • 44. Required as condition of publication Code Sharing Policy 170 journals, 2011-2012 StoddenV, Guo P, Ma Z (2013) Toward Reproducible Computational Research: An Empirical Analysis of Data and Code Policy Adoption by Journals. PLoSONE 8(6): e67111. doi:10.1371/journal.pone.0067111 Implied No mention Required but may not affect decisions Explicitly encouraged may be reviewed and/or hosted
  • 45. The accessibility of software in articles 87% software could be found 20% mentions provided version data 5% mentions provided actual version 20% inaccessible in any form 37% source code access 20% permission to modify 90 biology journal articles Howison and Bullard 2014 The visibility of software in the scientific literature: how do scientists mention software and how effective are those mentions? J Assoc fo Info Science and Technology In review
  • 46. Barriers to Sharing Victoria Stodden, AMP 2011 http://www.stodden.net/AMP2011/, Special Issue Reproducible Research Computing in Science and Engineering July/August 2012, 14(4) Howison and Herbsleb (2013) "Incentives and Integration In Scientific Software Production" CSCW 2013.
  • 48. I can publish every new monolithic thing I make and I can’t publish if I reuse someone else’s thing. Novelty > sustainability Reward for Reusing as well as Reusability
  • 49. Recognition Software Software developers Software Sustainability Institute www.software.ac.uk
  • 50. The invisibility of software esp software that is widely used, infrastructural, components or cross-discipline [1960s Boeing 747-100 Software Configuration]
  • 51. Credit is like Love not $$$$$ • Where do I get credit for my software? • Where is my name on your 5* journal paper? • Did you cite my software? • Niche credit and credit in private  • Novel disposable code vs reliable software • Subsidized/spare time software development
  • 52. The visibility of software in articles 37% formal citation 78% some form of credit Articles 87% software could be found Journals 24% had citation policy Requests 18% offered preferred citation 90 biology journal articles 32% who cited ignored it Howison and Bullard 2014 The visibility of software in the scientific literature: how do scientists mention software and how effective are those mentions? J Assoc fo Info Science and Technology In review
  • 54. http://www.rse.ac.uk/ “Skilled software developers and technicians are crucial to modern scientific endeavour so developing appropriate and appealing career paths and incentives and allocating appropriate operational budgets, taking these into account at the investment decision stage, and then committing to deliver them, are essential.” Scientific Infrastructure, UK HOUSE OF LORDS, Select Committee on Science and Technology, 2nd Report of Session 2013–14
  • 55. regulation of science institutions universities libraries publishers public services core facilities republic of science* *Merton’s four norms of scientific behaviour (1942) NSF Scientific Software Innovation Institutes (S2I2)
  • 56. Sustainability models Cultivating Communities Contributory citizenship Better Funding, Better Software…. Software Sustainability Institute www.software.ac.uk
  • 57.
  • 58. Technical Preservation Emulation Migration Cultivation Hiberation Depreciation Sustainability Approaches http://www.software.ac.uk/resources/approaches-software-sustainability
  • 59. Cultivating Sustainable Communities Ladder Model of OSS Adoption (adapted from Carbone P., Value Derived from Open Source is a Function of Maturity Levels) [FLOSS@Sycracuse]
  • 60. Developer Community: to enable Critical Mass User Community: to drive Resources & Time
  • 61. Cultivate Contributors – R project • Basics: Website, mailing list, code repository, issue resolution, meet ups • Remove barriers to participation, increase efficiency • 1993: First public release; 2 devs • 1995: Code open sourced; 3 devs • 1996: r-testers list set up • 1997: lists split: r-announce, r-help, r-devel; public CVS; 11 devs • 2000: CRAN split and mirror • 2001: BioConductor • 2003: Namespaces • 2005: I8n, L8n • 2007: R-Forge • Today: BioConductor (33 core devs), R-Forge (532 projects, 1562 devs), CRAN (1400+ packages) http://cran.r-project.org/doc/html/interface98-paper/paper_2.html
  • 62. People depend on my software?! But I’m a researcher… Software is free, like free puppies. s/w engineers are cute too. I didn’t intend to be a long term service provider.. [Scott McNealy, 2005] http://www.zdnet.com/open-source-is-free-like-a-puppy-is-free-says-sun-boss-3039202713/
  • 63. Shared Responsibility Incidental Adoption intentions Familial Fundamental Autocratic Control intentions Adoptive community Selfish Contribute Contribution intentions Me Collaborate / Sponsor Family & Friends Acquaintances Strangers, Rivals Incidental Cooperative
  • 64. Shared Responsibility Incidental Adoption intentions Familial Fundamental Autocratic Control intentions Adoptive community Selfish Contribute Contribution intentions Me Collaborate / Sponsor Family & Friends Acquaintances Strangers, Rivals Incidental Cooperative
  • 65. Shared Responsibility Incidental Adoption intentions Familial Fundamental Autocratic Control intentions Adoptive community Selfish Contribute Contribution intentions Me Collaborate / Sponsor Family & Friends Acquaintances Strangers, Rivals Incidental Cooperative
  • 66. Scale Community Interest Software Maturity Core funding Time Mind the gap
  • 67. Sustainability Matrix/Mosaic/Mayhem Grant Portfolio Foundational Grants Fees Community Foundation Related Grants Commercial PPP Contracts Hobby Open development In Kind Students Licensing Donations Co-operative development Forked extensions
  • 68. Sustainability through Subsidy The Core becomes Skunkworks
  • 69.
  • 70. What can we do? Software Sustainability Institute www.software.ac.uk
  • 71. Drivers Activities Outcomes Researcher Skills Reliability of Results Motivating Software Protecting Investment Improved researcher capability Greater reuse of software World leading science Recognition for developers Increased RSE capacity Research Software Engineers Research Software Training Community Policy
  • 72.