SlideShare una empresa de Scribd logo
1 de 25
Sustaining scientific infrastructures:
transitioning from grants to peer production
James Howison
School of Information
University of Texas at Austin
2 September 2016
@jameshowison
(slides on slideshare, see twitter for link)
This material is based upon work supported by the US National Science Foundation under
Grant Nos. SMA- 1064209 (SciSIP), OCI-0943168 (VOSS) and ACI-145348 (CAREER).
Supporting Scientific software
after grants run out
• What happens when the grant ends?
– It’s hard, hard work to keep the code from
inevitable “bit-rot”
@jameshowison
Extension needs up-to-date code
@jameshowison
Just open source it!
(How hard can it be???)
@jameshowison
Open projects are not like grants
1. Governance
2. Collaboration infrastructures
3. Contribution processes
4. Service center vs. Base for community
“open sourcing” means full-on
sociotechnical change
@jameshowison
A literature on transfer to open?
• Copious literature on commercialization,
“Technology Transfer” but not communities
• Happily there are promising literatures
– Studies of open source and online communities
(Resnick, Crowston, Wiggins, Kittur, Kraut, Lampe, Ellison, …)
– Studies of scientific practice
(Palmer, Borgman, Vertesi, Edwards, Olsons, Finholt, Lee/Bietz,
Østerlund, Sawyer, Tapia, Ludders, …)
– Studies of infrastructural work
(Bowker, Jackson, Vertesi, Ribes, …)
@jameshowison
How can scientific software projects successfully
transition from grant support to thriving peer
production communities?
Research Design:
1. Theoretically sampled case studies
1. Longitudinal panel study
@jameshowison
Questions for each case:
How did they succeed or fail in building peer
production?
– What actions were taken to change the project?
– How did routines in the project change as a
result?
– What conditions are relevant to the success of
those actions in causing change?
@jameshowison
Sampling success and failure
• Very hard to have people talk about failures
– Records are often unavailable
– Constant problem in studies of open source
• Panel study offers help here
– Enroll early, before outcome clear
– Build trust, chart course, keep records
– Selected the NSF SI2 funding program
(program officer support)
@jameshowison
Panel Study setup
• SI2 program contributed to over 350 grants
• Three step qualitative content analysis:
1. Did the grant intend to create software
2. What documents (URLs, Workshop reports, or
Publications) are available?
3. Read these, apply coding scheme
@jameshowison
Content analysis categories
Code Description
Project Presents
Separate From Grant
Does the grant support the project (e.g., pre-existing), Or is the
project only there because of the grant
inviteToContribute
contributionProcess
Is there an explicit invitation for outsiders to contribute? Is
there a process for taking contributions?
highlightsPublication e.g., Does the project have a “publications tab”
creditsNonPI
Contributors
Are only the PIs credited “the PIs and their teams” or a wider
group?
associatedRepository
CodeAvailable
license
Is code available? Is it openly hosted? Where? Under what
license?
Collaborative setup
(wiki, bugtracker)
Online meetings?
What set of collaborative tools are they using?
Offline meetings Does the project organize offline meetings, what kinds (user
workshop, hackathon).
Build dataset over time
• Training new graduate student on scheme
– May involve additional students over time
• Intend to code ~5 projects a weekday for two
years
– 300 projects, 250 weekdays in year, 5 projects a
day, 2 coders, assume some missed days!
– ~5-10 observations of each project a year
• Also analyze repositories, where available.
• Adding content analysis codes over time
@jameshowison
Case Method: Sampling
@jameshowison
Use- context diversity
Users
Science project
Generally unreachable area
Domain
Platform
General purpose
Unlikely region
Individual
low high
Few
Many
ytENZO
Eclipse
PTP
OODT/
Airavata
Case Method: analysis
• Identify work episodes
– Ground interviews in specific production work.
– Source-code repositories help immensely
– “Digital trace ethnography” (Ribes and Geiger)
• Identify socio-technical changes that divide
project into stages
– Investigate actions that precipitated changes
• Project narratives with illustrative vignettes
@jameshowison
ENZO
@jameshowison
ENZO pilot study
Data:
• 5 interviews, so far (thanks Eunyoung Moon!)
• Publications, websites, workshop websites,
source code repositories
• Analysis:
– Creation of timeline
– Identification of episodes and 4 project phases
(with their precipitating events)
@jameshowison
@jameshowison
• No central base to which changes are coming and going
• Copy and pasting features across personal branches
• Single lab
@jameshowison
• ENZO lab reforms as “Service Center” (grant)
• Mainline branch internally, releases externally
• Little expectation of contributions coming back in
• “Friendly user” labs internally functioning like “early days”
The “Week of Code”
• Director of external lab (former post-doc) has
new job at Stanford (with startup funds!)
• Learns of various versions through
conversations at conferences and reviewing(!)
• Focus is on collaboration infrastructure, not
governance.
• Begin with the code of those not present
@jameshowison
@jameshowison
• Central branch to which both core and outsiders contribute
• Development continues separately in external labs
• Called “Wild West” by participants, autonomy concerns.
@jameshowison
• Introduction of “code revision” (pull requests)
• External lab members on similar footing to Core members
• Review helps members not “step on each other’s work”
Change
• What hasn’t changed:
– Motivations (code is side-effect of scientific
inquiry, papers first, code second), no commercial
value
• Challenges to change
– Leadership’s emotional connection, difficulty of
passing on leadership.
– Giving up autonomy (being “blocked” in one’s
work)
@jameshowison
What worked
• Always: collaboration technology before
governance (contra “Collaboration Readiness”
(Olson et al.) TORSC?).
• Social proof: visible action in public
• Inspiration from open source
• Working alongside, rather than with.
Superposition rather than Teamwork.
@jameshowison
Additional CAREER elements
• Teaching course on online communities
– Incorporating more on managing software
projects in science
• Contributing modules to Software Carpentry
– 2-3 day workshops with graduate students
– Enough command line, python, SQL to get them
working
– I’m going to contribute module on contributing to
and running software projects in science
@jameshowison
Conclusions
• Software engineering, but in a very specific
context
• Organization of software work but different to
design and testing of methodologies
• Can also link in resource and motivation
situations
• Learning from open source, building
alternative paths alongside commercialization.
@jameshowison

Más contenido relacionado

La actualidad más candente

Strategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSSStrategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSSwebuploader
 
Towards Mining Software Repositories Research that Matters
Towards Mining Software Repositories Research that MattersTowards Mining Software Repositories Research that Matters
Towards Mining Software Repositories Research that MattersTao Xie
 
The Scientific and Technical Foundation for Altmetrics in the United States
The Scientific and Technical Foundation for Altmetrics in the United StatesThe Scientific and Technical Foundation for Altmetrics in the United States
The Scientific and Technical Foundation for Altmetrics in the United StatesWilliam Gunn
 
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...William Gunn
 
AAAS 2014: How the Web Changes Collaboration
AAAS 2014: How the Web Changes CollaborationAAAS 2014: How the Web Changes Collaboration
AAAS 2014: How the Web Changes CollaborationWilliam Gunn
 
Frameworks for studies of information behaviour and use
Frameworks for studies of information behaviour and useFrameworks for studies of information behaviour and use
Frameworks for studies of information behaviour and useDiane Rasmussen Pennington
 
CHASE 2014 - The hard of newcomers to OSS projects
CHASE 2014 - The hard of newcomers to OSS projectsCHASE 2014 - The hard of newcomers to OSS projects
CHASE 2014 - The hard of newcomers to OSS projectsIgor Steinmacher
 
Social metrics for Research: Quantity and Quality
Social metrics for Research: Quantity and QualitySocial metrics for Research: Quantity and Quality
Social metrics for Research: Quantity and QualityWilliam Gunn
 
User centred design and students' library search behaviours
User centred design and students' library search behavioursUser centred design and students' library search behaviours
User centred design and students' library search behavioursVernon Fowler
 
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
The Role of Open Access & Social Media in Knowledge Mobilization and DiscoveryThe Role of Open Access & Social Media in Knowledge Mobilization and Discovery
The Role of Open Access & Social Media in Knowledge Mobilization and DiscoveryToronto Metropolitan University
 
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? Beck Pitt
 
Ifla203 archambault
Ifla203 archambaultIfla203 archambault
Ifla203 archambaultsusangar
 
AGU2012: Creating a Collaborative Network for Scientists
AGU2012: Creating a Collaborative Network for ScientistsAGU2012: Creating a Collaborative Network for Scientists
AGU2012: Creating a Collaborative Network for ScientistsWilliam Gunn
 
ICSTI TACC 2014: How Mendeley Illuminates a Broader Definition of Impact
ICSTI TACC 2014: How Mendeley Illuminates a Broader Definition of ImpactICSTI TACC 2014: How Mendeley Illuminates a Broader Definition of Impact
ICSTI TACC 2014: How Mendeley Illuminates a Broader Definition of ImpactWilliam Gunn
 
Improving Library Resource Discovery
Improving Library Resource DiscoveryImproving Library Resource Discovery
Improving Library Resource DiscoveryDanya Leebaw
 
Developing a research Library position statement on Text and Data Mining in t...
Developing a research Library position statement on Text and Data Mining in t...Developing a research Library position statement on Text and Data Mining in t...
Developing a research Library position statement on Text and Data Mining in t...Danny Kingsley
 
From Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to EntrepreneurFrom Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to EntrepreneurFiona Nielsen
 

La actualidad más candente (20)

Strategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSSStrategies-Developing-Deploying-FOSS
Strategies-Developing-Deploying-FOSS
 
Towards Mining Software Repositories Research that Matters
Towards Mining Software Repositories Research that MattersTowards Mining Software Repositories Research that Matters
Towards Mining Software Repositories Research that Matters
 
The Scientific and Technical Foundation for Altmetrics in the United States
The Scientific and Technical Foundation for Altmetrics in the United StatesThe Scientific and Technical Foundation for Altmetrics in the United States
The Scientific and Technical Foundation for Altmetrics in the United States
 
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
LISA VII: The Scientific and Technical Foundation for Altmetrics in the Unite...
 
Building an Online Profile Using Social Networking and Amplification Tools fo...
Building an Online Profile Using Social Networking and Amplification Tools fo...Building an Online Profile Using Social Networking and Amplification Tools fo...
Building an Online Profile Using Social Networking and Amplification Tools fo...
 
AAAS 2014: How the Web Changes Collaboration
AAAS 2014: How the Web Changes CollaborationAAAS 2014: How the Web Changes Collaboration
AAAS 2014: How the Web Changes Collaboration
 
Frameworks for studies of information behaviour and use
Frameworks for studies of information behaviour and useFrameworks for studies of information behaviour and use
Frameworks for studies of information behaviour and use
 
CHASE 2014 - The hard of newcomers to OSS projects
CHASE 2014 - The hard of newcomers to OSS projectsCHASE 2014 - The hard of newcomers to OSS projects
CHASE 2014 - The hard of newcomers to OSS projects
 
Social metrics for Research: Quantity and Quality
Social metrics for Research: Quantity and QualitySocial metrics for Research: Quantity and Quality
Social metrics for Research: Quantity and Quality
 
User centred design and students' library search behaviours
User centred design and students' library search behavioursUser centred design and students' library search behaviours
User centred design and students' library search behaviours
 
Give me kudos for taking responsibility for self-marketing my scientific publ...
Give me kudos for taking responsibility for self-marketing my scientific publ...Give me kudos for taking responsibility for self-marketing my scientific publ...
Give me kudos for taking responsibility for self-marketing my scientific publ...
 
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
The Role of Open Access & Social Media in Knowledge Mobilization and DiscoveryThe Role of Open Access & Social Media in Knowledge Mobilization and Discovery
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
 
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research? From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
From Theory to Practice: Can Opennesss Improve the Quality of OER Research?
 
Mike Thelwall: Introduction to Webometrics
Mike Thelwall: Introduction to WebometricsMike Thelwall: Introduction to Webometrics
Mike Thelwall: Introduction to Webometrics
 
Ifla203 archambault
Ifla203 archambaultIfla203 archambault
Ifla203 archambault
 
AGU2012: Creating a Collaborative Network for Scientists
AGU2012: Creating a Collaborative Network for ScientistsAGU2012: Creating a Collaborative Network for Scientists
AGU2012: Creating a Collaborative Network for Scientists
 
ICSTI TACC 2014: How Mendeley Illuminates a Broader Definition of Impact
ICSTI TACC 2014: How Mendeley Illuminates a Broader Definition of ImpactICSTI TACC 2014: How Mendeley Illuminates a Broader Definition of Impact
ICSTI TACC 2014: How Mendeley Illuminates a Broader Definition of Impact
 
Improving Library Resource Discovery
Improving Library Resource DiscoveryImproving Library Resource Discovery
Improving Library Resource Discovery
 
Developing a research Library position statement on Text and Data Mining in t...
Developing a research Library position statement on Text and Data Mining in t...Developing a research Library position statement on Text and Data Mining in t...
Developing a research Library position statement on Text and Data Mining in t...
 
From Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to EntrepreneurFrom Bioinformatics Scientist to Entrepreneur
From Bioinformatics Scientist to Entrepreneur
 

Destacado

Open Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentOpen Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentJames Howison
 
नाम नहीं
नाम नहींनाम नहीं
नाम नहींBalaji Sharma
 
Acta de acuerdo unatsbo
Acta de acuerdo unatsboActa de acuerdo unatsbo
Acta de acuerdo unatsboErbol Digital
 
Areas protegidas de honduras
Areas protegidas de hondurasAreas protegidas de honduras
Areas protegidas de hondurasXavi Maravilla
 

Destacado (6)

Open Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software developmentOpen Superposition and lessons for scientific software development
Open Superposition and lessons for scientific software development
 
Light RU
Light RULight RU
Light RU
 
नाम नहीं
नाम नहींनाम नहीं
नाम नहीं
 
Acta de acuerdo unatsbo
Acta de acuerdo unatsboActa de acuerdo unatsbo
Acta de acuerdo unatsbo
 
Adhd handbook11
Adhd handbook11Adhd handbook11
Adhd handbook11
 
Areas protegidas de honduras
Areas protegidas de hondurasAreas protegidas de honduras
Areas protegidas de honduras
 

Similar a Scientific Software - what happens after the grant?

SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsJames Howison
 
Software Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSASoftware Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSADaniel S. Katz
 
Big Data Curricula at the UW eScience Institute, JSM 2013
Big Data Curricula at the UW eScience Institute, JSM 2013Big Data Curricula at the UW eScience Institute, JSM 2013
Big Data Curricula at the UW eScience Institute, JSM 2013University of Washington
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesDaniel S. Katz
 
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...James Howison
 
ISEC'18 Tutorial: Research Methodology on Pursuing Impact-Driven Research
ISEC'18 Tutorial: Research Methodology on Pursuing Impact-Driven ResearchISEC'18 Tutorial: Research Methodology on Pursuing Impact-Driven Research
ISEC'18 Tutorial: Research Methodology on Pursuing Impact-Driven ResearchTao Xie
 
What is eScience, and where does it go from here?
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?Daniel S. Katz
 
Research Software Engineering Inside and Outside the Library
Research Software Engineering Inside and Outside the LibraryResearch Software Engineering Inside and Outside the Library
Research Software Engineering Inside and Outside the LibraryPatrick McCann
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Digital Methods Initiative
 
Funding Software in Academia
Funding Software in AcademiaFunding Software in Academia
Funding Software in AcademiaDaniel S. Katz
 
OpenSocial in Practice - presented at VIVO14
OpenSocial in Practice - presented at VIVO14OpenSocial in Practice - presented at VIVO14
OpenSocial in Practice - presented at VIVO14Brian Turner
 
Leveraging Exhibitions as a Needs-Based Skill Development Program in Librarie...
Leveraging Exhibitions as a Needs-Based Skill Development Program in Librarie...Leveraging Exhibitions as a Needs-Based Skill Development Program in Librarie...
Leveraging Exhibitions as a Needs-Based Skill Development Program in Librarie...Sara Sterkenburg
 
Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)James Howison
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...EDINA, University of Edinburgh
 
Pathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesPathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesTao Xie
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...SEAD
 

Similar a Scientific Software - what happens after the grant? (20)

SciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystemsSciSoftDays Talk - Howison: Spreading the work in software ecosystems
SciSoftDays Talk - Howison: Spreading the work in software ecosystems
 
Howison si2 keynote
Howison si2 keynoteHowison si2 keynote
Howison si2 keynote
 
Software Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSASoftware Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSA
 
Big Data Curricula at the UW eScience Institute, JSM 2013
Big Data Curricula at the UW eScience Institute, JSM 2013Big Data Curricula at the UW eScience Institute, JSM 2013
Big Data Curricula at the UW eScience Institute, JSM 2013
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community Responses
 
Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013
 
Sgci at-two-years-7-26-18
Sgci at-two-years-7-26-18Sgci at-two-years-7-26-18
Sgci at-two-years-7-26-18
 
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...Sustainability in Scientific Software:Ecosystem complexityandSoftware Vis...
Sustainability in Scientific Software: Ecosystem complexity and Software Vis...
 
ISEC'18 Tutorial: Research Methodology on Pursuing Impact-Driven Research
ISEC'18 Tutorial: Research Methodology on Pursuing Impact-Driven ResearchISEC'18 Tutorial: Research Methodology on Pursuing Impact-Driven Research
ISEC'18 Tutorial: Research Methodology on Pursuing Impact-Driven Research
 
What is eScience, and where does it go from here?
What is eScience, and where does it go from here?What is eScience, and where does it go from here?
What is eScience, and where does it go from here?
 
Research Software Engineering Inside and Outside the Library
Research Software Engineering Inside and Outside the LibraryResearch Software Engineering Inside and Outside the Library
Research Software Engineering Inside and Outside the Library
 
Sgci nsf-si2-2-21-17
Sgci nsf-si2-2-21-17Sgci nsf-si2-2-21-17
Sgci nsf-si2-2-21-17
 
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
 
Funding Software in Academia
Funding Software in AcademiaFunding Software in Academia
Funding Software in Academia
 
OpenSocial in Practice - presented at VIVO14
OpenSocial in Practice - presented at VIVO14OpenSocial in Practice - presented at VIVO14
OpenSocial in Practice - presented at VIVO14
 
Leveraging Exhibitions as a Needs-Based Skill Development Program in Librarie...
Leveraging Exhibitions as a Needs-Based Skill Development Program in Librarie...Leveraging Exhibitions as a Needs-Based Skill Development Program in Librarie...
Leveraging Exhibitions as a Needs-Based Skill Development Program in Librarie...
 
Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)Software Citation and a Proposal (NSF workshop at Havard Medical School)
Software Citation and a Proposal (NSF workshop at Havard Medical School)
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
 
Pathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and ChallengesPathways to Technology Transfer and Adoption: Achievements and Challenges
Pathways to Technology Transfer and Adoption: Achievements and Challenges
 
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
Changing the Curation Equation: A Data Lifecycle Approach to Lowering Costs a...
 

Más de James Howison

How to Ask questions people want to answer
How to Ask questions people want to answerHow to Ask questions people want to answer
How to Ask questions people want to answerJames Howison
 
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...James Howison
 
Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016James Howison
 
Studying archives of online behavior
Studying archives of online behaviorStudying archives of online behavior
Studying archives of online behaviorJames Howison
 
Howison rutgers-open superposition
Howison rutgers-open superpositionHowison rutgers-open superposition
Howison rutgers-open superpositionJames Howison
 
Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...James Howison
 
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...James Howison
 
Scientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityScientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityJames Howison
 
Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...James Howison
 
Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...James Howison
 

Más de James Howison (11)

How to Ask questions people want to answer
How to Ask questions people want to answerHow to Ask questions people want to answer
How to Ask questions people want to answer
 
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...Heartbeat: measuring installed base by analyzing downloads and Scientific S...
Heartbeat: measuring installed base by analyzing downloads and Scientific S...
 
Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016Howison CAREER journey - NSF CISE CAREER Workshop 2016
Howison CAREER journey - NSF CISE CAREER Workshop 2016
 
Studying archives of online behavior
Studying archives of online behaviorStudying archives of online behavior
Studying archives of online behavior
 
Howison rutgers-open superposition
Howison rutgers-open superpositionHowison rutgers-open superposition
Howison rutgers-open superposition
 
Howison traces
Howison tracesHowison traces
Howison traces
 
Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...
 
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...Sharing the spoils: the trouble with reputation as a motivation and reward fo...
Sharing the spoils: the trouble with reputation as a motivation and reward fo...
 
Scientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexityScientific software sustainability and ecosystem complexity
Scientific software sustainability and ecosystem complexity
 
Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...Five Models for Interaction Between Science Enterprises and Organization Scie...
Five Models for Interaction Between Science Enterprises and Organization Scie...
 
Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...Alone Together: Patterns of collaboration in free and open source software de...
Alone Together: Patterns of collaboration in free and open source software de...
 

Último

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000Sapana Sha
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPirithiRaju
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 

Último (20)

Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 

Scientific Software - what happens after the grant?

  • 1. Sustaining scientific infrastructures: transitioning from grants to peer production James Howison School of Information University of Texas at Austin 2 September 2016 @jameshowison (slides on slideshare, see twitter for link) This material is based upon work supported by the US National Science Foundation under Grant Nos. SMA- 1064209 (SciSIP), OCI-0943168 (VOSS) and ACI-145348 (CAREER).
  • 2. Supporting Scientific software after grants run out • What happens when the grant ends? – It’s hard, hard work to keep the code from inevitable “bit-rot” @jameshowison
  • 3. Extension needs up-to-date code @jameshowison
  • 4. Just open source it! (How hard can it be???) @jameshowison
  • 5. Open projects are not like grants 1. Governance 2. Collaboration infrastructures 3. Contribution processes 4. Service center vs. Base for community “open sourcing” means full-on sociotechnical change @jameshowison
  • 6. A literature on transfer to open? • Copious literature on commercialization, “Technology Transfer” but not communities • Happily there are promising literatures – Studies of open source and online communities (Resnick, Crowston, Wiggins, Kittur, Kraut, Lampe, Ellison, …) – Studies of scientific practice (Palmer, Borgman, Vertesi, Edwards, Olsons, Finholt, Lee/Bietz, Østerlund, Sawyer, Tapia, Ludders, …) – Studies of infrastructural work (Bowker, Jackson, Vertesi, Ribes, …) @jameshowison
  • 7. How can scientific software projects successfully transition from grant support to thriving peer production communities? Research Design: 1. Theoretically sampled case studies 1. Longitudinal panel study @jameshowison
  • 8. Questions for each case: How did they succeed or fail in building peer production? – What actions were taken to change the project? – How did routines in the project change as a result? – What conditions are relevant to the success of those actions in causing change? @jameshowison
  • 9. Sampling success and failure • Very hard to have people talk about failures – Records are often unavailable – Constant problem in studies of open source • Panel study offers help here – Enroll early, before outcome clear – Build trust, chart course, keep records – Selected the NSF SI2 funding program (program officer support) @jameshowison
  • 10. Panel Study setup • SI2 program contributed to over 350 grants • Three step qualitative content analysis: 1. Did the grant intend to create software 2. What documents (URLs, Workshop reports, or Publications) are available? 3. Read these, apply coding scheme @jameshowison
  • 11. Content analysis categories Code Description Project Presents Separate From Grant Does the grant support the project (e.g., pre-existing), Or is the project only there because of the grant inviteToContribute contributionProcess Is there an explicit invitation for outsiders to contribute? Is there a process for taking contributions? highlightsPublication e.g., Does the project have a “publications tab” creditsNonPI Contributors Are only the PIs credited “the PIs and their teams” or a wider group? associatedRepository CodeAvailable license Is code available? Is it openly hosted? Where? Under what license? Collaborative setup (wiki, bugtracker) Online meetings? What set of collaborative tools are they using? Offline meetings Does the project organize offline meetings, what kinds (user workshop, hackathon).
  • 12. Build dataset over time • Training new graduate student on scheme – May involve additional students over time • Intend to code ~5 projects a weekday for two years – 300 projects, 250 weekdays in year, 5 projects a day, 2 coders, assume some missed days! – ~5-10 observations of each project a year • Also analyze repositories, where available. • Adding content analysis codes over time @jameshowison
  • 13. Case Method: Sampling @jameshowison Use- context diversity Users Science project Generally unreachable area Domain Platform General purpose Unlikely region Individual low high Few Many ytENZO Eclipse PTP OODT/ Airavata
  • 14. Case Method: analysis • Identify work episodes – Ground interviews in specific production work. – Source-code repositories help immensely – “Digital trace ethnography” (Ribes and Geiger) • Identify socio-technical changes that divide project into stages – Investigate actions that precipitated changes • Project narratives with illustrative vignettes @jameshowison
  • 16. ENZO pilot study Data: • 5 interviews, so far (thanks Eunyoung Moon!) • Publications, websites, workshop websites, source code repositories • Analysis: – Creation of timeline – Identification of episodes and 4 project phases (with their precipitating events) @jameshowison
  • 17. @jameshowison • No central base to which changes are coming and going • Copy and pasting features across personal branches • Single lab
  • 18. @jameshowison • ENZO lab reforms as “Service Center” (grant) • Mainline branch internally, releases externally • Little expectation of contributions coming back in • “Friendly user” labs internally functioning like “early days”
  • 19. The “Week of Code” • Director of external lab (former post-doc) has new job at Stanford (with startup funds!) • Learns of various versions through conversations at conferences and reviewing(!) • Focus is on collaboration infrastructure, not governance. • Begin with the code of those not present @jameshowison
  • 20. @jameshowison • Central branch to which both core and outsiders contribute • Development continues separately in external labs • Called “Wild West” by participants, autonomy concerns.
  • 21. @jameshowison • Introduction of “code revision” (pull requests) • External lab members on similar footing to Core members • Review helps members not “step on each other’s work”
  • 22. Change • What hasn’t changed: – Motivations (code is side-effect of scientific inquiry, papers first, code second), no commercial value • Challenges to change – Leadership’s emotional connection, difficulty of passing on leadership. – Giving up autonomy (being “blocked” in one’s work) @jameshowison
  • 23. What worked • Always: collaboration technology before governance (contra “Collaboration Readiness” (Olson et al.) TORSC?). • Social proof: visible action in public • Inspiration from open source • Working alongside, rather than with. Superposition rather than Teamwork. @jameshowison
  • 24. Additional CAREER elements • Teaching course on online communities – Incorporating more on managing software projects in science • Contributing modules to Software Carpentry – 2-3 day workshops with graduate students – Enough command line, python, SQL to get them working – I’m going to contribute module on contributing to and running software projects in science @jameshowison
  • 25. Conclusions • Software engineering, but in a very specific context • Organization of software work but different to design and testing of methodologies • Can also link in resource and motivation situations • Learning from open source, building alternative paths alongside commercialization. @jameshowison

Notas del editor

  1. Software is important, but many other examples as well.
  2. “Peer production takeaway, change is substantial, not natural, and not easy.
  3. Goal is to adapt and extend these literatures, building theory and actionable knowledge for practitioners
  4. So, as an experiment to start playing around algorithm involved in AMR … I started creating this, and I also wanted to learn C++. I started creating this code. And that was eventually grown up as ENZO. PI stitched together support, scrounging from grants, startup funds and the somewhat “fictional” 20 hours a week of graduate students