SlideShare a Scribd company logo
1 of 18
What is eScience, and where does it go
from here?
eScience 2019, 25 September 2019
Daniel S. Katz
(d.katz@ieee.org, http://danielskatz.org, @danielskatz)
Assistant Director for Scientific
Software & Applications, NCSA
Research Associate Professor,
CS, ECE, iSchool
e-Science in 2000
• “In November 2000 the Director General of UK Research Councils, Dr John Taylor,
announced £98M funding for a new UK e-Science programme.
• “In the future, e-Science will refer to the large scale science that will increasingly be carried
out through distributed global collaborations enabled by the Internet. Typically, a feature of
such collaborative scientific enterprises is that they will require access to very large data
collections, very large scale computing resources and high performance visualisation back
to the individual user scientists.
• “The World Wide Web gave us access to information on Web pages written in html
anywhere on the Internet. A much more powerful infrastructure is needed to support e-
Science. Besides information stored in Web pages, scientists will need easy access to
expensive remote facilities, to computing resources - either as dedicated Teraflop computers
or cheap collections of PCs - and to information stored in dedicated databases.
• “The Grid is an architecture proposed to bring all these issues together and make a reality of
such a vision for e-Science.”
https://web.archive.org/web/20040818222850/http://www.rcuk.ac.uk/escience_old/firstphase.shtml
“e-Science is about global collaboration in key
areas of science and the next generation of
infrastructure that will enable it” -- Dr John Taylor
e-Science in 2005
• Next generation of scientific research and experiments will be carried out by
communities of researchers from organizations that span national boundaries
• Activities will involve geographically distributed and heterogeneous resources such
as computational systems, scientific instruments, databases, sensors, software
components, networks, and people
• Such large-scale and enhanced scientific endeavors, popularly termed as e-
Science, are carried out via collaborations on a global scale
• “Grid computing has emerged as one of the key computing paradigms that enable
the creation and management of Internet-based utility computing infrastructure,
called Cyberinfrastructure, for realization of e-Science and e-Business at the global
level
https://web.archive.org/web/20160911013712/http://www.cloudbus.org/escience/cfp.html
eScience in 2012
• When planning the 2012 IEEE eScience conference:
• We decided that the key distinguishing element of eScience was
joint use of advances in infrastructure and advances in the use of
that infrastructure to make advances in scholarly research (which
we called science)
• We used this to organize the call for papers and the sessions
• Some papers focused more on
advances in the infrastructure
• Some more on advances in using it
• All papers had to have some
combination of these advances that
advanced research
Naming ourselves
• We called this eScience because of its tie to eInfrastructure, sometimes called
cyberinfrastructure
Digressing to defining cyberinfrastructure
• For an expansive view of what is infrastructure, we can use Craig
Stewart’s definition of cyberinfrastructure:
• “Computing systems, data storage systems, advanced
instruments and data repositories, visualization environments,
and people, all linked together by software and high performance
networks to improve research productivity and enable
breakthroughs not otherwise possible”
• Note that this contains the same goal of advancing research as
eScience.
https://doi.org/10.1145/1878335.1878347
Naming ourselves
• We called this eScience because of its tie to eInfrastructure, sometimes called
cyberinfrastructure
• But as with other e* and i* and cyber* things, the eScience name feels a little dated
• We need a new name
• Is eScience just Science now?
• Probably not
• Coupling of advances in digital infrastructure with advances in how the infrastructure is used
is still different than much of research
• Plus, science in English is more limited than wissenschaft in German
• Use research or scholarship or scholarly research instead?
• Thinking of the inherent research & infrastructure symbiosis (cf. lichen), maybe:
• Research and Infrastructure Development Symbiosis (RaIDS)
Coupling advances in infrastructure and research
• Sometimes a collaboration
• Though not always a single team
• Or even at a single time; can be a gap between paired advances
• Useful to examine these collaborations and interactions
• Specifically, the human aspects
Coupled advances
• For this type of progress to be made:
• Infrastructure developers have to be aware of infrastructure users’
potential needs
• Infrastructure users have to be aware of infrastructure developers’
possible offerings
• This requires communication
• After some advance in both that leads to progress in research, a
match may be declared, where the infrastructure advance needs to be
given credit for the research progress
• I think we as a community need to talk about these aspects,
communication and credit
• Due to limited time, I will mostly focus on communications
• Overall challenge: how can we improve them to improve eScience?
Push & Pull
• Developers make advances in infrastructure, intended to enable
better/more advanced scholarship
• Need to decide what advances to make
• Based on what they think will be used
• Based on what they think is possible
• Need to communicate this to researchers
• Advances in scholarship depend on advances in infrastructure
• Researchers need to decide what infrastructure advances to try/invest in
• Based on which are robust
• Based on which are sustainable
• Based on what others are using
• Based on what they think is possible
• Need to communicate their needs to infrastructure developers
• Idea: is our role to work between these two groups?
Problem 1
Problem 2: Thinking of what is possible is hard
https://www.youtube.com/watch?v=jWTGsUyv8IE
https://www.youtube.com/watch?v=jWTGsUyv8IE
https://www.chron.com/neighborhood/bayarea/news/article/When-Boris-Yeltsin-went-
grocery-shopping-in-Clear-5759129
“’Even the Politburo doesn't have this choice. Not even Mr. Gorbachev,’ he
said. When he was told through his interpreter that there were thousands of
items in the store for sale he didn't believe it. He had even thought that the
store was staged, a show for him.”
https://www.pinterest.com/pin/439593613604653533/
Problem 3: Too much …
https://www.etsy.com/listing/99710612/monkey-see-monkey-do-grown-up-t-shirt
Push and pull today
• Today, eScience (RaIDS) highlights collaborations that have
succeeded, e.g.
• Pegasus & Montage: https://pegasus.isi.edu/application-showcase/montage/
• Charm++ & NAMD: “Chapter 5: NAMD: Scalable Molecular Dynamics Based
on the Charm++ Parallel Runtime System”
• through papers and workshops
• Idea: Can we improve these papers by requiring them to state the
advances on which they are based?
• How can RaIDS promote communication that leads to encourage
additional successful collaborations?
Push and pull solutions though communication
• Idea: Can RaIDS bring in elements of matchmaking?
• Facilitated discussions, aka charrettes, ideas labs, sandpits
• Community road map and white paper processes (e.g. NASA Earth Science,
HEP software)
• Decadal surveys (e.g., astronomy)
• Idea: Can RaIDS promote/improve/standardize catalogs?
• XSEDE services: https://www.xsede.org/ecosystem/services
• ELIXIR’s bio.tools
• Idea: Can RaIDS promote infrastructure likely to be useful, e.g., a
technology showcase, and research needs, e.g., a research
challenges showcase
Quick foray into credit
• I don’t want to take this bus
• We need to better tie research advances to
the infrastructure (computing, data, software,
etc.) advances that enable them
• In our current system, best way is joint authorship
for synchronous collaborations
• eScience is a good platform for this now
• And via citation for asynchronous collaborations
• New standards emerging for data and software citation, ideas for computing
(instrument) citations
• Idea: RaIDS should encourage these via appropriate guidance to authors
and reviewers
Recap
• We aren’t a grid conference nor one on global-scale work, nor do we
focus only on science  we need a new name
• I suggest Research and Infrastructure Development Symbiosis
(RaIDS), but other ideas are equally welcome
• Ideas for future years of RaIDS
• Focus on role of attendees as people between infrastructure & applications
• Improve papers by requiring them to state advances on which they are based
• Bring in elements of matchmaking intended to lead to new collaborations
• Promote/improve/standardize catalogs
• Promote infrastructure likely to be useful, e.g., a technology showcase, and
research needs, e.g., a research challenges showcase
• Encourage infrastructure citation via appropriate guidance to authors and
reviewers
What is eScience, and where does it go from here?

More Related Content

What's hot

Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Jisc
 
Big Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DBig Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DUniversity of Washington
 
SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010Jisc
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open ScienceSarah Jones
 
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceFree as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceAndrea Wiggins
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Jisc
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and ScholarshipDavid De Roure
 
E research attachment survey
E research attachment surveyE research attachment survey
E research attachment surveyRiri Kusumarani
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsDavid De Roure
 
Citizen Science Phenotypes
Citizen Science PhenotypesCitizen Science Phenotypes
Citizen Science PhenotypesAndrea Wiggins
 
Software and Education at NSF/ACI
Software and Education at NSF/ACISoftware and Education at NSF/ACI
Software and Education at NSF/ACIDaniel S. Katz
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey BoultonLEARN Project
 
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...EarthCube
 
Jisc's new shared data centre
Jisc's new shared data centreJisc's new shared data centre
Jisc's new shared data centreJisc
 

What's hot (20)

Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
 
Big Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&DBig Data Talent in Academic and Industry R&D
Big Data Talent in Academic and Industry R&D
 
Knoesis Student Achievement
Knoesis Student AchievementKnoesis Student Achievement
Knoesis Student Achievement
 
SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010SPARC Repositories conference in Baltimore - Nov 2010
SPARC Repositories conference in Baltimore - Nov 2010
 
Zarneger "Supporting AI: Best Practices for Content Delivery Platforms"
Zarneger "Supporting AI: Best Practices for Content Delivery Platforms"Zarneger "Supporting AI: Best Practices for Content Delivery Platforms"
Zarneger "Supporting AI: Best Practices for Content Delivery Platforms"
 
Crowdsourcing Science
Crowdsourcing ScienceCrowdsourcing Science
Crowdsourcing Science
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Domain-specific Knowledge Extraction from the Web of Data
Domain-specific Knowledge Extraction from the Web of DataDomain-specific Knowledge Extraction from the Web of Data
Domain-specific Knowledge Extraction from the Web of Data
 
Little eScience
Little eScienceLittle eScience
Little eScience
 
Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013
 
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen ScienceFree as in Puppies: Compensating for ICT Constraints in Citizen Science
Free as in Puppies: Compensating for ICT Constraints in Citizen Science
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014
 
Social Machines of Science and Scholarship
Social Machines of Science and ScholarshipSocial Machines of Science and Scholarship
Social Machines of Science and Scholarship
 
E research attachment survey
E research attachment surveyE research attachment survey
E research attachment survey
 
Emerging Forms of Data and Analytics
Emerging Forms of Data and AnalyticsEmerging Forms of Data and Analytics
Emerging Forms of Data and Analytics
 
Citizen Science Phenotypes
Citizen Science PhenotypesCitizen Science Phenotypes
Citizen Science Phenotypes
 
Software and Education at NSF/ACI
Software and Education at NSF/ACISoftware and Education at NSF/ACI
Software and Education at NSF/ACI
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
 
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
AHM 2014: The iPlant Collaborative, Community Cyberinfrastructure for Life Sc...
 
Jisc's new shared data centre
Jisc's new shared data centreJisc's new shared data centre
Jisc's new shared data centre
 

Similar to What is eScience, and where does it go from here?

INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎Libcorpio
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesDaniel S. Katz
 
A Method to Select e-Infrastructure Components to Sustain
A Method to Select e-Infrastructure Components to SustainA Method to Select e-Infrastructure Components to Sustain
A Method to Select e-Infrastructure Components to SustainDaniel S. Katz
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love Kristi Holmes
 
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...datacite
 
XSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureXSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureJohn Towns
 
From Open Access to Open Data
From Open Access to Open DataFrom Open Access to Open Data
From Open Access to Open DataBrian Hole
 
XSEDE Overview (March 2014)
XSEDE Overview (March 2014)XSEDE Overview (March 2014)
XSEDE Overview (March 2014)John Towns
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?Graham Pryor
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Peter Löwe
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014VinothkumaR Ramu
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube
 
Asteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesAsteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesStormBourne, LLC
 
NSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingNSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingDaniel S. Katz
 

Similar to What is eScience, and where does it go from here? (20)

INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
INNOVATION AND ‎RESEARCH (Digital Library ‎Information Access)‎
 
Scientific Software Challenges and Community Responses
Scientific Software Challenges and Community ResponsesScientific Software Challenges and Community Responses
Scientific Software Challenges and Community Responses
 
A Method to Select e-Infrastructure Components to Sustain
A Method to Select e-Infrastructure Components to SustainA Method to Select e-Infrastructure Components to Sustain
A Method to Select e-Infrastructure Components to Sustain
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
#ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love #ALAAC15 Linked Data Love
#ALAAC15 Linked Data Love
 
Sgci esip-7-20-18
Sgci esip-7-20-18Sgci esip-7-20-18
Sgci esip-7-20-18
 
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
 
XSEDE and National Cyberinfrastructure
XSEDE and National CyberinfrastructureXSEDE and National Cyberinfrastructure
XSEDE and National Cyberinfrastructure
 
From Open Access to Open Data
From Open Access to Open DataFrom Open Access to Open Data
From Open Access to Open Data
 
FAIR play?
FAIR play? FAIR play?
FAIR play?
 
XSEDE Overview (March 2014)
XSEDE Overview (March 2014)XSEDE Overview (March 2014)
XSEDE Overview (March 2014)
 
Sgci nsf-si2-2-21-17
Sgci nsf-si2-2-21-17Sgci nsf-si2-2-21-17
Sgci nsf-si2-2-21-17
 
Final Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational ResearchFinal Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational Research
 
Why manage research data?
Why manage research data?Why manage research data?
Why manage research data?
 
Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...Data Science: History repeated? – The heritage of the Free and Open Source GI...
Data Science: History repeated? – The heritage of the Free and Open Source GI...
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013EarthCube Monthly Community Webinar- Nov. 22, 2013
EarthCube Monthly Community Webinar- Nov. 22, 2013
 
Asteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesAsteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence Series
 
NSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingNSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meeting
 

More from Daniel S. Katz

Research software susainability
Research software susainabilityResearch software susainability
Research software susainabilityDaniel S. Katz
 
Software Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSASoftware Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSADaniel S. Katz
 
Parsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonParsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonDaniel S. Katz
 
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...Daniel S. Katz
 
Citation and Research Objects: Toward Active Research Objects
Citation and Research Objects: Toward Active Research ObjectsCitation and Research Objects: Toward Active Research Objects
Citation and Research Objects: Toward Active Research ObjectsDaniel S. Katz
 
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...Daniel S. Katz
 
Fundamentals of software sustainability
Fundamentals of software sustainabilityFundamentals of software sustainability
Fundamentals of software sustainabilityDaniel S. Katz
 
Software Citation in Theory and Practice
Software Citation in Theory and PracticeSoftware Citation in Theory and Practice
Software Citation in Theory and PracticeDaniel S. Katz
 
Research Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIResearch Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIDaniel S. Katz
 
Expressing and sharing workflows
Expressing and sharing workflowsExpressing and sharing workflows
Expressing and sharing workflowsDaniel S. Katz
 
Citation and reproducibility in software
Citation and reproducibility in softwareCitation and reproducibility in software
Citation and reproducibility in softwareDaniel S. Katz
 
Software Citation: Principles, Implementation, and Impact
Software Citation:  Principles, Implementation, and ImpactSoftware Citation:  Principles, Implementation, and Impact
Software Citation: Principles, Implementation, and ImpactDaniel S. Katz
 
Summary of WSSSPE and its working groups
Summary of WSSSPE and its working groupsSummary of WSSSPE and its working groups
Summary of WSSSPE and its working groupsDaniel S. Katz
 
Working towards Sustainable Software for Science: Practice and Experience (WS...
Working towards Sustainable Software for Science: Practice and Experience (WS...Working towards Sustainable Software for Science: Practice and Experience (WS...
Working towards Sustainable Software for Science: Practice and Experience (WS...Daniel S. Katz
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panelDaniel S. Katz
 
20160607 citation4software opening
20160607 citation4software opening20160607 citation4software opening
20160607 citation4software openingDaniel S. Katz
 
What do we need beyond a DOI?
What do we need beyond a DOI?What do we need beyond a DOI?
What do we need beyond a DOI?Daniel S. Katz
 
Looking at Software Sustainability and Productivity Challenges from NSF
Looking at Software Sustainability and Productivity Challenges from NSFLooking at Software Sustainability and Productivity Challenges from NSF
Looking at Software Sustainability and Productivity Challenges from NSFDaniel S. Katz
 

More from Daniel S. Katz (20)

Research software susainability
Research software susainabilityResearch software susainability
Research software susainability
 
Software Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSASoftware Professionals (RSEs) at NCSA
Software Professionals (RSEs) at NCSA
 
Parsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in PythonParsl: Pervasive Parallel Programming in Python
Parsl: Pervasive Parallel Programming in Python
 
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
Requiring Publicly-Funded Software, Algorithms, and Workflows to be Made Publ...
 
Citation and Research Objects: Toward Active Research Objects
Citation and Research Objects: Toward Active Research ObjectsCitation and Research Objects: Toward Active Research Objects
Citation and Research Objects: Toward Active Research Objects
 
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
FAIR is not Fair Enough, Particularly for Software Citation, Availability, or...
 
Fundamentals of software sustainability
Fundamentals of software sustainabilityFundamentals of software sustainability
Fundamentals of software sustainability
 
Software Citation in Theory and Practice
Software Citation in Theory and PracticeSoftware Citation in Theory and Practice
Software Citation in Theory and Practice
 
URSSI
URSSIURSSI
URSSI
 
Research Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIResearch Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSI
 
Software citation
Software citationSoftware citation
Software citation
 
Expressing and sharing workflows
Expressing and sharing workflowsExpressing and sharing workflows
Expressing and sharing workflows
 
Citation and reproducibility in software
Citation and reproducibility in softwareCitation and reproducibility in software
Citation and reproducibility in software
 
Software Citation: Principles, Implementation, and Impact
Software Citation:  Principles, Implementation, and ImpactSoftware Citation:  Principles, Implementation, and Impact
Software Citation: Principles, Implementation, and Impact
 
Summary of WSSSPE and its working groups
Summary of WSSSPE and its working groupsSummary of WSSSPE and its working groups
Summary of WSSSPE and its working groups
 
Working towards Sustainable Software for Science: Practice and Experience (WS...
Working towards Sustainable Software for Science: Practice and Experience (WS...Working towards Sustainable Software for Science: Practice and Experience (WS...
Working towards Sustainable Software for Science: Practice and Experience (WS...
 
20160607 citation4software panel
20160607 citation4software panel20160607 citation4software panel
20160607 citation4software panel
 
20160607 citation4software opening
20160607 citation4software opening20160607 citation4software opening
20160607 citation4software opening
 
What do we need beyond a DOI?
What do we need beyond a DOI?What do we need beyond a DOI?
What do we need beyond a DOI?
 
Looking at Software Sustainability and Productivity Challenges from NSF
Looking at Software Sustainability and Productivity Challenges from NSFLooking at Software Sustainability and Productivity Challenges from NSF
Looking at Software Sustainability and Productivity Challenges from NSF
 

Recently uploaded

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 

Recently uploaded (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 

What is eScience, and where does it go from here?

  • 1. What is eScience, and where does it go from here? eScience 2019, 25 September 2019 Daniel S. Katz (d.katz@ieee.org, http://danielskatz.org, @danielskatz) Assistant Director for Scientific Software & Applications, NCSA Research Associate Professor, CS, ECE, iSchool
  • 2. e-Science in 2000 • “In November 2000 the Director General of UK Research Councils, Dr John Taylor, announced £98M funding for a new UK e-Science programme. • “In the future, e-Science will refer to the large scale science that will increasingly be carried out through distributed global collaborations enabled by the Internet. Typically, a feature of such collaborative scientific enterprises is that they will require access to very large data collections, very large scale computing resources and high performance visualisation back to the individual user scientists. • “The World Wide Web gave us access to information on Web pages written in html anywhere on the Internet. A much more powerful infrastructure is needed to support e- Science. Besides information stored in Web pages, scientists will need easy access to expensive remote facilities, to computing resources - either as dedicated Teraflop computers or cheap collections of PCs - and to information stored in dedicated databases. • “The Grid is an architecture proposed to bring all these issues together and make a reality of such a vision for e-Science.” https://web.archive.org/web/20040818222850/http://www.rcuk.ac.uk/escience_old/firstphase.shtml “e-Science is about global collaboration in key areas of science and the next generation of infrastructure that will enable it” -- Dr John Taylor
  • 3. e-Science in 2005 • Next generation of scientific research and experiments will be carried out by communities of researchers from organizations that span national boundaries • Activities will involve geographically distributed and heterogeneous resources such as computational systems, scientific instruments, databases, sensors, software components, networks, and people • Such large-scale and enhanced scientific endeavors, popularly termed as e- Science, are carried out via collaborations on a global scale • “Grid computing has emerged as one of the key computing paradigms that enable the creation and management of Internet-based utility computing infrastructure, called Cyberinfrastructure, for realization of e-Science and e-Business at the global level https://web.archive.org/web/20160911013712/http://www.cloudbus.org/escience/cfp.html
  • 4. eScience in 2012 • When planning the 2012 IEEE eScience conference: • We decided that the key distinguishing element of eScience was joint use of advances in infrastructure and advances in the use of that infrastructure to make advances in scholarly research (which we called science) • We used this to organize the call for papers and the sessions • Some papers focused more on advances in the infrastructure • Some more on advances in using it • All papers had to have some combination of these advances that advanced research
  • 5. Naming ourselves • We called this eScience because of its tie to eInfrastructure, sometimes called cyberinfrastructure
  • 6. Digressing to defining cyberinfrastructure • For an expansive view of what is infrastructure, we can use Craig Stewart’s definition of cyberinfrastructure: • “Computing systems, data storage systems, advanced instruments and data repositories, visualization environments, and people, all linked together by software and high performance networks to improve research productivity and enable breakthroughs not otherwise possible” • Note that this contains the same goal of advancing research as eScience. https://doi.org/10.1145/1878335.1878347
  • 7. Naming ourselves • We called this eScience because of its tie to eInfrastructure, sometimes called cyberinfrastructure • But as with other e* and i* and cyber* things, the eScience name feels a little dated • We need a new name • Is eScience just Science now? • Probably not • Coupling of advances in digital infrastructure with advances in how the infrastructure is used is still different than much of research • Plus, science in English is more limited than wissenschaft in German • Use research or scholarship or scholarly research instead? • Thinking of the inherent research & infrastructure symbiosis (cf. lichen), maybe: • Research and Infrastructure Development Symbiosis (RaIDS)
  • 8. Coupling advances in infrastructure and research • Sometimes a collaboration • Though not always a single team • Or even at a single time; can be a gap between paired advances • Useful to examine these collaborations and interactions • Specifically, the human aspects
  • 9. Coupled advances • For this type of progress to be made: • Infrastructure developers have to be aware of infrastructure users’ potential needs • Infrastructure users have to be aware of infrastructure developers’ possible offerings • This requires communication • After some advance in both that leads to progress in research, a match may be declared, where the infrastructure advance needs to be given credit for the research progress • I think we as a community need to talk about these aspects, communication and credit • Due to limited time, I will mostly focus on communications • Overall challenge: how can we improve them to improve eScience?
  • 10. Push & Pull • Developers make advances in infrastructure, intended to enable better/more advanced scholarship • Need to decide what advances to make • Based on what they think will be used • Based on what they think is possible • Need to communicate this to researchers • Advances in scholarship depend on advances in infrastructure • Researchers need to decide what infrastructure advances to try/invest in • Based on which are robust • Based on which are sustainable • Based on what others are using • Based on what they think is possible • Need to communicate their needs to infrastructure developers • Idea: is our role to work between these two groups?
  • 12. Problem 2: Thinking of what is possible is hard https://www.youtube.com/watch?v=jWTGsUyv8IE https://www.youtube.com/watch?v=jWTGsUyv8IE https://www.chron.com/neighborhood/bayarea/news/article/When-Boris-Yeltsin-went- grocery-shopping-in-Clear-5759129 “’Even the Politburo doesn't have this choice. Not even Mr. Gorbachev,’ he said. When he was told through his interpreter that there were thousands of items in the store for sale he didn't believe it. He had even thought that the store was staged, a show for him.” https://www.pinterest.com/pin/439593613604653533/
  • 13. Problem 3: Too much … https://www.etsy.com/listing/99710612/monkey-see-monkey-do-grown-up-t-shirt
  • 14. Push and pull today • Today, eScience (RaIDS) highlights collaborations that have succeeded, e.g. • Pegasus & Montage: https://pegasus.isi.edu/application-showcase/montage/ • Charm++ & NAMD: “Chapter 5: NAMD: Scalable Molecular Dynamics Based on the Charm++ Parallel Runtime System” • through papers and workshops • Idea: Can we improve these papers by requiring them to state the advances on which they are based? • How can RaIDS promote communication that leads to encourage additional successful collaborations?
  • 15. Push and pull solutions though communication • Idea: Can RaIDS bring in elements of matchmaking? • Facilitated discussions, aka charrettes, ideas labs, sandpits • Community road map and white paper processes (e.g. NASA Earth Science, HEP software) • Decadal surveys (e.g., astronomy) • Idea: Can RaIDS promote/improve/standardize catalogs? • XSEDE services: https://www.xsede.org/ecosystem/services • ELIXIR’s bio.tools • Idea: Can RaIDS promote infrastructure likely to be useful, e.g., a technology showcase, and research needs, e.g., a research challenges showcase
  • 16. Quick foray into credit • I don’t want to take this bus • We need to better tie research advances to the infrastructure (computing, data, software, etc.) advances that enable them • In our current system, best way is joint authorship for synchronous collaborations • eScience is a good platform for this now • And via citation for asynchronous collaborations • New standards emerging for data and software citation, ideas for computing (instrument) citations • Idea: RaIDS should encourage these via appropriate guidance to authors and reviewers
  • 17. Recap • We aren’t a grid conference nor one on global-scale work, nor do we focus only on science  we need a new name • I suggest Research and Infrastructure Development Symbiosis (RaIDS), but other ideas are equally welcome • Ideas for future years of RaIDS • Focus on role of attendees as people between infrastructure & applications • Improve papers by requiring them to state advances on which they are based • Bring in elements of matchmaking intended to lead to new collaborations • Promote/improve/standardize catalogs • Promote infrastructure likely to be useful, e.g., a technology showcase, and research needs, e.g., a research challenges showcase • Encourage infrastructure citation via appropriate guidance to authors and reviewers