SlideShare una empresa de Scribd logo
1 de 39
PIDs, Data and Software:
How libraries can support researchers in an
evolving research landscape
Sarah Stewart, The British Library
M25 Consortium CPD25 Event – The Role of the Library in
Supporting Research. London Mathematical Society, June 28, 2018
www.bl.uk
Outline
• The Evolving (Digital) Research Landscape…
• Data
• Software
• PIDs
• Developing Research Support Services for Data
and Software
• Conclusion and Questions
2
www.bl.uk
Data, Software, PIDs – Oh My!
3
www.bl.uk
An Evolving Research Landscape…
• Research is ‘always already’ digital, and becoming
increasingly linked and networked
• Open Research – Fosters transparency, validity and
reproducibility of research
• Strong mandates in the UK from funders (E.g. UKRI,
Wellcome) to make data open.
• increasingly, push from publishers to make ‘non-
traditional’ outputs such as data available on-line
• A role for Linked Open Data (LOD)?
4
www.bl.uk
The Research Graph (GESIS)
5
http://researchgraph.org/augment-api/
www.bl.uk
Data and the Digital Research Landscape
• Data as a research output (=credit and impact for
researchers!)
• Emergence of data journals, data repositories, global
data-sharing initiatives, scientific working committees
• Mandate from funders to make research data available
for 10+ years – digital preservation
• Force11 (2016): Make data FAIR – Findable, Accessible,
Interoperable and Re-Useable
• Data Management Plans as part of applications to
funders (e.g. UKRI, Wellcome)
6
www.bl.uk
The Importance of Research Data Management…
“In their parents' attic, in boxes in the garage, or stored on now-defunct
floppy disks — these are just some of the inaccessible places in which
scientists have admitted to keeping their old research data.”
http://www.nature.com/news/scientists-losing-data-at-a-rapid-rate-
1.14416
www.bl.uk
Funder requirements…
“Publicly funded research data are a public good,
produced in the public interest, which should be made
openly available with as few restrictions as
possible…”
RCUK Common Principles on Data Policy
www.bl.uk
RCUK Research Data Policy Coverage (2014)
9
www.bl.uk
What are Data?
• Many formats, volumes, types, ranging
from physical specimens and archival
material to petabytes of high-throughput
automated measurements or
simulations
• Language of data is taken from the
STEM disciplines, but data also exists
for the arts and humanities
• Need a way to describe (to make
discoverable/findable), store, preserve
and ensure access, sharing, and re-use
if this is possible (it may not be!)
10
www.bl.uk
UKRI Definition of Data
“Research data are the evidence that underpins the answer to the
research question, and can be used to validate findings regardless
of its form (e.g. print, digital, or physical). These might be quantitative
information or qualitative statements collected by researchers in the
course of their work by experimentation, observation, modelling, interview
or other methods, or information derived from existing evidence. Data may
be raw or primary (e.g. direct from measurement or collection) or derived
from primary data for subsequent analysis or interpretation (e.g. cleaned
up or as an extract from a larger data set), or derived from existing
sources where the rights may be held by others….The primary purpose
of research data is to provide the information necessary to support
or validate a research project's observations, findings or outputs.”
– UKRI Concordat on Open Research Data, (2016)
https://www.ukri.org/files/legacy/documents/concordatonopenresearchdata-pdf/
www.bl.uk
The Research Data Lifecycle
12
www.bl.uk
Software: What do I do with it?
• Lots of emphasis on ‘data’ management, but software in
research is often neglected.
• Software is sensitive to changes in its ‘environment’
• There is a lot of variation inherent in software (languages,
versions, licensing, etc.)
www.bl.uk
Software as ‘Data’
• ‘Software is used to create, interpret, present, manipulate and
manage data’ (Software Sustainability Institute)
• Data: ‘recorded factual material commonly retained by and
accepted…as necessary to validate research findings’
(EPSRC)
• Software = Data!
www.bl.uk
Obsolecence!
www.bl.uk
Software should be preserved if:
• Software can’t be separated from the data or digital object.
• Software is classified as a research output
• Software has intrinsic value
• More resources available at the Software Sustainability
Institute:
https://www.software.ac.uk/software-sustainability-institute
www.bl.uk
Treat software as valuable research output
PyRDM Green Shoots project
Zenodo integrates with GitHub
College survey on distributed version control
Software Sustainability Institute – I a fellow
www.bl.uk
PIDs in Research
hinemizushima.com
www.bl.uk
Why Use Persistent Identifiers?
• Use of persistent identifiers has
increased as scholarly
communications become
increasingly digital.
• ORCIDs and DOIs support open
science through supporting
interoperability in research
infrastructures.
• For instance, DataCite,
CrossRef can use DOIs and
ORCID iDs in addition to other
metadata to map and link
documents, data and
researchers.
www.bl.uk
PIDs (Persistent Identifiers)
• ORCID iDs (Open Researcher and Contributor IDs)
• DOIs (Digital Object Identifiers)
20
www.bl.uk
What is an ORCID iD?
• ‘Open Researcher & Contributor ID’
• Developed by ORCID, a non-profit community-owned organisation
• Provides a solution to name ambiguity in research and scholarly
communications
• Unique, persistent identifier for you as a researcher/academic.
Linked to your name, rather than to your institution
• Can be applied to your research outputs to identify, validate and
confirm your authorship
• Can be used to track research outputs
www.bl.uk
ORCID provides:
www.bl.uk
ORCID promotes System Interoperability:
www.bl.uk
DataCite (and DataCite UK)
• DataCite is a leading global non-profit organisation
that provides persistent identifiers (DOIs) for research
data. Our goal is to help the research community
locate, identify, and cite research data with
confidence.
• Supports the creation and allocation of DOIs and
accompanying metadata.
• Provides services that support the enhanced search
and discovery of research content.
• Promotes data citation and advocacy through our
community-building efforts and responsive
communication and outreach materials.
• DataCite UK is the UK’s national hub for the provision
of persistent identifiers (DOIs) for research data.
24
www.bl.uk
DOIs (Digital Object Identifiers)
• Persistent identifier used to uniquely identify objects (datasets,
software, journal articles, theses), standardised by the
International Standards Organisation (ISO)
• Presented as an alphanumeric code consisting of a prefix and
suffix separated by a slash ‘/’ . The ‘10’ at the start of the DOI
positions the DOI within DOI namespace. E.g.
10.1037/rmh0000008
• Uses a ‘handle’ system in which a DOI is ‘resolvable’ through
binding metadata (such as a URL) to the specific DOI that
describes it.
• DOI is persistent, so it is the publisher’s responsibility to
update the metadata attached to the DOI, otherwise, the DOI
will resolve to a dead link.
25
www.bl.uk 26
www.bl.uk
DOIs and FAIR Data
• DOIs ensure that data (and metadata about that data) are
preserved for the long-term
• Can be searched for and made discoverable and findable
(through DataCite and CrossRef, Google search, re3Data)
• Access and re-use conditions can be clarified. If the data
cannot be made open, the metadata can explicitly state the
terms and conditions of access.
27
www.bl.uk
PIDs and FAIR Data
28
www.bl.uk
‘If you like it, put a PID on it…’
29
www.bl.uk
The Library can play a strong role in service
support for Research Data, software and PIDs
30
www.bl.uk
The ‘Stick-and-Carrot’ approach to Data
Management?
31
www.bl.uk
Why spend time on RDM?
• It is not a distraction from ‘real work’.
• You can work effectively and efficiently.
• Save time and reduce frustration in the future.
• Set systems that work for you.
www.bl.uk
Engaging Directly with Researchers
• Embedded approach – meet with researchers in situ – in
their labs and offices
• One-on-one or group meetings
• Departmental meetings to inform on policy changes and
updates and provide insight into best practice.
Outreach – Love Your Data!
• PhD Training on RDM Basics and DMPOnline (including PhD-specific
DMPOnline template)
• RDM ‘Drop-in Clinics’
• RDM ‘Byte-Size’ sessions – informal sessions on various topics
• Imperial Data Circus
• Open Access Road Show
www.bl.uk
Findings from Imperial College RDM Policy
Development
• 60-100% of grant required to re-generate data used in
publications
• % of data that needs retaining to support publications: ~60%
• Data storage capacity will have to grow significantly
• Concerns around back-up and archiving, esp. considering data
volume
• Popularity of cloud services (as opposed to College storage)
Researchers want self-administered, secure, responsive
solution
for data sharing, storing and archiving; open APIs preferred
•(“Yes [storage] is really important. Basically, whenever we have been
out to talk to researchers, that's the thing they have latched on to and
want to talk about the most.” 10.1371/journal.pone.0114734)
www.bl.uk
The RDM Workflow at Imperial
www.bl.uk
RDM Infrastructure
Data
Access
Statement
www.bl.uk
The Library Supporting Researchers:
Infrastructure
• Consider workflows for research data
• Assist in the development of research data management plans
(use DMPOnline)
• Integration with existing systems (E.g. CRIS, grant systems)
• Use Your Metadata – Make work findable, discoverable and
accessible
Engagement
• Clear, direct communication
• Outreach and discussion
• Many benefits for researchers – increased efficiency and
impact of research
38
www.bl.uk
Thank You!
Any Questions?
sarah.stewart@bl.uk
datasets@bl.uk
https://datacite.org/
@BioStew
39

Más contenido relacionado

La actualidad más candente

Researchers guide March 2014
Researchers guide March 2014Researchers guide March 2014
Researchers guide March 2014
EISLibrarian
 
Virtual support_to_research_communities
Virtual  support_to_research_communitiesVirtual  support_to_research_communities
Virtual support_to_research_communities
СОБДиЮ
 
Information literacy
Information literacyInformation literacy
Information literacy
shechild
 
Stop Press: Libraries' Role in the Future of Publishing
Stop Press: Libraries' Role in the Future of PublishingStop Press: Libraries' Role in the Future of Publishing
Stop Press: Libraries' Role in the Future of Publishing
Danny Kingsley
 

La actualidad más candente (16)

Altmetrics and Social Media: Publicising, Discovering, Engaging
Altmetrics and Social Media: Publicising, Discovering, EngagingAltmetrics and Social Media: Publicising, Discovering, Engaging
Altmetrics and Social Media: Publicising, Discovering, Engaging
 
Psychology PG Skills Study Day
Psychology PG Skills Study DayPsychology PG Skills Study Day
Psychology PG Skills Study Day
 
Researchers guide March 2014
Researchers guide March 2014Researchers guide March 2014
Researchers guide March 2014
 
Digital Scholarship: building an online scholarly presence
Digital Scholarship: building an online scholarly presenceDigital Scholarship: building an online scholarly presence
Digital Scholarship: building an online scholarly presence
 
Virtual support_to_research_communities
Virtual  support_to_research_communitiesVirtual  support_to_research_communities
Virtual support_to_research_communities
 
Preserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSSPreserving Our Digital Heritage: Community Action via UK LOCKSS
Preserving Our Digital Heritage: Community Action via UK LOCKSS
 
Information literacy
Information literacyInformation literacy
Information literacy
 
Ensuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly ResourcesEnsuring Continuing Access to Online Scholarly Resources
Ensuring Continuing Access to Online Scholarly Resources
 
Reference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and RemedyReference Rot and Linked Data: Threat and Remedy
Reference Rot and Linked Data: Threat and Remedy
 
Stop Press: Libraries' Role in the Future of Publishing
Stop Press: Libraries' Role in the Future of PublishingStop Press: Libraries' Role in the Future of Publishing
Stop Press: Libraries' Role in the Future of Publishing
 
Mpirical CCM4901 Feb 2016
Mpirical CCM4901 Feb 2016Mpirical CCM4901 Feb 2016
Mpirical CCM4901 Feb 2016
 
PSY4035 finding research info 2017
PSY4035 finding research info 2017 PSY4035 finding research info 2017
PSY4035 finding research info 2017
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)
 
Research Data Management at The University of Edinburgh
Research Data Management at The University of EdinburghResearch Data Management at The University of Edinburgh
Research Data Management at The University of Edinburgh
 
Research data management at UAL
Research data management at UALResearch data management at UAL
Research data management at UAL
 
Is It Too Late to Ensure Continuity of Access to the Scholarly Record?
Is It Too Late to Ensure Continuity of Access to the Scholarly Record?Is It Too Late to Ensure Continuity of Access to the Scholarly Record?
Is It Too Late to Ensure Continuity of Access to the Scholarly Record?
 

Similar a PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving Research Landscape

Similar a PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving Research Landscape (20)

Research Data Management in GLAM: Managing Data for Cultural Heritage
Research Data Management in GLAM: Managing Data for Cultural HeritageResearch Data Management in GLAM: Managing Data for Cultural Heritage
Research Data Management in GLAM: Managing Data for Cultural Heritage
 
Data Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDsData Strategy and Services at the British Library: Data, Software and PIDs
Data Strategy and Services at the British Library: Data, Software and PIDs
 
RDM Programme at University of Edinburgh
RDM Programme at University of EdinburghRDM Programme at University of Edinburgh
RDM Programme at University of Edinburgh
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
RDM @ UoE
RDM @ UoERDM @ UoE
RDM @ UoE
 
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShareScottish Digital Library Consortium Meeting: Edinburgh DataShare
Scottish Digital Library Consortium Meeting: Edinburgh DataShare
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
RDM@Edinburgh
RDM@EdinburghRDM@Edinburgh
RDM@Edinburgh
 
Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...Supporting the development of a national Research Data Discovery Service – a ...
Supporting the development of a national Research Data Discovery Service – a ...
 
RDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian ExperienceRDM Programme @ Edinburgh: Data Librarian Experience
RDM Programme @ Edinburgh: Data Librarian Experience
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Staffing Research Data Services at University of Edinburgh
Staffing Research Data Services at University of EdinburghStaffing Research Data Services at University of Edinburgh
Staffing Research Data Services at University of Edinburgh
 
Research Data Service at the University of Edinburgh
Research Data Service at the University of EdinburghResearch Data Service at the University of Edinburgh
Research Data Service at the University of Edinburgh
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
 
EDINA / Data Library Overview
EDINA / Data Library OverviewEDINA / Data Library Overview
EDINA / Data Library Overview
 
Getting to grips with research data management
Getting to grips with research data management Getting to grips with research data management
Getting to grips with research data management
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 

Más de Sarah Anna Stewart

Más de Sarah Anna Stewart (9)

Library Carpentry Git, GitHub and GitPages Introduction Slides
Library Carpentry Git, GitHub and GitPages Introduction SlidesLibrary Carpentry Git, GitHub and GitPages Introduction Slides
Library Carpentry Git, GitHub and GitPages Introduction Slides
 
DataCite UK and British Library Update - DataCite UK Summer Client Meeting 2018
DataCite UK and British Library Update - DataCite UK Summer Client Meeting 2018DataCite UK and British Library Update - DataCite UK Summer Client Meeting 2018
DataCite UK and British Library Update - DataCite UK Summer Client Meeting 2018
 
Webs of Life and Data: Impacts of open and networked data on scientific pract...
Webs of Life and Data: Impacts of open and networked data on scientific pract...Webs of Life and Data: Impacts of open and networked data on scientific pract...
Webs of Life and Data: Impacts of open and networked data on scientific pract...
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Software Management Plans and Software as Data
Software Management Plans and Software as DataSoftware Management Plans and Software as Data
Software Management Plans and Software as Data
 
Research Data Management from a Software Engineering Perspective
Research Data Management from a Software Engineering PerspectiveResearch Data Management from a Software Engineering Perspective
Research Data Management from a Software Engineering Perspective
 
'Let a Thousand ORCIDs Bloom': ORCID iDs and the ORCID Project at Imperial Co...
'Let a Thousand ORCIDs Bloom': ORCID iDs and the ORCID Project at Imperial Co...'Let a Thousand ORCIDs Bloom': ORCID iDs and the ORCID Project at Imperial Co...
'Let a Thousand ORCIDs Bloom': ORCID iDs and the ORCID Project at Imperial Co...
 
Research Data Management - A DIY Guide: What? Why? How?
Research Data Management - A DIY Guide: What? Why? How?Research Data Management - A DIY Guide: What? Why? How?
Research Data Management - A DIY Guide: What? Why? How?
 
Neural Networks, Machine Learning and Extended Mind
Neural Networks, Machine Learning and Extended MindNeural Networks, Machine Learning and Extended Mind
Neural Networks, Machine Learning and Extended Mind
 

Último

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 

Último (20)

How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 

PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving Research Landscape

  • 1. PIDs, Data and Software: How libraries can support researchers in an evolving research landscape Sarah Stewart, The British Library M25 Consortium CPD25 Event – The Role of the Library in Supporting Research. London Mathematical Society, June 28, 2018
  • 2. www.bl.uk Outline • The Evolving (Digital) Research Landscape… • Data • Software • PIDs • Developing Research Support Services for Data and Software • Conclusion and Questions 2
  • 4. www.bl.uk An Evolving Research Landscape… • Research is ‘always already’ digital, and becoming increasingly linked and networked • Open Research – Fosters transparency, validity and reproducibility of research • Strong mandates in the UK from funders (E.g. UKRI, Wellcome) to make data open. • increasingly, push from publishers to make ‘non- traditional’ outputs such as data available on-line • A role for Linked Open Data (LOD)? 4
  • 5. www.bl.uk The Research Graph (GESIS) 5 http://researchgraph.org/augment-api/
  • 6. www.bl.uk Data and the Digital Research Landscape • Data as a research output (=credit and impact for researchers!) • Emergence of data journals, data repositories, global data-sharing initiatives, scientific working committees • Mandate from funders to make research data available for 10+ years – digital preservation • Force11 (2016): Make data FAIR – Findable, Accessible, Interoperable and Re-Useable • Data Management Plans as part of applications to funders (e.g. UKRI, Wellcome) 6
  • 7. www.bl.uk The Importance of Research Data Management… “In their parents' attic, in boxes in the garage, or stored on now-defunct floppy disks — these are just some of the inaccessible places in which scientists have admitted to keeping their old research data.” http://www.nature.com/news/scientists-losing-data-at-a-rapid-rate- 1.14416
  • 8. www.bl.uk Funder requirements… “Publicly funded research data are a public good, produced in the public interest, which should be made openly available with as few restrictions as possible…” RCUK Common Principles on Data Policy
  • 9. www.bl.uk RCUK Research Data Policy Coverage (2014) 9
  • 10. www.bl.uk What are Data? • Many formats, volumes, types, ranging from physical specimens and archival material to petabytes of high-throughput automated measurements or simulations • Language of data is taken from the STEM disciplines, but data also exists for the arts and humanities • Need a way to describe (to make discoverable/findable), store, preserve and ensure access, sharing, and re-use if this is possible (it may not be!) 10
  • 11. www.bl.uk UKRI Definition of Data “Research data are the evidence that underpins the answer to the research question, and can be used to validate findings regardless of its form (e.g. print, digital, or physical). These might be quantitative information or qualitative statements collected by researchers in the course of their work by experimentation, observation, modelling, interview or other methods, or information derived from existing evidence. Data may be raw or primary (e.g. direct from measurement or collection) or derived from primary data for subsequent analysis or interpretation (e.g. cleaned up or as an extract from a larger data set), or derived from existing sources where the rights may be held by others….The primary purpose of research data is to provide the information necessary to support or validate a research project's observations, findings or outputs.” – UKRI Concordat on Open Research Data, (2016) https://www.ukri.org/files/legacy/documents/concordatonopenresearchdata-pdf/
  • 13. www.bl.uk Software: What do I do with it? • Lots of emphasis on ‘data’ management, but software in research is often neglected. • Software is sensitive to changes in its ‘environment’ • There is a lot of variation inherent in software (languages, versions, licensing, etc.)
  • 14. www.bl.uk Software as ‘Data’ • ‘Software is used to create, interpret, present, manipulate and manage data’ (Software Sustainability Institute) • Data: ‘recorded factual material commonly retained by and accepted…as necessary to validate research findings’ (EPSRC) • Software = Data!
  • 16. www.bl.uk Software should be preserved if: • Software can’t be separated from the data or digital object. • Software is classified as a research output • Software has intrinsic value • More resources available at the Software Sustainability Institute: https://www.software.ac.uk/software-sustainability-institute
  • 17. www.bl.uk Treat software as valuable research output PyRDM Green Shoots project Zenodo integrates with GitHub College survey on distributed version control Software Sustainability Institute – I a fellow
  • 19. www.bl.uk Why Use Persistent Identifiers? • Use of persistent identifiers has increased as scholarly communications become increasingly digital. • ORCIDs and DOIs support open science through supporting interoperability in research infrastructures. • For instance, DataCite, CrossRef can use DOIs and ORCID iDs in addition to other metadata to map and link documents, data and researchers.
  • 20. www.bl.uk PIDs (Persistent Identifiers) • ORCID iDs (Open Researcher and Contributor IDs) • DOIs (Digital Object Identifiers) 20
  • 21. www.bl.uk What is an ORCID iD? • ‘Open Researcher & Contributor ID’ • Developed by ORCID, a non-profit community-owned organisation • Provides a solution to name ambiguity in research and scholarly communications • Unique, persistent identifier for you as a researcher/academic. Linked to your name, rather than to your institution • Can be applied to your research outputs to identify, validate and confirm your authorship • Can be used to track research outputs
  • 23. www.bl.uk ORCID promotes System Interoperability:
  • 24. www.bl.uk DataCite (and DataCite UK) • DataCite is a leading global non-profit organisation that provides persistent identifiers (DOIs) for research data. Our goal is to help the research community locate, identify, and cite research data with confidence. • Supports the creation and allocation of DOIs and accompanying metadata. • Provides services that support the enhanced search and discovery of research content. • Promotes data citation and advocacy through our community-building efforts and responsive communication and outreach materials. • DataCite UK is the UK’s national hub for the provision of persistent identifiers (DOIs) for research data. 24
  • 25. www.bl.uk DOIs (Digital Object Identifiers) • Persistent identifier used to uniquely identify objects (datasets, software, journal articles, theses), standardised by the International Standards Organisation (ISO) • Presented as an alphanumeric code consisting of a prefix and suffix separated by a slash ‘/’ . The ‘10’ at the start of the DOI positions the DOI within DOI namespace. E.g. 10.1037/rmh0000008 • Uses a ‘handle’ system in which a DOI is ‘resolvable’ through binding metadata (such as a URL) to the specific DOI that describes it. • DOI is persistent, so it is the publisher’s responsibility to update the metadata attached to the DOI, otherwise, the DOI will resolve to a dead link. 25
  • 27. www.bl.uk DOIs and FAIR Data • DOIs ensure that data (and metadata about that data) are preserved for the long-term • Can be searched for and made discoverable and findable (through DataCite and CrossRef, Google search, re3Data) • Access and re-use conditions can be clarified. If the data cannot be made open, the metadata can explicitly state the terms and conditions of access. 27
  • 29. www.bl.uk ‘If you like it, put a PID on it…’ 29
  • 30. www.bl.uk The Library can play a strong role in service support for Research Data, software and PIDs 30
  • 32. www.bl.uk Why spend time on RDM? • It is not a distraction from ‘real work’. • You can work effectively and efficiently. • Save time and reduce frustration in the future. • Set systems that work for you.
  • 33. www.bl.uk Engaging Directly with Researchers • Embedded approach – meet with researchers in situ – in their labs and offices • One-on-one or group meetings • Departmental meetings to inform on policy changes and updates and provide insight into best practice.
  • 34. Outreach – Love Your Data! • PhD Training on RDM Basics and DMPOnline (including PhD-specific DMPOnline template) • RDM ‘Drop-in Clinics’ • RDM ‘Byte-Size’ sessions – informal sessions on various topics • Imperial Data Circus • Open Access Road Show
  • 35. www.bl.uk Findings from Imperial College RDM Policy Development • 60-100% of grant required to re-generate data used in publications • % of data that needs retaining to support publications: ~60% • Data storage capacity will have to grow significantly • Concerns around back-up and archiving, esp. considering data volume • Popularity of cloud services (as opposed to College storage) Researchers want self-administered, secure, responsive solution for data sharing, storing and archiving; open APIs preferred •(“Yes [storage] is really important. Basically, whenever we have been out to talk to researchers, that's the thing they have latched on to and want to talk about the most.” 10.1371/journal.pone.0114734)
  • 38. www.bl.uk The Library Supporting Researchers: Infrastructure • Consider workflows for research data • Assist in the development of research data management plans (use DMPOnline) • Integration with existing systems (E.g. CRIS, grant systems) • Use Your Metadata – Make work findable, discoverable and accessible Engagement • Clear, direct communication • Outreach and discussion • Many benefits for researchers – increased efficiency and impact of research 38

Notas del editor

  1. It may be difficult to think about what’s happening in 20 years’ time, but if policies change, your research might be discredited if there is no data to support it or possibly…
  2. This is current for now, but policies do change, so keep up to date with what your funder, institution or publisher require.
  3. Most of us find that we have many calls on our time, and that packing everything that needs to be done into the week is often a challenge. That being the case, it’s easy to feel as though research data management is simply one more thing to add to an already endless to-do list – or worse, that it’s a distraction from real work. However, there are a number of key reasons that it’s worth paying some attention to it. Good data management does require an investment of effort – but ultimately it’s something that can actually save you time, by helping you work more efficiently. You want to complete your research project to the best of your ability, but with minimum stress – and good research data management is one of the tools that can help you to do that. Think about: the frustration of trying to track down a fact or a document we know we have somewhere. Good research data management – setting up an organizational system that works for you, and ensuring everything is properly filed or labelled to enable re-identification and retrieval – can make life a lot easier. And it’s not just a matter of saving time and reducing unnecessary effort (though clearly that’s a major benefit): having everything well ordered can also help you get a better feel of the shape and scope of your research material, which in turn can enable you to spot patterns or connections that might otherwise get missed. It’s also well worth doing, because the data you’re producing or working with is valuable As well as this being true for your own research, the data might ultimately be of use to other researchers. Having everything well organized and properly labelled also has the potential to save you a lot of time at the end of a research project, when it comes to deciding what to do with your data – but more of that later. Finally, there may be requirements imposed by your funding body and/or the university which you need to meet