SlideShare una empresa de Scribd logo
1 de 4
Descargar para leer sin conexión
Twenty Questions for Research Data Management
These twenty questions are designed to prompt and assist your thinking, as a research student, a postdoc
or an academic researcher at the beginning of a research project, and to form the basis of a workable
research data management plan that can both guide your on-going data management activities and inform
others about the nature and availability of your research data.
They will help you determining how best to safeguard your data from loss, how to describe your datasets
in ways that assist both yourself when returning to them in the future and others in their subsequent
interpretation, and how to publish your data in ways that maximize their usefulness to others and bring
maximum academic scholarly credit to yourself, to reward your efforts in acquiring, analysing,
describing, interpreting and publishing them in the first place.
You may not have immediate answers to all these questions. But, by seeking advice from your research
supervisor, colleagues and others in your institution with responsibilities for data management, you
should endeavour to discover them. Then, once in a while, you should revisit these questions and see
whether your data management practices can be improved, updating your answers.
The nature of your data
1

What is the subject discipline (domain, field) to which your research data relates?
Possible responses:
Quantum physics.
Cell biology.
Ornithology.

2

What is the exact nature (range, scope) of your research data?
Possible responses:
Long-distance quantum communication using entangled photons.
Protein chemistry and electron microscopy of cell membrane proteins.
Video field recordings of avian behaviour, and their quantitative analysis.

3

In what format(s), will you store your data in the short term after acquisition?
Possible responses:
Questionnaire response data will be stored on my laptop in a Microsoft Office Access 2007 database.
Raw video recording on digital video tapes on the shelf above my desk, edited videos in .mov format on my
laptop. numerical analyses in a spreadsheet (Microsoft Office Excel 2007 format) on my laptop.
On my research group’s cloud-based secure DataStage research data file store, in Zeiss confocal 3D image
format.

4

Who owns the data arising from your research, and the intellectual property rights relating to
them?
Possible responses:
Myself alone.
Myself and my research group leader.
My university.

Data descriptions (metadata, “data about data”)
5

How will your research datasets be described?
Possible responses:
The only description will be the filenames on my hard drive.
The only description will be the column and row labels in my spreadsheets.
The data will be described in handwritten notes in my lab notebook.
I will save metadata describing the data files in electronic form.
6

How will these descriptive metadata be created or captured?
Possible responses:
Instrument metadata are automatically included in each data file.
The only metadata will be the title and short textual description that I will manually complete in the Web
submission form, when depositing each dataset in my university’s data repository.
My data descriptions will be saved in spreadsheets or word processor documents.
Rich metadata conforming to a Minimal Information Standard appropriate to my research field will be
recorded at the time of data acquisition, using a metadata entry form, and will thus be available as a
metadata file to accompany my datasets during submission of the data to a data repository.

Data sharing
7

With whom will you share your research data in the short term, before publication of any
papers arising from their interpretation?
Possible responses:
My research supervisor only.
Members of my research group and trusted external collaborators.
Anyone who asks for them.
Everyone, by publishing the data online, since our research community is committed to the rapid sharing of
research results.

Data storage and backup
8

Where will you store your data in the short term, after acquisition?
Possible responses:
On my laptop.
On the computer connected to the microscope.
On the research group’s DataStage filestore.

9

Who is responsible for the immediate day-to-day management, storage and backup of the
data arising from your research?
Possible responses:
Myself alone.
My research group’s data manager.
Our departmental IT staff, who manage our research group’s DataStage research data management system.

10

How frequently will your research data be backed up for short-term data security?
Possible responses:
Whenever I remember to do so.
Nightly, using our research group’s DataStage research data management system connected to the
University’s automated backup service.

Data archiving
11

Where will your research data be archived for long-term preservation?
Possible responses:
Selected data will be included in the figures and tables of research papers published by my research group,
but we have no plans to archive and publish the full datasets.
As supplementary files attached to my journal articles on the publisher’s web site.
In the University’s DataBank data repository, run by the library service.
In appropriate genomics databases run by the European Bioinformatics Institute.

12

When will your research data be moved to a secure archive for long-term preservation and
publication?
Possible responses:
Our research data are already securely stored in an institutional data server.
Nightly.
Upon completion of each set of experiments.
When my research group leader decides it is appropriate.
Immediately after publication of my thesis.
Upon submission of our Nature paper, so that the data are available for reviewers.

13

Who will decide which of your research data are worth preserving?
Possible responses:
Myself alone.
Myself, in consultation with my research supervisor.
My research supervisor alone.

14

How (i.e. by what physical or electronic method) will you transfer your research datasets to
their long-term archive, under the curatorial care of a separate third-party, e.g. a data
repository?
Possible responses:
On physical hard drives that I will bring back from my field site by air.
By e-mailing files to our librarian.
By completion of the Web-based database submission form and uploading of the data files over the Internet.
By automated data packaging and repository submission over the Web from my local DataStage filestore,
using the SWORD repository submission protocol.

Data publication
15

For how long will you embargo your research data before it is published for others to see and
use?
Possible responses:
We will allow immediate public access to the data.
For one year, to permit us to exploit our hard-won research results.
Until the journal article describing our results has been published.

16

Why is public access to your research data to be restricted (if indeed it is)?
Possible responses:
We intend to make a patent application, and must avoid prior disclosure.
Don’t want to make locations of members of endangered species available to poachers.
The research data are confidential because of the arrangement my research group has made with the
commercial partner sponsoring our research.
My data form part of a long-term study upon which my research group is entirely reliant for its on-going
research publications and academic reputation. We only share this with trusted colleagues.
Confidential human patient data.
Questionnaire data collected in confidence from individuals – anonymized averaged data will be published.

17

Under what data-sharing license will you publish your research data?
Possible responses:
What is a data-sharing license?
Under a Creative Commons Open Data CC Zero public domain dedication and waiver, since my research
data are not covered by copyright.
Using a Creative Commons Attribution License, since my image data are copyrightable.

18

What persistent identifiers will be used to permit correct citation of your datasets?
Possible responses:
A Digital Object Identifier (DOI) issued by DataCite.
The accession number for the dataset issued by the database to which it is submitted.

19

What metadata will be published with the data to make them interpretable and reusable?
Possible responses:
I will expect users to be able to interpret the column and row labels in my spreadsheets.
The dataset will be described in the journal article we will publish, but will have no other metadata beyond
those required by the repository for data citation: Author, Date, Title, Source, Identifier.
An XML metadata file created in conformance with a Minimal Information standard will be submitted to the
repository as part of the data package, along with the data files.

Future data management
20

Who will be responsible for your data, once you have left your present research group?
Possible responses:
At this stage, I have no idea.
I’ll take my data with me and maintain responsibility.
My supervisor will make appropriate arrangements.
I hope the journal will maintain access to the supplementary information files associated with my article.
My University will assume long-term responsibility for the data I have chosen to preserve in its data archive.

----Notes
Creative Commons: Creative Commons is a non-profit organization that has developed a legal and technical
infrastructure for the licensing of copyright material and data in a standardised and machine-readable manner,
thereby facilitating open publication, sharing and innovation in the digital age.
DataCite: DataCite is an international organization that manages the issue of DOIs (Digital Object Identifiers) for
datasets. (DOIs are more commonly used to identify journal articles).
DataStage and DataBank: DataStage is a simple research data filestore and repository data submission system,
designed for deployment at the research group level. DataBank is a data repository for archiving and publishing
research data, designed for deployment at the institutional level. Both are open-source services for local or cloud
deployment developed together at Oxford University within the JISC University Modernization Fund DataFlow
Project, and both are now available in beta versions for third-party installation and use. Full Version 1.0 releases of
both DataStage and DataBank are scheduled for May 2012.
European Bioinformatics Institute: The EBI houses Europe’s primary databases for molecular sequence data,
genomics and bioinformatics, and shares data daily with similar institutions in the United States and Japan.
Minimal Information Standards for life science research specify minimal metadata requirements for certain types of
research data, are integrated by the MIBBI Project (Minimum Information for Biological and Biomedical
Investigations), and are described in [1].
SWORD2: The SWORD2 repository submission protocol is a standard protocol for on-line submissions to text or
data repositories.

Reference
[1]

Taylor et al. (2008). Promoting coherent minimum reporting guidelines for biological and biomedical
investigations: the MIBBI project. Nature Biotechnology 26 (8): 889-896. doi:10.1038/nbt0808-889.

Twenty Questions for Research Data Management was created by David Shotton, University of Oxford.
The original of this document is available from http://datamanagementplanning.wordpress.com/
2012/03/07/twenty-questions-for-research-data-management/.

This document is licensed under a Creative Commons Attribution 3.0 Unported License.

Más contenido relacionado

Destacado

Destacado (9)

Introduction to Research Data Management - 2014-02-26 - Mathematical, Physica...
Introduction to Research Data Management - 2014-02-26 - Mathematical, Physica...Introduction to Research Data Management - 2014-02-26 - Mathematical, Physica...
Introduction to Research Data Management - 2014-02-26 - Mathematical, Physica...
 
Preparing Your Research Material for the Future - 2016-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2016-02-22 - Humanities Div...Preparing Your Research Material for the Future - 2016-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2016-02-22 - Humanities Div...
 
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
Research Data Management: An Overview - 2014-05-12 - Humanities Division, Uni...
 
Preparing Your Research Material for the Future - 2015-05-20 - Humanities Div...
Preparing Your Research Material for the Future - 2015-05-20 - Humanities Div...Preparing Your Research Material for the Future - 2015-05-20 - Humanities Div...
Preparing Your Research Material for the Future - 2015-05-20 - Humanities Div...
 
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
 
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un... Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
 
Introduction to Relational Databases
Introduction to Relational DatabasesIntroduction to Relational Databases
Introduction to Relational Databases
 

Más de Research Support Team, IT Services, University of Oxford

Más de Research Support Team, IT Services, University of Oxford (16)

Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
 
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
Preparing Your Research Material for the Future - 2017-02-22 - Humanities Div...
 
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
Research Data Management Plan: How to Write One - 2017-02-01 - University of ...
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
 
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
 
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of OxfordWriting a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
 
Preparing Your Research Material for the Future 2016-05-16 - Humanities Divis...
Preparing Your Research Material for the Future 2016-05-16 - Humanities Divis...Preparing Your Research Material for the Future 2016-05-16 - Humanities Divis...
Preparing Your Research Material for the Future 2016-05-16 - Humanities Divis...
 
Preparing Your Research Material for the Future - 2015-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2015-11-16 - Humanities Div...Preparing Your Research Material for the Future - 2015-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2015-11-16 - Humanities Div...
 
Preparing Your Research Data for the Future - 2015-03-02 - University of Oxfo...
Preparing Your Research Data for the Future - 2015-03-02 - University of Oxfo...Preparing Your Research Data for the Future - 2015-03-02 - University of Oxfo...
Preparing Your Research Data for the Future - 2015-03-02 - University of Oxfo...
 
RDM key resources handout (humanities version)
RDM key resources handout (humanities version)RDM key resources handout (humanities version)
RDM key resources handout (humanities version)
 
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
 
Data Management Planning for Researchers - 2014-10-27 - University of Oxford
Data Management Planning for Researchers -  2014-10-27 - University of OxfordData Management Planning for Researchers -  2014-10-27 - University of Oxford
Data Management Planning for Researchers - 2014-10-27 - University of Oxford
 
DHOxSS 2014 - Introduction to Relational Databases
DHOxSS 2014 - Introduction to Relational DatabasesDHOxSS 2014 - Introduction to Relational Databases
DHOxSS 2014 - Introduction to Relational Databases
 
Sample dataset for documentation exercise (humanities version)
Sample dataset for documentation exercise (humanities version)Sample dataset for documentation exercise (humanities version)
Sample dataset for documentation exercise (humanities version)
 
Resources for Research Data Managers - handout
Resources for Research Data Managers -  handoutResources for Research Data Managers -  handout
Resources for Research Data Managers - handout
 
Resources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of OxfordResources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of Oxford
 

Último

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 

Último (20)

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 

Twenty questions for research data management

  • 1. Twenty Questions for Research Data Management These twenty questions are designed to prompt and assist your thinking, as a research student, a postdoc or an academic researcher at the beginning of a research project, and to form the basis of a workable research data management plan that can both guide your on-going data management activities and inform others about the nature and availability of your research data. They will help you determining how best to safeguard your data from loss, how to describe your datasets in ways that assist both yourself when returning to them in the future and others in their subsequent interpretation, and how to publish your data in ways that maximize their usefulness to others and bring maximum academic scholarly credit to yourself, to reward your efforts in acquiring, analysing, describing, interpreting and publishing them in the first place. You may not have immediate answers to all these questions. But, by seeking advice from your research supervisor, colleagues and others in your institution with responsibilities for data management, you should endeavour to discover them. Then, once in a while, you should revisit these questions and see whether your data management practices can be improved, updating your answers. The nature of your data 1 What is the subject discipline (domain, field) to which your research data relates? Possible responses: Quantum physics. Cell biology. Ornithology. 2 What is the exact nature (range, scope) of your research data? Possible responses: Long-distance quantum communication using entangled photons. Protein chemistry and electron microscopy of cell membrane proteins. Video field recordings of avian behaviour, and their quantitative analysis. 3 In what format(s), will you store your data in the short term after acquisition? Possible responses: Questionnaire response data will be stored on my laptop in a Microsoft Office Access 2007 database. Raw video recording on digital video tapes on the shelf above my desk, edited videos in .mov format on my laptop. numerical analyses in a spreadsheet (Microsoft Office Excel 2007 format) on my laptop. On my research group’s cloud-based secure DataStage research data file store, in Zeiss confocal 3D image format. 4 Who owns the data arising from your research, and the intellectual property rights relating to them? Possible responses: Myself alone. Myself and my research group leader. My university. Data descriptions (metadata, “data about data”) 5 How will your research datasets be described? Possible responses: The only description will be the filenames on my hard drive. The only description will be the column and row labels in my spreadsheets. The data will be described in handwritten notes in my lab notebook. I will save metadata describing the data files in electronic form.
  • 2. 6 How will these descriptive metadata be created or captured? Possible responses: Instrument metadata are automatically included in each data file. The only metadata will be the title and short textual description that I will manually complete in the Web submission form, when depositing each dataset in my university’s data repository. My data descriptions will be saved in spreadsheets or word processor documents. Rich metadata conforming to a Minimal Information Standard appropriate to my research field will be recorded at the time of data acquisition, using a metadata entry form, and will thus be available as a metadata file to accompany my datasets during submission of the data to a data repository. Data sharing 7 With whom will you share your research data in the short term, before publication of any papers arising from their interpretation? Possible responses: My research supervisor only. Members of my research group and trusted external collaborators. Anyone who asks for them. Everyone, by publishing the data online, since our research community is committed to the rapid sharing of research results. Data storage and backup 8 Where will you store your data in the short term, after acquisition? Possible responses: On my laptop. On the computer connected to the microscope. On the research group’s DataStage filestore. 9 Who is responsible for the immediate day-to-day management, storage and backup of the data arising from your research? Possible responses: Myself alone. My research group’s data manager. Our departmental IT staff, who manage our research group’s DataStage research data management system. 10 How frequently will your research data be backed up for short-term data security? Possible responses: Whenever I remember to do so. Nightly, using our research group’s DataStage research data management system connected to the University’s automated backup service. Data archiving 11 Where will your research data be archived for long-term preservation? Possible responses: Selected data will be included in the figures and tables of research papers published by my research group, but we have no plans to archive and publish the full datasets. As supplementary files attached to my journal articles on the publisher’s web site. In the University’s DataBank data repository, run by the library service. In appropriate genomics databases run by the European Bioinformatics Institute. 12 When will your research data be moved to a secure archive for long-term preservation and publication?
  • 3. Possible responses: Our research data are already securely stored in an institutional data server. Nightly. Upon completion of each set of experiments. When my research group leader decides it is appropriate. Immediately after publication of my thesis. Upon submission of our Nature paper, so that the data are available for reviewers. 13 Who will decide which of your research data are worth preserving? Possible responses: Myself alone. Myself, in consultation with my research supervisor. My research supervisor alone. 14 How (i.e. by what physical or electronic method) will you transfer your research datasets to their long-term archive, under the curatorial care of a separate third-party, e.g. a data repository? Possible responses: On physical hard drives that I will bring back from my field site by air. By e-mailing files to our librarian. By completion of the Web-based database submission form and uploading of the data files over the Internet. By automated data packaging and repository submission over the Web from my local DataStage filestore, using the SWORD repository submission protocol. Data publication 15 For how long will you embargo your research data before it is published for others to see and use? Possible responses: We will allow immediate public access to the data. For one year, to permit us to exploit our hard-won research results. Until the journal article describing our results has been published. 16 Why is public access to your research data to be restricted (if indeed it is)? Possible responses: We intend to make a patent application, and must avoid prior disclosure. Don’t want to make locations of members of endangered species available to poachers. The research data are confidential because of the arrangement my research group has made with the commercial partner sponsoring our research. My data form part of a long-term study upon which my research group is entirely reliant for its on-going research publications and academic reputation. We only share this with trusted colleagues. Confidential human patient data. Questionnaire data collected in confidence from individuals – anonymized averaged data will be published. 17 Under what data-sharing license will you publish your research data? Possible responses: What is a data-sharing license? Under a Creative Commons Open Data CC Zero public domain dedication and waiver, since my research data are not covered by copyright. Using a Creative Commons Attribution License, since my image data are copyrightable. 18 What persistent identifiers will be used to permit correct citation of your datasets?
  • 4. Possible responses: A Digital Object Identifier (DOI) issued by DataCite. The accession number for the dataset issued by the database to which it is submitted. 19 What metadata will be published with the data to make them interpretable and reusable? Possible responses: I will expect users to be able to interpret the column and row labels in my spreadsheets. The dataset will be described in the journal article we will publish, but will have no other metadata beyond those required by the repository for data citation: Author, Date, Title, Source, Identifier. An XML metadata file created in conformance with a Minimal Information standard will be submitted to the repository as part of the data package, along with the data files. Future data management 20 Who will be responsible for your data, once you have left your present research group? Possible responses: At this stage, I have no idea. I’ll take my data with me and maintain responsibility. My supervisor will make appropriate arrangements. I hope the journal will maintain access to the supplementary information files associated with my article. My University will assume long-term responsibility for the data I have chosen to preserve in its data archive. ----Notes Creative Commons: Creative Commons is a non-profit organization that has developed a legal and technical infrastructure for the licensing of copyright material and data in a standardised and machine-readable manner, thereby facilitating open publication, sharing and innovation in the digital age. DataCite: DataCite is an international organization that manages the issue of DOIs (Digital Object Identifiers) for datasets. (DOIs are more commonly used to identify journal articles). DataStage and DataBank: DataStage is a simple research data filestore and repository data submission system, designed for deployment at the research group level. DataBank is a data repository for archiving and publishing research data, designed for deployment at the institutional level. Both are open-source services for local or cloud deployment developed together at Oxford University within the JISC University Modernization Fund DataFlow Project, and both are now available in beta versions for third-party installation and use. Full Version 1.0 releases of both DataStage and DataBank are scheduled for May 2012. European Bioinformatics Institute: The EBI houses Europe’s primary databases for molecular sequence data, genomics and bioinformatics, and shares data daily with similar institutions in the United States and Japan. Minimal Information Standards for life science research specify minimal metadata requirements for certain types of research data, are integrated by the MIBBI Project (Minimum Information for Biological and Biomedical Investigations), and are described in [1]. SWORD2: The SWORD2 repository submission protocol is a standard protocol for on-line submissions to text or data repositories. Reference [1] Taylor et al. (2008). Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project. Nature Biotechnology 26 (8): 889-896. doi:10.1038/nbt0808-889. Twenty Questions for Research Data Management was created by David Shotton, University of Oxford. The original of this document is available from http://datamanagementplanning.wordpress.com/ 2012/03/07/twenty-questions-for-research-data-management/. This document is licensed under a Creative Commons Attribution 3.0 Unported License.