Talk for the JIBS User Group / RLUK joint event entitled Demystifying Research Data : don't be scared be prepared.
Descripton of the JISC DataPool project
2. DataPool - A project for the research life cycle
• To embed the management of
research data into the infrastructure
and culture of our institution
• To integrate data management
throughout the research life cycle -
from proposal to storing and sharing
http://datapool.soton.ac.uk/
2
3. DataPool
Building Capacity, Developing Skills, Supporting Researchers
1
•SharePoint •Ratified
1 2
•Data Repository •Embedded
•MetaData log •Implemented
System Policy
Guidance
•Discipline and Training •Developing Skills
Exemplars Support •Supporting
•Re-use /
researchers
integration case
studies 4 3
4
2
3
Byatt, D. (D.R.Byatt@soton.ac.uk); Hancock, P.J. (p.j.hancock@soton.ac.uk); Hitchcock, S. (sh94r@ecs.soton.ac.uk); White, W. (whw@soton.ac.uk)
4. IDMB Roadmap Years 1-3 Key points:
POLICY
• Institutional policy (DataPool)
INFRASTRUCTURE
• Scalable and sustainable business model for storage
– Initial model (IDMB)
• more evidence required for further investment (DataPool)
• Institutional data repository
– SharePoint and EPrints (DataPool)
SUPPORT
• A one-stop shop for data management advice and guidance
– http://www.southampton.ac.uk/library/research/researchdata/
(DataPool)
• In progress
5. Research Data Management Policy
• Underpins the project
• An iterative process
– views gathered through designated
“data” contacts within Faculties
initially
• Will inform data management processes
• Will influence data management plans -
required by most funders
http://www.calendar.soton.ac.uk/sectionIV/research-data-management.html
5
7. DataPool - Informing the Infrastructure
Looking for a simple deposit system to:
• Support the research data workflow
• Capture metadata from existing University systems
– 3 level
• Project
• Core
• Discipline
• Manage access and facilitate sharing
• Is geared towards the individual researcher
• Is flexible and able to deal with different disciplines
7
8. Caption for the image should
be placed in a clear area on
the image to ensure it is
legible
11. Guidance, Training and Support
• Data life cycle begins with the research proposal
• Provide a network of services to support the researcher from proposal
to deposit, based on existing services and expertise
• Internal
– Research and Innovation Service
– Doctoral Training Centres
– Library
• External
– Digital Curation Centre
– Archaeological Data Service
11
12. DataPool Service Model
adapted with permission from Martin Lewis (2012) Working with other RDM actors slide 2
(http://www.rluk.ac.uk/content/clarifying-roles-libraries-research-data-management-discussion-day-find-creative-solutions
12
Firefox only)
13. Guidance, Training and Support
• To enhance the support provided to researchers
– Website for guidance
– Workshops
– Training materials
» Introducing Research Data by Scott, M et al (2012)
» http://eprints.soton.ac.uk/338816/
• To provide training for academic, professional and support staff in support of this
role in areas such as:
– Writing data management plans
– Access and restrictions to access
– Creating metadata
– Storing and discarding
» Working with University of Oxford
13
At the launch event we presented this poster with the strapline “Building Capacity, Developing Skills, Supporting Reseachers”This very much represents the inter-relation of the various elements of supporting research data, both internal and external.The top left is a summary of the different aspects of the project.The top right is a diagram of the technical infrastructure. One key element is the MetaData log or Catalogue that will be a key element.
DataPool is not intended to meet all the demands of the IDMB Roadmap.Our aim is to develop three key elements of years 1-3 and these will form the basis of the areas covered in this presentationWe did have a rough draft of the policy by the end of the IDMB project, but this need further consultation and “legaleesed” (!)We did have an initial business model for storage, but this needs further refinement and evidenceWe have been looking at SharePoint as it offers a collaborative space as well as Eprints.IDMB recommended a one-stop shop and this seems a “favoured” approach though very often not possible. We have launched a website linked to our policy and I see this as a hub that tries to draw things together while directing out to areas of expertise.
Optional slide - Our slightly scary looking Calendar!It does contain a definition of research data. This includes the word “significant” and we would see that this is a discipline specific on what is covered by that. IDMB survey suggested that most researchers expect to keep their data forever, except where they don’t think they have data.
Key element in this area is looking for ways that reduce the manual element in managing data, avoiding researchers entering the data too many times.Our focus has been on SharePoint, to provide a catalogue entry form and collaborative working space, and EPrints to provide a deposit space.We are looking at creating 3 level metadata, 2 very basic – who, what, when style and the discipline level submitted as xml sheets.
Still very much in development and still needs to be testedHoused on the Eduserv Cloud
Aim is to offer a collaborative space along with the catalogue record forms for project and datasets, and data management plans. If possible fields would be auto-filled from other University databases, if they permit this use and we can have a key identifier.
Our third strand is Guidance, Training and Support. This is the area where library staff are most involved, but it is still very much a collaborative work involving the expertise of those working in other areas. Currently working on mapping the information literacy lens on the Research Development Framework with the Research lifecycle to identify the skills required to support this. Will look at those areas where awareness is required, and those where knowledge is essential. This is not specifically for library staff but involves all the various services that support research.
We are developing a service model taking advantage of the expertise within the different professionals. So in our institution the library has skills in digital preservation etc. Replace information with data in many of our everyday roles and it becomes clear we have many of the necessary skills. Metadata is no more than cataloguing of data?
We launched our website in support of our policyMark Scott has developed a guide “introducing research data” that give some examples in the science/engineering area. There is also an alternative to the DCC lifecycle model.Looking to develop a model for identifying the key skills required by support staff based on Information Literacy lens on the Research Development Framework with the University of Oxford, who also have a MRD project. From that we will use that to carry out a training needs analysis to inform joint training sessions.
Website launched 6thJulyThe library is a trusted, more stable location, perhaps less susceptible to the influence of the winds that blow in other areas of an institution.It is written for the early career researcher and covers general information with the expectation that subject specific material will be addressed at Faculty/discipline.Some of the pages are under development and we are looking for input from the academic community.