Closing the gap: connection points
between DMPs and repositories
Sarah Jones, Sam Rust & Magdalena Drafiova
10 June 2019, Open Repositories conference, Hamburg
dmponline@dcc.ac.uk
Workshop aims and scope
• Share current activity on machine-actionable DMPs
• Discuss latest/forthcoming features in DMPonline
• Understand repository needs
• Map out how DMPs and repositories can connect
• Use workshop outcomes to inform DMProadmap
Agenda
Time Session Name
9:00 Aims and introductions Sarah, Sam & Magdalena
9:45 DMPs – moves to machine-actionability Sarah
10:15 Demo of DMPonline features Sam & Magdalena
10:30 Coffee
11:00 Discussion: mapping connections between
DMPs and repositories
All
12:10 Wrap-up discussion and next steps Sarah, Sam & Magdalena
https://tiny.cc/DMP-workshop
What does a DMP contain?
Info on project
Info on data
Data management
Plans for sharing
Grant number
Project name, description
PI details
…..
Types of data to be created
Description of data / metadata
Data volume
Data format
…..
Where data will be stored and backed-up
Project partners needing access
Ethical issues
…..
Which repository will data be deposited in
Who will have access / permission to reuse
Plans for data licensing
Length of preservation
…..
http://www.dcc.ac.uk/resources/data-management-plans/checklist
Why make DMPs machine-actionable ?
Promote information flow
across systems to support:
• Data discovery
• Capacity planning
• Aggregation/integration
• Policy compliance
• ….
From Flickr by highwaysengland, CC BY 2.0
IDCC workshop
• Defining a ‘perfect DMP world’
• Map use cases and priorities
• Resulted in white paper on
recommendations
http://www.dcc.ac.uk/events/workshops
/postcard-future-tools-and-services-
perfect-dmp-world
maDMP priority areas
• Common standards and protocols
• Leveraging persistent identifiers (PIDs)
• Capacity planning (institutional & data centre)
• Share/publish/deposit DMPs
• Evaluation and monitoring
• Disciplinary tailoring and recommendations
https://riojournal.com/article/13086
Who is working on this?
Research Data Alliance (RDA) is a global organisation
supporting collaboration to promote data management and
open sharing
Active DMPs
Interest Group
Exposing DMPs Working Group
- develop use cases of what can be shared,
when and with whom
- provide a reference model and alternative
strategies for exposing DMPs
https://www.rd-alliance.org/groups/exposing-
data-management-plans-wg
DMP Common Standards
Working Group
- develop a common data model with a core
set of elements
- provide reference implementations of the
data model
https://www.rd-alliance.org/groups/dmp-
common-standards-wg
CC BY-SA 4.0
DMP Common Standards - Outputs
• Common data model for machine-actionable DMPs
• to model information from standard DMPs
• NOT a template
• NOT a questionnaire
• modular design
• core set of elements
• domain specific extensions
• Reference implementations
• ready to use models
• JSON, XML, RDF, etc.
• Guidelines for adoption of the common data model
• requirements for supporting systems
• pilot studies
www.rd-alliance.org - @resdatall
CC BY-SA 4.0
Example
• Current DMPs – model questionnaires
<administrative_data>
<question>Who will be the Principle Investigator?</question>
<answer>The PI will be John Smith from our university.</answer>
</administrative_data>
• Machine-actionable DMPs – model information
"dc:creator":[ {
"foaf:name":"John Smith",
"@id":"orcid.org/0000-1111-2222-3333",
"foaf:mbox":"mailto:jsmith@tuwien.ac.at",
"madmp:institution":" AT-Vienna-University-of-Technology"
} ],
www.rd-alliance.org - @resdatall
Principles
• Reuse existing standards,
e.g. Dublin Core, PREMIS
• Use PIDs whenever
possible, e.g. ORCID
• Use controlled
vocabularies
• Develop own concepts
and vocabularies only
when needed
CC BY-SA 4.0
Model in consultation: DCC plans to test
www.rd-alliance.org - @resdatall
https://www.lucidchart.com/invitations/accept/ee26bc71-01a6-442a-b946-
5b9c910fb926
Some recent activities in DMPonline
Activities to gather more structured information for reuse
• Connecting to the RDA Metadata Standards Directory to
allow standards to be selected
• Connecting to the OpenAIRE API to pull in grant details for
H2020 projects
• Supporting export to Zenodo and RIOjournal (in
development/test)
MSD added as an answer type
Tool pulls in information from the API to let users
browse and select metadata standards
Utilising EC grant IDs in plans
• Harvest grant IDs from OpenAIRE API
• Provide look up when entering project details
• Enables join up of DMP with other outputs
https://www.youtube.com/
watch?v=qYQf9viGTHE
Easing DMP publishing workflows
• Integration with Zenodo
• DMPonline provides an export to Zenodo option
• User signs-in to Zenodo
• DMP uploads in PDF version and pre-populates metadata
• Integration with RIOjournal
• User registers for ARPHA writing tool and obtains API key
• DMPonline stores this API key and provides a button to allow
export of DMP to ARPHA writing tool
• DMP import complies with pre-defined DMP article template in
RIO
Full text API
Request from user group to expand API to allow full-
text to be harvested
Potentially to increase functionality – link into piloting
of RDA Common standard for DMPs
Benefits and guidelines on using the API -
http://www.dcc.ac.uk/blog/sam-rust-explains-
benefits-api-dmponline
Exercise
1. What information held in a DMP could be useful to a repository?
• List all possible examples
• Be precise about the details needed
2. How could the information be used?
• Define use cases e.g. capacity planning, deposit workflow,
recommendations…
3. Map out the workflow of information exchange
• what is needed?
• when – once, at intervals?
• how to transfer info?
• what format is it needed in?
Take 15-20 mins per section. Agree on details to report back.
What next?
1. User groups
- March 2019 in Amsterdam
- April 2019 in Manchester
- Read more here too: http://www.dcc.ac.uk/blog/dmponline-user-group-where-next
- Prioritising requests from the users (full text API, notification one per review, case search sensitivity…)
2. Development plans - https://github.com/DMPRoadmap/roadmap/projects/5
- Conditional questions (#1722)
- Adding school/department field
- Statistics for the administrators
3. Newsletter
What next?
3. Newsletter
Subscribe: https://dcc.us1.list-manage.com/subscribe?u=dfcae3470db3a4ac2196a3671&id=c4710c28b8
Most recent issue: https://mailchi.mp/8c11dbd6093a/dmponline-newsletter-may-2019
- Latest updates (summary from virtual drop in sessions, news …)
- Software updates (latest releases…)
- How to videos (e.g. https://www.youtube.com/watch?v=qYQf9viGTHE)
- Knowledge exchange
- Days to remember (monthly virtual drop in sessions, user groups,
conferences…)
Connect & stay in touch
Helpdesk
Twitter
Blog
User group
Github
Slack
DMPonline@dcc.ac.uk
@DMPonline and #ActiveDMPs
http://www.dcc.ac.uk/news/DMPonline
http://tiny.cc/DMPonline-user-group
https://github.com/DMPRoadmap
http://tiny.cc/DMPRoadmap-slack