1. CDL: Supporting
the Research Life Cycle
Perry Willett
University of California Curation Center
California Digital Library
2. University of California:
• 10 campuses, 5 medical centers, 3 national laboratories
• 238,000 students
• 190,000 faculty members and staff
• $4.7 billion in research funding and external grants
3. California Digital Library:
• Part of the University of California
• Located organizationally in the Office of the President
4. UC3:
Partnership between CDL | 10 UC campuses | Peer institutions
Provide solutions, services, resources for digital assets
Pool & distribute diverse experience, expertise, & resources
5. A life cycle approach
Create, edit, share, and save
data management plans
Curation repository:
store, manage, and share research data
Self-service tool for metadata creation
and submission to Merritt data
repository
Create and manage
long-term identifiers
collect
Open Access publishing services /
dynamic research platform
plan
manage
share
6. A life cycle approach
Create, edit, share, and save
data management plans
Curation repository:
store, manage, and share research data
Create and manage
long-term identifiers
plan
Open Access publishing services /
dynamic research platform
Self-service tool for metadata creation
and submission to Merritt data
repository
7. DMPTool
• Connect researchers to resources
to create a data management plan
• NSF and directorates, NIH, NEH,
IMLS, foundations plus
• Customizable
Meeting funding agencies data management plan requirements
Primary Functions
1. Step-by-step “wizard”
2. Templates and examples
3. Links to institutional resources and agency information
4. Plan publication and sharing
8. • Precise identification of a dataset
(DOI or ARK)
• Credit to data producers and data
publishers
• A link from the traditional literature
to the data
• Exposure and research metrics for
datasets
(Web of Knowledge, Google)
Primary Functions
1. Create long term identifiers
2. Manage identifiers (and associated
metadata) over time
3. Resolve identifiers
EZID
Long term identifiers made easy
@ezidCDL
9. A life cycle approach
Create, edit, share, and save
data management plans
Curation repository:
store, manage, and share research data
Create and manage
long-term identifiers
collect
Open Access publishing services /
dynamic research platform
Self-service tool for metadata creation
and submission to Merritt data
repository
10.
11.
12.
13.
14. A life cycle approach
Create, edit, share, and save
data management plans
Curation repository:
store, manage, and share research data
Create and manage
long-term identifiers
Open Access publishing services /
dynamic research platform
Self-service tool for metadata creation
and submission to Merritt data
repository
share
15. Merritt
• Developed and supported in-house
• “Model free”
– No prescriptive requirements regarding format, structure, metadata,
or genre
• UI and REST API
• Strongly versioned
– Any change to data or metadata triggers a new version
– All previous versions can be re-instantiated for retrieval
– Intra-version compression (forward deltas) minimizes storage
duplication
16. Merritt
• Metadata
– User supplied: descriptive, …
– System augmented: technical, structural, provenance
• Replication and audit
– Across two technologies
• OpenStack/Swift
• WAN NFS
– Across two locations
• UCLA (two internal replicas)
• UCSD/SDSC (three internal replicas)
18. • UC’s institutional repository and publishing
platform
• 90,000 publications
• 78 open access journals
• Repository for UC’s Open Access policy:
“…future research articles authored by faculty at all 10
campuses of UC will be made available to the public at
no charge.”
eScholarship
20. For more information
UC3 Data Management Planning Resources
http://www.cdlib.org/uc3
https://dash.berkeley.edu
https://dmptool.org
http://ezid.cdlib.org
Twitter:
– @ezidCDL
– @UC3CDL
– @TheDMPTool
– @CalDigLib
Email: uc3@ucop.edu
Notas del editor
But first a very brief context setting.
Serving the 10 UC campuses
226,000 students
134,000 faculty and staff
But first a very brief context setting.
Serving the 10 UC campuses
226,000 students
134,000 faculty and staff
But first a very brief context setting.
Serving the 10 UC campuses
226,000 students
134,000 faculty and staff
What is a data management plan?
A document that describes what you will do with your data during and after you complete your research
The DMPTool “walks” scientists through the process of developing a concise, but comprehensive data management plan that could enable good stewardship of data and meet requirements of sponsors and home institutions.
Partners: University of Virginia Library, University of Illinois at Urbana-Champaign Library, and DataONE, UCLA, UCSD
The California Digital Library and its partners were awarded a $590,000 grant from the Alfred P. Sloan Foundation to fund further development of the popular Data Management Planning Tool in 2013. The bulk of the grant will go to the UC Curation Center (UC3) at the CDL to fund improvements to the DMPTool including expanded functionality, training modules, documentation and the creation of an open-source community to sustain the DMPTool in the future. Project partners are the University of Virginia Library, University of Illinois at Urbana-Champaign Library, and DataONE