This document provides an overview of the Dataverse Network Project, which is a repository for research data hosted at Harvard University. It allows researchers to deposit, share, and organize their data in a curated network. Key features include long-term preservation of data and metadata, access and sharing capabilities, and archiving best practices to promote data access and reproducibility. Researchers can create individual dataverses to organize their studies and deposit data through a web interface or via software installation. The network supports various file types and formats and provides data citation and version control.
3. DATAVERSE
NETWORK PROJECT
• repository for research data that takes care of
long-term preservation
• employs old archival practices while allowing
researchers to keep control of and receive
recognition for their data
4. DATA MANAGEMENT
• provides access and sharing capabilities
• allows researchers to deposit data in organized,
curated and citable network
• promotes access and sharing
5. ARCHIVING
• metadata is exported to XML
• data files reformatted for long-term access
• all versions are kept
• metadata and data are replicated to multiple locations
through LOCKSS
๏ Lots of Copies Keep Stuff Safe (Standford University)
11. UNIQUE SERVICES
• tubular data sets
๏ files with rows and columns (SPSS, STATA, CSV) can be subset
๏ user can extract only some of the variables
• social network data
๏ data that describes a network of entities and relationships
๏ sets uploaded in GraphML format to provide flexibility
12. Data sharing and archiving with control
and recognition for data authors
Persistent Data Citations
linking data to publications
Customized Branding
or embed on your site
Support for All File Types
any format
Data Restrictions
& terms of use options set by data author
Rich data support for certain file
formats
SPSS, Stata, R Data
metadata extraction, subset & R analysis
FITS Data
metadata extraction
Social Network Data (GraphML)
smart queries & subsetting
Data Visualizations
for time series
Data management, standards
and archival best practices
General and Domain-Specific Metadata
following metadata standards
Data Versioning
preserve & cite previous versions
Traffic & Downloads Tracking
for your data with Guestbook
Permanent Storage
preservation format; copies in
multiple locations
Harvard Dataverse Network Features
Learn more at: thedata.org or start searching and uploading at thedata.harvard.edu
Coming soon in Dataverse 4.0:
• Redesign of the entire user interface
• Dataverses can contain other dataverses
• Simplified workflows for creating an account, a dataverse, and datasets as well as uploading files
• Terminology changes:
• Study now called Dataset
• Cataloging Information now called Metadata (general, domain-specific and file metadata)
• Collections are now dataverses
Want to participate in Dataverse
4.0 testing? Sign up @
http://tinyurl.com/DVUserTesting
13. USING DATAVERSE
• web interface
๏ individual researcher can create a dataverse through a web form
on the DVN and deposit their own data sets
๏ dataverses are organized in studies which are given a data citation
so they can be referenced
• software installation
๏ institution can intall it on their servers and create their own
Dataverse Network
18. DATAVERSE REPORT
• Due November 1
• create dataverse
• add midterm study to the dataverse
• add datasets to the midterm study
๏ interview instrument, recorded interview, interview
transcript, data management plan
19. DATAVERSE REPORT
• report on your experience using DVN
• 1 page write-up:
๏ what you liked, did not like
๏ what was useful, what was confusing
๏ did you have to do a lot of searching on the website for help
๏ try searching for data sets: was it easy or hard