Tobin Magle presents on using the Open Science Framework (OSF) for collaborative data management. He discusses why data management is important throughout the research cycle. Key aspects of successful data management plans for collaborative projects include assigning roles, using a shared workspace like OSF, and implementing version control. OSF allows researchers to organize projects into components and files, link to add-ons, and control access for contributors.
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
Collaborative Data Management using OSF
1. Collaborative data
management using OSF
Tobin Magle
Data Management Specialist
Morgan Library
12-07-2016
http://www.slideshare.net/CTobinMagle/collaborative-data-management-using-osf
2. Outline
• Intro to data management services
• What is data management?
• Why should I care?
• Data Management Planning
• Collaboration tool: Open Science Framework
3. My Background: molecular microbiology
(1) CT Magle et al Infect Immun. 2014 Feb;82(2):618-25. doi: 10.1128/IAI.00444-13. Epub 2013 Nov 25.
(2) Sun W, Tanaka TQ, Magle CT, et al.. Sci Rep. 2014 Jan 17;4:3743. doi: 10.1038/srep03743.
5. One on one meetings
• How do I write a DMP?
• How do I organize my data?
• How do I clean and format my data?
• How do I automate my analyses?
• How do I get my data ready to share?
6. Data archiving service
• CSU Digital Repository
• Over 100 Datasets
• Satisfy requirements for
manuscripts and grants
• At no cost <1 TB
• $150/TB for 5 years
• $300/TB for >5 years
8. What is data
management?
The policies, practices and procedures needed to
manage the storage, access and preservation of data
produced from a research project
10. Why should I care?
• Good for research integrity
• Good for you
• Public good
• Collaboration is hard
Full lecture by Keith Baggerly, Bioinformatician
(University of Texas, MD Anderson Cancer Center)
https://www.youtube.com/watch?v=7gYIs7uYbMo
http://www.nytimes.com/2011/07/08/health/research/08genes.html
11. Where does data management
fit into research?
Throughout the whole research cycle
24. What is a data management plan?
• A description of how you plan
to describe, preserve and
share your research data.
• Often required by funders
• Collaboration takes extra planning
25. Successful DMPs include
• A data inventory
• A strategy for describing the data
• A plan for preserving the data
• A method for access to the data
http://help.osf.io/m/60347/l/618674-creating-a-data-management-plan-dmp
26. Successful collaborative DMPs include
• A data inventory
• A strategy for describing the data
• A plan for preserving the data
• A method for access to the data
http://help.osf.io/m/60347/l/618674-creating-a-data-management-plan-dmp
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3143734
• Assigned Roles
• Shared work space
• Context
• Version control
28. Organization rules
• Be consistent
• One directory per project
• Separate subdirectories for
• Raw data
• Processed data
• Code
• Output
• Make raw data read-only
• Make README files
http://help.osf.io/m/60347/l/611391-organizing-files
As one of my colleagues so kindly put it, I should tell you all “that I’m a weirdo”
Authoring
New storage – some that can run under your desk
Analytic workflow
Lab notebook
Data management plan
Institutional platforms like vivo
Publishing platforms like OJS and Ubiquity Press