SlideShare una empresa de Scribd logo
1 de 52
Creating a Data Management Plan
Kristin Briney, PhD
Data Services Librarian
This Session Will Answer
• Why am I being asked to create a DMP?
• What are the key parts of a DMP?
• How do I translate my research to each of
these parts?
You Will Leave With
• An understanding of the main parts of a data
management plan
• Knowledge of where to find resources and
assistance
 Rough outline of your data management plan
WHY AM I BEING ASKED TO CREATE
A DATA MANAGEMENT PLAN?
Why Data? Why Now?
• Data are DIGITAL
– Easy to copy and share
– Difficult to preserve
• Data are COMPUTABLE
– New avenues of research like data mining
• Data represent a FINANCIAL INVESTMENT
– Poor research funding climate
– Can no longer ignore data as a scholarly product
Many Funders Require DMPs
• NSF
• NEH
• NIH
• NOAA
• NASA
• …even more funders will require DMPs soon!
– White House OSTP Public Access memo
The Funder Perspective
• Data is a scholarly resource
– Data sharing akin to scholarly publishing
• Barriers to sharing are
– Organization
– Documentation
– Long-term management and preservation
 Hence data management plans
DMPs Help You Too!
• Don’t loose data
• Find data more easily
• Easier to analyze organized, documented data
• Avoid accusations of fraud & misconduct
• Get credit for your data
• Don’t drown in irrelevant data!
For each minute of planning at
beginning of a project, you will save
10 minutes of headache later
DMPs Help You Too!
A data management plan will make conducting
research easier for you…
…So if you are required to create a DMP, why
not use it to improve your practices?
WHAT ARE THE KEY PARTS OF A
DATA MANAGEMENT PLAN?
Actual NSF DMP Requirements
• The types of data, samples, physical
collections, software, curriculum materials, and other
materials to be produced in the course of the project
• The standards to be used for data and metadata format and
content
• Policies for access and sharing including provisions for
appropriate protection of
privacy, confidentiality, security, intellectual property, or
other rights or requirements
• Policies and provisions for re-use, re-distribution, and the
production of derivatives
• Plans for archiving data, samples, and other research
products, and for preservation of access to them
http://www.nsf.gov/pubs/policydocs/pappguide/nsf13001/gpg_2.jsp#dmp
Key Questions
1. What data will I create?
2. What standards will I use to document the
data?
3. How will I protect private/secure/confidential
data?
4. How will I archive and preserve the data?
5. How will I provide access to and allow reuse
of the data?
Be Aware
• Actual requirements vary by funder and
division
• Look up your requirements before you write
your DMP
HOW DO I TRANSLATE MY RESEARCH
TO EACH OF THESE PARTS?
1. WHAT TYPES OF DATA WILL I
CREATE?
What Are Data?
• “Research data is defined as the recorded
factual material commonly accepted in the
scientific community as necessary to validate
research findings”
– OMB Circular A-110
http://www.whitehouse.gov/omb/circulars_a110
What Are Data?
• Observational
– Sensor data, telemetry, survey data, sample
data, images
• Experimental
– Gene sequences, chromatograms, toroid magnetic
field data
• Simulation
– Climate models, economic models
• Derived or compiled
– Text and data mining, compiled database, 3D
models, data gathered from public documents
What Not To Share
• Laboratory notebooks
• Preliminary analyses
• Drafts of scientific papers
• Plans for future research
• Peer reviews or communications with
colleagues
• Physical Samples
No Data?
• Still need a data management plan
• Plans with no data and no sharing will likely be
examined more closely
– Carefully explain situation if you are in this
position
Exercise
• Conduct a quick inventory of the data you will
acquire
– What data will you collect?
– Is your data unique?
– How big will the data be?
– How fast will the data grow?
2. WHAT STANDARDS WILL I USE TO
DOCUMENT THE DATA?
What would someone unfamiliar
with your data need in order to find,
evaluate, understand, and reuse
them?
Documentation
• Consider the difference in documenting for
– someone inside your lab
– someone outside your lab but in your field
– someone outside your field
• Audience matters!
Documentation
Methods
• How the data were
gathered
• How the data should be
interpreted
• What you did
– Limitations on what you did
• …build trust in your data
Metadata
• What you’re looking at
• Who made it and when
• How it got there
• What it means
• What you can do with it
• …before you even look at
the file
Methods
• Examples of methods to document
– Code
– Survey
– Codebook
– Data dictionary
– Anything that lets someone reproduce your results
• Don’t forget the units!
Metadata
• Look for a metadata scheme before you collect
the data!
– Lots of metadata schemas available
– Easier to record metadata when collecting data than
to convert later
• Consult
– Disciplinary repository
• Repositories usually have required metadata schemas
– Your peers
– Subject librarian
Metadata Example: Dublin Core
• Contributor
– Jane Collaborator
• Creator
– Kristin Briney
• Date
– 2013 Apr 15
• Description
– A microscopy image of
cancerous breast tissues
under 20x zoom. This image is
my control, so it has only the
standard staining describe on
2013 Feb 2 in my notebook.
• Format
– JPEG
• Identifier
– IMG00057.jpg
• Relation
– Same sample as images
IMG00056.jpg and
IMG00055.jpg
• Subject
– Breast cancer
• Title
– Cancerous breast tissue
control
Exercise
• What methods information do you need to
preserve?
• What metadata standard will you use for your
data? -OR- Who will you contact to find a
relevant standard?
3. HOW WILL I PROTECT
PRIVATE/SECURE/CONFIDENTIAL DATA?
Security Issues
• Does your data fall under the following?
– HIPAA
• Health information
– FERPA
• Student information
– FISMA
• Government subcontractor
– Human subject research, etc.
 Ask for help!
Security Issues
• Secure storage
• Controlled access
• De-identification of personal information
• Security training
Security Questions
• Access permissions
– Who is allowed to access the data?
• Sharing
– Am I required to share? Can I actually share?
– Despite requirements, some data can’t be shared
• Responsibility
– Who will make sure the data stays secure?
UWM Security Resources
• UWM Information Security Office
– Visit: https://www4.uwm.edu/itsecurity/
– Email: infosec@uwm.edu
• Certificate in Information Security
• HIPAA
– https://www4.uwm.edu/legal/hipaa/index.cfm
• FERPA
– http://www4.uwm.edu/academics/ferpa.cfm
Exercise
• Do any regulations apply to your data?
• If so, who is allowed to access your secure
data? Who will be responsible for data
security?
4. HOW WILL I ARCHIVE AND
PRESERVE THE DATA?
Archiving Is Not Storage
• Storage is keeping files to access
• Archiving is about preservation
– Data should be readable and usable
– Data should be uncorrupted
• We can’t read some digital files from 10 years
ago
– This is what good digital preservation solves
Side Note
• If federally funded, you are required to retain
your data “for a period of three years from the
date of submission of the final expenditure
report.” AT LEAST.
• Better to keep on hand for at least 6 years
– Recent retraction in 6-year old paper for failure to
provide original data
• Preservation not an abstract issue
http://www.whitehouse.gov/omb/circulars_a110#53
http://retractionwatch.wordpress.com/2013/07/19/jci-paper-retracted-for-duplicated-panels-after-authors-cant-provide-
original-data/
File Formats
• Easy way to ensure long-term usability
• Use open file formats
– Open and standardized
– Well documented
– In wide use
– Examples: .txt, .tiff, .csv, .dbf
• Transform your data now, not later
– Keep both file types
Other Preservation Concerns
• Obsolescence
– Preserve software along with data
• Deterioration
– Keep more than 1 copy to avoid corruption
• Media
– ie. Can you still read a floppy disk?
– Periodically move data off outdated media
Find a Trustworthy Partner
• Find outside help
– Servers come and go, so do labs
• Off campus
– Disciplinary data repository
– Journal that accepts data
• Let someone else worry about this
Exercise
• What open file formats will you use to help
preserve your data?
• If there isn’t an adequate open format, what
software and hardware will you preserve?
5. HOW WILL I PROVIDE ACCESS TO
AND ALLOW REUSE OF THE DATA?
Why Share?
• Get more credit for your work
– In “studies that created gene expression
microarray data, we found that studies that made
data available in a public repository received 9% …
more citations than similar studies for which the
data was not made available”
– “The citation boost varied with date of dataset
deposition: a citation boost was most clear for
papers published in 2004 and 2005, at about 30%”
• Get credit for unpublishable results
https://peerj.com/preprints/1/ (2013 study)
Why Share?
• Make your funder happy
• Helps you find and use your data later
• Disprove misconduct or fraud accusations
• Stimulate new research
Audience
• Who is the audience for this data?
– Coworkers?
– Disciplinary/institutional colleague?
– Researchers in allied fields?
– Anyone?
• Audience will determine how to share the
data
Ways To Provide Access
• Hands-off options preferable
– Journal
– Disciplinary repository
• Embargoes may be possible here
– UWM Digital Commons
• Small, discrete datasets
• Other options
– By request
– On your lab website
Exercise
• Who is the audience for your data?
• Which way will you provide access?
RESOURCES
Resources
• Data Services Librarian
– briney@uwm.edu
• Data management information
– dataplan.uwm.edu
• UWM Information Security Office
– infosec@uwm.edu
Thank You
• This presentation is available on Slideshare
– http://www.slideshare.net/kbriney
• The content of this presentation is licensed under a Creative
Commons Attribution 3.0 Unported License (CC BY)
• Some content used with permission from Brad Houston and
Dorothea Salo
Questions?

Más contenido relacionado

La actualidad más candente

Data Science in Medicine and Health
Data Science in Medicine and HealthData Science in Medicine and Health
Data Science in Medicine and HealthSteve Tsang
 
Patient recruitment
Patient recruitmentPatient recruitment
Patient recruitmentswati2084
 
Lecture 19 research ethics (2)
Lecture 19 research ethics (2)Lecture 19 research ethics (2)
Lecture 19 research ethics (2)Dr Ghaiath Hussein
 
Handling Third Party Vendor Data_Katalyst HLS
Handling Third Party Vendor Data_Katalyst HLSHandling Third Party Vendor Data_Katalyst HLS
Handling Third Party Vendor Data_Katalyst HLSKatalyst HLS
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIRSarah Jones
 
Strategies for Implementing CDISC
Strategies for Implementing CDISCStrategies for Implementing CDISC
Strategies for Implementing CDISCjbarag
 
Clinical data management basics
Clinical data management basicsClinical data management basics
Clinical data management basicsSurabhi Jain
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcarePerficient, Inc.
 
define_xml_tutorial .ppt
define_xml_tutorial .pptdefine_xml_tutorial .ppt
define_xml_tutorial .pptssuser660bb1
 
Approaching Data Quality
Approaching Data QualityApproaching Data Quality
Approaching Data QualityDATAVERSITY
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata managementOpen Data Support
 
Big data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simpleBig data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simpleHadas Jacoby
 
CDISC SDTM Domain Presentation
CDISC SDTM Domain PresentationCDISC SDTM Domain Presentation
CDISC SDTM Domain PresentationAnkur Sharma
 
Clinical Data Management Plan_Katalyst HLS
Clinical Data Management Plan_Katalyst HLSClinical Data Management Plan_Katalyst HLS
Clinical Data Management Plan_Katalyst HLSKatalyst HLS
 
Understanding clinical data management
Understanding clinical data managementUnderstanding clinical data management
Understanding clinical data managementfinenessinstitute
 

La actualidad más candente (20)

Data management
Data managementData management
Data management
 
Data Science in Medicine and Health
Data Science in Medicine and HealthData Science in Medicine and Health
Data Science in Medicine and Health
 
Patient recruitment
Patient recruitmentPatient recruitment
Patient recruitment
 
Lecture 19 research ethics (2)
Lecture 19 research ethics (2)Lecture 19 research ethics (2)
Lecture 19 research ethics (2)
 
Handling Third Party Vendor Data_Katalyst HLS
Handling Third Party Vendor Data_Katalyst HLSHandling Third Party Vendor Data_Katalyst HLS
Handling Third Party Vendor Data_Katalyst HLS
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Data Quality Control
Data Quality ControlData Quality Control
Data Quality Control
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIR
 
Data management
Data management Data management
Data management
 
Strategies for Implementing CDISC
Strategies for Implementing CDISCStrategies for Implementing CDISC
Strategies for Implementing CDISC
 
Clinical data management basics
Clinical data management basicsClinical data management basics
Clinical data management basics
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in Healthcare
 
define_xml_tutorial .ppt
define_xml_tutorial .pptdefine_xml_tutorial .ppt
define_xml_tutorial .ppt
 
Approaching Data Quality
Approaching Data QualityApproaching Data Quality
Approaching Data Quality
 
Introduction to metadata management
Introduction to metadata managementIntroduction to metadata management
Introduction to metadata management
 
Big data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simpleBig data, RWE and AI in Clinical Trials made simple
Big data, RWE and AI in Clinical Trials made simple
 
CDISC SDTM Domain Presentation
CDISC SDTM Domain PresentationCDISC SDTM Domain Presentation
CDISC SDTM Domain Presentation
 
Clinical Data Management Plan_Katalyst HLS
Clinical Data Management Plan_Katalyst HLSClinical Data Management Plan_Katalyst HLS
Clinical Data Management Plan_Katalyst HLS
 
Understanding clinical data management
Understanding clinical data managementUnderstanding clinical data management
Understanding clinical data management
 
Pr. Peivand Pirouzi - Lung or Lung and Heart Transplants - Clinical trial pro...
Pr. Peivand Pirouzi - Lung or Lung and Heart Transplants - Clinical trial pro...Pr. Peivand Pirouzi - Lung or Lung and Heart Transplants - Clinical trial pro...
Pr. Peivand Pirouzi - Lung or Lung and Heart Transplants - Clinical trial pro...
 

Similar a Creating a Data Management Plan

How to write a data management plan
How to write a data management planHow to write a data management plan
How to write a data management planOpenExeter
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto UniversityStephanie Simms
 
Data Management for librarians
Data Management for librariansData Management for librarians
Data Management for librariansC. Tobin Magle
 
Responsible Conduct of Research: Data Management
Responsible Conduct of Research: Data ManagementResponsible Conduct of Research: Data Management
Responsible Conduct of Research: Data ManagementKristin Briney
 
Conquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementConquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementKathryn Houk
 
Data Management Planning for Engineers
Data Management Planning for EngineersData Management Planning for Engineers
Data Management Planning for EngineersSherry Lake
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management PlanningSarah Jones
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Fiona Nielsen
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...SEAD
 
Data Management Planning in the arts
Data Management Planning in the artsData Management Planning in the arts
Data Management Planning in the artsSarah Jones
 
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅kulibrarians
 
Data management planning – what it is and how to do it
Data management planning – what it is and how to do itData management planning – what it is and how to do it
Data management planning – what it is and how to do itariadnenetwork
 
Ariadne: Data Management Planning
Ariadne: Data Management PlanningAriadne: Data Management Planning
Ariadne: Data Management Planningariadnenetwork
 
Research Data Management for SOE
Research Data Management for SOEResearch Data Management for SOE
Research Data Management for SOELynda Kellam
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data LocallyErin D. Foster
 

Similar a Creating a Data Management Plan (20)

Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
 
How to write a data management plan
How to write a data management planHow to write a data management plan
How to write a data management plan
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
 
Data Management for librarians
Data Management for librariansData Management for librarians
Data Management for librarians
 
Responsible Conduct of Research: Data Management
Responsible Conduct of Research: Data ManagementResponsible Conduct of Research: Data Management
Responsible Conduct of Research: Data Management
 
Conquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data ManagementConquering Chaos in the Age of Networked Science: Research Data Management
Conquering Chaos in the Age of Networked Science: Research Data Management
 
Data Management Planning for Engineers
Data Management Planning for EngineersData Management Planning for Engineers
Data Management Planning for Engineers
 
Why managedata
Why managedataWhy managedata
Why managedata
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016
 
Research-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhDResearch-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhD
 
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...
Data Sets, Ensemble Cloud Computing, and the University Library: Getting the ...
 
Data Management Planning in the arts
Data Management Planning in the artsData Management Planning in the arts
Data Management Planning in the arts
 
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
20170222 ku-librarians勉強会 #211 :海外研修報告:英国大学図書館を北から南へ巡る旅
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
 
Data management planning – what it is and how to do it
Data management planning – what it is and how to do itData management planning – what it is and how to do it
Data management planning – what it is and how to do it
 
Ariadne: Data Management Planning
Ariadne: Data Management PlanningAriadne: Data Management Planning
Ariadne: Data Management Planning
 
Research Data Management for SOE
Research Data Management for SOEResearch Data Management for SOE
Research Data Management for SOE
 
Love Your Data Locally
Love Your Data LocallyLove Your Data Locally
Love Your Data Locally
 

Más de Kristin Briney

NCURA Webinar on Open Data
NCURA Webinar on Open DataNCURA Webinar on Open Data
NCURA Webinar on Open DataKristin Briney
 
Leveling Up Data Management
Leveling Up Data ManagementLeveling Up Data Management
Leveling Up Data ManagementKristin Briney
 
Breaking the Data Management Barrier
Breaking the Data Management BarrierBreaking the Data Management Barrier
Breaking the Data Management BarrierKristin Briney
 
TEDxUWMilwaukee: Rethinking Research Data
TEDxUWMilwaukee: Rethinking Research DataTEDxUWMilwaukee: Rethinking Research Data
TEDxUWMilwaukee: Rethinking Research DataKristin Briney
 
Data Management 101 (2015)
Data Management 101 (2015)Data Management 101 (2015)
Data Management 101 (2015)Kristin Briney
 
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...Kristin Briney
 
Measuring Research Impact
Measuring Research ImpactMeasuring Research Impact
Measuring Research ImpactKristin Briney
 
Retaining Your Old Research Data
Retaining Your Old Research DataRetaining Your Old Research Data
Retaining Your Old Research DataKristin Briney
 
Organizing Your Research Data
Organizing Your Research DataOrganizing Your Research Data
Organizing Your Research DataKristin Briney
 
Documenting Your Research Data
Documenting Your Research DataDocumenting Your Research Data
Documenting Your Research DataKristin Briney
 
Storing Your Research Data
Storing Your Research DataStoring Your Research Data
Storing Your Research DataKristin Briney
 
Research Data & Digital Preservation - CUWL Conference 2014
Research Data & Digital Preservation - CUWL Conference 2014Research Data & Digital Preservation - CUWL Conference 2014
Research Data & Digital Preservation - CUWL Conference 2014Kristin Briney
 
Practical Data Management - ACRL DCIG Webinar
Practical Data Management - ACRL DCIG WebinarPractical Data Management - ACRL DCIG Webinar
Practical Data Management - ACRL DCIG WebinarKristin Briney
 
Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory NotebooksKristin Briney
 
Data Management Crash Course
Data Management Crash CourseData Management Crash Course
Data Management Crash CourseKristin Briney
 
Data Management Tips Handout
Data Management Tips HandoutData Management Tips Handout
Data Management Tips HandoutKristin Briney
 
Data Management Plan Checklist
Data Management Plan ChecklistData Management Plan Checklist
Data Management Plan ChecklistKristin Briney
 

Más de Kristin Briney (20)

NCURA Webinar on Open Data
NCURA Webinar on Open DataNCURA Webinar on Open Data
NCURA Webinar on Open Data
 
Internet Privacy
Internet PrivacyInternet Privacy
Internet Privacy
 
Leveling Up Data Management
Leveling Up Data ManagementLeveling Up Data Management
Leveling Up Data Management
 
Breaking the Data Management Barrier
Breaking the Data Management BarrierBreaking the Data Management Barrier
Breaking the Data Management Barrier
 
Twitter For Academics
Twitter For AcademicsTwitter For Academics
Twitter For Academics
 
TEDxUWMilwaukee: Rethinking Research Data
TEDxUWMilwaukee: Rethinking Research DataTEDxUWMilwaukee: Rethinking Research Data
TEDxUWMilwaukee: Rethinking Research Data
 
Data Management 101 (2015)
Data Management 101 (2015)Data Management 101 (2015)
Data Management 101 (2015)
 
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
NIH Data Policy or: How I Learned to Stop Worrying and Love the Data Manageme...
 
Data Management 101
Data Management 101Data Management 101
Data Management 101
 
Measuring Research Impact
Measuring Research ImpactMeasuring Research Impact
Measuring Research Impact
 
Retaining Your Old Research Data
Retaining Your Old Research DataRetaining Your Old Research Data
Retaining Your Old Research Data
 
Organizing Your Research Data
Organizing Your Research DataOrganizing Your Research Data
Organizing Your Research Data
 
Documenting Your Research Data
Documenting Your Research DataDocumenting Your Research Data
Documenting Your Research Data
 
Storing Your Research Data
Storing Your Research DataStoring Your Research Data
Storing Your Research Data
 
Research Data & Digital Preservation - CUWL Conference 2014
Research Data & Digital Preservation - CUWL Conference 2014Research Data & Digital Preservation - CUWL Conference 2014
Research Data & Digital Preservation - CUWL Conference 2014
 
Practical Data Management - ACRL DCIG Webinar
Practical Data Management - ACRL DCIG WebinarPractical Data Management - ACRL DCIG Webinar
Practical Data Management - ACRL DCIG Webinar
 
Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory Notebooks
 
Data Management Crash Course
Data Management Crash CourseData Management Crash Course
Data Management Crash Course
 
Data Management Tips Handout
Data Management Tips HandoutData Management Tips Handout
Data Management Tips Handout
 
Data Management Plan Checklist
Data Management Plan ChecklistData Management Plan Checklist
Data Management Plan Checklist
 

Último

Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxDr.Ibrahim Hassaan
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxChelloAnnAsuncion2
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 

Último (20)

Gas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptxGas measurement O2,Co2,& ph) 04/2024.pptx
Gas measurement O2,Co2,& ph) 04/2024.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptxGrade 9 Q4-MELC1-Active and Passive Voice.pptx
Grade 9 Q4-MELC1-Active and Passive Voice.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 

Creating a Data Management Plan

  • 1. Creating a Data Management Plan Kristin Briney, PhD Data Services Librarian
  • 2. This Session Will Answer • Why am I being asked to create a DMP? • What are the key parts of a DMP? • How do I translate my research to each of these parts?
  • 3. You Will Leave With • An understanding of the main parts of a data management plan • Knowledge of where to find resources and assistance  Rough outline of your data management plan
  • 4. WHY AM I BEING ASKED TO CREATE A DATA MANAGEMENT PLAN?
  • 5. Why Data? Why Now? • Data are DIGITAL – Easy to copy and share – Difficult to preserve • Data are COMPUTABLE – New avenues of research like data mining • Data represent a FINANCIAL INVESTMENT – Poor research funding climate – Can no longer ignore data as a scholarly product
  • 6. Many Funders Require DMPs • NSF • NEH • NIH • NOAA • NASA • …even more funders will require DMPs soon! – White House OSTP Public Access memo
  • 7. The Funder Perspective • Data is a scholarly resource – Data sharing akin to scholarly publishing • Barriers to sharing are – Organization – Documentation – Long-term management and preservation  Hence data management plans
  • 8. DMPs Help You Too! • Don’t loose data • Find data more easily • Easier to analyze organized, documented data • Avoid accusations of fraud & misconduct • Get credit for your data • Don’t drown in irrelevant data!
  • 9. For each minute of planning at beginning of a project, you will save 10 minutes of headache later
  • 10. DMPs Help You Too! A data management plan will make conducting research easier for you… …So if you are required to create a DMP, why not use it to improve your practices?
  • 11. WHAT ARE THE KEY PARTS OF A DATA MANAGEMENT PLAN?
  • 12. Actual NSF DMP Requirements • The types of data, samples, physical collections, software, curriculum materials, and other materials to be produced in the course of the project • The standards to be used for data and metadata format and content • Policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements • Policies and provisions for re-use, re-distribution, and the production of derivatives • Plans for archiving data, samples, and other research products, and for preservation of access to them http://www.nsf.gov/pubs/policydocs/pappguide/nsf13001/gpg_2.jsp#dmp
  • 13. Key Questions 1. What data will I create? 2. What standards will I use to document the data? 3. How will I protect private/secure/confidential data? 4. How will I archive and preserve the data? 5. How will I provide access to and allow reuse of the data?
  • 14. Be Aware • Actual requirements vary by funder and division • Look up your requirements before you write your DMP
  • 15. HOW DO I TRANSLATE MY RESEARCH TO EACH OF THESE PARTS?
  • 16. 1. WHAT TYPES OF DATA WILL I CREATE?
  • 17. What Are Data? • “Research data is defined as the recorded factual material commonly accepted in the scientific community as necessary to validate research findings” – OMB Circular A-110 http://www.whitehouse.gov/omb/circulars_a110
  • 18. What Are Data? • Observational – Sensor data, telemetry, survey data, sample data, images • Experimental – Gene sequences, chromatograms, toroid magnetic field data • Simulation – Climate models, economic models • Derived or compiled – Text and data mining, compiled database, 3D models, data gathered from public documents
  • 19. What Not To Share • Laboratory notebooks • Preliminary analyses • Drafts of scientific papers • Plans for future research • Peer reviews or communications with colleagues • Physical Samples
  • 20. No Data? • Still need a data management plan • Plans with no data and no sharing will likely be examined more closely – Carefully explain situation if you are in this position
  • 21. Exercise • Conduct a quick inventory of the data you will acquire – What data will you collect? – Is your data unique? – How big will the data be? – How fast will the data grow?
  • 22. 2. WHAT STANDARDS WILL I USE TO DOCUMENT THE DATA?
  • 23. What would someone unfamiliar with your data need in order to find, evaluate, understand, and reuse them?
  • 24. Documentation • Consider the difference in documenting for – someone inside your lab – someone outside your lab but in your field – someone outside your field • Audience matters!
  • 25. Documentation Methods • How the data were gathered • How the data should be interpreted • What you did – Limitations on what you did • …build trust in your data Metadata • What you’re looking at • Who made it and when • How it got there • What it means • What you can do with it • …before you even look at the file
  • 26. Methods • Examples of methods to document – Code – Survey – Codebook – Data dictionary – Anything that lets someone reproduce your results • Don’t forget the units!
  • 27. Metadata • Look for a metadata scheme before you collect the data! – Lots of metadata schemas available – Easier to record metadata when collecting data than to convert later • Consult – Disciplinary repository • Repositories usually have required metadata schemas – Your peers – Subject librarian
  • 28. Metadata Example: Dublin Core • Contributor – Jane Collaborator • Creator – Kristin Briney • Date – 2013 Apr 15 • Description – A microscopy image of cancerous breast tissues under 20x zoom. This image is my control, so it has only the standard staining describe on 2013 Feb 2 in my notebook. • Format – JPEG • Identifier – IMG00057.jpg • Relation – Same sample as images IMG00056.jpg and IMG00055.jpg • Subject – Breast cancer • Title – Cancerous breast tissue control
  • 29. Exercise • What methods information do you need to preserve? • What metadata standard will you use for your data? -OR- Who will you contact to find a relevant standard?
  • 30. 3. HOW WILL I PROTECT PRIVATE/SECURE/CONFIDENTIAL DATA?
  • 31. Security Issues • Does your data fall under the following? – HIPAA • Health information – FERPA • Student information – FISMA • Government subcontractor – Human subject research, etc.  Ask for help!
  • 32. Security Issues • Secure storage • Controlled access • De-identification of personal information • Security training
  • 33. Security Questions • Access permissions – Who is allowed to access the data? • Sharing – Am I required to share? Can I actually share? – Despite requirements, some data can’t be shared • Responsibility – Who will make sure the data stays secure?
  • 34. UWM Security Resources • UWM Information Security Office – Visit: https://www4.uwm.edu/itsecurity/ – Email: infosec@uwm.edu • Certificate in Information Security • HIPAA – https://www4.uwm.edu/legal/hipaa/index.cfm • FERPA – http://www4.uwm.edu/academics/ferpa.cfm
  • 35. Exercise • Do any regulations apply to your data? • If so, who is allowed to access your secure data? Who will be responsible for data security?
  • 36. 4. HOW WILL I ARCHIVE AND PRESERVE THE DATA?
  • 37. Archiving Is Not Storage • Storage is keeping files to access • Archiving is about preservation – Data should be readable and usable – Data should be uncorrupted • We can’t read some digital files from 10 years ago – This is what good digital preservation solves
  • 38. Side Note • If federally funded, you are required to retain your data “for a period of three years from the date of submission of the final expenditure report.” AT LEAST. • Better to keep on hand for at least 6 years – Recent retraction in 6-year old paper for failure to provide original data • Preservation not an abstract issue http://www.whitehouse.gov/omb/circulars_a110#53 http://retractionwatch.wordpress.com/2013/07/19/jci-paper-retracted-for-duplicated-panels-after-authors-cant-provide- original-data/
  • 39. File Formats • Easy way to ensure long-term usability • Use open file formats – Open and standardized – Well documented – In wide use – Examples: .txt, .tiff, .csv, .dbf • Transform your data now, not later – Keep both file types
  • 40. Other Preservation Concerns • Obsolescence – Preserve software along with data • Deterioration – Keep more than 1 copy to avoid corruption • Media – ie. Can you still read a floppy disk? – Periodically move data off outdated media
  • 41. Find a Trustworthy Partner • Find outside help – Servers come and go, so do labs • Off campus – Disciplinary data repository – Journal that accepts data • Let someone else worry about this
  • 42. Exercise • What open file formats will you use to help preserve your data? • If there isn’t an adequate open format, what software and hardware will you preserve?
  • 43. 5. HOW WILL I PROVIDE ACCESS TO AND ALLOW REUSE OF THE DATA?
  • 44. Why Share? • Get more credit for your work – In “studies that created gene expression microarray data, we found that studies that made data available in a public repository received 9% … more citations than similar studies for which the data was not made available” – “The citation boost varied with date of dataset deposition: a citation boost was most clear for papers published in 2004 and 2005, at about 30%” • Get credit for unpublishable results https://peerj.com/preprints/1/ (2013 study)
  • 45. Why Share? • Make your funder happy • Helps you find and use your data later • Disprove misconduct or fraud accusations • Stimulate new research
  • 46. Audience • Who is the audience for this data? – Coworkers? – Disciplinary/institutional colleague? – Researchers in allied fields? – Anyone? • Audience will determine how to share the data
  • 47. Ways To Provide Access • Hands-off options preferable – Journal – Disciplinary repository • Embargoes may be possible here – UWM Digital Commons • Small, discrete datasets • Other options – By request – On your lab website
  • 48. Exercise • Who is the audience for your data? • Which way will you provide access?
  • 50. Resources • Data Services Librarian – briney@uwm.edu • Data management information – dataplan.uwm.edu • UWM Information Security Office – infosec@uwm.edu
  • 51. Thank You • This presentation is available on Slideshare – http://www.slideshare.net/kbriney • The content of this presentation is licensed under a Creative Commons Attribution 3.0 Unported License (CC BY) • Some content used with permission from Brad Houston and Dorothea Salo