SlideShare una empresa de Scribd logo
1 de 33
Research Data Management
Sarah Jones
DCC, University of Glasgow
sarah.jones@glasgow.ac.uk
Twitter: @sjDCC
•University of the West of England, 9th
July 2014
Funded by:
Programme
• Quiz of funders’ requirements
• Introduction to RDM
• Data management planning
• Demo of DMPonline
• Q&A
“the active management and
appraisal of data over the
lifecycle of scholarly and
scientific interest”
Data management is part of
good research practice
What is research data management?
Why manage your research data?
• To make your research easier!
• To stop yourself drowning in irrelevant stuff
• In case you need the data later
• To avoid accusations of fraud or bad science
• To share your data for others to use and learn from
• To get credit for producing it
• Because somebody else said to do so
RCUK Common Principles on Data Policy
“Publicly funded research data are a public good,
produced in the public interest, which should be
made openly available with as few restrictions as
possible in a timely and responsible manner that
does not harm intellectual property.”
www.rcuk.ac.uk/research/datapolicy
Why share data?
Benefits of data sharing data (1)
www.nytimes.com/2010/08/13/health/research
/13alzheimer.html?pagewanted=all&_r=0
“It was unbelievable. Its not science
the way most of us have practiced
in our careers. But we all realised
that we would never get biomarkers
unless all of us parked our egos and
intellectual property noses outside
the door and agreed that all of our
data would be public immediately.”
Dr John Trojanowski, University of Pennsylvania
•... scientific breakthroughs
Benefits of data sharing (2)
“There is evidence that studies that make their
data available do indeed receive more citations
than similar studies that do not.”
Piwowar H. and Vision T.J 2013 "Data reuse and the open data
citation advantage“ https://peerj.com/preprints/1.pdf
9% - 30% increase
•... more citations
If you plan to share your data....
• Have you got consent for sharing?
• Do any licences you’ve signed permit sharing?
• Is your data in suitable formats?
Decisions made early on affect what you can do later
Some formats are better for long-term
It’s preferable to opt for formats that are:
• Uncompressed
• Non-proprietary
• Open, documented
• Standard representation (ASCII, Unicode)
Data centres may have preferred formats for deposit e.g.
Type Recommended Non-preferred
Tabular data CSV, TSV, SPSS portable Excel
Text Plain text, HTML, RTF
PDF/A only if layout matters
Word
Media Container: MP4, Ogg
Codec: Theora, Dirac, FLAC
Quicktime
H264
Images TIFF, JPEG2000, PNG GIF, JPG
Structured data XML, RDF RDBMS
Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
Documentation
What would someone unfamiliar with your
data need in order to find, evaluate,
understand, and reuse them?
Consider the differences between someone inside
your research group, someone outside your
group but in your field, and someone outside
your field.
Documentation and standards
Metadata: basic info e.g. title, author, dates, access rights...
Documentation: context, workflows, methods, code, data dictionary...
Use standards wherever possible for interoperability
www.dcc.ac.uk/resources/
metadata-standards
Tools for managing data
www.dcc.ac.uk/resources/external/tools-services/
managing-active-research-data
Where to store your data?
• Your own drive (PC, server, flash drive, etc.)
– And if you lose it? Or it breaks?
• Somebody else’s drive
• Departmental drive
• “Cloud” drive
– Do they care as much about your data as you do?
How to backup?
• 3… 2… 1… backup!
– at least 3 copies of a file
– on at least 2 different media
– with at least 1 offsite
• Use managed services where possible e.g. University
filestores rather than local or external hard drives
• Ask central or local IT team for advice
Archiving: data repositories
http://databib.org
http://service.re3data.org/search
Zenodo
•OpenAIRE-CERN joint effort
•Multidisciplinary repository
•Multiple data types
– Publications
– Long tail of research data
•Citable data (DOI)
•Links to funding, pubs, data & software
www.zenodo.org
•CREATIVE COMMONS LIMITATIONS
• NC Non-Commercial
• What counts as commercial?
• SA Share Alike
• Reduces interoperability
• ND No Derivatives
• Severely restricts use
www.dcc.ac.uk/resources/
how-guides/license-research-data
License your data for reuse
Outlines pros and cons of each
approach and gives practical advice on
how to implement your licence
Data citation
• Makes it easier for readers to locate
the data and validate findings
• Data citations ensure that data
contributors receive proper credit
• Can link to reuse to show impact
• Less danger of rival researchers
‘stealing’ results from those who
publish their data openly
www.dcc.ac.uk/resources/briefing-papers/introduction-curation
/data-citation-and-linking
ImpactStory: Altmetrics
•https://impactstory.org
Getting
your
research
out
there
www.katiephd.com/twitter-and-science-publications
Managing and sharing data:
a best practice guide
• How to write a DMP
• Formatting your data
• Documentation
• Data sharing
• Ethics and consent
• Copyright
• …
http://data-archive.ac.uk/media/2894/managingsharing.pdf
Putting the pieces together...
...DMPs
Photo by Dread Pirate Jeff
http://www.flickr.com/photos
/justageek/2851643792
What is a data management plan?
A brief plan written at the start of your project to define:
• how your data will be created?
• how it will be documented?
• who will access it?
• where it will be stored?
• who will back it up?
• whether (and how) it will be shared & preserved?
DMPs are often submitted as part of grant applications,
but are useful whenever you’re creating data.
Why YOU need a Data
Management Plan
http://blogs.ch.cam.ac.uk/
pmr/2011/08/01/why-
you-need-a-data-
management-plan
What if this was your laptop?
Which UK funders require a DMP?
•www.dcc.ac.uk/resources/policy-and-legal/ overview-funders-data-policies
DCC Checklist for a DMP
• 13 questions on what’s asked across the board
• Prompts / pointers to help researchers get started
• Guidance on how to answer
www.dcc.ac.uk/sites/default/files/documents
/resource/DMP_Checklist_2013.pdf
Common themes in DMPs
1. Description of data to be collected / created
(i.e. content, type, format, volume...)
2. Standards / methodologies for data collection & management
3. Ethics and Intellectual Property
(highlight any restrictions on data sharing e.g. embargoes, confidentiality)
4. Plans for data sharing and access
(i.e. how, when, to whom)
5. Strategy for long-term preservation
A useful framework to get you started
Think about why the
questions are being
asked – why is it
useful to consider
that topic?
Look at examples to
help you understand
what to write
•www.icpsr.umich.edu/icpsrweb/content/datamanagement/dmp/framework.html
Tips for writing DMPs
• Seek advice - consult and collaborate
• Consider good practice for your field
• Base plans on available skills & support
• Make sure implementation is feasible
Example plans
• Technical plan submitted to AHRC by Bristol Uni
http://data.bris.ac.uk/research/planning/files/2013/08/data.bris-AHRC-example-Technical-
• Rural Economy & Land Use (RELU) programme examples
http://relu.data-archive.ac.uk/data-sharing/planning/examples
• UCSD example DMPs (20+ scientific plans for NSF)
http://rci.ucsd.edu/data-curation/examples.html
• My DMP – a satire (what not to write!)
http://ivory.idyll.org/blog/data-management.html
More at: https://dmponline.dcc.ac.uk/help#DMPhelp
Help from the DCC
•https://dmponline.dcc.ac.uk
•www.dcc.ac.uk/resources/how-guides/develop-data-plan
A web-based tool to help researchers
write data management plans
DMPonline demo
https://dmponline.dcc.ac.uk
Thanks – any questions?
DCC guidance, tools and case studies:
www.dcc.ac.uk/resources
Follow us on twitter:
@digitalcuration and #ukdcc
Credit to Dorothea Salo, Ryan Schryver and colleagues for content from the “Escaping Datageddon”
presentation for slides 4, 11 & 14, available at: http://www.slideshare.net/cavlec/escaping-datageddon
And to the Research360 project at the University of Bath for content from the “Managing your research
data” presentation for slide 10, available at: http://opus.bath.ac.uk/32296

Más contenido relacionado

La actualidad más candente

RDM policy and recovering costs
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costsSarah Jones
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinarSarah Jones
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementSarah Jones
 
Data Management Planning at Edinburgh
Data Management Planning at EdinburghData Management Planning at Edinburgh
Data Management Planning at EdinburghSarah Jones
 
Data Management Planning in the arts
Data Management Planning in the artsData Management Planning in the arts
Data Management Planning in the artsSarah Jones
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolkfear
 
H2020 open-data-pilot
H2020 open-data-pilotH2020 open-data-pilot
H2020 open-data-pilotSarah Jones
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management PlansSarah Jones
 
Data management plans and planning - a gentle introduction
Data management plans and planning - a gentle introductionData management plans and planning - a gentle introduction
Data management plans and planning - a gentle introductionMartin Donnelly
 
Data sharing: How, what and why?
Data sharing: How, what and why?Data sharing: How, what and why?
Data sharing: How, what and why?dancrane_open
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management PlanningSarah Jones
 
RDM librarians Skills & Competencies: roles & training (SPARC & COAR Member W...
RDM librarians Skills & Competencies: roles & training (SPARC & COAR Member W...RDM librarians Skills & Competencies: roles & training (SPARC & COAR Member W...
RDM librarians Skills & Competencies: roles & training (SPARC & COAR Member W...Pedro Príncipe
 
Building a collaborative RDM community, research data network
Building a collaborative RDM community, research data networkBuilding a collaborative RDM community, research data network
Building a collaborative RDM community, research data networkJisc RDM
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
 

La actualidad más candente (20)

RDM policy and recovering costs
RDM policy and recovering costsRDM policy and recovering costs
RDM policy and recovering costs
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
 
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
Preparing Your Research Data for the Future - 2015-06-08 - Medical Sciences D...
 
Supporting-DMPs
Supporting-DMPsSupporting-DMPs
Supporting-DMPs
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Data Management Planning at Edinburgh
Data Management Planning at EdinburghData Management Planning at Edinburgh
Data Management Planning at Edinburgh
 
Data Management Planning in the arts
Data Management Planning in the artsData Management Planning in the arts
Data Management Planning in the arts
 
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 
H2020 open-data-pilot
H2020 open-data-pilotH2020 open-data-pilot
H2020 open-data-pilot
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Intro to Data Management Plans
Intro to Data Management PlansIntro to Data Management Plans
Intro to Data Management Plans
 
Data management plans and planning - a gentle introduction
Data management plans and planning - a gentle introductionData management plans and planning - a gentle introduction
Data management plans and planning - a gentle introduction
 
Data sharing: How, what and why?
Data sharing: How, what and why?Data sharing: How, what and why?
Data sharing: How, what and why?
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
 
RDM librarians Skills & Competencies: roles & training (SPARC & COAR Member W...
RDM librarians Skills & Competencies: roles & training (SPARC & COAR Member W...RDM librarians Skills & Competencies: roles & training (SPARC & COAR Member W...
RDM librarians Skills & Competencies: roles & training (SPARC & COAR Member W...
 
Building a collaborative RDM community, research data network
Building a collaborative RDM community, research data networkBuilding a collaborative RDM community, research data network
Building a collaborative RDM community, research data network
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 

Destacado

Boekverslag miss marketing
Boekverslag miss marketingBoekverslag miss marketing
Boekverslag miss marketingmarcomgroep1
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data PilotSarah Jones
 
H2020 Open Research Data pilot
H2020 Open Research Data pilotH2020 Open Research Data pilot
H2020 Open Research Data pilotSarah Jones
 
Research support-challenges
Research support-challengesResearch support-challenges
Research support-challengesSarah Jones
 
Horizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotHorizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotSarah Jones
 

Destacado (6)

Boekverslag miss marketing
Boekverslag miss marketingBoekverslag miss marketing
Boekverslag miss marketing
 
What is a DMP
What is a DMPWhat is a DMP
What is a DMP
 
H2020 Open Data Pilot
H2020 Open Data PilotH2020 Open Data Pilot
H2020 Open Data Pilot
 
H2020 Open Research Data pilot
H2020 Open Research Data pilotH2020 Open Research Data pilot
H2020 Open Research Data pilot
 
Research support-challenges
Research support-challengesResearch support-challenges
Research support-challenges
 
Horizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilotHorizon 2020 and the open research data pilot
Horizon 2020 and the open research data pilot
 

Similar a DC101 UWE

Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Managementdancrane_open
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Getting to grips with research data management
Getting to grips with research data management Getting to grips with research data management
Getting to grips with research data management Wendy Mears
 
Getting to Grips with Research Data Management
Getting to Grips with Research Data Management Getting to Grips with Research Data Management
Getting to Grips with Research Data Management IzzyChad
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data ManagementIzzyChad
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataIzzyChad
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDMMarieke Guy
 
OU Library Training: Making your research data open
OU Library Training: Making your research data openOU Library Training: Making your research data open
OU Library Training: Making your research data openIzzyChad
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for LibrariansMarieke Guy
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Managementdancrane_open
 
Planning for Research Data Managment
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data ManagmentDaniel Crane
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016 Rebecca Raworth, MLIS
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016Rebecca Raworth, MLIS
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Leeds
 
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedRob Daley
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementopl10
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 

Similar a DC101 UWE (20)

Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Getting to grips with research data management
Getting to grips with research data management Getting to grips with research data management
Getting to grips with research data management
 
Getting to Grips with Research Data Management
Getting to Grips with Research Data Management Getting to Grips with Research Data Management
Getting to Grips with Research Data Management
 
Getting to grips with Research Data Management
Getting to grips with Research Data ManagementGetting to grips with Research Data Management
Getting to grips with Research Data Management
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research data
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
OU Library Training: Making your research data open
OU Library Training: Making your research data openOU Library Training: Making your research data open
OU Library Training: Making your research data open
 
Research Data Management and your PhD
Research Data Management and your PhDResearch Data Management and your PhD
Research Data Management and your PhD
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for Librarians
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Planning for Research Data Managment
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data Managment
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
Preparing Your Research Material for the Future - 2018-06-08 - Humanities Div...
 
Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017Research Data Mangagement Essentials, 5th July 2017
Research Data Mangagement Essentials, 5th July 2017
 
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 

Más de Sarah Jones

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricksSarah Jones
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and librariesSarah Jones
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activitiesSarah Jones
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextSarah Jones
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open ScienceSarah Jones
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysisSarah Jones
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSCSarah Jones
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptxSarah Jones
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?Sarah Jones
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIRSarah Jones
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchersSarah Jones
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open ScienceSarah Jones
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsSarah Jones
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open ScienceSarah Jones
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRSarah Jones
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsSarah Jones
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?Sarah Jones
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiativesSarah Jones
 

Más de Sarah Jones (20)

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricks
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and libraries
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activities
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European context
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open Science
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysis
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSC
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptx
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIR
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open Science
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessons
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiatives
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 

Último

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 

Último (20)

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 

DC101 UWE

  • 1. Research Data Management Sarah Jones DCC, University of Glasgow sarah.jones@glasgow.ac.uk Twitter: @sjDCC •University of the West of England, 9th July 2014 Funded by:
  • 2. Programme • Quiz of funders’ requirements • Introduction to RDM • Data management planning • Demo of DMPonline • Q&A
  • 3. “the active management and appraisal of data over the lifecycle of scholarly and scientific interest” Data management is part of good research practice What is research data management?
  • 4. Why manage your research data? • To make your research easier! • To stop yourself drowning in irrelevant stuff • In case you need the data later • To avoid accusations of fraud or bad science • To share your data for others to use and learn from • To get credit for producing it • Because somebody else said to do so
  • 5. RCUK Common Principles on Data Policy “Publicly funded research data are a public good, produced in the public interest, which should be made openly available with as few restrictions as possible in a timely and responsible manner that does not harm intellectual property.” www.rcuk.ac.uk/research/datapolicy
  • 7. Benefits of data sharing data (1) www.nytimes.com/2010/08/13/health/research /13alzheimer.html?pagewanted=all&_r=0 “It was unbelievable. Its not science the way most of us have practiced in our careers. But we all realised that we would never get biomarkers unless all of us parked our egos and intellectual property noses outside the door and agreed that all of our data would be public immediately.” Dr John Trojanowski, University of Pennsylvania •... scientific breakthroughs
  • 8. Benefits of data sharing (2) “There is evidence that studies that make their data available do indeed receive more citations than similar studies that do not.” Piwowar H. and Vision T.J 2013 "Data reuse and the open data citation advantage“ https://peerj.com/preprints/1.pdf 9% - 30% increase •... more citations
  • 9. If you plan to share your data.... • Have you got consent for sharing? • Do any licences you’ve signed permit sharing? • Is your data in suitable formats? Decisions made early on affect what you can do later
  • 10. Some formats are better for long-term It’s preferable to opt for formats that are: • Uncompressed • Non-proprietary • Open, documented • Standard representation (ASCII, Unicode) Data centres may have preferred formats for deposit e.g. Type Recommended Non-preferred Tabular data CSV, TSV, SPSS portable Excel Text Plain text, HTML, RTF PDF/A only if layout matters Word Media Container: MP4, Ogg Codec: Theora, Dirac, FLAC Quicktime H264 Images TIFF, JPEG2000, PNG GIF, JPG Structured data XML, RDF RDBMS Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
  • 11. Documentation What would someone unfamiliar with your data need in order to find, evaluate, understand, and reuse them? Consider the differences between someone inside your research group, someone outside your group but in your field, and someone outside your field.
  • 12. Documentation and standards Metadata: basic info e.g. title, author, dates, access rights... Documentation: context, workflows, methods, code, data dictionary... Use standards wherever possible for interoperability www.dcc.ac.uk/resources/ metadata-standards
  • 13. Tools for managing data www.dcc.ac.uk/resources/external/tools-services/ managing-active-research-data
  • 14. Where to store your data? • Your own drive (PC, server, flash drive, etc.) – And if you lose it? Or it breaks? • Somebody else’s drive • Departmental drive • “Cloud” drive – Do they care as much about your data as you do?
  • 15. How to backup? • 3… 2… 1… backup! – at least 3 copies of a file – on at least 2 different media – with at least 1 offsite • Use managed services where possible e.g. University filestores rather than local or external hard drives • Ask central or local IT team for advice
  • 16. Archiving: data repositories http://databib.org http://service.re3data.org/search Zenodo •OpenAIRE-CERN joint effort •Multidisciplinary repository •Multiple data types – Publications – Long tail of research data •Citable data (DOI) •Links to funding, pubs, data & software www.zenodo.org
  • 17. •CREATIVE COMMONS LIMITATIONS • NC Non-Commercial • What counts as commercial? • SA Share Alike • Reduces interoperability • ND No Derivatives • Severely restricts use www.dcc.ac.uk/resources/ how-guides/license-research-data License your data for reuse Outlines pros and cons of each approach and gives practical advice on how to implement your licence
  • 18. Data citation • Makes it easier for readers to locate the data and validate findings • Data citations ensure that data contributors receive proper credit • Can link to reuse to show impact • Less danger of rival researchers ‘stealing’ results from those who publish their data openly www.dcc.ac.uk/resources/briefing-papers/introduction-curation /data-citation-and-linking
  • 21. Managing and sharing data: a best practice guide • How to write a DMP • Formatting your data • Documentation • Data sharing • Ethics and consent • Copyright • … http://data-archive.ac.uk/media/2894/managingsharing.pdf
  • 22. Putting the pieces together... ...DMPs Photo by Dread Pirate Jeff http://www.flickr.com/photos /justageek/2851643792
  • 23. What is a data management plan? A brief plan written at the start of your project to define: • how your data will be created? • how it will be documented? • who will access it? • where it will be stored? • who will back it up? • whether (and how) it will be shared & preserved? DMPs are often submitted as part of grant applications, but are useful whenever you’re creating data.
  • 24. Why YOU need a Data Management Plan http://blogs.ch.cam.ac.uk/ pmr/2011/08/01/why- you-need-a-data- management-plan What if this was your laptop?
  • 25. Which UK funders require a DMP? •www.dcc.ac.uk/resources/policy-and-legal/ overview-funders-data-policies
  • 26. DCC Checklist for a DMP • 13 questions on what’s asked across the board • Prompts / pointers to help researchers get started • Guidance on how to answer www.dcc.ac.uk/sites/default/files/documents /resource/DMP_Checklist_2013.pdf
  • 27. Common themes in DMPs 1. Description of data to be collected / created (i.e. content, type, format, volume...) 2. Standards / methodologies for data collection & management 3. Ethics and Intellectual Property (highlight any restrictions on data sharing e.g. embargoes, confidentiality) 4. Plans for data sharing and access (i.e. how, when, to whom) 5. Strategy for long-term preservation
  • 28. A useful framework to get you started Think about why the questions are being asked – why is it useful to consider that topic? Look at examples to help you understand what to write •www.icpsr.umich.edu/icpsrweb/content/datamanagement/dmp/framework.html
  • 29. Tips for writing DMPs • Seek advice - consult and collaborate • Consider good practice for your field • Base plans on available skills & support • Make sure implementation is feasible
  • 30. Example plans • Technical plan submitted to AHRC by Bristol Uni http://data.bris.ac.uk/research/planning/files/2013/08/data.bris-AHRC-example-Technical- • Rural Economy & Land Use (RELU) programme examples http://relu.data-archive.ac.uk/data-sharing/planning/examples • UCSD example DMPs (20+ scientific plans for NSF) http://rci.ucsd.edu/data-curation/examples.html • My DMP – a satire (what not to write!) http://ivory.idyll.org/blog/data-management.html More at: https://dmponline.dcc.ac.uk/help#DMPhelp
  • 31. Help from the DCC •https://dmponline.dcc.ac.uk •www.dcc.ac.uk/resources/how-guides/develop-data-plan A web-based tool to help researchers write data management plans
  • 33. Thanks – any questions? DCC guidance, tools and case studies: www.dcc.ac.uk/resources Follow us on twitter: @digitalcuration and #ukdcc Credit to Dorothea Salo, Ryan Schryver and colleagues for content from the “Escaping Datageddon” presentation for slides 4, 11 & 14, available at: http://www.slideshare.net/cavlec/escaping-datageddon And to the Research360 project at the University of Bath for content from the “Managing your research data” presentation for slide 10, available at: http://opus.bath.ac.uk/32296

Notas del editor

  1. Data is increasing in significance. It will unquestionably matter to your research careers, more than it does to your supervisors’ generation. Learn good data habits now! You’ll need them later.
  2. Some formats are better for data sharing and long-term preservation than others. It’s preferable to use formats that are uncompressed (e.g. large, high-quality files like .wav), non-proprietary (i.e. open) standards that are documented and well-understood. This aids preservation and interoperability. Some data centres have preferred formats for deposit so it’s worthwhile encouraging researchers to consult these to check.
  3. To make sure their data can be understood by themselves, their community and others, researchers should create metadata and documentation. Metadata is basic descriptive information to help identify and understand the structure of the data e.g. title, author... Documentation provides the wider context. It’s useful to share the methodology / workflow, software and any information needed to understand the data e.g. explanation of abbreviations or acronyms There are lots of standards that can be used. The DCC started a catalogue of disciplinary metadata standards which is now being taken forward as an international initiative via an RDA working group
  4. The EC guidelines suggest selecting a suitable repository. The Databib and Re3data lists can be useful for this. They allow you to search and browse by subject. Re3data also allows you to restrict the search by certificates, open access repositories and persistent identifiers.
  5. Guidance from the DCC can also help researchers to understand data licensing. This guide outlines the pros and cons of each approach e.g. the limitations of some CC options Under Horizon 2020 it’s recommended that researchers use CC-0 or CC-BY to make data as open as possible.
  6. I recommend this ICPSR resource It explains the importance of different questions as a pointer to how to answer Examples are given. This is the most frequent request we get at DCC - examples help researchers think of what to write for their context
  7. The DCC has produced a How to guide on writing DMPs and developed a tool to help