SlideShare una empresa de Scribd logo
1 de 38
Descargar para leer sin conexión
This work is licensed under the Creative Commons Attribution 2.5 UK: Scotland License
S. Venkataraman, PhD
Research Data Specialist
Digital Curation Centre
s.venkataraman@ed.ac.uk
21st October 2019, Open Access Week Webinar
Principles of Research Data
Management
About the DCC
• Established in 2004
• Based in Edinburgh and Glasgow
• Works at national and international levels
• One of leading organisations in the world specialising in
training, consultancy, policy making and advocacy in digital
data management best practice and services provision
• Involved in many international consortia, schools and projects
• (Not involved in actual curation of any data!)
Is there a reproducibility crisis?
Baker, M. (2016)
“1,500 scientists lift
the lid on
reproducibility”,
Nature, 533:7604,
http://www.nature.com/n
ews/1-500-scientists-lift-
the-lid-on-
reproducibility-1.19970
Why make data available?
Who has heard of this before…?
Image CC-BY-SA by SangyaPundir
European perspective…
https://publications.europa.eu/en/publication-detail/-
/publication/7769a148-f1f6-11e8-9982-
01aa75ed71a1/language-en/format-PDF/source-
80611283
Slide CC-BY by Erik Schultes, Leiden UMC
What FAIR means: 15 principles
Comprehensive descriptions can be found at https://www.go-fair.org/fair-
principles/
Common misconceptions
• FAIR data does not have to be open
• The principles do not specify particular technologies or implementations
e.g. semantic web
• FAIR is not a standard to be followed or strict criteria – it’s a spectrum /
continuum
• It doesn’t only apply to the life sciences
All research data
Managed data
FAIR
data
Open
data
the
wild
Increasing that which is FAIR & open
Managed data
FAIR
data
Open
data
the
wild
as open as
possible, as
closed as
necessary
Image: ‘Balancing rocks’ by Viewminder CC-BY-SA-ND
www.flickr.com/photos/light_seeker/7780857224
RDM & the Data LifecycleImage CC-BY-SA by Janneke Staaks www.flickr.com/photos/jannekestaaks/14411397343
What is Research Data Management?
“the active management and
appraisal of data over the lifecycle of
scholarly and scientific interest”
Data management is part of good
research practice
Create
Document
Use
Store
Share
Preserve
Create
Document
Use
Store
Share
Preserve
Data creation tips
• Ensure consent forms, licences and agreements don’t restrict
opportunities to share data
• Choose appropriate formats
• Adopt a file naming convention
• Create metadata and documentation as you go
Ask for consent for data sharing
If not, data centres won’t be able to accept the data – regardless of any
conditions on the original grant.
www.data-archive.ac.uk/create-manage/consent-ethics/consent?index=3
Choose appropriate file formats
Different formats are good for different things
• open, lossless formats are more sustainable e.g. rtf, xml, tif, wav
• proprietary and/or compressed formats are less preservable but are often
in widespread use e.g. doc, jpg, mp3
One format for analysis then
convert to a standard format
Data centres may suggest preferred formats for deposit
https://www.ukdataservice.ac.uk/manage-data/format/recommended-formats
Type of data Recommended formats Acceptable formats
Tabular data with extensive metadata
variable labels, code labels, and defined missing values
SPSS portable format (.por)
delimited text and command ('setup') file (SPSS, Stata, SAS, etc.)
structured text or mark-up file of metadata information, e.g.
DDI XML file
proprietary formats of statistical packages: SPSS (.sav), Stata
(.dta), MS Access (.mdb/.accdb)
Tabular data with minimal metadata
column headings, variable names
comma-separated values (.csv)
tab-delimited file (.tab)
delimited text with SQL data definition statements
delimited text (.txt) with characters not present in data used as
delimiters
widely-used formats: MS Excel (.xls/.xlsx), MS Access
(.mdb/.accdb), dBase (.dbf), OpenDocument Spreadsheet (.ods)
Geospatial data
vector and raster data
ESRI Shapefile (.shp, .shx, .dbf, .prj, .sbx, .sbn optional)
geo-referenced TIFF (.tif, .tfw)
CAD data (.dwg)
tabular GIS attribute data
Geography Markup Language (.gml)
ESRI Geodatabase format (.mdb)
MapInfo Interchange Format (.mif) for vector data
Keyhole Mark-up Language (.kml)
Adobe Illustrator (.ai), CAD data (.dxf or .svg)
binary formats of GIS and CAD packages
Textual data Rich Text Format (.rtf)
plain text, ASCII (.txt)
eXtensible Mark-up Language (.xml) text according to an
appropriate Document Type Definition (DTD) or schema
Hypertext Mark-up Language (.html)
widely-used formats: MS Word (.doc/.docx)
some software-specific formats: NUD*IST, NVivo and ATLAS.ti
Image data TIFF 6.0 uncompressed (.tif) JPEG (.jpeg, .jpg, .jp2) if original created in this format
GIF (.gif)
TIFF other versions (.tif, .tiff)
RAW image format (.raw)
Photoshop files (.psd)
BMP (.bmp)
PNG (.png)
Adobe Portable Document Format (PDF/A, PDF) (.pdf)
Audio data Free Lossless Audio Codec (FLAC) (.flac) MPEG-1 Audio Layer 3 (.mp3) if original created in this format
Audio Interchange File Format (.aif)
Waveform Audio Format (.wav)
Video data MPEG-4 (.mp4)
OGG video (.ogv, .ogg)
motion JPEG 2000 (.mj2)
AVCHD video (.avchd)
Documentation and scripts Rich Text Format (.rtf)
PDF/UA, PDF/A or PDF (.pdf)
XHTML or HTML (.xhtml, .htm)
OpenDocument Text (.odt)
plain text (.txt)
widely-used formats: MS Word (.doc/.docx), MS Excel
(.xls/.xlsx)
XML marked-up text (.xml) according to an appropriate DTD or
schema, e.g. XHMTL 1.0
https://www.ukdataservice.ac.uk/manage-data/format/recommended-formats
How will you organise your data?
• Keep file and folder names short, but meaningful
• Agree a method for versioning
• Include dates in a set format e.g. YYYYMMDD
• Avoid using non-alphanumeric characters in file names
• Use hyphens or underscores not spaces e.g. day-sheet, day sheet
• Order the elements in the most appropriate way to retrieve the record
Example from ARM Climate Research Facility www.arm.gov/data/docs/plan
Create
Document
Use
Store
Share
Preserve
Documentation
Think about what is needed in order to evaluate, understand, and reuse the
data.
• Why was the data created?
• Have you documented what you did and how?
• Did you develop code to run analyses? If so, this should be kept and
shared too.
• Important to provide wider context for trust
What are metadata?
Metadata
• Standardised
• Structured
• Machine and human readable
Metadata helps to cite &
disambiguate data
Documentation aids reuse
Metadata
Documentation
Metadata standards
These can be general – such as Dublin Core
Or discipline specific
• Data Documentation Initiative (DDI) – social science
• Ecological Metadata Language (EML) - ecology
• Flexible Image Transport System (FITS) – astronomy
Search for standards in catalogues like:
http://rd-alliance.github.io/metadata-directory/
https://rdamsc.dcc.ac.uk/
“MTBLS1: A metabolomic study of urinary changes in type 2 diabetes in……”
Example courtesy of Ken Haug, European Bioinformatics Institute (EMBL-EBI)
Controlled vocabularies
e.g. SNOMED CT (clinical terms) or MeSH
• Defined terms + taxonomy
• Useful for selecting keywords to tag datasets
• You can find many ontologies in the BARTOC catalogue and elsewhere
➢ Organism A
➢ Term A1
➢ Term A2
➢ Term A3
➢ Term B1
➢ Term B2
➢ Term C4
➢ .
➢ .
➢ .
➢ Term n
► Organism B
► Term A1
► Term A2
► Term A3
► Term B1
► Term B2
► Term C4
► .
► .
► .
► Term n
…and ontologies?
Create
Document
Use
Store
Share
Preserve
Where will you store the data?
• Your own device (laptop, flash drive, server etc.)
– And if you lose it? Or it breaks?
• Departmental drives or university servers
• “Cloud” storage
– Do they care as much about your data as you do?
The decision will be based on how sensitive your data
are, how robust you need the storage to be, and
who needs access to the data and when
Third-party tools for collaboration
ownCloud
• Open source product with
Dropbox-like functionality
• Used by many universities and
service providers to offer
‘approved’ solution
https://owncloud.org
Using Dropbox and other cloud
services
Backup and preservation – not the
same thing!
Backups
• Used to take periodic snapshots of data in case the current version is
destroyed or lost
• Backups are copies of files stored for short or near-long-term
• Often performed on a somewhat frequent schedule
Archiving
• Used to preserve data for historical reference or potentially during
disasters
• Archives are usually the final version, stored for long-term, and generally
not copied over
• Often performed at the end of a project or during major milestones
Create
Document
Use
Store
Share
Preserve
Part of How To Attribute Creative Commons Photos by Foter, licensed CC BY SA 3.0
License research data openly
EUDAT licensing tool
Answer questions to determine which licence(s) are appropriate to use
https://ufal.github.io/public-license-selector/
Create
Document
Use
Store
Share
Preserve
Deposit in a data repository
http://databib.org
www.re3data.org
The Re3data catalogue can be searched to find a home for data
www.fosteropenscience.eu/
content/re3data-demo
Criteria for selecting a repository
• Better to use a domain specific repository if available
• Check they match particular data needs e.g. formats accepted, mixture of
Open and Restricted Access.
• Do they assign a persistent and globally unique identifier for sustainable
citations and to links back to particular researchers and grants?
• Look for certification as a ‘Trustworthy Digital Repository’ with an explicit
ambition to keep the data available in long term.
Icons to note open
access, licenses,
PIDs,
certificates…
What is a Persistent Identifier (PID)?
a long-lasting reference to a document, file or other object
• PIDs come in various forms e.g. ORCID, DOI, ISBN...
• Typically they’re actionable i.e. type it into web browser to access
• Many repositories will assign them on deposit
Questions?
Thank you!
For DCC resources see:
www.dcc.ac.uk/resources
Follow us on twitter:
@digitalcuration and #ukdcc

Más contenido relacionado

La actualidad más candente

Big Data... Big Analytics à travers les âges, les industries et les technologies
Big Data... Big Analytics à travers les âges, les industries et les technologiesBig Data... Big Analytics à travers les âges, les industries et les technologies
Big Data... Big Analytics à travers les âges, les industries et les technologiesHassan Lâasri
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management PlanKristin Briney
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Simplilearn
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The CustomerAlan McSweeney
 
Datastorage in DNA
Datastorage in DNADatastorage in DNA
Datastorage in DNAAditya Nag
 
Business glossaries - The What, the Why, and the How
Business glossaries - The What, the Why, and the HowBusiness glossaries - The What, the Why, and the How
Business glossaries - The What, the Why, and the Howgeorgefirican
 
The importance of data
The importance of dataThe importance of data
The importance of dataAPNIC
 
Convincing Stakeholders Data Governance Is Essential
Convincing Stakeholders Data Governance Is EssentialConvincing Stakeholders Data Governance Is Essential
Convincing Stakeholders Data Governance Is EssentialDATAVERSITY
 
Data Analytics Project Presentation
Data Analytics Project PresentationData Analytics Project Presentation
Data Analytics Project PresentationRohit Vaze
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSCSarah Jones
 
DAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best PracticesDAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best PracticesDATAVERSITY
 
Linked Data: principles and examples
Linked Data: principles and examples Linked Data: principles and examples
Linked Data: principles and examples Victor de Boer
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDMMarieke Guy
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Simplilearn
 

La actualidad más candente (20)

Big data
Big dataBig data
Big data
 
Big Data... Big Analytics à travers les âges, les industries et les technologies
Big Data... Big Analytics à travers les âges, les industries et les technologiesBig Data... Big Analytics à travers les âges, les industries et les technologies
Big Data... Big Analytics à travers les âges, les industries et les technologies
 
Creating a Data Management Plan
Creating a Data Management PlanCreating a Data Management Plan
Creating a Data Management Plan
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
Data Cleansing
Data CleansingData Cleansing
Data Cleansing
 
Notes On Single View Of The Customer
Notes On Single View Of The CustomerNotes On Single View Of The Customer
Notes On Single View Of The Customer
 
Datastorage in DNA
Datastorage in DNADatastorage in DNA
Datastorage in DNA
 
Business glossaries - The What, the Why, and the How
Business glossaries - The What, the Why, and the HowBusiness glossaries - The What, the Why, and the How
Business glossaries - The What, the Why, and the How
 
The importance of data
The importance of dataThe importance of data
The importance of data
 
Convincing Stakeholders Data Governance Is Essential
Convincing Stakeholders Data Governance Is EssentialConvincing Stakeholders Data Governance Is Essential
Convincing Stakeholders Data Governance Is Essential
 
Big data.
Big data.Big data.
Big data.
 
Data Analytics Project Presentation
Data Analytics Project PresentationData Analytics Project Presentation
Data Analytics Project Presentation
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSC
 
FAIR data overview
FAIR data overviewFAIR data overview
FAIR data overview
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
DAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best PracticesDAS Slides: Data Quality Best Practices
DAS Slides: Data Quality Best Practices
 
Linked Data: principles and examples
Linked Data: principles and examples Linked Data: principles and examples
Linked Data: principles and examples
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
 

Similar a OpenAIRE webinar: Principles of Research Data Management, with S. Venkataraman (DCC)

Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data ManagementOpenAIRE
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and SharingC. Tobin Magle
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchersSarah Jones
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for PreservationStephen Gray
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
Take control of your PhD journey: Manage your research data according to best...
Take control of your PhD journey: Manage your research data according to best...Take control of your PhD journey: Manage your research data according to best...
Take control of your PhD journey: Manage your research data according to best...Lars Figenschou
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Managementdancrane_open
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersRebekah Cummings
 
Keep Calm and Curate
Keep Calm and CurateKeep Calm and Curate
Keep Calm and CurateGarethKnight
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA'saaroncollie
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016IzzyChad
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData ManagementUlrike Wittig
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016 Rebecca Raworth, MLIS
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016Rebecca Raworth, MLIS
 

Similar a OpenAIRE webinar: Principles of Research Data Management, with S. Venkataraman (DCC) (20)

Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data Management
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
 
Good Practice in Research Data Management
Good Practice in Research Data ManagementGood Practice in Research Data Management
Good Practice in Research Data Management
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
File Formats for Preservation
File Formats for PreservationFile Formats for Preservation
File Formats for Preservation
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
Take control of your PhD journey: Manage your research data according to best...
Take control of your PhD journey: Manage your research data according to best...Take control of your PhD journey: Manage your research data according to best...
Take control of your PhD journey: Manage your research data according to best...
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
Keep Calm and Curate
Keep Calm and CurateKeep Calm and Curate
Keep Calm and Curate
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA's
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
 
Resources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of OxfordResources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of Oxford
 
Data management
Data management Data management
Data management
 
FAIR BioData Management
FAIR BioData ManagementFAIR BioData Management
FAIR BioData Management
 
Introduction to RDM for trainee physicians
Introduction to RDM for trainee physiciansIntroduction to RDM for trainee physicians
Introduction to RDM for trainee physicians
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
Prototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional RepositoryPrototype Design of Open Access Institutional Repository
Prototype Design of Open Access Institutional Repository
 

Más de OpenAIRE

10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community CallOpenAIRE
 
9th Content Providers Community Call\
9th Content Providers Community Call\9th Content Providers Community Call\
9th Content Providers Community Call\OpenAIRE
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE
 
8th Content Providers Community Call
8th Content Providers Community Call8th Content Providers Community Call
8th Content Providers Community CallOpenAIRE
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community CallOpenAIRE
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE
 
What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)OpenAIRE
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)OpenAIRE
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community CallOpenAIRE
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing DataOpenAIRE
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?OpenAIRE
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open ScienceOpenAIRE
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)OpenAIRE
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open ScienceOpenAIRE
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing DataOpenAIRE
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in GreeceOpenAIRE
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community CallOpenAIRE
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community CallOpenAIRE
 

Más de OpenAIRE (20)

10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call10th OpenAIRE Content Providers Community Call
10th OpenAIRE Content Providers Community Call
 
9th Content Providers Community Call\
9th Content Providers Community Call\9th Content Providers Community Call\
9th Content Providers Community Call\
 
OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)OpenAIRE in the European Open Science Cloud (EOSC)
OpenAIRE in the European Open Science Cloud (EOSC)
 
8th Content Providers Community Call
8th Content Providers Community Call8th Content Providers Community Call
8th Content Providers Community Call
 
7th Content Providers Community Call
7th Content Providers Community Call7th Content Providers Community Call
7th Content Providers Community Call
 
OpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managersOpenAIRE PROVIDE Dashboard for Turkish repository managers
OpenAIRE PROVIDE Dashboard for Turkish repository managers
 
What will it cost to manage and share my data?
What will it cost to manage and share my data?What will it cost to manage and share my data?
What will it cost to manage and share my data?
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 3)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 2)
 
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
Open Research Gateway for the ELIXIR-GR Infrastructure (Part 1)
 
6th Content Providers Community Call
6th Content Providers Community Call6th Content Providers Community Call
6th Content Providers Community Call
 
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200504_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?20200504_Research Data & the GDPR: How Open is Open?
20200504_Research Data & the GDPR: How Open is Open?
 
20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science20200504_Data, Data Ownership and Open Science
20200504_Data, Data Ownership and Open Science
 
20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)20200429_Research Data & the GDPR: How Open is Open? (updated version)
20200429_Research Data & the GDPR: How Open is Open? (updated version)
 
20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science20200429_Data, Data Ownership and Open Science
20200429_Data, Data Ownership and Open Science
 
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
20200429_OpenAIRE Legal Policy Webinar: GDPR and Sharing Data
 
COVID-19: Activities, tools, best practice and contact points in Greece
 COVID-19: Activities, tools, best practice and contact points in Greece COVID-19: Activities, tools, best practice and contact points in Greece
COVID-19: Activities, tools, best practice and contact points in Greece
 
5th Content Providers Community Call
5th Content Providers Community Call5th Content Providers Community Call
5th Content Providers Community Call
 
4th Content Providers Community Call
4th Content Providers Community Call4th Content Providers Community Call
4th Content Providers Community Call
 

Último

A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 

Último (20)

A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 

OpenAIRE webinar: Principles of Research Data Management, with S. Venkataraman (DCC)

  • 1. This work is licensed under the Creative Commons Attribution 2.5 UK: Scotland License S. Venkataraman, PhD Research Data Specialist Digital Curation Centre s.venkataraman@ed.ac.uk 21st October 2019, Open Access Week Webinar Principles of Research Data Management
  • 2. About the DCC • Established in 2004 • Based in Edinburgh and Glasgow • Works at national and international levels • One of leading organisations in the world specialising in training, consultancy, policy making and advocacy in digital data management best practice and services provision • Involved in many international consortia, schools and projects • (Not involved in actual curation of any data!)
  • 3. Is there a reproducibility crisis? Baker, M. (2016) “1,500 scientists lift the lid on reproducibility”, Nature, 533:7604, http://www.nature.com/n ews/1-500-scientists-lift- the-lid-on- reproducibility-1.19970
  • 4. Why make data available?
  • 5. Who has heard of this before…? Image CC-BY-SA by SangyaPundir
  • 7. Slide CC-BY by Erik Schultes, Leiden UMC What FAIR means: 15 principles Comprehensive descriptions can be found at https://www.go-fair.org/fair- principles/
  • 8. Common misconceptions • FAIR data does not have to be open • The principles do not specify particular technologies or implementations e.g. semantic web • FAIR is not a standard to be followed or strict criteria – it’s a spectrum / continuum • It doesn’t only apply to the life sciences
  • 9. All research data Managed data FAIR data Open data the wild
  • 10. Increasing that which is FAIR & open Managed data FAIR data Open data the wild
  • 11. as open as possible, as closed as necessary Image: ‘Balancing rocks’ by Viewminder CC-BY-SA-ND www.flickr.com/photos/light_seeker/7780857224
  • 12. RDM & the Data LifecycleImage CC-BY-SA by Janneke Staaks www.flickr.com/photos/jannekestaaks/14411397343
  • 13. What is Research Data Management? “the active management and appraisal of data over the lifecycle of scholarly and scientific interest” Data management is part of good research practice Create Document Use Store Share Preserve
  • 15. Data creation tips • Ensure consent forms, licences and agreements don’t restrict opportunities to share data • Choose appropriate formats • Adopt a file naming convention • Create metadata and documentation as you go
  • 16. Ask for consent for data sharing If not, data centres won’t be able to accept the data – regardless of any conditions on the original grant. www.data-archive.ac.uk/create-manage/consent-ethics/consent?index=3
  • 17. Choose appropriate file formats Different formats are good for different things • open, lossless formats are more sustainable e.g. rtf, xml, tif, wav • proprietary and/or compressed formats are less preservable but are often in widespread use e.g. doc, jpg, mp3 One format for analysis then convert to a standard format Data centres may suggest preferred formats for deposit https://www.ukdataservice.ac.uk/manage-data/format/recommended-formats
  • 18. Type of data Recommended formats Acceptable formats Tabular data with extensive metadata variable labels, code labels, and defined missing values SPSS portable format (.por) delimited text and command ('setup') file (SPSS, Stata, SAS, etc.) structured text or mark-up file of metadata information, e.g. DDI XML file proprietary formats of statistical packages: SPSS (.sav), Stata (.dta), MS Access (.mdb/.accdb) Tabular data with minimal metadata column headings, variable names comma-separated values (.csv) tab-delimited file (.tab) delimited text with SQL data definition statements delimited text (.txt) with characters not present in data used as delimiters widely-used formats: MS Excel (.xls/.xlsx), MS Access (.mdb/.accdb), dBase (.dbf), OpenDocument Spreadsheet (.ods) Geospatial data vector and raster data ESRI Shapefile (.shp, .shx, .dbf, .prj, .sbx, .sbn optional) geo-referenced TIFF (.tif, .tfw) CAD data (.dwg) tabular GIS attribute data Geography Markup Language (.gml) ESRI Geodatabase format (.mdb) MapInfo Interchange Format (.mif) for vector data Keyhole Mark-up Language (.kml) Adobe Illustrator (.ai), CAD data (.dxf or .svg) binary formats of GIS and CAD packages Textual data Rich Text Format (.rtf) plain text, ASCII (.txt) eXtensible Mark-up Language (.xml) text according to an appropriate Document Type Definition (DTD) or schema Hypertext Mark-up Language (.html) widely-used formats: MS Word (.doc/.docx) some software-specific formats: NUD*IST, NVivo and ATLAS.ti Image data TIFF 6.0 uncompressed (.tif) JPEG (.jpeg, .jpg, .jp2) if original created in this format GIF (.gif) TIFF other versions (.tif, .tiff) RAW image format (.raw) Photoshop files (.psd) BMP (.bmp) PNG (.png) Adobe Portable Document Format (PDF/A, PDF) (.pdf) Audio data Free Lossless Audio Codec (FLAC) (.flac) MPEG-1 Audio Layer 3 (.mp3) if original created in this format Audio Interchange File Format (.aif) Waveform Audio Format (.wav) Video data MPEG-4 (.mp4) OGG video (.ogv, .ogg) motion JPEG 2000 (.mj2) AVCHD video (.avchd) Documentation and scripts Rich Text Format (.rtf) PDF/UA, PDF/A or PDF (.pdf) XHTML or HTML (.xhtml, .htm) OpenDocument Text (.odt) plain text (.txt) widely-used formats: MS Word (.doc/.docx), MS Excel (.xls/.xlsx) XML marked-up text (.xml) according to an appropriate DTD or schema, e.g. XHMTL 1.0 https://www.ukdataservice.ac.uk/manage-data/format/recommended-formats
  • 19. How will you organise your data? • Keep file and folder names short, but meaningful • Agree a method for versioning • Include dates in a set format e.g. YYYYMMDD • Avoid using non-alphanumeric characters in file names • Use hyphens or underscores not spaces e.g. day-sheet, day sheet • Order the elements in the most appropriate way to retrieve the record Example from ARM Climate Research Facility www.arm.gov/data/docs/plan
  • 21. Documentation Think about what is needed in order to evaluate, understand, and reuse the data. • Why was the data created? • Have you documented what you did and how? • Did you develop code to run analyses? If so, this should be kept and shared too. • Important to provide wider context for trust
  • 22. What are metadata? Metadata • Standardised • Structured • Machine and human readable Metadata helps to cite & disambiguate data Documentation aids reuse Metadata Documentation
  • 23. Metadata standards These can be general – such as Dublin Core Or discipline specific • Data Documentation Initiative (DDI) – social science • Ecological Metadata Language (EML) - ecology • Flexible Image Transport System (FITS) – astronomy Search for standards in catalogues like: http://rd-alliance.github.io/metadata-directory/ https://rdamsc.dcc.ac.uk/
  • 24. “MTBLS1: A metabolomic study of urinary changes in type 2 diabetes in……” Example courtesy of Ken Haug, European Bioinformatics Institute (EMBL-EBI) Controlled vocabularies
  • 25. e.g. SNOMED CT (clinical terms) or MeSH • Defined terms + taxonomy • Useful for selecting keywords to tag datasets • You can find many ontologies in the BARTOC catalogue and elsewhere ➢ Organism A ➢ Term A1 ➢ Term A2 ➢ Term A3 ➢ Term B1 ➢ Term B2 ➢ Term C4 ➢ . ➢ . ➢ . ➢ Term n ► Organism B ► Term A1 ► Term A2 ► Term A3 ► Term B1 ► Term B2 ► Term C4 ► . ► . ► . ► Term n …and ontologies?
  • 27. Where will you store the data? • Your own device (laptop, flash drive, server etc.) – And if you lose it? Or it breaks? • Departmental drives or university servers • “Cloud” storage – Do they care as much about your data as you do? The decision will be based on how sensitive your data are, how robust you need the storage to be, and who needs access to the data and when
  • 28. Third-party tools for collaboration ownCloud • Open source product with Dropbox-like functionality • Used by many universities and service providers to offer ‘approved’ solution https://owncloud.org Using Dropbox and other cloud services
  • 29. Backup and preservation – not the same thing! Backups • Used to take periodic snapshots of data in case the current version is destroyed or lost • Backups are copies of files stored for short or near-long-term • Often performed on a somewhat frequent schedule Archiving • Used to preserve data for historical reference or potentially during disasters • Archives are usually the final version, stored for long-term, and generally not copied over • Often performed at the end of a project or during major milestones
  • 31. Part of How To Attribute Creative Commons Photos by Foter, licensed CC BY SA 3.0 License research data openly
  • 32. EUDAT licensing tool Answer questions to determine which licence(s) are appropriate to use https://ufal.github.io/public-license-selector/
  • 34. Deposit in a data repository http://databib.org www.re3data.org The Re3data catalogue can be searched to find a home for data www.fosteropenscience.eu/ content/re3data-demo
  • 35. Criteria for selecting a repository • Better to use a domain specific repository if available • Check they match particular data needs e.g. formats accepted, mixture of Open and Restricted Access. • Do they assign a persistent and globally unique identifier for sustainable citations and to links back to particular researchers and grants? • Look for certification as a ‘Trustworthy Digital Repository’ with an explicit ambition to keep the data available in long term. Icons to note open access, licenses, PIDs, certificates…
  • 36. What is a Persistent Identifier (PID)? a long-lasting reference to a document, file or other object • PIDs come in various forms e.g. ORCID, DOI, ISBN... • Typically they’re actionable i.e. type it into web browser to access • Many repositories will assign them on deposit
  • 38. Thank you! For DCC resources see: www.dcc.ac.uk/resources Follow us on twitter: @digitalcuration and #ukdcc