SlideShare una empresa de Scribd logo
1 de 32
The benefits & practice of openness 
Sarah Jones 
Digital Curation Centre, University of Glasgow 
sarah.jones@glasgow.ac.uk 
Twitter: @sjDCC 
Boo(s)tcamp Open Science, KU Leuven, 24th October www.kuleuven.be/research/bootcamp
WHAT DOES IT MEAN 
TO BE OPEN? 
Image by biblioteekje CC-BY-NC-SA www.flickr.com/photos/biblioteekje/3992172265
What is open science? 
“science carried out and communicated in a 
manner which allows others to contribute, 
collaborate and add to the research effort, with 
all kinds of data, results and protocols made 
freely available at different stages of the 
research process.” 
Research Information Network, Open Science case studies 
www.rin.ac.uk/our-work/data-management-and-curation/ 
open-science-case-studies
Open methods 
• Documenting and sharing workflows and methods 
• Sharing code and tools to allow others to reproduce work 
• Using web based tools to facilitate collaboration and 
interaction from the outside world 
• Open netbook science – “when there is a URL to a 
laboratory notebook that is freely available and indexed 
on common search engines.” 
http://drexel-coas-elearning.blogspot.co.uk/2006/09/open-notebook-science.html
Open access to publications 
• Free, immediate, online access to the results of research 
• Make sure anyone can access your papers 
– Gold route: paying APCs to ensure publishers makes copy open 
– Green route: self-archiving Open Access copy in repository 
• Find out what your publisher allows on SHERPA RoMEO 
– www.sherpa.ac.uk/romeo
Open data 
“Open data and content can be freely used, 
modified and shared by anyone for any purpose” 
http://opendefinition.org 
Tim Berners-Lee’s proposal for five star open data - http://5stardata.info 
make your stuff available on the Web (whatever format) under an open licence 
make it available as structured data (e.g. Excel instead of a scan of a table) 
use non-proprietary formats (e.g. CSV instead of Excel) 
use URIs to denote things, so that people can point at your stuff 
link your data to other data to provide context
Openness at every stage 
Design 
www.myexperiment.org 
Experiment 
Release 
Publication Analysis 
www.taverna.org.uk 
www.labtrove.org 
Open science image CC BY-SA 3.0 by Greg Emmerich www.flickr.com/photos/gemmerich/6365692655 
impactstory.org 
www.datacite.org 
https://github.com 
www.sherpa.ac.uk/romeo 
http://openwetware.org
WHY SHOULD YOU BE OPEN? 
Image by wonderwebby CC-BY-NC-SA www.flickr.com/photos/wonderwebby/2723279491
It’s part of good research practice
Science as an open enterprise 
“Much of the remarkable growth of 
scientific understanding in recent 
centuries is due to open practices; open 
communication and deliberation sit at 
the heart of scientific practice.” 
Royal Society report calls for ‘intelligent 
openness’ whereby data are accessible, 
intelligible, assessable and usable. 
https://royalsociety.org/policy/projects/science-public-enterprise/Report
Some benefits of openness 
• You can access relevant literature – not behind pay walls 
• Ensures research is transparent and reproducible 
• Increased visibility, usage and impact of your work 
• New collaborations and research partnerships 
• Ensure long-term access to your outputs 
• Help increase the efficiency of research
Saving wasted time 
OA helps to reduce time spent finding/accessing material: 
“If around 60 minutes were characteristic for researchers 
(the average time spent trying to access the last research 
article they had difficulty accessing), then in the current 
environment the time spent dealing with research article 
access difficulties might be costing around DKK 540 
million (EUR 72 million) per year among specialist 
researchers in Denmark alone.” 
Access to research and technical information in Denmark, 
Houghton, Swan & Brown (2011) 
http://eprints.ecs.soton.ac.uk/22603
Cut down on academic fraud 
www.nature.com/news/2011/111101/full/479015a.html
Validation of results 
“It was a mistake in a spreadsheet that could 
have been easily overlooked: a few rows left 
out of an equation to average the values in a 
column. 
The spreadsheet was used to draw the 
conclusion of an influential 2010 economics 
paper: that public debt of more than 90% of 
GDP slows down growth. This conclusion was 
later cited by the International Monetary Fund 
and the UK Treasury to justify programmes 
of austerity that have arguably led to riots, 
poverty and lost jobs.” 
www.guardian.co.uk/politics/2013/apr/18/uncovered-error-george-osborne-austerity
Acceleration of the research process 
“As more papers are deposited and more scientists 
use the repository, the time between an article being 
deposited and being cited has been shrinking 
dramatically, year upon year. This is important for 
research uptake and progress, because it means that 
in this area of research, where articles are made 
available at – or frequently before – publication, the 
research cycle is accelerating.” 
Open Access: Why should we have it? Alma Swan 
www.keyperspectives.co.uk
More scientific breakthroughs 
“It was unbelievable. Its not science 
the way most of us have practiced in 
our careers. But we all realised that 
we would never get biomarkers unless 
all of us parked our egos and 
intellectual property noses outside 
the door and agreed that all of our 
data would be public immediately.” 
Dr John Trojanowski, University of Pennsylvania 
www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0
A citation advantage 
A study that analysed the citation counts of 10,555 papers on gene 
expression studies that created microarray data, showed: 
“studies that made data available in a public repository 
received 9% more citations than similar studies for 
which the data was not made available” 
Data reuse and the open data citation advantage, 
Piwowar, H. & Vision, T. https://peerj.com/articles/175
Increased use and economic benefit 
The case of NASA Landsat satellite imagery of the Earth’s surface: 
Up to 2008 
• Sold through the US Geological 
Survey for US$600 per scene 
• Sales of 19,000 scenes per year 
• Annual revenue of $11.4 million 
Since 2009 
• Freely available over the internet 
• Google Earth now uses the images 
• Transmission of 2,100,000 
scenes per year. 
• Estimated to have created value for 
the environmental management 
industry of $935 million, with direct 
benefit of more than $100 million 
per year to the US economy 
• Has stimulated the development of 
applications from a large number of 
companies worldwide 
http://earthobservatory.nasa.gov/IOTD/view.php?id=83394&src=ve
But there are also opportunity costs 
By Emilio Bruna 
http://brunalab.org/blog/2014/09/04/the-opportunity- 
cost-of-my-openscience-was-35-hours-690 
For his most recent paper: 
1. Double checking the main dataset and 
reformatting to submit to Dryad: 5 hours 
2. Creating complementary file and preparing 
metadata: 3 hours 
3. Submission of these two files and the 
metadata to Dryad: 45 minutes 
4. Preparing a map of the locations: 1 hour 
5. Submission of map to Figshare: 15 minutes 
6. Cleaning up and documenting the code, 
uploading it to GitHub: 25 hours 
7. Cost of archiving in Dryad: US$90 
8. Page Charges: $600
So what needs to change? 
Conclusions from Emilio Bruna: 
• Develop a better system of incentives from the 
community for archiving data and code 
• Teach our students how to do this NOW - it’s much easier 
if you develop good habits early 
• Minimise the actual and opportunity costs 
We need to stop telling people “You should” and get 
better at telling people “Here’s how”
HOW TO PRACTICE OPEN SCIENCE 
Image by Paul Downey CC-BY www.flickr.com/photos/psd/3925801816
Conducting science in the open: UsefulChem 
http://usefulchem.wikispaces.com/EXP284
Collaboration & sharing: MyExperiment 
www.myexperiment.org/workflows/16.html
Open access publication 
Immediate open 
access (via 
publisher) 
Pay Article Processing 
Charge (APC) - if required 
GOLD OA ROUTE 
Publish in an 
open access 
journal 
IF OPTION EXISTS 
e.g. a ‘hybrid’ journal 
(a subscription-based journal 
that has a paid open access 
GREEN OA ROUTE 
Self-archive in a 
repository, based 
on publisher policy. 
option) Immediate open 
access (via 
publisher) 
Pay Article 
Processing Charge 
(APC) 
Immediate or delayed 
open access, depending 
on publisher’s policy. 
Search for a repository 
http://opendoar.org 
Publish in a 
subscription-based 
journal 
Researcher 
decides where 
to publish 
Check SHERPA RoMEO 
to see what OA and self-archiving 
options are 
available 
www.sherpa.ac.uk/romeo
Sherpah RoMEO
Deposit in your local repository! 
• Speak to the library and deposit in Lirias - 
https://lirias.kuleuven.be 
• Consider other relevant repositories for your field too 
e.g. Arxiv - http://arxiv.org 
• Check OpenDOAR for examples - 
http://www.opendoar.org
OpenAIRE 
Open Access Infrastructure for research in Europe 
http://vimeo.com/108790101 
aggregates data on OA publications 
mines & enriches it content by linking thing together 
provides services & APIs e.g. 
to generate publication lists 
www.openaire.eu
How to make data open? 
1. Choose your dataset(s) 
• What can you may open? You may need to revisit this step if you 
encounter problems later. 
2. Apply an open license 
• Determine what IP exists. Apply a suitable licence e.g. CC-BY 
3. Make the data available 
• Provide the data in a suitable format. Use repositories. 
4. Make it discoverable 
• Post on the web, register in catalogues… 
https://okfn.org
Licensing research data 
Outlines pros and cons of each approach 
and gives practical advice on how to 
implement your licence 
CREATIVE COMMONS LIMITATIONS 
NC Non-Commercial 
What counts as commercial? 
SA Share Alike 
Reduces interoperability 
ND No Derivatives 
Severely restricts use 
Horizon 2020 Open 
Access guidelines point to: 
or 
www.dcc.ac.uk/resources/how-guides/license-research-data
Potential repositories 
http://service.re3data.org/search 
http://databib.org 
Zenodo 
• OpenAIRE-CERN joint effort 
• Multidisciplinary repository 
• Multiple data types 
– Publications 
– Long tail of research data 
• Citable data (DOI) 
• Links funding, publications, 
data & software 
www.zenodo.org
Metadata standards 
Use relevant standards for interoperability 
www.dcc.ac.uk/resources/metadata-standards
Thanks – any questions? 
Follow us on Twitter: 
@fosterscience and #fosteropenscience 
FOSTER training events and materials: 
www.fosteropenscience.eu/events

Más contenido relacionado

La actualidad más candente

Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011
Jisc
 

La actualidad más candente (20)

Understanding Open Science: Definitions and framework
Understanding Open Science: Definitions and framework Understanding Open Science: Definitions and framework
Understanding Open Science: Definitions and framework
 
Open Science: Tools and platforms
Open Science: Tools and platformsOpen Science: Tools and platforms
Open Science: Tools and platforms
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...
 
The culture of researchData
The culture of researchDataThe culture of researchData
The culture of researchData
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014
 
Open science / open research
Open science / open researchOpen science / open research
Open science / open research
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
 
Liberating facts from the scientific literature - Jisc Digifest 2016
Liberating facts from the scientific literature - Jisc Digifest 2016Liberating facts from the scientific literature - Jisc Digifest 2016
Liberating facts from the scientific literature - Jisc Digifest 2016
 
Open Science (publishing) as-a-Service (Presentation by Paolo Manghi at the ...
Open Science (publishing) as-a-Service (Presentation by Paolo Manghi at the ...Open Science (publishing) as-a-Service (Presentation by Paolo Manghi at the ...
Open Science (publishing) as-a-Service (Presentation by Paolo Manghi at the ...
 
Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?Open Data in a Big Data World: easy to say, but hard to do?
Open Data in a Big Data World: easy to say, but hard to do?
 
Introduction to open access and how it helps in your research and increases t...
Introduction to open access and how it helps in your research and increases t...Introduction to open access and how it helps in your research and increases t...
Introduction to open access and how it helps in your research and increases t...
 
OpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open ScienceOpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open Science
 
Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011
 
Data management: The new frontier for libraries
Data management: The new frontier for librariesData management: The new frontier for libraries
Data management: The new frontier for libraries
 
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
 
Research Data Management and the brave new world, By Paul Ayris
Research Data Management and the brave new world, By Paul AyrisResearch Data Management and the brave new world, By Paul Ayris
Research Data Management and the brave new world, By Paul Ayris
 
Open by default: the challenges of research data in Europe
Open by default: the challenges of research data in EuropeOpen by default: the challenges of research data in Europe
Open by default: the challenges of research data in Europe
 

Destacado

A Database Called The Web
A Database Called The WebA Database Called The Web
A Database Called The Web
Nathan Yergler
 
Open Content Library @ Taiwan
Open Content Library @ TaiwanOpen Content Library @ Taiwan
Open Content Library @ Taiwan
Jon Phillips
 
Semantic Search on the Public Web with Creative Commons
Semantic Search on the Public Web with Creative CommonsSemantic Search on the Public Web with Creative Commons
Semantic Search on the Public Web with Creative Commons
Mike Linksvayer
 

Destacado (20)

VOCAB: Week Nine Slides
VOCAB: Week Nine SlidesVOCAB: Week Nine Slides
VOCAB: Week Nine Slides
 
iMoot 2014 - Developing a Course in the Open: A Case Study
 iMoot 2014 - Developing a Course in the Open: A Case Study iMoot 2014 - Developing a Course in the Open: A Case Study
iMoot 2014 - Developing a Course in the Open: A Case Study
 
Creative Commons birthday Lessig Outline
Creative Commons birthday Lessig OutlineCreative Commons birthday Lessig Outline
Creative Commons birthday Lessig Outline
 
Diseño Libre
Diseño LibreDiseño Libre
Diseño Libre
 
A Database Called The Web
A Database Called The WebA Database Called The Web
A Database Called The Web
 
CC BUS
CC BUSCC BUS
CC BUS
 
Phillips Remix Cycle Pixelodeon 2007
Phillips   Remix Cycle   Pixelodeon 2007Phillips   Remix Cycle   Pixelodeon 2007
Phillips Remix Cycle Pixelodeon 2007
 
Digitópolis I: Diseño de Aplicaciones Interactivas para Creativos y Comunicad...
Digitópolis I: Diseño de Aplicaciones Interactivas para Creativos y Comunicad...Digitópolis I: Diseño de Aplicaciones Interactivas para Creativos y Comunicad...
Digitópolis I: Diseño de Aplicaciones Interactivas para Creativos y Comunicad...
 
Creative Commons: What every Educator needs to know
Creative Commons: What every Educator needs to knowCreative Commons: What every Educator needs to know
Creative Commons: What every Educator needs to know
 
Open Content Library @ Taiwan
Open Content Library @ TaiwanOpen Content Library @ Taiwan
Open Content Library @ Taiwan
 
Open Content Library LGM 2007
Open Content Library LGM 2007Open Content Library LGM 2007
Open Content Library LGM 2007
 
Enabling (Open) Scholarship
Enabling (Open) ScholarshipEnabling (Open) Scholarship
Enabling (Open) Scholarship
 
ChiLUG - Open Source and Creative Commons
ChiLUG - Open Source and Creative CommonsChiLUG - Open Source and Creative Commons
ChiLUG - Open Source and Creative Commons
 
LiveContent v1.0 CC Salon Presentation
LiveContent v1.0 CC Salon PresentationLiveContent v1.0 CC Salon Presentation
LiveContent v1.0 CC Salon Presentation
 
Phillips Building Communities Isummit 2007
Phillips   Building Communities   Isummit 2007Phillips   Building Communities   Isummit 2007
Phillips Building Communities Isummit 2007
 
Emerging Fields of Application for RMI: Search Engines and Users
Emerging Fields of Application for RMI: Search Engines and UsersEmerging Fields of Application for RMI: Search Engines and Users
Emerging Fields of Application for RMI: Search Engines and Users
 
Open Content Library Guadec 2007
Open Content Library Guadec 2007Open Content Library Guadec 2007
Open Content Library Guadec 2007
 
Semantic Search on the Public Web with Creative Commons
Semantic Search on the Public Web with Creative CommonsSemantic Search on the Public Web with Creative Commons
Semantic Search on the Public Web with Creative Commons
 
Refining Copyright Oscon 2007
Refining Copyright Oscon 2007Refining Copyright Oscon 2007
Refining Copyright Oscon 2007
 
Creative Commons: Enabling Access to Knowledge
Creative Commons: Enabling Access to KnowledgeCreative Commons: Enabling Access to Knowledge
Creative Commons: Enabling Access to Knowledge
 

Similar a Benefits and practice of open science

Science20brussels osimo april2013
Science20brussels osimo april2013Science20brussels osimo april2013
Science20brussels osimo april2013
osimod
 
Presentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical SocietyPresentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical Society
osimod
 

Similar a Benefits and practice of open science (20)

Learn to speak open
Learn to speak openLearn to speak open
Learn to speak open
 
Introduction to Open Science and EOSC
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSC
 
Open science platforms
Open science platformsOpen science platforms
Open science platforms
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European context
 
Scholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to showScholarship in a connected world: New ways to know, new ways to show
Scholarship in a connected world: New ways to know, new ways to show
 
Open access for researchers, policy makers and research managers - Short ver...
Open access  for researchers, policy makers and research managers - Short ver...Open access  for researchers, policy makers and research managers - Short ver...
Open access for researchers, policy makers and research managers - Short ver...
 
The OpenCon Intro to Open Data
The OpenCon Intro to Open DataThe OpenCon Intro to Open Data
The OpenCon Intro to Open Data
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Science20brussels osimo april2013
Science20brussels osimo april2013Science20brussels osimo april2013
Science20brussels osimo april2013
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Final Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational ResearchFinal Johnson Research Libraries and Computational Research
Final Johnson Research Libraries and Computational Research
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open Science
 
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...
 
Open Science - Free Science for a free Society
Open Science - Free Science for a free SocietyOpen Science - Free Science for a free Society
Open Science - Free Science for a free Society
 
Science 2.0
Science 2.0Science 2.0
Science 2.0
 
Presentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical SocietyPresentation of science 2.0 at European Astronomical Society
Presentation of science 2.0 at European Astronomical Society
 
Open Data (and Software, and other Research Artefacts) - A proper management
Open Data (and Software, and other Research Artefacts) -A proper managementOpen Data (and Software, and other Research Artefacts) -A proper management
Open Data (and Software, and other Research Artefacts) - A proper management
 
Scott Edmunds: Using FAIR principles for more Open & Democratic Science
Scott Edmunds: Using FAIR principles for more Open & Democratic ScienceScott Edmunds: Using FAIR principles for more Open & Democratic Science
Scott Edmunds: Using FAIR principles for more Open & Democratic Science
 

Más de Sarah Jones

Más de Sarah Jones (20)

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricks
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and libraries
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activities
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open Science
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysis
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptx
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?
 
The future of FAIR
The future of FAIRThe future of FAIR
The future of FAIR
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open Science
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessons
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiatives
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDC
 
Future EOSC roadmap
Future EOSC roadmapFuture EOSC roadmap
Future EOSC roadmap
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 

Benefits and practice of open science

  • 1. The benefits & practice of openness Sarah Jones Digital Curation Centre, University of Glasgow sarah.jones@glasgow.ac.uk Twitter: @sjDCC Boo(s)tcamp Open Science, KU Leuven, 24th October www.kuleuven.be/research/bootcamp
  • 2. WHAT DOES IT MEAN TO BE OPEN? Image by biblioteekje CC-BY-NC-SA www.flickr.com/photos/biblioteekje/3992172265
  • 3. What is open science? “science carried out and communicated in a manner which allows others to contribute, collaborate and add to the research effort, with all kinds of data, results and protocols made freely available at different stages of the research process.” Research Information Network, Open Science case studies www.rin.ac.uk/our-work/data-management-and-curation/ open-science-case-studies
  • 4. Open methods • Documenting and sharing workflows and methods • Sharing code and tools to allow others to reproduce work • Using web based tools to facilitate collaboration and interaction from the outside world • Open netbook science – “when there is a URL to a laboratory notebook that is freely available and indexed on common search engines.” http://drexel-coas-elearning.blogspot.co.uk/2006/09/open-notebook-science.html
  • 5. Open access to publications • Free, immediate, online access to the results of research • Make sure anyone can access your papers – Gold route: paying APCs to ensure publishers makes copy open – Green route: self-archiving Open Access copy in repository • Find out what your publisher allows on SHERPA RoMEO – www.sherpa.ac.uk/romeo
  • 6. Open data “Open data and content can be freely used, modified and shared by anyone for any purpose” http://opendefinition.org Tim Berners-Lee’s proposal for five star open data - http://5stardata.info make your stuff available on the Web (whatever format) under an open licence make it available as structured data (e.g. Excel instead of a scan of a table) use non-proprietary formats (e.g. CSV instead of Excel) use URIs to denote things, so that people can point at your stuff link your data to other data to provide context
  • 7. Openness at every stage Design www.myexperiment.org Experiment Release Publication Analysis www.taverna.org.uk www.labtrove.org Open science image CC BY-SA 3.0 by Greg Emmerich www.flickr.com/photos/gemmerich/6365692655 impactstory.org www.datacite.org https://github.com www.sherpa.ac.uk/romeo http://openwetware.org
  • 8. WHY SHOULD YOU BE OPEN? Image by wonderwebby CC-BY-NC-SA www.flickr.com/photos/wonderwebby/2723279491
  • 9. It’s part of good research practice
  • 10. Science as an open enterprise “Much of the remarkable growth of scientific understanding in recent centuries is due to open practices; open communication and deliberation sit at the heart of scientific practice.” Royal Society report calls for ‘intelligent openness’ whereby data are accessible, intelligible, assessable and usable. https://royalsociety.org/policy/projects/science-public-enterprise/Report
  • 11. Some benefits of openness • You can access relevant literature – not behind pay walls • Ensures research is transparent and reproducible • Increased visibility, usage and impact of your work • New collaborations and research partnerships • Ensure long-term access to your outputs • Help increase the efficiency of research
  • 12. Saving wasted time OA helps to reduce time spent finding/accessing material: “If around 60 minutes were characteristic for researchers (the average time spent trying to access the last research article they had difficulty accessing), then in the current environment the time spent dealing with research article access difficulties might be costing around DKK 540 million (EUR 72 million) per year among specialist researchers in Denmark alone.” Access to research and technical information in Denmark, Houghton, Swan & Brown (2011) http://eprints.ecs.soton.ac.uk/22603
  • 13. Cut down on academic fraud www.nature.com/news/2011/111101/full/479015a.html
  • 14. Validation of results “It was a mistake in a spreadsheet that could have been easily overlooked: a few rows left out of an equation to average the values in a column. The spreadsheet was used to draw the conclusion of an influential 2010 economics paper: that public debt of more than 90% of GDP slows down growth. This conclusion was later cited by the International Monetary Fund and the UK Treasury to justify programmes of austerity that have arguably led to riots, poverty and lost jobs.” www.guardian.co.uk/politics/2013/apr/18/uncovered-error-george-osborne-austerity
  • 15. Acceleration of the research process “As more papers are deposited and more scientists use the repository, the time between an article being deposited and being cited has been shrinking dramatically, year upon year. This is important for research uptake and progress, because it means that in this area of research, where articles are made available at – or frequently before – publication, the research cycle is accelerating.” Open Access: Why should we have it? Alma Swan www.keyperspectives.co.uk
  • 16. More scientific breakthroughs “It was unbelievable. Its not science the way most of us have practiced in our careers. But we all realised that we would never get biomarkers unless all of us parked our egos and intellectual property noses outside the door and agreed that all of our data would be public immediately.” Dr John Trojanowski, University of Pennsylvania www.nytimes.com/2010/08/13/health/research/13alzheimer.html?pagewanted=all&_r=0
  • 17. A citation advantage A study that analysed the citation counts of 10,555 papers on gene expression studies that created microarray data, showed: “studies that made data available in a public repository received 9% more citations than similar studies for which the data was not made available” Data reuse and the open data citation advantage, Piwowar, H. & Vision, T. https://peerj.com/articles/175
  • 18. Increased use and economic benefit The case of NASA Landsat satellite imagery of the Earth’s surface: Up to 2008 • Sold through the US Geological Survey for US$600 per scene • Sales of 19,000 scenes per year • Annual revenue of $11.4 million Since 2009 • Freely available over the internet • Google Earth now uses the images • Transmission of 2,100,000 scenes per year. • Estimated to have created value for the environmental management industry of $935 million, with direct benefit of more than $100 million per year to the US economy • Has stimulated the development of applications from a large number of companies worldwide http://earthobservatory.nasa.gov/IOTD/view.php?id=83394&src=ve
  • 19. But there are also opportunity costs By Emilio Bruna http://brunalab.org/blog/2014/09/04/the-opportunity- cost-of-my-openscience-was-35-hours-690 For his most recent paper: 1. Double checking the main dataset and reformatting to submit to Dryad: 5 hours 2. Creating complementary file and preparing metadata: 3 hours 3. Submission of these two files and the metadata to Dryad: 45 minutes 4. Preparing a map of the locations: 1 hour 5. Submission of map to Figshare: 15 minutes 6. Cleaning up and documenting the code, uploading it to GitHub: 25 hours 7. Cost of archiving in Dryad: US$90 8. Page Charges: $600
  • 20. So what needs to change? Conclusions from Emilio Bruna: • Develop a better system of incentives from the community for archiving data and code • Teach our students how to do this NOW - it’s much easier if you develop good habits early • Minimise the actual and opportunity costs We need to stop telling people “You should” and get better at telling people “Here’s how”
  • 21. HOW TO PRACTICE OPEN SCIENCE Image by Paul Downey CC-BY www.flickr.com/photos/psd/3925801816
  • 22. Conducting science in the open: UsefulChem http://usefulchem.wikispaces.com/EXP284
  • 23. Collaboration & sharing: MyExperiment www.myexperiment.org/workflows/16.html
  • 24. Open access publication Immediate open access (via publisher) Pay Article Processing Charge (APC) - if required GOLD OA ROUTE Publish in an open access journal IF OPTION EXISTS e.g. a ‘hybrid’ journal (a subscription-based journal that has a paid open access GREEN OA ROUTE Self-archive in a repository, based on publisher policy. option) Immediate open access (via publisher) Pay Article Processing Charge (APC) Immediate or delayed open access, depending on publisher’s policy. Search for a repository http://opendoar.org Publish in a subscription-based journal Researcher decides where to publish Check SHERPA RoMEO to see what OA and self-archiving options are available www.sherpa.ac.uk/romeo
  • 26. Deposit in your local repository! • Speak to the library and deposit in Lirias - https://lirias.kuleuven.be • Consider other relevant repositories for your field too e.g. Arxiv - http://arxiv.org • Check OpenDOAR for examples - http://www.opendoar.org
  • 27. OpenAIRE Open Access Infrastructure for research in Europe http://vimeo.com/108790101 aggregates data on OA publications mines & enriches it content by linking thing together provides services & APIs e.g. to generate publication lists www.openaire.eu
  • 28. How to make data open? 1. Choose your dataset(s) • What can you may open? You may need to revisit this step if you encounter problems later. 2. Apply an open license • Determine what IP exists. Apply a suitable licence e.g. CC-BY 3. Make the data available • Provide the data in a suitable format. Use repositories. 4. Make it discoverable • Post on the web, register in catalogues… https://okfn.org
  • 29. Licensing research data Outlines pros and cons of each approach and gives practical advice on how to implement your licence CREATIVE COMMONS LIMITATIONS NC Non-Commercial What counts as commercial? SA Share Alike Reduces interoperability ND No Derivatives Severely restricts use Horizon 2020 Open Access guidelines point to: or www.dcc.ac.uk/resources/how-guides/license-research-data
  • 30. Potential repositories http://service.re3data.org/search http://databib.org Zenodo • OpenAIRE-CERN joint effort • Multidisciplinary repository • Multiple data types – Publications – Long tail of research data • Citable data (DOI) • Links funding, publications, data & software www.zenodo.org
  • 31. Metadata standards Use relevant standards for interoperability www.dcc.ac.uk/resources/metadata-standards
  • 32. Thanks – any questions? Follow us on Twitter: @fosterscience and #fosteropenscience FOSTER training events and materials: www.fosteropenscience.eu/events

Notas del editor

  1. Initially I want to consider what it means to be open
  2. The Research Information Network ran a series of open science case studies a few years ago to look at openness across different disciplines. These included astronomy, bioinformatics, chemistry, clinical neuroimaging, epidemiology and language technology. Through this they came up with the definition. They defined open science as … The first aspect is about conducting science in an open way to encourage others to engage with it, and also sharing the outputs at various stages. I’ll unpack this definition over the next few slides
  3. Part of open science is working openly – having open methods. This involves documenting workflows and procedures so others can find and reuse your methods, sharing code and tools so others can reproduce your work, and using web based tools such as blogs and wikis to facilitate collaboration and interaction with others. You may come across the terms ‘open netbook science’. This was coined by Jean-Claude Bradley. It’s about having an online labnotebook that others can find and follow.
  4. Open science is also about providing open access to publications. This is well-established concept now, so one you’re probably familiar with. Open access is about ensuring free, immediate, online access to the results of research. There are two main routes to ensuring open access – gold where you pay publishers to make your article freely accessible so it’s not only available to those with a subscription, and green where you self-archive a copy (usually a pre-print version) in a repository. You can see what your publisher allows via Sherpa RoMEO – search for the journal title and it will tell you if there are gold, paid-for options, or what is permitted in terms of green routes (i.e. what version can be archived and under what embargo)
  5. Open science is also about making data available. Open data is data that can be used, modified and shared by anyone for any purpose. So for example it can be mined, exploited, reproduced, commercialised... Data under a non-commercial license wouldn’t be open as you’re restricting how it can be used and by whom. There are different levels of openness. Making content available on the web, in any format, under an open license is open data. It’s not necessarily that reusable though. By making it available as structured data, in non-proprietary formats, using URIs and linking to other data for context, makes it much more valuable.
  6. So we’re talking about being open at every stage, right from designing your research to conducting experiments and analysis, publishing results and releasing the associated outputs. There are tools that can help at various stages and I’ve listed a few here. Taverna for example, help with documenting workflows, while MyExperiment is a VLE that allows you to share workflows and collaborate. LabTrove is a blogging platform that integrates with instruments so you can automatically publish documentation, and OpenWetWare is a wiki to share information, know-how, and research protocols in biology & biological engineering. GitHub is a code repository to handle versioning and allow others to collaborate / reuse code. RoMEO is a lit of journal policies, re3data.org is a list of data repositories and DataCite and Impact story help you to track the citation, reuse and impact of your work.
  7. In the second half of the talk I’ve focus on reasons why you should be open – specifically the benefits and rewards
  8. I like the quote from Ewan Birney that it was never acceptable to publish papers without making data available. If you can evidence and validate your findings, how do others know that they’re right?
  9. The Royal Society report ‘Science as an Open Enterprise’ makes a similar point. This emphasises that much of the growth of scientific understanding is due to open practices. Being open about your work and encouraging feedback from others is at the heart of scientific practice. The report calls for ‘intelligent openness’ – data shouldn’t just be accessible, they need to be intelligible by others so they can assess and reuse them.
  10. What are the benefits to openness? When publications are open, it means you can access all the relevant literature regardless of whether you have a subscription or not. Open sharing also means that research is transparant and reproducible – you can validate results By sharing you will increase your visibility and the reuse of your work, which in turn leads to more collaboration and partnerships By depositing in repositories, you also safeguard your outputs since these are properly managed services that will curate your papers and data Sharing in general also makes research more efficient – it’s quicker and new discoveries are made faster
  11. There was a study done in Denmark on how open access saves time. They looked at the average time wasted looking for articles that were difficult to locate or gain access to (average = 60 mins per article), and extrapoloated this up to see what it was costing the sector annually. The estimate is €72 million in Denmark alone.
  12. Sharing data also cuts down on academic fraud. You’ve probably come across the case of Diederik Stapel – a former professor of social psychology at Tilberg University in the Netherlands. He had literally been making up data to support his hypotheses and various headlines had been picked up by the media e.g. meat eaters are more selfish than vegetarians. Some students questioned the results and the whole story unravelled, showing this had been going on for years. What’s troubling is that this had gone unnoticed. The data weren’t made available and checked, so such a history of fraud could develop.
  13. The validation of results is key. This article references an inadvertent error in a economics paper by Reinhadt and Rogoff. Missing some rows out of an average gave drastically different results – what was published suggested that countries with 90% debt ratios see their economies shrink by 0.1%. Instead, it should have found that they grow by 2.2% – less than those with lower debt ratios, but not a spiralling collapse. This mistake wasn’t picked up on initially as the data hadn’t been shared. The mistake fed into government policy - the findings of this paper were used as justification for austerity measures in the UK and various other countries in the EU.
  14. To look at things in a more positive light, this study shows how open access publishing accelerates the research process. It looks at citations rates for articles deposited in arXiv – a preprints repository in maths, physics, astronomy, computer science etc The time lag between deposit and citation of an article has been shrinking drastically year on year. Nowadays the research cycle in High-Energy Physics (HEP) is approaching maximum efficiency as a result of the early and free availability of articles that scientists in the field can use and build upon rapidly.
  15. Certain research communities have also seen the benefit of sharing data as it speeds up the process of discovery. This article shows how researchers in the field of Alzheimer’s research have agreed as a community to share data immediately to make scientific breakthroughs.
  16. There’s also a citation advantage for individual researchers. This study by Heather Piwowar and Todd Vision looked at 10,555 paper of gene expression studies that had shared the associated microarray data. Those studies that shared data received 9% more citations.
  17. There’s also an economic benefit, as seen by the case of the NASA landsat satellite images. These were sold until 2008 for $600 a scene. Now they’re freely available and used by Google Earth. Previously they sold 19,000 images a year, whereas now they transmit 2.1 million. The revenue has gone up incredibly too from $11.4 million to an estimated value of $935 million with direct benefit of more than $100 million. The release has also stimulated the development of applications from companies worldwide. This case study comes from the Royal Society Report on Science as an Open Enterprise.
  18. It’s not all positive though – otherwise why isn’t everyone already doing this. There is a certain amount of effort and cost to open science, which this blog post by Emilio Bruna highlights. He calculated the cost of sharing his data for one paper and came to a total of 35 hours and $690. He breaks this down into the cost of preparing the dataset, creating complementary metadata and associated files, cleaning up and documenting the code (which involves a big mental leap), and the charges applied.
  19. He suggests what needs to change and I think this is quite pertinent for today. We need a better system of incentives so you get more tangible returns from openness – better chances on grant applications, more likelihood of promotion… This is likely to evolve during your careers. If we teach people what to do now, it will become part of your processes and be easier in the long run. The learning curve won’t be as steep. We also need to focus not on why you should be open, but how – and that’s what today’s presentations should give you.
  20. I mentioned earlier about conducting science in the open. UsefulChem is an example of that. It’s a wiki platform where chemists can document their experiments. It’s effectively an open labnotebook. You can see here that the researcher has shared his aims, procedures, results, conclusion and a log of activities. There is also an option to comment.
  21. My Experiment is like a Virtual Research Environment. You can share scientific workflows that others can comment on and reuse. You can see in the sidebar that it’s clear how the workflow is licensed and who credit should be given to if reusing. There are also a number of social media features too e.g. ratings, favourites and stats for views and downloads.
  22. We also mentioned the importance of open access publishing, so I want to walk through the different options available. Essentially researchers may choose to publish in open access journals or traditional subscription-based journals. You can check what OA options are available by searching for your journal in the RoMEO directory. If you publish in an open access journal, an Article Processing Charge may be applied. Your article will then be made immediately available via the publisher. This is often terms ‘gold OA’ Alternatively you may choose to publish with a traditional publisher. Some subscription based journals are termed ‘hybrid journals’ as they also have a gold OA option. Essentially you pay to make your individual article freely available alongside others that remain available only to subscribers. Again, once you pay the APC, your article will be freely available to all via the publisher’s website. Another, cheaper route is to follow green OA. This essentially means that you take your copy of the paper (usually a pre-print, not the publisher’s final version) and deposit that in an OA repository. You can search for a repository via OpenDOAR, or check with your institution. There is usually a local institutional repository. Once you self-archive, your article will be freely available via the repository. This is often called delayed open access as publishers will normally impose an embargo in the region of 6-12 months so they can get money from providing access to the article first. A final point to note is that gold and green routes are not mutually exclusive. You can pay an APC and also deposit in a repository. In fact, there may be an obligation / encouragement on you to do this from your funder or uni.
  23. This is the Sherpah RoMEO service. You can search for your journal title and then see what is allowed. In this example, the author can archive a pre-print but not the publisher’s final version, and certain restrictions apply e.g. a 6 month embargo.
  24. Speak to your library as there is usually an institutional repository you can use. There may also be relevant repositories given your field e.g. Arxiv for maths, computing science, physics etc.
  25. OpenAIRE is also worth checking out. This is an EC-funded project to provide infrastructure for open access. They’ve recently released a short video that tells you how they can help. Essentially OpenAIRE aggregates metadata from different repositories to compile a complete list of publications and related outputs. They mine and enrich the content, de-duplicating entries and linking together publications with data, details about the project, authors, funders etc. OpenAIRE also provides a number of useful services & APIs, for example you can embed a publication list for your project in your website that is automatically updated whenever someone adds a new paper to a repository (this is harvested into OpenAIRE and pushed out to your list).
  26. The final aspect to discuss is open data. The Open Knowledge Foundation suggests four simple steps. First off you need to decide what to share. Not all data can be made openly available due to commercial restrictions or sensitivities. Once you’ve decided what to share, determine what IP exists and apply a suitable licence. You should then make the data available in a suitable format so others can bulk download it. Remember that for it to be useful you want to share appropriate metadata and documentation too. Using repositories is useful to make sure your data are properly managed and preserved for the long-term. The final step is to make your data discoverable. Put it online, tell others about it and add details to various registries so it gets found.
  27. Guidance from the DCC can also help researchers to understand data licensing. This guide outlines the pros and cons of each approach e.g. the limitations of some CC options The OA guidelines under Horizon 2020 point to CC-0 or CC-BY as a straightforward and effective way to make it possible for others to mine, exploit and reproduce the data. See p11 at: http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-pilot-guide_en.pdf
  28. You should select a suitable repository. The Databib and Re3data lists can be useful for this. They allow you to search and browse by subject. Re3data also allows you to restrict the search by certificates, open access repositories and persistent identifiers.
  29. To make sure their data can be understood by themselves, their community and others, researchers should create metadata and documentation. Metadata is basic descriptive information to help identify and understand the structure of the data e.g. title, author... Documentation provides the wider context. It’s useful to share the methodology / workflow, software and any information needed to understand the data e.g. explanation of abbreviations or acronyms There are lots of standards that can be used. The DCC started a catalogue of disciplinary metadata standards which is now being taken forward as an international initiative via an RDA working group