SlideShare a Scribd company logo
1 of 17
DATA SHARING AND DATA MANAGEMENT –
     WHAT ARE THEY ALL ABOUT?




  A joint presentation from the University of Queensland Library‘s
     Scholarly Publishing and Digitisation Service and the
                  Research Information Service.


  Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
Increasingly, data sharing is expected

by journal publishers

by funding bodies

by governments

by other researchers

by the public



                                    But why is it important?

          Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
BECAUSE, INCREASINGLY, YOU NEED THE
DATA TO UNDERSTAND THE RESEARCH


 Geoffrey Boulton* argues that since no journal can spare the space
 to publish the avalanche of data points that large-scale scientific
 experiments produce, the published paper has become more
 of an "advertisement" and the "science                                                 sits in the
 underlying data".

 * Quoted in An open and shut case? Debating the purposes of open science, a
  Royal Society PolicyLab meeting (mp3 file).
 http://downloads.royalsociety.org/audio/Policy/policylab/2011-9-01SPSOpenScience.mp3



 Geoffrey Boulton is a Fellow of the Royal Society and is currently leading the
 Society's project, Science as a Public Enterprise.




                    Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
WHY SHOULD RESEARCHERS SHARE DATA?


 ―The volume of scientific data, and the inter-
 connectedness of the systems under study, makes
 integration of data a necessity.
 ―… life scientists must integrate data from across
 biology and chemistry to comprehend disease and
 discover cures, and climate change scientists
 must integratedata from wildly
 diverse disciplines to understand our
 current state and predict the impact of new
 policies.‖

  Science Commons, Protocol for Implementing Open Access Data,
  http://sciencecommons.org/projects/publishing/open-access-data-protocol/




                  Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
WHY DON’T RESEARCHERS SHARE DATA?


―We find ourselves in a slightly
perverse situation where                                                     Scientists have tended to regard
scientists are very strongly                                                 their data as personal property.
incentivised to create peer-                                                   After all, it is they who worked
reviewed publications, but not                                                    hard to generate it—and
to share information in other                                                   ownership has never been
ways.‖                                                                              seriously challenged.

Timo Hannay                                                                  Geoffrey Boulton et al (2011) ‗Science
Nature Publishing Group, quoted                                              as a public enterprise: the case for open
in www.growingknowledge.bl.uk                                                 data‘, The Lancet, 377 (9778) : 1633-
                                                                                               1635.
                                                                              doi:10.1016/S0140-6736(11)60647-8




     ―Much more is understood about why
     researchers do not share data than
     about when, why, and how
     researchers do share data …‖

     Christine Borgman 2011 ‗The conundrum
     of sharing research data‘, (unpublished paper)
     http://works.bepress.com/borgman/244/




                          Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
RESEARCHERS WORRY ABOUT …

   No incentives
   • data sharing is not yet valued in the promotion or tenure process

   No time

   • for data clean up, managing requests, handling enquiries

   Loss of control

   • risks of data theft, misrepresentation, lack of attribution

   Legal concerns

   • copyright, IP, ownership, commercialisation, contracts

   Ethical concerns

   • confidentiality agreements, fear of accidental disclosure

   Lack of mechanisms

   • inadequate infrastructure, lack of advice



               Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
DO RESEARCHERS HAVE TO SHARE DATA?

Increasingly, data sharing is expected ... How, when, and what you
share will depend on:


    Formats – digital data is probably easier to share


    Restrictions, such as confidentiality, commercialisation


    Funder and publisher agreements


    Customary embargo periods


    Availability of repositories or other means for sharing




                 Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
BENEFITS OF DATA SHARING – TO YOU

•   Makes your research better known, which may
    attract grants and collaborators, and citizens who
    want to help answer your questions (crowd-sourcing)
•   Demonstrates the continuing use of your data and
    the relevance of your research

DATA SHARING THROUGH A REPOSITORY
•   Lets you focus on research instead of having to
    manage the data itself, or manage requests for data
•   Safeguards your investment of time and resources
•   Preserves your data – for your own benefit as well
    as others




               Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
BENEFITS OF DATA SHARING - TO RESEARCH

                                      • data sharing makes new                          kinds of
new areas of research
                                        research possible

   data collection                    • greater volumes can be managed


  data ―mash ups‖                     • different data can be combined


research collaboration                • work across continents and
                                        disciplines

    data analysis                     • greater scale is achievable

                                      • crowd-sourcing can generate, crunch and
  ―citizen‖ science                      fund your data


     saves time                       • improves research efficiency




                  Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
REASONS TO OPEN DATA UP

―First, technology has made computer code and large datasets more
    important to science and has opened up the prospect of sharing
    code and data at the click of a mouse.

―Second, there is public interest in making data available to other
   scientists to validate findings or re-use the data in new ways to
   advance knowledge.

―Third, much modern science is created using public funds, which
    should oblige scientists to maximise the utility of their findings for the
    public good.
―And last …there are many competent members of the public who wish
   to test for themselves some of the pronouncements of scientists by
   analysing the data on which such pronouncements are based.‖


Geoffrey Boulton et al (2011) Science as a public enterprise: the case for open
data, The Lancet, 377 (9778) : 1633-1635. doi:10.1016/S0140-6736(11)60647-8




                   Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
DATA SHARING CAN BE ACHIEVED BY :


    Publishing                    Datasets as                                           Linking to
    findings in                  supplements to                                       datasets from
 journals and at                    journal                                              journal
   conferences                    publications                                         publications


                                                                                     Sharing data
                              Depositing data in
Assigning DOIs to                                                                   informally with
                              a public repository
    datasets                                                                       colleagues or on
                                  or archive
                                                                                        request


                                                        Offering data with
           Posting datasets
                                                        different levels of
            on public Web
                                                        access, e.g. de-
                 sites
                                                         identified data




             Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
―Science is driven by data … society now relies on
      scientific data of diverse kinds; for example, in responding
      to disease outbreaks, managing resources, responding to
      climate change, and improving transportation.

      ―It is obvious that making data widely                    available              is an
      essential element of scientific research.‖




Science editorial Making Data Maximally Available, 11 February 2011, 331 (6018): 649.
DOI: 10.1126/science.1203354




                 Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
…   WHICH MEANS DATA HAS TO BE MANAGED

According to the Australian Code for the Responsible Conduct of
Research :
2.6 ―Researchers must manage research data and primary materials                                                                  …‖



Compliance with the Code is now a prerequisite for acceptance of
  National Health and Medical Research Council and Australian
  research Council funding.
Research funding agencies, such as the US National Science
   Foundation, now expect data management plans to be lodged as
   part of funding proposals.
This may soon happen in Australia.




                Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
WHO WANTS YOUR DATA?

Just about everyone!
Submitting a proposal to                                       Publishing in a Nature
  the ARC ?                                                      journal?
You must describe how you will                                 ―… authors are required to make
   share your research data (or                                   materials, data and associated
   explain why you cannot share                                   protocols promptly available to
   data).                                                         readers.‖


The Code for the Responsible Conduct of Research states:

2. The potential value of the material for further research should also
be considered, particularly where the research would be difficult or
impossible to repeat.

2.5.2 Research data should be made available for use by other
researchers unless this is prevented by ethical, privacy or
confidentiality matters.


                Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
So … having a data management
  plan is important.
A data management plan outlines how you will collect, organise,
   manage, store, secure, back up, preserve and share your data.
It should


     describe the data so others can understand its scope


     identify the person responsible for data management


     list any tools or software needed to create, process or visualise the
     data

     document compliance with relevant policies, legislation, codes of
     conduct and ethical guidelines




                     Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
ARE THERE UQ TEMPLATES OR
CHECKLISTS YOU CAN USE?

Checklists, templates and other tools for creating plans are
   currently being developed.
Final documentation will be available once UQ‘s research
    data management policy is approved.


In the meantime, check with your Research Information
     Service librarian who can help you

•   draft a basic plan

•   advise you about the           training on offer
•   refer you to expert          advice, including our factsheets




                 Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
STILL HAVE QUESTIONS?
                                                                                   Please contact
                                                                                   us – we want to
                                                                                   help !




•   Research Information Service
       http://www.library.uq.edu.au/ris/index.html
•   Scholarly Publishing and Digitisation Service
       http://www.library.uq.edu.au/about/spads.html




                    Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library

More Related Content

What's hot

Managing and Sharing Research Data: Good practices for an ideal world...in th...
Managing and Sharing Research Data: Good practices for an ideal world...in th...Managing and Sharing Research Data: Good practices for an ideal world...in th...
Managing and Sharing Research Data: Good practices for an ideal world...in th...Martin Donnelly
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"GigaScience, BGI Hong Kong
 
HathiTrust Research Center Secure Commons
HathiTrust Research Center Secure CommonsHathiTrust Research Center Secure Commons
HathiTrust Research Center Secure CommonsBeth Plale
 
RDFC2012 Open Access to Research Data
RDFC2012 Open Access to Research DataRDFC2012 Open Access to Research Data
RDFC2012 Open Access to Research DataGudmundur Thorisson
 
Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research DataMartin Donnelly
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Managementslabrams
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...University of California Curation Center
 
Data, librarians, and services
Data, librarians, and servicesData, librarians, and services
Data, librarians, and servicesAndrew Treloar
 
Open Data and the Panton Principles in the Humanities
Open Data and the Panton Principles in the HumanitiesOpen Data and the Panton Principles in the Humanities
Open Data and the Panton Principles in the HumanitiesOpen Knowledge Maps
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingGigaScience, BGI Hong Kong
 
Big Data in the Arts and Humanities
Big Data in the Arts and HumanitiesBig Data in the Arts and Humanities
Big Data in the Arts and HumanitiesAndrew Prescott
 
Research Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social SciencesResearch Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social SciencesMartin Donnelly
 
Paradise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to MineParadise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to Minepetermurrayrust
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)Dag Endresen
 
Research Data Management: a gentle introduction
Research Data Management: a gentle introductionResearch Data Management: a gentle introduction
Research Data Management: a gentle introductionMartin Donnelly
 
RDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
RDAP13 Lorrie Johnson: Facilitating Access to Scientific DataRDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
RDAP13 Lorrie Johnson: Facilitating Access to Scientific DataASIS&T
 
Needs for Data Management & Citation Throughout the Information Lifecycle
Needs for Data Management & Citation Throughout  the Information LifecycleNeeds for Data Management & Citation Throughout  the Information Lifecycle
Needs for Data Management & Citation Throughout the Information LifecycleMicah Altman
 

What's hot (19)

Managing and Sharing Research Data: Good practices for an ideal world...in th...
Managing and Sharing Research Data: Good practices for an ideal world...in th...Managing and Sharing Research Data: Good practices for an ideal world...in th...
Managing and Sharing Research Data: Good practices for an ideal world...in th...
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"
 
HathiTrust Research Center Secure Commons
HathiTrust Research Center Secure CommonsHathiTrust Research Center Secure Commons
HathiTrust Research Center Secure Commons
 
RDFC2012 Open Access to Research Data
RDFC2012 Open Access to Research DataRDFC2012 Open Access to Research Data
RDFC2012 Open Access to Research Data
 
Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research Data
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Management
 
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
Libraries and Research Data Curation: Barriers and Incentives for Preservatio...
 
Data, librarians, and services
Data, librarians, and servicesData, librarians, and services
Data, librarians, and services
 
Open Data and the Panton Principles in the Humanities
Open Data and the Panton Principles in the HumanitiesOpen Data and the Panton Principles in the Humanities
Open Data and the Panton Principles in the Humanities
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
 
Cifar
CifarCifar
Cifar
 
April 23 NISO Virtual Conference: Dealing with the Data Deluge: Successful Te...
April 23 NISO Virtual Conference: Dealing with the Data Deluge: Successful Te...April 23 NISO Virtual Conference: Dealing with the Data Deluge: Successful Te...
April 23 NISO Virtual Conference: Dealing with the Data Deluge: Successful Te...
 
Big Data in the Arts and Humanities
Big Data in the Arts and HumanitiesBig Data in the Arts and Humanities
Big Data in the Arts and Humanities
 
Research Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social SciencesResearch Data Management for the Humanities and Social Sciences
Research Data Management for the Humanities and Social Sciences
 
Paradise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to MineParadise Lost and The Right to Read is the Right to Mine
Paradise Lost and The Right to Read is the Right to Mine
 
2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
Research Data Management: a gentle introduction
Research Data Management: a gentle introductionResearch Data Management: a gentle introduction
Research Data Management: a gentle introduction
 
RDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
RDAP13 Lorrie Johnson: Facilitating Access to Scientific DataRDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
RDAP13 Lorrie Johnson: Facilitating Access to Scientific Data
 
Needs for Data Management & Citation Throughout the Information Lifecycle
Needs for Data Management & Citation Throughout  the Information LifecycleNeeds for Data Management & Citation Throughout  the Information Lifecycle
Needs for Data Management & Citation Throughout the Information Lifecycle
 

Similar to Data sharing and data management – what are they all about?

British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010ALISS
 
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Sky Bristol
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...African Open Science Platform
 
Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Dag Endresen
 
BioMed Central's open data initiatives
BioMed Central's open data initiativesBioMed Central's open data initiatives
BioMed Central's open data initiativesiainh_z
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional RepositoriesRobin Rice
 
Forschungsdaten-Repositorien: Informationsinfrastrukturen für nachnutzbare F...
Forschungsdaten-Repositorien: Informationsinfrastrukturen für nachnutzbare F...Forschungsdaten-Repositorien: Informationsinfrastrukturen für nachnutzbare F...
Forschungsdaten-Repositorien: Informationsinfrastrukturen für nachnutzbare F...Heinz Pampel
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds
 
Research data lifecycle diagram
Research data lifecycle diagramResearch data lifecycle diagram
Research data lifecycle diagramSteven Cracknell
 
Curating the Scholarly Record: Data Management and Research Libraries
Curating the Scholarly Record: Data Management and Research LibrariesCurating the Scholarly Record: Data Management and Research Libraries
Curating the Scholarly Record: Data Management and Research LibrariesKeith Webster
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceMartin Donnelly
 
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...Instituting an Institutional Repository for Sharing, Archiving, and Accessing...
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...Lynne Frederickson
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarMartin Donnelly
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotMartin Donnelly
 
Rewarding data publication: ipt.biodiversity.aq
Rewarding data publication: ipt.biodiversity.aqRewarding data publication: ipt.biodiversity.aq
Rewarding data publication: ipt.biodiversity.aqAnton Van de Putte
 

Similar to Data sharing and data management – what are they all about? (20)

British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010
 
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
Big Data R&D Strategy - Ensure the long term sustainability, access, and deve...
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...
 
Open science curriculum for students, June 2019
Open science curriculum for students, June 2019Open science curriculum for students, June 2019
Open science curriculum for students, June 2019
 
BioMed Central's open data initiatives
BioMed Central's open data initiativesBioMed Central's open data initiatives
BioMed Central's open data initiatives
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
Forschungsdaten-Repositorien: Informationsinfrastrukturen für nachnutzbare F...
Forschungsdaten-Repositorien: Informationsinfrastrukturen für nachnutzbare F...Forschungsdaten-Repositorien: Informationsinfrastrukturen für nachnutzbare F...
Forschungsdaten-Repositorien: Informationsinfrastrukturen für nachnutzbare F...
 
20070919 Bkt Padua Esf Dfg Workshop Intro
20070919 Bkt Padua Esf Dfg Workshop Intro20070919 Bkt Padua Esf Dfg Workshop Intro
20070919 Bkt Padua Esf Dfg Workshop Intro
 
Simon hodson
Simon hodsonSimon hodson
Simon hodson
 
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
Jonathan Tedds Distinguished Lecture at DLab, UC Berkeley, 12 Sep 2013: "The ...
 
Engaging the Researcher in RDM
Engaging the Researcher in RDMEngaging the Researcher in RDM
Engaging the Researcher in RDM
 
Research data lifecycle diagram
Research data lifecycle diagramResearch data lifecycle diagram
Research data lifecycle diagram
 
Nicole Nogoy at the Auckland BMC RoadShow
Nicole Nogoy at the Auckland BMC RoadShowNicole Nogoy at the Auckland BMC RoadShow
Nicole Nogoy at the Auckland BMC RoadShow
 
Curating the Scholarly Record: Data Management and Research Libraries
Curating the Scholarly Record: Data Management and Research LibrariesCurating the Scholarly Record: Data Management and Research Libraries
Curating the Scholarly Record: Data Management and Research Libraries
 
Winning Horizon 2020 with Open Science
Winning Horizon 2020 with Open ScienceWinning Horizon 2020 with Open Science
Winning Horizon 2020 with Open Science
 
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...Instituting an Institutional Repository for Sharing, Archiving, and Accessing...
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...
 
Implementing and Institutional Repository for Sharing, Archiving, and Accessi...
Implementing and Institutional Repository for Sharing, Archiving, and Accessi...Implementing and Institutional Repository for Sharing, Archiving, and Accessi...
Implementing and Institutional Repository for Sharing, Archiving, and Accessi...
 
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE WebinarThe Horizon2020 Open Data Pilot - OpenAIRE Webinar
The Horizon2020 Open Data Pilot - OpenAIRE Webinar
 
The Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data PilotThe Horizon 2020 Open Data Pilot
The Horizon 2020 Open Data Pilot
 
Rewarding data publication: ipt.biodiversity.aq
Rewarding data publication: ipt.biodiversity.aqRewarding data publication: ipt.biodiversity.aq
Rewarding data publication: ipt.biodiversity.aq
 

Recently uploaded

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 

Recently uploaded (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 

Data sharing and data management – what are they all about?

  • 1. DATA SHARING AND DATA MANAGEMENT – WHAT ARE THEY ALL ABOUT? A joint presentation from the University of Queensland Library‘s Scholarly Publishing and Digitisation Service and the Research Information Service. Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 2. Increasingly, data sharing is expected by journal publishers by funding bodies by governments by other researchers by the public But why is it important? Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 3. BECAUSE, INCREASINGLY, YOU NEED THE DATA TO UNDERSTAND THE RESEARCH Geoffrey Boulton* argues that since no journal can spare the space to publish the avalanche of data points that large-scale scientific experiments produce, the published paper has become more of an "advertisement" and the "science sits in the underlying data". * Quoted in An open and shut case? Debating the purposes of open science, a Royal Society PolicyLab meeting (mp3 file). http://downloads.royalsociety.org/audio/Policy/policylab/2011-9-01SPSOpenScience.mp3 Geoffrey Boulton is a Fellow of the Royal Society and is currently leading the Society's project, Science as a Public Enterprise. Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 4. WHY SHOULD RESEARCHERS SHARE DATA? ―The volume of scientific data, and the inter- connectedness of the systems under study, makes integration of data a necessity. ―… life scientists must integrate data from across biology and chemistry to comprehend disease and discover cures, and climate change scientists must integratedata from wildly diverse disciplines to understand our current state and predict the impact of new policies.‖ Science Commons, Protocol for Implementing Open Access Data, http://sciencecommons.org/projects/publishing/open-access-data-protocol/ Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 5. WHY DON’T RESEARCHERS SHARE DATA? ―We find ourselves in a slightly perverse situation where Scientists have tended to regard scientists are very strongly their data as personal property. incentivised to create peer- After all, it is they who worked reviewed publications, but not hard to generate it—and to share information in other ownership has never been ways.‖ seriously challenged. Timo Hannay Geoffrey Boulton et al (2011) ‗Science Nature Publishing Group, quoted as a public enterprise: the case for open in www.growingknowledge.bl.uk data‘, The Lancet, 377 (9778) : 1633- 1635. doi:10.1016/S0140-6736(11)60647-8 ―Much more is understood about why researchers do not share data than about when, why, and how researchers do share data …‖ Christine Borgman 2011 ‗The conundrum of sharing research data‘, (unpublished paper) http://works.bepress.com/borgman/244/ Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 6. RESEARCHERS WORRY ABOUT … No incentives • data sharing is not yet valued in the promotion or tenure process No time • for data clean up, managing requests, handling enquiries Loss of control • risks of data theft, misrepresentation, lack of attribution Legal concerns • copyright, IP, ownership, commercialisation, contracts Ethical concerns • confidentiality agreements, fear of accidental disclosure Lack of mechanisms • inadequate infrastructure, lack of advice Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 7. DO RESEARCHERS HAVE TO SHARE DATA? Increasingly, data sharing is expected ... How, when, and what you share will depend on: Formats – digital data is probably easier to share Restrictions, such as confidentiality, commercialisation Funder and publisher agreements Customary embargo periods Availability of repositories or other means for sharing Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 8. BENEFITS OF DATA SHARING – TO YOU • Makes your research better known, which may attract grants and collaborators, and citizens who want to help answer your questions (crowd-sourcing) • Demonstrates the continuing use of your data and the relevance of your research DATA SHARING THROUGH A REPOSITORY • Lets you focus on research instead of having to manage the data itself, or manage requests for data • Safeguards your investment of time and resources • Preserves your data – for your own benefit as well as others Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 9. BENEFITS OF DATA SHARING - TO RESEARCH • data sharing makes new kinds of new areas of research research possible data collection • greater volumes can be managed data ―mash ups‖ • different data can be combined research collaboration • work across continents and disciplines data analysis • greater scale is achievable • crowd-sourcing can generate, crunch and ―citizen‖ science fund your data saves time • improves research efficiency Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 10. REASONS TO OPEN DATA UP ―First, technology has made computer code and large datasets more important to science and has opened up the prospect of sharing code and data at the click of a mouse. ―Second, there is public interest in making data available to other scientists to validate findings or re-use the data in new ways to advance knowledge. ―Third, much modern science is created using public funds, which should oblige scientists to maximise the utility of their findings for the public good. ―And last …there are many competent members of the public who wish to test for themselves some of the pronouncements of scientists by analysing the data on which such pronouncements are based.‖ Geoffrey Boulton et al (2011) Science as a public enterprise: the case for open data, The Lancet, 377 (9778) : 1633-1635. doi:10.1016/S0140-6736(11)60647-8 Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 11. DATA SHARING CAN BE ACHIEVED BY : Publishing Datasets as Linking to findings in supplements to datasets from journals and at journal journal conferences publications publications Sharing data Depositing data in Assigning DOIs to informally with a public repository datasets colleagues or on or archive request Offering data with Posting datasets different levels of on public Web access, e.g. de- sites identified data Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 12. ―Science is driven by data … society now relies on scientific data of diverse kinds; for example, in responding to disease outbreaks, managing resources, responding to climate change, and improving transportation. ―It is obvious that making data widely available is an essential element of scientific research.‖ Science editorial Making Data Maximally Available, 11 February 2011, 331 (6018): 649. DOI: 10.1126/science.1203354 Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 13. WHICH MEANS DATA HAS TO BE MANAGED According to the Australian Code for the Responsible Conduct of Research : 2.6 ―Researchers must manage research data and primary materials …‖ Compliance with the Code is now a prerequisite for acceptance of National Health and Medical Research Council and Australian research Council funding. Research funding agencies, such as the US National Science Foundation, now expect data management plans to be lodged as part of funding proposals. This may soon happen in Australia. Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 14. WHO WANTS YOUR DATA? Just about everyone! Submitting a proposal to Publishing in a Nature the ARC ? journal? You must describe how you will ―… authors are required to make share your research data (or materials, data and associated explain why you cannot share protocols promptly available to data). readers.‖ The Code for the Responsible Conduct of Research states: 2. The potential value of the material for further research should also be considered, particularly where the research would be difficult or impossible to repeat. 2.5.2 Research data should be made available for use by other researchers unless this is prevented by ethical, privacy or confidentiality matters. Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 15. So … having a data management plan is important. A data management plan outlines how you will collect, organise, manage, store, secure, back up, preserve and share your data. It should describe the data so others can understand its scope identify the person responsible for data management list any tools or software needed to create, process or visualise the data document compliance with relevant policies, legislation, codes of conduct and ethical guidelines Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 16. ARE THERE UQ TEMPLATES OR CHECKLISTS YOU CAN USE? Checklists, templates and other tools for creating plans are currently being developed. Final documentation will be available once UQ‘s research data management policy is approved. In the meantime, check with your Research Information Service librarian who can help you • draft a basic plan • advise you about the training on offer • refer you to expert advice, including our factsheets Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library
  • 17. STILL HAVE QUESTIONS? Please contact us – we want to help ! • Research Information Service http://www.library.uq.edu.au/ris/index.html • Scholarly Publishing and Digitisation Service http://www.library.uq.edu.au/about/spads.html Reproduced or adapted from original copyright content provided under Creative Commons licence by The University of Queensland Library