SlideShare una empresa de Scribd logo
1 de 39
British Library Datasets Programme JISC RSP Winter School February 2011 Max Wilkinson
Today’s Talk ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
The British Library ,[object Object],[object Object],[object Object],[object Object],[object Object]
The British Library: some facts and figures Helping people advance knowledge to enrich lives British Library Act 1972 National centre for reference, study, bibliographical and other information services, in relation both to scientific and technological matters, and to the humanities. Science and Innovation Investment Framework 2004-2014, H.M. Treasury (2004) UK research base must have ready and efficient access to information of all kinds – such as experimental data sets,   journals, theses, conference proceedings and patents.  This is the life blood of research and innovation . The largest document supply  service in the world. Secure  e-delivery and ‘just in time’ digitisation enables desktop  delivery within 2 hours ,[object Object],[object Object],National library of the UK. Serves researchers, business, libraries, education & the general public Collection includes over 2m  sound recordings, 5m reports, theses and conference papers, the world’s largest patents collection (c.50m) 3 main sites in London and Yorkshire. Circa 2,000 staff Business and IP Centre:  Providing inspiration, and enabling protection of creative capital and business development ,[object Object],Collection fills over 600km of shelving and grows at 11km per year 70 Tb of digital material through voluntary deposit
Who do we serve?   ,[object Object],[object Object],[object Object],[object Object],[object Object]
Modern science relies on good data
Scholarly record Discovery Access Record Permanence Citation Metadata Exposure Trust Fabrics Copyright Scholarly record
The Foundation for Research ,[object Object],[object Object],[object Object]
Current Situation ,[object Object],[object Object],[object Object]
As a result… ,[object Object],[object Object],[object Object],[object Object]
Difficult to Discover.  Good luck finding the data! “ Source: Committee on Climate Change”
Data are diverse in the Digital Landscape ,[object Object],[object Object],[object Object],[object Object]
Re-join the gap… ,[object Object],[object Object],[object Object],Articles Underlying data
Datasets – first class citizens? ,[object Object],[object Object],[object Object],[object Object],[object Object],Source: UKRDS Study: The Data Imperative.  Managing the UK’s research data for future use (Feb 2009)
Scholarly record Discovery Access Record Permanence Citation Metadata Exposure Trust Fabrics Copyright Scholarly record
Research training based on scholarly communication  Discovery Access Record Permanence Citation Metadata Exposure Trust Fabrics Copyright Scholarly record Rarely includes data
Scholarly communication requires intellectual exchanges Discovery Access Record Permanence Citation Metadata Exposure Trust Fabrics Copyright Scholarly record No such data fabric
Scholarly discourse requires a record and provenance Discovery Access Record Permanence Citation Metadata Exposure Trust Fabrics Copyright Scholarly record Almost non-existent for data
The Datasets Programme ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Two key concepts ,[object Object],[object Object]
Projects and activities www.bl.uk/ datasets Follow us on twitter  @ datasetsBL
A Key Component for Many Goals Persistent Identification Make Visible Find Access Track Impact Verify Reuse Cite ?
Citation using Digital Object Identifiers (DOIs) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],How to reference Published Article (Abstract or full text) The DOI system offers an easy, internet actionable way to connect the article with the underlying publication But a complete scholarly record would also link to the evidential datasets and their location, e.g. PANGAEA doi:10.1038/nature05431
doi:10.1038/nature05431  leads to a landing page
Connecting an Article with the Underlying Data ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object], 
doi:10.1594/PANGAEA.587840
Dataset citation using Digital Object Identifiers (DOIs) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Data Citation
Projects – DataCite ,[object Object],[object Object],[object Object],[object Object]
DataCite ,[object Object],[object Object],[object Object],DataCite : Data Centres :: CrossRef : Publishers
Digital Object Identifier (DOI) ,[object Object],Prefix Suffix
DOI prefix ,[object Object],Prefix Suffix ,[object Object],[object Object]
DOI suffix ,[object Object],Prefix Suffix ,[object Object],[object Object]
Resolving a DOI ,[object Object],Prefix Suffix ,[object Object],[object Object]
DOIs resolve to an open landing page
DataCite Service ,[object Object],[object Object]
Projects and activities www.bl.uk/ datasets
SageCite:  Data citation in bioinformatics workflow   ,[object Object],[object Object],[object Object],[object Object],SageCite:   Integration of data citation services into multi-contributor bio-informatics workflow.  Establishing data attribution and credit mechanisms . ►   INCENTIVE Sage Bionetworks :  Aggregating datasets from contributors to create massive coherent datasets that can be used for systems level analysis of disease
Dryad UK:  Repository sustainability   ,[object Object],[object Object],[object Object],Dryad UK:  Define a business case and pilot service integrating DataCite DOIs and dataset archiving into publisher workflows ►   SUSTAINABILITY Leveraging the Dryad Consortium, which is addressing the acquisition and storage of long tail supplementary data
For more information on the BL Datasets Programme ,[object Object],[object Object],[object Object],[object Object],[object Object]

Más contenido relacionado

La actualidad más candente

Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Management
slabrams
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
University of California Curation Center
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
Edward Blurock
 
Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011
Jisc
 

La actualidad más candente (20)

2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)2021-01-27--biodiversity-informatics-gbif-(52slides)
2021-01-27--biodiversity-informatics-gbif-(52slides)
 
Supporting UC Research Data Management
Supporting UC Research Data ManagementSupporting UC Research Data Management
Supporting UC Research Data Management
 
On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...
 
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
The Fourth Paradigm - Deltares Data Science Day, 31 October 2014
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
Active research management and sharing
Active research management and sharingActive research management and sharing
Active research management and sharing
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014Why science needs open data – Jisc and CNI conference 10 July 2014
Why science needs open data – Jisc and CNI conference 10 July 2014
 
Jisc's new shared data centre
Jisc's new shared data centreJisc's new shared data centre
Jisc's new shared data centre
 
Poster: Very Open Data Project
Poster: Very Open Data ProjectPoster: Very Open Data Project
Poster: Very Open Data Project
 
Liberating facts from the scientific literature - Jisc Digifest 2016
Liberating facts from the scientific literature - Jisc Digifest 2016Liberating facts from the scientific literature - Jisc Digifest 2016
Liberating facts from the scientific literature - Jisc Digifest 2016
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011Workshop at Oxford on publishing for early career researchers - April 2011
Workshop at Oxford on publishing for early career researchers - April 2011
 
Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014Opening up data – Jisc and CNI conference 10 July 2014
Opening up data – Jisc and CNI conference 10 July 2014
 
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and FuturePoster RDAP13: Research Data in eCommons @ Cornell: Present and Future
Poster RDAP13: Research Data in eCommons @ Cornell: Present and Future
 
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
 
Stories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global InfrastructureStories of “Glocality"—Nations in a Global Infrastructure
Stories of “Glocality"—Nations in a Global Infrastructure
 
Ross Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy DevelopmentsRoss Wilkinson - Data Publication: Australian and Global Policy Developments
Ross Wilkinson - Data Publication: Australian and Global Policy Developments
 
XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives
XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives
XSEDE National Cyberinfrastructure, NIST, and Supporting NCSI Objectives
 
Research Data Management at the University of Edinburgh
Research Data Management at the University of EdinburghResearch Data Management at the University of Edinburgh
Research Data Management at the University of Edinburgh
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 

Destacado (6)

Hyperlinks in HTML
Hyperlinks in HTMLHyperlinks in HTML
Hyperlinks in HTML
 
Internet Explorer and Outlook Express
Internet  Explorer and Outlook ExpressInternet  Explorer and Outlook Express
Internet Explorer and Outlook Express
 
Access 2007 Training
Access 2007 TrainingAccess 2007 Training
Access 2007 Training
 
MS Office Access Tutorial
MS Office Access TutorialMS Office Access Tutorial
MS Office Access Tutorial
 
Ms Access ppt
Ms Access pptMs Access ppt
Ms Access ppt
 
Cloud computing ppt
Cloud computing pptCloud computing ppt
Cloud computing ppt
 

Similar a British Library Datasets Programme Feb 2011

British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...
johnkayebl
 
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Xiaogang (Marshall) Ma
 
Jan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortiumJan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortium
"Open Access - Open Data" conference, 13th/14th December, 2010
 

Similar a British Library Datasets Programme Feb 2011 (20)

British Library Datasets Programme 2010
British Library Datasets Programme 2010British Library Datasets Programme 2010
British Library Datasets Programme 2010
 
British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...British Library Social Science National Postgraduate Training Day - Datasets ...
British Library Social Science National Postgraduate Training Day - Datasets ...
 
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repositoryEdinburgh DataShare: Tackling research data in a DSpace institutional repository
Edinburgh DataShare: Tackling research data in a DSpace institutional repository
 
Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information access
 
Moving OA to the scientific enterprise
Moving OA to the scientific enterpriseMoving OA to the scientific enterprise
Moving OA to the scientific enterprise
 
WOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web ObservatoriesWOW13_RPITWC_Web Observatories
WOW13_RPITWC_Web Observatories
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
DataCite at APE 2011
DataCite at APE 2011DataCite at APE 2011
DataCite at APE 2011
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
Understanding the Big Picture of e-Science
Understanding the Big Picture of e-ScienceUnderstanding the Big Picture of e-Science
Understanding the Big Picture of e-Science
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset citation and identification
Dataset citation and identificationDataset citation and identification
Dataset citation and identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
Introduction to data support services and resources for public policy
Introduction to data support services and resources for public policyIntroduction to data support services and resources for public policy
Introduction to data support services and resources for public policy
 
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Obser...
 
Jan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortiumJan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortium
 
Sla2009 D Curation Heidorn
Sla2009 D Curation HeidornSla2009 D Curation Heidorn
Sla2009 D Curation Heidorn
 

Último

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
SoniaTolstoy
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 

Último (20)

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 

British Library Datasets Programme Feb 2011

Notas del editor

  1. Name of person presenting the data?
  2. Intro to British Library. Facts.
  3. [Slide to remind people we are focusing on research data] When you read a research article, you are reading someone’s interpretation of some underlying evidence. And that’s a subjective interpretation. When we talk about data, we are really talking about the solid evidence that underpins these research articles.
  4. Data is the foundation for research It is an essential component of the scientific record. Time-consuming, costly to produce. Re-acquisition may be impossible. Therefore essential that it is preserved and shared.
  5. Despite the importance of this data, there is… As a result, datasets are … Many researchers are willing to share their data, but … . This is like grey literature in the 80’s and early 90’s – before the web. If you wanted a conference paper, you either had to show up in person or know someone who was there. This split research institutes and universities into haves and have-nots. You had to be on the ‘secret paper passing network’ to learn about the hottest latest research. This means that almost 80% of researchers put their datasets where? On their laptops, desktops, desk drawers, departmental servers. This is not a serious way to run a serious business! The management practices vary tremendously – some will have good practices, but many will not – placing the data at risk. 88% said they would make data available and 43% expressed the need to access other’s data. Researchers who produce the essential data that drive new science are often unrewarded and that data centres have considerable challenges justifying their budget and existence. And how is the state of resource discovery for datasets? UKRDS Study (1) Data is difficult to retain and manage once project funding ceases (compare grey and published articles) (2) Only 12% do not make their data available - but informal networks are predominant (3) 43% expressed the need to access other's data
  6. As a result, datasets are: Difficult to discover Difficult to access In danger of being lost This is widely recognised…
  7. [The Economist - citation appears in print and web versions, so not to save space] Good luck finding the data! Cannot: - Validate the author’s claims - Investigate the data for other interesting facts…
  8. I am here to tell you about the datasets programme, which has come about because of rapid changes in the digital landscape. People are generating and sharing ever increasing volumes of data. We refer to collections of data as datasets. While the nature of datasets varies across disciplines, researchers within each discipline typically agree on what constitutes a dataset for them. Examples of datasets include (1) example of volcanic data (2) sound archive (3) cluster of chromosomes inside a breast cancer cell (4) uk poll of voting intention (blue cons, red labour, yellow liberal) Within the Dataset Programme, we consider a dataset to be an organised collection of digital objects that is produced or consumed during research. We emphasise the role that the dataset plays in the research activity, its importance to researchers, its impact, and its potential for reuse. Despite the differing nature of datasets, many of the services required by researchers are shared, such as methods of citation, discovery, and preservation.
  9. The current situation for data isn’t good. Articles are well catered for by libraries and publishers. The underlying data is being neglected. Unsatisfactory.
  10. We can even ask the question – are datasets first class citizens in the record of science? Contrast this situation with the one that we have for research articles. Libraries ensure long-term storage and management of articles Well established services for giving access to articles. Nearly all published articles are held in multiple national libraries Articles and citations form the backbone of impact analysis of researchers Catalogues and full-text search support discovery Clearly, this is an untenable situation and we need to take action!
  11. The datasets programme has been established to explore how the Library can help… Not only do we want to ensure data is preserved, we envision a future where… Our approach is to foster collaboration and…
  12. How can we achieve this? We are working on a number of projects – see www.bl.uk/datasets
  13. DataCite 2 We see Persistent identification as a key component for this…
  14. So what can organisations, like the British Library, do to help address these issues. Libraries have a reasonable level of credibility with identifiers and metadata to enable discovery and enhance access. We are cross-discipline, and have established relationships with publishers, universities, researchers, funders and play a core role in the national research infrastructure. We feel that we can address some of the barriers that we are seeing to data citation. We are clear that we do not want to re-invent the wheel and that we want to ensure that the right incentives are there.
  15. DataCite 3 The approach that DataCite is taking – using DOIs - has some important social benefits. Researchers, authors, publishers are comfortable, understand, and know how to use them. They put datasets on a level playing field with articles. [Add citation of data in an article… REAL ONE!]
  16. So what can organisations, like the British Library, do to help address these issues. Libraries have a reasonable level of credibility with identifiers and metadata to enable discovery and enhance access. We are cross-discipline, and have established relationships with publishers, universities, researchers, funders and play a core role in the national research infrastructure. We feel that we can address some of the barriers that we are seeing to data citation. We are clear that we do not want to re-invent the wheel and that we want to ensure that the right incentives are there.
  17. Example Project 1 – DataCite Our long term vision is to support researchers by providing methods for them to locate, identify, and cite research datasets with confidence. Germany – TIB Germany – Gesis Leibniz Institute Germany – German Library of Medicine United Kingdom - The British Library France - INIST Switzerland - ETH Zürich Denmark - TU Delft Netherlands - TIC Canada - CISTI Australia - ANDS USA - CDL USA - Purdue
  18. Today we will be talking about DataCite International association of 15 organisations, founded at the British Library Just had our 1 year anniversary (founded at the British Library in December 2009). We are working together to…
  19. What is a DOI? Unique identifier, similar in concept to an ISBN Consists of a prefix and a suffix
  20. (NOTE – this DOI will not resolve!)
  21. Built a service or minting DOIs This is what we will tell you about today BUT FIRST, we will quickly introduce DOIs
  22. How can we achieve this? We are working on a number of projects – see www.bl.uk/datasets