1. CrossRef Annual Meeting
Cambridge, MA – November 15, 2011
DataCite: the Perfect
Complement to CrossRef
James L. Mullins, PhD
Dean of Libraries and Professor
Libraries
2. “I like to think of data in three categories,
using a mining metaphor: ‘raw ore’,
‘concentrate’, and ‘virgin metal.’ The
question is which data are worth saving
and which throwing away?”
Arden Bement, Remarks before IATUL, June 21st, 2010,
Purdue University, West Lafayette, Indiana.
http://docs.lib.purdue.edu/iatul2010/conf/day1/7/
Libraries
3. Data/Data Sets
•Prolific growth – large and small science
•Lifeblood – science and engineeringresearch
•Modeling – demands massive amounts of data
•Funding Agencies – public accessibility
Libraries
4. Motivation
– No effective way to link
Articles between datasets and articles
Articles
– No widely used method to
identify datasets
– No widely used method to
Underlying cite datasets
Underlying data
data
Jan Brase – ORCID
Outreach Meeting –
September 17th,
2011 – CERN
Jan Brase –
November 2011
Libraries
5. The CrossRef Digital Object Identifier (DOI®)
System identifies Journals.
More than 98% of all DOI
registered are for scholarly
articles.
Kuhlmann, Holger; Freudenthal, Tim; Helmke, Peer; Meggers,
Helge (2004): Reconstruction of paleoceanography off NW
Africa during the last 40,000 years: influence of local and
regional factors on sediment accumulation. Marine Geology,
207(1-4), 209-224, doi:10.1016/j.margeo.2004.03.017
Libraries
6. •Establish easier access to scientific research data on the
Internet.
•Increase acceptance of research data as legitimate,
citable contributions to the scientific record.
•Support data archiving that will permit results to be
verified and re-purposed for future study.
http://www.datacite.org/ Libraries
7. • Descriptive Metadata - author (person
or corporate), research variables, etc.
• Subject descriptors - disciplinary
taxonomy
• Digital Object Identifier - persistent
identifier
The dataset with DOI:
Kuhlmann, H et al. (2009): Age models, iron intensity,
magnetic susceptibility records and dry bulk density
of sediment cores from around the Canary Islands.
doi:10.1594/PANGAEA.727522, Libraries
8. • Acceptance by research community of
the importance of data in research
• Assignment of authorship/creation (lack
of copyright control) – metadata & DOI
• Citation to dataset by research
undertaken and reported – through
DataCite
• Establishment of an h-index for dataset
impact Libraries
9. Deep Sea Research: Oceanographic Research Papers –
Elsevier, Inc.
CrossRef DOI
DataSet
Libraries
10. Deep Sea Research: Oceanographic Research Papers –
Elsevier, Inc.
DOI by DataCite
Libraries
11. DataCite Members
• Technische Informationsbibliothek (TIB)
• Canada Institute for Scientific and Technical Information (CISTI),
• California Digital Library, USA
• Purdue University, USA
• Office of Scientific and Technical
• Information (OSTI), USA
• Library of TU Delft,
• The Netherlands
• Technical Information
• Center of Denmark
• The British Library
• ZB Med, Germany
• ZBW, Germany
• Gesis, Germany
• Library of ETH Zürich
• L’Institut de l’Information Scientifique
• et Technique (INIST), France
• Swedish National Data Service (SND)
• Australian National Data Service (ANDS)
• Affiliated members:
• Digital Curation Center (UK)
• Microsoft Research
• Interuniversity Consortium for Political and Social Research (ICPSR)
• Korea Institute of Science and Technology Information (KISTI)
Libraries
12. DataCite Structure
International DOI
In the Foundation
United Member
States DataCite
Managing Agent
(TIB)
California Digital Office of Science and Associate
Library and Technological Stakeholder:
Purdue University Information (OSTI) ICPSR;
Libraries (EZID) Microsoft
… Research
Works
Data Centre
Data Centre Data Centre
Data Centre
Data Centre Data Centre with
13. “I like to think of data in three categories,
using a mining metaphor: ‘raw ore’,
‘concentrate’, and ‘virgin metal.’ The
question is which data are worth saving
and which throwing away?”
Arden Bement, Remarks before IATUL, June 21st, 2010,
Purdue University, West Lafayette, Indiana.
http://docs.lib.purdue.edu/iatul2010/conf/day1/7/
Libraries
14. Thank you.
Questions?
James L. Mullins, Purdue/DataCite: jmullins@purdue.edu
Patricia Cruse, CDL/DataCite: Patricia.Cruse@ucop.edu
Sharon Jordan, OSTI/DataCite: JordanS@osti.gov
Jan Brase, TIB/DataCite: Jan.Brase@tib.uni-hannover.de
Libraries