SlideShare una empresa de Scribd logo
1 de 31
IEDA DATA PUBLICATION
WORKSHOP
AGU Fall Meeting 2013
iedadata.org
Community-driven Data Services for
the Solid Earth Sciences
December 11, 2013

2

Data Publication: Definition
• Data publication with a small p
• Sharing data via web sites or submission to databases
• Data Publication with the big P
• Publication of data as part of scholarly communication
• Citable
This is what we are talking about!
• Persistent access
• Quality assurance (repository review or peer-review)
3

December 11, 2013

Data Publication: Why?
• Data access is key to science
• Data are the nucleus of scientific collaboration

re-use

• Scientific progress requires community that competes and collaborates

in pursuit of common goals
• Without access to the same materials no community exists
• Data are needed for scientific replication

reproduce

• The value of an article that can‟t be replicated:?
• Scholarly articles are summaries, not the actual research results
• Experimental data expensive to verify, observational data impossible
• Replication projects show: many published articles cannot be replicated

accessible at
http://www.slideshare.net/
4

December 11, 2013

Data Publication: Why now?
• Exponentially increasing data volumes
• Rapidly expanding cyberinfrastructure capabilities to mine

and analyze data
• New paradigms in publishing
• Growing enforcement of policies for open access to
research data

“In the last decades, rapid technical developments, such
as digital data and high-throughput techniques,
dramatically changed the scholarly publishing paradigm.
This requires new approaches in order to ensure
availability and usability of science data.”
http://www.icsu-wds.org/working-groups/data-publication
5

December 11, 2013

Data Policies
• Agencies
• Societies
• Journals

February 22, 2013

May 9, 2013
December 11, 2013

Data Policies

6
December 11, 2013

Data Policies

7
December 11, 2013

Data Publication: Why?
• Ensure proper citation of the data and credit to their

creator(s)
•

8
December 11, 2013

9

Data Citation Principles

http://www.force11.org/datacitation

“Sound, reproducible scholarship rests upon a foundation of
robust, accessible data. For this to be so in practice as well
as theory, data must be accorded due importance in the
practice of scholarship and in the enduring scholarly record.
In other words, data should be considered legitimate, citable
products of research. Data citation, like the citation of other
evidence and sources, is good research practice.”
10

December 11, 2013

Data Citation Principles (Force 11)
• Importance: Data should be considered legitimate, citable products of research. Data

•

•
•
•

•
•

•

citations should be accorded the same importance in the scholarly record as citations of
other research objects, such as publications.
Credit and attribution: Data citations should facilitate giving scholarly credit and normative
and legal attribution to all contributors to the data, recognizing that a single style or
mechanism of attribution may not be applicable to all data.
Evidence: Where a specific claim rests upon data, the corresponding data citation should be
provided.
Unique Identification: A data citation should include a persistent method for identification
that is machine actionable, globally unique, and widely used by a community.
Access: Data citations should facilitate access to the data themselves and to such
associated metadata, documentation, and other materials, as are necessary for both humans
and machines to make informed use of the referenced data.
Persistence: Metadata describing the data, and unique identifiers should persist, even
beyond the lifespan of the data they describe.
Versioning and granularity: Data citations should facilitate identification and access to
different versions and/or subsets of data. Citations should include sufficient detail to verifiably
link the citing work to the portion and version of data cited.
Interoperability and flexibility: Data citation methods should be sufficiently flexible to
accommodate the variant practices among communities but should not differ so much that
they compromise interoperability of data citation practices across communities.

http://www.force11.org/datacitation
11

December 11, 2013

Ongoing Debate & Development

http://www.slideshare.net
December 11, 2013

12
December 11, 2013

13

Data Citation Example
• Global Multi-Resolution Topography (GMRT) synthesis described by

•
•

•
•

Ryan et al., 2009.
Its data DOI is 10.1594/IEDA/100001.
Its citation would be: Ryan, William B.F. (2009): Global MultiResolution Topography (GMRT) synthesis. Integrated Earth Data
Applications (IEDA). http://dx.doi.org/10.1594/IEDA/100001
It can be accessed at the
URL: http://dx.doi.org/10.1594/IEDA/100001.
The data set DOI is different from the DOI for the publication that cites
the data set, doi: 10.1029/2008GC002332.
December 11, 2013

14

Data Publication: Benefits
• Scientific integrity
• publishing your data and citing its location in published research
papers can allow others to replicate, validate, or correct your
results, thereby improving the scientific record.
• Increase the impact of your research
• those who make use of your data and cite it in their own research
will help to increase your impact within your field and beyond it.
Users of your data may include those in other disciplines, sectors,
and countries.
• Preserve your data for your own future use
• by preparing your data for sharing with others, you will benefit by
being able to identify, retrieve, and understand the data yourself
after you have lost familiarity with it, perhaps several years hence.
15

December 11, 2013

Data Publication: Benefits

from: Heather A. Piwowar, “Sharing Detailed Research Data Is Associated with Increased Citation Rate”
http://precedings.nature.com/documents/361/version/1
December 11, 2013

RDA/WDS Publishing Data IG
• address practical aspects in publishing research data

16
17

December 11, 2013

Data Publication: Options

Conventional
publication
Institutional
Repositories

Disciplinary
Repositories
Data Article
18

December 11, 2013

Data Publication “Best Practice”

Trusted Data
Repository

Journal
Data
File

Data
Description

Reciprocal citation by DOI
December 11, 2013

19

Journal Guidelines: Examples
“Elsevier encourages authors to deposit raw experimental
data at relevant data repositories.”
http://www.elsevier.com/about/content-innovation/database-linking

“AGU encourages authors to identify and archive
their data in approved data centers.”
AGU Data Policy, approved by AGU Council Dec 8, 2013
20

Repository Requirements
“The use of published digital data, like the use of digitally published literature, depends
upon the ability to identify, authenticate, locate, access, and interpret them.”
Report of the CODATA - ICSTI Task Group on Data Citation Standards and Practices: “OUT OF CITE, OUT OF MIND”

• Open access
• Long-term preservation
• Persistent & unique identification
• Data quality assurance (peer-review?)

➥Deposition of data in ‘Trusted

Repositories’
December 11, 2013

21
22

Domain-Specific Repositories
• Are best poised to ensure „Fitness for Re-use‟ through

domain-specific data stewardship
• Must ensure professional data curation services
• Long-term archiving & access
• Persistent, unique identification
• Discoverability (metadata registration)

• Must integrate with the „scholarly communication

ecosystem‟
23

Adding Value: Domain-Specific Data
Stewardship
• Development, maintenance, and promotion of domain-

specific, community-based standards for data and metadata
• Provenance documentation, uncertainties, semantics (vocabularies,

taxonomy), formats

• Domain-specific guidelines, software tools, and user

support/training that facilitate data submission
• Harmonization & integration of data for advanced analysis
• Mapping of data to standards-based interfaces for
interoperability
24

GDJ „Approved Repositories‟
December 11, 2013

25
December 11, 2013

26

IEDA Infrastructure
• Cooperative Agreement with NSF
• Sustainable funding
• Formal community governance & guidance
• Disciplinary expertise
• Professional data management policies & procedures
• Persistent identification of data & samples (DOI, IGSN)
• Standards-compliant metadata catalog
• Long-term archiving agreements with National Geophysical Data
Center & Columbia University Libraries
• Risk management

• “Accreditation” as member of the World Data System
December 11, 2013

27

IEDA Repository Services
• Rich and standards-compliant metadata catalog
• Data publication with DataCite (Registration with DOI)
• Online data submission tools
• QA/QC of datasets and metadata
• Storage & risk management of submitted data

• Long-term preservation (via partners)
• Cross-referencing with journals, data citation index, etc.
December 11, 2013

Links to Journals

28
December 11, 2013

Links to Journals

29
December 11, 2013

IEDA Repository Services
• Investigator support
• Data Management Plan tool
• Data Compliance Report tool
• User Support
• Online submission tools
• Data templates
• Tutorials & Help pages
• YouTube videos
• Personal assistance (info@iedadata.org)
• Workshops, webinars, etc.
http://www.iedadata.org/help

30
31

December 11, 2013

IEDA Data Publication Process
User Submission

Data

Review

IEDA Data
Managers

Publication

IEDA Repository

Integration

IEDA Data
Managers

Synthesis
databases

DOI linking

Journal

Manuscript
Editors

Portal
DOI linking

Más contenido relacionado

La actualidad más candente

NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14
SEAD
 

La actualidad más candente (20)

SEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability ResearchSEAD: Lightweight Data Services for Sustainability Research
SEAD: Lightweight Data Services for Sustainability Research
 
DataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy Issues
 
Data Policy for Open Science
Data Policy for Open ScienceData Policy for Open Science
Data Policy for Open Science
 
Data Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-TiessenData Publishing Models by Sünje Dallmeier-Tiessen
Data Publishing Models by Sünje Dallmeier-Tiessen
 
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service ExperiencesRDAP 16: DMPs and Public Access: Agency and Data Service Experiences
RDAP 16: DMPs and Public Access: Agency and Data Service Experiences
 
Levine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal ConsiderationsLevine - Data Curation; Ethics and Legal Considerations
Levine - Data Curation; Ethics and Legal Considerations
 
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...Ag Data Commons: A new USDA catalog and repository for agricultural research ...
Ag Data Commons: A new USDA catalog and repository for agricultural research ...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data WorkRDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
RDAP13 Mark Parsons: The Research Data Alliance: Making Data Work
 
NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14NSF DataNet Partners Update at RDAP14
NSF DataNet Partners Update at RDAP14
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
RDAP 16: DMPs and Public Access: An NIH Perspective (Panel 5, DMPs and Public...
 
Research Data Management for SOE
Research Data Management for SOEResearch Data Management for SOE
Research Data Management for SOE
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Guidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access PlansGuidelines for OSTP Data Access Plans
Guidelines for OSTP Data Access Plans
 
RDA Presentation to the International Federation of Library Associations
RDA Presentation to the International Federation of Library AssociationsRDA Presentation to the International Federation of Library Associations
RDA Presentation to the International Federation of Library Associations
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
 
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing ResearchThe UC Curation Center (UC3): Developing Tools & Services for Managing Research
The UC Curation Center (UC3): Developing Tools & Services for Managing Research
 
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
RDAP 16: Perspective on DMPs, Funders and Public Access (Panel 5: DMPs and Pu...
 
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinaiDataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
DataTags: Sharing Privacy Sensitive Data by Michael Bar-sinai
 

Similar a IEDA Data Publication Workshop @AGU

Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
Karlsruhe Institute of Technology (KIT)
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Natsuko Nicholls
 
2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk
Paul Bracke
 

Similar a IEDA Data Publication Workshop @AGU (20)

How and Why to Share Your Data
How and Why to Share Your DataHow and Why to Share Your Data
How and Why to Share Your Data
 
Shareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your ResearchShareable by Design: Making Better Use of your Research
Shareable by Design: Making Better Use of your Research
 
Linking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesLinking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual Archives
 
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...Instituting an Institutional Repository for Sharing, Archiving, and Accessing...
Instituting an Institutional Repository for Sharing, Archiving, and Accessing...
 
Implementing and Institutional Repository for Sharing, Archiving, and Accessi...
Implementing and Institutional Repository for Sharing, Archiving, and Accessi...Implementing and Institutional Repository for Sharing, Archiving, and Accessi...
Implementing and Institutional Repository for Sharing, Archiving, and Accessi...
 
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
dkNET Office Hours: NIH Data Management and Sharing Mandate  05/03/2024dkNET Office Hours: NIH Data Management and Sharing Mandate  05/03/2024
dkNET Office Hours: NIH Data Management and Sharing Mandate 05/03/2024
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ Library
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Standardising research data policies, research data network
Standardising research data policies, research data networkStandardising research data policies, research data network
Standardising research data policies, research data network
 
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
Enriching Scholarship 2014 Beyond the Journal Article: Publishing and Citing ...
 
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...
Do It Yourself (DIY) Earth Science Collaboratories Using Best Practices and B...
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
 
Research Data Service at the University of Edinburgh
Research Data Service at the University of EdinburghResearch Data Service at the University of Edinburgh
Research Data Service at the University of Edinburgh
 
RDA Update
RDA UpdateRDA Update
RDA Update
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data ServicesNISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
NISO Forum, Denver, Sept. 24, 2012: DataCite and Campus Data Services
 
2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk2014 ALA MW SPARC-ACRL Forum Talk
2014 ALA MW SPARC-ACRL Forum Talk
 

Más de Kerstin Lehnert

Boosting Data Science in Geochemistry: We Need Global Geochemical Data Standa...
Boosting Data Science in Geochemistry: We Need Global Geochemical Data Standa...Boosting Data Science in Geochemistry: We Need Global Geochemical Data Standa...
Boosting Data Science in Geochemistry: We Need Global Geochemical Data Standa...
Kerstin Lehnert
 

Más de Kerstin Lehnert (19)

Astromat Update on Developments 2021-01-29
Astromat Update on Developments 2021-01-29Astromat Update on Developments 2021-01-29
Astromat Update on Developments 2021-01-29
 
Data Services for Geochemical Data
Data Services for Geochemical DataData Services for Geochemical Data
Data Services for Geochemical Data
 
Lehnert_EGU201_SampleMetadataStandards
Lehnert_EGU201_SampleMetadataStandardsLehnert_EGU201_SampleMetadataStandards
Lehnert_EGU201_SampleMetadataStandards
 
Goldschmidt2019 Samples Workshop
Goldschmidt2019 Samples WorkshopGoldschmidt2019 Samples Workshop
Goldschmidt2019 Samples Workshop
 
Boosting Data Science in Geochemistry: We Need Global Geochemical Data Standa...
Boosting Data Science in Geochemistry: We Need Global Geochemical Data Standa...Boosting Data Science in Geochemistry: We Need Global Geochemical Data Standa...
Boosting Data Science in Geochemistry: We Need Global Geochemical Data Standa...
 
EGU 2018 Ian McHarg Lecture
EGU 2018 Ian McHarg LectureEGU 2018 Ian McHarg Lecture
EGU 2018 Ian McHarg Lecture
 
EarthCubeArchitectureWS_June2015
EarthCubeArchitectureWS_June2015EarthCubeArchitectureWS_June2015
EarthCubeArchitectureWS_June2015
 
Advancing Reproducible Science from Physical Samples: The IGSN and the iSampl...
Advancing Reproducible Science from Physical Samples: The IGSN and the iSampl...Advancing Reproducible Science from Physical Samples: The IGSN and the iSampl...
Advancing Reproducible Science from Physical Samples: The IGSN and the iSampl...
 
Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)Making Small Data BIG (UT Austin, March 2016)
Making Small Data BIG (UT Austin, March 2016)
 
IGSN: The International Geo Sample Number (DFG Roundtable)
IGSN: The International Geo Sample Number (DFG Roundtable)IGSN: The International Geo Sample Number (DFG Roundtable)
IGSN: The International Geo Sample Number (DFG Roundtable)
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)
 
Data Standards & Best Practices for the Stratigraphic Record
Data Standards & Best Practices for the Stratigraphic RecordData Standards & Best Practices for the Stratigraphic Record
Data Standards & Best Practices for the Stratigraphic Record
 
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...
Interdisciplinary Data Resources for Volcanology at the IEDA (Interdisciplina...
 
The Internet of Samples: IGSN in Action
The Internet of Samples: IGSN in ActionThe Internet of Samples: IGSN in Action
The Internet of Samples: IGSN in Action
 
Digital Representation of Physical Samples in Scientific Publications
Digital Representation of Physical Samples in Scientific PublicationsDigital Representation of Physical Samples in Scientific Publications
Digital Representation of Physical Samples in Scientific Publications
 
Lehnert: Making Small Data Big, IACS, April2015
Lehnert: Making Small Data Big, IACS, April2015Lehnert: Making Small Data Big, IACS, April2015
Lehnert: Making Small Data Big, IACS, April2015
 
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
IEDA: Making Small Data BIG Through Interdisciplinary Partnerships Among Long...
 
iSamples Research Coordination Network (C4P Webinar)
iSamples Research Coordination Network (C4P Webinar)iSamples Research Coordination Network (C4P Webinar)
iSamples Research Coordination Network (C4P Webinar)
 
MoonDB: Restoration & Synthesis of Planetary Geochemical Data
MoonDB: Restoration & Synthesis of Planetary Geochemical DataMoonDB: Restoration & Synthesis of Planetary Geochemical Data
MoonDB: Restoration & Synthesis of Planetary Geochemical Data
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 

IEDA Data Publication Workshop @AGU

  • 1. IEDA DATA PUBLICATION WORKSHOP AGU Fall Meeting 2013 iedadata.org Community-driven Data Services for the Solid Earth Sciences
  • 2. December 11, 2013 2 Data Publication: Definition • Data publication with a small p • Sharing data via web sites or submission to databases • Data Publication with the big P • Publication of data as part of scholarly communication • Citable This is what we are talking about! • Persistent access • Quality assurance (repository review or peer-review)
  • 3. 3 December 11, 2013 Data Publication: Why? • Data access is key to science • Data are the nucleus of scientific collaboration re-use • Scientific progress requires community that competes and collaborates in pursuit of common goals • Without access to the same materials no community exists • Data are needed for scientific replication reproduce • The value of an article that can‟t be replicated:? • Scholarly articles are summaries, not the actual research results • Experimental data expensive to verify, observational data impossible • Replication projects show: many published articles cannot be replicated accessible at http://www.slideshare.net/
  • 4. 4 December 11, 2013 Data Publication: Why now? • Exponentially increasing data volumes • Rapidly expanding cyberinfrastructure capabilities to mine and analyze data • New paradigms in publishing • Growing enforcement of policies for open access to research data “In the last decades, rapid technical developments, such as digital data and high-throughput techniques, dramatically changed the scholarly publishing paradigm. This requires new approaches in order to ensure availability and usability of science data.” http://www.icsu-wds.org/working-groups/data-publication
  • 5. 5 December 11, 2013 Data Policies • Agencies • Societies • Journals February 22, 2013 May 9, 2013
  • 8. December 11, 2013 Data Publication: Why? • Ensure proper citation of the data and credit to their creator(s) • 8
  • 9. December 11, 2013 9 Data Citation Principles http://www.force11.org/datacitation “Sound, reproducible scholarship rests upon a foundation of robust, accessible data. For this to be so in practice as well as theory, data must be accorded due importance in the practice of scholarship and in the enduring scholarly record. In other words, data should be considered legitimate, citable products of research. Data citation, like the citation of other evidence and sources, is good research practice.”
  • 10. 10 December 11, 2013 Data Citation Principles (Force 11) • Importance: Data should be considered legitimate, citable products of research. Data • • • • • • • citations should be accorded the same importance in the scholarly record as citations of other research objects, such as publications. Credit and attribution: Data citations should facilitate giving scholarly credit and normative and legal attribution to all contributors to the data, recognizing that a single style or mechanism of attribution may not be applicable to all data. Evidence: Where a specific claim rests upon data, the corresponding data citation should be provided. Unique Identification: A data citation should include a persistent method for identification that is machine actionable, globally unique, and widely used by a community. Access: Data citations should facilitate access to the data themselves and to such associated metadata, documentation, and other materials, as are necessary for both humans and machines to make informed use of the referenced data. Persistence: Metadata describing the data, and unique identifiers should persist, even beyond the lifespan of the data they describe. Versioning and granularity: Data citations should facilitate identification and access to different versions and/or subsets of data. Citations should include sufficient detail to verifiably link the citing work to the portion and version of data cited. Interoperability and flexibility: Data citation methods should be sufficiently flexible to accommodate the variant practices among communities but should not differ so much that they compromise interoperability of data citation practices across communities. http://www.force11.org/datacitation
  • 11. 11 December 11, 2013 Ongoing Debate & Development http://www.slideshare.net
  • 13. December 11, 2013 13 Data Citation Example • Global Multi-Resolution Topography (GMRT) synthesis described by • • • • Ryan et al., 2009. Its data DOI is 10.1594/IEDA/100001. Its citation would be: Ryan, William B.F. (2009): Global MultiResolution Topography (GMRT) synthesis. Integrated Earth Data Applications (IEDA). http://dx.doi.org/10.1594/IEDA/100001 It can be accessed at the URL: http://dx.doi.org/10.1594/IEDA/100001. The data set DOI is different from the DOI for the publication that cites the data set, doi: 10.1029/2008GC002332.
  • 14. December 11, 2013 14 Data Publication: Benefits • Scientific integrity • publishing your data and citing its location in published research papers can allow others to replicate, validate, or correct your results, thereby improving the scientific record. • Increase the impact of your research • those who make use of your data and cite it in their own research will help to increase your impact within your field and beyond it. Users of your data may include those in other disciplines, sectors, and countries. • Preserve your data for your own future use • by preparing your data for sharing with others, you will benefit by being able to identify, retrieve, and understand the data yourself after you have lost familiarity with it, perhaps several years hence.
  • 15. 15 December 11, 2013 Data Publication: Benefits from: Heather A. Piwowar, “Sharing Detailed Research Data Is Associated with Increased Citation Rate” http://precedings.nature.com/documents/361/version/1
  • 16. December 11, 2013 RDA/WDS Publishing Data IG • address practical aspects in publishing research data 16
  • 17. 17 December 11, 2013 Data Publication: Options Conventional publication Institutional Repositories Disciplinary Repositories Data Article
  • 18. 18 December 11, 2013 Data Publication “Best Practice” Trusted Data Repository Journal Data File Data Description Reciprocal citation by DOI
  • 19. December 11, 2013 19 Journal Guidelines: Examples “Elsevier encourages authors to deposit raw experimental data at relevant data repositories.” http://www.elsevier.com/about/content-innovation/database-linking “AGU encourages authors to identify and archive their data in approved data centers.” AGU Data Policy, approved by AGU Council Dec 8, 2013
  • 20. 20 Repository Requirements “The use of published digital data, like the use of digitally published literature, depends upon the ability to identify, authenticate, locate, access, and interpret them.” Report of the CODATA - ICSTI Task Group on Data Citation Standards and Practices: “OUT OF CITE, OUT OF MIND” • Open access • Long-term preservation • Persistent & unique identification • Data quality assurance (peer-review?) ➥Deposition of data in ‘Trusted Repositories’
  • 22. 22 Domain-Specific Repositories • Are best poised to ensure „Fitness for Re-use‟ through domain-specific data stewardship • Must ensure professional data curation services • Long-term archiving & access • Persistent, unique identification • Discoverability (metadata registration) • Must integrate with the „scholarly communication ecosystem‟
  • 23. 23 Adding Value: Domain-Specific Data Stewardship • Development, maintenance, and promotion of domain- specific, community-based standards for data and metadata • Provenance documentation, uncertainties, semantics (vocabularies, taxonomy), formats • Domain-specific guidelines, software tools, and user support/training that facilitate data submission • Harmonization & integration of data for advanced analysis • Mapping of data to standards-based interfaces for interoperability
  • 26. December 11, 2013 26 IEDA Infrastructure • Cooperative Agreement with NSF • Sustainable funding • Formal community governance & guidance • Disciplinary expertise • Professional data management policies & procedures • Persistent identification of data & samples (DOI, IGSN) • Standards-compliant metadata catalog • Long-term archiving agreements with National Geophysical Data Center & Columbia University Libraries • Risk management • “Accreditation” as member of the World Data System
  • 27. December 11, 2013 27 IEDA Repository Services • Rich and standards-compliant metadata catalog • Data publication with DataCite (Registration with DOI) • Online data submission tools • QA/QC of datasets and metadata • Storage & risk management of submitted data • Long-term preservation (via partners) • Cross-referencing with journals, data citation index, etc.
  • 28. December 11, 2013 Links to Journals 28
  • 29. December 11, 2013 Links to Journals 29
  • 30. December 11, 2013 IEDA Repository Services • Investigator support • Data Management Plan tool • Data Compliance Report tool • User Support • Online submission tools • Data templates • Tutorials & Help pages • YouTube videos • Personal assistance (info@iedadata.org) • Workshops, webinars, etc. http://www.iedadata.org/help 30
  • 31. 31 December 11, 2013 IEDA Data Publication Process User Submission Data Review IEDA Data Managers Publication IEDA Repository Integration IEDA Data Managers Synthesis databases DOI linking Journal Manuscript Editors Portal DOI linking