SlideShare una empresa de Scribd logo
1 de 25
Descargar para leer sin conexión
Research Data Management in Economics Journals – Data Policies and
Data Description as prerequisites of reproducible research
Sven Vlaeminck / Ralf Toepfer | Leibniz-Information Centre for Economics (ZBW)
2013-5-31 | IASSIST 2013 | Cologne | Germany
Funded by
Table of Contents
> Project background, aims and expected output
> Some results of the project‘s analyses phase.
> Data sharing among economists
> Replicable research in economics journals
> Journals‘ infrastructures for managing research data

> Our approach to facilitate metadata creation for
researchers.
Project Background
Facilitate Replication of published empirical Findings in Economics.

"If the empirical basis for an article or book cannot be reproduced, of what use to the 
discipline are its conclusions? What purpose does an article like this serve?“
Gary King, 1995
Do you trust in applied economic research?
> Dewald et al. (1986) tried to replicate 54 articles of the Journal of
Money, Credit, and Banking (JMCB).
– They succeeded two times (3.7%).
> McCullough et al. (2006) tried to replicate 62 papers of the JMCB.
– They were able to replicate 14 (22.6%).
> McCullough et al. (2008) tried to replicate 117 articles of the
Federal Reserve Bank of St. Louis Review.
- They were able to replicate 9 (7.7%).
➔ Often, published economic research is not replicable.
Background of the Project (II):
> Explanations:
> Lack of incentives for sharing data / lack of recognition
and credit
> Infrequent implementation of data availability policies by
economics journals
> Infrastructure for publication-related research data
management is rarely available
> EDaWaX adresses these three challenges
The EDaWaX-Project
> EDaWaX analysed…
– …journals‘ data policies and made recommendations for those
guidelines.
– …analysed the data sharing behaviour of economists.
– … data centres and made recommendations where to host a
publication related data archive.
> EDaWaX…
– …used existing metadata schema to derive three „levels“ of
metadata to describe data and publications.
> Based on these findings we intend to develop & to implement a
publication-related data archive for an economic journal (using
CKAN).
Data Sharing among Economists
The current state of data sharing in the profession.

Funded by
The Status Quo among Economists…

…is not to share their data
Replicability and research data
management in economics journals
"Most results published in economics journals cannot be subjected to verification, even in 
principle, because authors typically are not required to make their data and code available for 
verification."]
McCullough, McGeary, Harrison (2006)

Funded by
Replicability in economics Journals

We built a sample of 141 economics journals to
evaluate the amount of journals equipped with
data policies.
Replicability in economics Journals

We found 40 journals who
claimed to have a „data policy“.  
Replicability in economics Journals

Only 29 of them had a useful
Data Availability Policy  
Replicability in economics Journals

24 out of these 29 policies
are mandatory for authors
Replicability in economics Journals

Only 12 journals required the
submission of data & code/syntax
Replicability in economics Journals

4 Journals (2.8% of the full sample) had more than 50% of all 
articles in two single issues accompanied by research data
The journals‘ infrastructure to provide
research data

Funded by
The current e-infrastructure is not sufficient…
Often, additional metadata is not created…
Summary: Publication-related research data
management in economics journals
> Economists rarely share their research data with others.
> Only a few journals own usefull data availability policies
AND enforce data availability.
> Research data often is available in form of zip-files only
and it is accessible mainly via the publisher‘s website.
> Mostly no specific metadata / no persistent identifiers
added
> Data sharing and research data management in
economics journals are still at its early stages!
How can we increase data availability and
reproducible research?
> Build a sound, usefull infrastructure to support RDM
> Improve data policies
> Rise awareness that data sharing and working up data is an
important (scientific) task.
> Support incentive-mechanisms, e.g.
– make data citable (-> DataCite)
– include research data in our disciplinairy portals.
– implement metrics and usage statistics.

> But: For doing so, we need additional metadata! Who can
create it?
Our approach to facilitate metadata
creation for researchers.

Funded by
How much documentation do we need?
And who is going to create it?
> For replication purposes it is necessary to use adequate
metadata schema (e.g. DDI3)
> …but extensive schema are not accepted by researchers.
What to do?
Our approach:
> We defined some „levels“ and functionalities/purposes
of metadata.
> Researcher chooses metadata „level“ and functionalities.
> Important to point out advantages and limitations of
each „level“ of data description!
Metadata for publication-related research data
> We identified four „levels“ of metadata – three of them
are available in our pilot application:
Level: Ensure citability (DataCite/da|ra) • 9 metadata fields
2. Level: Support findability (da|ra):
• 26 metadata fields
3. Level: Ensure linkability (da|ra) :
• up to 100 fields
1.

4. Level: Reproducibility (DDI3):
•

up to 846 fields
Vision
> “Making it convenient for scientists to describe, deposit,

and share their data and to access data from others,
plus promulgating the best data practices through
education and awareness will help the future of science
as well as the future of data preservation.”
(Tenopir 2011:20)
Thank you very much for your
attention!
Contact:
Sven Vlaeminck
s.vlaeminck@zbw.eu
Ralf Toepfer
r.toepfer@zbw.eu

Funded by

http://www.edawax.de (in English)

Más contenido relacionado

La actualidad más candente

Data mining course learning outcomes,Data Mining CMAP
Data mining course learning outcomes,Data Mining CMAPData mining course learning outcomes,Data Mining CMAP
Data mining course learning outcomes,Data Mining CMAP
jaya lakshmi
 
Data and Statistics library research at UCSD
Data and Statistics library research at UCSDData and Statistics library research at UCSD
Data and Statistics library research at UCSD
Annelise Sklar
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
abedejesus
 
Green is pre-icis workshop v1
Green is pre-icis workshop v1Green is pre-icis workshop v1
Green is pre-icis workshop v1
mmarrone
 

La actualidad más candente (20)

Data mining course learning outcomes,Data Mining CMAP
Data mining course learning outcomes,Data Mining CMAPData mining course learning outcomes,Data Mining CMAP
Data mining course learning outcomes,Data Mining CMAP
 
Real life application of statistics in engineering
Real life application of statistics in engineeringReal life application of statistics in engineering
Real life application of statistics in engineering
 
Data Mining: Applying data mining
Data Mining: Applying data miningData Mining: Applying data mining
Data Mining: Applying data mining
 
Introduction to Research methodology: Orientation for Doctoral Program Course...
Introduction to Research methodology: Orientation for Doctoral Program Course...Introduction to Research methodology: Orientation for Doctoral Program Course...
Introduction to Research methodology: Orientation for Doctoral Program Course...
 
Data mining
Data miningData mining
Data mining
 
Overview and library support for data management/sharing
Overview and library support for data management/sharingOverview and library support for data management/sharing
Overview and library support for data management/sharing
 
Data and Statistics library research at UCSD
Data and Statistics library research at UCSDData and Statistics library research at UCSD
Data and Statistics library research at UCSD
 
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
 
EDI Training Module 5: Creating Clean Data foro Publishing
EDI Training Module 5:  Creating Clean Data foro PublishingEDI Training Module 5:  Creating Clean Data foro Publishing
EDI Training Module 5: Creating Clean Data foro Publishing
 
Baljeet ppt(1)2
Baljeet ppt(1)2Baljeet ppt(1)2
Baljeet ppt(1)2
 
Data Management Lab: Data management plan instructions
Data Management Lab: Data management plan instructionsData Management Lab: Data management plan instructions
Data Management Lab: Data management plan instructions
 
Big data and data mining
Big data and data miningBig data and data mining
Big data and data mining
 
dssi 10 12
dssi 10 12dssi 10 12
dssi 10 12
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
 
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
RDAP 15 EarthCollab: Connecting Scientific Information Sources using the Sema...
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
 
83341 ch24 jacobsen
83341 ch24 jacobsen83341 ch24 jacobsen
83341 ch24 jacobsen
 
Management of Data Collections
Management of Data CollectionsManagement of Data Collections
Management of Data Collections
 
Green is pre-icis workshop v1
Green is pre-icis workshop v1Green is pre-icis workshop v1
Green is pre-icis workshop v1
 
Using the library information resource network (lirn), jstor, or any
Using the library information resource network (lirn), jstor, or anyUsing the library information resource network (lirn), jstor, or any
Using the library information resource network (lirn), jstor, or any
 

Similar a Research Data Management in Economics Journals – Data Policies and Data Description as prerequisites of reproducible research

Similar a Research Data Management in Economics Journals – Data Policies and Data Description as prerequisites of reproducible research (20)

Paving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflowsPaving the way to open and interoperable research data service workflows
Paving the way to open and interoperable research data service workflows
 
Recognising data sharing
Recognising data sharingRecognising data sharing
Recognising data sharing
 
Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...Paving the way to open and interoperable research data service workflows Prog...
Paving the way to open and interoperable research data service workflows Prog...
 
Research Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and HumanitiesResearch Data Management and Sharing for the Social Sciences and Humanities
Research Data Management and Sharing for the Social Sciences and Humanities
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
 
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIESA SURVEY ON DATA MINING IN STEEL INDUSTRIES
A SURVEY ON DATA MINING IN STEEL INDUSTRIES
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
Predicting Online News Popularity
Predicting Online News Popularity Predicting Online News Popularity
Predicting Online News Popularity
 
THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing Elsevier
 
Unit i
Unit iUnit i
Unit i
 
Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012Meeting the NSF DMP Requirement: March 7, 2012
Meeting the NSF DMP Requirement: March 7, 2012
 
Advanced Database System
Advanced Database SystemAdvanced Database System
Advanced Database System
 
Model management for systems biology projects
Model management for systems biology projectsModel management for systems biology projects
Model management for systems biology projects
 
Data Extraction
Data ExtractionData Extraction
Data Extraction
 
Systematic review article and Meta-analysis: Main steps for Successful writin...
Systematic review article and Meta-analysis: Main steps for Successful writin...Systematic review article and Meta-analysis: Main steps for Successful writin...
Systematic review article and Meta-analysis: Main steps for Successful writin...
 
A Survey And Taxonomy Of Distributed Data Mining Research Studies A Systemat...
A Survey And Taxonomy Of Distributed Data Mining Research Studies  A Systemat...A Survey And Taxonomy Of Distributed Data Mining Research Studies  A Systemat...
A Survey And Taxonomy Of Distributed Data Mining Research Studies A Systemat...
 
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...
 
6 ijaems sept-2015-6-a review of data security primitives in data mining
6 ijaems sept-2015-6-a review of data security primitives in data mining6 ijaems sept-2015-6-a review of data security primitives in data mining
6 ijaems sept-2015-6-a review of data security primitives in data mining
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
 
Data Mining of Project Management Data: An Analysis of Applied Research Studies.
Data Mining of Project Management Data: An Analysis of Applied Research Studies.Data Mining of Project Management Data: An Analysis of Applied Research Studies.
Data Mining of Project Management Data: An Analysis of Applied Research Studies.
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Último (20)

HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Research Data Management in Economics Journals – Data Policies and Data Description as prerequisites of reproducible research

  • 1. Research Data Management in Economics Journals – Data Policies and Data Description as prerequisites of reproducible research Sven Vlaeminck / Ralf Toepfer | Leibniz-Information Centre for Economics (ZBW) 2013-5-31 | IASSIST 2013 | Cologne | Germany Funded by
  • 2. Table of Contents > Project background, aims and expected output > Some results of the project‘s analyses phase. > Data sharing among economists > Replicable research in economics journals > Journals‘ infrastructures for managing research data > Our approach to facilitate metadata creation for researchers.
  • 3. Project Background Facilitate Replication of published empirical Findings in Economics. "If the empirical basis for an article or book cannot be reproduced, of what use to the  discipline are its conclusions? What purpose does an article like this serve?“ Gary King, 1995
  • 4. Do you trust in applied economic research? > Dewald et al. (1986) tried to replicate 54 articles of the Journal of Money, Credit, and Banking (JMCB). – They succeeded two times (3.7%). > McCullough et al. (2006) tried to replicate 62 papers of the JMCB. – They were able to replicate 14 (22.6%). > McCullough et al. (2008) tried to replicate 117 articles of the Federal Reserve Bank of St. Louis Review. - They were able to replicate 9 (7.7%). ➔ Often, published economic research is not replicable.
  • 5. Background of the Project (II): > Explanations: > Lack of incentives for sharing data / lack of recognition and credit > Infrequent implementation of data availability policies by economics journals > Infrastructure for publication-related research data management is rarely available > EDaWaX adresses these three challenges
  • 6. The EDaWaX-Project > EDaWaX analysed… – …journals‘ data policies and made recommendations for those guidelines. – …analysed the data sharing behaviour of economists. – … data centres and made recommendations where to host a publication related data archive. > EDaWaX… – …used existing metadata schema to derive three „levels“ of metadata to describe data and publications. > Based on these findings we intend to develop & to implement a publication-related data archive for an economic journal (using CKAN).
  • 7. Data Sharing among Economists The current state of data sharing in the profession. Funded by
  • 8. The Status Quo among Economists… …is not to share their data
  • 9. Replicability and research data management in economics journals "Most results published in economics journals cannot be subjected to verification, even in  principle, because authors typically are not required to make their data and code available for  verification."] McCullough, McGeary, Harrison (2006) Funded by
  • 10. Replicability in economics Journals We built a sample of 141 economics journals to evaluate the amount of journals equipped with data policies.
  • 11. Replicability in economics Journals We found 40 journals who claimed to have a „data policy“.  
  • 12. Replicability in economics Journals Only 29 of them had a useful Data Availability Policy  
  • 13. Replicability in economics Journals 24 out of these 29 policies are mandatory for authors
  • 14. Replicability in economics Journals Only 12 journals required the submission of data & code/syntax
  • 15. Replicability in economics Journals 4 Journals (2.8% of the full sample) had more than 50% of all  articles in two single issues accompanied by research data
  • 16. The journals‘ infrastructure to provide research data Funded by
  • 17. The current e-infrastructure is not sufficient…
  • 18. Often, additional metadata is not created…
  • 19. Summary: Publication-related research data management in economics journals > Economists rarely share their research data with others. > Only a few journals own usefull data availability policies AND enforce data availability. > Research data often is available in form of zip-files only and it is accessible mainly via the publisher‘s website. > Mostly no specific metadata / no persistent identifiers added > Data sharing and research data management in economics journals are still at its early stages!
  • 20. How can we increase data availability and reproducible research? > Build a sound, usefull infrastructure to support RDM > Improve data policies > Rise awareness that data sharing and working up data is an important (scientific) task. > Support incentive-mechanisms, e.g. – make data citable (-> DataCite) – include research data in our disciplinairy portals. – implement metrics and usage statistics. > But: For doing so, we need additional metadata! Who can create it?
  • 21. Our approach to facilitate metadata creation for researchers. Funded by
  • 22. How much documentation do we need? And who is going to create it? > For replication purposes it is necessary to use adequate metadata schema (e.g. DDI3) > …but extensive schema are not accepted by researchers. What to do? Our approach: > We defined some „levels“ and functionalities/purposes of metadata. > Researcher chooses metadata „level“ and functionalities. > Important to point out advantages and limitations of each „level“ of data description!
  • 23. Metadata for publication-related research data > We identified four „levels“ of metadata – three of them are available in our pilot application: Level: Ensure citability (DataCite/da|ra) • 9 metadata fields 2. Level: Support findability (da|ra): • 26 metadata fields 3. Level: Ensure linkability (da|ra) : • up to 100 fields 1. 4. Level: Reproducibility (DDI3): • up to 846 fields
  • 24. Vision > “Making it convenient for scientists to describe, deposit, and share their data and to access data from others, plus promulgating the best data practices through education and awareness will help the future of science as well as the future of data preservation.” (Tenopir 2011:20)
  • 25. Thank you very much for your attention! Contact: Sven Vlaeminck s.vlaeminck@zbw.eu Ralf Toepfer r.toepfer@zbw.eu Funded by http://www.edawax.de (in English)