SlideShare una empresa de Scribd logo
1 de 39
Descargar para leer sin conexión
Do Authors Deposit on Time?
Tracking Open Access Policy Compliance
Drahomira Herrmannova
Nancy Pontika
Petr Knoth
June 4, 2019 – JCDL 2019, Urbana-Champaign, IL
Big Scientific Data and Text Analytics Group
Knowledge Media Institute, The Open University
Introduction
• Why we want Open Access (OA)
• Taxpayers should be able to read publicly funded research
• Help researchers at poorer institutions without access to
subscriptions
• Institutions suffer from rising journal subscription prices
• Funders introduce policies to encourage OA
• Notable examples:
• U.S. Public Access Plan
• U.S. NIH Public Access Policy
• UK REF 2021 Open Access Policy
• EC H2020 Open Access Policy
1/22
Growing number of OA policies
Source: http://roarmap.eprints.org/
Currently close to 1
thousand funder and
institutional OA policies
2/22
OA policies
• Provide criteria for making papers OA
• Requirements, such as:
• Where should papers be made available (publication or deposit)
• When should papers be deposited
• What version should be deposited (e.g. pre-print vs. post-print)
• Allowed embargo periods
• Etc.
3/22
Research questions
What effect do OA policies have?
4/22
Research questions
• Piwowar et al. (2018): At least 28% of all research papers are OA
• Lariviere and Sugimoto (2018): More than two thirds of papers
from selected funders (with an OA policy) were OA
• Gargouri et al. (2012): OA growth often due to retroactive self-
archiving (often years after publication)
5/22
Research questions
When do author deposit?
6/22
Research questions
When do author deposit?
Do they deposit in accordance with policies?
6/22
Deposit time lag
• What is deposit time lag?
• The difference between date of publication and date of deposit in a
repository expressed in days
• We study deposit time lag across
• Country
• Time
• Repository
• Discipline
7/22
Data
8/22
Data
8/22
Data
8/22
Data
8/22
Deposit time lag calculation
• Deposit time lag = deposit
date – publication date
• The difference was expressed
in days
• Positive values: article
deposited after publication
• Negative values: article
deposited prior to
publication
• Best: as low value as possible
9/22
Dataset
• 2013-2018 publications
• Metadata from Crossref and CORE
Publications 808,984
Repositories 728
Countries 70
Final dataset size Year of publication distribution
10/22
Results: Deposit time lag per country
11/22
Results: Deposit time lag per country/year
• How has deposit time lag changed over time?
• Average deposit time lag per year of publication
?
12/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
Data for 2013 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
Data for 2017 publications …?
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
• Underestimates deposit time
lag for all, but especially for
newer publications
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
• Underestimates deposit time
lag for all, but especially for
newer publications
2. Put a maximum limit on
deposit time lag for the
analysis (for comparability)
• E.g. deposit at most a year later
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
Data for 2013 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
• Underestimates deposit time
lag for all, but especially for
newer publications
2. Put a maximum limit on
deposit time lag for the
analysis (for comparability)
• E.g. deposit at most a year later
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
Data for 2017 publications
13/22
Results: Deposit time lag per country/year
• Two options:
1. Use all data
• Underestimates deposit time
lag for all, but especially for
newer publications
2. Put a maximum limit on
deposit time lag for the
analysis (for comparability)
• E.g. deposit at most a year later
• Underestimates deposit time
lag for all, but especially for
older publications
2011 2012 2013 2014 2015 2016 2017 2018
Yearly deposits – toy example
2013 publications 2017 publications
13/22
Results: Deposit time lag per year/country
Option 1: All data Option 2: Max deposit time lag limit (1 yr)
14/22
Results: Deposit time lag per subject
Bars are not
stacked, but
overlayed
15/22
REF 2021 OA Policy
• In 2014, the UK introduced an OA Policy for its next research
assessment exercise (REF)
• Requirements
• Deposit final manuscript in an OA repository
• Deposit on publication/acceptance or within 3 months from it
• Papers published since April 2016
• Sanction
• The OA requirement is linked to performance review
• Did the introduction of this mandatory policy affect deposit
time lag in the UK compared to other countries?
16/22
Single vs any repository deposit time lag
1. Single repository deposit time lag
• Deposit time lag with respect to the publications’ deposit date in a
given repository
2. Any repository deposit time lag
• Deposit time lag with respect to the publications’ deposit date in any
repository
Repository 1 Repository 2
05/2017 09/2017
Single repository deposit
time lag for Repository 1 =
05/2017 – publication date
Any repository deposit
time lag for Repository 1 =
min(05/2017, 09/2017) –
publication date
17/22
Results: UK REF compliance per year
Any repository deposit time lag
18/22
Results: Deposit time lag per repository
Full lines: Single repository deposit time lag
Dashed lines: Any repository deposit time lag 19/22
Results: Deposit time lag per year/country
Option 1: All data Option 2: Max deposit time lag limit
2014: UK introduces REF 2021 OA policy 20/22
Discussion
• Study assumption: if metadata deposited, then the full text is
also deposited
• Validation of full text deposits complicated due to the way the OAI-
PMH works
21/22
Discussion
• Study assumption: if metadata deposited, then the full text is
also deposited
• Validation of full text deposits complicated due to the way the OAI-
PMH works
• Our study excludes publications that were never deposited
• To quantify missing deposits we would have to correctly match all
CORE publications to their Crossref metadata
• Focus on deposit time lag rather than the proportion of missing
deposits
21/22
Discussion
• Study assumption: if metadata deposited, then the full text is
also deposited
• Validation of full text deposits complicated due to the way the OAI-
PMH works
• Our study excludes publications that were never deposited
• To quantify missing deposits we would have to correctly match all
CORE publications to their Crossref metadata
• Focus on deposit time lag rather than the proportion of missing
deposits
• Matching between Crossref and CORE was done using
metadata (titles, authors, publication years)
• Strict approach, results in high accuracy (~95.27%) but lower recall
21/22
Conclusions
• Time between publication and deposit has decreased
significantly in the 2013-2017 period globally
• By 472 days per country on average across all countries in our dataset
22/22
Conclusions
• Time between publication and deposit has decreased
significantly in the 2013-2017 period globally
• By 472 days per country on average across all countries in our dataset
• After introduction of the UK REF 2021 OA Policy this decrease in
the UK has accelerated
• As of early 2018, UK publications are deposited immediately upon
publication or even slightly before
22/22
Conclusions
• Time between publication and deposit has decreased
significantly in the 2013-2017 period globally
• By 472 days per country on average across all countries in our dataset
• After introduction of the UK REF 2021 OA Policy this decrease in
the UK has accelerated
• As of early 2018, UK publications are deposited immediately upon
publication or even slightly before
• Key messages:
• Our observations support the argument for the inclusion of time
limited deposit requirement in OA policies
• Institutional practices an important role in supporting OA policy
adoption
22/22
Thank you!
Code: https://github.com/oacore/jcdl_2019
Data: https://doi.org/10.5281/zenodo.2605408

Más contenido relacionado

Similar a Do Authors Deposit on Time? Tracking Open Access Policy Compliance

UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...
UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...
UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...ukcorr
 
Cn mo11 2_alt_status_and_planning_final_hessel
Cn mo11 2_alt_status_and_planning_final_hesselCn mo11 2_alt_status_and_planning_final_hessel
Cn mo11 2_alt_status_and_planning_final_hesselErik van den Elsen
 
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...Charleston Conference
 
Assessing the value of OA agreements
Assessing the value of OA agreementsAssessing the value of OA agreements
Assessing the value of OA agreementsJUSPSTATS
 
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...SPARC Europe
 
ORCID - UK PIDs for Open Access - progress update
ORCID - UK PIDs for Open Access - progress updateORCID - UK PIDs for Open Access - progress update
ORCID - UK PIDs for Open Access - progress updateJisc
 
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...UKSG: connecting the knowledge community
 
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...Chris Banks
 
Social sciences directory liber conference (26.06.2013)
Social sciences directory   liber conference (26.06.2013)Social sciences directory   liber conference (26.06.2013)
Social sciences directory liber conference (26.06.2013)SocSciDir
 
Uk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan BicarreguiUk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan BicarreguiInnovate UK
 
Open Access in the UK - challenges of compliance with funder mandates
Open Access in the UK - challenges of compliance with funder mandatesOpen Access in the UK - challenges of compliance with funder mandates
Open Access in the UK - challenges of compliance with funder mandatesChris Banks
 
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...Michael Levine-Clark
 
Green or gold: What will Open Access mean for the LSE?
Green or gold: What will Open Access mean for the LSE?Green or gold: What will Open Access mean for the LSE?
Green or gold: What will Open Access mean for the LSE?Jane Tinkler
 
Open Access, Plan S and New Models for Academic Publishing
Open Access, Plan S and New Models for Academic PublishingOpen Access, Plan S and New Models for Academic Publishing
Open Access, Plan S and New Models for Academic PublishingCILIPScotland
 
Evaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision MakingEvaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision MakingSelena Killick
 
UKSG 2018 Plenary - National license negotiations advancing the OA transition...
UKSG 2018 Plenary - National license negotiations advancing the OA transition...UKSG 2018 Plenary - National license negotiations advancing the OA transition...
UKSG 2018 Plenary - National license negotiations advancing the OA transition...UKSG: connecting the knowledge community
 
Open Access Week 2017: Research data management and data management plans (Fl...
Open Access Week 2017: Research data management and data management plans (Fl...Open Access Week 2017: Research data management and data management plans (Fl...
Open Access Week 2017: Research data management and data management plans (Fl...OpenAIRE
 

Similar a Do Authors Deposit on Time? Tracking Open Access Policy Compliance (20)

UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...
UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...
UKCORR members day 2019: Retaining choice constraining costs in a Plan S worl...
 
Cn mo11 2_alt_status_and_planning_final_hessel
Cn mo11 2_alt_status_and_planning_final_hesselCn mo11 2_alt_status_and_planning_final_hessel
Cn mo11 2_alt_status_and_planning_final_hessel
 
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...
Evidence-Based eBook Purchasing: Results and Implications from a Consortia-Pu...
 
Assessing the value of OA agreements
Assessing the value of OA agreementsAssessing the value of OA agreements
Assessing the value of OA agreements
 
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...
OA policies – Where we are and what we know about effectiveness, Lars Bjørnsh...
 
Implementation of the Smooth Transition Model
Implementation of the Smooth Transition ModelImplementation of the Smooth Transition Model
Implementation of the Smooth Transition Model
 
ORCID - UK PIDs for Open Access - progress update
ORCID - UK PIDs for Open Access - progress updateORCID - UK PIDs for Open Access - progress update
ORCID - UK PIDs for Open Access - progress update
 
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
UKSG 2024 - A critical review of transitional agreements in the UK: why, how,...
 
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...
United Kingdom Scholarly Communications model policy and Licence - UK-SCL - u...
 
Social sciences directory liber conference (26.06.2013)
Social sciences directory   liber conference (26.06.2013)Social sciences directory   liber conference (26.06.2013)
Social sciences directory liber conference (26.06.2013)
 
Uk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan BicarreguiUk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
Uk Research Infrastructure Workshop E-infrastructure Juan Bicarregui
 
Open Access in the UK - challenges of compliance with funder mandates
Open Access in the UK - challenges of compliance with funder mandatesOpen Access in the UK - challenges of compliance with funder mandates
Open Access in the UK - challenges of compliance with funder mandates
 
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...
Levine-Clark, Michael, “Going Beyond COUNTER: Strategies for Analyzing Data t...
 
Green or gold: What will Open Access mean for the LSE?
Green or gold: What will Open Access mean for the LSE?Green or gold: What will Open Access mean for the LSE?
Green or gold: What will Open Access mean for the LSE?
 
Thorley
ThorleyThorley
Thorley
 
Open Access, Plan S and New Models for Academic Publishing
Open Access, Plan S and New Models for Academic PublishingOpen Access, Plan S and New Models for Academic Publishing
Open Access, Plan S and New Models for Academic Publishing
 
Winter, Chandler, Biedenbach, Pearson, and Stanton, "It’s Only as Good as the...
Winter, Chandler, Biedenbach, Pearson, and Stanton, "It’s Only as Good as the...Winter, Chandler, Biedenbach, Pearson, and Stanton, "It’s Only as Good as the...
Winter, Chandler, Biedenbach, Pearson, and Stanton, "It’s Only as Good as the...
 
Evaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision MakingEvaluating the Big Deal: Usage Statistics for Decision Making
Evaluating the Big Deal: Usage Statistics for Decision Making
 
UKSG 2018 Plenary - National license negotiations advancing the OA transition...
UKSG 2018 Plenary - National license negotiations advancing the OA transition...UKSG 2018 Plenary - National license negotiations advancing the OA transition...
UKSG 2018 Plenary - National license negotiations advancing the OA transition...
 
Open Access Week 2017: Research data management and data management plans (Fl...
Open Access Week 2017: Research data management and data management plans (Fl...Open Access Week 2017: Research data management and data management plans (Fl...
Open Access Week 2017: Research data management and data management plans (Fl...
 

Más de Dasha Herrmannova

Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data ExtractionDasha Herrmannova
 
Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Dasha Herrmannova
 
Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?Dasha Herrmannova
 
An Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic GraphAn Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic GraphDasha Herrmannova
 
Visual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document CollectionsVisual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document CollectionsDasha Herrmannova
 
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Dasha Herrmannova
 
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingDasha Herrmannova
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Dasha Herrmannova
 
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...Dasha Herrmannova
 
Mining Research Publication Networks for Impact -- KMi Internal Seminar
Mining Research Publication Networks for Impact -- KMi Internal SeminarMining Research Publication Networks for Impact -- KMi Internal Seminar
Mining Research Publication Networks for Impact -- KMi Internal SeminarDasha Herrmannova
 

Más de Dasha Herrmannova (10)

Machine Learning for Data Extraction
Machine Learning for Data ExtractionMachine Learning for Data Extraction
Machine Learning for Data Extraction
 
Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation Semantometrics: Text Analysis in Research Evaluation
Semantometrics: Text Analysis in Research Evaluation
 
Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?Do Citations and Readership Predict Excellent Publications?
Do Citations and Readership Predict Excellent Publications?
 
An Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic GraphAn Analysis of the Microsoft Academic Graph
An Analysis of the Microsoft Academic Graph
 
Visual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document CollectionsVisual Search for Supporting Content Exploration in Large Document Collections
Visual Search for Supporting Content Exploration in Large Document Collections
 
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
Unsupervised Identification of Study Descriptors in Toxicology Research: An E...
 
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication RankingSimple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
Simple Yet Effective Methods for Large-Scale Scholarly Publication Ranking
 
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
Semantometrics in Coauthorship Networks: Fulltext-based Approach for Analysin...
 
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
Towards Semantometrics: A New Semantic Similarity Based Measure for Assessing...
 
Mining Research Publication Networks for Impact -- KMi Internal Seminar
Mining Research Publication Networks for Impact -- KMi Internal SeminarMining Research Publication Networks for Impact -- KMi Internal Seminar
Mining Research Publication Networks for Impact -- KMi Internal Seminar
 

Último

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 

Do Authors Deposit on Time? Tracking Open Access Policy Compliance

  • 1. Do Authors Deposit on Time? Tracking Open Access Policy Compliance Drahomira Herrmannova Nancy Pontika Petr Knoth June 4, 2019 – JCDL 2019, Urbana-Champaign, IL Big Scientific Data and Text Analytics Group Knowledge Media Institute, The Open University
  • 2. Introduction • Why we want Open Access (OA) • Taxpayers should be able to read publicly funded research • Help researchers at poorer institutions without access to subscriptions • Institutions suffer from rising journal subscription prices • Funders introduce policies to encourage OA • Notable examples: • U.S. Public Access Plan • U.S. NIH Public Access Policy • UK REF 2021 Open Access Policy • EC H2020 Open Access Policy 1/22
  • 3. Growing number of OA policies Source: http://roarmap.eprints.org/ Currently close to 1 thousand funder and institutional OA policies 2/22
  • 4. OA policies • Provide criteria for making papers OA • Requirements, such as: • Where should papers be made available (publication or deposit) • When should papers be deposited • What version should be deposited (e.g. pre-print vs. post-print) • Allowed embargo periods • Etc. 3/22
  • 5. Research questions What effect do OA policies have? 4/22
  • 6. Research questions • Piwowar et al. (2018): At least 28% of all research papers are OA • Lariviere and Sugimoto (2018): More than two thirds of papers from selected funders (with an OA policy) were OA • Gargouri et al. (2012): OA growth often due to retroactive self- archiving (often years after publication) 5/22
  • 7. Research questions When do author deposit? 6/22
  • 8. Research questions When do author deposit? Do they deposit in accordance with policies? 6/22
  • 9. Deposit time lag • What is deposit time lag? • The difference between date of publication and date of deposit in a repository expressed in days • We study deposit time lag across • Country • Time • Repository • Discipline 7/22
  • 14. Deposit time lag calculation • Deposit time lag = deposit date – publication date • The difference was expressed in days • Positive values: article deposited after publication • Negative values: article deposited prior to publication • Best: as low value as possible 9/22
  • 15. Dataset • 2013-2018 publications • Metadata from Crossref and CORE Publications 808,984 Repositories 728 Countries 70 Final dataset size Year of publication distribution 10/22
  • 16. Results: Deposit time lag per country 11/22
  • 17. Results: Deposit time lag per country/year • How has deposit time lag changed over time? • Average deposit time lag per year of publication ? 12/22
  • 18. Results: Deposit time lag per country/year • Two options: 1. Use all data 13/22
  • 19. Results: Deposit time lag per country/year • Two options: 1. Use all data 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications 13/22
  • 20. Results: Deposit time lag per country/year • Two options: 1. Use all data 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications Data for 2013 publications 13/22
  • 21. Results: Deposit time lag per country/year • Two options: 1. Use all data 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications Data for 2017 publications …? 13/22
  • 22. Results: Deposit time lag per country/year • Two options: 1. Use all data • Underestimates deposit time lag for all, but especially for newer publications 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications 13/22
  • 23. Results: Deposit time lag per country/year • Two options: 1. Use all data • Underestimates deposit time lag for all, but especially for newer publications 2. Put a maximum limit on deposit time lag for the analysis (for comparability) • E.g. deposit at most a year later 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications Data for 2013 publications 13/22
  • 24. Results: Deposit time lag per country/year • Two options: 1. Use all data • Underestimates deposit time lag for all, but especially for newer publications 2. Put a maximum limit on deposit time lag for the analysis (for comparability) • E.g. deposit at most a year later 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications Data for 2017 publications 13/22
  • 25. Results: Deposit time lag per country/year • Two options: 1. Use all data • Underestimates deposit time lag for all, but especially for newer publications 2. Put a maximum limit on deposit time lag for the analysis (for comparability) • E.g. deposit at most a year later • Underestimates deposit time lag for all, but especially for older publications 2011 2012 2013 2014 2015 2016 2017 2018 Yearly deposits – toy example 2013 publications 2017 publications 13/22
  • 26. Results: Deposit time lag per year/country Option 1: All data Option 2: Max deposit time lag limit (1 yr) 14/22
  • 27. Results: Deposit time lag per subject Bars are not stacked, but overlayed 15/22
  • 28. REF 2021 OA Policy • In 2014, the UK introduced an OA Policy for its next research assessment exercise (REF) • Requirements • Deposit final manuscript in an OA repository • Deposit on publication/acceptance or within 3 months from it • Papers published since April 2016 • Sanction • The OA requirement is linked to performance review • Did the introduction of this mandatory policy affect deposit time lag in the UK compared to other countries? 16/22
  • 29. Single vs any repository deposit time lag 1. Single repository deposit time lag • Deposit time lag with respect to the publications’ deposit date in a given repository 2. Any repository deposit time lag • Deposit time lag with respect to the publications’ deposit date in any repository Repository 1 Repository 2 05/2017 09/2017 Single repository deposit time lag for Repository 1 = 05/2017 – publication date Any repository deposit time lag for Repository 1 = min(05/2017, 09/2017) – publication date 17/22
  • 30. Results: UK REF compliance per year Any repository deposit time lag 18/22
  • 31. Results: Deposit time lag per repository Full lines: Single repository deposit time lag Dashed lines: Any repository deposit time lag 19/22
  • 32. Results: Deposit time lag per year/country Option 1: All data Option 2: Max deposit time lag limit 2014: UK introduces REF 2021 OA policy 20/22
  • 33. Discussion • Study assumption: if metadata deposited, then the full text is also deposited • Validation of full text deposits complicated due to the way the OAI- PMH works 21/22
  • 34. Discussion • Study assumption: if metadata deposited, then the full text is also deposited • Validation of full text deposits complicated due to the way the OAI- PMH works • Our study excludes publications that were never deposited • To quantify missing deposits we would have to correctly match all CORE publications to their Crossref metadata • Focus on deposit time lag rather than the proportion of missing deposits 21/22
  • 35. Discussion • Study assumption: if metadata deposited, then the full text is also deposited • Validation of full text deposits complicated due to the way the OAI- PMH works • Our study excludes publications that were never deposited • To quantify missing deposits we would have to correctly match all CORE publications to their Crossref metadata • Focus on deposit time lag rather than the proportion of missing deposits • Matching between Crossref and CORE was done using metadata (titles, authors, publication years) • Strict approach, results in high accuracy (~95.27%) but lower recall 21/22
  • 36. Conclusions • Time between publication and deposit has decreased significantly in the 2013-2017 period globally • By 472 days per country on average across all countries in our dataset 22/22
  • 37. Conclusions • Time between publication and deposit has decreased significantly in the 2013-2017 period globally • By 472 days per country on average across all countries in our dataset • After introduction of the UK REF 2021 OA Policy this decrease in the UK has accelerated • As of early 2018, UK publications are deposited immediately upon publication or even slightly before 22/22
  • 38. Conclusions • Time between publication and deposit has decreased significantly in the 2013-2017 period globally • By 472 days per country on average across all countries in our dataset • After introduction of the UK REF 2021 OA Policy this decrease in the UK has accelerated • As of early 2018, UK publications are deposited immediately upon publication or even slightly before • Key messages: • Our observations support the argument for the inclusion of time limited deposit requirement in OA policies • Institutional practices an important role in supporting OA policy adoption 22/22
  • 39. Thank you! Code: https://github.com/oacore/jcdl_2019 Data: https://doi.org/10.5281/zenodo.2605408