Closing the scientific literature access gap with CORE - how to gain free access to millions of open access scientific papers

Closing the scientific literature access gap
with CORE - how to gain free access to
millions of open access scientific papers
Dr Nancy Pontika
Open Access Aggregation Officer
twitter: @NancyPontika
October 21, 2020 – Kerala Library Association
Big Scientific Data and Text Analytics Group
Knowledge Media Institute, The Open University

OA aggregations and BOAI 2002
“To achieve open access to scholarly journal literature, we
recommend two complementary strategies.
• Self-Archiving: First, scholars need the tools and assistance to deposit
their refereed journal articles in open electronic archives, a practice
commonly called, self-archiving. When these archives conform to
standards created by the Open Archives Initiative, then search
engines and other tools can treat the separate archives as one. Users
then need not know which archives exist or where they are located in
order to find and make use of their contents.
• …”
Budapest Open Access Initiative, 2002

Global network of repositories
“A single scientific repository is of limited value, real benefits come
from the ability to exchange data within a network …
… interoperability allows us to exploit today's computational power
so that we can aggregate, data mine, create new tools and services,
and generate new knowledge from repository content.”
Confederation of Open Access Repositories (COAR)

CORE’s mission
Aggregate all open access research articles worldwide
Enrich this content and provide seamless access to it through a
set of data services

Introducing CORE
video url: https://core.ac.uk/about/#mission

World’s largest dataset of open access
full texts
24,664,721
hosted
full text
24,936,921
links
to full text
•202,118,227
metadata
records
146
countries
10,234
data
providers
15TB of raw
plain text

CORE processing pipeline
1. Metadata download, extraction and harmonisation
2. Full text download
3. Text extractions, sections extraction
4. Metadata validation and enrichment (DOI, ORCID, etc.)
5. Thumbnails generation
6. References and citation contexts extraction
7. API enrichment (e.g. finding DOIs, linking to other systems)
8. Document type classification
9. Deduplication
10.Indexing
11.Exposing (data dumps, API, FastSync)

CORE usage
Every day more than 1 million people access CORE papers

Alexa rank
Within top 2k global websites
Ulr: https://www.alexa.com/siteinfo/core.ac.uk

CORE Services
CONTENT DISCOVERY
Recommender
Discovery
MANAGING CONTENT
Repository Dashboard
Repository Edition
ACCESS TO RAW DATA
API
Dataset
FastSync
https://core.ac.uk/services/

Content Discovery
Video url: https://core.ac.uk/services/#content-discovery

CORE Search
11
• Full text search for OA content
• Faceted searching
• What you find is what you get
• Real change of data providers
wanting to be included
https://core.ac.uk

CORE Recommender
Recommending relevant
content to users from
across all free content
https://core.ac.uk/services/recommender/

CORE Recommender
Recommender plugin for
repositories
https://core.ac.uk/services/recommender/

CORE Discovery
• High coverage of freely
available content
• Best grip on open
repository content
• Repository integration
• Discovering documents
without a DOI
https://core.ac.uk/services/discovery/
https://chrome.google.com/webstore/detail/core-discovery/ockidfiihjhkngdalfnbeeepgfbmkmlh

CORE Discovery Repository integration
Majority of articles in repositories
metadata only
CORE Discovery repository plugin:
• turns dead ends of user journeys
into journeys fulfilling users’
information needs
• makes repository content more
discoverable
https://core.ac.uk/services/discovery/

Managing Content : CORE Repository Dashboard
• Access content harvested
from the repository
• Enables content management
& take down requests
• Access to all detected
technical issues
• Statistics regarding the
repository content via IRUS-UK
https://core.ac.uk/services/repository-dashboard/
specifically designed for repository managers

CORE’s raw data services
Video url: https://core.ac.uk/services/#content-discovery

CORE API
• Real-time machine access to the
world's largest collection of open
access papers
• Harmonised access to data from
across the network of CORE
provider
• Direct machine access to full texts
of research papers
https://core.ac.uk/services/api/

CORE Dataset
• Download millions of research
papers for text and data analysis
• Prototype, analyse and mine your
data in your infrastructure
https://core.ac.uk/services/dataset/

CORE FastSync
• Keeps your data in sync with
research content from around the
world
• Fast and incremental updates as
soon as they become available. No
usage restrictions
• Based on ResourceSync
https://core.ac.uk/services/fastsync/

Working with partners
https://core.ac.uk/about/endorsements/

CORE ambassadors network
• community's feedback
• identify repositories
• post CORE news to local
venues
• offer advice
https://core.ac.uk/about/ambassadors/

CORE Ambassadors from India
1. Mayank Trivedi, University Librarian, Maharaja Sayajirao University of Baroda
2. Sarika Jain, Associate Professor, Amity University Uttar Pradesh
3. Shamprasad Pujar, Chief Librarian, Indira Gandhi Institute of Development Research
4. Faeem Ahmad, Librarian in charge, Indian Grain Storage Mangement and Research Institute
5. R. Sakthivel, Library, India
6. T. Ananth Kumar, Associate Professor, IFET College of Engineering
7. Shambulinga B. Jali, Chief Librarian, CMR University
8. K. Venugopal, Assistant Professor, Vivekanandha College of Arts and Sciences for Women
9. Munfar Kappil, Librarian, Nam College Kallikkandy, Kaerala, India
10. Balraju Vattikulla, Library Assistant, URSC/ISRO
11. Piyush Mani Maurya, Lecturer, Zeal Institutes
12. Veerabhadra Swamy Pulletikurthi, Professor, Department of Management Studies
13. Abhilab Gupta, Student, University of Jammu
14. Kuldeep Pawar, Librarian, Arihant College of Arts, Commerce & Science, Pune, Maharashtra

Repositories harvested by CORE
153 repositories are from India – some examples
• Indian Institute of Astrophysics Repository
• Open Access Repository of IISc Research Publications
• Etheses - A Saurashtra University Library Service
• National Aerospace Laboratories Institutional Repository
• Dspace @ Vidyasagar University
• Osmania University Digital Library [OUDL]
• DSpace at Indian Institute of Management Kozhikode
• National Science Digital Library
• ePrints@Bangalore University
• Institutional Repository of Intellectual Contributions of Delhi Technological University
• National Science Digital Library
• Ministry of Earth Sciences, Government of India

Add your repository in the CORE collection
https://core.ac.uk/data-providers

Closing the scientific literature access gap with CORE - how to gain free access to millions of open access scientific papers

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Closing the scientific literature access gap with CORE - how to gain free access to millions of open access scientific papers

Similar a Closing the scientific literature access gap with CORE - how to gain free access to millions of open access scientific papers (20)

Más de Nancy Pontika

Más de Nancy Pontika (20)

Último

Último (20)

Closing the scientific literature access gap with CORE - how to gain free access to millions of open access scientific papers

Notas del editor