Presented during the International Open Access Week 2020 for the Kerala Library Association, October 21, 2020.
The presentation is about CORE, a global harvester of open access scientific content and the CORE services on content discovery, managing content and access to raw data.
A Critique of the Proposed National Education Policy Reform
Closing the scientific literature access gap with CORE - how to gain free access to millions of open access scientific papers
1. Closing the scientific literature access gap
with CORE - how to gain free access to
millions of open access scientific papers
Dr Nancy Pontika
Open Access Aggregation Officer
twitter: @NancyPontika
October 21, 2020 – Kerala Library Association
Big Scientific Data and Text Analytics Group
Knowledge Media Institute, The Open University
2. OA aggregations and BOAI 2002
“To achieve open access to scholarly journal literature, we
recommend two complementary strategies.
• Self-Archiving: First, scholars need the tools and assistance to deposit
their refereed journal articles in open electronic archives, a practice
commonly called, self-archiving. When these archives conform to
standards created by the Open Archives Initiative, then search
engines and other tools can treat the separate archives as one. Users
then need not know which archives exist or where they are located in
order to find and make use of their contents.
• …”
Budapest Open Access Initiative, 2002
3. Global network of repositories
“A single scientific repository is of limited value, real benefits come
from the ability to exchange data within a network …
… interoperability allows us to exploit today's computational power
so that we can aggregate, data mine, create new tools and services,
and generate new knowledge from repository content.”
Confederation of Open Access Repositories (COAR)
4. CORE’s mission
Aggregate all open access research articles worldwide
Enrich this content and provide seamless access to it through a
set of data services
6. World’s largest dataset of open access
full texts
24,664,721
hosted
full text
24,936,921
links
to full text
•202,118,227
metadata
records
146
countries
10,234
data
providers
15TB of raw
plain text
7. CORE processing pipeline
1. Metadata download, extraction and harmonisation
2. Full text download
3. Text extractions, sections extraction
4. Metadata validation and enrichment (DOI, ORCID, etc.)
5. Thumbnails generation
6. References and citation contexts extraction
7. API enrichment (e.g. finding DOIs, linking to other systems)
8. Document type classification
9. Deduplication
10.Indexing
11.Exposing (data dumps, API, FastSync)
12. CORE Search
11
• Full text search for OA content
• Faceted searching
• What you find is what you get
• Real change of data providers
wanting to be included
https://core.ac.uk
15. CORE Discovery
• High coverage of freely
available content
• Best grip on open
repository content
• Repository integration
• Discovering documents
without a DOI
https://core.ac.uk/services/discovery/
https://chrome.google.com/webstore/detail/core-discovery/ockidfiihjhkngdalfnbeeepgfbmkmlh
17. CORE Discovery Repository integration
Majority of articles in repositories
metadata only
CORE Discovery repository plugin:
• turns dead ends of user journeys
into journeys fulfilling users’
information needs
• makes repository content more
discoverable
https://core.ac.uk/services/discovery/
18. Managing Content : CORE Repository Dashboard
• Access content harvested
from the repository
• Enables content management
& take down requests
• Access to all detected
technical issues
• Statistics regarding the
repository content via IRUS-UK
https://core.ac.uk/services/repository-dashboard/
specifically designed for repository managers
19. CORE’s raw data services
Video url: https://core.ac.uk/services/#content-discovery
20. CORE API
• Real-time machine access to the
world's largest collection of open
access papers
• Harmonised access to data from
across the network of CORE
provider
• Direct machine access to full texts
of research papers
https://core.ac.uk/services/api/
21. CORE Dataset
• Download millions of research
papers for text and data analysis
• Prototype, analyse and mine your
data in your infrastructure
https://core.ac.uk/services/dataset/
22. CORE FastSync
• Keeps your data in sync with
research content from around the
world
• Fast and incremental updates as
soon as they become available. No
usage restrictions
• Based on ResourceSync
https://core.ac.uk/services/fastsync/
24. CORE ambassadors network
• community's feedback
• identify repositories
• post CORE news to local
venues
• offer advice
https://core.ac.uk/about/ambassadors/
25. CORE Ambassadors from India
1. Mayank Trivedi, University Librarian, Maharaja Sayajirao University of Baroda
2. Sarika Jain, Associate Professor, Amity University Uttar Pradesh
3. Shamprasad Pujar, Chief Librarian, Indira Gandhi Institute of Development Research
4. Faeem Ahmad, Librarian in charge, Indian Grain Storage Mangement and Research Institute
5. R. Sakthivel, Library, India
6. T. Ananth Kumar, Associate Professor, IFET College of Engineering
7. Shambulinga B. Jali, Chief Librarian, CMR University
8. K. Venugopal, Assistant Professor, Vivekanandha College of Arts and Sciences for Women
9. Munfar Kappil, Librarian, Nam College Kallikkandy, Kaerala, India
10. Balraju Vattikulla, Library Assistant, URSC/ISRO
11. Piyush Mani Maurya, Lecturer, Zeal Institutes
12. Veerabhadra Swamy Pulletikurthi, Professor, Department of Management Studies
13. Abhilab Gupta, Student, University of Jammu
14. Kuldeep Pawar, Librarian, Arihant College of Arts, Commerce & Science, Pune, Maharashtra
26. Repositories harvested by CORE
153 repositories are from India – some examples
• Indian Institute of Astrophysics Repository
• Open Access Repository of IISc Research Publications
• Etheses - A Saurashtra University Library Service
• National Aerospace Laboratories Institutional Repository
• Dspace @ Vidyasagar University
• Osmania University Digital Library [OUDL]
• DSpace at Indian Institute of Management Kozhikode
• National Science Digital Library
• ePrints@Bangalore University
• Institutional Repository of Intellectual Contributions of Delhi Technological University
• National Science Digital Library
• Ministry of Earth Sciences, Government of India
27. Add your repository in the CORE collection
https://core.ac.uk/data-providers
Highest coverage of freely available content. Our tests have shown CORE Discovery finding more free content than any other discovery system.
Free service for researchers by researchers. CORE Discovery is the only free content discovery extension developed by researchers for researchers. There is no major publisher or enterprise controlling and profiting from your usage data.
Best grip on open repository content. Due to CORE being a leader in harvesting open access literature, CORE Discovery has the best grip on open content from open repositories as opposed to other services that disproportionately focus only on content indexed in major commercial databases.
Repository integration and discovering documents without a DOI. The only service offering seamless and free integration into repositories. CORE Discovery is also the only discovery system that can locate scientific content even for items with an unknown DOI or which do not have a DOI.
Open access discovery tools locate freely available copies of research papers which might be behind the paywall
K[.*]io