1. Finding out about the preservation of e-journals: an overview of the PEPRS project and the Keepers Registry Beta service Fred Guy, EDINA, University of Edinburgh British Library, Boston Spa 13 th October 2011 on behalf of the Project Team – EDINA/ISSN-IC
17. The Agencies – the HathiTrust HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide. HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide. HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide. HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide. HathiTrust is a partnership of major research institutions and libraries working to ensure that the cultural record is preserved and accessible long into the future. There are more than fifty partners in HathiTrust, and membership is open to institutions worldwide.
18. What is in the vaults? http://www.flickr.com/photos/wka/4283285201 / http:// www.flickr.com/photos/mcfull/421644442/sizes/s/in/photostream /
20. Creating the database Agency data ISSN Register ISSNs The Keepers Registry ISSN-L + p-ISSN & e-ISSN Register metadata Agency metadata
21. Open Source components used in the Keepers Registry Abstract Perl API supporting search and retrieval. Based on YAZ toolkit. ZOOM http://zoom.z3950.org/api/ Z39.50 support in Perl Each preservation agency supplies custom data at the moment, so scripts will be created for each data source. ISSN data is in MARC21 format and will be processed using MARC::Record CPAN package Custom Perl and CPAN packages including MARC::Record http://search.cpan.org/~gmcharlt/MARC-Record-2.0.2/ Normalisation Data files will be collected using FTP and HTTP. Custom Perl and CPAN packages Harvester Provides structured text indexing and retrieval. Fast and scales well. Provides powerful and flexible text retrieval capabilities. Zebra http:// www.indexdata.dk /zebra/ Database: metadata hosted by the Keepers Registry Offers fast and easy development and is extremely flexible Apache::ASP http://www.apache-asp.org / User interface Comment Software choice Component
35. Phase 2: key stages Business Planning Establishment of Advisory Board 0.4 0.3 0.2 0.1 Software releases Full service operation The Keepers Registry Beta service start PEPRS Beta service start and end Dec-12 Aug-12 May-11 Apr-12 Feb-11 Dec-11 Oct-11 Aug-11 Apr-11 Dec-10 Aug-10 ACTIVITY Phase 2 start and end PEPRS Development activity
Going to talk about a metadata project which provides a key role in providing information for librarians and collection managers about the archiving arrangements for e-journals
This is the old scenario where users were faced with row upon row of printed journals.
Now users are inclined to access this information via computers although not quite like these ones!
All the statistics point to increased e-journal publication, expenditure and usage. RIN has done a lot of work to quantify the situation. Chart one shows availability by discipline and clearly it is no great surprise to learn that the sciences have a very high %. The second chart shows that the bigger publishers have moved into online in a bigger way than the smaller publishers but it is increasing no matter the size and more recent statistics would no doubt show significant increases. The third chart shows the increase in usage. There was a 23% increase in downloading between 2005/6 – 2006/07 and a 19% increase 2006-7. The increase is greater for the Scottish Higher Education Digital Libraries (SHEDL). 19.58% 2007-8 and 41.2% 2009-9.
Print aspects. Essentially under library control.
Essential aspect is that it is not under the control of the library as is the case with print.
Schemes have emerged to provide solutions for libraries but there is a key issue in trying to obtain a coherent overall view.
This is the key report on which PEPRS/theKeepers Registry is based.
There is a lot of background literature but the key report is that prepared by Rigtscm and Loughborough University. Essentially PEPRS has evolved from the findings in the report.
There are about 16 member libraries in the UKLOCKSS Alliance
Just under 50 participating publishers
12k titles preserved
How do we find what is in the vaults but more critically how can we avoid having to look into each vault separately?
This is a snow drop showing that the Keepers Registry is essentially an aggregation of metadata from a number of suppliers or as they are called in the KR context archiving agencies.
The key components. Essentially the KR is based upon metadata from the different participating agencies associated with authoritative metadata from the ISSN Register.
These are the open source components used in the KR.
Demonstration of the beta service.
The Home page. Search Box, list of agencies with last update date, project news and tabs for journal title, publisher and FAQ and Help
Entering an ISSN. (Note that the p-ISSN or the e-ISSN can be entered. The KR uses the ISSN-L which links the two).
The results screen. In this case there is only a single record.
Search results when there are a few possibilities.
Full record display
Full record.
Full record display showing first of all that there are four different agencies involved and secondly the status codes. Only using two at present: Preserved and In process.
HathiTrust summary information.
HathiTrust extent information.
Browsing journals.
Browsing publishers.
Some information about Phase 2.
Key stages for Phase 2.
ISSN issues which have been discovered.
Extent information. A variety of practices
Different terms used. We had planned to make progress with standardising the vocabulary used but haven’t really managed to take this too far.
PAP has been in contact with us and they are very keen to be able to use any API which we develop.
It was always the intention to add additional agencies. We have added the HathiTrust to the original 5 and will be adding the data from NSLC in the relatively near future. We have received the data and are assessing and analysing it at the moment. Should be adding it within the next couple of months. Have drawn up a document on inclusion criteria and have received some feedback on it and will be seeking more and then issuing it to the potential candidate agencies.