This document discusses using library circulation data to surface underused "long tail" library materials through search. Specifically:
1) Circulation data from research libraries could be aggregated to power discovery tools that recommend related materials based on usage, helping researchers find relevant sources in unique collections.
2) An initial pilot would test this approach using data from the John Rylands University Library to examine relationships between borrowing frequency and relevance.
3) Broader adoption could surface hidden collections, provide new discovery routes based on context over classification, and enable serendipitous research - but issues around data integration, ownership, and sustainability would need addressing.
3. And also… how sustainable would an API-based national shared service be?Can such a service support users and also library workflows such as collections management?RLUK, M25, Leeds University, Cambridge University, Sussex University.
4. Mimas delivers several key JISC national library & bibliographic services Copac Archives Hub Zetoc Journals Usage Stats Portal
5. --Aggregation of 50+ research & specialist libraries--50 million records +--1 million search sessions per month--Primary use case – locating long tail materials
6. --John Rylands University Library: --1.3 million bib records--600,000 search sessions per month--23% of records unique (cross checked against WorldCat)--40,000 students10 years of circulation data
13. market research reveals these users as… Centrifugal searchers ‘Berry-pickers’ from various trails Quite isolated and prone to pit-falls
14. And increasingly they just don’t ask librarians…They ask their tutors and each other where to look…
15. Researchers are suspicious about UGC, especially ratings & reviews, but…. they could see the immediate benefit of‘tacit’ recommender functions….
16.
17.
18.
19. What if? this represented a national aggregation of data gathered from the usage activity of these researchers, collected as they worked with a national aggregation of unique or rare research collections?
21. What can this mean? Surfacing and increasing usage of hidden collections ( & demonstrating value) Providing new routes to discovery based on use and disciplinary contexts (not traditional classification). Powering ‘centrifugal searching’ and discovery through serendipity Enabling new, original research – academic excellence…
22.
23. Next steps… Implement in JRUL OPAC Test hypothesis with academic users Share lessons & consider possible next steps with additional contributors Collect feedback from collections managers – useful for collections development & assessment? Analyse issues for sustainability
28. Can we make the data work harder to solve other shared problems?
29. Issues for sustainability Is there a clear-cut case for a national shared service here? Data model: data out = easy data in = not so much Licensing & Attribution: collective ownership of a collective pot? Is proof of our hypothesis key to sustainability?
Relatively plain sailing ‘til now – followed the guidelines of Dave Pattern to extract the data and create the appropriate algorithms for the recommenderThis is pulling from the API (which is not public yet, but will be)
Building on the work of MOSAIC, the SALT project will focus on a different use case where the barriers encountered by MOSAIC can be overcome. Academic researchers in the humanities make up a vast proportion of the institutional library OPAC usage. Much of the work of these researchers is monograph-based, and recent findings from Mimas and RIN indicate that a high level of postgraduate research centres on the use of unique or rare items held across the UK. These are the collections that make up the ‘Long Tail’ of UK research library collections. SALT will test the ‘long tail’ hypothesis in relation to advanced academic users of long tail collections held in UK research libraries which hold some of the richest heritage collections in the world. We will investigate how issues of relevance and frequency of borrowing might shift within the particular use case of humanities research, where low level of borrowing of rare or niche items may not necessarily equate with lower relevance to end users whose search behaviour is typically centrifugal and exploratory. We will look at the relationship between key critical texts and commonly read humanities secondary or primary research monographs that might occupy the ‘head’ of frequently borrowed items and follow activity trails to explore the relevancy of long tail, lesser borrowed, lesser known niche items.