SlideShare una empresa de Scribd logo
1 de 38
Descargar para leer sin conexión
•
•
•
•
•
•
•
•
•
•
••
•
•
•
•
•
•
•
•
•
•



•
•
•
•
•
•
•
Connecting the dots: Reader ratings, bibliographic data, and machine-learning algorithms for monograph selection
Connecting the dots: Reader ratings, bibliographic data, and machine-learning algorithms for monograph selection
Connecting the dots: Reader ratings, bibliographic data, and machine-learning algorithms for monograph selection

Más contenido relacionado

Más de NASIG

Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...NASIG
 
Bloomsbury digital resources
Bloomsbury digital resourcesBloomsbury digital resources
Bloomsbury digital resourcesNASIG
 
Web accessibility in the institutional repository crafting user centered sub...
Web accessibility in the institutional repository  crafting user centered sub...Web accessibility in the institutional repository  crafting user centered sub...
Web accessibility in the institutional repository crafting user centered sub...NASIG
 
Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries NASIG
 
Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration NASIG
 
Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?NASIG
 
Mapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing ChangeMapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing ChangeNASIG
 
When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020NASIG
 
Getting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentGetting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentNASIG
 
A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...NASIG
 
Knowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource ManagementKnowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource ManagementNASIG
 
Practical approaches to linked data
Practical approaches to linked dataPractical approaches to linked data
Practical approaches to linked dataNASIG
 
Transforming library collections and supporting student learning with collect...
Transforming library collections and supporting student learning with collect...Transforming library collections and supporting student learning with collect...
Transforming library collections and supporting student learning with collect...NASIG
 
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiadThe Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiadNASIG
 
Bridging the Public Services and Technical Services Divide: Hosting a Librari...
Bridging the Public Services and Technical Services Divide: Hosting a Librari...Bridging the Public Services and Technical Services Divide: Hosting a Librari...
Bridging the Public Services and Technical Services Divide: Hosting a Librari...NASIG
 
Where do we keep that? The new Keepers Registry and the digital content in yo...
Where do we keep that? The new Keepers Registry and the digital content in yo...Where do we keep that? The new Keepers Registry and the digital content in yo...
Where do we keep that? The new Keepers Registry and the digital content in yo...NASIG
 
Exploring Perpetual Access
Exploring Perpetual AccessExploring Perpetual Access
Exploring Perpetual AccessNASIG
 
How do you start a Scholarly Communications program?
How do you start a Scholarly Communications program? How do you start a Scholarly Communications program?
How do you start a Scholarly Communications program? NASIG
 
Usability Beyond the Home Page: Bringing Usability into the Technical Service...
Usability Beyond the Home Page: Bringing Usability into the Technical Service...Usability Beyond the Home Page: Bringing Usability into the Technical Service...
Usability Beyond the Home Page: Bringing Usability into the Technical Service...NASIG
 
Trial by Fire and Then Some for Electronic Resources: Connecting the Communit...
Trial by Fire and Then Some for Electronic Resources: Connecting the Communit...Trial by Fire and Then Some for Electronic Resources: Connecting the Communit...
Trial by Fire and Then Some for Electronic Resources: Connecting the Communit...NASIG
 

Más de NASIG (20)

Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
Full Text Coverage Ratios: A Simple Method of Article-Level Collections Analy...
 
Bloomsbury digital resources
Bloomsbury digital resourcesBloomsbury digital resources
Bloomsbury digital resources
 
Web accessibility in the institutional repository crafting user centered sub...
Web accessibility in the institutional repository  crafting user centered sub...Web accessibility in the institutional repository  crafting user centered sub...
Web accessibility in the institutional repository crafting user centered sub...
 
Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries Linked Data at Smithsonian Libraries
Linked Data at Smithsonian Libraries
 
Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration Walk this way: Online content platform migration experiences and collaboration
Walk this way: Online content platform migration experiences and collaboration
 
Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?Read & Publish – What It Takes to Implement a Seamless Model?
Read & Publish – What It Takes to Implement a Seamless Model?
 
Mapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing ChangeMapping Domain Knowledge for Leading and Managing Change
Mapping Domain Knowledge for Leading and Managing Change
 
When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020When to hold them when to fold them: reassessing big deals in 2020
When to hold them when to fold them: reassessing big deals in 2020
 
Getting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides ContentGetting on the Same Page: Aligning ERM and LIbGuides Content
Getting on the Same Page: Aligning ERM and LIbGuides Content
 
A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...A multi-institutional model for advancing open access journals and reclaiming...
A multi-institutional model for advancing open access journals and reclaiming...
 
Knowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource ManagementKnowledge Bases: The Heart of Resource Management
Knowledge Bases: The Heart of Resource Management
 
Practical approaches to linked data
Practical approaches to linked dataPractical approaches to linked data
Practical approaches to linked data
 
Transforming library collections and supporting student learning with collect...
Transforming library collections and supporting student learning with collect...Transforming library collections and supporting student learning with collect...
Transforming library collections and supporting student learning with collect...
 
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiadThe Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
The Power of Cross-unit Data Sharing: Nontraditional Uses for ILLiad
 
Bridging the Public Services and Technical Services Divide: Hosting a Librari...
Bridging the Public Services and Technical Services Divide: Hosting a Librari...Bridging the Public Services and Technical Services Divide: Hosting a Librari...
Bridging the Public Services and Technical Services Divide: Hosting a Librari...
 
Where do we keep that? The new Keepers Registry and the digital content in yo...
Where do we keep that? The new Keepers Registry and the digital content in yo...Where do we keep that? The new Keepers Registry and the digital content in yo...
Where do we keep that? The new Keepers Registry and the digital content in yo...
 
Exploring Perpetual Access
Exploring Perpetual AccessExploring Perpetual Access
Exploring Perpetual Access
 
How do you start a Scholarly Communications program?
How do you start a Scholarly Communications program? How do you start a Scholarly Communications program?
How do you start a Scholarly Communications program?
 
Usability Beyond the Home Page: Bringing Usability into the Technical Service...
Usability Beyond the Home Page: Bringing Usability into the Technical Service...Usability Beyond the Home Page: Bringing Usability into the Technical Service...
Usability Beyond the Home Page: Bringing Usability into the Technical Service...
 
Trial by Fire and Then Some for Electronic Resources: Connecting the Communit...
Trial by Fire and Then Some for Electronic Resources: Connecting the Communit...Trial by Fire and Then Some for Electronic Resources: Connecting the Communit...
Trial by Fire and Then Some for Electronic Resources: Connecting the Communit...
 

Último

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroomSamsung Business USA
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...Nguyen Thanh Tu Collection
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
DBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfDBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfChristalin Nelson
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...HetalPathak10
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxryandux83rd
 
Shark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsShark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsArubSultan
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6Vanessa Camilleri
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesVijayaLaxmi84
 
Comparative Literature in India by Amiya dev.pptx
Comparative Literature in India by Amiya dev.pptxComparative Literature in India by Amiya dev.pptx
Comparative Literature in India by Amiya dev.pptxAvaniJani1
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
The role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipThe role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipKarl Donert
 
4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptxmary850239
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 

Último (20)

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom6 ways Samsung’s Interactive Display powered by Android changes the classroom
6 ways Samsung’s Interactive Display powered by Android changes the classroom
 
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
BÀI TẬP BỔ TRỢ TIẾNG ANH 8 - I-LEARN SMART WORLD - CẢ NĂM - CÓ FILE NGHE (BẢN...
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
DBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdfDBMSArchitecture_QueryProcessingandOptimization.pdf
DBMSArchitecture_QueryProcessingandOptimization.pdf
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
Satirical Depths - A Study of Gabriel Okara's Poem - 'You Laughed and Laughed...
 
Employablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptxEmployablity presentation and Future Career Plan.pptx
Employablity presentation and Future Career Plan.pptx
 
Shark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristicsShark introduction Morphology and its behaviour characteristics
Shark introduction Morphology and its behaviour characteristics
 
ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6ICS 2208 Lecture Slide Notes for Topic 6
ICS 2208 Lecture Slide Notes for Topic 6
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
Sulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their usesSulphonamides, mechanisms and their uses
Sulphonamides, mechanisms and their uses
 
Comparative Literature in India by Amiya dev.pptx
Comparative Literature in India by Amiya dev.pptxComparative Literature in India by Amiya dev.pptx
Comparative Literature in India by Amiya dev.pptx
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
The role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenshipThe role of Geography in climate education: science and active citizenship
The role of Geography in climate education: science and active citizenship
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
Introduction to Research ,Need for research, Need for design of Experiments, ...
Introduction to Research ,Need for research, Need for design of Experiments, ...Introduction to Research ,Need for research, Need for design of Experiments, ...
Introduction to Research ,Need for research, Need for design of Experiments, ...
 
4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx4.9.24 Social Capital and Social Exclusion.pptx
4.9.24 Social Capital and Social Exclusion.pptx
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 

Connecting the dots: Reader ratings, bibliographic data, and machine-learning algorithms for monograph selection

Notas del editor

  1. Goodreads ratings, metadata from oclc, python
  2. Briefly talk about the evolution of data, concepts of RS
  3. chronologically or geologically organized. Get all articles from 2007, manageable
  4. find books on a subject, by Wheeler—easy where he isn’t the first author—not that easy
  5. With the rapidly increasing generation of data, users experience information overload and face difficulties in navigating and processing the information available online. Human’s capacity to find information advances more slowly than the pace at which new information is made available. More recently, terms such as big data, machine learning, and artificial intelligence no longer appear only in research papers. They appear so often in popular media that the general public have some familiarity with the terms. Big data are high-volume, high velocity, and high variety information assets that require new forms of processing to enable enhanced decision making, insight discovery and process optimization.
  6. Different library tools generate tons of library big data, mainly catalogue and transactional data. Catalogue data provides metadata information, while transactional data such as circulation statistics offers insights on user behaviors. There exists plenty of opportunities to analyze library big data. The rapid increase in library data offers innovative ways to understand interactions with users in the library environment. Analytics of library big data support library innovations, personalized recommendation services, library user behavior analysis
  7. ML is the study of algorithms and statistical models that computer systems use to effectively perform a specific task without using explicit instructions, relying on patterns and inference instead. As this recent article says,…Giving enough data, we can train the computer to recognize patterns. It is very powerful. RS is the result of ML of metadata and user behavior data.
  8. More and more businesses seek automated methods to deliver results relevant to users with the development of machine learning. Recommender systems “analyze user profiles, content items, and the connections between them, and try to predict future user behavior”  It has been adopted by many major e-commerce businesses such as Amazon, Netflix, and Expedia, and has been widely implemented to predict product and media recommendations, making it a key factor in increasing product average order value and the number of items per order. Recommender systems lead to improved browser-to-buyer rates, extended cross selling opportunities and increased customer loyalty, while also reducing the time and effort spent by consumers on searching Recommender systems are widely used to suggest contacts, or activities on social media platforms, and to improve targeted ads by the advertising industry Practitioners can also develop new marketing strategies by integrating users’ current situations and future needs by offering contextually relevant socialized recommendations
  9. 35% of the purchases on Amazon are the result of their recommender system, according to McKinsey. During the Chinese global shopping festival of November 11, 2016, Alibaba achieved growth of up to 20% of their conversion rate using personalized landing pages, according to Alizila. Recommendations are responsible for 70% of the time people spend watching videos on YouTube. 75% of what people are watching on Netflix comes from recommendations, according to McKinsey. Employing a recommender system enables Netflix to save around $1 billion each year—less on ads
  10. BibTip used for the previous mentioned catalogue
  11. Hamlet is a tool that uses machine learning to explor MIT thesis collection. Plug in a thesis and find out which other theses are most conceptually similar. 
  12. The ScienceDirect free Recommendations service uses machine learning and your online activity to suggest research tailored to your needs. Once you've signed in, the algorithm uses your signed-in activity on ScienceDirect to understand your research interests. It then searches databases of more than 3,800 journals and over 37,000 book titles to find related content. The more frequently you sign in, the better it gets to know you, and the more relevant the recommendations you'll receive.
  13. Help library users discover material relevant to their research Predict individual preferences, help users make better choices
  14. Lu, J., Wu, D., Mao, M., Wang, W., & Zhang, G. (2015). Recommender system application developments: a survey. Decision Support Systems, 74, 12-32.
  15. Collaborative filtering-based recommendation techniques combine users in some way and creates recommendations for one user based on the preferences of another user. It helps make choices based on the opinions of other people who share similar interests. Content based pair specific users to library items based on the metadata of the item and what is known about the user. For example, if a user indicates in some way that they enjoy mystery novels, items identified as mystery novels might be recommended to them. 
  16. This project is to help librarians make collection decisions using the recommender system, and in this presentation we will illustrate several examples of building this system to aid in the selection of monographs. One example involves the merging of popular titles with reader rating data. We found that while The New York Times publishes best seller titles based on the rates of sales, they do not have any connection to user ratings. By leveraging data from Goodreads, the world’s largest site for readers and book recommendations, we will build a simple recommender system that produces The New York Times best seller titles that have higher user rating. Drawing on bibliographic data from highly rated best sellers, we used a few methods to suggest items with similar features.
  17. Data for this project was retrieved from APIs. Weekly hardcover fiction best sellers in the year of 2018 was collected using New York Times API. A total of 52 weeks of data, containing 15 best seller titles, was retrieved in CSV files. After combing 52 files and removing duplicated titles, a total of 192 titles comprise the 2018 New York Times best seller books. The file contains fields such as ISBN, title, author, subject and summary, with no review information.
  18. Goodreads also provides free API to retrieve review data. The project team collected reviews in 2018 by its top 99 reviewers. Among those, 21 had reviewed New York Times best seller titles, with a number of 1367 reviews available. Goodreads review data contains userID, rating, goodreadsID, and title. GoodreadsIDs were matched with ISBN to create a list of New York Times best sellers with rating.
  19. Bibliometric data came from WorldCat Search API. Unlike the free New York Times API and Goodreads API, WordCat Search API came with WorldCat services subscriber. Data was in XML file format, and Python Pandas library was used to parse the data. author, book summary, title, topic term, and genre
  20. Pandas for query csv files, numpy for correlation Sklearn for analysis, alternative: spaCy-Natural Language Processing in Python. It features NER, POS tagging, dependency parsing, word vectors and more.
  21. Bayesian Estimate Algorithm offers generalized recommendations based on item popularity score. The basic idea behind this algorithm is that items that are more popular and critically acclaimed will have a higher probability of being liked by the average users. Using this method, each title will generate a list of books with a similarity score. The higher the score, the more closed related the books are. The values are need to determined an appropriate value for m percentile as cutoff. (more high value more small output)
  22. The score here is similarity score, weighted rating
  23. Matrix factorization algorithm is a popular collaborative filtering model, been widely adopted in the recommendation field for a long time. Matrix factorization is the breaking down of one matrix into matrix of users and items. The main idea is to make recommendation in the large collaborative sparse matrix through analyzing two small and low-dimensional matrices, such as reader and books.
  24. Using SVD. The Singular-Value Decomposition (SVD), is a matrix decomposition method for reducing a matrix  in order to make certain subsequent matrix calculations simpler.
  25. Cosine similarity score is another one of popular content based recommendation techniques. It converts a collection of metadata text to a matrix of token counts, then compute the cosine similarity score for each word. The cosine score can take any value between -1 and 1. The higher the cosine score, the more similar the documents are to each other.
  26. By calculating cosine similarity score from metadata field author, title, topic term, and genre, a list of recommended book was generated.
  27. This method is frequently used in natural language processing. It converts a collection of summary to a matrix of TF-IDF that reflect how important a word is to each summary by giving a weight factor. After the calculation of TF-IDF, cosine similarity algorithm is used to measure the similarity for each word.
  28. There are limitations to recommendation systems. For this project, the project team used leisure reading titles as a pilot. As some recommendation techniques reply on reader reviews, the lack of availability of review activities for academic books makes it hard to use reader reviews to support recommendations for academics. On a larger scale, it raised the concern for privacy. User behavior data are sensitive information. Personalized recommendation service should not sacrifice user’s privacy. Besides, the algorithms that determine which users are similar and thus which recommendations to make are not often understood. Reidsma (2016) pointed out that "in librarianship over the past few decades, the profession has had to grapple with the perception that computers are better at finding relevant information than people." The algorithms that are doing the finding, however, often carry the same hidden biases that their programmers have. Reidsma (2016) encourages a broader understanding of algorithms in general and deeper understanding of recommendation algorithms in particular.
  29. This projects highlights several ways of recommendations by using New York Times bestseller titles. The technique has potential to be implemented in academic library settings, using content based metadata. Libraries could run reports for highly circulated items and find similar books to those items for collection development. It is the same with interlibrary loan data. Libraries could find out similar items to frequently requested interlibrary loan items to bridge the gap in the collection. Moreover, retrieving data from libraries, especially libraries with similar user profile, could generate cross-library recommendations. The recommender system will use machine-learning algorithms not only to simplify collection development for librarians, but will also help end users discover more items relevant to their interests.