Diversity in recommender systems - Bridging the gap between users and systems

Institut de Recherche en Informatique de Toulouse (IRIT) - UMR 5505

Bridging the gap between users and systems

Laurent CANDILLIER – Max CHEVALIER – Damien DUDOGNON – Josiane MOTHE

27/10/11

Diversity in recommender systems
 How to recommend documents for a visited one
 Maximizing the chances of retrieving at least one relevant
document per user [Santos et al., 2010]
 Cover a large range of users’ interests

 Context
 Blog platform
 Unknown user => no profile
 Diversity of users, diversity of their expectations

27/10/11 Candillier L. – Chevalier M. – Dudognon D. – Mothe M. 2

Diversity in recommender systems
 How to recommend documents for a visited one
 Maximizing the chances of retrieving at least one relevant
document per user [Santos et al., 2010]
 Cover a large range of users’ interests

 Context
 Blog platform
 Unknown user => no profile
 Diversity of users, diversity of their expectations

=> Diversify the recommendations

What is diversity?
 Definitions from the literature
 Topicality
 Related to a particular topic [Xu and Chen, 2006]

 Diversity
 Topical diversity
 Extrinsic: solve ambiguity [Radlinski et al., 2009]

 Intrinsic: avoid redundancy [Clarke et al., 2008]

 Serendipity
 Attractive and surprising documents [Herlocker et al., 2004]


Approaches to diversify IR results
 Clustering
 Identify aspects
 Reorder depending on the aspects covered

 Examples
 K-Means [Bi et al., 2009]
 Hierarchical Clustering [Meij et al., 2010]


 Sliding Windows
 Reorder the retrieved documents
 Select documents using metrics
 Similarity with the visited document

 Similarity with the current recommended document list

 Examples
 MMR [Carbonell and Goldstein, 1998]
 Intra-list similarity [Ziegler et al., 2005]


 Serendipity
 Alternative to topical diversity
 Similarity not only based on the content

 Examples
 Organizational similarity [Cabanac et al., 2007]
 Temporal diversity [Lathia et al., 2010]


Analysis of the TREC Web 2009 results
 Hypothesis
 Diversity of approaches
 No one approach for all users’ needs
 Approaches are complementary
 Valuable to combine them

 Goals
 Analyse results obtained with approaches having
 Same goal
 Similar performances

=> To identify if diversity exists


 Experimental framework
 Reference IR corpus (TREC Web 2009)

 Two IR contexts
 Adhoc task

 Diversity task

 Compare results (runs) of the 4 best approaches of each task
 Similar performances according to IR metrics

 MAP for adhoc task

 NDCG for diversity task

 Overlap for each pair of runs underlying diversity


 Adhoc Task

 Top 10 documents
 Overlap: 22.4%

 Precision: 0.384

 Overlap max < 30%


 Diversity Task

 Top 10 documents
 Overlap: 6.3%

 Overlap max < 15%


 Conclusions
 Two distinct approaches are unlikely to return the same
(relevant) documents
 Low average overlap

 Diversity of approaches
 No approach significantly better than others
 A combination can be valuable

 TREC tasks focused on topicality and topical diversity
 Can’t be used to evaluate other types of diversity
 Users’ study necessary [Hayes et al., 2002]


Users’ Study
 Our intuitions
 Most of the time, users want topicality
 Get focused information

 Sometime, they want diversity
 Enlarge the subject

 Serendipity
 Discover new information


Users’ Study
 Goals
 Verify our intuitions
 Prove that diversified recommendations answer a larger
range of users’ needs

 Context of experimentation
 34 students in M. Sc. (Management field)
 Blog post recommendations


Users’ Study
 Experimental Framework
 Select a document


Users’ Study
 Read the selected
document


Users’ Study
 Compute the recommendation lists

Approach 1 List 1 (random)

Approach 2

Approach 3

Approach 4 List 2 (fused)

Approach 5


Users’ Study


Approach 2

Approach 3


Approach 5


Users’ Study
 Present recommendation lists for assessment
Which list best meets your needs?


Users’ Study
 Present recommendation lists for assessment
Which list is the most diversified?


Users’ Study
 Assessment of all
documents


Users’ Study
 Approaches used
 searchsim
 Vector-space model
 Document title as query
 mlt
Topicality
 Apache Solr MoreLikeThis module
 Document content as query


Users’ Study
 Approaches used






 kmeans
 K-means classification Topical diversity
 One element per cluster


Users’ Study
 Approaches used









 blogart
 Random selection from the same blog
 topcateg Serendipity
 Popular documents in the same category


Users’ Study
 Approaches used

 Same analysis than TREC
experiments
 Same results

 Overlap is low (< 10%)

=> High diversity


Users’ Study
 Results
 Distribution of relevant documents
blogart fused kmeans fused
35% 65% 52.5% 21.3%
0% 26.2%

mlt fused
54.7% 32.8%
12.5%

searchsim fused topcateg fused
52.4% 38.9% 8.8% 91.2%
8.7% 0%


Users’ Study
 Results
kmeans fused
35% 65% 52.5% 21.3%
0% 26.2%

mlt fused
54.7% 32.8%
12.5%

searchsim fused
52.4% 38.9% 8.8% 91.2%
8.7% 0%


Users’ Study
 Results
blogart fused
35% 65% 52.5% 21.3%
0% 26.2%

54.7% 32.8%
12.5%

topcateg fused
52.4% 38.9% 8.8% 91.2%
8.7% 0%


Users’ Study
 Results
 Relevant mainly retrieved by topical approaches
 But at least 20% are retrieved only by fused

 Fused matches with a larger range of needs


Conclusions and future work
 Conclusions
 Diversity of users’ expectations
 No one approach to rule them all
 A diversity of approaches
 Complementary

 Fused

 Diversity helps RS to fit more users’ needs


 Future work
 Real scale experiment
 OverBlog platform

 Renew the user survey
 More users (international call for participation)
 Avoid revealed biases
 e.g. More detailed form

=> Deeper analysis


 Future work
 Improve the model
 Refining the fusing process
 Adding a learning process to weight each approach
 For every visited document

 Find the proportion of documents coming from each
approach (log analysis)
 Better match with the real users’ needs


Thank you for your attention

Questions ?


References
W. Bi, X. Yu, Y. Liu, F. Guan, Z. Peng, H. Xu, and X. Cheng, “ICTNET at Web Track 2009 diversity task”, Text REtrieval Conf., 2009

G. Cabanac, M. Chevalier, C. Chrisment, and C. Julien, “An Original Usage-based Metrics for Building a Unified View of Corporate Documents”,
Inter. Conf. on Database and Expert Systems Applications, 2007, LNCS V. 4653, 2007, pp. 202–212

J. Carbonell and J. Goldstein, “The use of MMR, diversity-based reranking for reordering documents and producing summaries”, ACM Conf. on
Research and Development in Information Retrieval, 1998, pp. 335-336

C. L. A. Clarke, M. Kolla, G. V. Cormack, O. Vechtomova, A. Ashkan, S. Büttcher, and I.n MacKinnon, “Novelty and Diversity in Information
Retrieval Evaluation”, ACM Conf. on Research and Development in Information Retrieval, 2008, pp. 659-666

C. Hayes, P. Massa, P. Avesani, and P. Cunningham, « An online evaluation framework for recommender systems», Workshop on Personalization
and Recommendation in E-Commerce, 2002

J. L. Herlocker, J. A. Konstan, L. G. Terveen, and J. T. Riedl, “Evaluating Collaborative Filtering Recommender Systems”, ACM Trans. Information
Systems, 22(1), 2004, pp. 5-53

N. Lathia, S. Hailes, L. Capra, and X. Amatriain, “Temporal diversity in recommender systems”, ACM Conf. on Research and Development in
Information Retrieval, 2010, pp. 210-217

E. Meij, J. He, W. Weerkamp, and M. de Rijke, “Topical Diversity and Relevance Feedback”, Text REtrieval Conf., 2010

F. Radlinski, P. N. Bennett, B. Carterette, and T. Joachims. “Redundancy, diversity and interdependent document relevance”, SIGIR Forum, 43(2),
2009, pp. 46–52

R. L. T. Santos, C. Macdonald, and I. Ounis, “Selectively Diversifying Web Search Results”, ACM Inter. Conf. on Information and Knowledge
Management, 2010

Y. C. Xu and Z. Chen, “Relevance judgment: What do information users consider beyond topicality”, Journal of the American Society for
Information Science and Technology, 57(7), 2006, pp. 961–973

C. Ziegler, S. McNee, J. A. Konstan, and G. Lausen, “Improving recommendation lists through topic diversification”, Inter. Conf. on World Wide
Web, 2005, pp. 22–32

Diversity in recommender systems - Bridging the gap between users and systems

Recomendados

Recomendados

Más contenido relacionado

Similar a Diversity in recommender systems - Bridging the gap between users and systems

Similar a Diversity in recommender systems - Bridging the gap between users and systems (20)

Último

Último (20)

Diversity in recommender systems - Bridging the gap between users and systems