SlideShare una empresa de Scribd logo
1 de 41
Descargar para leer sin conexión
XPLODIV: An Exploitation-Exploration Aware Diversification
Approach for Recommender Systems
Andrea Barraza-Urbina, Benjamin Heitmann, Conor Hayes, Angela Carrillo-Ramos
The 28th International FLAIRS Conference
May 18-20, 2015
Hollywood, Florida, USA
Pontificia Universidad Javeriana
Facultad de Ingeniería
Maestría en Ingeniería de Sistemas y
Computación
Bogotá, Colombia
The Insight Centre for Data Analytics
Unit for Information Mining and Retrieval (UIMR)
National University of Ireland
Galway, Ireland
Centre for Data Analytics
Agenda
Introduction
XPLODIV
Diversification
Approach
Conclusion and
Future Work
Experimental
Validation
Literature
Review
2
Centre for Data Analytics
Introduction
Conclusion and
Future Work
Experimental
Validation
Literature
Review
3
XPLODIV
Diversification
Approach
Agenda
4
>50%
of Job Applications
are due to
Recommendation
~75%
of Watched Movies
are due to
Recommendation
Tools that help users identify interesting
products by means of personalized
suggestions.
Discovery
Recommender Systems
5
User-Item Matrix
Rating
Recommender Systems
Centre for Data Analytics
The task of selecting a subset of k elements from a broader set S in order to
maximize an objective function that considers both the relevance and
diversity of the k elements.
DiversityRelevance
A set is diverse if there is a high level of heterogeneity
(dissimilarity) between the items in the collection.
6
The Diversification Problem
Centre for Data Analytics
Search Space
User Profile
Recommend 10 movies to a user…
Movie Recommendation System
7
The Diversification Problem
Comedy
ActionDrama
Centre for Data Analytics
What happens if the user is
no longer interested in Action
movies?
Organize by relevance…
8
The Diversification Problem
User Profile
Comedy
ActionDrama
Centre for Data Analytics
VS.
Diversity
-Variety
-Balance
-Disparity
Relevance
Relevance
Diversity
In response to user profile ambiguity and the redundancy among results…
9
The Diversification Problem
Centre for Data Analytics
• Offering items representative of the variety of the user’s tastes.
• Offering novel products to explore unknown user preferences.
• Novelty can be achieved depending on how far or diverse an item is
from the user’s past experience.
Discovery
Exploitation of the User Profile
Exploration of novel products
10
Exploitation vs. Exploration
Centre for Data Analytics
Design a diversification technique that:
Research Goal
Centre for Data Analytics
XPLODIV
Diversification
Approach
Conclusion and
Future Work
Experimental
Validation
Literature
Review
12
Introduction
Agenda
Centre for Data AnalyticsAnalysis of Diversification Techniques
Information Retrieval Recommender Systems
[Carb98] [Agra09] [Sant10] [Zhen12] [Smyt01] [Zieg05] [Varg12] [Adom09]
Type of Solution
Greedy Optimization -
Explicit Approach -
Implicit Approach -
Trade-off diversity vs. relevance
Control of diversity vs.
relevance trade-off
? ?
Trade-off exploitation vs. exploration
Encourages Discovery ? ? ? ?
Control of exploitation vs.
exploration trade-off
-
13
Centre for Data Analytics
Information Retrieval Recommender Systems
[Carb98] [Agra09] [Sant10] [Zhen12] [Smyt01] [Zieg05] [Varg12] [Adom09]
Type of Solution
Greedy Optimization -
Explicit Approach -
Implicit Approach -
Trade-off diversity vs. relevance
Control of diversity vs.
relevance trade-off
? ?
Trade-off exploitation vs. exploration
Encourages Discovery ? - - - ? ? - ?
Control of exploitation vs.
exploration trade-off
- - - - -
Analysis of Diversification Techniques
14
Control of diversity vs. relevance
trade-off
Centre for Data AnalyticsAnalysis of Diversification Techniques
Information Retrieval Recommender Systems
[Carb98] [Agra09] [Sant10] [Zhen12] [Smyt01] [Zieg05] [Varg12] [Adom09]
Type of Solution
Greedy Optimization + + + + + + + -
Explicit Approach - + + + - - + -
Implicit Approach + - - - + + - -
Trade-off diversity vs. relevance
Control of diversity vs.
relevance trade-off
+ - + + + + ? ?
Trade-off exploitation vs. exploration
Encourages Discovery ? ? ? - ?
Control of exploitation vs.
exploration trade-off
-
15
Control of Exploitation vs. Exploration
trade-off
Encourages Discovery
Current solutions are mostly inspired
by work in Information Retrieval
Centre for Data Analytics
XPLODIV
Diversification
Approach
Conclusion and
Future Work
Experimental
Validation
Literature
Review
16
Introduction
Agenda
Centre for Data Analytics
Traditional
Recommendation
Algorithm
Candidate
Items Final Diversified
Recommendation
List
User
Profiles
Item
Profiles
XPLODIV: Exploitation-Exploration
Diversification Technique
Diversification
Technique
XPLODIV
We formulate our approach as a:
• Post-Filtering Technique
• Greedy optimization problem
17
Centre for Data Analytics
XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌
XPLODIV has four core dimensions:
Relevance
𝑟𝑒𝑙 𝑖
Diversity
𝑑𝑖𝑣 𝑖, ℝ
Exploitation
𝑥𝑝𝑙𝑜𝑖𝑡 𝑖, 𝕌
Exploration
𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌
• Each dimension must be normalized to return a value in the range [0,1].
• 1 is the highest desirable value.
18
XPLODIV
Centre for Data Analytics
XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌
The approach has two control parameters:
• The parameter 𝜶 controls the trade-off between relevance and
diversity.
• The parameter 𝜷 controls the trade-off between exploitation
and exploration.
19
XPLODIV
Centre for Data Analytics
The relevance dimension gives priority to items that have high
predicted rating.
How relevant is the item we are
evaluating?
20
XPLODIV: Relevance Dimension
Normalized Predicted Rating
Centre for Data Analytics
Average pairwise dissimilarity of an
element i to a set ℝ
Minimum distance of an element i to a
set ℝ
How distant is the item being
evaluated from those previously
selected?
21
XPLODIV: Diversity Dimension
The diversity dimension measures how diverse an item i is in relation to a set
of items ℝ.
Centre for Data Analytics
The exploitation dimension gives priority to items that exploit known user
preference information.
Probability of high rating of similar
items
How representative is the item
being evaluated of items found in
the user profile?
22
XPLODIV: Exploitation Dimension
Centre for Data Analytics
Diversity of item i to user profile 𝕌 Average pairwise dissimilarity
Minimum dissimilarity
How novel is the item being
evaluated for the user?
23
XPLODIV: Exploration Dimension
The exploration dimension gives priority to items that allow the user to
discover and explore the unknown.
Centre for Data Analytics
24
XPLODIV
𝜶
XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌
𝜷
Centre for Data Analytics
XPLODIV
RELEVANCE DIVERSITY EXPLOITATION EXPLORATION
Average
Dissimilarity
Minimum
Dissimilarity
Dimension
Instantiation
Alternatives
Importance of
Associated
Preference
KNN Importance of
Associated
Preference
User Profile
Novelty
Neighborhood
Novelty
25
XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌
Normalized
Predicted Rating
Centre for Data Analytics
Introduction
XPLODIV
Diversification
Approach
Conclusion and
Future Work
Experimental
Validation
Literature
Review
26
Agenda
Centre for Data Analytics
Experimental Validation
Claim I
XPLODIV can be tuned towards different
configurations of relevance, diversity,
exploitation and exploration.
Claim II
XPLODIV produces results comparable to
baseline techniques in terms of
relevance and diversity.
27
Centre for Data Analytics
• 100,000 ratings
• 943 users
• 1682 movies
Dataset
Quantitative Tests
28
Experimental Set-Up
Centre for Data Analytics
Baselines
29
Experimental Set-Up
• No Diversity: returns the top k of candidate items.
• Random Diversity: returns a random selection of k items from
candidate items.
• Maximal Marginal Relevance (MMR) with α=0.5 : returns k items
selected with the technique MMR.
• Representative of implicit diversification approaches.
• Proposed by Carbonell et al. 1998.
• Has served as foundation for many related more recent approaches.
Quantitative Tests
Centre for Data Analytics
XPLODIV
RELEVANCE DIVERSITY EXPLOITATION EXPLORATION
Average
Dissimilarity
Minimum
Dissimilarity
Dimension
Instantiation
Alternatives
Importance of
Associated
Preference
KNN Importance of
Associated
Preference
User Profile
Novelty
Neighborhood
Novelty
30
XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌
Normalized
Predicted Rating
Centre for Data Analytics
XPLODIV
RELEVANCE DIVERSITY EXPLOITATION EXPLORATION
Average
Dissimilarity
Minimum
Dissimilarity
Dimension
Instantiation
Alternatives
Importance of
Associated
Preference
KNN Importance of
Associated
Preference
User Profile
Novelty
Neighborhood
Novelty
31
XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌
Normalized
Predicted Rating
Centre for Data Analytics
No Bias
• 𝛼 = 0.5, 𝛽 = 0.5.
Relevance Bias
• 𝛼 = 0.8, 𝛽 = 0.5.
Exploitation Bias
• 𝛼 = 0.2, 𝛽 = 0.7.
Exploration Bias
• 𝛼 = 0.2, 𝛽 = 0.3.
Pure Exploitation
• 𝛼 = 0.0, 𝛽 = 1.0.
Pure Exploration
• 𝛼 = 0.0, 𝛽 = 0.0.
XPLODIV Test Cases
32
Experimental Set-Up
Exploitation Bias
Exploration Bias
𝜷
1.0
0.0
Relevance Bias
Diversity Bias
𝜶
1.0
0.0
33
Candidate
Items Final Diversified
Recommendation
List
User
Profiles
Item
Profiles
Recommendation Algorithm User-User Collaborative Filtering
Apache Mahout
Size 100
Matrix
User - Movie
Matrix
Movie - Genre
Traditional
Recommendation
Algorithm
Jaccard similarity coefficient to measure
similarity between Movie Items
Prototype
Centre for Data Analytics
Metrics
DIVERSITY
RELEVANCE
Exploration
Perspectives
Pairwise Intra-list
Dissimilarity
nDCG
Dissimilarity Threshold
Percentage
Metrics
User Profile
ExploitationExploitation
34
How well each item from the User
Profile is represented by the set of
selected items?
How different are selected items
from each other?
How relevant are selected items
considering their rank position?
What is the percentage of novel
items in the set of selected items?
3535
Tendency Graph
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
No Diversity Random
Diversity
MMR XploDiv
No Bias
XploDiv
Relevance
Bias
XploDiv
Pure
Exploitation
XploDiv
Exploitation
Bias
XploDiv
Pure
Exploration
XploDiv
Exploration
Bias
Relevance Diversity Exploitation Exploration
3636
-3.86% -0.57%
-14.56%
-7.10%
-13.24%
-9.33%
-14.19%
-7.30%
137.52%
27.23%
4.78% 7.67%3.07%
-0.75%
19.33%
-8.17%
-31.28%
-27.82%
82.80%
33.25%
-97.61%
37.83%
113.65%
98.86%
-100%
-75%
-50%
-25%
0%
25%
50%
75%
100%
125%
150%
MMR
XploDiv
RelevanceBias
XploDiv
PureExploitation
XploDiv
ExploitationBias
XploDiv
PureExploration
XploDiv
ExplorationBias
Relevance Diversity Exploitation Exploration
Loss-Gain Graph relative to “No Diversity”
Loss/GainPercentage
37
Our solution:
• Generates results comparable to baseline and state-of-the-art
techniques.
• Can be tuned towards more explorative or exploitative
recommendations.
Claim I Claim II
Summary
Centre for Data Analytics
Introduction
XPLODIV
Diversification
Approach
Conclusion and
Future Work
Experimental
Validation
Literature
Review
38
Agenda
Centre for Data Analytics
Conclusion
39
Contributions:
1. Analytical comparison of related work.
2. Exploitation-Exploration Diversification Technique XPLODIV.
• Generates comparable results to baseline and state-of-the-art techniques.
• Explicitly considers the factor of exploration.
• Can be tuned to offer "exploitative diversity" or "explorative diversity" with
controlled sacrifice over relevance.
Centre for Data Analytics
Future Work
• Dynamically learn values for the control parameters 𝛼 and 𝛽 to adapt XPLODIV to different
user profile and dataset characteristics.
• The use of XPLODIV as an aggregation strategy for results generated by different
recommendation algorithms (Hybrid Recommendation Systems).
• Design diversification strategies, based on XPLODIV, to enhance a Traditional
Recommendation algorithm.
• Example. Adapt XPLODIV to select a diverse set of neighbors in a Collaborative Filtering
Recommendation System.
40
41
Acknowledgements: This research was made possible by
funding from Science Foundation Ireland under grant number
SFI/12/RC/2289 (Insight) and by the Master's Program of the
Computer Science Department at the Pontificia Universidad
Javeriana, Bogotá.

Más contenido relacionado

La actualidad más candente

Alleviating cold-user start problem with users' social network data in recomm...
Alleviating cold-user start problem with users' social network data in recomm...Alleviating cold-user start problem with users' social network data in recomm...
Alleviating cold-user start problem with users' social network data in recomm...Eduardo Castillejo Gil
 
Hybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Hybridisation Techniques for Cold-Starting Context-Aware Recommender SystemsHybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Hybridisation Techniques for Cold-Starting Context-Aware Recommender SystemsMatthias Braunhofer
 
Recommendation system using collaborative deep learning
Recommendation system using collaborative deep learningRecommendation system using collaborative deep learning
Recommendation system using collaborative deep learningRitesh Sawant
 
Generative Adversarial Networks and Their Medical Imaging Applications
Generative Adversarial Networks and Their Medical Imaging ApplicationsGenerative Adversarial Networks and Their Medical Imaging Applications
Generative Adversarial Networks and Their Medical Imaging ApplicationsKyuhwan Jung
 
Extended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithmExtended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithmIJMIT JOURNAL
 
Generative Adversarial Networks and Their Applications in Medical Imaging
Generative Adversarial Networks  and Their Applications in Medical ImagingGenerative Adversarial Networks  and Their Applications in Medical Imaging
Generative Adversarial Networks and Their Applications in Medical ImagingSanghoon Hong
 
Noorbehbahani a survey on instance selection for active learning
Noorbehbahani a survey on instance selection for active learningNoorbehbahani a survey on instance selection for active learning
Noorbehbahani a survey on instance selection for active learningnoorbehbahani
 
Heuristics for the Maximal Diversity Selection Problem
Heuristics for the Maximal Diversity Selection ProblemHeuristics for the Maximal Diversity Selection Problem
Heuristics for the Maximal Diversity Selection ProblemIJMER
 
Survey on Various Classification Techniques in Data Mining
Survey on Various Classification Techniques in Data MiningSurvey on Various Classification Techniques in Data Mining
Survey on Various Classification Techniques in Data Miningijsrd.com
 
Reliable payment gateway.pdf
Reliable payment gateway.pdfReliable payment gateway.pdf
Reliable payment gateway.pdfsam26a
 
Evaluating the Use of Clustering for Automatically Organising Digital Library...
Evaluating the Use of Clustering for Automatically Organising Digital Library...Evaluating the Use of Clustering for Automatically Organising Digital Library...
Evaluating the Use of Clustering for Automatically Organising Digital Library...pathsproject
 
Distributed Representation-based Recommender Systems in E-commerce
Distributed Representation-based Recommender Systems in E-commerceDistributed Representation-based Recommender Systems in E-commerce
Distributed Representation-based Recommender Systems in E-commerceRakuten Group, Inc.
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learningSandeep Garg
 
Probability density estimation using Product of Conditional Experts
Probability density estimation using Product of Conditional ExpertsProbability density estimation using Product of Conditional Experts
Probability density estimation using Product of Conditional ExpertsChirag Gupta
 
Barga Data Science lecture 5
Barga Data Science lecture 5Barga Data Science lecture 5
Barga Data Science lecture 5Roger Barga
 
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesA Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesBesnik Fetahu
 
Barga Data Science lecture 9
Barga Data Science lecture 9Barga Data Science lecture 9
Barga Data Science lecture 9Roger Barga
 
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyRecommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyVito Walter Anelli
 
IRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET Journal
 

La actualidad más candente (20)

Alleviating cold-user start problem with users' social network data in recomm...
Alleviating cold-user start problem with users' social network data in recomm...Alleviating cold-user start problem with users' social network data in recomm...
Alleviating cold-user start problem with users' social network data in recomm...
 
Hybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Hybridisation Techniques for Cold-Starting Context-Aware Recommender SystemsHybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
Hybridisation Techniques for Cold-Starting Context-Aware Recommender Systems
 
Recommendation system using collaborative deep learning
Recommendation system using collaborative deep learningRecommendation system using collaborative deep learning
Recommendation system using collaborative deep learning
 
Generative Adversarial Networks and Their Medical Imaging Applications
Generative Adversarial Networks and Their Medical Imaging ApplicationsGenerative Adversarial Networks and Their Medical Imaging Applications
Generative Adversarial Networks and Their Medical Imaging Applications
 
Extended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithmExtended pso algorithm for improvement problems k means clustering algorithm
Extended pso algorithm for improvement problems k means clustering algorithm
 
Generative Adversarial Networks and Their Applications in Medical Imaging
Generative Adversarial Networks  and Their Applications in Medical ImagingGenerative Adversarial Networks  and Their Applications in Medical Imaging
Generative Adversarial Networks and Their Applications in Medical Imaging
 
Noorbehbahani a survey on instance selection for active learning
Noorbehbahani a survey on instance selection for active learningNoorbehbahani a survey on instance selection for active learning
Noorbehbahani a survey on instance selection for active learning
 
Heuristics for the Maximal Diversity Selection Problem
Heuristics for the Maximal Diversity Selection ProblemHeuristics for the Maximal Diversity Selection Problem
Heuristics for the Maximal Diversity Selection Problem
 
Survey on Various Classification Techniques in Data Mining
Survey on Various Classification Techniques in Data MiningSurvey on Various Classification Techniques in Data Mining
Survey on Various Classification Techniques in Data Mining
 
Reliable payment gateway.pdf
Reliable payment gateway.pdfReliable payment gateway.pdf
Reliable payment gateway.pdf
 
Evaluating the Use of Clustering for Automatically Organising Digital Library...
Evaluating the Use of Clustering for Automatically Organising Digital Library...Evaluating the Use of Clustering for Automatically Organising Digital Library...
Evaluating the Use of Clustering for Automatically Organising Digital Library...
 
Clustering
ClusteringClustering
Clustering
 
Distributed Representation-based Recommender Systems in E-commerce
Distributed Representation-based Recommender Systems in E-commerceDistributed Representation-based Recommender Systems in E-commerce
Distributed Representation-based Recommender Systems in E-commerce
 
Credit card fraud detection using python machine learning
Credit card fraud detection using python machine learningCredit card fraud detection using python machine learning
Credit card fraud detection using python machine learning
 
Probability density estimation using Product of Conditional Experts
Probability density estimation using Product of Conditional ExpertsProbability density estimation using Product of Conditional Experts
Probability density estimation using Product of Conditional Experts
 
Barga Data Science lecture 5
Barga Data Science lecture 5Barga Data Science lecture 5
Barga Data Science lecture 5
 
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic ProfilesA Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
A Scalable Approach for Efficiently Generating Structured Dataset Topic Profiles
 
Barga Data Science lecture 9
Barga Data Science lecture 9Barga Data Science lecture 9
Barga Data Science lecture 9
 
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross EntropyRecommender Systems Fairness Evaluation via Generalized Cross Entropy
Recommender Systems Fairness Evaluation via Generalized Cross Entropy
 
IRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET- Online Course Recommendation System
IRJET- Online Course Recommendation System
 

Similar a XPLODIV: An Exploitation-Exploration Aware Diversification Approach for Recommender Systems

Data analytics and visualization
Data analytics and visualizationData analytics and visualization
Data analytics and visualizationVini Vasundharan
 
Incorporating Diversity in a Learning to Rank Recommender System
Incorporating Diversity in a Learning to Rank Recommender SystemIncorporating Diversity in a Learning to Rank Recommender System
Incorporating Diversity in a Learning to Rank Recommender SystemJacek Wasilewski
 
Empirical Evaluation of Active Learning in Recommender Systems
Empirical Evaluation of Active Learning in Recommender SystemsEmpirical Evaluation of Active Learning in Recommender Systems
Empirical Evaluation of Active Learning in Recommender SystemsUniversity of Bergen
 
WWW 2021report public
WWW 2021report publicWWW 2021report public
WWW 2021report publicTakuma Oda
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsMarcel Kurovski
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systemsinovex GmbH
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringChangsung Moon
 
A Content Boosted Hybrid Recommendation System
A Content Boosted Hybrid Recommendation SystemA Content Boosted Hybrid Recommendation System
A Content Boosted Hybrid Recommendation SystemSeval Çapraz
 
Artificial intelligence and IoT
Artificial intelligence and IoTArtificial intelligence and IoT
Artificial intelligence and IoTVeselin Pizurica
 
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Daniel Valcarce
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysisAcad
 
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough dataВладимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough dataMail.ru Group
 
Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion antimo musone
 
Marketing Analytics with R Lifting Campaign Success Rates
Marketing Analytics with R Lifting Campaign Success RatesMarketing Analytics with R Lifting Campaign Success Rates
Marketing Analytics with R Lifting Campaign Success RatesRevolution Analytics
 

Similar a XPLODIV: An Exploitation-Exploration Aware Diversification Approach for Recommender Systems (20)

Data analytics and visualization
Data analytics and visualizationData analytics and visualization
Data analytics and visualization
 
Incorporating Diversity in a Learning to Rank Recommender System
Incorporating Diversity in a Learning to Rank Recommender SystemIncorporating Diversity in a Learning to Rank Recommender System
Incorporating Diversity in a Learning to Rank Recommender System
 
User Personality and the New User Problem in a Context-­‐Aware POI Recommende...
User Personality and the New User Problem in a Context-­‐Aware POI Recommende...User Personality and the New User Problem in a Context-­‐Aware POI Recommende...
User Personality and the New User Problem in a Context-­‐Aware POI Recommende...
 
Empirical Evaluation of Active Learning in Recommender Systems
Empirical Evaluation of Active Learning in Recommender SystemsEmpirical Evaluation of Active Learning in Recommender Systems
Empirical Evaluation of Active Learning in Recommender Systems
 
WWW 2021report public
WWW 2021report publicWWW 2021report public
WWW 2021report public
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
An overview on diversity and Software Testing
An overview on diversity and Software TestingAn overview on diversity and Software Testing
An overview on diversity and Software Testing
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 
A Content Boosted Hybrid Recommendation System
A Content Boosted Hybrid Recommendation SystemA Content Boosted Hybrid Recommendation System
A Content Boosted Hybrid Recommendation System
 
master_thesis.pdf
master_thesis.pdfmaster_thesis.pdf
master_thesis.pdf
 
DEA
DEADEA
DEA
 
Thesis Presentation
Thesis PresentationThesis Presentation
Thesis Presentation
 
Artificial intelligence and IoT
Artificial intelligence and IoTArtificial intelligence and IoT
Artificial intelligence and IoT
 
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
 
Cluster analysis
Cluster analysisCluster analysis
Cluster analysis
 
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough dataВладимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
Владимир Гулин, Mail.Ru Group, Learning to rank using clickthrough data
 
Descriptive Analytics: Data Reduction
 Descriptive Analytics: Data Reduction Descriptive Analytics: Data Reduction
Descriptive Analytics: Data Reduction
 
Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion Tech meetup Data Driven - Codemotion
Tech meetup Data Driven - Codemotion
 
Marketing Analytics with R Lifting Campaign Success Rates
Marketing Analytics with R Lifting Campaign Success RatesMarketing Analytics with R Lifting Campaign Success Rates
Marketing Analytics with R Lifting Campaign Success Rates
 

Último

CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptxpallavirawat456
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...Universidade Federal de Sergipe - UFS
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxpriyankatabhane
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPirithiRaju
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxMurugaveni B
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxmaryFF1
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxpriyankatabhane
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 

Último (20)

CHROMATOGRAPHY PALLAVI RAWAT.pptx
CHROMATOGRAPHY  PALLAVI RAWAT.pptxCHROMATOGRAPHY  PALLAVI RAWAT.pptx
CHROMATOGRAPHY PALLAVI RAWAT.pptx
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
REVISTA DE BIOLOGIA E CIÊNCIAS DA TERRA ISSN 1519-5228 - Artigo_Bioterra_V24_...
 
Speech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptxSpeech, hearing, noise, intelligibility.pptx
Speech, hearing, noise, intelligibility.pptx
 
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdfPests of Blackgram, greengram, cowpea_Dr.UPR.pdf
Pests of Blackgram, greengram, cowpea_Dr.UPR.pdf
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptxSTOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
STOPPED FLOW METHOD & APPLICATION MURUGAVENI B.pptx
 
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptxECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
ECG Graph Monitoring with AD8232 ECG Sensor & Arduino.pptx
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptxMicrophone- characteristics,carbon microphone, dynamic microphone.pptx
Microphone- characteristics,carbon microphone, dynamic microphone.pptx
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 

XPLODIV: An Exploitation-Exploration Aware Diversification Approach for Recommender Systems

  • 1. XPLODIV: An Exploitation-Exploration Aware Diversification Approach for Recommender Systems Andrea Barraza-Urbina, Benjamin Heitmann, Conor Hayes, Angela Carrillo-Ramos The 28th International FLAIRS Conference May 18-20, 2015 Hollywood, Florida, USA Pontificia Universidad Javeriana Facultad de Ingeniería Maestría en Ingeniería de Sistemas y Computación Bogotá, Colombia The Insight Centre for Data Analytics Unit for Information Mining and Retrieval (UIMR) National University of Ireland Galway, Ireland
  • 2. Centre for Data Analytics Agenda Introduction XPLODIV Diversification Approach Conclusion and Future Work Experimental Validation Literature Review 2
  • 3. Centre for Data Analytics Introduction Conclusion and Future Work Experimental Validation Literature Review 3 XPLODIV Diversification Approach Agenda
  • 4. 4 >50% of Job Applications are due to Recommendation ~75% of Watched Movies are due to Recommendation Tools that help users identify interesting products by means of personalized suggestions. Discovery Recommender Systems
  • 6. Centre for Data Analytics The task of selecting a subset of k elements from a broader set S in order to maximize an objective function that considers both the relevance and diversity of the k elements. DiversityRelevance A set is diverse if there is a high level of heterogeneity (dissimilarity) between the items in the collection. 6 The Diversification Problem
  • 7. Centre for Data Analytics Search Space User Profile Recommend 10 movies to a user… Movie Recommendation System 7 The Diversification Problem Comedy ActionDrama
  • 8. Centre for Data Analytics What happens if the user is no longer interested in Action movies? Organize by relevance… 8 The Diversification Problem User Profile Comedy ActionDrama
  • 9. Centre for Data Analytics VS. Diversity -Variety -Balance -Disparity Relevance Relevance Diversity In response to user profile ambiguity and the redundancy among results… 9 The Diversification Problem
  • 10. Centre for Data Analytics • Offering items representative of the variety of the user’s tastes. • Offering novel products to explore unknown user preferences. • Novelty can be achieved depending on how far or diverse an item is from the user’s past experience. Discovery Exploitation of the User Profile Exploration of novel products 10 Exploitation vs. Exploration
  • 11. Centre for Data Analytics Design a diversification technique that: Research Goal
  • 12. Centre for Data Analytics XPLODIV Diversification Approach Conclusion and Future Work Experimental Validation Literature Review 12 Introduction Agenda
  • 13. Centre for Data AnalyticsAnalysis of Diversification Techniques Information Retrieval Recommender Systems [Carb98] [Agra09] [Sant10] [Zhen12] [Smyt01] [Zieg05] [Varg12] [Adom09] Type of Solution Greedy Optimization - Explicit Approach - Implicit Approach - Trade-off diversity vs. relevance Control of diversity vs. relevance trade-off ? ? Trade-off exploitation vs. exploration Encourages Discovery ? ? ? ? Control of exploitation vs. exploration trade-off - 13
  • 14. Centre for Data Analytics Information Retrieval Recommender Systems [Carb98] [Agra09] [Sant10] [Zhen12] [Smyt01] [Zieg05] [Varg12] [Adom09] Type of Solution Greedy Optimization - Explicit Approach - Implicit Approach - Trade-off diversity vs. relevance Control of diversity vs. relevance trade-off ? ? Trade-off exploitation vs. exploration Encourages Discovery ? - - - ? ? - ? Control of exploitation vs. exploration trade-off - - - - - Analysis of Diversification Techniques 14 Control of diversity vs. relevance trade-off
  • 15. Centre for Data AnalyticsAnalysis of Diversification Techniques Information Retrieval Recommender Systems [Carb98] [Agra09] [Sant10] [Zhen12] [Smyt01] [Zieg05] [Varg12] [Adom09] Type of Solution Greedy Optimization + + + + + + + - Explicit Approach - + + + - - + - Implicit Approach + - - - + + - - Trade-off diversity vs. relevance Control of diversity vs. relevance trade-off + - + + + + ? ? Trade-off exploitation vs. exploration Encourages Discovery ? ? ? - ? Control of exploitation vs. exploration trade-off - 15 Control of Exploitation vs. Exploration trade-off Encourages Discovery Current solutions are mostly inspired by work in Information Retrieval
  • 16. Centre for Data Analytics XPLODIV Diversification Approach Conclusion and Future Work Experimental Validation Literature Review 16 Introduction Agenda
  • 17. Centre for Data Analytics Traditional Recommendation Algorithm Candidate Items Final Diversified Recommendation List User Profiles Item Profiles XPLODIV: Exploitation-Exploration Diversification Technique Diversification Technique XPLODIV We formulate our approach as a: • Post-Filtering Technique • Greedy optimization problem 17
  • 18. Centre for Data Analytics XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌 XPLODIV has four core dimensions: Relevance 𝑟𝑒𝑙 𝑖 Diversity 𝑑𝑖𝑣 𝑖, ℝ Exploitation 𝑥𝑝𝑙𝑜𝑖𝑡 𝑖, 𝕌 Exploration 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌 • Each dimension must be normalized to return a value in the range [0,1]. • 1 is the highest desirable value. 18 XPLODIV
  • 19. Centre for Data Analytics XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌 The approach has two control parameters: • The parameter 𝜶 controls the trade-off between relevance and diversity. • The parameter 𝜷 controls the trade-off between exploitation and exploration. 19 XPLODIV
  • 20. Centre for Data Analytics The relevance dimension gives priority to items that have high predicted rating. How relevant is the item we are evaluating? 20 XPLODIV: Relevance Dimension Normalized Predicted Rating
  • 21. Centre for Data Analytics Average pairwise dissimilarity of an element i to a set ℝ Minimum distance of an element i to a set ℝ How distant is the item being evaluated from those previously selected? 21 XPLODIV: Diversity Dimension The diversity dimension measures how diverse an item i is in relation to a set of items ℝ.
  • 22. Centre for Data Analytics The exploitation dimension gives priority to items that exploit known user preference information. Probability of high rating of similar items How representative is the item being evaluated of items found in the user profile? 22 XPLODIV: Exploitation Dimension
  • 23. Centre for Data Analytics Diversity of item i to user profile 𝕌 Average pairwise dissimilarity Minimum dissimilarity How novel is the item being evaluated for the user? 23 XPLODIV: Exploration Dimension The exploration dimension gives priority to items that allow the user to discover and explore the unknown.
  • 24. Centre for Data Analytics 24 XPLODIV 𝜶 XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌 𝜷
  • 25. Centre for Data Analytics XPLODIV RELEVANCE DIVERSITY EXPLOITATION EXPLORATION Average Dissimilarity Minimum Dissimilarity Dimension Instantiation Alternatives Importance of Associated Preference KNN Importance of Associated Preference User Profile Novelty Neighborhood Novelty 25 XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌 Normalized Predicted Rating
  • 26. Centre for Data Analytics Introduction XPLODIV Diversification Approach Conclusion and Future Work Experimental Validation Literature Review 26 Agenda
  • 27. Centre for Data Analytics Experimental Validation Claim I XPLODIV can be tuned towards different configurations of relevance, diversity, exploitation and exploration. Claim II XPLODIV produces results comparable to baseline techniques in terms of relevance and diversity. 27
  • 28. Centre for Data Analytics • 100,000 ratings • 943 users • 1682 movies Dataset Quantitative Tests 28 Experimental Set-Up
  • 29. Centre for Data Analytics Baselines 29 Experimental Set-Up • No Diversity: returns the top k of candidate items. • Random Diversity: returns a random selection of k items from candidate items. • Maximal Marginal Relevance (MMR) with α=0.5 : returns k items selected with the technique MMR. • Representative of implicit diversification approaches. • Proposed by Carbonell et al. 1998. • Has served as foundation for many related more recent approaches. Quantitative Tests
  • 30. Centre for Data Analytics XPLODIV RELEVANCE DIVERSITY EXPLOITATION EXPLORATION Average Dissimilarity Minimum Dissimilarity Dimension Instantiation Alternatives Importance of Associated Preference KNN Importance of Associated Preference User Profile Novelty Neighborhood Novelty 30 XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌 Normalized Predicted Rating
  • 31. Centre for Data Analytics XPLODIV RELEVANCE DIVERSITY EXPLOITATION EXPLORATION Average Dissimilarity Minimum Dissimilarity Dimension Instantiation Alternatives Importance of Associated Preference KNN Importance of Associated Preference User Profile Novelty Neighborhood Novelty 31 XPLODIV 𝑖, 𝕌, ℝ = 𝛼 ∙ 𝑟𝑒𝑙 𝑖 + 1 − 𝛼 ∙ 𝑑𝑖𝑣 𝑖, ℝ ∙ 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑖𝑡(𝑖, 𝕌) + 1 − 𝛽 ∙ 𝑥𝑝𝑙𝑜𝑟𝑒 𝑖, 𝕌 Normalized Predicted Rating
  • 32. Centre for Data Analytics No Bias • 𝛼 = 0.5, 𝛽 = 0.5. Relevance Bias • 𝛼 = 0.8, 𝛽 = 0.5. Exploitation Bias • 𝛼 = 0.2, 𝛽 = 0.7. Exploration Bias • 𝛼 = 0.2, 𝛽 = 0.3. Pure Exploitation • 𝛼 = 0.0, 𝛽 = 1.0. Pure Exploration • 𝛼 = 0.0, 𝛽 = 0.0. XPLODIV Test Cases 32 Experimental Set-Up Exploitation Bias Exploration Bias 𝜷 1.0 0.0 Relevance Bias Diversity Bias 𝜶 1.0 0.0
  • 33. 33 Candidate Items Final Diversified Recommendation List User Profiles Item Profiles Recommendation Algorithm User-User Collaborative Filtering Apache Mahout Size 100 Matrix User - Movie Matrix Movie - Genre Traditional Recommendation Algorithm Jaccard similarity coefficient to measure similarity between Movie Items Prototype
  • 34. Centre for Data Analytics Metrics DIVERSITY RELEVANCE Exploration Perspectives Pairwise Intra-list Dissimilarity nDCG Dissimilarity Threshold Percentage Metrics User Profile ExploitationExploitation 34 How well each item from the User Profile is represented by the set of selected items? How different are selected items from each other? How relevant are selected items considering their rank position? What is the percentage of novel items in the set of selected items?
  • 35. 3535 Tendency Graph 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 No Diversity Random Diversity MMR XploDiv No Bias XploDiv Relevance Bias XploDiv Pure Exploitation XploDiv Exploitation Bias XploDiv Pure Exploration XploDiv Exploration Bias Relevance Diversity Exploitation Exploration
  • 37. 37 Our solution: • Generates results comparable to baseline and state-of-the-art techniques. • Can be tuned towards more explorative or exploitative recommendations. Claim I Claim II Summary
  • 38. Centre for Data Analytics Introduction XPLODIV Diversification Approach Conclusion and Future Work Experimental Validation Literature Review 38 Agenda
  • 39. Centre for Data Analytics Conclusion 39 Contributions: 1. Analytical comparison of related work. 2. Exploitation-Exploration Diversification Technique XPLODIV. • Generates comparable results to baseline and state-of-the-art techniques. • Explicitly considers the factor of exploration. • Can be tuned to offer "exploitative diversity" or "explorative diversity" with controlled sacrifice over relevance.
  • 40. Centre for Data Analytics Future Work • Dynamically learn values for the control parameters 𝛼 and 𝛽 to adapt XPLODIV to different user profile and dataset characteristics. • The use of XPLODIV as an aggregation strategy for results generated by different recommendation algorithms (Hybrid Recommendation Systems). • Design diversification strategies, based on XPLODIV, to enhance a Traditional Recommendation algorithm. • Example. Adapt XPLODIV to select a diverse set of neighbors in a Collaborative Filtering Recommendation System. 40
  • 41. 41 Acknowledgements: This research was made possible by funding from Science Foundation Ireland under grant number SFI/12/RC/2289 (Insight) and by the Master's Program of the Computer Science Department at the Pontificia Universidad Javeriana, Bogotá.

Notas del editor

  1. NETFLIX - http://techblog.netflix.com/2012/04/netflix-recommendations-beyond-5-stars.html LINKEDIN- http://es.slideshare.net/anmolbhasin/beyond-ratings-andfollowers-recsys-2012 AMAZON - http://glinden.blogspot.ie/2006/12/35-of-sales-from-recommendations.html TODO – Agregar la referencia de las citas Recommendation Systems are tools that help users find products they might like Products that the user might not have been looking In other words, recommenders help users discover new interesting products specially in a situation of information overload ----------------------------------------------------------- Antes de introducir el problema de diversificación en Sist. De Reco Es importante definir brevemente que son los Sistemas de Reco Los Sist. De Reco son herramientas que ofrecen sugerencias proactivas Estas sugerencias tienden a ser de productos que el usuario desconoce y no encontraría por su cuenta Incentivando asi al usuario a descubrir nuevos productos e intereses Estos sistemas se han convertido en herramientas importantes en e-commerce Por ejemplo Amazon indica que 35% de sus ventas provienen de alguna recomendación
  2. In a conventional recsys, a rating score is used to represent “how relevant an item is for a user” However, rating information is incomplete, and the recommendation system problem is then to estimate the unknown ratings After this, the system offers as recommedations ítems that have the highest predicted rating, or that would be the most relevant for the user --------------------------------------------- En un sistema de reco tradicional Los usuarios le colocan un puntaje o rating a productos el cual indica qué tanto el gusta el producto al usuario Estos puntajes se guardan en la matriz usuario-producto No obstante, es improbable q un usuario le coloque ratings a todos los productos posibles y el problema de recomendación Es estimar los ratings que los usuarios le colocarían a productos que no han evaluado Los distintos tipos de sistemas de recomendación difieren en la forma de estimar ratings
  3. There is a trade-off between selecting ítems of a higher relevance and obtaining diverse results ----------------------------------------------- Mejorar Presentar Recommender System primero y q soluciona cosas para info overload y eso Usar la palabra relevance en vez de quality The Conventional view is that there is a TRADE-OFF between relevance and diversity ----------------------------------------------- El problema de la diversificación consiste en seleccionar un subconjunto de elementos diversos de un conjunto de mayor tamaño El problema consiste en que existe un trade-off entre diversidad y calidad Es decir, al aumentar la diversidad general disminuye la calidad del conjunto A continuación ofreceremos un ejemplo en el ambiente de Sistemas de Recuperación de Información
  4. Con un Sistema de Recuperación de Información sin diversificación Se ordenan los resultados por orden de relevancia y se obtendría una lista de resultados donde los primeros elementos son de Java Lenguaje de Programación Pero : ¿Qué sucede si el usuario no se encuentra buscando Lenguaje de Programación Java?
  5. Dado que no se conocen las verdaderas intenciones del usuario Una solución es agregar diversidad a los resultados, a pesar de que se sacrifique relevancia
  6. In this manner, we can add diversity by offering … Que quede super clear que estamos introduciendo something new que es EXPLORATION Que exploration esta ligado con novelty q se define con diversity Resaltarlo bastante q sea un Strong Point Novelty can be adequately measured by diversity La definición conventional de diversity in recos hace solo exploitation … la queremos aumentar para agregar exploration La razón para tomar en cuenta diversidad en sistemas de reco es parecida a la de sistemas de recuperación de info También se desea eliminar la redundancia de los resultados Además la información asociada a las preferencias de los usuarios es incompleta Y con el propósito de responder a esta ambigüedad con diversidad se buscaría cubrir la variedad de los gustos del usuario No obstante, los sistemas de recomendación tienen un factor adicional importante y es ofrecerle al usuario productos novedosos Estos productos son los que le van a permitir al usuario descubrir y este es un factor fundamental Por estos motivos en sistemas de reco no solo se desea Explotar la información que tenemos dentro del perfil del usuario Pero también se desea ofrecer productos novedosos que le permitan al usuario explorar
  7. Adding diversity by exploiting the user profile or adding diversity by exploring new products…
  8. Idea de los puntos verdes y rojos Mostrar esto por partes y las conclusiones de cada parte Agregar las convenciones en la diapositiva Que sea binario según conor … yo creo q los circulos verdes y rojos puede ser Realmente resaltar que ninguno hace lo de abajo Relacionar más esta con la 12 En cuanto a las técnicas de diversificación Definimos un conjunto de criterios para comparar las técnicas El primer conjunto ayuda a entender el tipo de solución ofrecida El segundo conjunto analiza el trade-off entre diversidad vs relevancia En tercer lugar se analiza el trade-off entre explotación y exploración Y finalmente se explora la forma en qué se están comparando los elementos
  9. Idea de los puntos verdes y rojos Mostrar esto por partes y las conclusiones de cada parte Agregar las convenciones en la diapositiva Que sea binario según conor … yo creo q los circulos verdes y rojos puede ser Realmente resaltar que ninguno hace lo de abajo Relacionar más esta con la 12 En cuanto a las técnicas de diversificación Definimos un conjunto de criterios para comparar las técnicas El primer conjunto ayuda a entender el tipo de solución ofrecida El segundo conjunto analiza el trade-off entre diversidad vs relevancia En tercer lugar se analiza el trade-off entre explotación y exploración Y finalmente se explora la forma en qué se están comparando los elementos
  10. Idea de los puntos verdes y rojos Mostrar esto por partes y las conclusiones de cada parte Agregar las convenciones en la diapositiva Que sea binario según conor … yo creo q los circulos verdes y rojos puede ser Realmente resaltar que ninguno hace lo de abajo Relacionar más esta con la 12 En cuanto a las técnicas de diversificación Definimos un conjunto de criterios para comparar las técnicas El primer conjunto ayuda a entender el tipo de solución ofrecida El segundo conjunto analiza el trade-off entre diversidad vs relevancia En tercer lugar se analiza el trade-off entre explotación y exploración Y finalmente se explora la forma en qué se están comparando los elementos
  11. Como solución proponemos la técnica de diversificación XploDiv Esta es una técnica postfiltrado basada en un algoritmo voraz
  12. Decir que cada una se va explicar en detalle más adelante
  13. I’m using a fairly standard diversity and relevance metrics so i will focus more on the explanation of how we measure exploitation and exploration La relevancia se encuentra relacionada con el rating estimado por el algoritmo de recomendación Alto predicted rating alta relevancia
  14. Con la dimensión de diversidad se desea verificar la diversidad que el producto que estamos evaluando Le aportaría a R Esta medida se encuentra relacionada con la distancia del producto con respecto al conjunto de productos seleccionados previamente Respondiendo a la pregunta: Qué tan distante es el producto que estamos evaluando con respecto a los previamente seleccionados
  15. La dimensión de explotación busca darle mayor puntaje a los productos que se encuentran relacionados con el perfil del usuario Para lograrlo definimos como medida la Probabilidad de que productos dentro del perfil del usuario que son parecidos al producto que Se esta evaluando… tengan un rating alto Para más aclaración se pueden referir al documento de la memoria o lo podemos hacer en la sesión de preguntas
  16. Exploración es lo opuesto a explotación Se desea dar prioridad a productos que se encuentren lejanos del perfil del usuario los cuales son los productos novedosos
  17. Cuidado con la palabra hypothesis Cuidado con los colores Implícita o Explicita
  18. Implícita o Explicita
  19. Implícita o Explicita
  20. Mejorar resultados más detallados se pueden observar en el documento de las memorias
  21. Implícita o Explicita
  22. Implícita o Explicita
  23. How the perspectives interact with each other
  24. - All XploDiv approaches had a relevance
  25. Our technique is an improvement over current work, because in addition to providing comparable results, it can be tuned towards more diverse explorative results or more diverse exploitative results. Para lograrlo… En primer lugar llevamos a cabo una revisión bibliográfica La cual obtuvo como resultado una comparación analítica de trabajos relacionados Hasta donde sabemos una comparación similar con criterios no se encuentra en la literatura Considerando la revisión bibliografica se define una técnica novedosa de diversificación denominada XploDiv Para validar dicha técnica se desarrollo un prototipo funcional y Se define un Framework para Evaluar Sist. De Reo que permite evaluar resultados desde distintas perspectivas Contribuciones más importantes: Exploitation Exploration diversification technique denominada XploDiv Toma en cuenta el factor de exploración. Se puede ajustar para ofrecer resultados con “diversidad explotativa” o “diversidad explorativa”. Diversity-Aware Evaluation Framework for Recommender Systems Identifica y organiza métricas para evaluar diversidad en Sistemas de Recomendación. Define nuevas métricas para evaluar explotación y exploración. Comparación Analítica de Trabajos Relacionados Resalta ventajas y desventajas de trabajos relacionados.
  26. Lo de aggregation technique mejor traducir como lo puso angela … that takes results from different recos and selects o algo Para lograrlo… En primer lugar llevamos a cabo una revisión bibliográfica La cual obtuvo como resultado una comparación analítica de trabajos relacionados Hasta donde sabemos una comparación similar con criterios no se encuentra en la literatura Considerando la revisión bibliografica se define una técnica novedosa de diversificación denominada XploDiv Para validar dicha técnica se desarrollo un prototipo funcional y Se define un Framework para Evaluar Sist. De Reo que permite evaluar resultados desde distintas perspectivas Contribuciones más importantes: Exploitation Exploration diversification technique denominada XploDiv Toma en cuenta el factor de exploración. Se puede ajustar para ofrecer resultados con “diversidad explotativa” o “diversidad explorativa”. Diversity-Aware Evaluation Framework for Recommender Systems Identifica y organiza métricas para evaluar diversidad en Sistemas de Recomendación. Define nuevas métricas para evaluar explotación y exploración. Comparación Analítica de Trabajos Relacionados Resalta ventajas y desventajas de trabajos relacionados.
  27. En sintensis Al estudiar las técnicas de diversificación Se observa que existen dos formas de agregar diversidad a Sist. De Reco Un sist. Tradcional. Recibe como entrada un perfil de usuario y perfil de productos y estima los ratings desconocidos de un usuario Después Con una técnica de post-filtrdo o de ranking se organizan los resultados de acuerdo a la relevancia La diversidad se puede agregar para enriquecer un algoritmo de recomendación o para enriquecer la técnica de postfiltrado No obstante de los trabajos estudiados se resalta que ninguno considera explícitamente el factor de explotación Esto se debe a que la mayoría de trabajos para sistemas de reco son inspirados de sist. de recu de info donde ofrecer Novedad no es un factor importante A pesar de, en Sistemas de Reco la novedad es un factor fundamental
  28. Solo decir que especificamente nosotros decidimos enfocarnos en un postfiltering approach Resaltamos que en trabajos relacionados observamos q las técnicas para enriquecer un sist. Tradicional de reco Tienden a ser adaptaciones de técnicas de postfiltering por estos motivos Nos parece importante enfocarnos en técnicas de postfiltering que se pueden extender con mayor facilidad
  29. Implícita o Explicita
  30. Para definir el umbral tao visualizamos la distribución de distancias de los resultados con respecto al perfil del usuario en un histograma Se observan dos tipos de resultados que son los de explotación pura y los de exploración pura Se resalta que en los resultados de exploración pura se observa que existe una alta frecuencia de resultados que se encuentran muy cercanos a 1 Del diagrama concluimos que la distancia 0.9 es un buen umbral para diferenciar productos explorativos de explotativos
  31. Implícita o Explicita
  32. Implícita o Explicita
  33. No tener la ecuación todavia si decides agregar esta diapositiva En la aprox voraz se tiene …