Slides UMAP'13 paper "Exploiting the Semantic Similarity of Contextual Situations for Pre-filtering Recommendation"

Exploiting the Semantic Similarity of
Contextual Situations for Pre-Filtering
Recommendation
Victor Codina
Francesco Ricci
Luigi Ceccaroni
UMAP 2013

UMAP – June 2013, Rome, Italy
Outline
2
State of the art
Novel contextual pre-filtering approach
Evaluation
Context-Aware Recommender Systems (CARS)

Context matters
3
Main assumption: items can be experienced differently
by the users depending on the current context

Context-Aware Recommender Systems
(CARS)
4
CARS are locally adapted prediction models
Adapted to the “local” user contextual situation
Goal: to find the right level of contextualitzation for a
target user and contextual situation
Global (all user ratings) vs. Local model (user ratings in context)
Difficult task: the optimal level of contextualization
depends on several factors
Context relevance, enough ratings in context, …

Outline
5
State of the art
Evaluation

Direct approach that builds a strict local prediction
model for each possible contextual situation
Only ratings acquired in the target context are used
Main limitation: its lack of flexibility
Always uses the maximum level of contextualization
It produces too narrow prediction models that fail when:
Target context is not relevant enough
Not enough ratings acquired in the situation (sparsity problem)
Exact pre-filtering
6

It exploits context taxonomies to build generalized local
models when needed (Adomavicius et al., 2005)
Example:
Performance limited to the quality of the context model
The pre-defined generalization may or may not suit the data
Generalized pre-filtering
7

All build a global MF model that integrates contextual
information as additional model parameters
Which implies that all ratings are used during model learning
Two main ways of extending MF with context
Tensor Factorization, where context is integrated using high-
order MF techniques (Karatzoglou et al., 2010)
Context-Aware MF, where context is integrated as contextual
biases associated to the items/users (Baltrunas et al., 2011)
Main limitations
Computation complexity (especially in Tensor Factorization)
Potential similarities between situations are not exploited
Approaches based on extended Matrix
Factorization (MF)
8

Outline
9
State of the art
Evaluation

Similar idea as in generalized pre-filtering
Our approach also builds semantically-related local models
Key difference: we use a novel notion of semantic
similarities between conditions
Based on their effect on the users’ ratings (altering the value)
Advantages:
No external knowledge is needed (only rating data)
Non-conventional (cross-factor) similarities can be found
Our solution: semantic pre-filtering
10

Main assumption: two conditions are similar if they
influence the users’ ratings similarly
Example (item-based):
Similarity calculation between conditions
based on their implicit semantics (I)
11
Contextual
Condition
Natural
Park 1
Natural
Park 2
Natural
Park 3
Walking
Route 1
Museum
1
Museum
2
sunny - -
family - - -
rainy -

To mitigate the sparsity of the co-occurrence matrix we
reduce its dimensionality applying SVD
Example:
Similarity calculation between conditions
based on their implicit semantics (II)
12
Contextual
Condition
Latent
Feature 1
Latent
Feature
2
Latent
Feature
3
sunny
family
rainy
Cosine similarity

Similarity calculation between conditions:
an example in a tourism recommender
13
Similarities calculated from a tourism rating data set
(Top-5 similar conditions to cold weather)

We experimented with two pairwise similarity strategies
for estimating a global similarity between two situations
Best-pairs
All-pairs
Similarity calculation between situations
defined by several conditions
UMAP – June 2013, Rome, Italy 14
target
sunny
sad
unknown
candidate
unknown
happy
weekend
target
sunny
sad
unknown
candidate
unknown
happy
weekend
Contextual
Factors
Weather
Mood
Day of the week

Building the semantically-related
local prediction models
Ratings filtering
All
ratings
(Y)
Relevant
ratings (X)
Local prediction
model building
target contextual situation
(c*)
Rating predictions
β
FOR EACH (user u, item i) in Y DO
IF exists THEN
ADD to X
ELSE
GET Sui = { : Sim(c,c*) ≥ }
ADD AVG(Sui) = to X
END IF
END FOR

The similarity threshold controls
the level of contextualization
If (β = 1) works as in exact pre-filtering
Sim(c,c*) = 1 only when c = c*
If (β = -1) works as in the global model
All ratings are used for training the model
Two methods to set the optimal threshold (β)
Using a global threshold for all the possible target situations
Using a local threshold per target situation

Outline
17
State of the art
Evaluation

Semantic pre-filtering vs. Context-Aware MF (CAMF)
CAMF-CC and CAMF-CI variants (Baltrunas et al., 2011)
Two approaches as baseline
Exact pre-filtering
Bias MF (Context-free)
Exact and Semantic pre-filtering use the Bias MF to build
the local rating prediction models
Experimental Setup (I)
Overall
rating average
User
bias
Item
bias

Rating prediction accuracy measured in terms of:
Mean Absolute Error (MAE)
Root Mean Squared Error (RMSE)
Per-user training/test splitting protocol
For each user 5 test ratings randomly selected
Only users with more than 10 ratings used for testing
We discarded a test rating if the user has not rated at
least 1 additional item in the same contextual situation
A necessary condition for applying MF with exact pre-filtering
Experimental Setup (II)

5 contextually tagged rating data sets:
Different types of contextual information:
One condition per situation in Music and Tourism
Several conditions per situation in the other data sets
In MovieLens and LibraryThing user tags are used as context
Data sets

Error reduction (MAE) with respect to Bias MF
Performance comparison for data sets
with one condition per situation

Error reduction (MAE) with respect to Bias MF
Performance comparison in data sets
with several conditions per situation

which is able to outperform state-of-the-art context-aware MF
models by aggregating rating data acquired in situations that
are similar to the target contextual situation
Novel notion of semantic similarity between conditions
which estimates two conditions as similar if they influence the
users’ ratings similarly
Conclusions

New semantic pre-filtering variants
To improve the precision of situation-to-situation similarities
To better exploit the similarities during ratings’ filtering
Extended evaluation
New data sets / offline metrics (e.g. top-n recommendation)
User study
New semantically-enhanced context-aware approaches
Using the post-filtering strategy
Using the contextual modeling strategy (e.g. extended MF)
Future work

Exploiting the Semantic Similarity of
Contextual Situations for Pre-Filtering
Recommendation
Any comments or questions?
Victor Codina
vcodina@lsi.upc.edu

Slides UMAP'13 paper "Exploiting the Semantic Similarity of Contextual Situations for Pre-filtering Recommendation"

Recomendados

Recomendados

Más contenido relacionado

Similar a Slides UMAP'13 paper "Exploiting the Semantic Similarity of Contextual Situations for Pre-filtering Recommendation"

Similar a Slides UMAP'13 paper "Exploiting the Semantic Similarity of Contextual Situations for Pre-filtering Recommendation" (20)

Último

Último (20)

Slides UMAP'13 paper "Exploiting the Semantic Similarity of Contextual Situations for Pre-filtering Recommendation"

Notas del editor