SlideShare una empresa de Scribd logo
1 de 69
Descargar para leer sin conexión
DOCTORAL SYMPOSIUM
Exploring Statistical Language Models for
Recommender Systems
RecSys 2015
16 - 20 September, Vienna, Austria
Daniel Valcarce
@dvalcarce
Information Retrieval Lab
University of A Coruña
Spain
Motivation
1
Information Retrieval vs Information Filtering (1)
Information Retrieval (IR) Information Filtering (IF)
2
Information Retrieval vs Information Filtering (1)
Information Retrieval (IR)
Goal: Retrieve relevant
documents according to the
information need of a user
Information Filtering (IF)
2
Information Retrieval vs Information Filtering (1)
Information Retrieval (IR)
Goal: Retrieve relevant
documents according to the
information need of a user
Examples: Search engines
(web, multimedia...)
Information Filtering (IF)
2
Information Retrieval vs Information Filtering (1)
Information Retrieval (IR)
Goal: Retrieve relevant
documents according to the
information need of a user
Examples: Search engines
(web, multimedia...)
Information Filtering (IF)
Goal: Select relevant items
from an information stream
for a given user
2
Information Retrieval vs Information Filtering (1)
Information Retrieval (IR)
Goal: Retrieve relevant
documents according to the
information need of a user
Examples: Search engines
(web, multimedia...)
Information Filtering (IF)
Goal: Select relevant items
from an information stream
for a given user
Examples: spam filters,
recommender systems
2
Information Retrieval vs Information Filtering (1)
Information Retrieval (IR)
Goal: Retrieve relevant
documents according to the
information need of a user
Examples: Search engines
(web, multimedia...)
Input: The user’s query
(explicit).
Information Filtering (IF)
Goal: Select relevant items
from an information stream
for a given user
Examples: spam filters,
recommender systems
Input: The user’s history
(implicit).
2
Information Retrieval vs Information Filtering (2)
Some people consider them different fields:
U. Hanani, B. Shapira and P. Shoval: Information
Filtering: Overview of Issues, Research and Systems in
User Modeling and User-Adapted Interaction (2001)
3
Information Retrieval vs Information Filtering (2)
Some people consider them different fields:
U. Hanani, B. Shapira and P. Shoval: Information
Filtering: Overview of Issues, Research and Systems in
User Modeling and User-Adapted Interaction (2001)
While other consider them the same thing:
N. J. Belkin and W. B. Croft: Information filtering and
information retrieval: two sides of the same coin? in
Communications of the ACM (1992)
3
Information Retrieval vs Information Filtering (2)
Some people consider them different fields:
U. Hanani, B. Shapira and P. Shoval: Information
Filtering: Overview of Issues, Research and Systems in
User Modeling and User-Adapted Interaction (2001)
While other consider them the same thing:
N. J. Belkin and W. B. Croft: Information filtering and
information retrieval: two sides of the same coin? in
Communications of the ACM (1992)
What is undeniable is that they are closely related:
Why not apply techniques from one field to the other?
3
Information Retrieval vs Information Filtering (2)
Some people consider them different fields:
U. Hanani, B. Shapira and P. Shoval: Information
Filtering: Overview of Issues, Research and Systems in
User Modeling and User-Adapted Interaction (2001)
While other consider them the same thing:
N. J. Belkin and W. B. Croft: Information filtering and
information retrieval: two sides of the same coin? in
Communications of the ACM (1992)
What is undeniable is that they are closely related:
Why not apply techniques from one field to the other?
It has already been done!
3
Information Retrieval vs Information Filtering (3)
Information Retrieval (IR)
Some retrieval techniques are:
Information Filtering (IF)
Some CF techniques are:
4
Information Retrieval vs Information Filtering (3)
Information Retrieval (IR)
Some retrieval techniques are:
Vector: Vector Space Model
Information Filtering (IF)
Some CF techniques are:
Vector: Pairwise similarities
(cosine, Pearson)
4
Information Retrieval vs Information Filtering (3)
Information Retrieval (IR)
Some retrieval techniques are:
Vector: Vector Space Model
MF: Latent Semantic
Indexing (LSI)
Information Filtering (IF)
Some CF techniques are:
Vector: Pairwise similarities
(cosine, Pearson)
MF: SVD, NMF
4
Information Retrieval vs Information Filtering (3)
Information Retrieval (IR)
Some retrieval techniques are:
Vector: Vector Space Model
MF: Latent Semantic
Indexing (LSI)
Probabilistic: LDA
Information Filtering (IF)
Some CF techniques are:
Vector: Pairwise similarities
(cosine, Pearson)
MF: SVD, NMF
Probabilistic: LDA and
other PGMs
4
Information Retrieval vs Information Filtering (3)
Information Retrieval (IR)
Some retrieval techniques are:
Vector: Vector Space Model
MF: Latent Semantic
Indexing (LSI)
Probabilistic: LDA,
Language Models (LM)
Information Filtering (IF)
Some CF techniques are:
Vector: Pairwise similarities
(cosine, Pearson)
MF: SVD, NMF
Probabilistic: LDA and
other PGMs
4
Language Models for Recommendation: Research goals
Language Models (LM) represented a breakthrough in
Information Retrieval:
State-of-the-art technique for text retrieval
Solid statistical foundation
5
Language Models for Recommendation: Research goals
Language Models (LM) represented a breakthrough in
Information Retrieval:
State-of-the-art technique for text retrieval
Solid statistical foundation
Maybe they can also be useful in RecSys:
5
Language Models for Recommendation: Research goals
Language Models (LM) represented a breakthrough in
Information Retrieval:
State-of-the-art technique for text retrieval
Solid statistical foundation
Maybe they can also be useful in RecSys:
Are LM a good framework for Collaborative Filtering?
5
Language Models for Recommendation: Research goals
Language Models (LM) represented a breakthrough in
Information Retrieval:
State-of-the-art technique for text retrieval
Solid statistical foundation
Maybe they can also be useful in RecSys:
Are LM a good framework for Collaborative Filtering?
Can LM be adapted to deal with temporal (TARS) and/or
contextual information (CARS)?
5
Language Models for Recommendation: Research goals
Language Models (LM) represented a breakthrough in
Information Retrieval:
State-of-the-art technique for text retrieval
Solid statistical foundation
Maybe they can also be useful in RecSys:
Are LM a good framework for Collaborative Filtering?
Can LM be adapted to deal with temporal (TARS) and/or
contextual information (CARS)?
A principled formulation of LM that combines
Content-Based and Collaborative Filtering?
5
Language Models for Recommendation: Research goals
Language Models (LM) represented a breakthrough in
Information Retrieval:
State-of-the-art technique for text retrieval
Solid statistical foundation
Maybe they can also be useful in RecSys:
Are LM a good framework for Collaborative Filtering?
Can LM be adapted to deal with temporal (TARS) and/or
contextual information (CARS)?
A principled formulation of LM that combines
Content-Based and Collaborative Filtering?
5
Language Models for Recommendation: Related work
There is little work done in using Language Models for CF:
J. Wang, A. P. de Vries and M. J. Reinders: A User-Item
Relevance Model for Log-based Collaborative Filtering
in ECIR 2006
6
Language Models for Recommendation: Related work
There is little work done in using Language Models for CF:
J. Wang, A. P. de Vries and M. J. Reinders: A User-Item
Relevance Model for Log-based Collaborative Filtering
in ECIR 2006
A. Bellogín, J. Wang and P. Castells: Bridging
Memory-Based Collaborative Filtering and Text
Retrieval in Information Retrieval (2013)
6
Language Models for Recommendation: Related work
There is little work done in using Language Models for CF:
J. Wang, A. P. de Vries and M. J. Reinders: A User-Item
Relevance Model for Log-based Collaborative Filtering
in ECIR 2006
A. Bellogín, J. Wang and P. Castells: Bridging
Memory-Based Collaborative Filtering and Text
Retrieval in Information Retrieval (2013)
J. Parapar, A. Bellogín, P. Castells and Á. Barreiro:
Relevance-Based Language Modelling for Recommender
Systems in Information Processing & Management (2013)
6
Language Models for Recommendation: Related work
There is little work done in using Language Models for CF:
J. Wang, A. P. de Vries and M. J. Reinders: A User-Item
Relevance Model for Log-based Collaborative Filtering
in ECIR 2006
A. Bellogín, J. Wang and P. Castells: Bridging
Memory-Based Collaborative Filtering and Text
Retrieval in Information Retrieval (2013)
J. Parapar, A. Bellogín, P. Castells and Á. Barreiro:
Relevance-Based Language Modelling for Recommender
Systems in Information Processing & Management (2013)
6
Relevance-Based Language Models
for Collaborative Filtering
6
Relevance-Based Language Models
Relevance-Based Language Models or Relevance Models (RM)
are a pseudo-relevance feedback technique from IR.
Pseudo-relevance feedback is an automatic query expansion
technique.
The expanded query is expected to yield better results than the
original one.
7
Pseudo-relevance feedback
Information need
8
Pseudo-relevance feedback
Information need
query
8
Pseudo-relevance feedback
Information need
query Retrieval
System
8
Pseudo-relevance feedback
Information need
query Retrieval
System
8
Pseudo-relevance feedback
Information need
query Retrieval
System
8
Pseudo-relevance feedback
Information need
query Retrieval
System
8
Pseudo-relevance feedback
Information need
query Retrieval
System
Query
Expansion
expanded
query
8
Pseudo-relevance feedback
Information need
query Retrieval
System
Query
Expansion
expanded
query
8
Relevance-Based Language Models for CF Recommendation (1)
IR RecSys
User’s query User’s profile
mostˆ1,populatedˆ1,stateˆ2 Titanicˆ2,Avatarˆ3,Sharkˆ5
Documents
Neighbours
Terms
Items
9
Relevance-Based Language Models for CF Recommendation (2)
Parapar et al. (2013):
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
Iu is the set of items rated by the user u
Vu is neighbourhood of the user u. This is computed using a
clustering algorithm
p(i|u) is computed smoothing the maximum likelihood
estimate with the probability in the collection
p(i) and p(v) are the item and user priors
10
Relevance-Based Language Models for CF Recommendation (2)
Parapar et al. (2013):
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
Iu is the set of items rated by the user u
Vu is neighbourhood of the user u. This is computed
using a clustering algorithm
p(i|u) is computed smoothing the maximum likelihood
estimate with the probability in the collection
p(i) and p(v) are the item and user priors
10
Relevance-Based Language Models for CF Recommendation (2)
Parapar et al. (2013):
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
Iu is the set of items rated by the user u
Vu is neighbourhood of the user u. This is computed using a
clustering algorithm
p(i|u) is computed smoothing the maximum likelihood
estimate with the probability in the collection
p(i) and p(v) are the item and user priors
10
Relevance-Based Language Models for CF Recommendation (2)
Parapar et al. (2013):
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
Iu is the set of items rated by the user u
Vu is neighbourhood of the user u. This is computed using a
clustering algorithm
p(i|u) is computed smoothing the maximum likelihood
estimate with the probability in the collection
p(i) and p(v) are the item and user priors
10
Smoothing methods
10
Smoothing in RM2
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
p(i|u) is computed smoothing the maximum likelihood
estimate:
pml(i|u) =
ru,i
j∈Iu
ru,j
with the probability in the collection:
p(i|C) = v∈U rv,i
j∈I, v∈U rv,j
11
Why use smoothing?
In Information Retrieval, smoothing provides:
A way to deal with data sparsity
The inverse document frequency (IDF) role
Document length normalisation
12
Why use smoothing?
In Information Retrieval, smoothing provides:
A way to deal with data sparsity
The inverse document frequency (IDF) role
Document length normalisation
In RecSys, we have the same problems:
Data sparsity
Item popularity vs item specificity
Profiles with different lengths
12
Smoothing techniques
Jelinek-Mercer (JM): Linear interpolation. Parameter λ.
pλ(i|u) = (1 − λ) pml(i|u) + λ p(i|C)
Dirichlet priors (DP): Bayesian analysis. Parameter µ.
pµ(i|u) =
ru,i + µ p(i|C)
µ + j∈Iu
ru,j
Absolute Discounting (AD): Subtract a constant δ.
pδ(i|u) =
max(ru,i − δ, 0) + δ |Iu| p(i|C)
j∈Iu
ru,j
13
Experiments with smoothing
13
Smoothing: ranking accuracy
0.20
0.25
0.30
0.35
0 100 200 300 400 500 600 700 800 900 1000
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
nDCG@10
µ
λ, δ
RM2 + AD
RM2 + JM
RM2 + DP
Figure: nDCG@10 values of RM2 varying the smoothing method
using 400 nearest neighbours according to Pearson’s correlation on
MovieLens 100k dataset
14
Smoothing: diversity
0.010
0.015
0.020
0.025
0.030
0 100 200 300 400 500 600 700 800 900 1000
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
Gini@10
µ
λ, δ
RM2 + AD
RM2 + JM
RM2 + DP
Figure: Gini@10 values of RM2 varying the smoothing method using
400 nearest neighbours according to Pearson’s correlation on
MovieLens 100k dataset
15
Smoothing: novelty
7.5
8.0
8.5
9.0
9.5
0 100 200 300 400 500 600 700 800 900 1000
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
MSI@10
µ
λ, δ
RM2 + AD
RM2 + JM
RM2 + DP
Figure: MSI@10 values of RM2 varying the smoothing method using
400 nearest neighbours according to Pearson’s correlation on
MovieLens 100k dataset
16
More about smoothings in RM2 for CF
More details about smoothings in:
D. Valcarce, J. Parapar, Á. Barreiro: A Study of
Smoothing Methods for Relevance-Based Language
Modelling of Recommender Systems in ECIR 2015
17
Priors
17
Priors in RM2
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
p(i) and p(v) are the item and user priors:
Enable to introduce a priori information into the model
18
Priors in RM2
RM2 : p(i|Ru) ∝ p(i)
j∈Iu v∈Vu
p(i|v) p(v)
p(i)
p(j|v)
p(i) and p(v) are the item and user priors:
Enable to introduce a priori information into the model
Provide a principled way of modelling business rules!
18
Prior estimates
Uniform (U) Linear (L)
User prior pU(u) =
1
|U|
pL(u) = i∈Iu
ru,i
v∈U j∈Iv
rv,j
Item prior pU(i) =
1
|I|
pL(i) =
u∈Ui
ru,i
j∈I v∈Uj
rv,j
19
Experiments with priors
19
Priors on MovieLens 100k
User prior Item prior nDCG@10 Gini@10 MSI@10
Linear Linear 0.0922 0.4603 28.4284
Uniform Linear 0.2453 0.2027 16.4022
Uniform Uniform 0.3296 0.0256 6.8273
Linear Uniform 0.3423 0.0264 6.7848
Table: nDCG@10, Gini@10 and MSI@10 values of RM2 varying the
prior estimates using 400 nearest neighbours according to Pearson’s
correlation on MovieLens 100k dataset and Absolute Discounting
(δ = 0.1)
More priors in
D. Valcarce, J. Parapar and Á. Barreiro: A Study of Priors
for Relevance-Based Language Modelling of
Recommender Systems in RecSys 2015!
20
Comparison with other CF algorithms
20
Priors on MovieLens 100k
Algorithm nDCG@10 Gini@10 MSI@10
SVD 0.0946 0.0109 14.6129
SVD++ 0.1113 0.0126 14.9574
NNCosNgbr 0.1771 0.0344 16.8222
UIR-Item 0.2188 0.0124 5.2337
PureSVD 0.3595 0.1364 11.8841
RM2-JM 0.3175 0.0232 9.1087
RM2-DP 0.3274 0.0251 9.2181
RM2-AD 0.3296 0.0256 9.2409
RM2-AD-L-U 0.3423 0.0264 9.2004
Table: nDCG@10, Gini@10 and MSI@10 values of different CF
recommendation algorithms
21
Conclusions and future directions
21
Conclusions
IR techniques can be employed in RecSys
Not only methods such as SVD...
but also Language Models!
22
Conclusions
IR techniques can be employed in RecSys
Not only methods such as SVD...
but also Language Models!
Language Models provide a principled and interpretable
framework for recommendation.
22
Conclusions
IR techniques can be employed in RecSys
Not only methods such as SVD...
but also Language Models!
Language Models provide a principled and interpretable
framework for recommendation.
Relevance-Based Language Models are competitive, but there is
room for improvements:
More sophisticated priors
22
Conclusions
IR techniques can be employed in RecSys
Not only methods such as SVD...
but also Language Models!
Language Models provide a principled and interpretable
framework for recommendation.
Relevance-Based Language Models are competitive, but there is
room for improvements:
More sophisticated priors
Neighbourhood computation
◦ Different similarity metrics: cosine, Kullback–Leibler
divergence
◦ Matrix factorisation: NMF, SVD
◦ Spectral clustering: NC
22
Future work
Improve novelty and diversity figures:
RM2 performance is similar to PureSVD in terms of nDCG
but it fails in terms of diversity and novelty
23
Future work
Improve novelty and diversity figures:
RM2 performance is similar to PureSVD in terms of nDCG
but it fails in terms of diversity and novelty
Introduce more evidences in the LM framework apart from
ratings:
Content-based information (hybrid recommender)
Temporal and contextual information (TARS & CARS)
23
Thank you!
@dvalcarce
http://www.dc.fi.udc.es/~dvalcarce
Time and Context in Language Models
Time:
X. Li and W. B. Croft: Time-based Language Models in
CIKM 2003
K. Berberich, S. Bedathur, O. Alonso and G. Weikum: A
language modeling approach for temporal information
needs in ECIR 2010
Context:
H. Rode and D. Hiemstra: Conceptual Language Models
for Context-Aware Text Retrieval in TREC 2004
L. Azzopardi: Incorporating Context within the
Language Modeling Approach for ad hoc Information
Retrieval. PhD Thesis (2005)
25

Más contenido relacionado

La actualidad más candente

Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
IJwest
 
Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
Andre Freitas
 
Text Data Mining
Text Data MiningText Data Mining
Text Data Mining
KU Leuven
 
Text data mining1
Text data mining1Text data mining1
Text data mining1
KU Leuven
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
butest
 

La actualidad más candente (20)

Role of Text Mining in Search Engine
Role of Text Mining in Search EngineRole of Text Mining in Search Engine
Role of Text Mining in Search Engine
 
FAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologiesFAIRness through a novel combination of Web technologies
FAIRness through a novel combination of Web technologies
 
Matching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sourcesMatching and merging anonymous terms from web sources
Matching and merging anonymous terms from web sources
 
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern MinimalizationABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
ABSTAT: Ontology-driven Linked Data Summaries with Pattern Minimalization
 
SPARQL Query Verbalization for Explaining Semantic Search Engine Queries
SPARQL Query Verbalization for Explaining Semantic Search Engine QueriesSPARQL Query Verbalization for Explaining Semantic Search Engine Queries
SPARQL Query Verbalization for Explaining Semantic Search Engine Queries
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Context, Perspective, and Generalities in a Knowledge Ontology
Context, Perspective, and Generalities in a Knowledge OntologyContext, Perspective, and Generalities in a Knowledge Ontology
Context, Perspective, and Generalities in a Knowledge Ontology
 
Information Extraction
Information ExtractionInformation Extraction
Information Extraction
 
Lecture20 xing
Lecture20 xingLecture20 xing
Lecture20 xing
 
Tutorial - Introduction to Rule Technologies and Systems
Tutorial - Introduction to Rule Technologies and SystemsTutorial - Introduction to Rule Technologies and Systems
Tutorial - Introduction to Rule Technologies and Systems
 
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
Lexical Distribution in Citation Contexts through the IMRaD Standard - ECIR-2...
 
Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...Schema-agnositc queries over large-schema databases: a distributional semanti...
Schema-agnositc queries over large-schema databases: a distributional semanti...
 
Text Data Mining
Text Data MiningText Data Mining
Text Data Mining
 
Text data mining1
Text data mining1Text data mining1
Text data mining1
 
Harvester_presentaion
Harvester_presentaionHarvester_presentaion
Harvester_presentaion
 
Ontology Mapping
Ontology MappingOntology Mapping
Ontology Mapping
 
Gleaning Types for Literals in RDF with Application to Entity Summarization
Gleaning Types for Literals in RDF with Application to Entity SummarizationGleaning Types for Literals in RDF with Application to Entity Summarization
Gleaning Types for Literals in RDF with Application to Entity Summarization
 
Big Data & Text Mining
Big Data & Text MiningBig Data & Text Mining
Big Data & Text Mining
 
Textmining Information Extraction
Textmining Information ExtractionTextmining Information Extraction
Textmining Information Extraction
 
Data Integration Ontology Mapping
Data Integration Ontology MappingData Integration Ontology Mapping
Data Integration Ontology Mapping
 

Destacado

Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Alejandro Bellogin
 
Recommender systemms search engines
Recommender systemms search enginesRecommender systemms search engines
Recommender systemms search engines
ברכה שפירא
 
Two sides of the same coin with text 2
Two sides of the same coin with text 2Two sides of the same coin with text 2
Two sides of the same coin with text 2
LiteracyCenter
 
Design of recommender systems
Design of recommender systemsDesign of recommender systems
Design of recommender systems
Rashmi Sinha
 

Destacado (10)

A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
A Study of Priors for Relevance-Based Language Modelling of Recommender Syste...
 
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
 
Is Unlabeled Data Suitable for Multiclass SVM-based Web Page Classification?
Is Unlabeled Data Suitable for Multiclass SVM-based Web Page Classification?Is Unlabeled Data Suitable for Multiclass SVM-based Web Page Classification?
Is Unlabeled Data Suitable for Multiclass SVM-based Web Page Classification?
 
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
Improving Memory-Based Collaborative Filtering by Neighbour Selection based o...
 
Recommender systemms search engines
Recommender systemms search enginesRecommender systemms search engines
Recommender systemms search engines
 
Two sides of the same coin with text 2
Two sides of the same coin with text 2Two sides of the same coin with text 2
Two sides of the same coin with text 2
 
Models for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationModels for Information Retrieval and Recommendation
Models for Information Retrieval and Recommendation
 
Tutorial: Context-awareness In Information Retrieval and Recommender Systems
Tutorial: Context-awareness In Information Retrieval and Recommender SystemsTutorial: Context-awareness In Information Retrieval and Recommender Systems
Tutorial: Context-awareness In Information Retrieval and Recommender Systems
 
Semantics-aware Techniques for Social Media Analysis, User Modeling and Recom...
Semantics-aware Techniques for Social Media Analysis, User Modeling and Recom...Semantics-aware Techniques for Social Media Analysis, User Modeling and Recom...
Semantics-aware Techniques for Social Media Analysis, User Modeling and Recom...
 
Design of recommender systems
Design of recommender systemsDesign of recommender systems
Design of recommender systems
 

Similar a Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS Slides]

Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
Mohamed BEN ELLEFI
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
National Information Standards Organization (NISO)
 
CLAIR: Computational Linguistics And Information Retrieval
CLAIR: Computational Linguistics And Information RetrievalCLAIR: Computational Linguistics And Information Retrieval
CLAIR: Computational Linguistics And Information Retrieval
butest
 
Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)
Bradley Allen
 

Similar a Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS Slides] (20)

07 04-06
07 04-0607 04-06
07 04-06
 
Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking  Profile-based Dataset Recommendation for RDF Data Linking
Profile-based Dataset Recommendation for RDF Data Linking
 
Semantics-aware Content-based Recommender Systems
Semantics-aware Content-based Recommender SystemsSemantics-aware Content-based Recommender Systems
Semantics-aware Content-based Recommender Systems
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
 
CLAIR: Computational Linguistics And Information Retrieval
CLAIR: Computational Linguistics And Information RetrievalCLAIR: Computational Linguistics And Information Retrieval
CLAIR: Computational Linguistics And Information Retrieval
 
Information_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_HabibInformation_Retrieval_Models_Nfaoui_El_Habib
Information_Retrieval_Models_Nfaoui_El_Habib
 
Information Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slidesInformation Retrieval Models for Recommender Systems - PhD slides
Information Retrieval Models for Recommender Systems - PhD slides
 
The Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela KroegerThe Progress of BIBFRAME, by Angela Kroeger
The Progress of BIBFRAME, by Angela Kroeger
 
Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)
 
Closing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary LinguisticsClosing the Gap: Data Models for Documentary Linguistics
Closing the Gap: Data Models for Documentary Linguistics
 
A Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from TextA Framework to Automatically Extract Funding Information from Text
A Framework to Automatically Extract Funding Information from Text
 
Semantic Search Component
Semantic Search ComponentSemantic Search Component
Semantic Search Component
 
From Linked Data to Semantic Applications
From Linked Data to Semantic ApplicationsFrom Linked Data to Semantic Applications
From Linked Data to Semantic Applications
 
Introduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and TerminologyIntroduction to Ontology Concepts and Terminology
Introduction to Ontology Concepts and Terminology
 
RDA for Original Catalogers
RDA for Original CatalogersRDA for Original Catalogers
RDA for Original Catalogers
 
Profiling Linked Open Data
Profiling Linked Open DataProfiling Linked Open Data
Profiling Linked Open Data
 
Word Embedding In IR
Word Embedding In IRWord Embedding In IR
Word Embedding In IR
 
Inteligent Catalogue Final
Inteligent Catalogue FinalInteligent Catalogue Final
Inteligent Catalogue Final
 
Recent Research and Developments on Recommender Systems in TEL
Recent Research and Developments on Recommender Systems in TELRecent Research and Developments on Recommender Systems in TEL
Recent Research and Developments on Recommender Systems in TEL
 
introduction to data mining tutorial
introduction to data mining tutorial introduction to data mining tutorial
introduction to data mining tutorial
 

Más de Daniel Valcarce

Más de Daniel Valcarce (8)

On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...
On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...
On the Robustness and Discriminative Power of IR Metrics for Top-N Recommenda...
 
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]
LiMe: Linear Methods for Pseudo-Relevance Feedback [SAC '18 Slides]
 
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
Additive Smoothing for Relevance-Based Language Modelling of Recommender Syst...
 
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...
Computing Neighbourhoods with Language Models in a Collaborative Filtering Sc...
 
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...
When Recommenders Met Big Data: an Architectural Proposal and Evaluation [CER...
 
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...
A Study of Smoothing Methods for Relevance-Based Language Modelling of Recomm...
 
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]
Language Models for Collaborative Filtering Neighbourhoods [ECIR '16 Slides]
 
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
Efficient Pseudo-Relevance Feedback Methods for Collaborative Filtering Recom...
 

Último

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
MarinCaroMartnezBerg
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Último (20)

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 

Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS Slides]

  • 1. DOCTORAL SYMPOSIUM Exploring Statistical Language Models for Recommender Systems RecSys 2015 16 - 20 September, Vienna, Austria Daniel Valcarce @dvalcarce Information Retrieval Lab University of A Coruña Spain
  • 3. Information Retrieval vs Information Filtering (1) Information Retrieval (IR) Information Filtering (IF) 2
  • 4. Information Retrieval vs Information Filtering (1) Information Retrieval (IR) Goal: Retrieve relevant documents according to the information need of a user Information Filtering (IF) 2
  • 5. Information Retrieval vs Information Filtering (1) Information Retrieval (IR) Goal: Retrieve relevant documents according to the information need of a user Examples: Search engines (web, multimedia...) Information Filtering (IF) 2
  • 6. Information Retrieval vs Information Filtering (1) Information Retrieval (IR) Goal: Retrieve relevant documents according to the information need of a user Examples: Search engines (web, multimedia...) Information Filtering (IF) Goal: Select relevant items from an information stream for a given user 2
  • 7. Information Retrieval vs Information Filtering (1) Information Retrieval (IR) Goal: Retrieve relevant documents according to the information need of a user Examples: Search engines (web, multimedia...) Information Filtering (IF) Goal: Select relevant items from an information stream for a given user Examples: spam filters, recommender systems 2
  • 8. Information Retrieval vs Information Filtering (1) Information Retrieval (IR) Goal: Retrieve relevant documents according to the information need of a user Examples: Search engines (web, multimedia...) Input: The user’s query (explicit). Information Filtering (IF) Goal: Select relevant items from an information stream for a given user Examples: spam filters, recommender systems Input: The user’s history (implicit). 2
  • 9. Information Retrieval vs Information Filtering (2) Some people consider them different fields: U. Hanani, B. Shapira and P. Shoval: Information Filtering: Overview of Issues, Research and Systems in User Modeling and User-Adapted Interaction (2001) 3
  • 10. Information Retrieval vs Information Filtering (2) Some people consider them different fields: U. Hanani, B. Shapira and P. Shoval: Information Filtering: Overview of Issues, Research and Systems in User Modeling and User-Adapted Interaction (2001) While other consider them the same thing: N. J. Belkin and W. B. Croft: Information filtering and information retrieval: two sides of the same coin? in Communications of the ACM (1992) 3
  • 11. Information Retrieval vs Information Filtering (2) Some people consider them different fields: U. Hanani, B. Shapira and P. Shoval: Information Filtering: Overview of Issues, Research and Systems in User Modeling and User-Adapted Interaction (2001) While other consider them the same thing: N. J. Belkin and W. B. Croft: Information filtering and information retrieval: two sides of the same coin? in Communications of the ACM (1992) What is undeniable is that they are closely related: Why not apply techniques from one field to the other? 3
  • 12. Information Retrieval vs Information Filtering (2) Some people consider them different fields: U. Hanani, B. Shapira and P. Shoval: Information Filtering: Overview of Issues, Research and Systems in User Modeling and User-Adapted Interaction (2001) While other consider them the same thing: N. J. Belkin and W. B. Croft: Information filtering and information retrieval: two sides of the same coin? in Communications of the ACM (1992) What is undeniable is that they are closely related: Why not apply techniques from one field to the other? It has already been done! 3
  • 13. Information Retrieval vs Information Filtering (3) Information Retrieval (IR) Some retrieval techniques are: Information Filtering (IF) Some CF techniques are: 4
  • 14. Information Retrieval vs Information Filtering (3) Information Retrieval (IR) Some retrieval techniques are: Vector: Vector Space Model Information Filtering (IF) Some CF techniques are: Vector: Pairwise similarities (cosine, Pearson) 4
  • 15. Information Retrieval vs Information Filtering (3) Information Retrieval (IR) Some retrieval techniques are: Vector: Vector Space Model MF: Latent Semantic Indexing (LSI) Information Filtering (IF) Some CF techniques are: Vector: Pairwise similarities (cosine, Pearson) MF: SVD, NMF 4
  • 16. Information Retrieval vs Information Filtering (3) Information Retrieval (IR) Some retrieval techniques are: Vector: Vector Space Model MF: Latent Semantic Indexing (LSI) Probabilistic: LDA Information Filtering (IF) Some CF techniques are: Vector: Pairwise similarities (cosine, Pearson) MF: SVD, NMF Probabilistic: LDA and other PGMs 4
  • 17. Information Retrieval vs Information Filtering (3) Information Retrieval (IR) Some retrieval techniques are: Vector: Vector Space Model MF: Latent Semantic Indexing (LSI) Probabilistic: LDA, Language Models (LM) Information Filtering (IF) Some CF techniques are: Vector: Pairwise similarities (cosine, Pearson) MF: SVD, NMF Probabilistic: LDA and other PGMs 4
  • 18. Language Models for Recommendation: Research goals Language Models (LM) represented a breakthrough in Information Retrieval: State-of-the-art technique for text retrieval Solid statistical foundation 5
  • 19. Language Models for Recommendation: Research goals Language Models (LM) represented a breakthrough in Information Retrieval: State-of-the-art technique for text retrieval Solid statistical foundation Maybe they can also be useful in RecSys: 5
  • 20. Language Models for Recommendation: Research goals Language Models (LM) represented a breakthrough in Information Retrieval: State-of-the-art technique for text retrieval Solid statistical foundation Maybe they can also be useful in RecSys: Are LM a good framework for Collaborative Filtering? 5
  • 21. Language Models for Recommendation: Research goals Language Models (LM) represented a breakthrough in Information Retrieval: State-of-the-art technique for text retrieval Solid statistical foundation Maybe they can also be useful in RecSys: Are LM a good framework for Collaborative Filtering? Can LM be adapted to deal with temporal (TARS) and/or contextual information (CARS)? 5
  • 22. Language Models for Recommendation: Research goals Language Models (LM) represented a breakthrough in Information Retrieval: State-of-the-art technique for text retrieval Solid statistical foundation Maybe they can also be useful in RecSys: Are LM a good framework for Collaborative Filtering? Can LM be adapted to deal with temporal (TARS) and/or contextual information (CARS)? A principled formulation of LM that combines Content-Based and Collaborative Filtering? 5
  • 23. Language Models for Recommendation: Research goals Language Models (LM) represented a breakthrough in Information Retrieval: State-of-the-art technique for text retrieval Solid statistical foundation Maybe they can also be useful in RecSys: Are LM a good framework for Collaborative Filtering? Can LM be adapted to deal with temporal (TARS) and/or contextual information (CARS)? A principled formulation of LM that combines Content-Based and Collaborative Filtering? 5
  • 24. Language Models for Recommendation: Related work There is little work done in using Language Models for CF: J. Wang, A. P. de Vries and M. J. Reinders: A User-Item Relevance Model for Log-based Collaborative Filtering in ECIR 2006 6
  • 25. Language Models for Recommendation: Related work There is little work done in using Language Models for CF: J. Wang, A. P. de Vries and M. J. Reinders: A User-Item Relevance Model for Log-based Collaborative Filtering in ECIR 2006 A. Bellogín, J. Wang and P. Castells: Bridging Memory-Based Collaborative Filtering and Text Retrieval in Information Retrieval (2013) 6
  • 26. Language Models for Recommendation: Related work There is little work done in using Language Models for CF: J. Wang, A. P. de Vries and M. J. Reinders: A User-Item Relevance Model for Log-based Collaborative Filtering in ECIR 2006 A. Bellogín, J. Wang and P. Castells: Bridging Memory-Based Collaborative Filtering and Text Retrieval in Information Retrieval (2013) J. Parapar, A. Bellogín, P. Castells and Á. Barreiro: Relevance-Based Language Modelling for Recommender Systems in Information Processing & Management (2013) 6
  • 27. Language Models for Recommendation: Related work There is little work done in using Language Models for CF: J. Wang, A. P. de Vries and M. J. Reinders: A User-Item Relevance Model for Log-based Collaborative Filtering in ECIR 2006 A. Bellogín, J. Wang and P. Castells: Bridging Memory-Based Collaborative Filtering and Text Retrieval in Information Retrieval (2013) J. Parapar, A. Bellogín, P. Castells and Á. Barreiro: Relevance-Based Language Modelling for Recommender Systems in Information Processing & Management (2013) 6
  • 28. Relevance-Based Language Models for Collaborative Filtering 6
  • 29. Relevance-Based Language Models Relevance-Based Language Models or Relevance Models (RM) are a pseudo-relevance feedback technique from IR. Pseudo-relevance feedback is an automatic query expansion technique. The expanded query is expected to yield better results than the original one. 7
  • 36. Pseudo-relevance feedback Information need query Retrieval System Query Expansion expanded query 8
  • 37. Pseudo-relevance feedback Information need query Retrieval System Query Expansion expanded query 8
  • 38. Relevance-Based Language Models for CF Recommendation (1) IR RecSys User’s query User’s profile mostˆ1,populatedˆ1,stateˆ2 Titanicˆ2,Avatarˆ3,Sharkˆ5 Documents Neighbours Terms Items 9
  • 39. Relevance-Based Language Models for CF Recommendation (2) Parapar et al. (2013): RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) Iu is the set of items rated by the user u Vu is neighbourhood of the user u. This is computed using a clustering algorithm p(i|u) is computed smoothing the maximum likelihood estimate with the probability in the collection p(i) and p(v) are the item and user priors 10
  • 40. Relevance-Based Language Models for CF Recommendation (2) Parapar et al. (2013): RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) Iu is the set of items rated by the user u Vu is neighbourhood of the user u. This is computed using a clustering algorithm p(i|u) is computed smoothing the maximum likelihood estimate with the probability in the collection p(i) and p(v) are the item and user priors 10
  • 41. Relevance-Based Language Models for CF Recommendation (2) Parapar et al. (2013): RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) Iu is the set of items rated by the user u Vu is neighbourhood of the user u. This is computed using a clustering algorithm p(i|u) is computed smoothing the maximum likelihood estimate with the probability in the collection p(i) and p(v) are the item and user priors 10
  • 42. Relevance-Based Language Models for CF Recommendation (2) Parapar et al. (2013): RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) Iu is the set of items rated by the user u Vu is neighbourhood of the user u. This is computed using a clustering algorithm p(i|u) is computed smoothing the maximum likelihood estimate with the probability in the collection p(i) and p(v) are the item and user priors 10
  • 44. Smoothing in RM2 RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) p(i|u) is computed smoothing the maximum likelihood estimate: pml(i|u) = ru,i j∈Iu ru,j with the probability in the collection: p(i|C) = v∈U rv,i j∈I, v∈U rv,j 11
  • 45. Why use smoothing? In Information Retrieval, smoothing provides: A way to deal with data sparsity The inverse document frequency (IDF) role Document length normalisation 12
  • 46. Why use smoothing? In Information Retrieval, smoothing provides: A way to deal with data sparsity The inverse document frequency (IDF) role Document length normalisation In RecSys, we have the same problems: Data sparsity Item popularity vs item specificity Profiles with different lengths 12
  • 47. Smoothing techniques Jelinek-Mercer (JM): Linear interpolation. Parameter λ. pλ(i|u) = (1 − λ) pml(i|u) + λ p(i|C) Dirichlet priors (DP): Bayesian analysis. Parameter µ. pµ(i|u) = ru,i + µ p(i|C) µ + j∈Iu ru,j Absolute Discounting (AD): Subtract a constant δ. pδ(i|u) = max(ru,i − δ, 0) + δ |Iu| p(i|C) j∈Iu ru,j 13
  • 49. Smoothing: ranking accuracy 0.20 0.25 0.30 0.35 0 100 200 300 400 500 600 700 800 900 1000 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 nDCG@10 µ λ, δ RM2 + AD RM2 + JM RM2 + DP Figure: nDCG@10 values of RM2 varying the smoothing method using 400 nearest neighbours according to Pearson’s correlation on MovieLens 100k dataset 14
  • 50. Smoothing: diversity 0.010 0.015 0.020 0.025 0.030 0 100 200 300 400 500 600 700 800 900 1000 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Gini@10 µ λ, δ RM2 + AD RM2 + JM RM2 + DP Figure: Gini@10 values of RM2 varying the smoothing method using 400 nearest neighbours according to Pearson’s correlation on MovieLens 100k dataset 15
  • 51. Smoothing: novelty 7.5 8.0 8.5 9.0 9.5 0 100 200 300 400 500 600 700 800 900 1000 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 MSI@10 µ λ, δ RM2 + AD RM2 + JM RM2 + DP Figure: MSI@10 values of RM2 varying the smoothing method using 400 nearest neighbours according to Pearson’s correlation on MovieLens 100k dataset 16
  • 52. More about smoothings in RM2 for CF More details about smoothings in: D. Valcarce, J. Parapar, Á. Barreiro: A Study of Smoothing Methods for Relevance-Based Language Modelling of Recommender Systems in ECIR 2015 17
  • 54. Priors in RM2 RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) p(i) and p(v) are the item and user priors: Enable to introduce a priori information into the model 18
  • 55. Priors in RM2 RM2 : p(i|Ru) ∝ p(i) j∈Iu v∈Vu p(i|v) p(v) p(i) p(j|v) p(i) and p(v) are the item and user priors: Enable to introduce a priori information into the model Provide a principled way of modelling business rules! 18
  • 56. Prior estimates Uniform (U) Linear (L) User prior pU(u) = 1 |U| pL(u) = i∈Iu ru,i v∈U j∈Iv rv,j Item prior pU(i) = 1 |I| pL(i) = u∈Ui ru,i j∈I v∈Uj rv,j 19
  • 58. Priors on MovieLens 100k User prior Item prior nDCG@10 Gini@10 MSI@10 Linear Linear 0.0922 0.4603 28.4284 Uniform Linear 0.2453 0.2027 16.4022 Uniform Uniform 0.3296 0.0256 6.8273 Linear Uniform 0.3423 0.0264 6.7848 Table: nDCG@10, Gini@10 and MSI@10 values of RM2 varying the prior estimates using 400 nearest neighbours according to Pearson’s correlation on MovieLens 100k dataset and Absolute Discounting (δ = 0.1) More priors in D. Valcarce, J. Parapar and Á. Barreiro: A Study of Priors for Relevance-Based Language Modelling of Recommender Systems in RecSys 2015! 20
  • 59. Comparison with other CF algorithms 20
  • 60. Priors on MovieLens 100k Algorithm nDCG@10 Gini@10 MSI@10 SVD 0.0946 0.0109 14.6129 SVD++ 0.1113 0.0126 14.9574 NNCosNgbr 0.1771 0.0344 16.8222 UIR-Item 0.2188 0.0124 5.2337 PureSVD 0.3595 0.1364 11.8841 RM2-JM 0.3175 0.0232 9.1087 RM2-DP 0.3274 0.0251 9.2181 RM2-AD 0.3296 0.0256 9.2409 RM2-AD-L-U 0.3423 0.0264 9.2004 Table: nDCG@10, Gini@10 and MSI@10 values of different CF recommendation algorithms 21
  • 61. Conclusions and future directions 21
  • 62. Conclusions IR techniques can be employed in RecSys Not only methods such as SVD... but also Language Models! 22
  • 63. Conclusions IR techniques can be employed in RecSys Not only methods such as SVD... but also Language Models! Language Models provide a principled and interpretable framework for recommendation. 22
  • 64. Conclusions IR techniques can be employed in RecSys Not only methods such as SVD... but also Language Models! Language Models provide a principled and interpretable framework for recommendation. Relevance-Based Language Models are competitive, but there is room for improvements: More sophisticated priors 22
  • 65. Conclusions IR techniques can be employed in RecSys Not only methods such as SVD... but also Language Models! Language Models provide a principled and interpretable framework for recommendation. Relevance-Based Language Models are competitive, but there is room for improvements: More sophisticated priors Neighbourhood computation ◦ Different similarity metrics: cosine, Kullback–Leibler divergence ◦ Matrix factorisation: NMF, SVD ◦ Spectral clustering: NC 22
  • 66. Future work Improve novelty and diversity figures: RM2 performance is similar to PureSVD in terms of nDCG but it fails in terms of diversity and novelty 23
  • 67. Future work Improve novelty and diversity figures: RM2 performance is similar to PureSVD in terms of nDCG but it fails in terms of diversity and novelty Introduce more evidences in the LM framework apart from ratings: Content-based information (hybrid recommender) Temporal and contextual information (TARS & CARS) 23
  • 69. Time and Context in Language Models Time: X. Li and W. B. Croft: Time-based Language Models in CIKM 2003 K. Berberich, S. Bedathur, O. Alonso and G. Weikum: A language modeling approach for temporal information needs in ECIR 2010 Context: H. Rode and D. Hiemstra: Conceptual Language Models for Context-Aware Text Retrieval in TREC 2004 L. Azzopardi: Incorporating Context within the Language Modeling Approach for ad hoc Information Retrieval. PhD Thesis (2005) 25