SlideShare una empresa de Scribd logo
1 de 34
Descargar para leer sin conexión
Matrix Factorization Techniques For Recommender
                           Systems


                                      Reporter: Lei Guo



                                http://ir.sdu.edu.cn/index.htm




Copyright  2012 by IRLAB@SDU                                    -1-
Agenda


 Recommender System Strategies
 Matrix Factorization Methods
 Basic Factorization Model
 Learning Algorithms
 Adding Biases
 Additional Input Sources
 Temporal Dynamics
 Inputs With Varing Confidence Level




Copyright  2012 by IRLAB@SDU           -2-
Recommender System Strategies


 Contented-based recommendation
      – Creates a profile for each user or product to characterize its nature
             Eg. Movie profile include attributes regarding its genre, the paticipating
              actors, its box office popularity…
             Eg. User profile might include demographic information or answers provided
              on a suitable questionnaire
      – Programs use these profiles associate users with mathing products




Copyright  2012 by IRLAB@SDU                                                      -3-
 Simple approach
      – Compute the similarity of an unseen item with the user profile based on the
        keyword overlap (e.g. using the Dice coefficient)
                          2 ∗|𝑘𝑒𝑦𝑤𝑜𝑟𝑑𝑠 𝑏 𝑖 ∩𝑘𝑒𝑦𝑤𝑜𝑟𝑑𝑠 𝑏 𝑗 |
      – sim(bi, bj) =        𝑘𝑒𝑦𝑤𝑜𝑟𝑑𝑠 𝑏 𝑖 +𝑘𝑒𝑦𝑤𝑜𝑟𝑑𝑠 𝑏 𝑗 |
      – Or combine multiple metrics in a weighted approach
      – Require gathering external information that might not be available
 Collaborative-based recommendation
      – Relies only on past user behavior (eg. Previous transactions or puduct ratings)
      – Also known as collaborative filtering (CF)
 Basic assumption and idea
      – Users give ratings to catalog items (implicitly or explicitly)
      – Customers who had similar tastes in the past, will have similar tastes in the future

Copyright  2012 by IRLAB@SDU                                                     -4-
Item1      Item2   Item3       Item4       Item5
       Alice           5         3        4           4          ?
      User1            3         1        2           3          3
      User2            4         3        4           3          5
      User3            3         3        1           5          4
      User4            1         5        5           2          1
       – The goal is to estimate Alice's rating for this item
 Neighborhood methods
 User-oriented approach
       – Centered on computing the relationships between users
                                                     Pearson correlation




Copyright  2012 by IRLAB@SDU                                              -5-
 Item-oriented approach
      – Evaluates a user’s preference for an item based on ratings of “neighboring”
        item by the same user
      – Look for items that are similar to Item5
      – Take Alice's ratings for these items to predict the rating for Item5

                                Item1   Item2   Item3    Item4     Item5
                  Alice          5       3       4         4          ?
                 User1           3       1       2         3          3
                 User2           4       3       4         3          5
                 User3           3       3       1         5          4
                 User4           1       5       5         2          1



Copyright  2012 by IRLAB@SDU                                                  -6-
 Latent factor models
      – An alternative approach that tries to explain the ratings by charactering both
        items and users with the more holistic goal to uncover the latent features
      – For movies, these latent factors might measure obvious dimensions such as
        comedy versus drama, amount of action, or orientation to children
      – For users, each factor measures how much the users likes movies that score
        high on the corresponding movie factor
      – PLAS
      – Neural networks
      – Latent Dirichlet Allocation
      – Matrix factorization (eg. SVD)
      – …



Copyright  2012 by IRLAB@SDU                                                    -7-
Matrix Factorization Methods

 Characteristic
      – Characterizes both items and users by vectors of factors infered from item
        rating patterns
      – High correspondence between item and user factors leads to a
        recommendation
 Rely on matrix types of input data
      – One dimention representing user
      – The other representing items
 Two data types
      – High-quality explicit feedfack
             Includs explicit input by users regarding their interest in products
             We refer to explicit user feedback as ratings
             Usually sparse matrix, since any single user is likely to have rated only a small
              percentage of possible items
Copyright  2012 by IRLAB@SDU                                                            -8-
 Implicit feedback
      – Which indirectly reflects opion by observing user behavior
             Purchase history, browsing history, search patterns, mouse movements
      – Usually denotes the presence or absence of an event
      – Typically represented by a densely filled matrix




Copyright  2012 by IRLAB@SDU                                                        -9-
Basic Matrix Factorization Model


 Formalization
      – Map both users and items to a joint latent factor space of dimensionality f
      – User-item interactions are modeled as inner products in that space
      – Each item i is associated with a vector qi ,and each user u is associated with a
        a vector pu,
             qi measures the extent to which the item possesses those factors
             pu measure the extent of interest the user has in items
             the resulting dot product qiTpu caputures the interaction between user u and item i –
                the user’s overall interest in the item’s characteristics
      – The appoximates user u’s rating of item i, which is denoted by rui, leading to
        the estimate

      – The major chanllenge is computing the mapping of each item and user to
        factor vecotrs qi,pu

Copyright  2012 by IRLAB@SDU                                                                  - 10 -
– We can caputure the latent relationships between users and items
      – We can produce a low-dimensional representation of the original rating
        matrix
      – Factor rating matrix R using SVD obtain Q, S, P
      – Reduce the matrix S to dimension k
      – Compute two resultant matrices: QkSk (qT)and SkPk(p)
 Predicting task
      – These resultant matrices can now be used to compute the recommendation
        score for any user and item
      – We can simply calculte the dot product of the ith row of q and the uth column
        of p




Copyright  2012 by IRLAB@SDU                                                    - 11 -
 Top-N Recommendation Task
      – We consider customer preference data as binary by treating each non-zero
        entry of the customer-product matrix as “1”
      – We are only interested in whether a customer consumed a particular product
        but not how much the user liked
 First Step (neighborhood formation in the reduced space)
      –   Factor rating matrix R using SVD obtain Q, S, P
      –   Reduce the matrix S to dimension k
      –   Compute resultant matrices: PkSk
      –   Performed vector similarity (cosine similarity) to form the neighborhood
 Second Step(Top-N Recommendation)
      – we scan through the purchase record of each of the k neighbors
      – Compute the frequency count on the products they purchased
      – Return the most frequently purchased N items for the target customer

Copyright  2012 by IRLAB@SDU                                                    - 12 -
 Difficulties
      – High portion of missing values caused by sparseness in the user-item rating
        matrix
      – Conventional SVD is undefined when knowledge about the matrix is
        incomplete
      – Carelessly addressing only the relatively few kown entries is highly prone to
        overfitting
 Solutions
 Fill missing values
      – Earlier systems relied on imputation to fill in missing rating and make the
        rating matrix dense
      – Such as using the average ratings for user and item
      – However, (1)imputation can be very expensive as it significantly increases the
        amount of data and (2) inaccurate imputation might distort the data.
Copyright  2012 by IRLAB@SDU                                                    - 13 -
 Modeling directly the observed ratings only
      – Avoiding overfitting through a regularized model
      – The learn the factor vectors(pu and qi), the system minimizes the regularized
        squared error on the set of known ratings:




      – The goal is to generalize those previous ratings in a way that predicts future
        unkown ratings
      – The constant controls the extent of regularization




Copyright  2012 by IRLAB@SDU                                                     - 14 -
Learning Algorithms


 Stochastic gradient desent
      – Also known as incremental learning
      – For each given training case, the system predicts rui and computes the
        associated prediction error



      – Learning rule




Copyright  2012 by IRLAB@SDU                                                    - 15 -
 Alternating least squares
      – Because both qi and pu are unknowns, the object function is not convex
      – However, if fix one of the unknowns, the optimization problems quadratic and
        can be solved optimally
      – Als techniques rotate between fixing the qi’s and fixing the pu’s
      – When all pu’s are fixed, the system recomputes the qi’s by sovling a least-
        squares problem

      – we can fix the matrix P as some matrix   , such that minimization problem
        would be equivalent to



      – Analogously, we can fix Q as    ,


Copyright  2012 by IRLAB@SDU                                                 - 16 -
 Learning rule




      – It can be shown that the only possible minimum is the global one
      – so that P and Q must converge to the true SVD subspace




Copyright  2012 by IRLAB@SDU                                              - 17 -
Adding Biases


 We tries to capture the interactions between users and items that
  produce the different rating values



 However, typical collaborative filtering data exhibits large systematic
  tendencies for some users to give higher ratings than others
 And for some items to receive higher ratings than others
      – Some products are widely perceived as better(or worse) than others
 Its unwise to explain the full rating value in this form
 We should identify the user and item bias



Copyright  2012 by IRLAB@SDU                                                - 18 -
 A first-order approximation of the bias is as follows:


      – The bias accounts for the user and item effects, where      denotes the
        overall average rating
      – bu and bi indicate the observed deviations of user u and item i.
 For example
      – Suppose we want to estimate user john’s rating of the movie Titanic
      – And the averagerating over all movies is 3.7 stars
      – Titanic is better than an average movie, so it tends to be rated 0.5 starts
        above the average movie
      – John is a critical user, who tends to rate 0.3 stars lower than the average
      – Thus, the estimate for Titanic’s rating by John would be (3.7+0.5-0.3)


Copyright  2012 by IRLAB@SDU                                                     - 19 -
 The estimation becomes:


 The system learns by miniming the squared error function




 The stochastic gradient descent learning rule becomes:




Copyright  2012 by IRLAB@SDU                                - 20 -
Additional Input Sources


 A system must deal with cold start problem, wherein many users supply
  very few ratings
 We need to incorporate additional sources to relieve this problem
 Use implicit feedback to gain insight into user preferences
      – They can gather behavior information of user’s
      – Eg. A retailer can use its customers’ purchases or browsing history to learn
        their tendencies
      – For simplicity, consider a case with a boolean implicit feedback
      – The exact model is as follows




Copyright  2012 by IRLAB@SDU                                                    - 21 -
 Several types of implicit feedback can be simultaneously introduced into
  the model by using extra sets of item factors
 Eg. Demographics, such as gender, age group, zip code, income level
      – Use a distinct factor vector yj(2) corresponds to each attribute to describe a
        user
 The matrix factorization model should integrate all signal sources:





Copyright  2012 by IRLAB@SDU                                                     - 22 -
Temproal Dynamics


 Time-drifting nature
      – So far, the presented models have been static
      – In reality, product perception and popularity constantly change as new
        selections emerge
      – Similarly, customer’s inclinations evolve, leading them to redefine their taste
      – The system should account for the temporal effects




Copyright  2012 by IRLAB@SDU                                                     - 23 -
 Multiple sources of temporal dynamics
 Item-side effects:
      – Product perception and popularity are constantly changing
      – Seasonal patterns influence items’ popularity
 User-side effects:
      –   Customers ever redefine their taste
      –   Transient, short-term bias; anchoring
      –   Drifting rating scale
      –   Change of rater within household
 Multiple sources: Both items and users are changing over time
 Multiple targets: Each user/item forms a unique time series
  Scarce data per target
Copyright  2012 by IRLAB@SDU                                       - 24 -
Addressing temporal dynamics

 Factor model conveniently allows separately treating different aspects
 We observe changes in:
      1.    Rating scale of individual users
      2.    Popularity of individual items
      3.    User preferences




                 rui (t )    bu (t )  bi (t )  q pu (t )
                                                    T
                                                    i


Copyright  2012 by IRLAB@SDU                                       - 25 -
Parameterizing the model


                     rui (t )    bu (t )  bi (t )  qiT pu (t )

 Use functional forms: bu(t)=f(u,t), bi(t)=g(i,t), pu(t)=h(u,t)
 Need to find adequate f(), g(), h()
 General guidelines:
      –   Items show slower temporal changes
      –   Users exhibit frequent and sudden changes
      –   Factors –pu(t)– are expensive to model
      –   Gain flexibility by heavily parameterizing the functions


Copyright  2012 by IRLAB@SDU                                         - 26 -
 Take item biases and user biases as an example
 It is more of a challenge on the users side
      – we would like a finer resolution for users to detect very short lived temporal
        effects
      – we do not expect enough ratings per user to produce reliable estimates for
        isolated bins
 One simple modeling choice
      – uses a linear function to capture a possible gradual drift of user bias



      – Timedependent user-bias




Copyright  2012 by IRLAB@SDU                                                     - 27 -
 item biases bi(t)
 split the item biases into time-based bins and how?
      – The decision of how to split the timeline into bins should balance the desire to
        achieve finer resolution (hence, smaller bins) with the need for enough ratings
        per bin (hence, larger bins).
 The movie bias is split into a stationary part and a time changing part




Copyright  2012 by IRLAB@SDU                                                    - 28 -
Inputs With Varying Confidence Levels

 Are all observed ratings deserve the same weight or confidence?
      – For example, massive advertising might influence votes for certain items,
        which do not aptly reflect longer-term characteristics
      – Adversarial users might try to tilt the ratings of certain items
 The same in implicit feedback
      – The system works with a cruder binary representation (like or not like)
      – A zero value may not means not liking
             The user might be unaware of the existence of the item, or unable to consume
              it due to its price
      – Consuming an item can also be the result of factors different from preferring
        it
             a user may watch a TV show just because she is staying on the channel of the
              previously watched show
             a consumer may buy an item as gift for someone else, despite not liking the
              item for himself

Copyright  2012 by IRLAB@SDU                                                       - 29 -
 The notion of confidence
      – Confidence can stem from available number values that describe the
        frequency of actions
             How much time the user watched a certain show
             How frequently a user bought a certain item
      – Various factors that have nothing to do with user preferences might cause a
        one-time event
      – A recurring event is more likely to reflect user opinion
 We introduce a set of variables            , which measure our confidence in
  observing
      – In general, as rui grows, we have a stronger
      – Indication that the user indeed likes the item



Copyright  2012 by IRLAB@SDU                                                  - 30 -
 A plausible choice for cui would be



          – This way, we have some minimal confidence in pui for every user-item pair,
            but as we observe more evidence for positive preference, our confidence in
            pui = 1 increases accordingly.
          – The rate of increase is controlled by the constant α
     Our goal is to find a vector pu for each user u and a vector of item qi that
      will factor user preferences


Min p* ,q* ,b*     
                 known rui
                                                    T
                                                     u   
                                                             2
                                                                 
                             cui rui  (   bu  bi  p qi )   pu
                                                                       2
                                                                            qi  bu2  bi2
                                                                                2
                                                                                              
    Copyright  2012 by IRLAB@SDU                                                   - 31 -
Conclusions


 Basic Factorization Model
 Adding Biases
 Additional Input Sources
 Temporal Dynamics
 Inputs With Varing Confidence Level




Copyright  2012 by IRLAB@SDU           - 32 -
References
[1] Matrix factorization techniques for recommender systerms, Yehuda Koren,2009.
[2]Application of Dimensionality Reduction in Recommender System-A case study,B.M. Sarwar,
     KDD, 2000.
[3] A Guide to Singular Value Decomposition for collaborative filtering. Chih-Chao Ma
[4] Factorization meets the neighborhood: a multifacedted collaborative fitering model.
     Yehuda Koren, KDD’08.
[5]Collaborative filtering for implicit feedback datasets. Yehuda Koren, ICDM’08.
[6]Collaborative filtering with temporal dynamics. Yehuda Koren, KDD’09.
[7] Recommender systems –An introduction, Dietmar Jannach, Cambridge university
     press,2011.
[8] Recommender Systems Handbook, Francesco Ricci, Springer , 2011.




Copyright  2012 by IRLAB@SDU                                                      - 33 -
Thanks




Copyright  2012 by IRLAB@SDU   - 34 -

Más contenido relacionado

La actualidad más candente

Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringViet-Trung TRAN
 
Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsYONG ZHENG
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Xavier Amatriain
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation SystemAnamta Sayyed
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation SystemsTrieu Nguyen
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architectureLiang Xiang
 
Machine Learning for Recommender Systems MLSS 2015 Sydney
Machine Learning for Recommender Systems MLSS 2015 SydneyMachine Learning for Recommender Systems MLSS 2015 Sydney
Machine Learning for Recommender Systems MLSS 2015 SydneyAlexandros Karatzoglou
 
Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Sudeep Das, Ph.D.
 
Recommender systems using collaborative filtering
Recommender systems using collaborative filteringRecommender systems using collaborative filtering
Recommender systems using collaborative filteringD Yogendra Rao
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systemsNAVER Engineering
 
Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsAxel de Romblay
 
Matrix factorization
Matrix factorizationMatrix factorization
Matrix factorizationLuis Serrano
 
Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Shrutika Oswal
 
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019Anoop Deoras
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender SystemsJustin Basilico
 

La actualidad más candente (20)

Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
 
Matrix Factorization In Recommender Systems
Matrix Factorization In Recommender SystemsMatrix Factorization In Recommender Systems
Matrix Factorization In Recommender Systems
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 
Content based filtering
Content based filteringContent based filtering
Content based filtering
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Machine Learning for Recommender Systems MLSS 2015 Sydney
Machine Learning for Recommender Systems MLSS 2015 SydneyMachine Learning for Recommender Systems MLSS 2015 Sydney
Machine Learning for Recommender Systems MLSS 2015 Sydney
 
Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it!
 
Recommender systems using collaborative filtering
Recommender systems using collaborative filteringRecommender systems using collaborative filtering
Recommender systems using collaborative filtering
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation Systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Matrix factorization
Matrix factorizationMatrix factorization
Matrix factorization
 
Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence
 
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019Tutorial on Deep Learning in Recommender System, Lars summer school 2019
Tutorial on Deep Learning in Recommender System, Lars summer school 2019
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Recommender system
Recommender systemRecommender system
Recommender system
 

Destacado

Simple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in MahoutSimple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in MahoutData Science London
 
Latent factor models for Collaborative Filtering
Latent factor models for Collaborative FilteringLatent factor models for Collaborative Filtering
Latent factor models for Collaborative Filteringsscdotopen
 
Beginners Guide to Non-Negative Matrix Factorization
Beginners Guide to Non-Negative Matrix FactorizationBeginners Guide to Non-Negative Matrix Factorization
Beginners Guide to Non-Negative Matrix FactorizationBenjamin Bengfort
 
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...Yun-Nung (Vivian) Chen
 
Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization CS, NcState
 
Echo in WebRTC; Why?
Echo in WebRTC; Why?Echo in WebRTC; Why?
Echo in WebRTC; Why?Muaz Khan
 
Microcalcification oriented content based mammogram retrieval for breast canc...
Microcalcification oriented content based mammogram retrieval for breast canc...Microcalcification oriented content based mammogram retrieval for breast canc...
Microcalcification oriented content based mammogram retrieval for breast canc...Lazaros Tsochatzidis
 
この Visualization がすごい2014 〜データ世界を彩るツール6選〜
この Visualization がすごい2014 〜データ世界を彩るツール6選〜この Visualization がすごい2014 〜データ世界を彩るツール6選〜
この Visualization がすごい2014 〜データ世界を彩るツール6選〜Takahiro Inoue
 
Introduction to Matrix Factorization Methods Collaborative Filtering
Introduction to Matrix Factorization Methods Collaborative FilteringIntroduction to Matrix Factorization Methods Collaborative Filtering
Introduction to Matrix Factorization Methods Collaborative FilteringDKALab
 

Destacado (11)

Simple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in MahoutSimple Matrix Factorization for Recommendation in Mahout
Simple Matrix Factorization for Recommendation in Mahout
 
Latent factor models for Collaborative Filtering
Latent factor models for Collaborative FilteringLatent factor models for Collaborative Filtering
Latent factor models for Collaborative Filtering
 
Beginners Guide to Non-Negative Matrix Factorization
Beginners Guide to Non-Negative Matrix FactorizationBeginners Guide to Non-Negative Matrix Factorization
Beginners Guide to Non-Negative Matrix Factorization
 
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
Matrix Factorization with Knowledge Graph Propagation for Unsupervised Spoken...
 
Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization 
 
Echo in WebRTC; Why?
Echo in WebRTC; Why?Echo in WebRTC; Why?
Echo in WebRTC; Why?
 
Microcalcification oriented content based mammogram retrieval for breast canc...
Microcalcification oriented content based mammogram retrieval for breast canc...Microcalcification oriented content based mammogram retrieval for breast canc...
Microcalcification oriented content based mammogram retrieval for breast canc...
 
Multidimensional RNN
Multidimensional RNNMultidimensional RNN
Multidimensional RNN
 
この Visualization がすごい2014 〜データ世界を彩るツール6選〜
この Visualization がすごい2014 〜データ世界を彩るツール6選〜この Visualization がすごい2014 〜データ世界を彩るツール6選〜
この Visualization がすごい2014 〜データ世界を彩るツール6選〜
 
FaaS-and-Furious
FaaS-and-FuriousFaaS-and-Furious
FaaS-and-Furious
 
Introduction to Matrix Factorization Methods Collaborative Filtering
Introduction to Matrix Factorization Methods Collaborative FilteringIntroduction to Matrix Factorization Methods Collaborative Filtering
Introduction to Matrix Factorization Methods Collaborative Filtering
 

Similar a Matrix Factorization Techniques For Recommender Systems

Chapter 02 collaborative recommendation
Chapter 02   collaborative recommendationChapter 02   collaborative recommendation
Chapter 02 collaborative recommendationAravindharamanan S
 
Chapter 02 collaborative recommendation
Chapter 02   collaborative recommendationChapter 02   collaborative recommendation
Chapter 02 collaborative recommendationAravindharamanan S
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperChangsung Moon
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionPerumalPitchandi
 
Ronny lempelyahooindiabigthinkerapril2013
Ronny lempelyahooindiabigthinkerapril2013Ronny lempelyahooindiabigthinkerapril2013
Ronny lempelyahooindiabigthinkerapril2013Muthusamy Chelliah
 
REAL-TIME RECOMMENDATION SYSTEMS
REAL-TIME RECOMMENDATION SYSTEMS REAL-TIME RECOMMENDATION SYSTEMS
REAL-TIME RECOMMENDATION SYSTEMS BigDataCloud
 
3e recommendation engines_meetup
3e recommendation engines_meetup3e recommendation engines_meetup
3e recommendation engines_meetupPranab Ghosh
 
Recommender Systems from A to Z – The Right Dataset
Recommender Systems from A to Z – The Right DatasetRecommender Systems from A to Z – The Right Dataset
Recommender Systems from A to Z – The Right DatasetCrossing Minds
 
IRJET - Recommendation System using Big Data Mining on Social Networks
IRJET -  	  Recommendation System using Big Data Mining on Social NetworksIRJET -  	  Recommendation System using Big Data Mining on Social Networks
IRJET - Recommendation System using Big Data Mining on Social NetworksIRJET Journal
 
Explain Yourself: Why You Get the Recommendations You Do
Explain Yourself: Why You Get the Recommendations You DoExplain Yourself: Why You Get the Recommendations You Do
Explain Yourself: Why You Get the Recommendations You DoDatabricks
 
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...Sc Huang
 
IRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET Journal
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringChangsung Moon
 
Tag And Tag Based Recommender
Tag And Tag Based RecommenderTag And Tag Based Recommender
Tag And Tag Based Recommendergu wendong
 
Methodology - Conceptual Database Design Transparencies .pptx
Methodology -  Conceptual Database Design Transparencies .pptxMethodology -  Conceptual Database Design Transparencies .pptx
Methodology - Conceptual Database Design Transparencies .pptxbirhanugirmay559
 
Book Recommendation System
Book Recommendation SystemBook Recommendation System
Book Recommendation SystemIRJET Journal
 
AI in Entertainment – Movie Recommendation System
AI in Entertainment – Movie Recommendation SystemAI in Entertainment – Movie Recommendation System
AI in Entertainment – Movie Recommendation SystemIRJET Journal
 
Item Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation AlgorithmsItem Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation Algorithmsnextlib
 
Overview of Movie Recommendation System using Machine learning by R programmi...
Overview of Movie Recommendation System using Machine learning by R programmi...Overview of Movie Recommendation System using Machine learning by R programmi...
Overview of Movie Recommendation System using Machine learning by R programmi...IRJET Journal
 

Similar a Matrix Factorization Techniques For Recommender Systems (20)

Chapter 02 collaborative recommendation
Chapter 02   collaborative recommendationChapter 02   collaborative recommendation
Chapter 02 collaborative recommendation
 
Chapter 02 collaborative recommendation
Chapter 02   collaborative recommendationChapter 02   collaborative recommendation
Chapter 02 collaborative recommendation
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paper
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 
Ronny lempelyahooindiabigthinkerapril2013
Ronny lempelyahooindiabigthinkerapril2013Ronny lempelyahooindiabigthinkerapril2013
Ronny lempelyahooindiabigthinkerapril2013
 
REAL-TIME RECOMMENDATION SYSTEMS
REAL-TIME RECOMMENDATION SYSTEMS REAL-TIME RECOMMENDATION SYSTEMS
REAL-TIME RECOMMENDATION SYSTEMS
 
3e recommendation engines_meetup
3e recommendation engines_meetup3e recommendation engines_meetup
3e recommendation engines_meetup
 
Recommender Systems from A to Z – The Right Dataset
Recommender Systems from A to Z – The Right DatasetRecommender Systems from A to Z – The Right Dataset
Recommender Systems from A to Z – The Right Dataset
 
IRJET - Recommendation System using Big Data Mining on Social Networks
IRJET -  	  Recommendation System using Big Data Mining on Social NetworksIRJET -  	  Recommendation System using Big Data Mining on Social Networks
IRJET - Recommendation System using Big Data Mining on Social Networks
 
Explain Yourself: Why You Get the Recommendations You Do
Explain Yourself: Why You Get the Recommendations You DoExplain Yourself: Why You Get the Recommendations You Do
Explain Yourself: Why You Get the Recommendations You Do
 
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
 
IRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET- Online Course Recommendation System
IRJET- Online Course Recommendation System
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 
Tag And Tag Based Recommender
Tag And Tag Based RecommenderTag And Tag Based Recommender
Tag And Tag Based Recommender
 
Methodology - Conceptual Database Design Transparencies .pptx
Methodology -  Conceptual Database Design Transparencies .pptxMethodology -  Conceptual Database Design Transparencies .pptx
Methodology - Conceptual Database Design Transparencies .pptx
 
Book Recommendation System
Book Recommendation SystemBook Recommendation System
Book Recommendation System
 
User Personality and the New User Problem in a Context-­‐Aware POI Recommende...
User Personality and the New User Problem in a Context-­‐Aware POI Recommende...User Personality and the New User Problem in a Context-­‐Aware POI Recommende...
User Personality and the New User Problem in a Context-­‐Aware POI Recommende...
 
AI in Entertainment – Movie Recommendation System
AI in Entertainment – Movie Recommendation SystemAI in Entertainment – Movie Recommendation System
AI in Entertainment – Movie Recommendation System
 
Item Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation AlgorithmsItem Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation Algorithms
 
Overview of Movie Recommendation System using Machine learning by R programmi...
Overview of Movie Recommendation System using Machine learning by R programmi...Overview of Movie Recommendation System using Machine learning by R programmi...
Overview of Movie Recommendation System using Machine learning by R programmi...
 

Último

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Último (20)

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

Matrix Factorization Techniques For Recommender Systems

  • 1. Matrix Factorization Techniques For Recommender Systems Reporter: Lei Guo http://ir.sdu.edu.cn/index.htm Copyright  2012 by IRLAB@SDU -1-
  • 2. Agenda  Recommender System Strategies  Matrix Factorization Methods  Basic Factorization Model  Learning Algorithms  Adding Biases  Additional Input Sources  Temporal Dynamics  Inputs With Varing Confidence Level Copyright  2012 by IRLAB@SDU -2-
  • 3. Recommender System Strategies  Contented-based recommendation – Creates a profile for each user or product to characterize its nature  Eg. Movie profile include attributes regarding its genre, the paticipating actors, its box office popularity…  Eg. User profile might include demographic information or answers provided on a suitable questionnaire – Programs use these profiles associate users with mathing products Copyright  2012 by IRLAB@SDU -3-
  • 4.  Simple approach – Compute the similarity of an unseen item with the user profile based on the keyword overlap (e.g. using the Dice coefficient) 2 ∗|𝑘𝑒𝑦𝑤𝑜𝑟𝑑𝑠 𝑏 𝑖 ∩𝑘𝑒𝑦𝑤𝑜𝑟𝑑𝑠 𝑏 𝑗 | – sim(bi, bj) = 𝑘𝑒𝑦𝑤𝑜𝑟𝑑𝑠 𝑏 𝑖 +𝑘𝑒𝑦𝑤𝑜𝑟𝑑𝑠 𝑏 𝑗 | – Or combine multiple metrics in a weighted approach – Require gathering external information that might not be available  Collaborative-based recommendation – Relies only on past user behavior (eg. Previous transactions or puduct ratings) – Also known as collaborative filtering (CF)  Basic assumption and idea – Users give ratings to catalog items (implicitly or explicitly) – Customers who had similar tastes in the past, will have similar tastes in the future Copyright  2012 by IRLAB@SDU -4-
  • 5. Item1 Item2 Item3 Item4 Item5 Alice 5 3 4 4 ? User1 3 1 2 3 3 User2 4 3 4 3 5 User3 3 3 1 5 4 User4 1 5 5 2 1 – The goal is to estimate Alice's rating for this item  Neighborhood methods  User-oriented approach – Centered on computing the relationships between users Pearson correlation Copyright  2012 by IRLAB@SDU -5-
  • 6.  Item-oriented approach – Evaluates a user’s preference for an item based on ratings of “neighboring” item by the same user – Look for items that are similar to Item5 – Take Alice's ratings for these items to predict the rating for Item5 Item1 Item2 Item3 Item4 Item5 Alice 5 3 4 4 ? User1 3 1 2 3 3 User2 4 3 4 3 5 User3 3 3 1 5 4 User4 1 5 5 2 1 Copyright  2012 by IRLAB@SDU -6-
  • 7.  Latent factor models – An alternative approach that tries to explain the ratings by charactering both items and users with the more holistic goal to uncover the latent features – For movies, these latent factors might measure obvious dimensions such as comedy versus drama, amount of action, or orientation to children – For users, each factor measures how much the users likes movies that score high on the corresponding movie factor – PLAS – Neural networks – Latent Dirichlet Allocation – Matrix factorization (eg. SVD) – … Copyright  2012 by IRLAB@SDU -7-
  • 8. Matrix Factorization Methods  Characteristic – Characterizes both items and users by vectors of factors infered from item rating patterns – High correspondence between item and user factors leads to a recommendation  Rely on matrix types of input data – One dimention representing user – The other representing items  Two data types – High-quality explicit feedfack  Includs explicit input by users regarding their interest in products  We refer to explicit user feedback as ratings  Usually sparse matrix, since any single user is likely to have rated only a small percentage of possible items Copyright  2012 by IRLAB@SDU -8-
  • 9.  Implicit feedback – Which indirectly reflects opion by observing user behavior  Purchase history, browsing history, search patterns, mouse movements – Usually denotes the presence or absence of an event – Typically represented by a densely filled matrix Copyright  2012 by IRLAB@SDU -9-
  • 10. Basic Matrix Factorization Model  Formalization – Map both users and items to a joint latent factor space of dimensionality f – User-item interactions are modeled as inner products in that space – Each item i is associated with a vector qi ,and each user u is associated with a a vector pu,  qi measures the extent to which the item possesses those factors  pu measure the extent of interest the user has in items  the resulting dot product qiTpu caputures the interaction between user u and item i – the user’s overall interest in the item’s characteristics – The appoximates user u’s rating of item i, which is denoted by rui, leading to the estimate – The major chanllenge is computing the mapping of each item and user to factor vecotrs qi,pu Copyright  2012 by IRLAB@SDU - 10 -
  • 11. – We can caputure the latent relationships between users and items – We can produce a low-dimensional representation of the original rating matrix – Factor rating matrix R using SVD obtain Q, S, P – Reduce the matrix S to dimension k – Compute two resultant matrices: QkSk (qT)and SkPk(p)  Predicting task – These resultant matrices can now be used to compute the recommendation score for any user and item – We can simply calculte the dot product of the ith row of q and the uth column of p Copyright  2012 by IRLAB@SDU - 11 -
  • 12.  Top-N Recommendation Task – We consider customer preference data as binary by treating each non-zero entry of the customer-product matrix as “1” – We are only interested in whether a customer consumed a particular product but not how much the user liked  First Step (neighborhood formation in the reduced space) – Factor rating matrix R using SVD obtain Q, S, P – Reduce the matrix S to dimension k – Compute resultant matrices: PkSk – Performed vector similarity (cosine similarity) to form the neighborhood  Second Step(Top-N Recommendation) – we scan through the purchase record of each of the k neighbors – Compute the frequency count on the products they purchased – Return the most frequently purchased N items for the target customer Copyright  2012 by IRLAB@SDU - 12 -
  • 13.  Difficulties – High portion of missing values caused by sparseness in the user-item rating matrix – Conventional SVD is undefined when knowledge about the matrix is incomplete – Carelessly addressing only the relatively few kown entries is highly prone to overfitting  Solutions  Fill missing values – Earlier systems relied on imputation to fill in missing rating and make the rating matrix dense – Such as using the average ratings for user and item – However, (1)imputation can be very expensive as it significantly increases the amount of data and (2) inaccurate imputation might distort the data. Copyright  2012 by IRLAB@SDU - 13 -
  • 14.  Modeling directly the observed ratings only – Avoiding overfitting through a regularized model – The learn the factor vectors(pu and qi), the system minimizes the regularized squared error on the set of known ratings: – The goal is to generalize those previous ratings in a way that predicts future unkown ratings – The constant controls the extent of regularization Copyright  2012 by IRLAB@SDU - 14 -
  • 15. Learning Algorithms  Stochastic gradient desent – Also known as incremental learning – For each given training case, the system predicts rui and computes the associated prediction error – Learning rule Copyright  2012 by IRLAB@SDU - 15 -
  • 16.  Alternating least squares – Because both qi and pu are unknowns, the object function is not convex – However, if fix one of the unknowns, the optimization problems quadratic and can be solved optimally – Als techniques rotate between fixing the qi’s and fixing the pu’s – When all pu’s are fixed, the system recomputes the qi’s by sovling a least- squares problem – we can fix the matrix P as some matrix , such that minimization problem would be equivalent to – Analogously, we can fix Q as , Copyright  2012 by IRLAB@SDU - 16 -
  • 17.  Learning rule – It can be shown that the only possible minimum is the global one – so that P and Q must converge to the true SVD subspace Copyright  2012 by IRLAB@SDU - 17 -
  • 18. Adding Biases  We tries to capture the interactions between users and items that produce the different rating values  However, typical collaborative filtering data exhibits large systematic tendencies for some users to give higher ratings than others  And for some items to receive higher ratings than others – Some products are widely perceived as better(or worse) than others  Its unwise to explain the full rating value in this form  We should identify the user and item bias Copyright  2012 by IRLAB@SDU - 18 -
  • 19.  A first-order approximation of the bias is as follows: – The bias accounts for the user and item effects, where denotes the overall average rating – bu and bi indicate the observed deviations of user u and item i.  For example – Suppose we want to estimate user john’s rating of the movie Titanic – And the averagerating over all movies is 3.7 stars – Titanic is better than an average movie, so it tends to be rated 0.5 starts above the average movie – John is a critical user, who tends to rate 0.3 stars lower than the average – Thus, the estimate for Titanic’s rating by John would be (3.7+0.5-0.3) Copyright  2012 by IRLAB@SDU - 19 -
  • 20.  The estimation becomes:  The system learns by miniming the squared error function  The stochastic gradient descent learning rule becomes: Copyright  2012 by IRLAB@SDU - 20 -
  • 21. Additional Input Sources  A system must deal with cold start problem, wherein many users supply very few ratings  We need to incorporate additional sources to relieve this problem  Use implicit feedback to gain insight into user preferences – They can gather behavior information of user’s – Eg. A retailer can use its customers’ purchases or browsing history to learn their tendencies – For simplicity, consider a case with a boolean implicit feedback – The exact model is as follows Copyright  2012 by IRLAB@SDU - 21 -
  • 22.  Several types of implicit feedback can be simultaneously introduced into the model by using extra sets of item factors  Eg. Demographics, such as gender, age group, zip code, income level – Use a distinct factor vector yj(2) corresponds to each attribute to describe a user  The matrix factorization model should integrate all signal sources:  Copyright  2012 by IRLAB@SDU - 22 -
  • 23. Temproal Dynamics  Time-drifting nature – So far, the presented models have been static – In reality, product perception and popularity constantly change as new selections emerge – Similarly, customer’s inclinations evolve, leading them to redefine their taste – The system should account for the temporal effects Copyright  2012 by IRLAB@SDU - 23 -
  • 24.  Multiple sources of temporal dynamics  Item-side effects: – Product perception and popularity are constantly changing – Seasonal patterns influence items’ popularity  User-side effects: – Customers ever redefine their taste – Transient, short-term bias; anchoring – Drifting rating scale – Change of rater within household  Multiple sources: Both items and users are changing over time  Multiple targets: Each user/item forms a unique time series   Scarce data per target Copyright  2012 by IRLAB@SDU - 24 -
  • 25. Addressing temporal dynamics  Factor model conveniently allows separately treating different aspects  We observe changes in: 1. Rating scale of individual users 2. Popularity of individual items 3. User preferences rui (t )    bu (t )  bi (t )  q pu (t ) T i Copyright  2012 by IRLAB@SDU - 25 -
  • 26. Parameterizing the model rui (t )    bu (t )  bi (t )  qiT pu (t )  Use functional forms: bu(t)=f(u,t), bi(t)=g(i,t), pu(t)=h(u,t)  Need to find adequate f(), g(), h()  General guidelines: – Items show slower temporal changes – Users exhibit frequent and sudden changes – Factors –pu(t)– are expensive to model – Gain flexibility by heavily parameterizing the functions Copyright  2012 by IRLAB@SDU - 26 -
  • 27.  Take item biases and user biases as an example  It is more of a challenge on the users side – we would like a finer resolution for users to detect very short lived temporal effects – we do not expect enough ratings per user to produce reliable estimates for isolated bins  One simple modeling choice – uses a linear function to capture a possible gradual drift of user bias – Timedependent user-bias Copyright  2012 by IRLAB@SDU - 27 -
  • 28.  item biases bi(t)  split the item biases into time-based bins and how? – The decision of how to split the timeline into bins should balance the desire to achieve finer resolution (hence, smaller bins) with the need for enough ratings per bin (hence, larger bins).  The movie bias is split into a stationary part and a time changing part Copyright  2012 by IRLAB@SDU - 28 -
  • 29. Inputs With Varying Confidence Levels  Are all observed ratings deserve the same weight or confidence? – For example, massive advertising might influence votes for certain items, which do not aptly reflect longer-term characteristics – Adversarial users might try to tilt the ratings of certain items  The same in implicit feedback – The system works with a cruder binary representation (like or not like) – A zero value may not means not liking  The user might be unaware of the existence of the item, or unable to consume it due to its price – Consuming an item can also be the result of factors different from preferring it  a user may watch a TV show just because she is staying on the channel of the previously watched show  a consumer may buy an item as gift for someone else, despite not liking the item for himself Copyright  2012 by IRLAB@SDU - 29 -
  • 30.  The notion of confidence – Confidence can stem from available number values that describe the frequency of actions  How much time the user watched a certain show  How frequently a user bought a certain item – Various factors that have nothing to do with user preferences might cause a one-time event – A recurring event is more likely to reflect user opinion  We introduce a set of variables , which measure our confidence in observing – In general, as rui grows, we have a stronger – Indication that the user indeed likes the item Copyright  2012 by IRLAB@SDU - 30 -
  • 31.  A plausible choice for cui would be – This way, we have some minimal confidence in pui for every user-item pair, but as we observe more evidence for positive preference, our confidence in pui = 1 increases accordingly. – The rate of increase is controlled by the constant α  Our goal is to find a vector pu for each user u and a vector of item qi that will factor user preferences Min p* ,q* ,b*  known rui  T u  2  cui rui  (   bu  bi  p qi )   pu 2  qi  bu2  bi2 2  Copyright  2012 by IRLAB@SDU - 31 -
  • 32. Conclusions  Basic Factorization Model  Adding Biases  Additional Input Sources  Temporal Dynamics  Inputs With Varing Confidence Level Copyright  2012 by IRLAB@SDU - 32 -
  • 33. References [1] Matrix factorization techniques for recommender systerms, Yehuda Koren,2009. [2]Application of Dimensionality Reduction in Recommender System-A case study,B.M. Sarwar, KDD, 2000. [3] A Guide to Singular Value Decomposition for collaborative filtering. Chih-Chao Ma [4] Factorization meets the neighborhood: a multifacedted collaborative fitering model. Yehuda Koren, KDD’08. [5]Collaborative filtering for implicit feedback datasets. Yehuda Koren, ICDM’08. [6]Collaborative filtering with temporal dynamics. Yehuda Koren, KDD’09. [7] Recommender systems –An introduction, Dietmar Jannach, Cambridge university press,2011. [8] Recommender Systems Handbook, Francesco Ricci, Springer , 2011. Copyright  2012 by IRLAB@SDU - 33 -
  • 34. Thanks Copyright  2012 by IRLAB@SDU - 34 -