SlideShare una empresa de Scribd logo
1 de 39
Descargar para leer sin conexión
Recommender System:
Algorithms & Architecture
     xiangliang@hulu.com
Outline
    Problem
    Data
•

    Algorithms
•

    Cold start
•

    Architecture
•
•
                         Recommender
                            System
Problem
Recommend items to users to make user, content
partner, websites happy!
Data
• User behaviors data

  Page view         All user        Very Large
  Behavior          User            Size


  Watch video       All user        Large
  Favorite          Register user   Middle
  Vote              Register user   Middle
  Add to playlist   Register user   Small
  Facebook like     Register user   Small
  Share             Register user   Small
  Review            Register user   Small
Data
• Which data is most
  important                     Page view         All user        Very Large
                                Behavior          User            Size



  – Main behavior in the
                                Watch video       All user        Large


    website
                                Favorite          Register user   Middle
                                Vote              Register user   Middle

  – All user can have such      Add to playlist   Register user   Small

    behavior
                                Facebook like     Register user   Small


  – Cost
                                Share             Register user   Small
                                Review            Register user   Small

  – Reflect user interests on
    items
Data
• Data Structure
  – User ID
  – Item ID
  – Behavior Type
  – Behavior Content
  – Context
     • Timestamp
     • Location
     • Mood
                   Sheldon watch Star Trek with his friends at home
Algorithms
                              Recommender
                              System Method



   Collaborative        Content          Social
                                                    ……
     Filtering          Filtering       Filtering


                   Latent Factor
Graph-based                               ……
                      Model


Neighborhood
                       ……
   -based



 User-based         Item-based            ……
Neighborhood-based
• User-based
  – Digg
• Item-based
  – Amazon, Netflix, YouTube, Hulu, …
User-based
• Algorithm
  – For user u, find a set of users S(u) have similar
    preference as u.
  – Recommend popular items among users in S(u)
    to user u.
User-based CF

pui =           ∑
        v∈S ( u , K ) ∩ N ( i )
                                  wuv rvi


            N (u ) ∩ N (v)
w uv =
            N (u ) ∪ N (v)
Item-based
• Algorithm
  – For user u, get items set N(u) this user like
    before.
  – Recommend items which are similar to many
    items in N(u) to user u.
Item-based CF

pui =           ∑
        j∈S ( i , K ) ∩ N ( u )
                                  w ji ruj


           N (i ) ∩ N ( j )
w ij =
           N (i ) ∪ N ( j )
Item-based CF




Why not use w ij =                      ?
                     N (i ) ∩ N ( j )
                          N (i )
Neighborhood-based
• User-based vs. Item-based
                    User-based              Item-based

  Scalability       Bad when user size is   Bad when item size is
                    large                   large
  Explanation       Bad                     Good

  Novelty           Bad                     Good

  Coverage          Bad                     Good

  Cold start        Bad for new users       Bad for new items

  Performance       Need to get many        Only need to get
                    users history           current user’s history
References
• Amazon.com Recommendations item-to-
  item Collaborative Filtering.
• Empirical Analysis of Predictive Algorithms
  for Collaborative Filtering.
Graph-based
• Users’ behaviors on items can be
  represented by bi-part graph.
 A       1   A       1   A       1   A   1

 B       2   B       2   B       2   B   2

 C       3   C       3   C       3   C   3

 D       4   D       4   D       4   D   4
Graph-based
• Two nodes will have high relevance if
  – There are many paths in graph between two
    nodes.
  – Most of paths between two nodes is short.
  – Most paths do not go through nodes with high
    out-degree.
Graph-based
• Advantage
  – Heterogeneous data            A   1
     • Multiple user behaviors
     • Social Network
                                  B   2

     • Context (Time, Location)   C   3

• Disadvantage                    D   4

  – Statistical-based
  – High cost for long path
References
• A Graph-based Recommender System for
  Digital Library.
• Random-walk computation of similarities
  between nodes of a graph with application
  to collaborative recommendation.
Latent Factor Model
• Users and items are connect by latent
  features.

       A                        1
                   a
       B                        2
                   b
       C                        3
                   c
       D                        4
Latent Factor Model
       rui = ∑ puk qik
       ˆ
                        k
Science Fiction   0.5       Science Fiction   0.9


   Universe       0.9          Universe       0.9


   Physical       0.8          Physical       0.5


 Space Travel     0.8        Space Travel     0.7


  Animation       0.3         Animation       0.1


  Romance         0.0         Romance         0.0
Latent Factor Model
• How to get p, q?

 min ∑ (rui − ∑ puk qik ) + λ ( pu       + qi )
                         2           2       2

      ( u ,i )    k

 = α (eui qik − λ puk )
 puk +
= α (eui puk − λ qik )
qik +
Latent Factor Model
• How to define rui
  – Rating prediction
  – Top-N recommendation
     • Implicit feedback data: only have positive samples
       and missing values, how to select negative samples?
Latent Factor Model
    1 (Sci-fi)         2 (Crime)         3 (Family)        4 (Horror)

                                                         The Blair Witch
The invisible Man        Jaws          101 Dalmatians
                                                             Project
 Frankenstein
                                         Back to the
 Meets the Wolf     Lethal Weapon                        Pacific Heights
                                           Future
     Man

    Godzilla          Total Recall     Groundhog Day     Stir of Echoes

  Star Wars VI      Reservoir Dogs         Tarzan          Dead Calm

The Terminator       Donnie Brasco     The Aristocats      Phantasm

                                       The Jungle Book
      Alien           The Fugitive                       Sleepy Hollow
                                              2

     Alien 2        La shou Shen tan        Antz          The Faculty
Latent Factor Model
• Advantage
  – High accuracy in rating prediction
  – Auto group items
  – Scalability is good
  – Learning-based
• Disadvantage
  – Incremental updating
  – Real-time
  – Explanation
References
• http://www.informatik.uni-
  trier.de/~ley/db/indices/a-
  tree/k/Koren:Yehuda.html
Cold Start
• Problems
  – User cold start : new users
  – Item cold start : new items
  – System cold start : new systems
User Cold Start
• How to recommend items to new users?
  – Non-personalization recommendation
     • Most popular items
     • Highly Rated items
  – Using user register profile (Age, Gender, …)
User Cold Start
• Example: Gender and TV shows




 Data comes from IMDB : http://www.imdb.com/title/tt0412142/ratings
User Cold Start

Male
Age : 20-30
Theoretical physicist
Doctor
American
Irreligious
How to get user interest quickly
• When new user comes, his feedback on
  what items can help us better understand
  his interest?
  – Not very popular
  – Can represent a group of items
  – Users who like this item have different
    preference with users who dislike this item
Item Cold Start
• How to recommend new items to user?
  – Do not recommend




                       How to recommend news??
Item Cold Start
• How to recommend new items to user?
  – Using content information
                 Machine
                            Data Mining   Recommendation
                 Learning
System Cold Start
• How to design recommender system when
  there is no user?
  – Pandora : Music Genome Project
  – Jinni : Movie Genome Project
Architecture
• Feature-based recommendation framework:

      A                      1
                  a
      B                      2
                  b
      C                      3
                  c
      D                      4


      User      Feature     Item
Architecture




    Male


   Scientist


   Physics
Architecture
• Advantage:
  – Heterogeneous data
  – Reasonable Explanation
• Disadvantage:
  – Do not support user-based methods
Open Questions
• How to weight multiple behaviors?
• How to improve diversity, novelty?
• How to build feedback loop?
Thanks!

Más contenido relacionado

La actualidad más candente

Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Alexandros Karatzoglou
 
Boston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsBoston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsJames Kirk
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systemsNAVER Engineering
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringViet-Trung TRAN
 
Recommendation system
Recommendation systemRecommendation system
Recommendation systemAkshat Thakar
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation SystemsTrieu Nguyen
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems BasicsJarin Tasnim Khan
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineNYC Predictive Analytics
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation SystemAnamta Sayyed
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation SystemsRobin Reni
 
Recommender systems using collaborative filtering
Recommender systems using collaborative filteringRecommender systems using collaborative filtering
Recommender systems using collaborative filteringD Yogendra Rao
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender SystemsLior Rokach
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation enginesGeorgian Micsa
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectiveXavier Amatriain
 
Collaborative Filtering using KNN
Collaborative Filtering using KNNCollaborative Filtering using KNN
Collaborative Filtering using KNNŞeyda Hatipoğlu
 
Recommendation system
Recommendation system Recommendation system
Recommendation system Vikrant Arya
 
Sequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsSequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsJaya Kawale
 
A Hybrid Recommendation system
A Hybrid Recommendation systemA Hybrid Recommendation system
A Hybrid Recommendation systemPranav Prakash
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender systemStanley Wang
 

La actualidad más candente (20)

Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial Deep Learning for Recommender Systems RecSys2017 Tutorial
Deep Learning for Recommender Systems RecSys2017 Tutorial
 
Boston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsBoston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender Systems
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
Recommendation Systems Basics
Recommendation Systems BasicsRecommendation Systems Basics
Recommendation Systems Basics
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation Systems
 
Recommender systems using collaborative filtering
Recommender systems using collaborative filteringRecommender systems using collaborative filtering
Recommender systems using collaborative filtering
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Collaborative Filtering using KNN
Collaborative Filtering using KNNCollaborative Filtering using KNN
Collaborative Filtering using KNN
 
Recommendation system
Recommendation system Recommendation system
Recommendation system
 
Sequential Decision Making in Recommendations
Sequential Decision Making in RecommendationsSequential Decision Making in Recommendations
Sequential Decision Making in Recommendations
 
A Hybrid Recommendation system
A Hybrid Recommendation systemA Hybrid Recommendation system
A Hybrid Recommendation system
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 

Destacado

Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemMilind Gokhale
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Xavier Amatriain
 
Introduction to R Package Recommendation System Competition
Introduction to R Package Recommendation System CompetitionIntroduction to R Package Recommendation System Competition
Introduction to R Package Recommendation System CompetitionNYC Predictive Analytics
 
A Data Scientist in the Music Industry
A Data Scientist in the Music IndustryA Data Scientist in the Music Industry
A Data Scientist in the Music IndustryData Science London
 
Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC
Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYCJeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC
Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYCMLconf
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectiveXavier Amatriain
 
[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systemsFalitokiniaina Rabearison
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Xavier Amatriain
 
How to Build a Recommendation Engine on Spark
How to Build a Recommendation Engine on SparkHow to Build a Recommendation Engine on Spark
How to Build a Recommendation Engine on SparkCaserta
 
Microservice Architecture with CQRS and Event Sourcing
Microservice Architecture with CQRS and Event SourcingMicroservice Architecture with CQRS and Event Sourcing
Microservice Architecture with CQRS and Event SourcingBen Wilcock
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFMLiangjie Hong
 
Matrix factorization
Matrix factorizationMatrix factorization
Matrix factorizationrubyyc
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix FactorizationTatsuya Yokota
 
Neighbor methods vs matrix factorization - case studies of real-life recommen...
Neighbor methods vs matrix factorization - case studies of real-life recommen...Neighbor methods vs matrix factorization - case studies of real-life recommen...
Neighbor methods vs matrix factorization - case studies of real-life recommen...Domonkos Tikk
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with SparkChris Johnson
 
Intro to Factorization Machines
Intro to Factorization MachinesIntro to Factorization Machines
Intro to Factorization MachinesPavel Kalaidin
 
آموزش محاسبات عددی - بخش دوم
آموزش محاسبات عددی - بخش دومآموزش محاسبات عددی - بخش دوم
آموزش محاسبات عددی - بخش دومfaradars
 
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2njit-ronbrown
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsAladejubelo Oluwashina
 

Destacado (20)

Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Introduction to R Package Recommendation System Competition
Introduction to R Package Recommendation System CompetitionIntroduction to R Package Recommendation System Competition
Introduction to R Package Recommendation System Competition
 
A Data Scientist in the Music Industry
A Data Scientist in the Music IndustryA Data Scientist in the Music Industry
A Data Scientist in the Music Industry
 
Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC
Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYCJeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC
Jeremy Schiff, Senior Manager, Data Science, OpenTable at MLconf NYC
 
Past present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry PerspectivePast present and future of Recommender Systems: an Industry Perspective
Past present and future of Recommender Systems: an Industry Perspective
 
Collaborative filtering
Collaborative filteringCollaborative filtering
Collaborative filtering
 
[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems[Final]collaborative filtering and recommender systems
[Final]collaborative filtering and recommender systems
 
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...
 
How to Build a Recommendation Engine on Spark
How to Build a Recommendation Engine on SparkHow to Build a Recommendation Engine on Spark
How to Build a Recommendation Engine on Spark
 
Microservice Architecture with CQRS and Event Sourcing
Microservice Architecture with CQRS and Event SourcingMicroservice Architecture with CQRS and Event Sourcing
Microservice Architecture with CQRS and Event Sourcing
 
Factorization Machines with libFM
Factorization Machines with libFMFactorization Machines with libFM
Factorization Machines with libFM
 
Matrix factorization
Matrix factorizationMatrix factorization
Matrix factorization
 
Nonnegative Matrix Factorization
Nonnegative Matrix FactorizationNonnegative Matrix Factorization
Nonnegative Matrix Factorization
 
Neighbor methods vs matrix factorization - case studies of real-life recommen...
Neighbor methods vs matrix factorization - case studies of real-life recommen...Neighbor methods vs matrix factorization - case studies of real-life recommen...
Neighbor methods vs matrix factorization - case studies of real-life recommen...
 
Collaborative Filtering with Spark
Collaborative Filtering with SparkCollaborative Filtering with Spark
Collaborative Filtering with Spark
 
Intro to Factorization Machines
Intro to Factorization MachinesIntro to Factorization Machines
Intro to Factorization Machines
 
آموزش محاسبات عددی - بخش دوم
آموزش محاسبات عددی - بخش دومآموزش محاسبات عددی - بخش دوم
آموزش محاسبات عددی - بخش دوم
 
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2Lecture 6   lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
Lecture 6 lu factorization & determinants - section 2-5 2-7 3-1 and 3-2
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender Systems
 

Similar a Recommender system algorithm and architecture

Facets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationFacets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationRoberto García
 
Ronny lempelyahooindiabigthinkerapril2013
Ronny lempelyahooindiabigthinkerapril2013Ronny lempelyahooindiabigthinkerapril2013
Ronny lempelyahooindiabigthinkerapril2013Muthusamy Chelliah
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Ernesto Mislej
 
Crowdsourcing for HCI Research with Amazon Mechanical Turk
Crowdsourcing for HCI Research with Amazon Mechanical TurkCrowdsourcing for HCI Research with Amazon Mechanical Turk
Crowdsourcing for HCI Research with Amazon Mechanical TurkEd Chi
 
Bootstrapping Recommendations with Neo4j
Bootstrapping Recommendations with Neo4jBootstrapping Recommendations with Neo4j
Bootstrapping Recommendations with Neo4jMax De Marzi
 
Bootstrapping Recommendations OSCON 2015
Bootstrapping Recommendations OSCON 2015Bootstrapping Recommendations OSCON 2015
Bootstrapping Recommendations OSCON 2015Max De Marzi
 
Project Progress Report - Recommender Systems for Social Networks
Project Progress Report - Recommender Systems for Social NetworksProject Progress Report - Recommender Systems for Social Networks
Project Progress Report - Recommender Systems for Social Networksamirhhz
 
Big Data Real Time Training in Chennai
Big Data Real Time Training in ChennaiBig Data Real Time Training in Chennai
Big Data Real Time Training in ChennaiVijay Susheedran C G
 
Big Data 101 - An introduction
Big Data 101 - An introductionBig Data 101 - An introduction
Big Data 101 - An introductionNeeraj Tewari
 
Buidling large scale recommendation engine
Buidling large scale recommendation engineBuidling large scale recommendation engine
Buidling large scale recommendation engineKeeyong Han
 
CSC 8101 Non Relational Databases
CSC 8101 Non Relational DatabasesCSC 8101 Non Relational Databases
CSC 8101 Non Relational Databasessjwoodman
 
Recommendation Systems Roadtrip
Recommendation Systems RoadtripRecommendation Systems Roadtrip
Recommendation Systems RoadtripThe Real Dyl
 
yt: Growing and Engaging a Community of Practice
yt: Growing and Engaging a Community of Practiceyt: Growing and Engaging a Community of Practice
yt: Growing and Engaging a Community of Practicematthewturk
 
Managing Online Business Communities
Managing Online Business CommunitiesManaging Online Business Communities
Managing Online Business CommunitiesSteffen Staab
 
Bring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science WorkflowsBring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science WorkflowsDatabricks
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Dr. Aparna Varde
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysisikanow
 
Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSpark Summit
 
Recommendations and User Understanding at StumbleUpon
Recommendations and User Understandingat StumbleUponRecommendations and User Understandingat StumbleUpon
Recommendations and User Understanding at StumbleUponDebora Donato
 
Immersive Recommendation
Immersive RecommendationImmersive Recommendation
Immersive Recommendation承剛 謝
 

Similar a Recommender system algorithm and architecture (20)

Facets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data ExplorationFacets and Pivoting for Flexible and Usable Linked Data Exploration
Facets and Pivoting for Flexible and Usable Linked Data Exploration
 
Ronny lempelyahooindiabigthinkerapril2013
Ronny lempelyahooindiabigthinkerapril2013Ronny lempelyahooindiabigthinkerapril2013
Ronny lempelyahooindiabigthinkerapril2013
 
Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011Recommender Systems! @ASAI 2011
Recommender Systems! @ASAI 2011
 
Crowdsourcing for HCI Research with Amazon Mechanical Turk
Crowdsourcing for HCI Research with Amazon Mechanical TurkCrowdsourcing for HCI Research with Amazon Mechanical Turk
Crowdsourcing for HCI Research with Amazon Mechanical Turk
 
Bootstrapping Recommendations with Neo4j
Bootstrapping Recommendations with Neo4jBootstrapping Recommendations with Neo4j
Bootstrapping Recommendations with Neo4j
 
Bootstrapping Recommendations OSCON 2015
Bootstrapping Recommendations OSCON 2015Bootstrapping Recommendations OSCON 2015
Bootstrapping Recommendations OSCON 2015
 
Project Progress Report - Recommender Systems for Social Networks
Project Progress Report - Recommender Systems for Social NetworksProject Progress Report - Recommender Systems for Social Networks
Project Progress Report - Recommender Systems for Social Networks
 
Big Data Real Time Training in Chennai
Big Data Real Time Training in ChennaiBig Data Real Time Training in Chennai
Big Data Real Time Training in Chennai
 
Big Data 101 - An introduction
Big Data 101 - An introductionBig Data 101 - An introduction
Big Data 101 - An introduction
 
Buidling large scale recommendation engine
Buidling large scale recommendation engineBuidling large scale recommendation engine
Buidling large scale recommendation engine
 
CSC 8101 Non Relational Databases
CSC 8101 Non Relational DatabasesCSC 8101 Non Relational Databases
CSC 8101 Non Relational Databases
 
Recommendation Systems Roadtrip
Recommendation Systems RoadtripRecommendation Systems Roadtrip
Recommendation Systems Roadtrip
 
yt: Growing and Engaging a Community of Practice
yt: Growing and Engaging a Community of Practiceyt: Growing and Engaging a Community of Practice
yt: Growing and Engaging a Community of Practice
 
Managing Online Business Communities
Managing Online Business CommunitiesManaging Online Business Communities
Managing Online Business Communities
 
Bring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science WorkflowsBring Satellite and Drone Imagery into your Data Science Workflows
Bring Satellite and Drone Imagery into your Data Science Workflows
 
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
 
Sparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya HristakevaSparking Science up with Research Recommendations by Maya Hristakeva
Sparking Science up with Research Recommendations by Maya Hristakeva
 
Recommendations and User Understanding at StumbleUpon
Recommendations and User Understandingat StumbleUponRecommendations and User Understandingat StumbleUpon
Recommendations and User Understanding at StumbleUpon
 
Immersive Recommendation
Immersive RecommendationImmersive Recommendation
Immersive Recommendation
 

Más de Liang Xiang

Phd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal RecommendationPhd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal RecommendationLiang Xiang
 
动态推荐系统关键技术研究
动态推荐系统关键技术研究动态推荐系统关键技术研究
动态推荐系统关键技术研究Liang Xiang
 
动态推荐系统关键技术研究
动态推荐系统关键技术研究动态推荐系统关键技术研究
动态推荐系统关键技术研究Liang Xiang
 
How to do model ensemble
How to do model ensembleHow to do model ensemble
How to do model ensembleLiang Xiang
 
Temporal recommendation on graphs via long and short-term
Temporal recommendation on graphs via long  and short-termTemporal recommendation on graphs via long  and short-term
Temporal recommendation on graphs via long and short-termLiang Xiang
 

Más de Liang Xiang (6)

Kddcup2011
Kddcup2011Kddcup2011
Kddcup2011
 
Phd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal RecommendationPhd. Thesis : Temporal Recommendation
Phd. Thesis : Temporal Recommendation
 
动态推荐系统关键技术研究
动态推荐系统关键技术研究动态推荐系统关键技术研究
动态推荐系统关键技术研究
 
动态推荐系统关键技术研究
动态推荐系统关键技术研究动态推荐系统关键技术研究
动态推荐系统关键技术研究
 
How to do model ensemble
How to do model ensembleHow to do model ensemble
How to do model ensemble
 
Temporal recommendation on graphs via long and short-term
Temporal recommendation on graphs via long  and short-termTemporal recommendation on graphs via long  and short-term
Temporal recommendation on graphs via long and short-term
 

Último

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 

Último (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...Swan(sea) Song – personal research during my six years at Swansea ... and bey...
Swan(sea) Song – personal research during my six years at Swansea ... and bey...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 

Recommender system algorithm and architecture

  • 1. Recommender System: Algorithms & Architecture xiangliang@hulu.com
  • 2. Outline Problem Data • Algorithms • Cold start • Architecture • • Recommender System
  • 3. Problem Recommend items to users to make user, content partner, websites happy!
  • 4. Data • User behaviors data Page view All user Very Large Behavior User Size Watch video All user Large Favorite Register user Middle Vote Register user Middle Add to playlist Register user Small Facebook like Register user Small Share Register user Small Review Register user Small
  • 5. Data • Which data is most important Page view All user Very Large Behavior User Size – Main behavior in the Watch video All user Large website Favorite Register user Middle Vote Register user Middle – All user can have such Add to playlist Register user Small behavior Facebook like Register user Small – Cost Share Register user Small Review Register user Small – Reflect user interests on items
  • 6. Data • Data Structure – User ID – Item ID – Behavior Type – Behavior Content – Context • Timestamp • Location • Mood Sheldon watch Star Trek with his friends at home
  • 7. Algorithms Recommender System Method Collaborative Content Social …… Filtering Filtering Filtering Latent Factor Graph-based …… Model Neighborhood …… -based User-based Item-based ……
  • 8. Neighborhood-based • User-based – Digg • Item-based – Amazon, Netflix, YouTube, Hulu, …
  • 9. User-based • Algorithm – For user u, find a set of users S(u) have similar preference as u. – Recommend popular items among users in S(u) to user u.
  • 10. User-based CF pui = ∑ v∈S ( u , K ) ∩ N ( i ) wuv rvi N (u ) ∩ N (v) w uv = N (u ) ∪ N (v)
  • 11. Item-based • Algorithm – For user u, get items set N(u) this user like before. – Recommend items which are similar to many items in N(u) to user u.
  • 12. Item-based CF pui = ∑ j∈S ( i , K ) ∩ N ( u ) w ji ruj N (i ) ∩ N ( j ) w ij = N (i ) ∪ N ( j )
  • 13. Item-based CF Why not use w ij = ? N (i ) ∩ N ( j ) N (i )
  • 14. Neighborhood-based • User-based vs. Item-based User-based Item-based Scalability Bad when user size is Bad when item size is large large Explanation Bad Good Novelty Bad Good Coverage Bad Good Cold start Bad for new users Bad for new items Performance Need to get many Only need to get users history current user’s history
  • 15. References • Amazon.com Recommendations item-to- item Collaborative Filtering. • Empirical Analysis of Predictive Algorithms for Collaborative Filtering.
  • 16. Graph-based • Users’ behaviors on items can be represented by bi-part graph. A 1 A 1 A 1 A 1 B 2 B 2 B 2 B 2 C 3 C 3 C 3 C 3 D 4 D 4 D 4 D 4
  • 17. Graph-based • Two nodes will have high relevance if – There are many paths in graph between two nodes. – Most of paths between two nodes is short. – Most paths do not go through nodes with high out-degree.
  • 18. Graph-based • Advantage – Heterogeneous data A 1 • Multiple user behaviors • Social Network B 2 • Context (Time, Location) C 3 • Disadvantage D 4 – Statistical-based – High cost for long path
  • 19. References • A Graph-based Recommender System for Digital Library. • Random-walk computation of similarities between nodes of a graph with application to collaborative recommendation.
  • 20. Latent Factor Model • Users and items are connect by latent features. A 1 a B 2 b C 3 c D 4
  • 21. Latent Factor Model rui = ∑ puk qik ˆ k Science Fiction 0.5 Science Fiction 0.9 Universe 0.9 Universe 0.9 Physical 0.8 Physical 0.5 Space Travel 0.8 Space Travel 0.7 Animation 0.3 Animation 0.1 Romance 0.0 Romance 0.0
  • 22. Latent Factor Model • How to get p, q? min ∑ (rui − ∑ puk qik ) + λ ( pu + qi ) 2 2 2 ( u ,i ) k = α (eui qik − λ puk ) puk + = α (eui puk − λ qik ) qik +
  • 23. Latent Factor Model • How to define rui – Rating prediction – Top-N recommendation • Implicit feedback data: only have positive samples and missing values, how to select negative samples?
  • 24. Latent Factor Model 1 (Sci-fi) 2 (Crime) 3 (Family) 4 (Horror) The Blair Witch The invisible Man Jaws 101 Dalmatians Project Frankenstein Back to the Meets the Wolf Lethal Weapon Pacific Heights Future Man Godzilla Total Recall Groundhog Day Stir of Echoes Star Wars VI Reservoir Dogs Tarzan Dead Calm The Terminator Donnie Brasco The Aristocats Phantasm The Jungle Book Alien The Fugitive Sleepy Hollow 2 Alien 2 La shou Shen tan Antz The Faculty
  • 25. Latent Factor Model • Advantage – High accuracy in rating prediction – Auto group items – Scalability is good – Learning-based • Disadvantage – Incremental updating – Real-time – Explanation
  • 26. References • http://www.informatik.uni- trier.de/~ley/db/indices/a- tree/k/Koren:Yehuda.html
  • 27. Cold Start • Problems – User cold start : new users – Item cold start : new items – System cold start : new systems
  • 28. User Cold Start • How to recommend items to new users? – Non-personalization recommendation • Most popular items • Highly Rated items – Using user register profile (Age, Gender, …)
  • 29. User Cold Start • Example: Gender and TV shows Data comes from IMDB : http://www.imdb.com/title/tt0412142/ratings
  • 30. User Cold Start Male Age : 20-30 Theoretical physicist Doctor American Irreligious
  • 31. How to get user interest quickly • When new user comes, his feedback on what items can help us better understand his interest? – Not very popular – Can represent a group of items – Users who like this item have different preference with users who dislike this item
  • 32. Item Cold Start • How to recommend new items to user? – Do not recommend How to recommend news??
  • 33. Item Cold Start • How to recommend new items to user? – Using content information Machine Data Mining Recommendation Learning
  • 34. System Cold Start • How to design recommender system when there is no user? – Pandora : Music Genome Project – Jinni : Movie Genome Project
  • 35. Architecture • Feature-based recommendation framework: A 1 a B 2 b C 3 c D 4 User Feature Item
  • 36. Architecture Male Scientist Physics
  • 37. Architecture • Advantage: – Heterogeneous data – Reasonable Explanation • Disadvantage: – Do not support user-based methods
  • 38. Open Questions • How to weight multiple behaviors? • How to improve diversity, novelty? • How to build feedback loop?