SlideShare una empresa de Scribd logo
1 de 17
iTALS
Fast ALS-based tensor factorization for context-aware
recommendation from implicit feedback
Balázs Hidasi – balazs.hidasi@gravityrd.com
Domonkos Tikk – domonkos.tikk@gravityrd.com




 ECML/PKDD, 25TH SEPTEMBER 2012, BRISTOL
Overview
• Implicit feedback problem
• Context-awareness
 ▫ Seasonality
 ▫ Sequentaility
• iTALS
 ▫ Model
 ▫ Learning
 ▫ Prediction
• Experiments
Feedback types
• Feedback: user-item interraction (events)
• Explicit:
  ▫ Preferences explicitely coded
  ▫ E.g.: Ratings
• Implicit:
  ▫ Preferences not coded explicitely
  ▫ E.g.: purchase history
Problems with implicit feedback
• Noisy positive preferences
 ▫ E.g.: bought & disappointed
• No negative feedback available
 ▫ E.g.: had no info on item

• Usually evaluated by ranking metrics
 ▫ Can not be directly optimized
Why to use implicit feedback?
• Every user provides
• Some magnitudes larger amount of information
  than explicit feedback
• More important in practice
 ▫ Explicit algorithms are for the biggest only
Context-awareness
• Context: any information associated with events
• Context state: a possible value of the context
  dimension
• Context-awareness
  ▫ Usage of context information
  ▫ Incorporating additional informations into the method
  ▫ Different predictions for same user given different
    context states
  ▫ Can greatly outperform context-unaware methods
     Context segmentates items/users well
Seasonality as context
• Season: a time period
  ▫ E.g.: a week
• Timeband: given interval in season
  ▫ Context-states
  ▫ E.g.: days
• Assumed:
  ▫ aggregated behaviour in a given
    timeband is similar inbetween
    seasons                            User   Item     Date       Context
  ▫ and different for different         1      A     12/07/2010      1
    timebands                           2      B     15/07/2010      3
  ▫ E.g.: daily/weekly routines         1      B     15/07/2010      3
                                        …      …         …
                                        1      A     19/07/2010      1
Sequentiality
• Bought A after B
  ▫ B is the context-state of the user’s
    event on A
• Usefullness
  ▫ Some items are bought together
  ▫ Some items bought repetetively
  ▫ Some are not
• Association rule like information
  incorporated into model as
  context
  ▫ Here: into factorization methods
• Can learn negated rules
  ▫ If C then not A
M3
iTALS - Model
• Binary tensor
  ▫ D dimensional




                                         User
  ▫ User – item – context(s)




                                    M1
                                                 T
• Importance weights
  ▫ Lower weights to zeroes (NIF)
  ▫ Higher weights to cells with                Item
    more events                                 M2
• Cells approximated by sum of
  the values in the elementwise
  product of D vectors
  ▫ Each for a dimension
  ▫ Low dimensional vectors
iTALS - Learning
•
iTALS - Prediction
• Sum of values in elementwise product of vectors
 ▫ User-item: scalar product of feature vectors
 ▫ User-item-context: weighted scalar product of
   feature vectors
• Context-state dependent reweighting of features
• E.g.:
 ▫ Third feature = horror movie
 ▫ Context state1 = Friday night  third feature high
 ▫ Context state2 = Sunday afternoon  third
   feature low
Experiments
• 5 databases
  ▫ 3 implicit
     Online grocery shopping
     VoD consumption
     Music listening habits
  ▫ 2 implicitizied explicit
     Netflix
     MovieLens 10M
• Recall@20 as primary evaluation metric
• Baseline: context-unaware method in every context-
  state
Scalability
                                      Running times on the Grocery dataset
                     350

                     300
                                                                                iALS epoch
Running time (sec)




                     250
                                                                                iTALS (time bands) epoch
                     200
                                                                                iTALS (sequence) epoch
                     150
                                                                                iALS calc. matrix
                     100
                                                                                iTALS (time bands) calc.
                                                                                matrix
                      50
                                                                                iTALS (sequence) calc. matrix
                      0
                           10   20   30     40   50   60   70   80   90   100
                                          Number of factors (K)
Results – Recall@20
0.1200

0.1000

0.0800

0.0600

0.0400

0.0200

0.0000
             VoD        Grocery        LastFM            Netflix           MovieLens
     iALS (MF method)   iCA baseline   iTALS (seasonality)         iTALS (sequence)
Results – Precision-recall curves
0.25        Recall-precision (Grocery)
0.20
0.15
0.10                                                            iALS

0.05
0.00                                                            iCA baseline
    0.00   0.05          0.10       0.15          0.20
                  Recall-precision (VoD)                        iTALS
0.08                                                            (season)

0.06                                                            iTALS
                                                                (sequence)
0.04

0.02

0.00
    0.00    0.05           0.10            0.15          0.20
Summary
• iTALS is a
  ▫   scalable
  ▫   context-aware
  ▫   factorization method
  ▫   on implicit feedback data
• The problem is modelled
  ▫ by approximating the values of a binary tensor
  ▫ with elementwise product of short vectors
  ▫ using importance weighting
• Learning can be efficiently done by using
  ▫ ALS
  ▫ and other computation time reducing tricks
• Recommendation accuracy is significantly better than iCA-baseline
• Introduced a new context type: sequentiality
  ▫ association rule like information in factorization framework
Thank you for your attention!



For more of my recommender systems related research visit my website: http://www.hidasi.eu




                                         T

Más contenido relacionado

Destacado

Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...
Balázs Hidasi
 

Destacado (6)

Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...Approximate modeling of continuous context in factorization algorithms (CaRR1...
Approximate modeling of continuous context in factorization algorithms (CaRR1...
 
Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...Context aware factorization methods for implicit feedback based recommendatio...
Context aware factorization methods for implicit feedback based recommendatio...
 
Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...Context-aware similarities within the factorization framework (CaRR 2013 pres...
Context-aware similarities within the factorization framework (CaRR 2013 pres...
 
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
Parallel Recurrent Neural Network Architectures for Feature-rich Session-base...
 
Deep learning: the future of recommendations
Deep learning: the future of recommendationsDeep learning: the future of recommendations
Deep learning: the future of recommendations
 
Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...Deep learning to the rescue - solving long standing problems of recommender ...
Deep learning to the rescue - solving long standing problems of recommender ...
 

Similar a iTALS: implicit tensor factorization for context-aware recommendations (ECML/PKDD 2012 presentation)

Optimising digital content delivery
Optimising digital content deliveryOptimising digital content delivery
Optimising digital content delivery
Tamas Jambor
 
H formal lab report
H formal lab reportH formal lab report
H formal lab report
stephm32
 

Similar a iTALS: implicit tensor factorization for context-aware recommendations (ECML/PKDD 2012 presentation) (15)

Optimising digital content delivery
Optimising digital content deliveryOptimising digital content delivery
Optimising digital content delivery
 
H formal lab report
H formal lab reportH formal lab report
H formal lab report
 
Evaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated DatabasesEvaluating Data Freshness in Large Scale Replicated Databases
Evaluating Data Freshness in Large Scale Replicated Databases
 
IoT with Azure Machine Learning and InfluxDB
IoT with Azure Machine Learning and InfluxDBIoT with Azure Machine Learning and InfluxDB
IoT with Azure Machine Learning and InfluxDB
 
It's all about the timing
It's all about the timingIt's all about the timing
It's all about the timing
 
Mining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOAMining Big Data Streams with APACHE SAMOA
Mining Big Data Streams with APACHE SAMOA
 
Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)Time Series Databases for IoT (On-premises and Azure)
Time Series Databases for IoT (On-premises and Azure)
 
Mining big data streams with APACHE SAMOA by Albert Bifet
Mining big data streams with APACHE SAMOA by Albert BifetMining big data streams with APACHE SAMOA by Albert Bifet
Mining big data streams with APACHE SAMOA by Albert Bifet
 
Effectiveness and code optimization in Java
Effectiveness and code optimization in JavaEffectiveness and code optimization in Java
Effectiveness and code optimization in Java
 
Scylla Summit 2018: OLAP or OLTP? Why Not Both?
Scylla Summit 2018: OLAP or OLTP? Why Not Both?Scylla Summit 2018: OLAP or OLTP? Why Not Both?
Scylla Summit 2018: OLAP or OLTP? Why Not Both?
 
ETALIS_RuleML_2011_Retractions
ETALIS_RuleML_2011_RetractionsETALIS_RuleML_2011_Retractions
ETALIS_RuleML_2011_Retractions
 
Modeling the IoT with TitanDB and Cassandra
Modeling the IoT with TitanDB and CassandraModeling the IoT with TitanDB and Cassandra
Modeling the IoT with TitanDB and Cassandra
 
DL'12 mastro at work
DL'12 mastro at workDL'12 mastro at work
DL'12 mastro at work
 
Data mining Basics and complete description
Data mining Basics and complete description Data mining Basics and complete description
Data mining Basics and complete description
 
Database Health Check
Database Health CheckDatabase Health Check
Database Health Check
 

Último

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

iTALS: implicit tensor factorization for context-aware recommendations (ECML/PKDD 2012 presentation)

  • 1. iTALS Fast ALS-based tensor factorization for context-aware recommendation from implicit feedback Balázs Hidasi – balazs.hidasi@gravityrd.com Domonkos Tikk – domonkos.tikk@gravityrd.com ECML/PKDD, 25TH SEPTEMBER 2012, BRISTOL
  • 2. Overview • Implicit feedback problem • Context-awareness ▫ Seasonality ▫ Sequentaility • iTALS ▫ Model ▫ Learning ▫ Prediction • Experiments
  • 3. Feedback types • Feedback: user-item interraction (events) • Explicit: ▫ Preferences explicitely coded ▫ E.g.: Ratings • Implicit: ▫ Preferences not coded explicitely ▫ E.g.: purchase history
  • 4. Problems with implicit feedback • Noisy positive preferences ▫ E.g.: bought & disappointed • No negative feedback available ▫ E.g.: had no info on item • Usually evaluated by ranking metrics ▫ Can not be directly optimized
  • 5. Why to use implicit feedback? • Every user provides • Some magnitudes larger amount of information than explicit feedback • More important in practice ▫ Explicit algorithms are for the biggest only
  • 6. Context-awareness • Context: any information associated with events • Context state: a possible value of the context dimension • Context-awareness ▫ Usage of context information ▫ Incorporating additional informations into the method ▫ Different predictions for same user given different context states ▫ Can greatly outperform context-unaware methods  Context segmentates items/users well
  • 7. Seasonality as context • Season: a time period ▫ E.g.: a week • Timeband: given interval in season ▫ Context-states ▫ E.g.: days • Assumed: ▫ aggregated behaviour in a given timeband is similar inbetween seasons User Item Date Context ▫ and different for different 1 A 12/07/2010 1 timebands 2 B 15/07/2010 3 ▫ E.g.: daily/weekly routines 1 B 15/07/2010 3 … … … 1 A 19/07/2010 1
  • 8. Sequentiality • Bought A after B ▫ B is the context-state of the user’s event on A • Usefullness ▫ Some items are bought together ▫ Some items bought repetetively ▫ Some are not • Association rule like information incorporated into model as context ▫ Here: into factorization methods • Can learn negated rules ▫ If C then not A
  • 9. M3 iTALS - Model • Binary tensor ▫ D dimensional User ▫ User – item – context(s) M1 T • Importance weights ▫ Lower weights to zeroes (NIF) ▫ Higher weights to cells with Item more events M2 • Cells approximated by sum of the values in the elementwise product of D vectors ▫ Each for a dimension ▫ Low dimensional vectors
  • 11. iTALS - Prediction • Sum of values in elementwise product of vectors ▫ User-item: scalar product of feature vectors ▫ User-item-context: weighted scalar product of feature vectors • Context-state dependent reweighting of features • E.g.: ▫ Third feature = horror movie ▫ Context state1 = Friday night  third feature high ▫ Context state2 = Sunday afternoon  third feature low
  • 12. Experiments • 5 databases ▫ 3 implicit  Online grocery shopping  VoD consumption  Music listening habits ▫ 2 implicitizied explicit  Netflix  MovieLens 10M • Recall@20 as primary evaluation metric • Baseline: context-unaware method in every context- state
  • 13. Scalability Running times on the Grocery dataset 350 300 iALS epoch Running time (sec) 250 iTALS (time bands) epoch 200 iTALS (sequence) epoch 150 iALS calc. matrix 100 iTALS (time bands) calc. matrix 50 iTALS (sequence) calc. matrix 0 10 20 30 40 50 60 70 80 90 100 Number of factors (K)
  • 14. Results – Recall@20 0.1200 0.1000 0.0800 0.0600 0.0400 0.0200 0.0000 VoD Grocery LastFM Netflix MovieLens iALS (MF method) iCA baseline iTALS (seasonality) iTALS (sequence)
  • 15. Results – Precision-recall curves 0.25 Recall-precision (Grocery) 0.20 0.15 0.10 iALS 0.05 0.00 iCA baseline 0.00 0.05 0.10 0.15 0.20 Recall-precision (VoD) iTALS 0.08 (season) 0.06 iTALS (sequence) 0.04 0.02 0.00 0.00 0.05 0.10 0.15 0.20
  • 16. Summary • iTALS is a ▫ scalable ▫ context-aware ▫ factorization method ▫ on implicit feedback data • The problem is modelled ▫ by approximating the values of a binary tensor ▫ with elementwise product of short vectors ▫ using importance weighting • Learning can be efficiently done by using ▫ ALS ▫ and other computation time reducing tricks • Recommendation accuracy is significantly better than iCA-baseline • Introduced a new context type: sequentiality ▫ association rule like information in factorization framework
  • 17. Thank you for your attention! For more of my recommender systems related research visit my website: http://www.hidasi.eu T