Personalized Playlists at Spotify

•

5 recomendaciones•921 vistas

Rohan Agrawal

Presentation at the RE.WORK Machine Intelligence Summit in New York 2016.

Ingeniería

Personalized Playlists
@
Spotify
Rohan Agrawal
RE-WORK Machine Intelligence Summit
• New York
• Nov 2, 2016

Spotify in Numbers
• Started in 2006, now available in 59
markets
• 100+ Million active users
• 30 Million + tracks
• 20,000 new songs added per day
• 2+ Billion user generated playlists

Personalization @ Spotify
‣ Features:
• Discover Weekly
• Release Radar
• Discover Page
• Playlist
Recommendations
• Radio
• Concerts
Recommendations …

Focus on track recommendations
‣Discover Weekly
‣Release Radar

Today, we’ll talk about 3 types of models
‣ Latent Factor Models
‣ Deep Learning Audio models
‣ NLP models (which are also latent factor models …)

Lets start off with Latent Factor Models
“Compact” representation for each user and items(songs): f-dimensional
vectors
Rohan
Track a
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. .
.. .
.. .
.. .
. .
...
...
...
...
..
mUsers
Songs
User Vector Matrix: X: (m x f) Song Vector Matrix: Y: (n x f)

If we were to visualize a few Artist Latent Factors

Implicit Feedback (Hu et al. 2008)
‣ If a user u, listens to an item i, dot product of the user vector and
item vector should be as close to 1 as possible.
‣ Also takes into account confidence of a user liking an item i
‣ Solve with Alternating Gradient Descent or Alternating Least
squares.

Logistic Matrix Factorization (Johnson 2014)
‣ Model the probability of a user clicking on an item as the logistic
function.
‣ Maximize the likelihood of observations R, given ….

Recent Advances in MF
‣ Different loss functions (rank loss)
‣ Use of side information (demographics, metadata)
‣ Use of context (where, when)
‣ Deep Learning CF models

Deep Learning on Audio
http://benanne.github.io/2014/08/05/spotify-cnns.html

Document : User Session
Word : Song
NLP Models For Recommendations

Word2Vec (Mikolov et al. 2013)
‣ Each word / track has an input
and output vector
representation.
‣ Output is a vector space with
similar items living close to each
other in cosine distance. (and
awesome vector algebra
property)
Softmax
skipgram

Sequential Data? RNN ?
‣ Output layer is same as word2vec, softmax. Make a prediction of
the next item based on hidden state
‣ Recurrent connection
‣ Learning output weights and b’s for each item
https://erikbern.com/2014/06/28/recurrent-neural-networks-for-collaborative-filtering/

User Representations?
‣ Word2vec can output word / track representation but what about user
representations.
‣ Simple Aggregation (Bag of words) ?
Averaging problems
‣ Doc2Vec ?
Retrain every time there is new user activity
‣ Clustering?
Losing vector addition information
‣ Learn user vector through RNN ?

Another RNN approach
‣ Assume item vectors are fixed
‣ Try to learn the next item vector in the sequence
‣ Long term intents, train RNN to predict longer ahead in the future

Challenges, what lies ahead
Side information in embedding models, remove regional
biases, external genre information, lyrics, Facebook /
Twitter account data, [ cover art, who knows :) ]
Deep Learning
Transfer Learning
Outlier Detection

Thank You!
You can reach me @
Email: rohanag@spotify.com
Twitter: @rohanag
20

Más contenido relacionado

La actualidad más candente

Big data and machine learning @ SpotifyOscar Carlsson

Machine Learning and Big Data for Music Discovery at SpotifyChing-Wei Chen

DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...Hakka Labs

From Idea to Execution: Spotify's Discover WeeklyChris Johnson

Music Personalization : Real time Platforms.Esh Vckay

Recommending and searching @ SpotifyMounia Lalmas-Roelleke

Spotify Discover Weekly: The machine learning behind your music recommendationsSophia Ciocca

Scala Data Pipelines for Music RecommendationsChris Johnson

Recommending and Searching (Research @ Spotify)Mounia Lalmas-Roelleke

Personalizing the listening experienceMounia Lalmas-Roelleke

Scala Data Pipelines @ SpotifyNeville Li

ML+Hadoop at NYC Predictive AnalyticsErik Bernhardsson

Playlist Recommendations @ SpotifyNikhil Tibrewal

Tutorial: Context In Recommender SystemsYONG ZHENG

Music recommendations @ MLConf 2014Erik Bernhardsson

Homepage Personalization at SpotifyOguz Semerci

Interactive Recommender SystemsRoelof van Zwol

Search @ Spotify Mounia Lalmas-Roelleke

The Evolution of Big Data at SpotifyJosh Baer

Recent Trends in Personalization: A Netflix PerspectiveJustin Basilico

La actualidad más candente (20)

Big data and machine learning @ Spotify

Machine Learning and Big Data for Music Discovery at Spotify

DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...

From Idea to Execution: Spotify's Discover Weekly

Music Personalization : Real time Platforms.

Recommending and searching @ Spotify

Spotify Discover Weekly: The machine learning behind your music recommendations

Scala Data Pipelines for Music Recommendations

Recommending and Searching (Research @ Spotify)

Personalizing the listening experience

Scala Data Pipelines @ Spotify

ML+Hadoop at NYC Predictive Analytics

Playlist Recommendations @ Spotify

Tutorial: Context In Recommender Systems

Music recommendations @ MLConf 2014

Homepage Personalization at Spotify

Interactive Recommender Systems

Search @ Spotify

The Evolution of Big Data at Spotify

Recent Trends in Personalization: A Netflix Perspective

Destacado

WhoToFollow @SpotifyRohan Agrawal

EL SEGUNDO HOMBRE O POSTRER ADÁNMinisterio Apostólico Kadosh

Het muzikale netwerk 2.0 & 3.0Stijn Bannier

Omi+ maint enmulkalheloa2

Presentación1gabriela gomez

INFOGRAFICA Rainbow MethodologyMaria Rita Fiasco

TSEM Spring 2017 Mcarthur Class 2Laksamee Putnam

Digipak and Poster planningSanNahar

José Viña - Envejecimiento a nivel celular y orgánico. Envejecer es normalFundación Ramón Areces

RELIGIÓN Y REINOMinisterio Apostólico Kadosh

Reproducción sexual y asexual1999804

biaya konsepFajar Sandy

#AI is About to Reshape the Workplace & Your Organization's #DataStrategySteve Ardire

Human presence detection based room light controller using pir2.pptx [repaired]nikhilsinghia

Approximate nearest neighbor methods and vector models – NYC ML meetupErik Bernhardsson

Web技術の現状と将来 (Open Source Conference 2011 Kyoto) Rikkyo University

Io t40systems @ mesa graz april 2016Michael Klemen

How to Build a Recommendation Engine on SparkCaserta

Destacado (18)

WhoToFollow @Spotify

EL SEGUNDO HOMBRE O POSTRER ADÁN

Het muzikale netwerk 2.0 & 3.0

Omi+ maint en

Presentación1

INFOGRAFICA Rainbow Methodology

TSEM Spring 2017 Mcarthur Class 2

Digipak and Poster planning

José Viña - Envejecimiento a nivel celular y orgánico. Envejecer es normal

RELIGIÓN Y REINO

Reproducción sexual y asexual

biaya konsep

#AI is About to Reshape the Workplace & Your Organization's #DataStrategy

Human presence detection based room light controller using pir2.pptx [repaired]

Approximate nearest neighbor methods and vector models – NYC ML meetup

Web技術の現状と将来 (Open Source Conference 2011 Kyoto)

Io t40systems @ mesa graz april 2016

How to Build a Recommendation Engine on Spark

Similar a Personalized Playlists at Spotify

A Semantic Multimedia Web (Part 3)Raphael Troncy

Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Sem...sebastianewert

Platforms and the Semantic WebDanny Ayers

Automatic speech recognition system using deep learningAnkan Dutta

Recommendations 101 Esh Vckay

AI&BigData Lab 2016. Игорь Костюк: Как приручить музыкальную рекомендательную...GeeksLab Odessa

Igor Kostiuk “Как приручить музыкальную рекомендательную систему”Dakiry

Can Deep Learning Techniques Improve Entity Linking?Julien PLU

Media Sharing on Urban TransportUCL-CS MobiSys

Anghami: From Billions Of Streams To Better RecommendationsRamzi Karam

Reaktive Programmierung mit den Reactive Extensions (Rx)NETUserGroupBern

The Lonesome LOD CloudRuben Verborgh

LINEデリマでのElasticsearchの運用と監視の話LINE Corporation

Music Recommender Systemsfuchaoqun

Master's Thesis Alessandro CalmanoviciAlessandro Calmanovici

Encoding and Designing for the Swift Poems ProjectJames Griffin

Doing data science with F#Tomas Petricek

Multilevel Audio Descriptors @WWW09 develtrackXavier Amatriain

Babar: Knowledge Recognition, Extraction and RepresentationPierre de Lacaze

Music Recommendation 2018Fabien Gouyon

Similar a Personalized Playlists at Spotify (20)

A Semantic Multimedia Web (Part 3)

Large-Scale Capture of Producer-Defined Musical Semantics - Ryan Stables (Sem...

Platforms and the Semantic Web

Automatic speech recognition system using deep learning

Recommendations 101

AI&BigData Lab 2016. Игорь Костюк: Как приручить музыкальную рекомендательную...

Igor Kostiuk “Как приручить музыкальную рекомендательную систему”

Can Deep Learning Techniques Improve Entity Linking?

Media Sharing on Urban Transport

Anghami: From Billions Of Streams To Better Recommendations

Reaktive Programmierung mit den Reactive Extensions (Rx)

The Lonesome LOD Cloud

LINEデリマでのElasticsearchの運用と監視の話

Music Recommender Systems

Master's Thesis Alessandro Calmanovici

Encoding and Designing for the Swift Poems Project

Doing data science with F#

Multilevel Audio Descriptors @WWW09 develtrack

Babar: Knowledge Recognition, Extraction and Representation

Music Recommendation 2018

Último

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

Introduction and different types of Ethernet.pptxupamatechverse

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani

Porous Ceramics seminar and technical writingrakeshbaidya232001

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat

Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan

AKTU Computer Networks notes --- Unit 3.pdfankushspencer015

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia

Introduction to Multiple Access Protocol.pptxupamatechverse

UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat

Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

Extrusion Processes and Their Limitations120cr0395

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor

UNIT-II FMM-Flow Through Circular Conduitsrknatarajan

UNIT - IV - Air Compressors and its Performancesivaprakash250

Personalized Playlists at Spotify

1. Personalized Playlists @ Spotify Rohan Agrawal RE-WORK Machine Intelligence Summit • New York • Nov 2, 2016

2. Spotify in Numbers • Started in 2006, now available in 59 markets • 100+ Million active users • 30 Million + tracks • 20,000 new songs added per day • 2+ Billion user generated playlists

3. What to recommend?

4. Personalization @ Spotify ‣ Features: • Discover Weekly • Release Radar • Discover Page • Playlist Recommendations • Radio • Concerts Recommendations …

5. Focus on track recommendations ‣Discover Weekly ‣Release Radar

6. Our ML Models seem to be working!

7. Today, we’ll talk about 3 types of models ‣ Latent Factor Models ‣ Deep Learning Audio models ‣ NLP models (which are also latent factor models …)

8. Lets start off with Latent Factor Models “Compact” representation for each user and items(songs): f-dimensional vectors Rohan Track a .. . . . . .. . . . . .. . . . . .. . . . . .. . . . . .. . .. . .. . .. . . . ... ... ... ... .. mUsers Songs User Vector Matrix: X: (m x f) Song Vector Matrix: Y: (n x f)

9. If we were to visualize a few Artist Latent Factors

10. Implicit Feedback (Hu et al. 2008) ‣ If a user u, listens to an item i, dot product of the user vector and item vector should be as close to 1 as possible. ‣ Also takes into account confidence of a user liking an item i ‣ Solve with Alternating Gradient Descent or Alternating Least squares.

11. Logistic Matrix Factorization (Johnson 2014) ‣ Model the probability of a user clicking on an item as the logistic function. ‣ Maximize the likelihood of observations R, given ….

12. Recent Advances in MF ‣ Different loss functions (rank loss) ‣ Use of side information (demographics, metadata) ‣ Use of context (where, when) ‣ Deep Learning CF models

13. Deep Learning on Audio http://benanne.github.io/2014/08/05/spotify-cnns.html

14. Document : User Session Word : Song NLP Models For Recommendations

15. Word2Vec (Mikolov et al. 2013) ‣ Each word / track has an input and output vector representation. ‣ Output is a vector space with similar items living close to each other in cosine distance. (and awesome vector algebra property) Softmax skipgram

16. Sequential Data? RNN ? ‣ Output layer is same as word2vec, softmax. Make a prediction of the next item based on hidden state ‣ Recurrent connection ‣ Learning output weights and b’s for each item https://erikbern.com/2014/06/28/recurrent-neural-networks-for-collaborative-filtering/

17. User Representations? ‣ Word2vec can output word / track representation but what about user representations. ‣ Simple Aggregation (Bag of words) ? Averaging problems ‣ Doc2Vec ? Retrain every time there is new user activity ‣ Clustering? Losing vector addition information ‣ Learn user vector through RNN ?

18. Another RNN approach ‣ Assume item vectors are fixed ‣ Try to learn the next item vector in the sequence ‣ Long term intents, train RNN to predict longer ahead in the future

19. Challenges, what lies ahead Side information in embedding models, remove regional biases, external genre information, lyrics, Facebook / Twitter account data, [ cover art, who knows :) ] Deep Learning Transfer Learning Outlier Detection

20. Thank You! You can reach me @ Email: rohanag@spotify.com Twitter: @rohanag 20

Personalized Playlists at Spotify

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (18)

Similar a Personalized Playlists at Spotify

Similar a Personalized Playlists at Spotify (20)

Último

Último (20)

Personalized Playlists at Spotify