Music Personalization At Spotify

Music Personalization
@
Spotify
Vidhya Murali
@vid052
RecSys 2016

Spotify’s Big Data
‣ Started in 2006, now available in 58 countries
‣ 100+ million active users, 35+ million paid subscribers
‣ 30+ million songs in our catalog, ~20K added every
day
‣ 2+ billion playlists
‣ 1 TB of log data every day
‣ Hadoop cluster with ~2500 nodes

Personalization @ Spotify
Features:
Discover
Discover Weekly
Fresh Finds
Home
Radio
Release Radar
5

Approaches
‣Manual Curation by Experts
‣Metadata (e.g: Label Provided Data, News, Blogs)
‣Audio Signals
‣Collaborative Filtering
‣ Hybrid

Latent Factor Models
“Compact” representation for each user and items(songs): f-dimensional
vectors

vectors
Vidhya
Rise
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. .
.. .
.. .
.. .
. .
...
...
...
...
..
mUsers
Songs

vectors
Vidhya
Rise
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. .
.. .
.. .
.. .
. .
...
...
...
...
..
mUsers
Songs
User Vector
Matrix: X: (m x f)

vectors
Vidhya
Rise
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. .
.. .
.. .
.. .
. .
...
...
...
...
..
mUsers
Songs
User Vector
Matrix: X: (m x f)
Song Vector
Matrix: Y: (n x f)

vectors
(here, f = 2)
Vidhya
Rise
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. . . . .
.. .
.. .
.. .
.. .
. .
...
...
...
...
..
mUsers
Songs
User Vector
Matrix: X: (m x f)
Song Vector
Matrix: Y: (n x f)

NLP Models work great on Playlists!

Document : Playlist

Document : Playlist
Word : Song

[1] http://benanne.github.io/2014/08/05/spotify-cnns.html
Deep Learning on Audio

Vectors
“COMPACT” representation for users and items musical fingerprint.
Normalized Song Vectors

Vectors
“COMPACT” representation for users and items musical fingerprint.
User Vector

Why Vectors?
Encodes higher order dependencies
Users and Items in the same latent space
User - Item recommendations
Item - Item similarities
Easy to scale up
Complexity is linear in order of latent factors

Recommendations
15
User Vector

Ranking
Similarity score can be used for ranking

Ranking
Balance relevance, diversity, popularity, freshness

Ranking
Heuristic based

Ranking
Heuristic based
MAB
Interactions
Impressions
Clicks
Streams

Music Personalization Data Flow

Challenges Unique to Spotify
Scale of catalog
Music is “niche”
Music consumption has heavy correlation to users’ context
Repeated consumption of music is NOT so uncommon.

Challenge Accepted!
Cold start problem for both users and new music/upcoming artists:
Content Based Signals
Real Time Recommendations
Measuring Quality:
Implicit: A/B Test Metrics
Explicit: Feedback from social forums
Scam Attacks:
Rule based model to detect scammers
Humans choices are not always predictable:
Faith in humanity

What Next?
‣Personalization!
‣Content signals such as lyrics, audio, images
‣Expanded Catalog: Shows, Podcasts
‣New Markets
21

Thank You!
You can reach me @
Email: vidhya@spotify.com
Twitter: @vid052
23

Music Personalization At Spotify

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (7)

Similar a Music Personalization At Spotify

Similar a Music Personalization At Spotify (20)

Último

Último (20)

Music Personalization At Spotify