SlideShare una empresa de Scribd logo
#nowplaying-RS: A New Benchmark
Dataset for Building Context-Aware
Music Recommender Systems
ASMITA PODDAR EVA ZANGERLE YI-HSUAN YANG
National Institute of University of Academia Sinica,
Technology, Rourkela, India Innsbruck, Austria Taiwan
05/07/2018 Asmita Poddar
Motivation
 Twitter and Spotify are ideal for information retrieval and
analysis
 We need a consolidated dataset containing different data types
like item content features, user contexts, as well as timestamps
of the user-item association.
 Develop an integrated system to jointly model different data
types relevant to different kinds of recommendation.
05/07/2018 Asmita Poddar
Outline
 Related datasets
 About #nowplaying-RS
• What?
• How?
• Statistics
 Recommendation tasks
• Methods
• Evaluation metric
• Baseline Results
05/07/2018 Asmita Poddar
Related Datasets for Music
Recommendation
05/07/2018 Asmita Poddar
The Dataset
The #Nowplaying-RS dataset consists of 11.6 million Listening
Events (LEs) including:
140K Users
350K Tracks
45K Artists
45K Hashtags
5290 Sentiments
05/07/2018 Asmita Poddar
How?
05/07/2018 Asmita Poddar
05/07/2018 Asmita Poddar
Distribution of Tweets Over Time
 Poor speaking is a reaction to fear.
 Presentations are not journal articles. They’re a
completely different communication, and they
require different skills.
05/07/2018 Asmita Poddar
Red bars represent the number of unique users
Blue bars represent the number of tweets.
Histogram showing the variation of average
sentiment scores of the hashtags in the dataset.
The sentiment sores have been scaled in the
range of [0,1].
05/07/2018 Asmita Poddar
Affective Information
General LEs
05/07/2018 Asmita Poddar
Histograms of Median Time Distances
Per User
LEs with affect-related hashtags
The Experiments
 Context-aware
Recommendation
05/07/2018 Asmita Poddar
 Context-aware
Next-song
Recommendation
Context-Aware Recommendation
05/07/2018 Asmita Poddar
Context-aware Next-Song Recommendation
 Sequence of songs listened to by the user can be created
from the timestamps
 Mean length of sequences per user:123
 Median length of sequences per user: 13
 We use previous-”N” songs listened to by a user to create
recommendations
05/07/2018 Asmita Poddar
Creation of Training and Test Sets
05/07/2018 Asmita Poddar
Scenarios
05/07/2018 Asmita Poddar
Method
05/07/2018 Asmita Poddar
Factorization Machines
Supervised learning model
They model interactions of variables by mapping them (the
interactions) to a low dimensional space
[ Steffen Rendle. 2012. Factorization Machines with libFM. ACM Trans. Intell.
Syst. Technol. 3, 3, Article 57 (May 2012), 22 pages. DOI=
http://dx.doi.org/10.1145/2168752.2168771 ]
Evaluation Metric
05/07/2018 Asmita Poddar
Mean Reciprocal Rank (MRR)
Results
05/07/2018 Asmita Poddar
Conclusion
05/07/2018 Asmita Poddar
 We present the #nowplaying-RS dataset, which can be
used to compare and evaluate large-scale
recommendation approaches in a real-life setting.
 We demonstrate various features of the dataset.
 We showcase two possible use cases of the dataset in our
paper.
Future work
05/07/2018 Asmita Poddar
 Use the combination of mood-related hashtags and timestamps
to track how people use music to modulate their emotion.
 Explore the scope of using the various features of the dataset to
improve music recommendation quality using neural networks.
 Perform session-based music recommendation.
Links and Contact Info
05/07/2018 Asmita Poddar
 Link to dataset: http://dbis-nowplaying.uibk.ac.at/#nowplayingrs
 Link to implementation code:
https://github.com/asmitapoddar/nowplaying-RS-Music-Reco-FM
: asmita_poddar
: asmita.poddar@gmail.com
: https://sites.google.com/view/asmitapoddar
Thank You!
05/07/2018 Asmita Poddar

Más contenido relacionado

Similar a #nowplaying-RS: A New Benchmark Dataset for Building Context-Aware Music Recommender Systems

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
Osman Ali
 
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET Journal
 
Metadata as Standard: improving Interoperability through the Research Data Al...
Metadata as Standard: improving Interoperability through the Research Data Al...Metadata as Standard: improving Interoperability through the Research Data Al...
Metadata as Standard: improving Interoperability through the Research Data Al...
AIMS (Agricultural Information Management Standards)
 
Music Recommendation System
Music Recommendation SystemMusic Recommendation System
Music Recommendation System
IRJET Journal
 
Data mining with Rattle For R
Data mining with Rattle For RData mining with Rattle For R
Data mining with Rattle For R
Akhil Anil
 
Big data and machine learning @ Spotify
Big data and machine learning @ SpotifyBig data and machine learning @ Spotify
Big data and machine learning @ Spotify
Oscar Carlsson
 
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
Jason Riedy
 
A study of Data Quality and Analytics
A study of Data Quality and AnalyticsA study of Data Quality and Analytics
A study of Data Quality and Analytics
Ali Habeeb
 
music recommendation system ,Based on Million Song Dataset
music recommendation system ,Based on Million Song Datasetmusic recommendation system ,Based on Million Song Dataset
music recommendation system ,Based on Million Song Dataset
SandipKumarPratihari
 
Emofy
Emofy Emofy
BATCH 1 FIRST REVIEW-1.pptx
BATCH 1 FIRST REVIEW-1.pptxBATCH 1 FIRST REVIEW-1.pptx
BATCH 1 FIRST REVIEW-1.pptx
SurajRavi16
 
Skillshare - Let's talk about R in Data Journalism
Skillshare - Let's talk about R in Data JournalismSkillshare - Let's talk about R in Data Journalism
Skillshare - Let's talk about R in Data Journalism
School of Data
 
NESMA - More than just points
NESMA - More than just pointsNESMA - More than just points
NESMA - More than just points
Nesma
 
IRJET- Feeling based Music Recommnendation System using Sensors
IRJET- Feeling based Music Recommnendation System using SensorsIRJET- Feeling based Music Recommnendation System using Sensors
IRJET- Feeling based Music Recommnendation System using Sensors
IRJET Journal
 
flankr: EPS presentation
flankr: EPS presentationflankr: EPS presentation
flankr: EPS presentation
JimGrange
 
Data analytics using R programming
Data analytics using R programmingData analytics using R programming
Data analytics using R programming
Umang Singh
 
Top 10 statistics tools to get better data insights
Top 10 statistics tools to get better data insightsTop 10 statistics tools to get better data insights
Top 10 statistics tools to get better data insights
Stat Analytica
 
Report for internship
Report for internshipReport for internship
Report for internship
Salman Khan
 
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
Mladen Jovanovic
 
ATAGTR2017 Analytics Testing
ATAGTR2017 Analytics TestingATAGTR2017 Analytics Testing
ATAGTR2017 Analytics Testing
Agile Testing Alliance
 

Similar a #nowplaying-RS: A New Benchmark Dataset for Building Context-Aware Music Recommender Systems (20)

Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
IRJET- Implementation of Emotion based Music Recommendation System using SVM ...
 
Metadata as Standard: improving Interoperability through the Research Data Al...
Metadata as Standard: improving Interoperability through the Research Data Al...Metadata as Standard: improving Interoperability through the Research Data Al...
Metadata as Standard: improving Interoperability through the Research Data Al...
 
Music Recommendation System
Music Recommendation SystemMusic Recommendation System
Music Recommendation System
 
Data mining with Rattle For R
Data mining with Rattle For RData mining with Rattle For R
Data mining with Rattle For R
 
Big data and machine learning @ Spotify
Big data and machine learning @ SpotifyBig data and machine learning @ Spotify
Big data and machine learning @ Spotify
 
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
Graph Analysis Trends and Opportunities -- CMG Performance and Capacity 2014
 
A study of Data Quality and Analytics
A study of Data Quality and AnalyticsA study of Data Quality and Analytics
A study of Data Quality and Analytics
 
music recommendation system ,Based on Million Song Dataset
music recommendation system ,Based on Million Song Datasetmusic recommendation system ,Based on Million Song Dataset
music recommendation system ,Based on Million Song Dataset
 
Emofy
Emofy Emofy
Emofy
 
BATCH 1 FIRST REVIEW-1.pptx
BATCH 1 FIRST REVIEW-1.pptxBATCH 1 FIRST REVIEW-1.pptx
BATCH 1 FIRST REVIEW-1.pptx
 
Skillshare - Let's talk about R in Data Journalism
Skillshare - Let's talk about R in Data JournalismSkillshare - Let's talk about R in Data Journalism
Skillshare - Let's talk about R in Data Journalism
 
NESMA - More than just points
NESMA - More than just pointsNESMA - More than just points
NESMA - More than just points
 
IRJET- Feeling based Music Recommnendation System using Sensors
IRJET- Feeling based Music Recommnendation System using SensorsIRJET- Feeling based Music Recommnendation System using Sensors
IRJET- Feeling based Music Recommnendation System using Sensors
 
flankr: EPS presentation
flankr: EPS presentationflankr: EPS presentation
flankr: EPS presentation
 
Data analytics using R programming
Data analytics using R programmingData analytics using R programming
Data analytics using R programming
 
Top 10 statistics tools to get better data insights
Top 10 statistics tools to get better data insightsTop 10 statistics tools to get better data insights
Top 10 statistics tools to get better data insights
 
Report for internship
Report for internshipReport for internship
Report for internship
 
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
 
ATAGTR2017 Analytics Testing
ATAGTR2017 Analytics TestingATAGTR2017 Analytics Testing
ATAGTR2017 Analytics Testing
 

Último

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
saastr
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
Federico Razzoli
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
IndexBug
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
Ivanti
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
Edge AI and Vision Alliance
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
jpupo2018
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 

Último (20)

Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
Deep Dive: AI-Powered Marketing to Get More Leads and Customers with HyperGro...
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
Webinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data WarehouseWebinar: Designing a schema for a Data Warehouse
Webinar: Designing a schema for a Data Warehouse
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial IntelligenceAI 101: An Introduction to the Basics and Impact of Artificial Intelligence
AI 101: An Introduction to the Basics and Impact of Artificial Intelligence
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
June Patch Tuesday
June Patch TuesdayJune Patch Tuesday
June Patch Tuesday
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
“Building and Scaling AI Applications with the Nx AI Manager,” a Presentation...
 
Project Management Semester Long Project - Acuity
Project Management Semester Long Project - AcuityProject Management Semester Long Project - Acuity
Project Management Semester Long Project - Acuity
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 

#nowplaying-RS: A New Benchmark Dataset for Building Context-Aware Music Recommender Systems

  • 1. #nowplaying-RS: A New Benchmark Dataset for Building Context-Aware Music Recommender Systems ASMITA PODDAR EVA ZANGERLE YI-HSUAN YANG National Institute of University of Academia Sinica, Technology, Rourkela, India Innsbruck, Austria Taiwan 05/07/2018 Asmita Poddar
  • 2. Motivation  Twitter and Spotify are ideal for information retrieval and analysis  We need a consolidated dataset containing different data types like item content features, user contexts, as well as timestamps of the user-item association.  Develop an integrated system to jointly model different data types relevant to different kinds of recommendation. 05/07/2018 Asmita Poddar
  • 3. Outline  Related datasets  About #nowplaying-RS • What? • How? • Statistics  Recommendation tasks • Methods • Evaluation metric • Baseline Results 05/07/2018 Asmita Poddar
  • 4. Related Datasets for Music Recommendation 05/07/2018 Asmita Poddar
  • 5. The Dataset The #Nowplaying-RS dataset consists of 11.6 million Listening Events (LEs) including: 140K Users 350K Tracks 45K Artists 45K Hashtags 5290 Sentiments 05/07/2018 Asmita Poddar
  • 8. Distribution of Tweets Over Time  Poor speaking is a reaction to fear.  Presentations are not journal articles. They’re a completely different communication, and they require different skills. 05/07/2018 Asmita Poddar Red bars represent the number of unique users Blue bars represent the number of tweets.
  • 9. Histogram showing the variation of average sentiment scores of the hashtags in the dataset. The sentiment sores have been scaled in the range of [0,1]. 05/07/2018 Asmita Poddar Affective Information
  • 10. General LEs 05/07/2018 Asmita Poddar Histograms of Median Time Distances Per User LEs with affect-related hashtags
  • 11. The Experiments  Context-aware Recommendation 05/07/2018 Asmita Poddar  Context-aware Next-song Recommendation
  • 13. Context-aware Next-Song Recommendation  Sequence of songs listened to by the user can be created from the timestamps  Mean length of sequences per user:123  Median length of sequences per user: 13  We use previous-”N” songs listened to by a user to create recommendations 05/07/2018 Asmita Poddar
  • 14. Creation of Training and Test Sets 05/07/2018 Asmita Poddar
  • 16. Method 05/07/2018 Asmita Poddar Factorization Machines Supervised learning model They model interactions of variables by mapping them (the interactions) to a low dimensional space [ Steffen Rendle. 2012. Factorization Machines with libFM. ACM Trans. Intell. Syst. Technol. 3, 3, Article 57 (May 2012), 22 pages. DOI= http://dx.doi.org/10.1145/2168752.2168771 ]
  • 17. Evaluation Metric 05/07/2018 Asmita Poddar Mean Reciprocal Rank (MRR)
  • 19. Conclusion 05/07/2018 Asmita Poddar  We present the #nowplaying-RS dataset, which can be used to compare and evaluate large-scale recommendation approaches in a real-life setting.  We demonstrate various features of the dataset.  We showcase two possible use cases of the dataset in our paper.
  • 20. Future work 05/07/2018 Asmita Poddar  Use the combination of mood-related hashtags and timestamps to track how people use music to modulate their emotion.  Explore the scope of using the various features of the dataset to improve music recommendation quality using neural networks.  Perform session-based music recommendation.
  • 21. Links and Contact Info 05/07/2018 Asmita Poddar  Link to dataset: http://dbis-nowplaying.uibk.ac.at/#nowplayingrs  Link to implementation code: https://github.com/asmitapoddar/nowplaying-RS-Music-Reco-FM : asmita_poddar : asmita.poddar@gmail.com : https://sites.google.com/view/asmitapoddar

Notas del editor

  1. Asmita – NITR At Taiwan TIGP Scholarship internship
  2. Who cares? Spotify, youtube Smart applications Twitter: 336 mio (11% of internet connected people), Spotify: 160 mio (5%) Users of music streaming services are becoming more particular about the songs they want to listen – depending on their mood, activity, time of day etc. Hence, streaming companies like Pandora, Spotify, to stay more competitive in todays’s market have to provide music recommendations which are relevant to a user. To do so, the best way is to train your RS based on context of the user, ie, context-aware music recommendation. Next song generation, sequential RS, session based RS, playlist generation etc. are recommendation tasks. To develop algos and do research for these for these tasks, we need a large scale dataset which will provide the necessary data. Address different types of recommendation like context-based, sequence-based/ next-song or session-based recommendation. it is a largescale, context-aware dataset; ii) the dataset contains timestamps for LEs, for sequence-based recommendation; iii) the hashtags contained are a unique feature : give an idea of the listening context and emotional state of the user at time of listening to track; for sentiment-aware approaches to recommendation and retrieval; iv) contains item content features of the songs v) it facilitates the development of an integrated system that jointly models different data types relevant to recommendation Develop algos for music recommendation for personalise music streaming, smart cars, smart speakers etc.
  3. Before talking about the details of #nowplaying-RS, I want to give an overview of the other music datasets that can be used for music recommendation. (Be sure of the dates and references of these datasets). Uniqueness: First, we provide a publicly available and extensive dataset of LEs, particularly suited for (sequential) context-aware recommendations. Second, we provide a great variety of context and content information about the users and tracks, as well as clues of the underlying emotions, activities, and thoughts of the users through hashtags.
  4. Sentiments are those sentiment scores of hashtags, which can be resolved using at least one of the 5 sentiment dictionaries: AFINN [26], Opinion Lexicon [27], SentiStrength [28], Vader [29] and the Sentiment Hashtag lexicon
  5. Barplot depicting the weekly distribution of the number of tweets (LEs) and number of users tweeting over time. Each bar represents the span of a week. Red bars represent the number of unique users and blue bars represent the number of tweets. This has been used to split the dataset into training and test sets.
  6. Lot od contextual information can be retrieved from the hashtags as can be seen from the examples. The sentiment information can give us valuable intelligence about the moods of the user. We get a wide range of mood information from the dataset. We can derive the change in mood of the user over time.
  7. This information is useful while making sequence based reco. We see that there are larger number of users with LEs having a short time span (50% of users have a time span of 0–37 hours) between successive LEs in general. About 50% of users who have used affect-related hashtags with LEs have a time span of 0–52 hours between successive LEs. This allows us to model user preferences or mood on a nearly daily basis, making this dataset useful for sequence-based recommendation.
  8. For our experiments, we took into account the user’s most recent song preference into consideration by taking N = 1, i.e., the “last song” that a user had listened to is used as the context information to infer the ”next song” that a user would likely listen to.
  9. The resulting set of 10 tracks can be subsequently used as input to an RS. Concretely, it can be understood that track recommendation under the POP USER population method is more difficult, because all the 10 candidate tracks are known to the user and likely the recommender has to rely on contextual features to pick the right one.
  10. Work well with sparse datasets. RS usually creates a sparse matrix
  11. Higher the MRR, better the recommendation
  12. Evaluation results in mean reciprocal rank (MRR) for (a) context-aware recommendation for the POP RND and POP USER settings and (b) context-aware next song recommendation for N = 1, using the POP RND setting. ‘CB’ and ‘Cx’ indicate whether the method is content-based or context-based, respectively. [Base] indicates a part of the input to FM. For (a) [Base] = User ID+Track ID and (b) [Base] = User ID+Track ID+Previous Track ID. We highlight the top two results per column using bold font.