SlideShare una empresa de Scribd logo
1 de 31
Denis studying/working to be a faculty/researcher (Denis Parra || Denis Parra-Santander) PhD Student http://www.sis.pitt.edu/~dparra/ 1 March 18th 2011 PAWS Lab – School of information Sciences – University of Pittsburgh
What is this presentation about? A short introduction of myself A description of my research interests and what I have been doing about it in the latest years 2
I.1 Where are you from? I am from Chile, a country that looks like a chile pepper, but, paradoxically, people don’t eat much spicy food. Chile ≠ [red hot chile pepper]      &&      Chile ≠ México 3
I.2 Are youfrom Santiago, the capital? Good try. One third of the 16 million Chileans lives in Santiago. But Chile is a looong country, in the north is hot and dry, in the south is very cold. I live in Valdivia, a city with rainy weather. Very Hot! Here I Live! Valdivia Very Cold! 4
I.3 Which activities do you like to do? I like playing tennis, running & rowing I like writing poetry. Check some poems herein Spanish (translated to English) I like reading novels, my favorite authors are J. L. Borges, Fyodor Dostoyevsky & James Joyce (right now I’m reading a Roberto Bolaño’s novel) I like listening to music, from Blues to Lady Gaga, passing by Pink Floyd, Radiohead and Los Jaivas. I like watching movies like “A Clockwork Orange” by S. Kubrick and “Underground” by E. Kusturica. I also like surrealistic movies like “The Holy Mountain” by Alejandro Jodorowsky. 5
I.4 OK, but now let’s talk about work… (1997 - 2002) I have BS in Engineering with emphasis in Informatics from Universidad Austral de Chile. This is a 6 year program, my undergrad thesis was titled “SPORAS: An Adaptive Web Platform based on a Multiagent System and Ontologies” (my first link to Dr. Brusilovsky’s field, Adaptive Hypermedia) Then, I worked in several projects of e-learning, developing on Open Source LMS such as Dokeos and Moodle (2003-2004) later on, I worked as IT Manager and consultant for an aquaculture company, Aqua Cards, in the South of Chile (2005-2007) I was also teaching OOP (Java), Matlab, and Introduction to Software Engineering (2004, 2006-2007) In 2007 I co-founded a company, Perceptum TI. 6
I.5 … and what about research? In 2008 I started the PhD program and I joined the PAWS lab (lead by Dr. Brusilovsky) … so here is where this presentation starts Tag-based recommendations Spreading Activation for recommender systems Related projects CourseAgent TagTheMap Conference Navigator Latent Communities This Presentation “Walk the Talk”: Mapping explicit  7
I.6.1 Tag-based recommendations Main topic: Lack of ratings in most items of many systems pushes to look for alternatives to apply user and item-based Collaborative Filtering. We explore 2 variants: neighbor-weighted and tag-based BM25. Presented a workshop paper in HT’09, p1 Presented a short-paper (poster) at Recsys ’09, p2 Presented a short-paper at WI 09, p3 8
I.6.2 Spreading activation Presented a paper in a Workshop of Recsys 2009, p4 Look for a way to apply Spreading activation for recommendations in order to: Make use of the multidimensional network structure of Folksonomies (users, items, tags) Find an scalable algorithm (compared to state-of-the art FolkRank, SVD and LDA-based) that makes use of local topology/neighborhood 9
I.6.3. Related Projects 10
Part II: so finally… This project is based on the work of my internship at Telefónica Research (Barcelona, Spain) in the Summer of 2010 Paper submitted to UMAP 2011:  Walk the Talk Analyzing the relation between implicit and explicit feedback for preference elicitation (I am co-author with Dr. Xavier Amatriain) 11
II.1 Introduction (1/2) Explicit feedback: scarcity (people are not especially eager to rate) Implicit feedback: Is less scarce, but (Hu et al., 2008) There’s no negative feedback Noisy Preference v/s Confidence Lack of evaluation metrics 12
II.1 Introduction (2/2) Which variables better account for the amount of times a user listens to online albums?  Is it possible to map implicit behavior to explicit preference (ratings)? Study with Last.fm users:  Part I: demographics and online music consumption Part II: Rating 100 albums collected from their last.fm user profile 13
II.2 About last.fm 14
II.2.1 Survey Screenshots Requirements: 18 y.o.,  scrobblings > 5000 15
II.2.2 Survey Part I Pre-req: 18 years old & 5,000 min playcount (scrobblings) # Users: 151 users started, 127 completed, 114 after filtering outliers. 82% were male and 18% were female.  From 23 different countries, main were Spain (25 users), U.S. (15 users), and UK (16 users). 80% used 20 or more hours per week of internet. 50% of users listening to music for over 20 hours per week. 9% did not attend music concerts. 30% went to 11 or more concerts a year. 35% said that they only read music magazines or blogs sometimes, but 20% did it every week. 50% of our subjects admitted rating music online never or seldom. 45% of our subjects said they bought 1 to 10 physical records a year. However, a non-negligible 18% said they did not buy any. 35% of our subjects report never buy music online, 8% say they do it once a month or more. 14% preferred to listen to single tracks while over 45% preferred listening to full albums. The other 40% reported listening to music either way. 16
II.2.3 Survey Part II For item (album) sampling, we accounted for Implicit Feedback (IF): playcount for a user on a given item. Changed to scale [1-3], 3 means being more listened to. Global Popularity (GP): global playcount for all users on a given item [1-3]. Changed to scale [1-3], 3 means being more listened to.  Recentness (R) : time ellapsed since user played a given item. Changed to scale [1-3], 3 means being listened to more recently. 17
II.3 General Analysis Initial assumption: Rating and IF (# playcount) must be strongly correlated. 18
II.3.1 Distribution of ratings Average rating: ,[object Object],3.206316 ,[object Object],3.611144 19
II.3.2 Implicit Feedback  5 1 0 20
II.3.3 Recentness 5 0 1 21
II.3.4 Global  Popularity 5 0 1 22
Effect of Track or CD 5 0 1 23
II.3.5 General Analysis - Findings We “see” strong positive correlation between ratings and implicit feedback We “see” some level of  positive correlation between ratings and recentness We don’t expect a significant relations between ratings and global popularity. On demographic data: Just listening to track or album shows a significant effect (using ANOVA) 24
II.4 Regression Analysis Including Recentness increases R2 in more than 10% [ 1  -> 2] Including GP increases R2, not much compared to RE + IF [ 1 -> 3] Not Including GP, but including interaction between IF and RE improves the variance of the DV explained by the regression model. [ 2 -> 4 ] 25
II.4.1 Regression Analysis We tested conclusions of regression analysis by predicting the score, using RMSE and 10-fold cross validations Results of regression analysis are supported. 26
II.4.2 Regression Analysis Including track or CD Including this variable that seemed to have an effect in the general analysis, helped to improve accuracy of the model 27
II.5 Conclusions Using a linear model, Implicit feedback and recentness can help to predict explicit feedback (playcount) Global popularity doesn’t show a significant improvement in the prediction task (discussion) Our model can help to relate implicit and explicit feedback, helping to evaluate and compare explicit and implicit recommender systems. Ongoing Work? 28
THANKS for spending your time listening to this talk  Questions? dap89@pitt.edu 29
Survey part I results 30
Graphics comparing % of ratings given 2 variables 31

Más contenido relacionado

Similar a Currents steps to be a researcher and faculty

Reflection AssignmentThis week there will be no formal discu.docx
Reflection AssignmentThis week there will be no formal discu.docxReflection AssignmentThis week there will be no formal discu.docx
Reflection AssignmentThis week there will be no formal discu.docx
ringrid1
 
Due Date 1159 p.m. EST, Sunday of Unit 4 Points
 Due Date  1159 p.m. EST, Sunday of Unit 4 Points   Due Date  1159 p.m. EST, Sunday of Unit 4 Points
Due Date 1159 p.m. EST, Sunday of Unit 4 Points
MargaritoWhitt221
 
Siguse 2009 Symposium Program
Siguse 2009 Symposium ProgramSiguse 2009 Symposium Program
Siguse 2009 Symposium Program
siguse_history
 
Reu13 orientation
Reu13 orientationReu13 orientation
Reu13 orientation
gestrine
 
Module 3 reflections - literacies
Module 3 reflections - literaciesModule 3 reflections - literacies
Module 3 reflections - literacies
lorettamartin3
 
Studying and Using Social Media in Academic Research_Paton_Chris
Studying and Using Social Media in Academic Research_Paton_ChrisStudying and Using Social Media in Academic Research_Paton_Chris
Studying and Using Social Media in Academic Research_Paton_Chris
yan_stanford
 
StevenCurtis_Thesis_LivingLabsforSustainability
StevenCurtis_Thesis_LivingLabsforSustainabilityStevenCurtis_Thesis_LivingLabsforSustainability
StevenCurtis_Thesis_LivingLabsforSustainability
Steven Curtis
 
Using A Mixed Methods Approach
Using A Mixed Methods ApproachUsing A Mixed Methods Approach
Using A Mixed Methods Approach
Deb Birch
 

Similar a Currents steps to be a researcher and faculty (20)

Tolerance Essay In English For Students. Online assignment writing service.
Tolerance Essay In English For Students. Online assignment writing service.Tolerance Essay In English For Students. Online assignment writing service.
Tolerance Essay In English For Students. Online assignment writing service.
 
Reflection AssignmentThis week there will be no formal discu.docx
Reflection AssignmentThis week there will be no formal discu.docxReflection AssignmentThis week there will be no formal discu.docx
Reflection AssignmentThis week there will be no formal discu.docx
 
Spreading the Word! Librarians and OER (OER14, April 2014)
Spreading the Word! Librarians and OER (OER14, April 2014)  Spreading the Word! Librarians and OER (OER14, April 2014)
Spreading the Word! Librarians and OER (OER14, April 2014)
 
Spreading the Word! Librarians and OER (OER14, April 2014)
Spreading the Word! Librarians and OER (OER14, April 2014) Spreading the Word! Librarians and OER (OER14, April 2014)
Spreading the Word! Librarians and OER (OER14, April 2014)
 
Essay On Internet. Advantages and Disadvantages of Internet Essay
Essay On Internet. Advantages and Disadvantages of Internet EssayEssay On Internet. Advantages and Disadvantages of Internet Essay
Essay On Internet. Advantages and Disadvantages of Internet Essay
 
Why OER?
Why OER?Why OER?
Why OER?
 
Does iTunes Provide Everything You Need To Be Entertained. Anywhere. Anytime?
Does iTunes Provide Everything You Need To Be Entertained. Anywhere. Anytime?Does iTunes Provide Everything You Need To Be Entertained. Anywhere. Anytime?
Does iTunes Provide Everything You Need To Be Entertained. Anywhere. Anytime?
 
Due Date 1159 p.m. EST, Sunday of Unit 4 Points
 Due Date  1159 p.m. EST, Sunday of Unit 4 Points   Due Date  1159 p.m. EST, Sunday of Unit 4 Points
Due Date 1159 p.m. EST, Sunday of Unit 4 Points
 
Data matters-bournemouth-2015
Data matters-bournemouth-2015Data matters-bournemouth-2015
Data matters-bournemouth-2015
 
Siguse 2009 Symposium Program
Siguse 2009 Symposium ProgramSiguse 2009 Symposium Program
Siguse 2009 Symposium Program
 
Reu13 orientation
Reu13 orientationReu13 orientation
Reu13 orientation
 
Open debate setting-the-scene-v2_270315
Open debate setting-the-scene-v2_270315Open debate setting-the-scene-v2_270315
Open debate setting-the-scene-v2_270315
 
Module 3 reflections - literacies
Module 3 reflections - literaciesModule 3 reflections - literacies
Module 3 reflections - literacies
 
The Pragmatic Environmentalist
The Pragmatic EnvironmentalistThe Pragmatic Environmentalist
The Pragmatic Environmentalist
 
Studying and Using Social Media in Academic Research_Paton_Chris
Studying and Using Social Media in Academic Research_Paton_ChrisStudying and Using Social Media in Academic Research_Paton_Chris
Studying and Using Social Media in Academic Research_Paton_Chris
 
StevenCurtis_Thesis_LivingLabsforSustainability
StevenCurtis_Thesis_LivingLabsforSustainabilityStevenCurtis_Thesis_LivingLabsforSustainability
StevenCurtis_Thesis_LivingLabsforSustainability
 
Social network innovation in the internet’s global coffeehouses
Social network innovation in the internet’s global coffeehousesSocial network innovation in the internet’s global coffeehouses
Social network innovation in the internet’s global coffeehouses
 
Using Social Media in Canadian Academic Libraries: A 2010 CARL ABRC Libraries...
Using Social Media in Canadian Academic Libraries: A 2010 CARL ABRC Libraries...Using Social Media in Canadian Academic Libraries: A 2010 CARL ABRC Libraries...
Using Social Media in Canadian Academic Libraries: A 2010 CARL ABRC Libraries...
 
Using A Mixed Methods Approach
Using A Mixed Methods ApproachUsing A Mixed Methods Approach
Using A Mixed Methods Approach
 
Communities of interpretation2014
Communities of interpretation2014Communities of interpretation2014
Communities of interpretation2014
 

Más de Denis Parra Santander

Más de Denis Parra Santander (9)

The Effect of Explanations & Algorithmic Accuracy on Visual Recommender Syste...
The Effect of Explanations & Algorithmic Accuracy on Visual Recommender Syste...The Effect of Explanations & Algorithmic Accuracy on Visual Recommender Syste...
The Effect of Explanations & Algorithmic Accuracy on Visual Recommender Syste...
 
Do Better ImageNet Models Transfer Better... for Image Recommendation?
Do Better ImageNet Models Transfer Better... for Image Recommendation?Do Better ImageNet Models Transfer Better... for Image Recommendation?
Do Better ImageNet Models Transfer Better... for Image Recommendation?
 
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...
Human-Centered Machine Learning: Harnessing Visualization and Interactivity f...
 
Social Aspects of Interactive Recommender Systems
Social Aspects of Interactive Recommender SystemsSocial Aspects of Interactive Recommender Systems
Social Aspects of Interactive Recommender Systems
 
Interactive Recommender Systems
Interactive Recommender SystemsInteractive Recommender Systems
Interactive Recommender Systems
 
Data Fusion for Dealing with the Recommendation Problem
Data Fusion for Dealing with the Recommendation ProblemData Fusion for Dealing with the Recommendation Problem
Data Fusion for Dealing with the Recommendation Problem
 
Research on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and ListsResearch on Recommender Systems: Beyond Ratings and Lists
Research on Recommender Systems: Beyond Ratings and Lists
 
The Effect of Different Set-based Visualizations on User Exploration of Reco...
The Effect of Different Set-based  Visualizations on User Exploration of Reco...The Effect of Different Set-based  Visualizations on User Exploration of Reco...
The Effect of Different Set-based Visualizations on User Exploration of Reco...
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
 

Último

Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
MateoGardella
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
MateoGardella
 

Último (20)

Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Gardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch LetterGardella_PRCampaignConclusion Pitch Letter
Gardella_PRCampaignConclusion Pitch Letter
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.Gardella_Mateo_IntellectualProperty.pdf.
Gardella_Mateo_IntellectualProperty.pdf.
 

Currents steps to be a researcher and faculty

  • 1. Denis studying/working to be a faculty/researcher (Denis Parra || Denis Parra-Santander) PhD Student http://www.sis.pitt.edu/~dparra/ 1 March 18th 2011 PAWS Lab – School of information Sciences – University of Pittsburgh
  • 2. What is this presentation about? A short introduction of myself A description of my research interests and what I have been doing about it in the latest years 2
  • 3. I.1 Where are you from? I am from Chile, a country that looks like a chile pepper, but, paradoxically, people don’t eat much spicy food. Chile ≠ [red hot chile pepper] && Chile ≠ México 3
  • 4. I.2 Are youfrom Santiago, the capital? Good try. One third of the 16 million Chileans lives in Santiago. But Chile is a looong country, in the north is hot and dry, in the south is very cold. I live in Valdivia, a city with rainy weather. Very Hot! Here I Live! Valdivia Very Cold! 4
  • 5. I.3 Which activities do you like to do? I like playing tennis, running & rowing I like writing poetry. Check some poems herein Spanish (translated to English) I like reading novels, my favorite authors are J. L. Borges, Fyodor Dostoyevsky & James Joyce (right now I’m reading a Roberto Bolaño’s novel) I like listening to music, from Blues to Lady Gaga, passing by Pink Floyd, Radiohead and Los Jaivas. I like watching movies like “A Clockwork Orange” by S. Kubrick and “Underground” by E. Kusturica. I also like surrealistic movies like “The Holy Mountain” by Alejandro Jodorowsky. 5
  • 6. I.4 OK, but now let’s talk about work… (1997 - 2002) I have BS in Engineering with emphasis in Informatics from Universidad Austral de Chile. This is a 6 year program, my undergrad thesis was titled “SPORAS: An Adaptive Web Platform based on a Multiagent System and Ontologies” (my first link to Dr. Brusilovsky’s field, Adaptive Hypermedia) Then, I worked in several projects of e-learning, developing on Open Source LMS such as Dokeos and Moodle (2003-2004) later on, I worked as IT Manager and consultant for an aquaculture company, Aqua Cards, in the South of Chile (2005-2007) I was also teaching OOP (Java), Matlab, and Introduction to Software Engineering (2004, 2006-2007) In 2007 I co-founded a company, Perceptum TI. 6
  • 7. I.5 … and what about research? In 2008 I started the PhD program and I joined the PAWS lab (lead by Dr. Brusilovsky) … so here is where this presentation starts Tag-based recommendations Spreading Activation for recommender systems Related projects CourseAgent TagTheMap Conference Navigator Latent Communities This Presentation “Walk the Talk”: Mapping explicit 7
  • 8. I.6.1 Tag-based recommendations Main topic: Lack of ratings in most items of many systems pushes to look for alternatives to apply user and item-based Collaborative Filtering. We explore 2 variants: neighbor-weighted and tag-based BM25. Presented a workshop paper in HT’09, p1 Presented a short-paper (poster) at Recsys ’09, p2 Presented a short-paper at WI 09, p3 8
  • 9. I.6.2 Spreading activation Presented a paper in a Workshop of Recsys 2009, p4 Look for a way to apply Spreading activation for recommendations in order to: Make use of the multidimensional network structure of Folksonomies (users, items, tags) Find an scalable algorithm (compared to state-of-the art FolkRank, SVD and LDA-based) that makes use of local topology/neighborhood 9
  • 11. Part II: so finally… This project is based on the work of my internship at Telefónica Research (Barcelona, Spain) in the Summer of 2010 Paper submitted to UMAP 2011: Walk the Talk Analyzing the relation between implicit and explicit feedback for preference elicitation (I am co-author with Dr. Xavier Amatriain) 11
  • 12. II.1 Introduction (1/2) Explicit feedback: scarcity (people are not especially eager to rate) Implicit feedback: Is less scarce, but (Hu et al., 2008) There’s no negative feedback Noisy Preference v/s Confidence Lack of evaluation metrics 12
  • 13. II.1 Introduction (2/2) Which variables better account for the amount of times a user listens to online albums? Is it possible to map implicit behavior to explicit preference (ratings)? Study with Last.fm users: Part I: demographics and online music consumption Part II: Rating 100 albums collected from their last.fm user profile 13
  • 15. II.2.1 Survey Screenshots Requirements: 18 y.o., scrobblings > 5000 15
  • 16. II.2.2 Survey Part I Pre-req: 18 years old & 5,000 min playcount (scrobblings) # Users: 151 users started, 127 completed, 114 after filtering outliers. 82% were male and 18% were female. From 23 different countries, main were Spain (25 users), U.S. (15 users), and UK (16 users). 80% used 20 or more hours per week of internet. 50% of users listening to music for over 20 hours per week. 9% did not attend music concerts. 30% went to 11 or more concerts a year. 35% said that they only read music magazines or blogs sometimes, but 20% did it every week. 50% of our subjects admitted rating music online never or seldom. 45% of our subjects said they bought 1 to 10 physical records a year. However, a non-negligible 18% said they did not buy any. 35% of our subjects report never buy music online, 8% say they do it once a month or more. 14% preferred to listen to single tracks while over 45% preferred listening to full albums. The other 40% reported listening to music either way. 16
  • 17. II.2.3 Survey Part II For item (album) sampling, we accounted for Implicit Feedback (IF): playcount for a user on a given item. Changed to scale [1-3], 3 means being more listened to. Global Popularity (GP): global playcount for all users on a given item [1-3]. Changed to scale [1-3], 3 means being more listened to. Recentness (R) : time ellapsed since user played a given item. Changed to scale [1-3], 3 means being listened to more recently. 17
  • 18. II.3 General Analysis Initial assumption: Rating and IF (# playcount) must be strongly correlated. 18
  • 19.
  • 22. II.3.4 Global Popularity 5 0 1 22
  • 23. Effect of Track or CD 5 0 1 23
  • 24. II.3.5 General Analysis - Findings We “see” strong positive correlation between ratings and implicit feedback We “see” some level of positive correlation between ratings and recentness We don’t expect a significant relations between ratings and global popularity. On demographic data: Just listening to track or album shows a significant effect (using ANOVA) 24
  • 25. II.4 Regression Analysis Including Recentness increases R2 in more than 10% [ 1 -> 2] Including GP increases R2, not much compared to RE + IF [ 1 -> 3] Not Including GP, but including interaction between IF and RE improves the variance of the DV explained by the regression model. [ 2 -> 4 ] 25
  • 26. II.4.1 Regression Analysis We tested conclusions of regression analysis by predicting the score, using RMSE and 10-fold cross validations Results of regression analysis are supported. 26
  • 27. II.4.2 Regression Analysis Including track or CD Including this variable that seemed to have an effect in the general analysis, helped to improve accuracy of the model 27
  • 28. II.5 Conclusions Using a linear model, Implicit feedback and recentness can help to predict explicit feedback (playcount) Global popularity doesn’t show a significant improvement in the prediction task (discussion) Our model can help to relate implicit and explicit feedback, helping to evaluate and compare explicit and implicit recommender systems. Ongoing Work? 28
  • 29. THANKS for spending your time listening to this talk  Questions? dap89@pitt.edu 29
  • 30. Survey part I results 30
  • 31. Graphics comparing % of ratings given 2 variables 31