SlideShare una empresa de Scribd logo
1 de 11
Descargar para leer sin conexión
Tag Recommendation in Social
Bookmarking sites like Deli.cio.us
Varun Ahuja (201206628)
Vinay Singri (201305592)
Tanuj Sharma ( 201101138 )
Introduction
Automated process of suggesting
relevant keywords given a
dataset
Given link L, description D, and
user U, a set of personalized tags
CT(L) are suggested with help
from given dataset.
First Approach – STaR ( Social Tag
Recommender System )
Divided in 3 major steps – Pre-processing,
Indexing and Recommendation
Pre-processing – Remove useless tags, Case
Folding, Spam Removal
Indexing – Index existing tags against users.
Recommendation – Combine outputs of Title to
Tag, Resource Profile, User Profile
Recommender.
Problems in First Approach
Not all tags from the dataset
appeared.
Low Precision and Low Recall
Without crawling the given link,
this approach gives low accuracy
Final Approach – Supervised Learning Model
Modelled as a ranking problem of
candidate tags of a given URL
Consists of 3 stages –
◦Candidates Tag Extraction
◦SVM Features Construction
◦Ranking Process
Ranking SVM is used for ranking candidate
tags.
Candidates Tag Extraction
Extracted from –
◦Description field of link L
◦Tags assigned by the same user U
previously
◦Tags to assigned to the same link L by other
users
Given link L, user U, candidate tags
CT{L} = { description(L) union Tags(U) union
Tags(L) }
SVM Features Construction
5 features used for each Candidate Tag ( CT ) –
Candidate Tag's Term Frequency (TF) in link's description
terms
Candidate Tag's Term Frequency (TF) in link's URL terms
Candidate Tag’s Term Frequency (TF) in T{Rj} (tags
assigned to the same URL in the training data).
Candidate Tag’s Term Frequency (TF) in T{Ui} (tags
assigned previously by user in the training data.)
Times of candidate tag being assigned as a tag in the
training data.
Ranking
For any link in test dataset, Candidate
Tags are extracted
Features stored for each candidate tag.
SVM ranking model ranks the candidate
tags from top to bottom
Top K tags selected
Tools Used
Future Work
Extension to various datasets
Giving more enriched
recommendation for the seed URL
Candidate Tags can be expanded
using content similarity based KNN
model.
References
 STaR: a Social Tag Recommender System Cataldo
Musto, Fedelucio Narducci, Marco de Gemmis,
Pasquale Lops, and Giovanni Semeraro
Department of Computer Science, University of
Bari, Italy
• Social Tag Prediction Base on
Supervised Ranking Model
Hao Cao, Maoqiang Xie, Lian Xue, Chunhua Liu, Fei
Teng and Yalou Huang
College of Software, Nankai University, Tianjin,
P.R.China

Más contenido relacionado

La actualidad más candente

When relevance is not enough
When relevance is not enoughWhen relevance is not enough
When relevance is not enough
moresmile
 
Hybrid recommender systems
Hybrid recommender systemsHybrid recommender systems
Hybrid recommender systems
renataghisloti
 

La actualidad más candente (20)

Interaction Design Patterns in Recommender Systems
Interaction Design Patterns in Recommender SystemsInteraction Design Patterns in Recommender Systems
Interaction Design Patterns in Recommender Systems
 
Performance analysis of the
Performance analysis of thePerformance analysis of the
Performance analysis of the
 
13 sdm-blda-slides
13 sdm-blda-slides13 sdm-blda-slides
13 sdm-blda-slides
 
Phd thesis final presentation
Phd thesis   final presentationPhd thesis   final presentation
Phd thesis final presentation
 
When relevance is not enough
When relevance is not enoughWhen relevance is not enough
When relevance is not enough
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and Text
 
Hybrid recommender systems
Hybrid recommender systemsHybrid recommender systems
Hybrid recommender systems
 
Tutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social NetworksTutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social Networks
 
Task oriented word embedding for text classification
Task oriented word embedding for text classificationTask oriented word embedding for text classification
Task oriented word embedding for text classification
 
Preliminary Findings: A Comparative Study of User- and Cataloger-Assigned Sub...
Preliminary Findings: A Comparative Study of User- and Cataloger-Assigned Sub...Preliminary Findings: A Comparative Study of User- and Cataloger-Assigned Sub...
Preliminary Findings: A Comparative Study of User- and Cataloger-Assigned Sub...
 
Email Classification
Email ClassificationEmail Classification
Email Classification
 
Models for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationModels for Information Retrieval and Recommendation
Models for Information Retrieval and Recommendation
 
Online Learning to Rank
Online Learning to RankOnline Learning to Rank
Online Learning to Rank
 
Mobile Computing
Mobile ComputingMobile Computing
Mobile Computing
 
Collaborative Filtering
Collaborative FilteringCollaborative Filtering
Collaborative Filtering
 
Query expansion
Query expansionQuery expansion
Query expansion
 
Recommender systems using collaborative filtering
Recommender systems using collaborative filteringRecommender systems using collaborative filtering
Recommender systems using collaborative filtering
 
Rate-it! A general purpose Polling Platform (based on graph dbms Neo4j) [ENG ...
Rate-it! A general purpose Polling Platform (based on graph dbms Neo4j) [ENG ...Rate-it! A general purpose Polling Platform (based on graph dbms Neo4j) [ENG ...
Rate-it! A general purpose Polling Platform (based on graph dbms Neo4j) [ENG ...
 
Alexandr Lyalyuk - PHP Machine Learning and user-oriented content
Alexandr Lyalyuk - PHP Machine Learning and user-oriented contentAlexandr Lyalyuk - PHP Machine Learning and user-oriented content
Alexandr Lyalyuk - PHP Machine Learning and user-oriented content
 
presentation
presentationpresentation
presentation
 

Destacado

Oeof Community Presentation
Oeof Community PresentationOeof Community Presentation
Oeof Community Presentation
guest2755a3
 
GPGPU algorithms in games
GPGPU algorithms in gamesGPGPU algorithms in games
GPGPU algorithms in games
zlatan4177
 

Destacado (14)

La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 20 Secundaria
La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 20 SecundariaLa Formacio D Una Capital La Ciutat Medieval Dossier Gotic 20 Secundaria
La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 20 Secundaria
 
El Park Guell De Ciutat Jardi A Jardi De La Ciutat Dossier Park 17 Primaria
El Park Guell De Ciutat Jardi A Jardi De La Ciutat Dossier Park 17 PrimariaEl Park Guell De Ciutat Jardi A Jardi De La Ciutat Dossier Park 17 Primaria
El Park Guell De Ciutat Jardi A Jardi De La Ciutat Dossier Park 17 Primaria
 
Viatge A Barcino Dossier Romans P 2ed
Viatge A Barcino Dossier Romans P 2edViatge A Barcino Dossier Romans P 2ed
Viatge A Barcino Dossier Romans P 2ed
 
Sant Cugat
Sant CugatSant Cugat
Sant Cugat
 
La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 17 Primaria
La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 17 PrimariaLa Formacio D Una Capital La Ciutat Medieval Dossier Gotic 17 Primaria
La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 17 Primaria
 
Report v1
Report v1Report v1
Report v1
 
La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 17 Primaria
La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 17 PrimariaLa Formacio D Una Capital La Ciutat Medieval Dossier Gotic 17 Primaria
La Formacio D Una Capital La Ciutat Medieval Dossier Gotic 17 Primaria
 
Viatge A Barcino Dossier Romans P 2ed
Viatge A Barcino Dossier Romans P 2edViatge A Barcino Dossier Romans P 2ed
Viatge A Barcino Dossier Romans P 2ed
 
Programa Educatiu Muhba Del Curs 2009 2010 Muhba Escoles 09 10
Programa Educatiu Muhba Del Curs 2009 2010 Muhba Escoles 09 10Programa Educatiu Muhba Del Curs 2009 2010 Muhba Escoles 09 10
Programa Educatiu Muhba Del Curs 2009 2010 Muhba Escoles 09 10
 
Programa Educatiu Muhba Del Curs 2009 2010 Muhba Escoles 09 10
Programa Educatiu Muhba Del Curs 2009 2010 Muhba Escoles 09 10Programa Educatiu Muhba Del Curs 2009 2010 Muhba Escoles 09 10
Programa Educatiu Muhba Del Curs 2009 2010 Muhba Escoles 09 10
 
Oeof Community Presentation
Oeof Community PresentationOeof Community Presentation
Oeof Community Presentation
 
osasuna
osasuna osasuna
osasuna
 
Historiaurrea
HistoriaurreaHistoriaurrea
Historiaurrea
 
GPGPU algorithms in games
GPGPU algorithms in gamesGPGPU algorithms in games
GPGPU algorithms in games
 

Similar a Tag recommendation in social bookmarking sites like deli

Data Science Task.pdf by the topper world
Data Science Task.pdf by the topper worldData Science Task.pdf by the topper world
Data Science Task.pdf by the topper world
TanishaChouhan4
 

Similar a Tag recommendation in social bookmarking sites like deli (20)

A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
 
Slideshow ire
Slideshow ireSlideshow ire
Slideshow ire
 
Slideshow ire
Slideshow ireSlideshow ire
Slideshow ire
 
IRJET- Analysis of Question and Answering Recommendation System
IRJET-  	  Analysis of Question and Answering Recommendation SystemIRJET-  	  Analysis of Question and Answering Recommendation System
IRJET- Analysis of Question and Answering Recommendation System
 
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
Evaluating and Enhancing Efficiency of Recommendation System using Big Data A...
 
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...
IRJET- Classifying Twitter Data in Multiple Classes based on Sentiment Class ...
 
Sentiment Analysis on Product Reviews Using Supervised Learning Techniques
Sentiment Analysis on Product Reviews Using Supervised Learning TechniquesSentiment Analysis on Product Reviews Using Supervised Learning Techniques
Sentiment Analysis on Product Reviews Using Supervised Learning Techniques
 
Project prSentiment Analysis of Twitter Data Using Machine Learning Approach...
Project prSentiment Analysis  of Twitter Data Using Machine Learning Approach...Project prSentiment Analysis  of Twitter Data Using Machine Learning Approach...
Project prSentiment Analysis of Twitter Data Using Machine Learning Approach...
 
Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence
 
Profile Analysis of Users in Data Analytics Domain
Profile Analysis of   Users in Data Analytics DomainProfile Analysis of   Users in Data Analytics Domain
Profile Analysis of Users in Data Analytics Domain
 
Network Based Intrusion Detection System using Filter Based Feature Selection...
Network Based Intrusion Detection System using Filter Based Feature Selection...Network Based Intrusion Detection System using Filter Based Feature Selection...
Network Based Intrusion Detection System using Filter Based Feature Selection...
 
Data Science Task.pdf by the topper world
Data Science Task.pdf by the topper worldData Science Task.pdf by the topper world
Data Science Task.pdf by the topper world
 
Study on Relavance Feature Selection Methods
Study on Relavance Feature Selection MethodsStudy on Relavance Feature Selection Methods
Study on Relavance Feature Selection Methods
 
IRJET- Detection of Ranking Fraud in Mobile Applications
IRJET-  	  Detection of Ranking Fraud in Mobile ApplicationsIRJET-  	  Detection of Ranking Fraud in Mobile Applications
IRJET- Detection of Ranking Fraud in Mobile Applications
 
Recsys2016 Tutorial by Xavier and Deepak
Recsys2016 Tutorial by Xavier and DeepakRecsys2016 Tutorial by Xavier and Deepak
Recsys2016 Tutorial by Xavier and Deepak
 
Twitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised ApproachTwitter Sentiment Analysis: An Unsupervised Approach
Twitter Sentiment Analysis: An Unsupervised Approach
 
IRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET- Sentimental Analysis for Online Reviews using Machine Learning AlgorithmsIRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
IRJET- Sentimental Analysis for Online Reviews using Machine Learning Algorithms
 
Automated Question Paper Generator And Answer Checker Using Information Retri...
Automated Question Paper Generator And Answer Checker Using Information Retri...Automated Question Paper Generator And Answer Checker Using Information Retri...
Automated Question Paper Generator And Answer Checker Using Information Retri...
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

Tag recommendation in social bookmarking sites like deli

  • 1. Tag Recommendation in Social Bookmarking sites like Deli.cio.us Varun Ahuja (201206628) Vinay Singri (201305592) Tanuj Sharma ( 201101138 )
  • 2. Introduction Automated process of suggesting relevant keywords given a dataset Given link L, description D, and user U, a set of personalized tags CT(L) are suggested with help from given dataset.
  • 3. First Approach – STaR ( Social Tag Recommender System ) Divided in 3 major steps – Pre-processing, Indexing and Recommendation Pre-processing – Remove useless tags, Case Folding, Spam Removal Indexing – Index existing tags against users. Recommendation – Combine outputs of Title to Tag, Resource Profile, User Profile Recommender.
  • 4. Problems in First Approach Not all tags from the dataset appeared. Low Precision and Low Recall Without crawling the given link, this approach gives low accuracy
  • 5. Final Approach – Supervised Learning Model Modelled as a ranking problem of candidate tags of a given URL Consists of 3 stages – ◦Candidates Tag Extraction ◦SVM Features Construction ◦Ranking Process Ranking SVM is used for ranking candidate tags.
  • 6. Candidates Tag Extraction Extracted from – ◦Description field of link L ◦Tags assigned by the same user U previously ◦Tags to assigned to the same link L by other users Given link L, user U, candidate tags CT{L} = { description(L) union Tags(U) union Tags(L) }
  • 7. SVM Features Construction 5 features used for each Candidate Tag ( CT ) – Candidate Tag's Term Frequency (TF) in link's description terms Candidate Tag's Term Frequency (TF) in link's URL terms Candidate Tag’s Term Frequency (TF) in T{Rj} (tags assigned to the same URL in the training data). Candidate Tag’s Term Frequency (TF) in T{Ui} (tags assigned previously by user in the training data.) Times of candidate tag being assigned as a tag in the training data.
  • 8. Ranking For any link in test dataset, Candidate Tags are extracted Features stored for each candidate tag. SVM ranking model ranks the candidate tags from top to bottom Top K tags selected
  • 10. Future Work Extension to various datasets Giving more enriched recommendation for the seed URL Candidate Tags can be expanded using content similarity based KNN model.
  • 11. References  STaR: a Social Tag Recommender System Cataldo Musto, Fedelucio Narducci, Marco de Gemmis, Pasquale Lops, and Giovanni Semeraro Department of Computer Science, University of Bari, Italy • Social Tag Prediction Base on Supervised Ranking Model Hao Cao, Maoqiang Xie, Lian Xue, Chunhua Liu, Fei Teng and Yalou Huang College of Software, Nankai University, Tianjin, P.R.China