SlideShare una empresa de Scribd logo
1 de 33
WebCamp 07 Social Networks
Topics Tags and Trends in the
Blogosphere
Conor Hayes
DERI, Galway
WebCamp 07
Social Networks
Outline
 Blogs and the Blogosphere
 Linking Blogs - Content vs. Tags
 Topics and Bloggers
 User entropy
 Topic drift
 Blog reactivity
 Identifying consistent, topic–relevant blogs
WebCamp 07
Social Networks
Blogs
 Web site with journal style entries
 Dated in reverse chronological order
 Generally written by a single user
 Regularly updated
 Distributed publishing
 Easy to maintain and update
 Exponential growth:
 Technorati: 50 millions blogs (July 2006), doubling in size
every 6 months
 Increasingly important indicator of public opinion on
politics, technology, current affairs
WebCamp 07
Social Networks
Blogs vs. Usenet
 Blogosphere is user-centred
 Distributed architecture
 Topic organisation is locally defined by tags
 Not easy to find relevant posts related to the same topic
 Usenet is topic-centred
 Logically centralised architecture
 Topic organisation is a priori defined by newsgroup
heading, and subject headers
 Users know where to go to find information on a particular
topic
WebCamp 07
Social Networks
A topic-centred blogosphere?
 Semantic Web
 Link blogs using machine readable metadata : SIOC
 Tagging
 Tags: simple propositional entities, locally defined
 Link analysis:
 Majority of blogs have little or no inward connections
 Blog Roll, Comment List
 Relatively static group
 ‘Conventional’ Knowledge discovery techniques
 Clustering + online recommender systems
WebCamp 07
Social Networks
Nearest Neighbour Recommender
 Method :
1. Periodically, identify a set of nearest neighbour
blogs – one set for each topic the user is
interested in.
2. Select matching posts from these neighbours
 What are the implications of User drift ?
 How quickly does the neighbourhood set change
 What is the relationship between Users and
Topics?
 How consistently are bloggers attached to topics?
 Which neighbours consistently provide the most
topic- relevant information?
WebCamp 07
Social Networks
Experiments
 We cluster blog data over different time periods
 user entropy: measures whether bloggers remain
together over time
 topic drift: measure blogger behaviour in relation to
Topic growth and drift:
 We identify the most relevant blogs in each cluster
using tag analysis
WebCamp 07
Social Networks
Data
 We collected blog data from Jan16 to Feb 27, 2006
 7200 blogs in total
 We created 6 data sets, one for each week
 mean of 4250 blogs per week
 70% overlap between consecutive weeks
 Each instance in each data set contains the posts
from a single tag, from a single blogger
 An instance is only included in the data set for a
week only if the user has posted in that week
WebCamp 07
Social Networks
Clustering
 Goals: Uncover latent structures reflecting topics in
the collection and provide a means of summarisation
 Spherical k-means : partitions document corpus into
k disjoint groups of documents
 Produces interpretable concept summary for each
group
 Clustering quality:
 Blogs in the same cluster should
be similar;
 Blogs in different clusters should
be dissimilar.
 Hr: Ratio of intra- to inter-
cluster similarity
WebCamp 07
Social Networks
Clustering….
WebCamp 07
Social Networks
Partitions the document space
WebCamp 07
Social Networks
At window win t+n ?
WebCamp 07
Social Networks
Methodology
 We cluster each data set in date order at different k
 We reuse the cluster centroids in window t to seed the
clusters in window t+1
WebCamp 07
Social Networks
User Drift
 We define User Entropy: a measure of the degree
of user dispersion between windows
wint+n
q : number of clusters at wint+n
containing users from cluster r
nr
i
: number of users from cluster r
contained in cluster i at wint+n
nr: number of users from cluster r
available at wint+n
wint
WebCamp 07
Social Networks
Ur as the Interval Increases
WebCamp 07
Social Networks
Proportion of Users
= mean fraction of dataset contained in top 20% of clusters
= mean fraction of dataset contained in bottom 20% of clusters
WebCamp 07
Social Networks
User Drift vs. Cluster Strength
(mean) correlation: Hr vs. Ur at k
-0.9
-0.8
-0.7
-0.6
-0.5
-0.4
-0.3
-0.2
-0.1
0
0 10 20 30 40 50 60 70 80 90 100
k
pearsonR.
WebCamp 07
Social Networks
User Drift Conclusions
 Even where n is low, user dispersion occurs
 As n increases user entropy also increases,
suggesting that the ‘relationship’ between users
based on shared topics is short lived
 User dispersion is related to cluster strength
 Strong clusters experience less user drift than weak
clusters
 However, the fraction of data from strong clusters is
smaller than the fraction from weak clusters, by at
least a factor of 2
 We will return to user entropy later in the talk
WebCamp 07
Social Networks
Topic drift
 inter window similarity Wr
t+1
 Wr
t+1
for a cluster r at wint is the similarity between the
centroid of cluster r and the centroid of the
corresponding cluster r at wint+n
Wr
t+n
= cos(Cr,t, Cr,t+n)
 Intuitively, Wr
t+n
is a measure of the drift of the
concept centroid, Cr,from wint to wint+n
WebCamp 07
Social Networks
Topic Drift vs. User Drift
(mean) correlation: Wr vs. Ur at k
-0.9
-0.8
-0.7
-0.6
-0.5
-0.4
-0.3
-0.2
-0.1
0
0 10 20 30 40 50 60 70 80 90 100
k
pearsonR.
WebCamp 07
Social Networks
A Model of Topic Drift
 Topic drift is related to user
drift, but topics may be
more stable than users
 By observation: rate of
topic change is less than
rate of user drift
 Our analysis so far would
suggest that users are not
firmly fixed to topics, rather
they drift between topics
over time.
Type a = {X,Y,Z}
Type b= {P,Q,R}
WebCamp 07
Social Networks
Muhammad Cartoon Controversy
time
Jan 16
Jan 23
Jan 30
Feb 06
Feb 13
Feb 20
WebCamp 07
Social Networks
Observations
 The blogosphere responds quickly to breaking news
stories
 The relationship between topic and user drift is
pronounced where topic drift is extreme
 Otherwise, there is steady turnover of users around
relatively stable concepts
 Users ‘float’ between topics
WebCamp 07
Social Networks
Identifying the most relevant blogs
 Technorati uses a tag cloud to link posts by different
blogs together
WebCamp 07
Social Networks
Tag clouds
 In previous work we showed that tags perform badly
at grouping similar blog posts together
WebCamp 07
Social Networks
Tag clouds
 Clustering followed by tag analysis allowed us to
determine clusters contain strong concepts
 It also allowed us to fragment the global tag space to
produce local tag clouds
WebCamp 07
Social Networks
A-bloggers
 A-bloggers: only a portion (<0.4) of blogs in each
cluster contributes tags to the local tag cloud
description
WebCamp 07
Social Networks
A-bloggers are
1. more similar to each other than c- bloggers
2. more similar to the cluster centroid (topic definition)
than c-bloggers
3. more similar to pages retrieved from Google using
the topic description
WebCamp 07
Social Networks
Entropy: a-blogs vs c-blogs
 A-blog entropy is lower
 As interval increases a-blogs experience smaller
increases in entropy
 Suggests that a-bloggers tend to write consistently
about the same things over time
WebCamp 07
Social Networks
A-blogs Example
 Cluster 28 in Win5; k =50
 Cluster description: mobile, internet, weblog, web, patent
A-blogs
1) “Comunications: technology, economic and social issues at the intersection of
telecom, mobility and the Internet”
2) “IP Blawg”: technology and Intellectual property blog
3) “Small business IP management blog: Patent, Trademark, Copyright, Internet,
and Technology Law”
4) “Open Gardens: Wireless mobility, Digital convergence - Mobile web 2.0”
5) “Mobile Enterprise Weblog: the voice of enterprise mobility management”
C-blogs
1) “Digital Music Den: Digital Music, online music marketing”
2) "icarusindie.com – blog about nothing”: general computing and technology
3) “Dunkie's Saga” - personal blog: personal, cars, games, quizzes, some
technology
4) “Complex Christ – a vision for church that is organic, networked, decentralized,
bottom-up, emergent, communal, flexible, always evolving”
5) “Philips Brooks patent infringement updates”: legal blog on general patent
issues (pharmaceutical as well as technological)
WebCamp 07
Social Networks
Conclusions
 We have accumulated empirical evidence to suggest
that a-bloggers are topic authorities
 Tend to form tight subgroups close to cluster topic
definition
 Consistently more similar to pages ranked by Google using
the cluster topic definition
 Tend to stay together at differerent clusterings over time. In
other words they tend to write regularly about the same
topic
 What characteristics does the a-blogger have?
 A blogger that is aware of a wider potential readership and
chooses his/her tags so that they can be understood easily
by others
 Writes regularly in depth about fairly narrowly defined
subjects
 New professional bloggers
WebCamp 07
Social Networks
Future Work
 Produce tag hierarchies using hierarchical clustering
 Combine this with the work of Hak Lae Kim and Dr.
Suk-Hyung Hwang:
 Formal concept model of a blog social network using
tags and content analysis
 Tag recommender: Enrich SIOC topic descriptions
with tag cloud meta data
 Develop a set of style features to classify blogs
WebCamp 07
Social Networks
References
 Hayes, C., Avesani, P (2007) Using Tags and Clustering to
Identify Topic Relevant Blogs in Proceedings of the
International Conference on Weblogs and Social Media
(ICWSM 07)
 Hayes, C., Avesani, P., Veeramachaneni, S. (2007) An
Analysis of the Use of Tags in a Blog Recommender
System. In proceedings of IJCAI-07, the International Joint
Conference on Artificial Intelligence
 Hayes, C., Avesani, P., Veeramachaneni, S. (2006) An
Analysis of Bloggers and Topics for a Blog Recommender
System in proceedings of the Workshop on Web Mining
(Webmine 06) , 7th European Conference on Machine
Learning and the 10th European Conference on Principles

Más contenido relacionado

La actualidad más candente

SRS presentation
SRS presentationSRS presentation
SRS presentationslavaxx
 
Implementation of Privacy Policy Specification System for User Uploaded Image...
Implementation of Privacy Policy Specification System for User Uploaded Image...Implementation of Privacy Policy Specification System for User Uploaded Image...
Implementation of Privacy Policy Specification System for User Uploaded Image...rahulmonikasharma
 
Creating A Culture Of Collaboration In Online Communities
Creating A Culture Of Collaboration In Online CommunitiesCreating A Culture Of Collaboration In Online Communities
Creating A Culture Of Collaboration In Online CommunitieseKindling.org
 
Social Web Course @VU Amsterdam: Final Student Presentations
Social Web Course @VU Amsterdam: Final Student PresentationsSocial Web Course @VU Amsterdam: Final Student Presentations
Social Web Course @VU Amsterdam: Final Student PresentationsLora Aroyo
 
Social Bookmarking
Social BookmarkingSocial Bookmarking
Social BookmarkingChris
 
Social Software for Empowerment
Social Software for EmpowermentSocial Software for Empowerment
Social Software for EmpowermenteKindling.org
 
Lecture 4: How do we MINE, ANALYSE & VISUALISE the Social Web? (VU Amsterdam ...
Lecture 4: How do we MINE, ANALYSE & VISUALISE the Social Web? (VU Amsterdam ...Lecture 4: How do we MINE, ANALYSE & VISUALISE the Social Web? (VU Amsterdam ...
Lecture 4: How do we MINE, ANALYSE & VISUALISE the Social Web? (VU Amsterdam ...Lora Aroyo
 
Reference Services & Social Networking - Being on the cutting edge of engagment
Reference Services & Social Networking - Being on the cutting edge of engagmentReference Services & Social Networking - Being on the cutting edge of engagment
Reference Services & Social Networking - Being on the cutting edge of engagmentAriel Dagan
 
Tagging - Can User Generated Content Improve Our Services?
Tagging - Can User Generated Content Improve Our Services?Tagging - Can User Generated Content Improve Our Services?
Tagging - Can User Generated Content Improve Our Services?guestff5a190a
 
HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Pr...
HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Pr...HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Pr...
HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Pr...cameron
 
Social Networks and Social Capital
Social Networks and Social CapitalSocial Networks and Social Capital
Social Networks and Social CapitalGiorgos Cheliotis
 
RSS and Social Bookmarking
RSS and Social BookmarkingRSS and Social Bookmarking
RSS and Social BookmarkingNGRF
 
Web 2.0/Library 2.0
Web 2.0/Library 2.0Web 2.0/Library 2.0
Web 2.0/Library 2.0guesta715d0
 
ADLUG 2008 Web 2.0 - Library 2.0 presentation
ADLUG 2008 Web 2.0 - Library 2.0 presentationADLUG 2008 Web 2.0 - Library 2.0 presentation
ADLUG 2008 Web 2.0 - Library 2.0 presentation@CULT Srl
 
The Power of Known Peers: A Study in Two Domains
The Power of Known Peers: A Study in Two DomainsThe Power of Known Peers: A Study in Two Domains
The Power of Known Peers: A Study in Two DomainsPeter Brusilovsky
 
Social network analysis for modeling & tuning social media website
Social network analysis for modeling & tuning social media websiteSocial network analysis for modeling & tuning social media website
Social network analysis for modeling & tuning social media websiteEdward B. Rockower
 

La actualidad más candente (19)

SRS presentation
SRS presentationSRS presentation
SRS presentation
 
Blogosphere
BlogosphereBlogosphere
Blogosphere
 
Implementation of Privacy Policy Specification System for User Uploaded Image...
Implementation of Privacy Policy Specification System for User Uploaded Image...Implementation of Privacy Policy Specification System for User Uploaded Image...
Implementation of Privacy Policy Specification System for User Uploaded Image...
 
Creating A Culture Of Collaboration In Online Communities
Creating A Culture Of Collaboration In Online CommunitiesCreating A Culture Of Collaboration In Online Communities
Creating A Culture Of Collaboration In Online Communities
 
Social Web Course @VU Amsterdam: Final Student Presentations
Social Web Course @VU Amsterdam: Final Student PresentationsSocial Web Course @VU Amsterdam: Final Student Presentations
Social Web Course @VU Amsterdam: Final Student Presentations
 
Social Bookmarking
Social BookmarkingSocial Bookmarking
Social Bookmarking
 
Social Software for Empowerment
Social Software for EmpowermentSocial Software for Empowerment
Social Software for Empowerment
 
Lecture 4: How do we MINE, ANALYSE & VISUALISE the Social Web? (VU Amsterdam ...
Lecture 4: How do we MINE, ANALYSE & VISUALISE the Social Web? (VU Amsterdam ...Lecture 4: How do we MINE, ANALYSE & VISUALISE the Social Web? (VU Amsterdam ...
Lecture 4: How do we MINE, ANALYSE & VISUALISE the Social Web? (VU Amsterdam ...
 
Reference Services & Social Networking - Being on the cutting edge of engagment
Reference Services & Social Networking - Being on the cutting edge of engagmentReference Services & Social Networking - Being on the cutting edge of engagment
Reference Services & Social Networking - Being on the cutting edge of engagment
 
Tagging - Can User Generated Content Improve Our Services?
Tagging - Can User Generated Content Improve Our Services?Tagging - Can User Generated Content Improve Our Services?
Tagging - Can User Generated Content Improve Our Services?
 
HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Pr...
HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Pr...HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Pr...
HT06, Position Paper, Tagging, Taxonomy, Flickr, Academic Article, ToRead, Pr...
 
Social Networks and Social Capital
Social Networks and Social CapitalSocial Networks and Social Capital
Social Networks and Social Capital
 
Blogosphere
BlogosphereBlogosphere
Blogosphere
 
RSS and Social Bookmarking
RSS and Social BookmarkingRSS and Social Bookmarking
RSS and Social Bookmarking
 
Web 2.0/Library 2.0
Web 2.0/Library 2.0Web 2.0/Library 2.0
Web 2.0/Library 2.0
 
Web 2.0 Presentation Tools & Resources: Flickr, SlideShare, Zoho Show & More
Web 2.0 Presentation Tools & Resources: Flickr, SlideShare, Zoho Show & MoreWeb 2.0 Presentation Tools & Resources: Flickr, SlideShare, Zoho Show & More
Web 2.0 Presentation Tools & Resources: Flickr, SlideShare, Zoho Show & More
 
ADLUG 2008 Web 2.0 - Library 2.0 presentation
ADLUG 2008 Web 2.0 - Library 2.0 presentationADLUG 2008 Web 2.0 - Library 2.0 presentation
ADLUG 2008 Web 2.0 - Library 2.0 presentation
 
The Power of Known Peers: A Study in Two Domains
The Power of Known Peers: A Study in Two DomainsThe Power of Known Peers: A Study in Two Domains
The Power of Known Peers: A Study in Two Domains
 
Social network analysis for modeling & tuning social media website
Social network analysis for modeling & tuning social media websiteSocial network analysis for modeling & tuning social media website
Social network analysis for modeling & tuning social media website
 

Destacado

Andrew Page - The demand for search in a social network
Andrew Page - The demand for search in a social networkAndrew Page - The demand for search in a social network
Andrew Page - The demand for search in a social networkDERIGalway
 
Titanic by Jermaine Monero
Titanic by Jermaine MoneroTitanic by Jermaine Monero
Titanic by Jermaine MoneroDafydd Humphreys
 
Gabriela Avram - Where is the knowledge: reflections on social networking in ...
Gabriela Avram - Where is the knowledge: reflections on social networking in ...Gabriela Avram - Where is the knowledge: reflections on social networking in ...
Gabriela Avram - Where is the knowledge: reflections on social networking in ...DERIGalway
 
Work houses by shaquille brown
Work houses by shaquille brownWork houses by shaquille brown
Work houses by shaquille brownDafydd Humphreys
 
John Breslin - Introduction / overview
John Breslin - Introduction / overviewJohn Breslin - Introduction / overview
John Breslin - Introduction / overviewDERIGalway
 
pigyear
pigyearpigyear
pigyeared2k
 
Crimewatch UK - What crime was like in Early Modern Times by Qasim Khan
Crimewatch UK - What crime was like in Early Modern Times by Qasim KhanCrimewatch UK - What crime was like in Early Modern Times by Qasim Khan
Crimewatch UK - What crime was like in Early Modern Times by Qasim KhanDafydd Humphreys
 
Pro dj
Pro djPro dj
Pro djadolan
 
Weather Phenomena by Ali Hussain
Weather Phenomena by Ali HussainWeather Phenomena by Ali Hussain
Weather Phenomena by Ali HussainDafydd Humphreys
 
Valdis Krebs - Social network analysis: 1987-2007
Valdis Krebs - Social network analysis: 1987-2007Valdis Krebs - Social network analysis: 1987-2007
Valdis Krebs - Social network analysis: 1987-2007DERIGalway
 

Destacado (15)

Andrew Page - The demand for search in a social network
Andrew Page - The demand for search in a social networkAndrew Page - The demand for search in a social network
Andrew Page - The demand for search in a social network
 
Titanic by Jermaine Monero
Titanic by Jermaine MoneroTitanic by Jermaine Monero
Titanic by Jermaine Monero
 
test
testtest
test
 
Gabriela Avram - Where is the knowledge: reflections on social networking in ...
Gabriela Avram - Where is the knowledge: reflections on social networking in ...Gabriela Avram - Where is the knowledge: reflections on social networking in ...
Gabriela Avram - Where is the knowledge: reflections on social networking in ...
 
Indians by Ali Hussain
Indians by Ali HussainIndians by Ali Hussain
Indians by Ali Hussain
 
Work houses by shaquille brown
Work houses by shaquille brownWork houses by shaquille brown
Work houses by shaquille brown
 
cliffs by jake cudd
cliffs by jake cuddcliffs by jake cudd
cliffs by jake cudd
 
Romans by Anuj Patel
Romans by Anuj PatelRomans by Anuj Patel
Romans by Anuj Patel
 
John Breslin - Introduction / overview
John Breslin - Introduction / overviewJohn Breslin - Introduction / overview
John Breslin - Introduction / overview
 
pigyear
pigyearpigyear
pigyear
 
Crimewatch UK - What crime was like in Early Modern Times by Qasim Khan
Crimewatch UK - What crime was like in Early Modern Times by Qasim KhanCrimewatch UK - What crime was like in Early Modern Times by Qasim Khan
Crimewatch UK - What crime was like in Early Modern Times by Qasim Khan
 
Pro dj
Pro djPro dj
Pro dj
 
work house by Gary Eden
work house by Gary Edenwork house by Gary Eden
work house by Gary Eden
 
Weather Phenomena by Ali Hussain
Weather Phenomena by Ali HussainWeather Phenomena by Ali Hussain
Weather Phenomena by Ali Hussain
 
Valdis Krebs - Social network analysis: 1987-2007
Valdis Krebs - Social network analysis: 1987-2007Valdis Krebs - Social network analysis: 1987-2007
Valdis Krebs - Social network analysis: 1987-2007
 

Similar a Conor Hayes - Topics, tags and trends in the blogosphere

Using Tags and Clustering to Identify Topic-specific Blogs
Using Tags and Clustering to Identify Topic-specific BlogsUsing Tags and Clustering to Identify Topic-specific Blogs
Using Tags and Clustering to Identify Topic-specific BlogsConor Hayes
 
Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...John Breslin
 
Wikis as Social Networks: Evolution and Dynamics
Wikis as Social Networks:Evolution and Dynamics Wikis as Social Networks:Evolution and Dynamics
Wikis as Social Networks: Evolution and Dynamics Ralf Klamma
 
InternshipPoster
InternshipPosterInternshipPoster
InternshipPosterRu Zhao
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigmJonathan Challener
 
Klout as an Example Application of Topics-oriented NLP APIs
Klout as an Example Application of Topics-oriented NLP APIsKlout as an Example Application of Topics-oriented NLP APIs
Klout as an Example Application of Topics-oriented NLP APIsTyler Singletary
 
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
IRJET-  	  Finding Related Forum Posts through Intention-Based SegmentationIRJET-  	  Finding Related Forum Posts through Intention-Based Segmentation
IRJET- Finding Related Forum Posts through Intention-Based SegmentationIRJET Journal
 
14420-Article Text-17938-1-2-20201228.pdf
14420-Article Text-17938-1-2-20201228.pdf14420-Article Text-17938-1-2-20201228.pdf
14420-Article Text-17938-1-2-20201228.pdfMehwishKanwal14
 
[ICDE 2014] Incremental Cluster Evolution Tracking from Highly Dynamic Networ...
[ICDE 2014] Incremental Cluster Evolution Tracking from Highly Dynamic Networ...[ICDE 2014] Incremental Cluster Evolution Tracking from Highly Dynamic Networ...
[ICDE 2014] Incremental Cluster Evolution Tracking from Highly Dynamic Networ...Pei Lee
 
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...GUANGYUAN PIAO
 
Prediction of Reaction towards Textual Posts in Social Networks
Prediction of Reaction towards Textual Posts in Social NetworksPrediction of Reaction towards Textual Posts in Social Networks
Prediction of Reaction towards Textual Posts in Social NetworksMohamed El-Geish
 
Web 2.0: Implications for Library Services
Web 2.0: Implications for Library ServicesWeb 2.0: Implications for Library Services
Web 2.0: Implications for Library ServicesADINET Ahmedabad
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchangelagoze
 
Extracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme DocumentsExtracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme Documentsmaria.grineva
 
Effective Extraction of Thematically Grouped Key Terms From Text
Effective Extraction of Thematically Grouped Key Terms From TextEffective Extraction of Thematically Grouped Key Terms From Text
Effective Extraction of Thematically Grouped Key Terms From Textmaria.grineva
 

Similar a Conor Hayes - Topics, tags and trends in the blogosphere (20)

Using Tags and Clustering to Identify Topic-specific Blogs
Using Tags and Clustering to Identify Topic-specific BlogsUsing Tags and Clustering to Identify Topic-specific Blogs
Using Tags and Clustering to Identify Topic-specific Blogs
 
Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...Interlinking Online Communities and Enriching Social Software with the Semant...
Interlinking Online Communities and Enriching Social Software with the Semant...
 
Wikis as Social Networks: Evolution and Dynamics
Wikis as Social Networks:Evolution and Dynamics Wikis as Social Networks:Evolution and Dynamics
Wikis as Social Networks: Evolution and Dynamics
 
InternshipPoster
InternshipPosterInternshipPoster
InternshipPoster
 
Notes on mining social media updated
Notes on mining social media updatedNotes on mining social media updated
Notes on mining social media updated
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigm
 
Klout as an Example Application of Topics-oriented NLP APIs
Klout as an Example Application of Topics-oriented NLP APIsKlout as an Example Application of Topics-oriented NLP APIs
Klout as an Example Application of Topics-oriented NLP APIs
 
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
IRJET-  	  Finding Related Forum Posts through Intention-Based SegmentationIRJET-  	  Finding Related Forum Posts through Intention-Based Segmentation
IRJET- Finding Related Forum Posts through Intention-Based Segmentation
 
14420-Article Text-17938-1-2-20201228.pdf
14420-Article Text-17938-1-2-20201228.pdf14420-Article Text-17938-1-2-20201228.pdf
14420-Article Text-17938-1-2-20201228.pdf
 
[ICDE 2014] Incremental Cluster Evolution Tracking from Highly Dynamic Networ...
[ICDE 2014] Incremental Cluster Evolution Tracking from Highly Dynamic Networ...[ICDE 2014] Incremental Cluster Evolution Tracking from Highly Dynamic Networ...
[ICDE 2014] Incremental Cluster Evolution Tracking from Highly Dynamic Networ...
 
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
SEMANTiCS2016 - Exploring Dynamics and Semantics of User Interests for User ...
 
Prediction of Reaction towards Textual Posts in Social Networks
Prediction of Reaction towards Textual Posts in Social NetworksPrediction of Reaction towards Textual Posts in Social Networks
Prediction of Reaction towards Textual Posts in Social Networks
 
Web 2.0: Implications for Library Services
Web 2.0: Implications for Library ServicesWeb 2.0: Implications for Library Services
Web 2.0: Implications for Library Services
 
CSE509 Lecture 5
CSE509 Lecture 5CSE509 Lecture 5
CSE509 Lecture 5
 
Webware Webinar
Webware WebinarWebware Webinar
Webware Webinar
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
Extracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme DocumentsExtracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme Documents
 
Understanding User-Community Engagement by Multi-faceted Features: A Case ...
Understanding User-Community Engagement by Multi-faceted Features: A Case ...Understanding User-Community Engagement by Multi-faceted Features: A Case ...
Understanding User-Community Engagement by Multi-faceted Features: A Case ...
 
library 2.0
library 2.0library 2.0
library 2.0
 
Effective Extraction of Thematically Grouped Key Terms From Text
Effective Extraction of Thematically Grouped Key Terms From TextEffective Extraction of Thematically Grouped Key Terms From Text
Effective Extraction of Thematically Grouped Key Terms From Text
 

Último

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observabilityitnewsafrica
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 

Último (20)

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security ObservabilityGlenn Lazarus- Why Your Observability Strategy Needs Security Observability
Glenn Lazarus- Why Your Observability Strategy Needs Security Observability
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 

Conor Hayes - Topics, tags and trends in the blogosphere

  • 1. WebCamp 07 Social Networks Topics Tags and Trends in the Blogosphere Conor Hayes DERI, Galway
  • 2. WebCamp 07 Social Networks Outline  Blogs and the Blogosphere  Linking Blogs - Content vs. Tags  Topics and Bloggers  User entropy  Topic drift  Blog reactivity  Identifying consistent, topic–relevant blogs
  • 3. WebCamp 07 Social Networks Blogs  Web site with journal style entries  Dated in reverse chronological order  Generally written by a single user  Regularly updated  Distributed publishing  Easy to maintain and update  Exponential growth:  Technorati: 50 millions blogs (July 2006), doubling in size every 6 months  Increasingly important indicator of public opinion on politics, technology, current affairs
  • 4. WebCamp 07 Social Networks Blogs vs. Usenet  Blogosphere is user-centred  Distributed architecture  Topic organisation is locally defined by tags  Not easy to find relevant posts related to the same topic  Usenet is topic-centred  Logically centralised architecture  Topic organisation is a priori defined by newsgroup heading, and subject headers  Users know where to go to find information on a particular topic
  • 5. WebCamp 07 Social Networks A topic-centred blogosphere?  Semantic Web  Link blogs using machine readable metadata : SIOC  Tagging  Tags: simple propositional entities, locally defined  Link analysis:  Majority of blogs have little or no inward connections  Blog Roll, Comment List  Relatively static group  ‘Conventional’ Knowledge discovery techniques  Clustering + online recommender systems
  • 6. WebCamp 07 Social Networks Nearest Neighbour Recommender  Method : 1. Periodically, identify a set of nearest neighbour blogs – one set for each topic the user is interested in. 2. Select matching posts from these neighbours  What are the implications of User drift ?  How quickly does the neighbourhood set change  What is the relationship between Users and Topics?  How consistently are bloggers attached to topics?  Which neighbours consistently provide the most topic- relevant information?
  • 7. WebCamp 07 Social Networks Experiments  We cluster blog data over different time periods  user entropy: measures whether bloggers remain together over time  topic drift: measure blogger behaviour in relation to Topic growth and drift:  We identify the most relevant blogs in each cluster using tag analysis
  • 8. WebCamp 07 Social Networks Data  We collected blog data from Jan16 to Feb 27, 2006  7200 blogs in total  We created 6 data sets, one for each week  mean of 4250 blogs per week  70% overlap between consecutive weeks  Each instance in each data set contains the posts from a single tag, from a single blogger  An instance is only included in the data set for a week only if the user has posted in that week
  • 9. WebCamp 07 Social Networks Clustering  Goals: Uncover latent structures reflecting topics in the collection and provide a means of summarisation  Spherical k-means : partitions document corpus into k disjoint groups of documents  Produces interpretable concept summary for each group  Clustering quality:  Blogs in the same cluster should be similar;  Blogs in different clusters should be dissimilar.  Hr: Ratio of intra- to inter- cluster similarity
  • 12. WebCamp 07 Social Networks At window win t+n ?
  • 13. WebCamp 07 Social Networks Methodology  We cluster each data set in date order at different k  We reuse the cluster centroids in window t to seed the clusters in window t+1
  • 14. WebCamp 07 Social Networks User Drift  We define User Entropy: a measure of the degree of user dispersion between windows wint+n q : number of clusters at wint+n containing users from cluster r nr i : number of users from cluster r contained in cluster i at wint+n nr: number of users from cluster r available at wint+n wint
  • 15. WebCamp 07 Social Networks Ur as the Interval Increases
  • 16. WebCamp 07 Social Networks Proportion of Users = mean fraction of dataset contained in top 20% of clusters = mean fraction of dataset contained in bottom 20% of clusters
  • 17. WebCamp 07 Social Networks User Drift vs. Cluster Strength (mean) correlation: Hr vs. Ur at k -0.9 -0.8 -0.7 -0.6 -0.5 -0.4 -0.3 -0.2 -0.1 0 0 10 20 30 40 50 60 70 80 90 100 k pearsonR.
  • 18. WebCamp 07 Social Networks User Drift Conclusions  Even where n is low, user dispersion occurs  As n increases user entropy also increases, suggesting that the ‘relationship’ between users based on shared topics is short lived  User dispersion is related to cluster strength  Strong clusters experience less user drift than weak clusters  However, the fraction of data from strong clusters is smaller than the fraction from weak clusters, by at least a factor of 2  We will return to user entropy later in the talk
  • 19. WebCamp 07 Social Networks Topic drift  inter window similarity Wr t+1  Wr t+1 for a cluster r at wint is the similarity between the centroid of cluster r and the centroid of the corresponding cluster r at wint+n Wr t+n = cos(Cr,t, Cr,t+n)  Intuitively, Wr t+n is a measure of the drift of the concept centroid, Cr,from wint to wint+n
  • 20. WebCamp 07 Social Networks Topic Drift vs. User Drift (mean) correlation: Wr vs. Ur at k -0.9 -0.8 -0.7 -0.6 -0.5 -0.4 -0.3 -0.2 -0.1 0 0 10 20 30 40 50 60 70 80 90 100 k pearsonR.
  • 21. WebCamp 07 Social Networks A Model of Topic Drift  Topic drift is related to user drift, but topics may be more stable than users  By observation: rate of topic change is less than rate of user drift  Our analysis so far would suggest that users are not firmly fixed to topics, rather they drift between topics over time. Type a = {X,Y,Z} Type b= {P,Q,R}
  • 22. WebCamp 07 Social Networks Muhammad Cartoon Controversy time Jan 16 Jan 23 Jan 30 Feb 06 Feb 13 Feb 20
  • 23. WebCamp 07 Social Networks Observations  The blogosphere responds quickly to breaking news stories  The relationship between topic and user drift is pronounced where topic drift is extreme  Otherwise, there is steady turnover of users around relatively stable concepts  Users ‘float’ between topics
  • 24. WebCamp 07 Social Networks Identifying the most relevant blogs  Technorati uses a tag cloud to link posts by different blogs together
  • 25. WebCamp 07 Social Networks Tag clouds  In previous work we showed that tags perform badly at grouping similar blog posts together
  • 26. WebCamp 07 Social Networks Tag clouds  Clustering followed by tag analysis allowed us to determine clusters contain strong concepts  It also allowed us to fragment the global tag space to produce local tag clouds
  • 27. WebCamp 07 Social Networks A-bloggers  A-bloggers: only a portion (<0.4) of blogs in each cluster contributes tags to the local tag cloud description
  • 28. WebCamp 07 Social Networks A-bloggers are 1. more similar to each other than c- bloggers 2. more similar to the cluster centroid (topic definition) than c-bloggers 3. more similar to pages retrieved from Google using the topic description
  • 29. WebCamp 07 Social Networks Entropy: a-blogs vs c-blogs  A-blog entropy is lower  As interval increases a-blogs experience smaller increases in entropy  Suggests that a-bloggers tend to write consistently about the same things over time
  • 30. WebCamp 07 Social Networks A-blogs Example  Cluster 28 in Win5; k =50  Cluster description: mobile, internet, weblog, web, patent A-blogs 1) “Comunications: technology, economic and social issues at the intersection of telecom, mobility and the Internet” 2) “IP Blawg”: technology and Intellectual property blog 3) “Small business IP management blog: Patent, Trademark, Copyright, Internet, and Technology Law” 4) “Open Gardens: Wireless mobility, Digital convergence - Mobile web 2.0” 5) “Mobile Enterprise Weblog: the voice of enterprise mobility management” C-blogs 1) “Digital Music Den: Digital Music, online music marketing” 2) "icarusindie.com – blog about nothing”: general computing and technology 3) “Dunkie's Saga” - personal blog: personal, cars, games, quizzes, some technology 4) “Complex Christ – a vision for church that is organic, networked, decentralized, bottom-up, emergent, communal, flexible, always evolving” 5) “Philips Brooks patent infringement updates”: legal blog on general patent issues (pharmaceutical as well as technological)
  • 31. WebCamp 07 Social Networks Conclusions  We have accumulated empirical evidence to suggest that a-bloggers are topic authorities  Tend to form tight subgroups close to cluster topic definition  Consistently more similar to pages ranked by Google using the cluster topic definition  Tend to stay together at differerent clusterings over time. In other words they tend to write regularly about the same topic  What characteristics does the a-blogger have?  A blogger that is aware of a wider potential readership and chooses his/her tags so that they can be understood easily by others  Writes regularly in depth about fairly narrowly defined subjects  New professional bloggers
  • 32. WebCamp 07 Social Networks Future Work  Produce tag hierarchies using hierarchical clustering  Combine this with the work of Hak Lae Kim and Dr. Suk-Hyung Hwang:  Formal concept model of a blog social network using tags and content analysis  Tag recommender: Enrich SIOC topic descriptions with tag cloud meta data  Develop a set of style features to classify blogs
  • 33. WebCamp 07 Social Networks References  Hayes, C., Avesani, P (2007) Using Tags and Clustering to Identify Topic Relevant Blogs in Proceedings of the International Conference on Weblogs and Social Media (ICWSM 07)  Hayes, C., Avesani, P., Veeramachaneni, S. (2007) An Analysis of the Use of Tags in a Blog Recommender System. In proceedings of IJCAI-07, the International Joint Conference on Artificial Intelligence  Hayes, C., Avesani, P., Veeramachaneni, S. (2006) An Analysis of Bloggers and Topics for a Blog Recommender System in proceedings of the Workshop on Web Mining (Webmine 06) , 7th European Conference on Machine Learning and the 10th European Conference on Principles