SlideShare a Scribd company logo
1 of 13
Download to read offline
Digital Enterprise Research Institute                                                               www.deri.ie




                            Towards Cross-Community
                              Information Diffusion
                                  Maximisation
                  Václav Belák, Samantha Lam, Conor Hayes



© Copyright 2011 Digital Enterprise Research Institute. All rights reserved.




                                                                               Enabling Networked Knowledge
Motivation
Digital Enterprise Research Institute                                        www.deri.ie


   •  Information cascades of high interest in marketing, CRM, etc.
   •  A common approach is to maximise information diffusion by
      targeting influential actors
   •  In the context of many online communities (e.g. discussion
      fora) the information is shared to the community as a whole
      and not to individual actors




  common case – targeting individuals    cross-community case – targeting communities

                                                     Enabling Networked Knowledge
Objectives
Digital Enterprise Research Institute                             www.deri.ie




   •  Our main hypothesis is that it is possible to efficiently
      spread a message over the information flow network by
      targeting highly influential communities


   •  The main problem is then formulated as a prediction of
      the set of communities to target such that the message is
      spread over the network as much as possible
       •  Spread over the actors, i.e. user activation fraction
       •  Spread over the communities, i.e. community
          activation fraction


                                             Enabling Networked Knowledge
Methods: Definition of Impact
Digital Enterprise Research Institute                                   www.deri.ie



  •  We propose (Belák et al., ‘12) to take two factors into account:
      1.  degree of community membership of the users
      2.  centrality of the users within each community




  •  Impact of community A on community B defined as an average centrality of
     actors from A within B, weighted by their membership in A

                                                   Enabling Networked Knowledge
Methods: Targeting
                                Communities
Digital Enterprise Research Institute                                              www.deri.ie

   •  Level of dispersion (heterogeneity) of total impact of community i can be
      measured as an entropy of an i-th row/column of the impact matrix

   •  We propose to target communities by means of the product of the total
      impact of community i and its entropy: impact focus (IF)

   •  We simulated the diffusion by extending Independent Cascade (ICM) and
      Linear Threshold (LTM) Models (Kempe et al., ‘03)
        1.  Take q target communities and sample s users from each of them
        2.  Run the original models from the union of sampled users
   •  Information diffusion network derived from the reply-to network:
                                             replies to
                                        i       rji       j


                                            information
                                        i                 j
                                              flow wij

                                                              Enabling Networked Knowledge
Evaluation Strategy
Digital Enterprise Research Institute                                       www.deri.ie


         •  IF compared with random targeting (R), and group in-degree (GI)
            (Everett & Borgatti, ’99)

         •  The main aim was to investigate robustness of our framework with
            respect to:
              •  Character of the system
              •  Diffusion models
              •  User and Community Activation Fractions

         •  Procedural outline
             1.  Target q communities using one of the heuristics evaluated on
                 the data from time-slice t
             2.  Run the diffusion model on the network from time-slice t+1
             3.  Compute an average user and community spreads over all
                 pairs (t, t+1)


                                                    Enabling Networked Knowledge
Evaluation Data-Sets
Digital Enterprise Research Institute                                              www.deri.ie



  •  51 weeks of data of the largest Irish
     discussion board system
  •  Segmented using 1 week sliding window
      •  1 week window represents approx. 84% of
         cross-fora posting activity
  •  540 communities, 5.3k users/snapshot (avg)



                            •  5 years of data from the technical support fora of SAP
                            •  Used only for the diffusion experiments
                            •  Segmented using 2 months sliding window
                                •  2 months represent approx. 50% of cross-fora posting
                                   activity
                            •  33 communities, 2k users/snapshot (avg)

                                                            Enabling Networked Knowledge
User Act. Fraction
Digital Enterprise Research Institute                                                                                                                                  www.deri.ie



                                                                             One targeted community
                                                     q=1, Boards−LTM                                                                  q=1, SAP−LTM
                                           0.8




                                                                                                                           0.30
                                           0.7




                                                                                                                           0.25
                                           0.6
       mean user activation fraction (u)




                                                                                       mean user activation fraction (u)

                                                                                                                           0.20
                                           0.5




                                                                                                                           0.15
                                           0.4




                                                                                                                           0.10
                                           0.3




                                                                                                                           0.05
                                           0.2




                                                                                  IF                                                                              IF
                                                                                  GI                                                                              GI
                                                                                                                           0.00
                                           0.1




                                                                                  R                                                                               R


                                                 5          10              15    20                                              5          10              15   20

                                                     user sample size (s)                                                             user sample size (s)




                                                                                                                                  Enabling Networked Knowledge
Community Act. Fr.
Digital Enterprise Research Institute                                                                                                                                              www.deri.ie



                                                                                   One targeted community
                                                            q=1, Boards−LTM                                                                       q=1, SAP−LTM




                                                                                                                                       0.5
                                                  0.8
                                                  0.7




                                                                                                                                       0.4
         mean community activation fraction (c)




                                                                                              mean community activation fraction (c)
                                                  0.6




                                                                                                                                       0.3
                                                  0.5
                                                  0.4




                                                                                                                                       0.2
                                                  0.3




                                                                                                                                       0.1
                                                  0.2




                                                                                         IF                                                                                   IF
                                                                                         GI                                                                                   GI
                                                  0.1




                                                                                                                                       0.0

                                                                                         R                                                                                    R


                                                        5          10              15    20                                                   5          10              15   20

                                                            user sample size (s)                                                                  user sample size (s)




                                                                                                                                             Enabling Networked Knowledge
Community Act. Fr.
Digital Enterprise Research Institute                                                                                                                                              www.deri.ie



                                                                             Five targeted communities
                                                             q=5, Boards−LTM                                                                      q=5, SAP−LTM




                                                                                                                                       0.5
                                                   0.8
                                                   0.7




                                                                                                                                       0.4
          mean community activation fraction (c)




                                                                                              mean community activation fraction (c)
                                                   0.6




                                                                                                                                       0.3
                                                   0.5
                                                   0.4




                                                                                                                                       0.2
                                                   0.3




                                                                                                                                       0.1
                                                   0.2




                                                                                         IF                                                                                   IF
                                                                                         GI                                                                                   GI
                                                   0.1




                                                                                                                                       0.0

                                                                                         R                                                                                    R


                                                         5          10              15   20                                                   5          10              15   20

                                                             user sample size (s)                                                                 user sample size (s)




                                                                                                                                             Enabling Networked Knowledge
Results Highlights
Digital Enterprise Research Institute                                     www.deri.ie


       •  Diffusion process became saturated at approximately 80% of users
          or communities in Boards, and 30% in SAP
           •  More efficient to target few communities

       •  Impact Focus outperformed the other two strategies with respect to
          both user and community activation fractions, namely for small
          number of targeted communities (i.e. [1, 2]) and
          seed users (i.e. [1, 20])
           •  Diminishing returns

       •  For high number of targeted communities and seed users, random
          strategy outperformed the other two with respect to community
          activation fractions in SAP data-set
            •  SAP network fragmented into many small components, which
               made it hard to reach peripheral communities


                                                   Enabling Networked Knowledge
Conclusion
Digital Enterprise Research Institute                               www.deri.ie



       •  The evaluation demonstrated that the framework
           •  is able to identify highly influential communities
           •  can predict which communities to target s.t. the
              message spreads efficiently over both individual users
              and communities

       •  We aim to extend it with content analysis
           •  E.g. What are the most influential communities with
              respect to a particular topic?

       •  We will also investigate empirically-observed topic
          cascades and modify our models accordingly if needed


                                             Enabling Networked Knowledge
Questions?
Digital Enterprise Research Institute                                       www.deri.ie




      References

      •  Belák V., Lam S., Hayes C. Cross-Community Influence in Discussion
         Fora. ICWSM. AAAI, 2012.
      •  M. Everett and S. Borgatti. The centrality of groups and classes. J. of
         Mathematical Sociology, 23(3):181–201, 1999.
      •  D. Kempe, J. Kleinberg, and É. Tardos. Maximizing the spread of
         influence through a social network. SIGKDD. ACM, 2003.

                                                     Enabling Networked Knowledge

More Related Content

What's hot

Cristina Torrecillas: "Building evidence to measure the socio-economic impact...
Cristina Torrecillas: "Building evidence to measure the socio-economic impact...Cristina Torrecillas: "Building evidence to measure the socio-economic impact...
Cristina Torrecillas: "Building evidence to measure the socio-economic impact...TELECENTRE EUROPE
 
Connect With Customers And Your Social Networks Using Oracle Fusion Crm
Connect With Customers And Your Social Networks Using Oracle Fusion CrmConnect With Customers And Your Social Networks Using Oracle Fusion Crm
Connect With Customers And Your Social Networks Using Oracle Fusion CrmJerome Leonard
 
Network UniverCity- OU Lounge
Network UniverCity- OU LoungeNetwork UniverCity- OU Lounge
Network UniverCity- OU Loungedharmesh gangani
 
Pulse survey july 2012
Pulse survey july 2012Pulse survey july 2012
Pulse survey july 2012mdarder
 
Pulse survey july 2012
Pulse survey july 2012Pulse survey july 2012
Pulse survey july 2012Pulse_Intranet
 
E democracy, visualization, open data, digital citizenship
E democracy, visualization, open data, digital citizenshipE democracy, visualization, open data, digital citizenship
E democracy, visualization, open data, digital citizenship@cristobalcobo
 
Pulse survey july 2012 22
Pulse survey july 2012 22Pulse survey july 2012 22
Pulse survey july 2012 22Pulse_Intranet
 
Pulse survey july 2012
Pulse survey july 2012Pulse survey july 2012
Pulse survey july 2012Pulse_Intranet
 
Dundu - The philosophy that enables a group
Dundu - The philosophy that enables a groupDundu - The philosophy that enables a group
Dundu - The philosophy that enables a groupFabian Seewald
 
Newcastle 3 8 2012
Newcastle 3 8 2012Newcastle 3 8 2012
Newcastle 3 8 2012Mazzara1976
 
Webinar: Enterprise Social Networking to Foster Employee Engagement
Webinar: Enterprise Social Networking  to Foster Employee Engagement Webinar: Enterprise Social Networking  to Foster Employee Engagement
Webinar: Enterprise Social Networking to Foster Employee Engagement tibbr
 
Knowledge Management
Knowledge ManagementKnowledge Management
Knowledge ManagementAKAGroup
 
Open Data for Health of Natural Capital
Open Data for Health of Natural CapitalOpen Data for Health of Natural Capital
Open Data for Health of Natural CapitalOpen Knowledge Canada
 
Hypersoft Operational and Organizational Intelligence
Hypersoft Operational and Organizational IntelligenceHypersoft Operational and Organizational Intelligence
Hypersoft Operational and Organizational IntelligencePavelHypersoft
 
Collaborative eResearch in a Social Cloud
Collaborative eResearch in a Social CloudCollaborative eResearch in a Social Cloud
Collaborative eResearch in a Social CloudSimon Caton
 
Lll wces2012 barcelona
Lll wces2012 barcelonaLll wces2012 barcelona
Lll wces2012 barcelonaInês Messias
 

What's hot (18)

Cristina Torrecillas: "Building evidence to measure the socio-economic impact...
Cristina Torrecillas: "Building evidence to measure the socio-economic impact...Cristina Torrecillas: "Building evidence to measure the socio-economic impact...
Cristina Torrecillas: "Building evidence to measure the socio-economic impact...
 
Connect With Customers And Your Social Networks Using Oracle Fusion Crm
Connect With Customers And Your Social Networks Using Oracle Fusion CrmConnect With Customers And Your Social Networks Using Oracle Fusion Crm
Connect With Customers And Your Social Networks Using Oracle Fusion Crm
 
Network UniverCity- OU Lounge
Network UniverCity- OU LoungeNetwork UniverCity- OU Lounge
Network UniverCity- OU Lounge
 
Pulse survey july 2012
Pulse survey july 2012Pulse survey july 2012
Pulse survey july 2012
 
Pulse survey july 2012
Pulse survey july 2012Pulse survey july 2012
Pulse survey july 2012
 
E democracy, visualization, open data, digital citizenship
E democracy, visualization, open data, digital citizenshipE democracy, visualization, open data, digital citizenship
E democracy, visualization, open data, digital citizenship
 
Networked Innovation And Collaboration
Networked Innovation And CollaborationNetworked Innovation And Collaboration
Networked Innovation And Collaboration
 
Pulse survey july 2012 22
Pulse survey july 2012 22Pulse survey july 2012 22
Pulse survey july 2012 22
 
Pulse survey july 2012
Pulse survey july 2012Pulse survey july 2012
Pulse survey july 2012
 
Dundu - The philosophy that enables a group
Dundu - The philosophy that enables a groupDundu - The philosophy that enables a group
Dundu - The philosophy that enables a group
 
Newcastle 3 8 2012
Newcastle 3 8 2012Newcastle 3 8 2012
Newcastle 3 8 2012
 
Opening up government
Opening up governmentOpening up government
Opening up government
 
Webinar: Enterprise Social Networking to Foster Employee Engagement
Webinar: Enterprise Social Networking  to Foster Employee Engagement Webinar: Enterprise Social Networking  to Foster Employee Engagement
Webinar: Enterprise Social Networking to Foster Employee Engagement
 
Knowledge Management
Knowledge ManagementKnowledge Management
Knowledge Management
 
Open Data for Health of Natural Capital
Open Data for Health of Natural CapitalOpen Data for Health of Natural Capital
Open Data for Health of Natural Capital
 
Hypersoft Operational and Organizational Intelligence
Hypersoft Operational and Organizational IntelligenceHypersoft Operational and Organizational Intelligence
Hypersoft Operational and Organizational Intelligence
 
Collaborative eResearch in a Social Cloud
Collaborative eResearch in a Social CloudCollaborative eResearch in a Social Cloud
Collaborative eResearch in a Social Cloud
 
Lll wces2012 barcelona
Lll wces2012 barcelonaLll wces2012 barcelona
Lll wces2012 barcelona
 

Similar to Towards Maximising Cross-Community Information Diffusion

Workforce Intelligence and Social Analytics: Opportunity at the Confluence
Workforce Intelligence and Social Analytics: Opportunity at the ConfluenceWorkforce Intelligence and Social Analytics: Opportunity at the Confluence
Workforce Intelligence and Social Analytics: Opportunity at the ConfluenceYvette Cameron
 
Gis - open source potentials
Gis  - open source potentialsGis  - open source potentials
Gis - open source potentialsTim Willoughby
 
ESM Readiness Assessment:Organisational Semiotics Perspective
ESM Readiness Assessment:Organisational Semiotics PerspectiveESM Readiness Assessment:Organisational Semiotics Perspective
ESM Readiness Assessment:Organisational Semiotics PerspectiveAimee Jacobs
 
Open Entrepreneurship_Teigland, Di Gangi, Yetis
Open Entrepreneurship_Teigland, Di Gangi, YetisOpen Entrepreneurship_Teigland, Di Gangi, Yetis
Open Entrepreneurship_Teigland, Di Gangi, YetisRobin Teigland
 
Nfa workshop introductions_wdonnelly
Nfa workshop introductions_wdonnellyNfa workshop introductions_wdonnelly
Nfa workshop introductions_wdonnellyShane Dempsey
 
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...Ralf Klamma
 
2012 03 16 (uc3m) emadrid rklamma rwth au analitica aprendizaje mundo movil p...
2012 03 16 (uc3m) emadrid rklamma rwth au analitica aprendizaje mundo movil p...2012 03 16 (uc3m) emadrid rklamma rwth au analitica aprendizaje mundo movil p...
2012 03 16 (uc3m) emadrid rklamma rwth au analitica aprendizaje mundo movil p...eMadrid network
 
Social networking text mining - analytics in km 13.dec.2011
Social networking   text mining - analytics in km 13.dec.2011Social networking   text mining - analytics in km 13.dec.2011
Social networking text mining - analytics in km 13.dec.2011HCL Technologies
 
TruSIS: Trust Accross Social Network
TruSIS: Trust Accross Social NetworkTruSIS: Trust Accross Social Network
TruSIS: Trust Accross Social NetworkLora Aroyo
 
The JLeRN Experiment: Dev8eD 2012 Learning Registry Workshop
The JLeRN Experiment: Dev8eD 2012 Learning Registry Workshop The JLeRN Experiment: Dev8eD 2012 Learning Registry Workshop
The JLeRN Experiment: Dev8eD 2012 Learning Registry Workshop Sarah Currier
 
OrteliusMicroserviceVisionaries2022_Why do you need a microservice catalog to...
OrteliusMicroserviceVisionaries2022_Why do you need a microservice catalog to...OrteliusMicroserviceVisionaries2022_Why do you need a microservice catalog to...
OrteliusMicroserviceVisionaries2022_Why do you need a microservice catalog to...Turja Narayan Chaudhuri
 
Meetup 11 here&now_megatriscomp design methodpartii_v0.2
Meetup 11 here&now_megatriscomp design methodpartii_v0.2Meetup 11 here&now_megatriscomp design methodpartii_v0.2
Meetup 11 here&now_megatriscomp design methodpartii_v0.2Francesco Rago
 
Next Generation Internet
Next Generation InternetNext Generation Internet
Next Generation InternetSabiha M
 
Social business and innovation
Social business and innovationSocial business and innovation
Social business and innovationJohn Mancini
 
Gephi icwsm-tutorial
Gephi icwsm-tutorialGephi icwsm-tutorial
Gephi icwsm-tutorialcsedays
 
Civil Service Live 2012 - using the G-Cloud to Herd Cats
Civil Service Live 2012 - using the G-Cloud to Herd CatsCivil Service Live 2012 - using the G-Cloud to Herd Cats
Civil Service Live 2012 - using the G-Cloud to Herd CatsKahootz
 

Similar to Towards Maximising Cross-Community Information Diffusion (20)

Workforce Intelligence and Social Analytics: Opportunity at the Confluence
Workforce Intelligence and Social Analytics: Opportunity at the ConfluenceWorkforce Intelligence and Social Analytics: Opportunity at the Confluence
Workforce Intelligence and Social Analytics: Opportunity at the Confluence
 
Gis - open source potentials
Gis  - open source potentialsGis  - open source potentials
Gis - open source potentials
 
ESM Readiness Assessment:Organisational Semiotics Perspective
ESM Readiness Assessment:Organisational Semiotics PerspectiveESM Readiness Assessment:Organisational Semiotics Perspective
ESM Readiness Assessment:Organisational Semiotics Perspective
 
Open Entrepreneurship_Teigland, Di Gangi, Yetis
Open Entrepreneurship_Teigland, Di Gangi, YetisOpen Entrepreneurship_Teigland, Di Gangi, Yetis
Open Entrepreneurship_Teigland, Di Gangi, Yetis
 
Nfa workshop introductions_wdonnelly
Nfa workshop introductions_wdonnellyNfa workshop introductions_wdonnelly
Nfa workshop introductions_wdonnelly
 
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...Learning Analytics in a Mobile World - A Community Information Systems Perspe...
Learning Analytics in a Mobile World - A Community Information Systems Perspe...
 
2012 03 16 (uc3m) emadrid rklamma rwth au analitica aprendizaje mundo movil p...
2012 03 16 (uc3m) emadrid rklamma rwth au analitica aprendizaje mundo movil p...2012 03 16 (uc3m) emadrid rklamma rwth au analitica aprendizaje mundo movil p...
2012 03 16 (uc3m) emadrid rklamma rwth au analitica aprendizaje mundo movil p...
 
Social networking text mining - analytics in km 13.dec.2011
Social networking   text mining - analytics in km 13.dec.2011Social networking   text mining - analytics in km 13.dec.2011
Social networking text mining - analytics in km 13.dec.2011
 
TruSIS: Trust Accross Social Network
TruSIS: Trust Accross Social NetworkTruSIS: Trust Accross Social Network
TruSIS: Trust Accross Social Network
 
A Methodology for Building the Internet of Things
A Methodology for Building the Internet of ThingsA Methodology for Building the Internet of Things
A Methodology for Building the Internet of Things
 
Willie Donnelly IFIF
Willie Donnelly IFIFWillie Donnelly IFIF
Willie Donnelly IFIF
 
The JLeRN Experiment: Dev8eD 2012 Learning Registry Workshop
The JLeRN Experiment: Dev8eD 2012 Learning Registry Workshop The JLeRN Experiment: Dev8eD 2012 Learning Registry Workshop
The JLeRN Experiment: Dev8eD 2012 Learning Registry Workshop
 
OrteliusMicroserviceVisionaries2022_Why do you need a microservice catalog to...
OrteliusMicroserviceVisionaries2022_Why do you need a microservice catalog to...OrteliusMicroserviceVisionaries2022_Why do you need a microservice catalog to...
OrteliusMicroserviceVisionaries2022_Why do you need a microservice catalog to...
 
2 1-research roadmap task force michele missikoff
2 1-research roadmap task force michele missikoff2 1-research roadmap task force michele missikoff
2 1-research roadmap task force michele missikoff
 
Meetup 11 here&now_megatriscomp design methodpartii_v0.2
Meetup 11 here&now_megatriscomp design methodpartii_v0.2Meetup 11 here&now_megatriscomp design methodpartii_v0.2
Meetup 11 here&now_megatriscomp design methodpartii_v0.2
 
02 Living Labs and Smart Cities Alvaro Oliveira
02 Living Labs and Smart Cities Alvaro Oliveira02 Living Labs and Smart Cities Alvaro Oliveira
02 Living Labs and Smart Cities Alvaro Oliveira
 
Next Generation Internet
Next Generation InternetNext Generation Internet
Next Generation Internet
 
Social business and innovation
Social business and innovationSocial business and innovation
Social business and innovation
 
Gephi icwsm-tutorial
Gephi icwsm-tutorialGephi icwsm-tutorial
Gephi icwsm-tutorial
 
Civil Service Live 2012 - using the G-Cloud to Herd Cats
Civil Service Live 2012 - using the G-Cloud to Herd CatsCivil Service Live 2012 - using the G-Cloud to Herd Cats
Civil Service Live 2012 - using the G-Cloud to Herd Cats
 

More from Václav Belák

Targeting Communities to Maximise Information Diffusion
Targeting Communities to Maximise Information DiffusionTargeting Communities to Maximise Information Diffusion
Targeting Communities to Maximise Information DiffusionVáclav Belák
 
Cross-Community Influence in Discussion Fora
Cross-Community Influence in Discussion ForaCross-Community Influence in Discussion Fora
Cross-Community Influence in Discussion ForaVáclav Belák
 
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 posterLife-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 posterVáclav Belák
 
Supporting Self-Organization in Politics by the Semantic Web Technologies
Supporting Self-Organization in Politics by the Semantic Web TechnologiesSupporting Self-Organization in Politics by the Semantic Web Technologies
Supporting Self-Organization in Politics by the Semantic Web TechnologiesVáclav Belák
 
Life-Cycles and Mutual Effects of Scientific Communities: ASNA 2010
Life-Cycles and Mutual Effects of Scientific Communities: ASNA 2010Life-Cycles and Mutual Effects of Scientific Communities: ASNA 2010
Life-Cycles and Mutual Effects of Scientific Communities: ASNA 2010Václav Belák
 

More from Václav Belák (6)

Vaclav Belak PhD Viva
Vaclav Belak PhD VivaVaclav Belak PhD Viva
Vaclav Belak PhD Viva
 
Targeting Communities to Maximise Information Diffusion
Targeting Communities to Maximise Information DiffusionTargeting Communities to Maximise Information Diffusion
Targeting Communities to Maximise Information Diffusion
 
Cross-Community Influence in Discussion Fora
Cross-Community Influence in Discussion ForaCross-Community Influence in Discussion Fora
Cross-Community Influence in Discussion Fora
 
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 posterLife-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
Life-Cycles and Mutual Effects of Scientific Communities: RSWebSci2010 poster
 
Supporting Self-Organization in Politics by the Semantic Web Technologies
Supporting Self-Organization in Politics by the Semantic Web TechnologiesSupporting Self-Organization in Politics by the Semantic Web Technologies
Supporting Self-Organization in Politics by the Semantic Web Technologies
 
Life-Cycles and Mutual Effects of Scientific Communities: ASNA 2010
Life-Cycles and Mutual Effects of Scientific Communities: ASNA 2010Life-Cycles and Mutual Effects of Scientific Communities: ASNA 2010
Life-Cycles and Mutual Effects of Scientific Communities: ASNA 2010
 

Recently uploaded

Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 

Recently uploaded (20)

Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 

Towards Maximising Cross-Community Information Diffusion

  • 1. Digital Enterprise Research Institute www.deri.ie Towards Cross-Community Information Diffusion Maximisation Václav Belák, Samantha Lam, Conor Hayes © Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Enabling Networked Knowledge
  • 2. Motivation Digital Enterprise Research Institute www.deri.ie •  Information cascades of high interest in marketing, CRM, etc. •  A common approach is to maximise information diffusion by targeting influential actors •  In the context of many online communities (e.g. discussion fora) the information is shared to the community as a whole and not to individual actors common case – targeting individuals cross-community case – targeting communities Enabling Networked Knowledge
  • 3. Objectives Digital Enterprise Research Institute www.deri.ie •  Our main hypothesis is that it is possible to efficiently spread a message over the information flow network by targeting highly influential communities •  The main problem is then formulated as a prediction of the set of communities to target such that the message is spread over the network as much as possible •  Spread over the actors, i.e. user activation fraction •  Spread over the communities, i.e. community activation fraction Enabling Networked Knowledge
  • 4. Methods: Definition of Impact Digital Enterprise Research Institute www.deri.ie •  We propose (Belák et al., ‘12) to take two factors into account: 1.  degree of community membership of the users 2.  centrality of the users within each community •  Impact of community A on community B defined as an average centrality of actors from A within B, weighted by their membership in A Enabling Networked Knowledge
  • 5. Methods: Targeting Communities Digital Enterprise Research Institute www.deri.ie •  Level of dispersion (heterogeneity) of total impact of community i can be measured as an entropy of an i-th row/column of the impact matrix •  We propose to target communities by means of the product of the total impact of community i and its entropy: impact focus (IF) •  We simulated the diffusion by extending Independent Cascade (ICM) and Linear Threshold (LTM) Models (Kempe et al., ‘03) 1.  Take q target communities and sample s users from each of them 2.  Run the original models from the union of sampled users •  Information diffusion network derived from the reply-to network: replies to i rji j information i j flow wij Enabling Networked Knowledge
  • 6. Evaluation Strategy Digital Enterprise Research Institute www.deri.ie •  IF compared with random targeting (R), and group in-degree (GI) (Everett & Borgatti, ’99) •  The main aim was to investigate robustness of our framework with respect to: •  Character of the system •  Diffusion models •  User and Community Activation Fractions •  Procedural outline 1.  Target q communities using one of the heuristics evaluated on the data from time-slice t 2.  Run the diffusion model on the network from time-slice t+1 3.  Compute an average user and community spreads over all pairs (t, t+1) Enabling Networked Knowledge
  • 7. Evaluation Data-Sets Digital Enterprise Research Institute www.deri.ie •  51 weeks of data of the largest Irish discussion board system •  Segmented using 1 week sliding window •  1 week window represents approx. 84% of cross-fora posting activity •  540 communities, 5.3k users/snapshot (avg) •  5 years of data from the technical support fora of SAP •  Used only for the diffusion experiments •  Segmented using 2 months sliding window •  2 months represent approx. 50% of cross-fora posting activity •  33 communities, 2k users/snapshot (avg) Enabling Networked Knowledge
  • 8. User Act. Fraction Digital Enterprise Research Institute www.deri.ie One targeted community q=1, Boards−LTM q=1, SAP−LTM 0.8 0.30 0.7 0.25 0.6 mean user activation fraction (u) mean user activation fraction (u) 0.20 0.5 0.15 0.4 0.10 0.3 0.05 0.2 IF IF GI GI 0.00 0.1 R R 5 10 15 20 5 10 15 20 user sample size (s) user sample size (s) Enabling Networked Knowledge
  • 9. Community Act. Fr. Digital Enterprise Research Institute www.deri.ie One targeted community q=1, Boards−LTM q=1, SAP−LTM 0.5 0.8 0.7 0.4 mean community activation fraction (c) mean community activation fraction (c) 0.6 0.3 0.5 0.4 0.2 0.3 0.1 0.2 IF IF GI GI 0.1 0.0 R R 5 10 15 20 5 10 15 20 user sample size (s) user sample size (s) Enabling Networked Knowledge
  • 10. Community Act. Fr. Digital Enterprise Research Institute www.deri.ie Five targeted communities q=5, Boards−LTM q=5, SAP−LTM 0.5 0.8 0.7 0.4 mean community activation fraction (c) mean community activation fraction (c) 0.6 0.3 0.5 0.4 0.2 0.3 0.1 0.2 IF IF GI GI 0.1 0.0 R R 5 10 15 20 5 10 15 20 user sample size (s) user sample size (s) Enabling Networked Knowledge
  • 11. Results Highlights Digital Enterprise Research Institute www.deri.ie •  Diffusion process became saturated at approximately 80% of users or communities in Boards, and 30% in SAP •  More efficient to target few communities •  Impact Focus outperformed the other two strategies with respect to both user and community activation fractions, namely for small number of targeted communities (i.e. [1, 2]) and seed users (i.e. [1, 20]) •  Diminishing returns •  For high number of targeted communities and seed users, random strategy outperformed the other two with respect to community activation fractions in SAP data-set •  SAP network fragmented into many small components, which made it hard to reach peripheral communities Enabling Networked Knowledge
  • 12. Conclusion Digital Enterprise Research Institute www.deri.ie •  The evaluation demonstrated that the framework •  is able to identify highly influential communities •  can predict which communities to target s.t. the message spreads efficiently over both individual users and communities •  We aim to extend it with content analysis •  E.g. What are the most influential communities with respect to a particular topic? •  We will also investigate empirically-observed topic cascades and modify our models accordingly if needed Enabling Networked Knowledge
  • 13. Questions? Digital Enterprise Research Institute www.deri.ie References •  Belák V., Lam S., Hayes C. Cross-Community Influence in Discussion Fora. ICWSM. AAAI, 2012. •  M. Everett and S. Borgatti. The centrality of groups and classes. J. of Mathematical Sociology, 23(3):181–201, 1999. •  D. Kempe, J. Kleinberg, and É. Tardos. Maximizing the spread of influence through a social network. SIGKDD. ACM, 2003. Enabling Networked Knowledge