SlideShare una empresa de Scribd logo
1 de 27
Networkx & Gephi Tutorial
          #pydata
     Gilad Lotan | @gilgul
link
#gayrights, #lgbt, #jesus,                          #palestine, #OWS, #immigration,
#flipflop, #jobs, #economy                          #abortion
                             #republican, #dems,
                             #economics, #amnesty
#Debates / Ohio
#Debates / Ohio


Politicos




            Ohio based Media




            OSU Students
• Node network properties
  – from immediate connections
                                                                     indegree=3
    • indegree
      how many directed edges (arcs) are incident on a node
                                                                   outdegree=2
    • outdegree
      how many directed edges (arcs) originate at a node
                                                                     degree=5

    • degree (in or out)
      number of edges incident on a node


  – from the entire graph
    • centrality (betweenness, closeness)

                                                  Source: Lada Adamic (SI508-F08)
Example Graph Types
• Complete Graph



• Bipartite Graph
  – Vertices can be divided into two disjoint sets
  – Ex: students & schools
Social Network Attributes
• Scale Free
  – Degree distribution follows a power law
  – Barabasi et al (‘99): mapped the topology of a portion
    of the web



• Small World
  – Most nodes are not neighbors, but can be reached by
    small number of hops
  – Watts & Strogatz (’98)
  – Properties: cliques, sub networks with high clustering
    coefficient, most pairs of nodes connected by at least
    one short path
(Zachary) Karate club graph

                              social network of friendships
                              between 34 members of a karate
                              club at a US university in the
                              1970s.

                              Standard test network for
                              clustering algorithms -> during
                              the observation period the club
                              broke up into two separate clubs
                              over a conflict.
Graph Measures
• Centrality
  – Betweenness
  – Closeness
  – Eigenvector
  – Degree


• Clustering Coefficient (clique)
• Modularity
Graph Layout
• Open Ord
  – Better distinguishes clusters
• Yifan Hu
• Force Atlas
• Fruchterman Reingold
  – Graph as a system of mass particles
    (nodes:particles, edges:springs)
Networkx
Graph Generators
Generate Twitter Graph
graphml file



               nodes




               edges
Twitter Users with Python in their Bios
• 2 days of Twitter data (Oct 24th and 25th)
• Total: 4246 users (62k tweets)
• @mikanyan1 tweeted 795 times
Pythonistas on
    Twitter
Pythonistas on
                                                 Twitter
                                                  Spanish Speakers
              English / European


                                                                Chinese




Python
(the snake)


                                                     Japanese




                        Musicians, Artists
Twitter User Community: Data Science
• Grepped from Twitter bios over 1 week:
"data science|data scientist|machine learning|data strateg”


• 1053 Users
• 14k Tweets
• Most tweeting users:
   – @data_nerd (659)
   – @Chantel_Esworth (562)
   – @Da5_12 (253)
Dataists on Twitter
Thank You

   Gilad Lotan
 Twitter: @gilgul
Github: giladlotan

Más contenido relacionado

La actualidad más candente

From Pandas to Koalas: Reducing Time-To-Insight for Virgin Hyperloop's Data
From Pandas to Koalas: Reducing Time-To-Insight for Virgin Hyperloop's DataFrom Pandas to Koalas: Reducing Time-To-Insight for Virgin Hyperloop's Data
From Pandas to Koalas: Reducing Time-To-Insight for Virgin Hyperloop's DataDatabricks
 
아마존 혁신의 배경 및 Digital Innovation Program 소개 – 김중수, AWS 사업개발 담당/ 김성락, LG 인화원 책...
아마존 혁신의 배경 및 Digital Innovation Program 소개 – 김중수, AWS  사업개발 담당/ 김성락, LG 인화원 책...아마존 혁신의 배경 및 Digital Innovation Program 소개 – 김중수, AWS  사업개발 담당/ 김성락, LG 인화원 책...
아마존 혁신의 배경 및 Digital Innovation Program 소개 – 김중수, AWS 사업개발 담당/ 김성락, LG 인화원 책...Amazon Web Services Korea
 
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...Amazon Web Services
 
Data Modeling with Neo4j
Data Modeling with Neo4jData Modeling with Neo4j
Data Modeling with Neo4jNeo4j
 
Reklam Mesajında stil, ton ve yapım formatları
Reklam Mesajında stil, ton ve yapım formatlarıReklam Mesajında stil, ton ve yapım formatları
Reklam Mesajında stil, ton ve yapım formatlarıumrtnr
 
The current state of generative AI
The current state of generative AIThe current state of generative AI
The current state of generative AIBenjaminlapid1
 
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...Neo4j
 
Graph Based Recommendation Systems at eBay
Graph Based Recommendation Systems at eBayGraph Based Recommendation Systems at eBay
Graph Based Recommendation Systems at eBayDataStax Academy
 
Algida Medya Planlaması
Algida Medya PlanlamasıAlgida Medya Planlaması
Algida Medya Planlamasıyaseminbig
 
XYZ Kahve Firması Sosyal Medya Pazarlama Planı
XYZ Kahve Firması Sosyal Medya Pazarlama PlanıXYZ Kahve Firması Sosyal Medya Pazarlama Planı
XYZ Kahve Firması Sosyal Medya Pazarlama PlanıMurat Yaman
 
NLP for Biomedical Applications
NLP for Biomedical ApplicationsNLP for Biomedical Applications
NLP for Biomedical ApplicationsNVIDIA
 
How to deploy machine learning models into production
How to deploy machine learning models into productionHow to deploy machine learning models into production
How to deploy machine learning models into productionDataWorks Summit
 
AWS Summit Seoul 2023 | 아마존의 공급망 전략을 배워보고, 우리 회사에 적용하기
AWS Summit Seoul 2023 | 아마존의 공급망 전략을 배워보고, 우리 회사에 적용하기AWS Summit Seoul 2023 | 아마존의 공급망 전략을 배워보고, 우리 회사에 적용하기
AWS Summit Seoul 2023 | 아마존의 공급망 전략을 배워보고, 우리 회사에 적용하기Amazon Web Services Korea
 
Neo4j Presentation
Neo4j PresentationNeo4j Presentation
Neo4j PresentationMax De Marzi
 
Sosyal Medya Proje Dosyası (Örnek) - Social Media Project File (Example)
Sosyal Medya Proje Dosyası (Örnek) - Social Media Project File (Example)Sosyal Medya Proje Dosyası (Örnek) - Social Media Project File (Example)
Sosyal Medya Proje Dosyası (Örnek) - Social Media Project File (Example)Volkan Çelik
 
2023年はTiDBの時代!
2023年はTiDBの時代!2023年はTiDBの時代!
2023年はTiDBの時代!Tomotaka6
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models BootcampData Science Dojo
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 
Strateji Ornek
Strateji OrnekStrateji Ornek
Strateji OrnekAretiasus
 
Intro to Neo4j presentation
Intro to Neo4j presentationIntro to Neo4j presentation
Intro to Neo4j presentationjexp
 

La actualidad más candente (20)

From Pandas to Koalas: Reducing Time-To-Insight for Virgin Hyperloop's Data
From Pandas to Koalas: Reducing Time-To-Insight for Virgin Hyperloop's DataFrom Pandas to Koalas: Reducing Time-To-Insight for Virgin Hyperloop's Data
From Pandas to Koalas: Reducing Time-To-Insight for Virgin Hyperloop's Data
 
아마존 혁신의 배경 및 Digital Innovation Program 소개 – 김중수, AWS 사업개발 담당/ 김성락, LG 인화원 책...
아마존 혁신의 배경 및 Digital Innovation Program 소개 – 김중수, AWS  사업개발 담당/ 김성락, LG 인화원 책...아마존 혁신의 배경 및 Digital Innovation Program 소개 – 김중수, AWS  사업개발 담당/ 김성락, LG 인화원 책...
아마존 혁신의 배경 및 Digital Innovation Program 소개 – 김중수, AWS 사업개발 담당/ 김성락, LG 인화원 책...
 
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
Aurora Serverless: Scalable, Cost-Effective Application Deployment (DAT336) -...
 
Data Modeling with Neo4j
Data Modeling with Neo4jData Modeling with Neo4j
Data Modeling with Neo4j
 
Reklam Mesajında stil, ton ve yapım formatları
Reklam Mesajında stil, ton ve yapım formatlarıReklam Mesajında stil, ton ve yapım formatları
Reklam Mesajında stil, ton ve yapım formatları
 
The current state of generative AI
The current state of generative AIThe current state of generative AI
The current state of generative AI
 
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
Neo4j Demo: Using Knowledge Graphs to Classify Diabetes Patients (GlaxoSmithK...
 
Graph Based Recommendation Systems at eBay
Graph Based Recommendation Systems at eBayGraph Based Recommendation Systems at eBay
Graph Based Recommendation Systems at eBay
 
Algida Medya Planlaması
Algida Medya PlanlamasıAlgida Medya Planlaması
Algida Medya Planlaması
 
XYZ Kahve Firması Sosyal Medya Pazarlama Planı
XYZ Kahve Firması Sosyal Medya Pazarlama PlanıXYZ Kahve Firması Sosyal Medya Pazarlama Planı
XYZ Kahve Firması Sosyal Medya Pazarlama Planı
 
NLP for Biomedical Applications
NLP for Biomedical ApplicationsNLP for Biomedical Applications
NLP for Biomedical Applications
 
How to deploy machine learning models into production
How to deploy machine learning models into productionHow to deploy machine learning models into production
How to deploy machine learning models into production
 
AWS Summit Seoul 2023 | 아마존의 공급망 전략을 배워보고, 우리 회사에 적용하기
AWS Summit Seoul 2023 | 아마존의 공급망 전략을 배워보고, 우리 회사에 적용하기AWS Summit Seoul 2023 | 아마존의 공급망 전략을 배워보고, 우리 회사에 적용하기
AWS Summit Seoul 2023 | 아마존의 공급망 전략을 배워보고, 우리 회사에 적용하기
 
Neo4j Presentation
Neo4j PresentationNeo4j Presentation
Neo4j Presentation
 
Sosyal Medya Proje Dosyası (Örnek) - Social Media Project File (Example)
Sosyal Medya Proje Dosyası (Örnek) - Social Media Project File (Example)Sosyal Medya Proje Dosyası (Örnek) - Social Media Project File (Example)
Sosyal Medya Proje Dosyası (Örnek) - Social Media Project File (Example)
 
2023年はTiDBの時代!
2023年はTiDBの時代!2023年はTiDBの時代!
2023年はTiDBの時代!
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Strateji Ornek
Strateji OrnekStrateji Ornek
Strateji Ornek
 
Intro to Neo4j presentation
Intro to Neo4j presentationIntro to Neo4j presentation
Intro to Neo4j presentation
 

Destacado

Clustering of graphs and search of assemblages
Clustering of graphs and search of assemblagesClustering of graphs and search of assemblages
Clustering of graphs and search of assemblagesData-Centric_Alliance
 
[20140830, Pycon2014] NetworkX를 이용한 네트워크 분석
[20140830, Pycon2014] NetworkX를 이용한 네트워크 분석[20140830, Pycon2014] NetworkX를 이용한 네트워크 분석
[20140830, Pycon2014] NetworkX를 이용한 네트워크 분석Kyunghoon Kim
 
A Fast and Dirty Intro to NetworkX (and D3)
A Fast and Dirty Intro to NetworkX (and D3)A Fast and Dirty Intro to NetworkX (and D3)
A Fast and Dirty Intro to NetworkX (and D3)Lynn Cherny
 
NetworkX - python graph analysis and visualization @ PyHug
NetworkX - python graph analysis and visualization @ PyHugNetworkX - python graph analysis and visualization @ PyHug
NetworkX - python graph analysis and visualization @ PyHugJimmy Lai
 
A comparative study of social network analysis tools
A comparative study of social network analysis toolsA comparative study of social network analysis tools
A comparative study of social network analysis toolsDavid Combe
 
Ch01 네트워크와+소켓+프로그래밍+[호환+모드]
Ch01 네트워크와+소켓+프로그래밍+[호환+모드]Ch01 네트워크와+소켓+프로그래밍+[호환+모드]
Ch01 네트워크와+소켓+프로그래밍+[호환+모드]지환 김
 

Destacado (9)

Clustering of graphs and search of assemblages
Clustering of graphs and search of assemblagesClustering of graphs and search of assemblages
Clustering of graphs and search of assemblages
 
Network stats using Gephi
Network stats using GephiNetwork stats using Gephi
Network stats using Gephi
 
[20140830, Pycon2014] NetworkX를 이용한 네트워크 분석
[20140830, Pycon2014] NetworkX를 이용한 네트워크 분석[20140830, Pycon2014] NetworkX를 이용한 네트워크 분석
[20140830, Pycon2014] NetworkX를 이용한 네트워크 분석
 
A Fast and Dirty Intro to NetworkX (and D3)
A Fast and Dirty Intro to NetworkX (and D3)A Fast and Dirty Intro to NetworkX (and D3)
A Fast and Dirty Intro to NetworkX (and D3)
 
NetworkX - python graph analysis and visualization @ PyHug
NetworkX - python graph analysis and visualization @ PyHugNetworkX - python graph analysis and visualization @ PyHug
NetworkX - python graph analysis and visualization @ PyHug
 
A comparative study of social network analysis tools
A comparative study of social network analysis toolsA comparative study of social network analysis tools
A comparative study of social network analysis tools
 
Gephi Quick Start (Japanese)
Gephi Quick Start (Japanese)Gephi Quick Start (Japanese)
Gephi Quick Start (Japanese)
 
Ch01 네트워크와+소켓+프로그래밍+[호환+모드]
Ch01 네트워크와+소켓+프로그래밍+[호환+모드]Ch01 네트워크와+소켓+프로그래밍+[호환+모드]
Ch01 네트워크와+소켓+프로그래밍+[호환+모드]
 
Gephi Quick Start
Gephi Quick StartGephi Quick Start
Gephi Quick Start
 

Similar a Networkx & Gephi Tutorial #Pydata NYC

Network sampling, community detection
Network sampling, community detectionNetwork sampling, community detection
Network sampling, community detectionroberval mariano
 
Social Networks and Computer Science
Social Networks and Computer ScienceSocial Networks and Computer Science
Social Networks and Computer Sciencedragonmeteor
 
The Slashdot Zoo: Mining a Social Network with Negative Edges
The Slashdot Zoo:  Mining a Social Network with Negative EdgesThe Slashdot Zoo:  Mining a Social Network with Negative Edges
The Slashdot Zoo: Mining a Social Network with Negative EdgesJérôme KUNEGIS
 
Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012CameliaN
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Denis Parra Santander
 
Social network analysis
Social network analysisSocial network analysis
Social network analysisCaleb Jones
 
2010 june - personal democracy forum - marc smith - mapping political socia...
2010   june - personal democracy forum - marc smith - mapping political socia...2010   june - personal democracy forum - marc smith - mapping political socia...
2010 june - personal democracy forum - marc smith - mapping political socia...Marc Smith
 
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Symeon Papadopoulos
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social NetworksKent State University
 
4C13 J.15 Larson "Twitter based discourse community"
4C13 J.15 Larson "Twitter based discourse community"4C13 J.15 Larson "Twitter based discourse community"
4C13 J.15 Larson "Twitter based discourse community"rhetoricked
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network AnalysisMarc Smith
 
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...Marc Smith
 
Community detection in graphs
Community detection in graphsCommunity detection in graphs
Community detection in graphsNicola Barbieri
 
Social network analysis basics
Social network analysis basicsSocial network analysis basics
Social network analysis basicsPradeep Kumar
 
2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 TutorialAlexander Pico
 
Network Construction and Visualization.pdf
Network Construction and Visualization.pdfNetwork Construction and Visualization.pdf
Network Construction and Visualization.pdfFrancisco Restivo
 
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...Steve Kramer
 
项亮 推荐系统实践 从入门到精通
项亮 推荐系统实践 从入门到精通 项亮 推荐系统实践 从入门到精通
项亮 推荐系统实践 从入门到精通 topgeek
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architectureLiang Xiang
 

Similar a Networkx & Gephi Tutorial #Pydata NYC (20)

Network sampling, community detection
Network sampling, community detectionNetwork sampling, community detection
Network sampling, community detection
 
Social Networks and Computer Science
Social Networks and Computer ScienceSocial Networks and Computer Science
Social Networks and Computer Science
 
The Slashdot Zoo: Mining a Social Network with Negative Edges
The Slashdot Zoo:  Mining a Social Network with Negative EdgesThe Slashdot Zoo:  Mining a Social Network with Negative Edges
The Slashdot Zoo: Mining a Social Network with Negative Edges
 
Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012Sylva workshop.gt that camp.2012
Sylva workshop.gt that camp.2012
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
 
Social network analysis
Social network analysisSocial network analysis
Social network analysis
 
2010 june - personal democracy forum - marc smith - mapping political socia...
2010   june - personal democracy forum - marc smith - mapping political socia...2010   june - personal democracy forum - marc smith - mapping political socia...
2010 june - personal democracy forum - marc smith - mapping political socia...
 
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...Community Structure, Interaction and Evolution Analysis of Online Social Netw...
Community Structure, Interaction and Evolution Analysis of Online Social Netw...
 
PMED Undergraduate Workshop - Communities & Classification in Disease Data -...
PMED Undergraduate Workshop - Communities & Classification in Disease Data  -...PMED Undergraduate Workshop - Communities & Classification in Disease Data  -...
PMED Undergraduate Workshop - Communities & Classification in Disease Data -...
 
Group and Community Detection in Social Networks
Group and Community Detection in Social NetworksGroup and Community Detection in Social Networks
Group and Community Detection in Social Networks
 
4C13 J.15 Larson "Twitter based discourse community"
4C13 J.15 Larson "Twitter based discourse community"4C13 J.15 Larson "Twitter based discourse community"
4C13 J.15 Larson "Twitter based discourse community"
 
2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis2013 NodeXL Social Media Network Analysis
2013 NodeXL Social Media Network Analysis
 
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
 
Community detection in graphs
Community detection in graphsCommunity detection in graphs
Community detection in graphs
 
Social network analysis basics
Social network analysis basicsSocial network analysis basics
Social network analysis basics
 
2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial
 
Network Construction and Visualization.pdf
Network Construction and Visualization.pdfNetwork Construction and Visualization.pdf
Network Construction and Visualization.pdf
 
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
Finding Key Influencers and Viral Topics in Twitter Networks Related to ISIS,...
 
项亮 推荐系统实践 从入门到精通
项亮 推荐系统实践 从入门到精通 项亮 推荐系统实践 从入门到精通
项亮 推荐系统实践 从入门到精通
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 

Más de Gilad Lotan

Gilad Lotan, News Xchange 2014, Algorithmic Power
Gilad Lotan, News Xchange 2014, Algorithmic PowerGilad Lotan, News Xchange 2014, Algorithmic Power
Gilad Lotan, News Xchange 2014, Algorithmic PowerGilad Lotan
 
Data Science of Messy Metrics
Data Science of Messy MetricsData Science of Messy Metrics
Data Science of Messy MetricsGilad Lotan
 
A Networked Take on Influence: what we learn from data
A Networked Take on Influence: what we learn from dataA Networked Take on Influence: what we learn from data
A Networked Take on Influence: what we learn from dataGilad Lotan
 
Networked Audiences: what we learn from data / Gilad Lotan / IPZ2012
Networked Audiences: what we learn from data / Gilad Lotan / IPZ2012Networked Audiences: what we learn from data / Gilad Lotan / IPZ2012
Networked Audiences: what we learn from data / Gilad Lotan / IPZ2012Gilad Lotan
 
Cbnweekly big data
Cbnweekly big data Cbnweekly big data
Cbnweekly big data Gilad Lotan
 
SocialFlow - 140confNYC
SocialFlow - 140confNYCSocialFlow - 140confNYC
SocialFlow - 140confNYCGilad Lotan
 
Seeing your Invisible Audience
Seeing your Invisible AudienceSeeing your Invisible Audience
Seeing your Invisible AudienceGilad Lotan
 
Understanding the Hebrew Blogosphere
Understanding the Hebrew BlogosphereUnderstanding the Hebrew Blogosphere
Understanding the Hebrew BlogosphereGilad Lotan
 
imPulse: materials and interactive design
imPulse: materials and interactive designimPulse: materials and interactive design
imPulse: materials and interactive designGilad Lotan
 
Gilad Presentation on Digital Hollywood
Gilad Presentation on Digital HollywoodGilad Presentation on Digital Hollywood
Gilad Presentation on Digital HollywoodGilad Lotan
 

Más de Gilad Lotan (11)

Gilad Lotan, News Xchange 2014, Algorithmic Power
Gilad Lotan, News Xchange 2014, Algorithmic PowerGilad Lotan, News Xchange 2014, Algorithmic Power
Gilad Lotan, News Xchange 2014, Algorithmic Power
 
Data Science of Messy Metrics
Data Science of Messy MetricsData Science of Messy Metrics
Data Science of Messy Metrics
 
A Networked Take on Influence: what we learn from data
A Networked Take on Influence: what we learn from dataA Networked Take on Influence: what we learn from data
A Networked Take on Influence: what we learn from data
 
Networked Audiences: what we learn from data / Gilad Lotan / IPZ2012
Networked Audiences: what we learn from data / Gilad Lotan / IPZ2012Networked Audiences: what we learn from data / Gilad Lotan / IPZ2012
Networked Audiences: what we learn from data / Gilad Lotan / IPZ2012
 
Cbnweekly big data
Cbnweekly big data Cbnweekly big data
Cbnweekly big data
 
SocialFlow - 140confNYC
SocialFlow - 140confNYCSocialFlow - 140confNYC
SocialFlow - 140confNYC
 
Seeing your Invisible Audience
Seeing your Invisible AudienceSeeing your Invisible Audience
Seeing your Invisible Audience
 
Understanding the Hebrew Blogosphere
Understanding the Hebrew BlogosphereUnderstanding the Hebrew Blogosphere
Understanding the Hebrew Blogosphere
 
Times Open
Times OpenTimes Open
Times Open
 
imPulse: materials and interactive design
imPulse: materials and interactive designimPulse: materials and interactive design
imPulse: materials and interactive design
 
Gilad Presentation on Digital Hollywood
Gilad Presentation on Digital HollywoodGilad Presentation on Digital Hollywood
Gilad Presentation on Digital Hollywood
 

Último

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

Último (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

Networkx & Gephi Tutorial #Pydata NYC

  • 1. Networkx & Gephi Tutorial #pydata Gilad Lotan | @gilgul
  • 3.
  • 4.
  • 5.
  • 6. #gayrights, #lgbt, #jesus, #palestine, #OWS, #immigration, #flipflop, #jobs, #economy #abortion #republican, #dems, #economics, #amnesty
  • 8. #Debates / Ohio Politicos Ohio based Media OSU Students
  • 9. • Node network properties – from immediate connections indegree=3 • indegree how many directed edges (arcs) are incident on a node outdegree=2 • outdegree how many directed edges (arcs) originate at a node degree=5 • degree (in or out) number of edges incident on a node – from the entire graph • centrality (betweenness, closeness) Source: Lada Adamic (SI508-F08)
  • 10. Example Graph Types • Complete Graph • Bipartite Graph – Vertices can be divided into two disjoint sets – Ex: students & schools
  • 11.
  • 12. Social Network Attributes • Scale Free – Degree distribution follows a power law – Barabasi et al (‘99): mapped the topology of a portion of the web • Small World – Most nodes are not neighbors, but can be reached by small number of hops – Watts & Strogatz (’98) – Properties: cliques, sub networks with high clustering coefficient, most pairs of nodes connected by at least one short path
  • 13. (Zachary) Karate club graph social network of friendships between 34 members of a karate club at a US university in the 1970s. Standard test network for clustering algorithms -> during the observation period the club broke up into two separate clubs over a conflict.
  • 14. Graph Measures • Centrality – Betweenness – Closeness – Eigenvector – Degree • Clustering Coefficient (clique) • Modularity
  • 15. Graph Layout • Open Ord – Better distinguishes clusters • Yifan Hu • Force Atlas • Fruchterman Reingold – Graph as a system of mass particles (nodes:particles, edges:springs)
  • 19.
  • 20. graphml file nodes edges
  • 21. Twitter Users with Python in their Bios • 2 days of Twitter data (Oct 24th and 25th) • Total: 4246 users (62k tweets) • @mikanyan1 tweeted 795 times
  • 22. Pythonistas on Twitter
  • 23. Pythonistas on Twitter Spanish Speakers English / European Chinese Python (the snake) Japanese Musicians, Artists
  • 24.
  • 25. Twitter User Community: Data Science • Grepped from Twitter bios over 1 week: "data science|data scientist|machine learning|data strateg” • 1053 Users • 14k Tweets • Most tweeting users: – @data_nerd (659) – @Chantel_Esworth (562) – @Da5_12 (253)
  • 27. Thank You Gilad Lotan Twitter: @gilgul Github: giladlotan

Notas del editor

  1. Homophily
  2. Endogenous Trend – information spread
  3. Exogenous information spread
  4. Hashtags have emerged as a way for people to gather around topics or events.
  5. - Mitt romney: #gayrights, #lgbt, #jesus, #flipflop, #jobs, #economy- Newt Gingrich: #palestine, #OWS, #immigration, #abortion (he famously said – “Stop whining, take a bath and get a job!”Equal: #republican, #dems, #economics, #amnestyCo-occurence
  6. Networkx supports
  7. Zachary's Karate Club Graph describes the friendships between the members of a US karate club in the 1970s. The significant feature of this social network is that the club president and the instructor were involved in a dispute (some might say: a fight) over the issue of how much to charge for lessons. This split the club into two factions, one centred around the president, and the other centred around the instructor.
  8. Betweenness – number of shortest paths from all vertices that pass through that node / positioningCloseness – how fast it will take to spread information from s to all other nodes sequentially / distance of s from all other actors in a networkEigenvector – measure of the influence of a node (page rank, connections to high scoring nodes contribute more to the score)Clustering Coefficient – measure of degree to which nodes in a graph tend to cluster together (how close to being a clique = 1)
  9. NetworkX is a Python language software package for the creation, manipulation, and study of the structure, dynamics, and function of complex networks.NetworkX was born in May 2002. The original version was designed and written by AricHagberg, Dan Schult, and Pieter Swart in 2002 and 2003. The first public release was in April 2005.
  10. Python – user description2 days of Twitter data-