SlideShare a Scribd company logo
1 of 22
Gossip-based Partitioning
and Replication
Middle-ware for
Online Social Networks
Muhammad Anis Uddin Nasir
(EMDC/ICT/LCN)
Supervisor: Šarūnas Girdzijauskas
Examiner: Johan Montelius
Online Social Networks
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
•Vertices •Edges •Metadata
Ioanna Antonio Vaidas
Aras
Vasia
Anis
Mudit
Manos
2
LeandroJohan
Existing Solutions
• Relational Databases
- MySQL Cluster
• Key-Value stores
- Cassandra, Amazon Dynamo
• Document Databases
- MongoDB, CouchDB
• Graph Databases
- Neo4j, Titans
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 3
Why Existing Solutions are not
enough?
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
4
Why Existing Solutions are not
enough?
• Random Partitioning
• Social Request
- E.g., gather new feeds
from all the friends
• Enforcing Data
Locality
• Random partitioning
can lead to full
replication!
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
1 4 7 82 3 5 6 10 9
1’ 4’ 7’ 8’ 9’ 2’ 3’ 6’5’ 10’
5
Social Graphs are not Random
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 6
Graph Partitioning
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 7
JA-BE-JA- edge-cut
8/27/2013
Muhammad Anis Uddin Nasir- Gossip-based Partitioning and
Replication Middle-ware
Server A Server B
6
3
5
2
1
4
76’
3’
1’
4’
7’
• Edge Cut = 3 links, 3+2=5 replicas to
maintain
8
SPAR- Minimizing Replicas
8/27/2013
Muhammad Anis Uddin Nasir- Gossip-based Partitioning and
Replication Middle-ware
Server A Server B
6
3
5
2
1
4
76’
3’2’
5’
• Edge Cut = 4 links, 2+2=4 replicas to
maintain
9
Initialization
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
1 4 7 82 3 5 6 10 9
1’ 4’ 7’ 8’ 9’ 2’ 3’ 6’5’ 10’
• Node Addition
- Assign it to server with
minimum master
• Edge Addition
- Check if Nodes are Local
- Else create replicas to
maintain locality
10
Gossip Phase
• Cost Function
- Count number of replicas
- For current and new server
• Peer Selection
- Local, Random, Hybrid
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
1 4 7 82 3 5 6 10 9
1’ 4’ 7’ 8’ 9’ 5’ 10’
11
2’ 3’ 6’
Gossip Phase
• Cost Function
- Count number of replicas
- For existing and new server
• Peer Selection
- Local, Random, Hybrid
• Simulated Annealing
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
5
3
4
2
1
10
8
9
7
6
6 4 7 82 3 5 1 10 9
4’ 8’ 9’ 3’ 5’ 10’6’ 1’
4 10
12
Simulated Annealing
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 13
Algorithms
Algorithm Random SPAR JA-BE-JA Gossip-based
Data locality
Decentralized
Load Balancing
Fault tolerance
Avoiding Local
Optima
Availability
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 14
Datasets
Datasets Vertices Edges
Synth-C 2,000 20,000
Synth-HC 2,000 20,000
Synth-PL 2,000 20,000
SNAP-Facebook 4,039 88,234
WSON-Facebook 60,290 1,545,686
SNAP-Twitter 81,306 1,768,149
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 15
Evaluation- with datasets
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
0
2
4
6
8
10
12
Random
SPAR
JA-BE-JA
Gossip-based
ReplicationOverhead
>3x gain
compared to
Random
Partitioning
≈2x gain
compared to
SPAR
• Number of Servers =16, Replication factor=2
16
Evaluation- with replication factor
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
ReplicationOverhead
• Number of Servers =16
0
1
2
3
4
5
6
7
8
9
10
f=0
f=2
Random Graphs
generates maximum
replication overhead Real Graphs
generates minimum
replication
overhead
Data locality is
achieved by fault
tolerance replicas
17
Evaluation- with servers
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
ReplicationOverhead
• Replication factor =2
Number of Servers
WSON-Facebook
18
0
2
4
6
8
10
12
14
16
18
20
8 16 32 64
Random
SPAR
JA-BE-JA
Gossip-based
Gossip-based
generates minimum
replication
overhead
Replication
overhead
increases non
linearly
>4x gain
compared to
Random
Partitioning
0
2
4
6
8
10
12
14
16
18
20
8 16 32 64
Gossip-based
Evaluation- dynamicity
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware
• Number of Servers =16, Replication factor=2
0.2
0.25
0.3
0.35
0.4
0.45
0
50
100
150
200
250
300
350
400
450
500
550
600
650
700
750
800
850
900
0.2
0.25
0.3
0.35
0.4
0.45
0
50
100
150
200
250
300
350
400
450
500
550
600
650
700
750
800
850
900
950
1000
SNAP-Twitter SNAP-Facebook
Number of cycles Number of cycles
ReplicationOverhead
ReplicationOverhead
Spikes show
bulk edge
addition
Algorithm
Stabilization
19
Transition state,
i.e., reducing the
number of replicas
after new edge
additions
Conclusion
• Random Partitioning does not provide efficient
solution of Online Social Networks
• Minimizing Replicas can help to achieve better
partitioning
• Gossip-based heuristic was proposed to solve the
minimization problem while achieving the global
optima
• Algorithm able to handle different datasets and
adjusts with dynamic nature of OSNs
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 20
Gossip-based Partitioning
and Replication
Middle-ware for
Online Social Networks
Muhammad Anis Uddin Nasir
(EMDC/ICT/LCN)
Supervisor: Šarūnas Girdzijauskas
Examiner: Johan Montelius
Future Work
• Execution of the algorithm with large datasets using
parallel graph processing frameworks like
GraphLab and Apache Girpah
• Load Balancing using both Master and Replicas and
providing different consistency levels
• Smart Replication to provide data locality for highly
interactive nodes
• Implement different consistency strategies based to
access patterns
8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 22

More Related Content

Viewers also liked

The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...Anis Nasir
 
Gossipy2
Gossipy2Gossipy2
Gossipy2NBCC
 
World War Two
World War TwoWorld War Two
World War Twosystemed1
 
Software-Defined Networking , Survey of HotSDN 2012
Software-Defined Networking , Survey of HotSDN 2012Software-Defined Networking , Survey of HotSDN 2012
Software-Defined Networking , Survey of HotSDN 2012Jason TC HOU (侯宗成)
 
Workplace Gossip
Workplace GossipWorkplace Gossip
Workplace GossipYuGao
 
Software-Defined Networking(SDN):A New Approach to Networking
Software-Defined Networking(SDN):A New Approach to NetworkingSoftware-Defined Networking(SDN):A New Approach to Networking
Software-Defined Networking(SDN):A New Approach to NetworkingAnju Ann
 
Good and Bad Power Point Examples Ed Tech
Good and Bad Power Point Examples Ed TechGood and Bad Power Point Examples Ed Tech
Good and Bad Power Point Examples Ed TechLynnylu
 

Viewers also liked (14)

The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
The Power of Both Choices: Practical Load Balancing for Distributed Stream Pr...
 
Gossipy2
Gossipy2Gossipy2
Gossipy2
 
World War Two
World War TwoWorld War Two
World War Two
 
Gossiping
GossipingGossiping
Gossiping
 
Software-Defined Networking , Survey of HotSDN 2012
Software-Defined Networking , Survey of HotSDN 2012Software-Defined Networking , Survey of HotSDN 2012
Software-Defined Networking , Survey of HotSDN 2012
 
Gossip
GossipGossip
Gossip
 
Workplace Gossip
Workplace GossipWorkplace Gossip
Workplace Gossip
 
Gossip
GossipGossip
Gossip
 
How To Deal With Gossip
How To Deal With GossipHow To Deal With Gossip
How To Deal With Gossip
 
Gossip & Rumor In the Workplace
Gossip & Rumor In the WorkplaceGossip & Rumor In the Workplace
Gossip & Rumor In the Workplace
 
Men V/S Women
Men V/S WomenMen V/S Women
Men V/S Women
 
Software-Defined Networking(SDN):A New Approach to Networking
Software-Defined Networking(SDN):A New Approach to NetworkingSoftware-Defined Networking(SDN):A New Approach to Networking
Software-Defined Networking(SDN):A New Approach to Networking
 
Good and Bad Power Point Examples Ed Tech
Good and Bad Power Point Examples Ed TechGood and Bad Power Point Examples Ed Tech
Good and Bad Power Point Examples Ed Tech
 
STEAL THIS PRESENTATION!
STEAL THIS PRESENTATION! STEAL THIS PRESENTATION!
STEAL THIS PRESENTATION!
 

Similar to Gossip based partitioning and replication for Online Social Networks

Getting started with Cassandra 2.1
Getting started with Cassandra 2.1Getting started with Cassandra 2.1
Getting started with Cassandra 2.1Viswanath J
 
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Johnny Miller
 
Using Riak for Events storage and analysis at Booking.com
Using Riak for Events storage and analysis at Booking.comUsing Riak for Events storage and analysis at Booking.com
Using Riak for Events storage and analysis at Booking.comDamien Krotkine
 
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMsGlobal Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMsMarco Obinu
 
Data Science Lab Meetup: Cassandra and Spark
Data Science Lab Meetup: Cassandra and SparkData Science Lab Meetup: Cassandra and Spark
Data Science Lab Meetup: Cassandra and SparkChristopher Batey
 
Sharding Methods for MongoDB
Sharding Methods for MongoDBSharding Methods for MongoDB
Sharding Methods for MongoDBMongoDB
 
Lawrence Livermore Labs talk 2011
Lawrence Livermore Labs talk 2011Lawrence Livermore Labs talk 2011
Lawrence Livermore Labs talk 2011MapR Technologies
 
Agility and Scalability with MongoDB
Agility and Scalability with MongoDBAgility and Scalability with MongoDB
Agility and Scalability with MongoDBMongoDB
 
D108636GC10_les01.pptx
D108636GC10_les01.pptxD108636GC10_les01.pptx
D108636GC10_les01.pptxSuresh569521
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)Alex Rasmussen
 
10 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 201910 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 2019Dave Nielsen
 
Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Marco Tusa
 
impervious cover
impervious coverimpervious cover
impervious coverJames Yang
 
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Data Con LA
 
Highly available, scalable and secure data with Cassandra and DataStax Enterp...
Highly available, scalable and secure data with Cassandra and DataStax Enterp...Highly available, scalable and secure data with Cassandra and DataStax Enterp...
Highly available, scalable and secure data with Cassandra and DataStax Enterp...Johnny Miller
 
lastfm contentdashboards project description
lastfm contentdashboards project descriptionlastfm contentdashboards project description
lastfm contentdashboards project descriptionGaurav Bhardwaj
 
Shaping the Future of Travel with MongoDB
Shaping the Future of Travel with MongoDBShaping the Future of Travel with MongoDB
Shaping the Future of Travel with MongoDBMongoDB
 

Similar to Gossip based partitioning and replication for Online Social Networks (20)

Getting started with Cassandra 2.1
Getting started with Cassandra 2.1Getting started with Cassandra 2.1
Getting started with Cassandra 2.1
 
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
Apache Cassandra For Java Developers - Why, What and How. LJC @ UCL October 2014
 
Geode - Day 1
Geode - Day 1Geode - Day 1
Geode - Day 1
 
MySQL vs. MonetDB
MySQL vs. MonetDBMySQL vs. MonetDB
MySQL vs. MonetDB
 
Using Riak for Events storage and analysis at Booking.com
Using Riak for Events storage and analysis at Booking.comUsing Riak for Events storage and analysis at Booking.com
Using Riak for Events storage and analysis at Booking.com
 
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMsGlobal Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
Global Azure Virtual 2020 What's new on Azure IaaS for SQL VMs
 
EDNS0 Client-Subnet for DNS Based CDNs
EDNS0 Client-Subnet for DNS Based CDNs EDNS0 Client-Subnet for DNS Based CDNs
EDNS0 Client-Subnet for DNS Based CDNs
 
Data Science Lab Meetup: Cassandra and Spark
Data Science Lab Meetup: Cassandra and SparkData Science Lab Meetup: Cassandra and Spark
Data Science Lab Meetup: Cassandra and Spark
 
Sharding Methods for MongoDB
Sharding Methods for MongoDBSharding Methods for MongoDB
Sharding Methods for MongoDB
 
Lawrence Livermore Labs talk 2011
Lawrence Livermore Labs talk 2011Lawrence Livermore Labs talk 2011
Lawrence Livermore Labs talk 2011
 
Agility and Scalability with MongoDB
Agility and Scalability with MongoDBAgility and Scalability with MongoDB
Agility and Scalability with MongoDB
 
D108636GC10_les01.pptx
D108636GC10_les01.pptxD108636GC10_les01.pptx
D108636GC10_les01.pptx
 
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
TritonSort: A Balanced Large-Scale Sorting System (NSDI 2011)
 
10 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 201910 Ways to Scale with Redis - LA Redis Meetup 2019
10 Ways to Scale with Redis - LA Redis Meetup 2019
 
Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2Scaling with sync_replication using Galera and EC2
Scaling with sync_replication using Galera and EC2
 
impervious cover
impervious coverimpervious cover
impervious cover
 
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
Big Data Day LA 2016/ NoSQL track - Analytics at the Speed of Light with Redi...
 
Highly available, scalable and secure data with Cassandra and DataStax Enterp...
Highly available, scalable and secure data with Cassandra and DataStax Enterp...Highly available, scalable and secure data with Cassandra and DataStax Enterp...
Highly available, scalable and secure data with Cassandra and DataStax Enterp...
 
lastfm contentdashboards project description
lastfm contentdashboards project descriptionlastfm contentdashboards project description
lastfm contentdashboards project description
 
Shaping the Future of Travel with MongoDB
Shaping the Future of Travel with MongoDBShaping the Future of Travel with MongoDB
Shaping the Future of Travel with MongoDB
 

Recently uploaded

microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 

Recently uploaded (20)

microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 

Gossip based partitioning and replication for Online Social Networks

  • 1. Gossip-based Partitioning and Replication Middle-ware for Online Social Networks Muhammad Anis Uddin Nasir (EMDC/ICT/LCN) Supervisor: Šarūnas Girdzijauskas Examiner: Johan Montelius
  • 2. Online Social Networks 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware •Vertices •Edges •Metadata Ioanna Antonio Vaidas Aras Vasia Anis Mudit Manos 2 LeandroJohan
  • 3. Existing Solutions • Relational Databases - MySQL Cluster • Key-Value stores - Cassandra, Amazon Dynamo • Document Databases - MongoDB, CouchDB • Graph Databases - Neo4j, Titans 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 3
  • 4. Why Existing Solutions are not enough? 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 4
  • 5. Why Existing Solutions are not enough? • Random Partitioning • Social Request - E.g., gather new feeds from all the friends • Enforcing Data Locality • Random partitioning can lead to full replication! 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 1 4 7 82 3 5 6 10 9 1’ 4’ 7’ 8’ 9’ 2’ 3’ 6’5’ 10’ 5
  • 6. Social Graphs are not Random 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 6
  • 7. Graph Partitioning 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 7
  • 8. JA-BE-JA- edge-cut 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware Server A Server B 6 3 5 2 1 4 76’ 3’ 1’ 4’ 7’ • Edge Cut = 3 links, 3+2=5 replicas to maintain 8
  • 9. SPAR- Minimizing Replicas 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware Server A Server B 6 3 5 2 1 4 76’ 3’2’ 5’ • Edge Cut = 4 links, 2+2=4 replicas to maintain 9
  • 10. Initialization 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 1 4 7 82 3 5 6 10 9 1’ 4’ 7’ 8’ 9’ 2’ 3’ 6’5’ 10’ • Node Addition - Assign it to server with minimum master • Edge Addition - Check if Nodes are Local - Else create replicas to maintain locality 10
  • 11. Gossip Phase • Cost Function - Count number of replicas - For current and new server • Peer Selection - Local, Random, Hybrid 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 1 4 7 82 3 5 6 10 9 1’ 4’ 7’ 8’ 9’ 5’ 10’ 11 2’ 3’ 6’
  • 12. Gossip Phase • Cost Function - Count number of replicas - For existing and new server • Peer Selection - Local, Random, Hybrid • Simulated Annealing 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 5 3 4 2 1 10 8 9 7 6 6 4 7 82 3 5 1 10 9 4’ 8’ 9’ 3’ 5’ 10’6’ 1’ 4 10 12
  • 13. Simulated Annealing 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 13
  • 14. Algorithms Algorithm Random SPAR JA-BE-JA Gossip-based Data locality Decentralized Load Balancing Fault tolerance Avoiding Local Optima Availability 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 14
  • 15. Datasets Datasets Vertices Edges Synth-C 2,000 20,000 Synth-HC 2,000 20,000 Synth-PL 2,000 20,000 SNAP-Facebook 4,039 88,234 WSON-Facebook 60,290 1,545,686 SNAP-Twitter 81,306 1,768,149 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 15
  • 16. Evaluation- with datasets 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 0 2 4 6 8 10 12 Random SPAR JA-BE-JA Gossip-based ReplicationOverhead >3x gain compared to Random Partitioning ≈2x gain compared to SPAR • Number of Servers =16, Replication factor=2 16
  • 17. Evaluation- with replication factor 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware ReplicationOverhead • Number of Servers =16 0 1 2 3 4 5 6 7 8 9 10 f=0 f=2 Random Graphs generates maximum replication overhead Real Graphs generates minimum replication overhead Data locality is achieved by fault tolerance replicas 17
  • 18. Evaluation- with servers 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware ReplicationOverhead • Replication factor =2 Number of Servers WSON-Facebook 18 0 2 4 6 8 10 12 14 16 18 20 8 16 32 64 Random SPAR JA-BE-JA Gossip-based Gossip-based generates minimum replication overhead Replication overhead increases non linearly >4x gain compared to Random Partitioning 0 2 4 6 8 10 12 14 16 18 20 8 16 32 64 Gossip-based
  • 19. Evaluation- dynamicity 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware • Number of Servers =16, Replication factor=2 0.2 0.25 0.3 0.35 0.4 0.45 0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 0.2 0.25 0.3 0.35 0.4 0.45 0 50 100 150 200 250 300 350 400 450 500 550 600 650 700 750 800 850 900 950 1000 SNAP-Twitter SNAP-Facebook Number of cycles Number of cycles ReplicationOverhead ReplicationOverhead Spikes show bulk edge addition Algorithm Stabilization 19 Transition state, i.e., reducing the number of replicas after new edge additions
  • 20. Conclusion • Random Partitioning does not provide efficient solution of Online Social Networks • Minimizing Replicas can help to achieve better partitioning • Gossip-based heuristic was proposed to solve the minimization problem while achieving the global optima • Algorithm able to handle different datasets and adjusts with dynamic nature of OSNs 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 20
  • 21. Gossip-based Partitioning and Replication Middle-ware for Online Social Networks Muhammad Anis Uddin Nasir (EMDC/ICT/LCN) Supervisor: Šarūnas Girdzijauskas Examiner: Johan Montelius
  • 22. Future Work • Execution of the algorithm with large datasets using parallel graph processing frameworks like GraphLab and Apache Girpah • Load Balancing using both Master and Replicas and providing different consistency levels • Smart Replication to provide data locality for highly interactive nodes • Implement different consistency strategies based to access patterns 8/27/2013 Muhammad Anis Uddin Nasir- Gossip-based Partitioning and Replication Middle-ware 22