PUNCH: Partitioning Using Natural-Cut Heuristics

PUNCH
Partitioning Using Natural-Cut Heuristics

Daniel Delling (Microsoft Research)
Andrew V. Goldberg (Microsoft Research)
Ilya Razenshteyn (Moscow University)
Renato F. Werneck (Microsoft Research)

May 19, 2010

Motivation

Goal: process a continental-sized road network in parallel
(Europe: 18M nodes and 43M arcs).
The ﬁrst natural step: divide it into “small” parts with few
arcs between them.
Partition problems are NP-hard, but routinely solved using
diﬀerent heuristics.

Applications: routing on road networks
Idea: Precompute distances between boundary nodes of each cell.

Overlay Graph:
Nodes — boundary nodes
Edges between boundary nodes,

Search Graph:
t
Source and target cell,
s
Overlay graph,
Use bidirectional Dijkstra
Number of cut edges aﬀects the performance heavily.
More applications: arc-ﬂags and reach.

Existing solvers

METIS [KK’99], SCOTCH [PR’96], KAPPA [HSS’10],
KASPAR [OS’10], KAFFPA [SS’10]. General purpose, some
are fast, some produce very good solution.
There are many more . . .
Our goal: partitioner tailored to road networks,
emphasize quality, still fast enough in practice.

Formal deﬁnition

Input: undirected graph G = (V , E ).
Result: partition (V = V1 ∪ V2 ∪ . . . ∪ Vk , with Vi ∩ Vj = ∅).
Goal: minimize number of edges between Vi .
Two common variants:
given U, require |Vi | ≤ U for every i,
given k and , require |Vi | ≤ (1 + ) n/k .
We focus on the ﬁrst one.
Rebalancing is possible.

Intuition

Road networks: dense regions (grids, cities) interleaved with
natural cuts (mountains, parks, rivers, deserts, sparse areas,
freeways).

Summary of the algorithm

Filtering:
Contracts dense regions,
Reduces graph size,
Preserves natural cuts structure.
Assembly phase:
Works with much smaller graph,
Finds actual partition.

Outline of the talk

Introduction

Natural cuts

Assembly phase

Experiments

Conclusion

Natural cuts

Sparse sets that
separate dense areas.
Minimum s–t cuts are
trivial (average degree
< 3).
Sparsest cuts would be
OK, but they are
intractable.
Our notion of natural
cut is both tractable and
useful.

Natural cuts

Pick centers in a
randomized manner.
v Compute minimum cut
between the core and the
ring.
Repeat until every node
is inside of at least two
cores.

Natural cuts

Pick centers in a
randomized manner.
U/10 nodes ring.
cores.

Natural cuts

Pick centers in a
randomized manner.
U/10 nodes ring.
cores.
U nodes

Natural cuts
Take a union of all natural cuts found and contract everything
between them.
The resulting graph is much smaller than the original one.
U = 106 — 18M nodes to 10K nodes
U = 103 — 18M nodes to 1.3M nodes

Tiny cuts

The most obvious natural cuts — 1-cuts and 2-cuts.
We handle them explicitly before processing natural cuts.
Greatly decreases graph size (by half) and overall running
time,

Assembly phase

Three ingredients:
Greedy algorithm,
Local search,
Multistart and combination heuristics (optional).

Greedy algorithm

We combine well-connected small fragments in a randomized
fashion.
Repeat until maximal.
Finds initial partition.

The local search

Pick two neighboring cells, disassemble them, apply greedy
algorithm to the subproblem.
Repeat several times for every pair of neighboring cells.

Multistart and combination heuristics

Since the local search is typically much faster than the natural cuts
detection, we can use the following two heuristics:
Multistart: since the local search is randomized, we can
repeat it several times.
Combination: keep track of several solutions, and combine
them from time to time.

Experimental evaluation

C++/OpenMP
Tested on Western Europe map (18M nodes, 43M arcs).
Machine: Intel Xeon X5680 (two six-core 3.33GHz CPUs)
with DDR3-1333MHz RAM.

A typical use-case

Europe, U = 64K .
Tiny cuts contraction: 25 seconds (18M nodes to 9M nodes).
Natural cuts identiﬁcation: 50 seconds (12 cores, 9M nodes to
100K nodes).
Greedy + local search: only 5 seconds (12 cores).

Running times on Europe

Tiny cuts
200

Natural cuts
Greedy + Local search
180

q
160
140
120

q
Time (s)

100
80

q
60

q
40

q

q
q q q q q q q
20

q
q
q
q
q
q q q
0

210 212 214 216 218 220 222

maximum cell size

Inﬂuence of ϕ
The local search tries every edge ϕ times.
15000

Dependence on phi

q
14000
13000
cut size

12000

q

q
q
q
q
q
q
q
11000

q q
10000

0.1 1 10 100 1000 10000

time (s)

Multistart and combination
Combination helps!
15000

Dependence on phi
Multistart
q
Combination
14000
13000
cut size

12000

q

q
q
q
q
q q q
q q
q
11000

q q
q
q
q
10000

0.1 1 10 100 1000 10000

time (s)

Balanced partitions

Recall that there are two variants of requirements on |Vi |:
given U, require |Vi | ≤ U for every i,
given k and , require |Vi | ≤ (1 + ) n/k .
PUNCH solves the ﬁrst, but most existing solvers ﬁnd
-balanced partitions.
Rebalancing:
Run PUNCH with U = (1 + ) n/k ,
If there are too many regions, redistribute them.

Balanced partitions: Europe, = 0.03 (quality and time)

Cut size
K PUNCH KAFFPA KASPAR KAPPA SCOTCH METIS
2 129 130 133 — 469 —
4 309 412 355 543 952 846
8 634 749 774 986 1667 1675
16 1293 1454 1401 1760 2922 3519
32 2289 2428 2595 3186 4336 7424
64 3828 4240 4502 5290 6772 11313

Time
K PUNCH KAFFPA KASPAR KAPPA SCOTCH METIS
2 255 1013 1946 — 12 —
4 215 1823 2168 441 25 29
8 176 2067 2232 418 39 29
16 151 2340 2553 498 52 31
32 130 2445 2599 418 65 31
64 145 2533 2534 308 77 30

Concluding remarks

PUNCH is good for partitioning road networks.
It doesn’t work as well on instances without natural cuts.

PUNCH: Partitioning Using Natural-Cut Heuristics

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a PUNCH: Partitioning Using Natural-Cut Heuristics

Similar a PUNCH: Partitioning Using Natural-Cut Heuristics (20)

Último

Último (20)

PUNCH: Partitioning Using Natural-Cut Heuristics