Daniel A. Spielman MIT Fast, Randomized Algorithms for Partitioning, Sparsification, and the...

Daniel A. Spielman

Fast, Randomized Algorithms forPartitioning, Sparsification, andthe Solution of Linear Systems

Joint work with Shang-Hua Teng (Boston University)

Nearly-Linear Time Algorithms forGraph Partitioning, Graph Sparsification, and Solving Linear Systems S-Teng ‘04

Papers

The Mixing Rate of Markov Chains, An Isoperimetric Inequality, and Computing The Volume Lovasz-Simonovits ‘93

The Eigenvalues of Random Symmetric Matrices Füredi-Komlós ‘81

Preconditioning to solve linear system

Overview

Find B that approximates A, such that solving is easy

Three techniques: Augmented Spanning Trees [Vaidya ’90]

Sparsification: Approximate graph by sparse graph

Partitioning: Runtime proportional to #nodes removed

I. Linear system solvers

II. Sparsification using partitioning and random sampling

III. Graph partitioning by truncated random walks

Outline

Eigenvalues of random graphs

Combinatorial and Spectral Graph Theory

Diagonally Dominant Matrices

Just solve for

Solve wherediags of A at least sum of abs of others in row

Ex: Laplacian matrices of weighted graphs = negative weight from i to j diagonal = weighted degree

Complexity of Solving Ax = bA positive semi-definite

General Direct Methods:

Gaussian Elimination (Cholesky)

Conjugate Gradient

Fast matrix inversion

n is dimensionm is number non-zeros

Complexity of Solving Ax = bA positive semi-definite, structured

Planar: Nested dissection [Lipton-Rose-Tarjan ’79]

Tree: like path, work up from leaves

Path: forward and backward elimination

Express A = L LT

Preconditioned Conjugate Gradient

Omit for rest of talk

Iterative Methods

Find easy B that approximates A

Solve in time

Qualityof approximation

Time to solveBy = c

Iteratively solve in time

Main Result

For symmetric, diagonally dominant A

General:

Planar:

Vaidya’s Subgraph Preconditioners

Precondition A by the Laplacian of a subgraph, B

Vaidya ’90 Relate to graph embedding cong/dil

use MSTaugmented MST

History

Gremban, Miller ‘96 Steiner vertices, many details

Bern, Boman, Chen, Gilbert, Hendrickson, Nguyen, Toledo All the details, algebraic approach

Joshi ’97, Reif ‘98 Recursive, multi-level approach

planar:

Maggs, Miller, Parekh, Ravi, Woo ‘02after preprocessing

History

Boman, Hendrickson ‘01 Low-stretch spanning trees

Clustering, Partitioning, Sparsification, Recursion Lower-stretch spanning trees

S-Teng ‘03 Augmented low-stretch trees

if A-B is positive semi-definite,that is,

if A is positive semi-definite

For A Laplacian of graph with edges E

The relative condition number:

Bounding

For B subgraph of A,

and so

Fundamental Inequality

Application to eigenvalues of graphs

Example: For complete graph on n nodes, all non-zero eigs = n

= 0 for Laplacian matrices, and

-7 -5 -3 -1 1 3 5 7

For path,

Lower bound on

[Gauttery-Leighton-Miller ‘97]

Lower bound on

Preconditioning with a Spanning Tree B

Every edge of A not in B has unique path in B

When B is a Tree

Low-Stretch Spanning Trees

Theorem (Alon-Karp-Peleg-West ’91)

Theorem (Elkin-Emek-S-Teng ’04)

Theorem (Boman-Hendrickson ’01)

B a spanning tree plus s edges, in total

Vaidya’s Augmented Spanning Trees

Adding Edges to a Tree

Partition tree into t sub-trees, balancing stretch

Partition tree into t sub-trees, balancing stretchFor sub-trees connected in A, add one such bridge edge, carefully

(and in blue)

Theorem:

in general

if planar

t sub-trees

improve to

All graphs can be well-approximated by a sparse graph

SparsificationFeder-Motwani ’91, Benczur-Karger ’96

D. Eppstein, Z. Galil, G.F. Italiano, and T. Spencer. ‘93 D. Eppstein, Z. Galil, G.F. Italiano, and A. Nissenzweig ’97

SparsificationBenczur-Karger ’96: Can find sparse subgraph H s.t.

We need

(edges are subset, but different weights)

H has edges

Example: Complete Graph

If A is Laplacian of Kn, all non-zero eigenvalues are n

If B is Laplacian of Ramanujan expander all non-zero eigenvalues satisfy

And so

Example: Dumbell

If B does not contain middle edge,

Example: Grid plus edge

• Random sampling not sufficient.• Cut approximation not sufficient

= (m-1)2 + k(m-1)

(m-1)2

Conductance

Cut = partition of vertices

Conductance of S

Conductance of G

Conductance

Conductance and Sparsification

If conductance high (expander) can precondition by random sampling

If conductance low can partition graph by removing few edges

Decomposition: Partition of vertex set remove few edges graph on each partition has high conductance

Graph Decomposition

Lemma: Exists partition of vertices

Each Vi has large At most half the edges cross partition

sample these edges

recurse on these edges

Graph Decomposition Exists

Each Vi has At most half edges cross

Lemma: Exists partition of vertices

Proof: Let S be largest set s.t.

If S smallonly recurse in S

If S big, V-S not too big

S V - S

Bounded recursion depth

Sparsification from Graph Decomposition

sample these edges

recurse on these edges

Theorem: Exists B with #edges <

Need to find the partition in nearly-linear time

Sparsification

Thm: For all A, find B edges

Thm: Given G, find subgraph H s.t.

in time

#edges(H) <

General solve time:

Cheeger’s Inequality[Sinclair-Jerrum ‘89]:

For Laplacian L, and D: diagonal matrix with = degree of node i

Useful if is big

Random Sampling

Given a graph will randomly sample to get

So that is small,

where is diagonal matrix of degrees in

Useful if is big

Then, for all x orthogonal to (mutual) nullspace

But, don’t want D in there

D has no impact:

implies

Work with adjacency matrix

No difference, because

= weight of edge from i to j, 0 if i = j

Random Sampling Rule

Choose param governing sparsity of

Keep edge (i,j) with prob

If keep edge, raise weight by

Random Sampling Rule

guarantees

guarantees expect at most edges in

Random Sampling Theorem

Theorem

Useful for

From now on, all edges have weight 1

Will prove in unweighted case.

Analysis by Trace

Trace = sum of diagonal entries = sum of eigenvalues

For even k

Analysis by Trace

Main Lemma:

Proof of Theorem:

Markov:

k-th root

Expected Trace

Most terms zero because

So, sum is zero unless each edge appears at least twice.

Will code such walks to count them.

Coding walks

For sequence where

S = {i : edge not used before, either way}

the index of the time edge was used before, either way

Coding walks: Example