Sparsiﬁcation,of,Graphs,and,Matrices, · Sparsiﬁcation,of,Graphs,and,Matrices,, DanielA....

Sparsification of Graphs and Matrices

Daniel A. Spielman

Yale University

joint work with

Joshua Batson (MIT) Nikhil Srivastava (MSR) Shang-‐Hua Teng (USC)

HUJI, May 21, 2014

Objective of Sparsification:

Approximate any (weighted) graph by a

sparse weighted graph.

Objective of Sparsification:

sparse weighted graph. Spanners -‐ Preserve Distances [Chew ’89] Cut-‐Sparsifiers – preserve wt of edges leaving every set S ⊆ V

[Benczur-‐Karger ’96]

Spectral Sparsification [S-‐Teng]

sparse weighted graph.

Graph Laplacian X

(u,v)2E

wu,v(x(u)� x(v))2G = (V,E,w)

Laplacian Quadratic Form, examples

All edge-‐weights are 1 x : 0

Sum of squares of differences across edges

TLGx =

(u,v)2E

(x(u)� x(v))2

Sum of squares of differences across edges

TLGx =

(u,v)2E

(x(u)� x(v))2

When x is the characteristic vector of a set S, sum the weights of edges on the boundary of S

TLGx =

(u,v)2Eu2Sv 62S

Learning on Graphs (Zhu-‐Ghahramani-‐Lafferty ’03)

Infer values of a function at all vertices from known values at a few vertices.

Minimize

Subject to known values

(a,b)2E

(x(a)� x(b))2

(a,b)2E

(x(a)� x(b))2

0.625 0.375

Infer values of a function at all vertices from known values at a few vertices.

Minimize

Subject to known values

Learning on Graphs (Zhu-‐Ghahramani-‐Lafferty ’03)

View edges as resistors connecting vertices Apply voltages at some vertices. Measure induced voltages and current flow.

Graphs as Resistor Networks

View edges as resistors connecting vertices Apply voltages at some vertices. Measure induced voltages and current flow.

0.625V 0.375V

View edges as resistors connecting vertices Apply voltages at some vertices. Measure induced voltages and current flow. Induced voltages minimize

(a,b)2E

(v(a)� v(b))2

Subject to fixed voltages (by battery)

Effective Resistance between s and t = potential difference of unit flow

1.5 Re↵(s, t) = 1.5

Re↵(s, t) = 2

Laplacian Matrices

TLGx =

(u,v)2E

wu,v(x(u)� x(v))2

L1,2 =

✓1 �1�1 1

✓1�1

◆�1 �1

(u,v)2E

wu,vLu,v

Laplacian Matrices

TLGx =

(u,v)2E

wu,v(x(u)� x(v))2

(u,v)2E

wu,vLu,v

(u,v)2E

wu,v(bu,vbTu,v)

where bu,v = �u � �vSum of outer products

Laplacian Matrices

TLGx =

(u,v)2E

wu,v(x(u)� x(v))2

(u,v)2E

wu,vLu,v

(u,v)2E

wu,v(bu,vbTu,v)

Positive semidefinite

If connected, nullspace = Span(1)

Inequalities on Graphs and Matrices

For graphs and G = (V,E,w) H = (V, F, z)

For matrices and fMM

M 4 fMx

if for all x

G 4 H if LG 4 LH

Inequalities on Graphs and Matrices

For matrices and fMM

M 4 fMx

if for all x

G 4 H if LG 4 LH

if LG 4 kLHG 4 kH

Approximations of Graphs and Matrices

M ⇡✏fM

1 + ✏M 4 fM 4 (1 + ✏)Mif

LG ⇡✏ LHG ⇡✏ H if

Approximations of Graphs and Matrices

M ⇡✏fM

1 + ✏M 4 fM 4 (1 + ✏)Mif

LG ⇡✏ LHG ⇡✏ H if

That is, for all

1 + �

P(u,v)2F zu,v(x(u)� x(v))2

P(u,v)2E wu,v(x(u)� x(v))2

1 + �

Implications of Approximation

LH and LG have similar eigenvalues

Solutions to systems of linear equations are similar.

G ⇡✏ H

L+G ⇡✏ L

Boundaries of sets are similar.

Effective resistances are similar.

Spectral Sparsification [S-‐Teng]

so that

For an input graph G with n vertices, find a sparse graph H having edges O(n)

G ⇡✏ H

Why? Solving linear equations in Laplacian Matrices

key part of nearly-‐linear time algorithm use for learning on graphs, maxflow, PDEs, …

Preserve Eigenvectors, Eigenvalues and electrical properties Generalize Expanders Certifiable cut-‐sparsifiers

Approximations of Complete Graphs are Expanders

Expanders: d-‐regular graphs on n vertices (n grows, d fixed) every set of vertices has large boundary random walks mix quickly incredibly useful

Approximations of Complete Graphs are Expanders

Expanders: d-‐regular graphs on n vertices (n grows, d fixed) weak expanders: eigenvalues bounded from 0 strong expanders: all eigenvalues near d

Example: Approximating a Complete Graph

For G the complete graph on n verts. all non-‐zero eigenvalues of LG are n.

For , x ? 1 x

TLGx = n

kxk = 1

For , x ? 1 x

TLGx = n

For H a d-‐regular strong expander, all non-‐zero eigenvalues of LH are close to d.

For , x ? 1 x

TLHx ⇠ d

kxk = 1

For , x ? 1 x

TLGx = n

For H a d-‐regular strong expander, all non-‐zero eigenvalues of LH are close to d.

For , x ? 1 x

TLHx ⇠ d

kxk = 1

dH is a good approximation of G

Best Approximations of Complete Graphs

Ramanujan Expanders [Margulis, Lubotzky-‐Phillips-‐Sarnak]

d� 2pd� 1 �(LH) d+ 2

pd� 1

d� 2pd� 1 �(LH) d+ 2

pd� 1

Cannot do better if n grows while d is fixed [Alon-‐Boppana]

Can we approximate every graph this well?

d� 2pd� 1 �(LH) d+ 2

pd� 1

Example: Dumbbell

Complete graph

on n vertices Complete graph

on n vertices

d-‐regular Ramanujan, times n/d

Example: Dumbbell

G = G1 +G2 +G3

H = H1 +H2 +H3

G1 4 (1 + �)H1

G2 = H2

G3 4 (1 + �)H3

G 4 (1 + �)H

Cut-‐Sparsifiers [Benczur-‐Karger ‘96]

For every G, is an H with edges

for all x � {0, 1}nO(n log n/�2)

1 + ✏

Cut-‐approximation is different

m� 1

0 1 2 m … 3

m� 1

0 1 2 m … 3

TLGx = km+m

Need long edge if k < m

Main Theorems for Graphs

For every , there is a s.t.

G = (V,E,w) H = (V, F, z)

|F | � |V | (2 + �)2/�2and G ⇡✏ H

G = (V,E,w) H = (V, F, z)

|F | � |V | (2 + �)2/�2and G ⇡✏ H

Within a factor of 2 of the Ramanujan bound

G = (V,E,w) H = (V, F, z)

|F | � |V | (2 + �)2/�2and G ⇡✏ H

By careful random sampling, get

(S-‐Srivastava 08)

O(|E| log2 |V | log(1/�))

(Koutis-‐Levin-‐Peng ‘12)

In time

Sparsification by Random Sampling

Assign a probability pu,v to each edge (u,v) Include edge (u,v) in H with probability pu,v. If include edge (u,v), give it weight wu,v /pu,v

E [ LH ] =X

(u,v)2E

pu,v(wu,v/pu,v)Lu,v = LG

Sparsification by Random Sampling

Choose pu,v to be wu,v times the effective resistance between u and v. Low resistance between u and v means there are many alternate routes for current to flow and that the edge is not critical. Proof by random matrix concentration bounds (Rudelson, Ahlswede-‐Winter, Tropp, etc.)

Matrix Sparsification

� �=

� �✓ ◆M BTB

� �=

� �✓ ◆fM

(1 + ✏)M 4 fM 4 (1 + ✏)M

Matrix Sparsification

� �=

� �✓ ◆M BTB

� �=

� �✓ ◆fM

(1 + ✏)M 4 fM 4 (1 + ✏)M

most se = 0

e sebebTe

e bebTe

Main Theorem (Batson-‐S-‐Srivastava)

For , there exist so that for

e bebTe se

e sebebTe

n(2 + ✏)2/✏2at most are non-‐zero se

M ⇡✏fM

Simplification of Matrix Sparsification

(1 + ✏)M 4 fM 4 (1 + ✏)M

(1 + ✏)I 4 M�1/2fMM�1/2 4 (1 + ✏)I

is equivalent to

(1 + ✏)I 4 M�1/2fMM�1/2 4 (1 + ✏)I

ve = M�1/2beSet X

vevTe = I

“Decomposition of the identity”

hu, vei2 = kuk2

(1 + ✏)I 4 M�1/2fMM�1/2 4 (1 + ✏)I

ve = M�1/2beSet X

vevTe = I

We need X

sevevTe ⇡✏ I

(1 + ✏)I 4 M�1/2fMM�1/2 4 (1 + ✏)I

ve = M�1/2beSet X

vevTe = I

sevevTe ⇡✏ I

Random sampling sets

We need

pe = kvek2

What happens when we add a vector?

Interlacing

More precisely Characteristic Polynomial:

More precisely Characteristic Polynomial: Rank-‐one update:

pA+vvT =⇣1 +

Pi<v,ui>

�i�x

Where Aui = �iui

More precisely Characteristic Polynomial: Rank-‐one update:

pA+vvT =⇣1 +

Pi<v,ui>

�i�x

are zeros of

Physical model of interlacing λi = positive unit charges resting at barriers on a slope

Physical model of interlacing

ui is eigenvector is added vector charge on barrier

hv, uii2

Barriers repel eigs.

ui is eigenvector is added vector charge on barrier

hv, uii2

Barriers repel eigs.

gravity

Inverse law repulsion

Examples

Ex1: All weight on u1

v = u1

Pushed up against next barrier

Gravity keeps resting on barrier

v = u1

Ex2: Equal weight on u1, u2

Ex3: Equal weight on all u1, u2 , …, un

Adding a random ve

Because are decomposition of identity, ve

⇥PA+vevT

�i � x

= PA � 1

hhve, uii2

i= 1/m

Many random ve

⇥PA+vevT

⇤= 1� 1

Ee1,...,ek

hPve1v

+···+vekvTek(x)

✓1� 1

Many random ve

⇥PA+vevT

⇤= 1� 1

Ee1,...,ek

hPve1v

+···+vekvTek(x)

✓1� 1

Is an associated Laguerre polynomial!

For , roots lie between and

k = n/✏2

(1� ✏)2n

m✏2(1 + ✏)2

Matrix Sparsification Proof Sketch

Have X

vevTe = I Want

sevevTe ⇡✏ I

✏ ⇡ 2.6

Will do with |{e : se 6= 0}| 6n

All eigenvalues between 1 and 13,

Broad outline: moving barriers

0 n -‐n

Step 1

0 n -‐n

Step 1

0 n -‐n

Step 1

0 n -‐n

+1/3 +2

-‐n+1/3 n+2

Step 1

0 n -‐n

+1/3 +2

-‐n+1/3 n+2

tighter constraint

looser constraint

Step i+1

+1/3 +2

Step i+1

+1/3 +2

Step i+1

+1/3 +2

Step i+1

Step 6n

0 … 13n n

Step 6n

0 … n

2.6-approximation with 6n vectors.

Problem

need to show that an appropriate

always exists.

Problem

need to show that an appropriate

always exists.

Is not strong enough for induction

Problems

If many small eigenvalues, can only move one

Bunched large eigenvalues repel the highest one

The Lower Barrier Potential Function

�⇥(A) =P

�i�⇥ = Tr�(A� �I)�1

�⇥(A) =P

�i�⇥ = Tr�(A� �I)�1

��(A) 1 =) �min(A) � ⇥+ 1

No λi within dist. 1 No 2 λi within dist. 2 No 3 λi within dist. 3

. No k λi within dist. k

�⇥(A) =P

�i�⇥ = Tr�(A� �I)�1

��(A) 1 =) �min(A) � ⇥+ 1

The Upper Barrier Potential Function

�u(A) =P

u��i= Tr

�(uI �A)�1

The Beginning

Step i+1

+1/3 +2

Lemma. can always choose so that potentials do not increase

+sevevTe

Step i+1

Step 6n

0 … 13n n

Step 6n

0 … n

2.6-approximation with 6n vectors.

+1/3 +2

+sevevTe

Upper Barrier Update

Add and set

�u0(A+ svvT )

= Tr�(u0I �A� svvT )�1

Add and set

�u0(A+ svvT )

= �u0(A) +

svT (u0I �A)�2v

1� svT (u0I �A)�1v

By Sherman-‐Morrison Formula

Add and set

�u0(A+ svvT )

= �u0(A) +

svT (u0I �A)�2v

1� svT (u0I �A)�1v

Need �u(A)

How much of can we add?

Rearranging:

�u0(A+ svvT ) �u(A)

1 � svT

✓(u0I �A)�2

�u(A)� �u0(A)+ (u0I �A)�1

How much of can we add?

Rearranging:

�u0(A+ svvT ) �u(A)

1 � svT

✓(u0I �A)�2

�u(A)� �u0(A)+ (u0I �A)�1

Write as 1 � svTUAv

Lower Barrier

Similarly:

Write as

�l0(A+ svvT ) �l(A)

1 svTLAv

✓(A� l0I)�2

�l0(A)� �l(A)� (A� l0I)�1

Show that we can always add some vector while respecting both barriers.

+1/3 +2

Need: svTUAv 1 svTLAv

Two expectations

Can show:

svTUAv 1 svTLAv

⇥vTe UAve

⇤ 3/2m

⇥vTe LAve

⇤� 2/m

Two expectations

Can show:

svTUAv 1 svTLAv

And, exists

⇥vTe UAve

⇤ 3/2m

⇥vTe LAve

⇤� 2/m

⇥vTe UAve

⇥vTe LAve

e : vTe UAve vTe LAve

Two expectations

Can show:

svTUAv 1 svTLAv

And, exists

And that puts between them s 1

⇥vTe UAve

⇤ 3/2m

⇥vTe LAve

⇤� 2/m

⇥vTe UAve

⇥vTe LAve

Two expectations

Can show:

svTUAv 1 svTLAv

And, exists

And that puts between them s 1

⇥vTe UAve

⇤ 3/2m

⇥vTe LAve

⇤� 2/m

⇥vTe UAve

⇥vTe LAve

Bounding expectations

vTUAv = Tr�UAvv

�UAvev

� ⇤= Tr

⇣UA E

⇥vev

⇤⌘

= Tr (UAI)

= Tr (UA)

Tr (UA) =Tr

�(u0I �A)�2

�u(A)� �u0(A)+ Tr

�(u0I �A)�1

= �u0(A)

�u(A)

As barrier function is monotone decreasing

Tr (UA) =Tr

�(u0I �A)�2

�u(A)� �u0(A)+ Tr

�(u0I �A)�1

= �u0(A)

�u(A)

Numerator is derivative of barrier function. As barrier function is convex,

u0 � u

Tr(UA) 1/2 + 1 = 3/2

Similarly, Tr (LA) �1

l0 � l� 1

1/3� 1

Step i+1

Twice-‐Ramanujan Sparsifiers

Fixing steps and tightening parameters gives ratio

Less than twice as many edges as used by Ramanujan Expander of same quality

�max

�min

(A) d+ 1 + 2

d+ 1� 2pd

Open Questions

The Ramanujan bound Properties of vectors from graphs?

Faster algorithm union of random Hamiltonian cycles?

Sparsiﬁcation,of,Graphs,and,Matrices, · Sparsiﬁcation,of,Graphs,and,Matrices,, DanielA....

Documents

Transcript of Sparsiﬁcation,of,Graphs,and,Matrices, · Sparsiﬁcation,of,Graphs,and,Matrices,, DanielA....

Skew-adjacency matrices of graphs · graphs by the spectra of their derived sets of skew-adjacency matrices, but, as we have just seen, addressing that question leads to interesting

Idempotent tropical matrices: graphs, groups and metric spaces › staff › Marianne... · 2012-06-27 · Idempotent tropical matrices: graphs, groups and metric spaces Marianne

Discrete Mathematics and Its Applicationsdase.ecnu.edu.cn/.../DM_2019_Fall/slides/7_graph_72.pdf · 2019-12-03 · Outline 1 Representing Graphs Adjacency Matrices Incidence Matrices

4.4. Gaussian Elimination for Sparse Matrices€¦ · 3.3 QR-Decomposition with Householder matrices 4 Sparse Matrices 4.1 General Properties, Storage 4.2 Sparse Matrices and Graphs

Graph Laplacian Matrices as Quantum State Density Matrices ... · Laplacian matrices of graphs • Properties of Laplacian matrices: 1. Singular 2. Positive semidefinite (all eigenvalues

Satisﬁability Allows No Nontrivial Sparsiﬁcation Unless ...

Graphs - math.uic.edujan/mcs360f17/graphs.pdf · Graphs 1 Graphs directed and undirected graphs weighted graphs adjacency matrices 2 Graph Representations abstract data type adjacency

Codes from incidence matrices of graphs - Clemson Universitycecas.clemson.edu/~keyj/Key/JDK_3ICTMA.pdf · J. D. Key (keyj@clemson.edu) Codes from incidence matrices of graphs 11-15

On Perfect Graphs and Balanced Matrices - LSEpersonal.lse.ac.uk/Zambelli/papers/Thesis.pdf · matrices, named k-balanced matrices, introduced by Truemper and Chan-drasekaran [77]

Adaptive Gradient Sparsification for Efficient Federated ......Adaptive Gradient Sparsification for Efficient Federated Learning: An Online Learning Approach Pengchao Han , Shiqiang

Graphs associated with matrices over nite elds and their …cklixx.people.wm.edu/graph-matrices.pdf · 2013-12-24 · Graphs associated with matrices over nite elds and their endomorphisms

Skew-adjacency matrices of graphs€¦ · adjacency matrices in Section 3. These relations and those for other matrices of graphs may be found in [8]. InSection4,theskewco-spectralcharacterizationofodd-cyclegraphsisproved(Theorem4.2).Eq.

Laplacian Matrices of Graphs: Algorithms and Applicationscs- · Laplacian Matrices of Graphs: Algorithms and Applications ICML, June 21, 2016 Daniel A. Spielman. Laplacians Interpolation

Nilpotent adjacency matrices, random graphs, and quantum random

Cyclotomic Matrices and Graphs - SageMath · cyclotomic and thus the presented classi cation determines all cyclotomic matrices/graphs for those elds. For the same values of dwe then

Directed Graphs - Transition Matrices › sites › default › files › inline-files › LA02.pdfLecture #2: Directed Graphs - Transition Matrices A graph is an object that consists

Local Graph Sparsiﬁcation for Scalable Clustering

Permutation decoding for codes from designs, finite ...€¦ · incidence matrices of classes of designs or graphs, and adjacency matrices of classes of regular graphs. These codes

Conference matrices and graphs of order 26 · conference matrices of order 26 (theorem 4.4. I) and 15 pairwise nonisomor phic strongly regular graphs of order 25 (theorem 4.6.1) can

Graphs and Matrices