Compact Representations of Separable Graphs

Compact Representations of Separable Graphs

From a paper of the same title submitted to SODA by:

Dan Blandford and Guy Blellochand Ian Kash

Standard Graph Representations

• Standard Adjacency List- |V| + 2|E| words = O(|E|lg|V|) bits- |V| + |E| words if stored as an array

• For arbitrary graphs, lower bound isO(|E| lg (|V|2/|E|)) bits, which adjacency list meets for sparse graphs, i.e. |E| = O(|V|)

Representations for Planar Graphs that use O(|V|) bits

• No Query Support- Turan [1984], First to come up with an O(|V|) bit compression scheme- Keeler and Westbrook [1995], Improved constants- Deo and Litow [1998], First use of separators- He, Kao and Lu [2000], Optimal high order term

• Query Support- Jacobson [1989], O( lg |V|) adjacency queries- Munro and Raman [1997], O(1) adjacency queries- Based on page embeddings and balanced parentheses, so not generally applicable

Graph Separators

A Vertex Separator An Edge Separator

Formally, a class of Graphs is separable if:

1) Every member has a cut set of size βf(|V|) for some β > 0 such that

2) Each component has at most αn vertices for some α < 1

We concern ourselves with the case f(|V|) = |V|c, 0 < c < 1

Classes of Graphs With Small Separators

Many Interesting and Useful Classes- Planar (and all bounded genus)- Telephone / Power / Road networks- 3D Meshes (Miller et al. [1997])- Link Structure of Web

Results

Theory for both edge and vertex separators• Compress Graphs to O(|V|) bits• Meets Lower Bound for Query Times

- Degree Queries – O(1)- Adjacency Queries – O(1)- Neighbor Queries – O(D), D = Degree

Implementation for edge separators

Main Ideas

1) Build separator tree2) Order based on tree3) Compress using difference encoded

adjacency table4) Use root find tree*

* Vertex separators only

A Graph and Its Separator Tree

1

24

3

1

Final Numbering: 1 2 3 4

11

111 3

2

1

4

4 1

2

3

3 2

4

Difference Coded Adjacency Table

• Instead of storing absolute numbers, store offsets

• i.e.: 1,1,3,2,1 instead of 1,2,5,7,8• Makes numbers stored smaller• By itself, it offers no advantage

Gamma Code• Variable length encoding

with 2 parts• 1st Part: Unary code of

length of 2nd part• 2nd Part: Number in binary• Encoding n takes

2└lg(n+1)┘=O(lg(n)) bits• There are other codes

with similar properties (Ex: Delta Code)

Number Encoding1 002 013 10004 10015 10106 10117 1100008 1100019 110010

Theorem: Using a difference coded adjacency table this ordering

requires O(|V|) bits

u

u v

v

Sketch of Proof

• The maximum distance between the numbers of u and v is the number of vertices in their Least Common Ancestor in the separator tree (n)

• With Gamma Code this takes O(lg(n)) bits• Recurrence: S(n) ≤ 2S(.5n) + O(nc lg(n))• As a geometrically decreasing recurrence,

this solves to S(|V|) = O(|V|)

Decoding Vertex i in O(1) time

• We can decode lg(|V|) bits at a time through table lookup

• An Edge Takes O(lg(|V|)) bits, and the degree of a vertex for a graph with good edge separators is constant

• We decode the entire vertex using a constant number of table lookups

Vertex i-1 Degree Edge 2 Edge 3Edge 1 Vertex i+1

Pointer to Vertex i

Problem!

• This assumes we have a pointer to vertex i• Requires |V| pointers of lg |V| bits each =

O(|V| lg|V|), but the graph uses only O(|V|)• Can afford a pointer to a BLOCK of lg |V|

vertices O(lg |V| * |V| / lg |V|) = O(|V|)• Can afford a pointer to a SUBBLOCK of at

least k lg |V| bits• Can afford extra 2 lg |V| bits per block

Getting a Pointer to Vertex i in O(1)

Next BlockPrevious Block 100110 Pointer to Block Pointer to Offsets

Prev. Subblocks Next SubblocksOffset for 3rd SubblockOffset for 2nd Subblock

Pointer to Block (calculate as i / lg (|V|)

Flags to determine subblocks. A 1 is the start of a subblock

The Process:

1) Calculate block number

2) Identify subblock number

3) Get subblock offset

4) Add offset to pointer

5) Decode any Previous vertices (at most k lg |V| bits)

Graphs Used For Testing|V| |E| Max D Desc.

3dGrid 262144 2044102 20 Large 3d Grid

Airfoil 23560 461936 20 3d Mesh of an Airfoil

Crack 10240 60628 82d Triangle Mesh of a

Cracked Plate

Pitt 18234 49566 6 Map of Pittsburgh

In Link 247428 1166146 3205Links to

WebPages (from TREC)

Out Link 247428 1166146 800Links from WebPages

(from TREC)

Experimental Space Results

All #s in bits

Used per edge

Overhead per edge

Used per vertex

Overhead per Vertex

3dGrid 9.38 0.94 73.13 7.35

Airfoil 5.34 0.46 104.66 8.98

Crack 5.90 0.92 34.93 5.42

Pitt 6.45 1.42 17.54 3.87

In Link 5.64 1.01 26.58 4.75

Out Link 8.78 1.23 41.38 5.79

Experimental Time Results

Results on3D Grid

Depth First Search time (seconds)

Breadth First Search time (seconds)

Our Implementation 0.89 0.86

Standard Adjacency List 0.33 0.31

Array Adjacency List 0.17 0.12

Conclusions

• Our Method uses O(|V|) bits to compress a graph with good separators

• On test graphs with good separators, it used less than 12 bits per edge, including overhead

• It meets the lower bounds for queries• With unoptimized table lookup it takes 3 times as

long to search as a standard adjacency list representation and 6 times as long as an array based adjacency list

Future Work

• Test decoding speed on more graphs• Optimize decoding• Allow addition / deletion of edges• More Efficient Separator Generator• Implement Vertex Separators

Planar Embeddings and Parentheses

• Create a Page Embedding

• Represent vertices and edges as parentheses- () for a vertex- ( for start of an edge- ) close for end- Figure on left:

()((())(()))((())(()))• No good page embedding

algorithm for arbitrary graphs

Compact Representations of Separable Graphs

Documents

Transcript of Compact Representations of Separable Graphs