Thesis Defense Large-Scale Graph Computation on Just a PC Aapo Kyrölä [email protected] Carlos...

1

Thesis DefenseLarge-Scale Graph Computation on

Just a PC

Aapo Kyroumllauml akyrolacscmuedu

Carlos GuestrinUniversity of

Washington amp CMU

Guy BlellochCMU

Alex SmolaCMU

Dave AndersenCMU

Jure LeskovecStanford

Thesis Committee

2

Research Fields

This thesis

Machine Learning

Data Mining

Graph analysis

and mining

External memory

algorithms research

Systems research

Databases

Motivation and applications

Research contributions

3

Why Graphs

Large-Scale Graph Computation on Just a PC

4

BigData with Structure BigGraph

social graph social graph follow-graph consumer-products graph

user-movie ratings graph

DNA interaction

graph

WWWlink graph

Communication networks (but ldquoonly 3 hopsrdquo)

5

Why on a single machine

Canrsquot we just use the Cloud

Large-Scale Graph Computation on a Just a PC

6

Why use a cluster

Two reasons1 One computer cannot handle my graph problem

in a reasonable time

2 I need to solve the problem very fast

7

Why use a cluster




Our work expands the space of feasible problems on one machine (PC)- Our experiments use the same graphs or bigger than previous papers on distributed graph computation (+ we can do Twitter graph on a laptop)

Our work raises the bar on required performance for a ldquocomplicatedrdquo system

8

Benefits of single machine systems

Assuming it can handle your big problemshellip1 Programmer productivityndash Global state debuggershellip

2 Inexpensive to install administer less power

3 Scalabilityndash Use cluster of single-machine systems

to solve many tasks in parallel Idea Trade latency

for throughputlt 32K bitssec

9

Computing on Big Graphs


10

Big Graphs = Big Data

Data size 140 billion connections

asymp 1 TB

Not a problem

Computation

Hard to scale

Twitter network visualization by Akshay Java 2009

11

Research Goal

Compute on graphs with billions of edges in a reasonable time on a single PC

ndash Reasonable = close to numbers previously reported for distributed systems in the literature

Experiment PC Mac Mini (2012)

12

Terminology

bull (Analytical) Graph ComputationndashWhole graph is processed typically for

several iterations vertex-centric computation

ndash Examples Belief Propagation Pagerank Community detection Triangle Counting Matrix Factorization Machine Learninghellip

bull Graph Queries (database)ndash Selective graph queries (compare to SQL

queries)ndash Traversals shortest-path friends-of-

friendshellip

13

GraphChi (Parallel Sliding Windows)

GraphChi-DB(Partitioned Adjacency Lists)

Online graph updates

Batchcomp

Evolving graph

Incremental comp

DrunkardMob Parallel Random walk simulation

GraphChi^2

Triangle CountingItem-Item Similarity

PageRankSALSAHITS

Graph ContractionMinimum Spanning Forest

k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query

Edge and vertex properties

Link prediction

Weakly Connected ComponentsStrongly Connected Components

Community DetectionLabel Propagation

Multi-BFS

Graph sampling

Graph Queries

Matrix Factorization

Loopy Belief PropagationCo-EM

Graph traversal

Thesis statement

The Parallel Sliding Windows algorithm and the Partitioned Adjacency Lists data structure enable

computation on very large graphs in external memory on just a personal computer

14

DISK-BASED GRAPH COMPUTATION

15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model

bull Graph G = (V E) ndash directed edges e =

(source destination)ndash each edge and vertex

associated with a value (user-defined type)

ndash vertex and edge values can be modifiedbull (structure modification also

supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be

Terms e is an out-edge of A and in-edge of B

17

Data

Data Data

Data

DataDataData

Data

Data

Data

Vertex-centric Programming

bull ldquoThink like a vertexrdquobull Popularized by the Pregel and GraphLab

projectsndash Historically systolic computation and the Connection Machine

MyFunc(vertex) modify neighborhood Data

Data Data

Data

Data

function Pagerank(vertex) insum = sum(edgevalue for edge in vertexinedges) vertexvalue = 085 + 015 insum foreach edge in vertexoutedges edgevalue = vertexvalue vertexnum_outedges

18

Computational Setting

ConstraintsA Not enough memory to store the whole

graph in memory nor all the vertex values

B Enough memory to store one vertex and its edges w associated values

19

The Main Challenge of Disk-based Graph Computation

Random Access

~ 100K reads sec (commodity)~ 1M reads sec (high-end arrays)

ltlt 5-10 M random edges sec to achieve ldquoreasonable performancerdquo

20

Random Access Problem

File edge-values

A in-edges A out-edges

A B

B in-edges B out-edges

x

Moral You can either access in- or out-edges sequentially but not both

Random write

Random readProcessing sequentially

21

Our Solution

Parallel Sliding Windows (PSW)

22

Parallel Sliding Windows Phases

bull PSW processes the graph one sub-graph a time

bull In one iteration the whole graph is processedndash And typically next iteration is started

1 Load

2 Compute

3 Write

23

bull Vertices are numbered from 1 to nndash P intervalsndash sub-graph = interval of vertices

PSW Shards and Intervals

shard(1)

interval(1) interval(2) interval(P)

shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write

ltltpartition-bygtgt

source destinationedge

In shards edges sorted by source

24

Example Layout

Shard 1

Shards small enough to fit in memory balance size of shards

Shard in-edges for interval of vertices sorted by source-id

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000

Shard 2 Shard 3 Shard 4Shard 1

1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000

Load all in-edgesin memory

Load subgraph for vertices 1100

What about out-edges Arranged in sequence in other shards

Shard 2 Shard 3 Shard 4

PSW Loading Sub-graph

Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000

Out-edge blocksin memory

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27

Parallel Sliding Windows

Only P large reads and writes for each interval

= P2 random accesses on one full pass

Works well on both SSD and magnetic hard disks

>

28

ldquoGAUSS-SEIDELrdquo ASYNCHRONOUS

How PSW computes

Chapter 6 + Appendix

Joint work Julian Shun

29

Synchronous vs Gauss-Seidel

bull Bulk-Synchronous Parallel (Jacobi iterations)ndash Updates see neighborsrsquo values from

previous iteration [Most systems are synchronous]

bull Asynchronous (Gauss-Seidel iterations)ndash Updates see most recent valuesndash GraphLab is asynchronous

30

Shard 1




PSW runs Gauss-Seidel

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Fresh values in this ldquowindowrdquo from previous phase

31

Synchronous (Jacobi)

1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1

Bulk-Synchronous requires graph diameter ndashmany iterations to propagate the minimum label

Each vertex chooses minimum label of neighbor

32

PSW is Asynchronous (Gauss-Seidel)

1 2 4 3 5

1 1 1 3 3

1 1 1 1 1

Gauss-Seidel expected iterations on random schedule on a chain graph

= (N - 1) (e ndash 1) asymp 60 of synchronous


33

Open theoretical question

Label PropagationSide length = 100

Synchronous PSW Gauss-Seidel (average random schedule)

199 ~ 57

100 ~29

298

iterations

~52

Natural graphs (web social)

graph diameter - 1

~ 06 diameter


Chapter 6

34

PSW amp External Memory Algorithms Research

bull PSW is a new technique for implementing many fundamental graph algorithmsndash Especially simple (compared to previous work)

for directed graph problems PSW handles both in- and out-edges

bull We propose new graph contraction algorithm based on PSWndash Minimum-Spanning Forest amp Connected

Components

bull hellip utilizing the Gauss-Seidel ldquoaccelerationrdquo


SEA 2014 Chapter 6 + Appendix

35

GRAPHCHI SYSTEM EVALUATION

Sneak peek

Consult the paper for a comprehensive evaluationbull HD vs SSDbull Striping data across multiple hard

drivesbull Comparison to an in-memory

versionbull Bottlenecks analysisbull Effect of the number of shardsbull Block size and performance

37

GraphChi

bull C++ implementation 8000 lines of codendash Java-implementation also available

bull Several optimizations to PSW (see paper)

Source code and exampleshttpgithubcomgraphchi

38

Experiment Setting

bull Mac Mini (Apple Inc)ndash 8 GB RAMndash 256 GB SSD 1TB hard drivendash Intel Core i5 25 GHz

bull Experiment graphs

Graph Vertices Edges P (shards) Preprocessing

live-journal 48M 69M 3 05 min

netflix 05M 99M 20 1 min

twitter-2010 42M 15B 20 2 min

uk-2007-05 106M 37B 40 31 min

uk-union 133M 54B 50 33 min

yahoo-web 14B 66B 50 37 min

39

Comparison to Existing Systems

Notes comparison results do not include time to transfer the data to cluster preprocessing or the time to load the graph from disk GraphChi computes asynchronously while all but GraphLab synchronously

PageRank

See the paper for more comparisons

WebGraph Belief Propagation (U Kang et al)

Matrix Factorization (Alt Least Sqr) Triangle Counting

0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)

Twitter-2010 (15B edges)

Minutes0 5 10 15 20 25 30

Pegasus Hadoop (100 ma-chines)

GraphChi (Mac Mini)

Yahoo-web (67B edges)

Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)

twitter-2010 (15B edges)

Minutes

On a Mac MiniGraphChi can solve as big problems as

existing large-scale systemsComparable performance

40

PowerGraph Comparisonbull PowerGraph GraphLab

2 outperforms previous systems by a wide margin on natural graphs

bull With 64 more machines 512 more CPUsndash Pagerank 40x faster than

GraphChindash Triangle counting 30x

faster than GraphChi

OSDIrsquo12

GraphChi has good performance CPU

2

vs

GraphChi

41

In-memory Comparison

bull Total runtime comparison to 1-shard GraphChi with initial load + output write taken into account

GraphChi Mac Mini ndash SSD 790 secs

Ligra (J Shun Blelloch) 40-core Intel E7-8870 15 secs

bull Comparison to state-of-the-artbull - 5 iterations of Pageranks Twitter (15B edges)

However sometimes better algorithm available for in-memory than external memory distributed

Ligra (J Shun Blelloch) 8-core Xeon 5550 230 s + preproc 144 s

PSW ndash inmem version 700 shards (see Appendix)

8-core Xeon 5550 100 s + preproc 210 s

42

Scalability Input Size [SSD]

bull Throughput number of edges processed second

Conclusion the throughput remains roughly constant when graph size is increased

GraphChi with hard-drive is ~ 2x slower than SSD (if computational cost low)

Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web

PageRank -- throughput (Mac Mini SSD)

Perf

orm

an

ce

Paper scalability of other applications

43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions

bull What if there is lot of metadata associated with edges and vertices

bull How to do graph queries efficiently while retaining computational capabilities

bull How to add edges efficiently to the graph

Can we design a graph database based on GraphChi

46

Existing Graph Database Solutions

1) Specialized single-machine graph databases

Problemsbull Poor performance with data gtgt memorybull Noweak support for analytical computation

2) Relational key-value databases as graph storage

Problemsbull Large indicesbull In-edge out-edge dilemmabull Noweak support for analytical computation

47

PARTITIONED ADJACENCY LISTS (PAL) DATA STRUCTURE

Our solution

48

Review Edges in Shards

Shard 1

sort

ed b

y so

urce

_id

Shard 2 Shard 3 Shard PShard 1

0 100 101 700 N

Edge = (src dst)Partition by dst

49

Shard Structure (Basic)

Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139

hellip hellip src dst

50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Compressed Sparse Row (CSR)

Problem 1

How to find in-edges of a vertex quickly

Note We know the shard but edges in random orderPointer-array

Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

+ Index to the first in-edge for each vertex in interval

Pointer-array

Edge-array

Problem 2

How to find out-edges quickly

Note Sorted inside a shard but partitioned across all shards

Augmented linked list for in-edges

53

Source File offset

1 0

3 3

7 5

hellip hellip

PAL Out-edge Queries

Pointer-array

Edge-array

Problem Can be big -- O(V) Binary search on disk slow

Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip

Option 1Sparse index (inmem)

Option 2Delta-coding with unary code (Elias-Gamma) Completely in-memory

54

Experiment Indices

Median latency Twitter-graph 15B edges

55

Queries IO costsIn-edge query only one shard

Out-edge query each shard that has edges

Trade-offMore shards Better locality for in-edge queries worse for out-edge queries

Edge Data amp Searches

Shard X- adjacency

lsquoweightlsquo [float]

lsquotimestamprsquo [long] lsquobeliefrsquo (factor)

Edge (i)

No foreign key required to find edge data(fixed size)

Note vertex values stored similarly

57

Efficient Ingest

Shards on Disk

Buffers in RAM

58

Merging Buffers to Disk

Shards on Disk

Buffers in RAM

59

Merging Buffers to Disk (2)

Shards on Disk

Buffers in RAM

Merge requires loading existing shard from disk Each edge will be rewritten always on a merge

Does not scale number of rewrites data size buffer size = O(E)

60

Log-Structured Merge-tree

(LSM)Ref OrsquoNeil Cheng et al (1996)

Top shards with in-memory buffers

On-disk shards

LEVEL 1 (youngest)

Downstream merge

interval 1 interval 2 interval 3 interval 4 interval PInterval P-1

Intervals 1--4 Intervals (P-4) -- P

Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)

In-edge query One shard on each level

Out-edge queryAll shards

61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09

GraphChi-DB with LSM GraphChi-No-LSM

Time (seconds)

Edge

s

62

Advantages of PAL

bull Only sparse and implicit indicesndash Pointer-array usually fits in RAM with Elias-

Gamma Small database size

bull Columnar data modelndash Load only data you needndash Graph structure is separate from datandash Property graph model

bull Great insertion throughput with LSMTree can be adjusted to match workload

63

EXPERIMENTAL COMPARISONS

64

GraphChi-DB Implementation

bull Written in Scalabull Queries amp Computationbull Online database

All experiments shown in this talk done on Mac Mini (8 GB SSD)


65

Comparison Database Size

Baseline 4 + 4 bytes edge

MySQL (data + indices)

Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70

Database file size (twitter-2010 graph 15B edges)

66

Comparison Ingest

System Time to ingest 15B edges

GraphChi-DB (ONLINE) 1 hour 45 mins

Neo4j (batch) 45 hours

MySQL (batch) 3 hour 30 minutes (including index creation)

If running Pagerank simultaneously GraphChi-DB takes 3 hour 45 minutes

Comparison Friends-of-Friends Query

See thesis for shortest-path comparison

GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127

Small graph - 50-percentile

mill

iseco

nds

GraphChi-DB Neo4j0123456789

6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59

Big graph - 50-percentile

mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds

Latency percentiles over 100K random queries

68M edges

15B edges

68

LinkBench Online Graph DB Benchmark by Facebook

bull Concurrent readwrite workloadndash But only single-hop queries (ldquofriendsrdquo)ndash 8 different operations mixed workloadndash Best performance with 64 parallel threads

bull Each edge and vertex hasndash Version timestamp type random string payload

GraphChi-DB (Mac Mini) MySQL+FB patch server SSD-array 144 GB RAM

Edge-update (95p) 22 ms 25 ms

Edge-get-neighbors (95p)

18 ms 9 ms

Avg throughput 2487 reqs 11029 reqs

Database size 350 GB 14 TB

See full results in the thesis

69

Summary of Experiments

bull Efficient for mixed readwrite workloadndash See Facebook LinkBench experiments in

thesisndash LSM-tree trade-off read performance (but

can adjust)

bull State-of-the-art performance for graphs that are much larger than RAMndash Neo4Jrsquos linked-list data structure good for

RAMndash DEX performs poorly in practice

More experiments in the thesis

70

Discussion

Greater ImpactHindsight

Future Research Questions

71

GREATER IMPACT

72

Impact ldquoBig Datardquo Research

bull GraphChirsquos OSDI 2012 paper has received over 85 citations in just 18 months (Google Scholar)

ndash Two major direct descendant papers in top conferencesbull X-Stream SOSP 2013bull TurboGraph KDD 2013

bull Challenging the mainstreamndash You can do a lot on just a PC focus on right

data structures computational models

73

Impact Users

bull GraphChirsquos (C++ Java -DB) have gained a lot of usersndash Currently ~50 unique visitors day

bull Enables lsquoeveryonersquo to tackle big graph problemsndash Especially the recommender toolkit (by Danny

Bickson) has been very popularndash Typical users students non-systems researchers

small companieshellip

74

Impact Users (cont)

I work in a [EU country] public university I cant use a a distributed computing cluster for my research it is too expensive

Using GraphChi I was able to perform my experiments on my laptop I thus have to admit that GraphChi saved my research ()

75

EVALUATION HINDSIGHT

76

What is GraphChi Optimized for

bull Original target algorithm Belief Propagation on Probabilistic Graphical Models

1 Changing value of an edge (both in- and out)

2 Computation process whole or most of the graph on each iteration

3 Random access to all vertexrsquos edgesndash Vertex-centric vs edge centric

4 AsyncGauss-Seidel execution

u v

77

GraphChi Not Good For

bull Very large vertex statebull Traversals and two-hop dependencies

ndash Or dynamic scheduling (such as Splash BP)

bull High diameter graphs such as planar graphsndash Unless the computation itself has short-range

interactions

bull Very large number of iterationsndash Neural networksndash LDA with Collapsed Gibbs sampling

bull No support for implicit graph structure

+ Single PC performance is limited

78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Versatility of PSW and PAL

79

Future Research Directions

bull Distributed Setting1 Distributed PSW (one shard node)

1 PSW is inherently sequential2 Low bandwidth in the Cloud

2 Co-operating GraphChi(-DB)rsquos connected with a Parameter Server

bull New graph programming models and Toolsndash Vertex-centric programming sometimes too local for example

two-hop interactions and many traversals cumbersomendash Abstractions for learning graph structure Implicit graphsndash Hard to debug especially async Better tools needed

bull Graph-aware optimizations to GraphChi-DBndash Buffer managementndash Smart cachingndash Learning configuration

80

WHAT IF WE HAVE PLENTY OF MEMORY

Semi-External Memory Setting

81

Observations

bull The IO performance of PSW is only weakly affected by the amount of RAMndash Good works with very little memoryndash Bad Does not benefit from more memory

bull Simple trick cache some data

bull Many graph algorithms have O(V) statendash Update function accesses neighbor vertex

statebull Standard PSW lsquobroadcastrsquo vertex value via edgesbull Semi-external Store vertex values in memory

82

Using RAM efficiently

bull Assume that enough RAM to store many O(V) algorithm states in memoryndash But not enough to store the whole

graph

Graph working memory (PSW) disk

computational state

Computation 1 stateComputation 2 state

RAM

83

Parallel Computation Examples

bull DrunkardMob algorithm (Chapter 5)ndash Store billions of random walk states in RAM

bull Multiple Breadth-First-Searchesndash Analyze neighborhood sizes by starting

hundreds of random BFSes

bull Compute in parallel many different recommender algorithms (or with different parameterizations)ndash See Mayank Mohta Shu-Hao Yursquos Masterrsquos

project

84

CONCLUSION

85

Summary of Published WorkGraphLab Parallel Framework for Machine Learning (with J Gonzaled YLow D Bickson CGuestrin)

UAI 2010

Distributed GraphLab Framework for Machine Learning and Data Mining in the Cloud (-- same --)

VLDB 2012

GraphChi Large-scale Graph Computation on Just a PC (with CGuestrin G Blelloch)

OSDI 2012

DrunkardMob Billions of Random Walks on Just a PC ACM RecSys 2013

Beyond Synchronous New Techniques for External Memory Graph Connectivity and Minimum Spanning Forest (with Julian Shun G Blelloch)

SEA 2014

GraphChi-DB Simple Design for a Scalable Graph Database ndash on Just a PC (with C Guestrin)

(submitted arxiv)

Parallel Coordinate Descent for L1-regularized Loss Minimization (Shotgun) (with J Bradley D Bickson CGuestrin)

ICML 2011

First author papers in bold

THESIS

86

Summary of Main Contributions

bull Proposed Parallel Sliding Windows a new algorithm for external memory graph computation GraphChi

bull Extended PSW to design Partitioned Adjacency Lists to build a scalable graph database GraphChi-DB

bull Proposed DrunkardMob for simulating billions of random walks in parallel

bull Analyzed PSW and its Gauss-Seidel properties for fundamental graph algorithms New approach for EM graph algorithms research

Thank You

87

ADDITIONAL SLIDES

88

Economics

GraphChi (40 Mac Minis) PowerGraph (64 EC2 cc14xlarge)

Investments 67320 $ -

Operating costs

Per node hour 003 $ 130 $

Cluster hour 119 $ 5200 $

Daily 2856 $ 124800 $

It takes about 56 days to recoup Mac Mini investments

Assumptions- Mac Mini 85W (typical servers 500-1000W)- Most expensive US energy 35c KwH

Equal throughput configurations (based on OSDIrsquo12)

89

PSW for In-memory Computation

bull External memory settingndash Slow memory =

hard disk SSDndash Fast memory =

RAM

bull In-memoryndash Slow = RAMndash Fast = CPU

caches

Small (fast) Memorysize M

Large (slow) Memorysize lsquounlimitedrsquo

B

CPU

block transfers

Does PSW help in the in-memory setting

90

PSW for in-memory

Appendix

PSW outperforms (slightly) Ligra the fastest in-memory graph computation system on Pagerank with the Twitter graph including preprocessing steps of both systems

Julian Shun Guy Blelloch PPoPP 2013

91

Remarks (sync vs async)

bull Bulk-Synchronous is embarrassingly parallelndash But needs twice the amount of space

bull AsyncG-S helps with high diameter graphsbull Some algorithms converge much better

asynchronouslyndash Loopy BP see Gonzalez et al (2009)ndash Also Bertsekas amp Tsitsiklis Parallel and Distributed

Optimization (1989)

bull Asynchronous sometimes difficult to reason about and debug

bull Asynchronous can be used to implement BSP

IO Complexity

bull See the paper for theoretical analysis in the Aggarwal-Vitterrsquos IO modelndashWorst-case only 2x best-case

bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2

Inter-interval edge is loaded from disk only once iteration

Edge spanning intervals is loaded twice iteration

93

Impact of Graph Structure

bull Algos with long range information propagation need relatively small diameter would require too many iterations

bull Per iteration cost not much affected

bull Can we optimize partitioningndash Could help thanks to Gauss-Seidel (faster

convergence inside ldquogroupsrdquo) topological sortndash Likely too expensive to do on single PC

94

Graph Compression Would it help

bull Graph Compression methods (eg Blelloch et al WebGraph Framework) can be used to compress edges to 3-4 bits edge (web) ~ 10 bits edge (social)ndash But require graph partitioning requires a lot of

memoryndash Compression of large graphs can take days (personal

communication)

bull Compression problematic for evolving graphs and associated data

bull GraphChi can be used to compress graphsndash Layered label propagation (Boldi et al 2011)

95

Previous research on (single computer) Graph Databases

bull 1990s 2000s saw interest in object-oriented and graph databasesndash GOOD GraphDB HyperGraphDBhellipndash Focus was on modeling graph storage on top of

relational DB or key-value store

bull RDF databasesndash Most do not use graph storage but store triples as

relations + use indexing

bull Modern solutions have proposed graph-specific storagendash Neo4j doubly linked listndash TurboGraph adjacency list chopped into pagesndash DEX compressed bitmaps (details not clear)

96

LinkBench

Comparison to FB (cont)

bull GraphChi load time 9 hours FBrsquos 12 hoursbull GraphChi database about 250 GB FB gt 14

terabytesndash However about 100 GB explained by different

variable data (payload) size

bull FacebookMySQL via JDBC GraphChi embeddedndash But MySQL native code GraphChi-DB Scala (JVM)

bull Important CPU bound bottleneck in sorting the results for high-degree vertices

LinkBench GraphChi-DB performance Size of DB

100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes

Why not even faster when everything in RAM computationally bound requests

Possible Solutions

1 Use SSD as a memory-extension

[SSDAlloc NSDIrsquo11]

2 Compress the graph structure to fit into RAM[ WebGraph framework]

3 Cluster the graph and handle each cluster separately in RAM

4 Caching of hot nodes

Too many small objects need millions sec

Associated values do not compress well and are mutated

Expensive The number of inter-cluster edges is big

Unpredictable performance

Number of Shards

bull If P is in the ldquodozensrdquo there is not much effect on performance

Multiple hard-drives (RAIDish)

bull GraphChi supports striping shards to multiple disks Parallel IO

Pagerank Conn components0

500

1000

1500

2000

2500

3000

1 hard drive 2 hard drives 3 hard drives

Seco

nds

iter

ation

Experiment on a 16-core AMD server (from year 2007)

Bottlenecks

1 thread 2 threads 4 threads0

500

1000

1500

2000

2500

Disk IO Graph constructionExec updates

Connected Components on Mac Mini SSD

bull Cost of constructing the sub-graph in memory is almost as large as the IO cost on an SSDndash Graph construction requires a lot of random access in RAM memory

bandwidth becomes a bottleneck

Bottlenecks Multicore

Experiment on MacBook Pro with 4 cores SSD

bull Computationally intensive applications benefit substantially from parallel execution

bull GraphChi saturates SSD IO with 2 threads

1 2 40

20406080

100120140160

Matrix Factorization (ALS)

Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000

Connected Components

Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105

Experiment Query latency

See thesis for IO cost analysis of inout queries

106

Example Induced Subgraph Queries

bull Induced subgraph for vertex set S contains all edges in the graph that have both endpoints in S

bull Very fast in GraphChi-DBndash Sufficient to query for out-edgesndash Parallelizes well multi-out-edge-query

bull Can be used for statistical graph analysisndash Sample induced neighborhoods induced FoF

neighborhoods from graph

Vertices Nodesbull Vertices are partitioned similarly as

edgesndash Similar ldquodata shardsrdquo for columns

bull Lookupupdate of vertex data is O(1)bull No merge tree here Vertex files are

ldquodenserdquondash Sparse structure could be supported1

a

a+1

2a Max-id

ID-mapping

bull Vertex IDs mapped to internal IDs to balance shardsndash Interval length constant a

012

aa+1a+2

2a2a+1

hellip

0 1 2 255

Original ID 0 1 2 255

256 257 258 511

109

What if we have a Cluster


computational state


RAM

Trade latency for throughput

110

Graph Computation Research Challenges

1 Lack of truly challenging (benchmark) applications

2 hellip which is caused by lack of good data available for the academics big graphs with metadatandash Industry co-operation But problem with

reproducibilityndash Also it is hard to ask good questions about

graphs (especially with just structure)

3 Too much focus on performance More important to enable ldquoextracting valuerdquo

DrunkardMob - RecSys 13

Random walk in an in-memory graph

bull Compute one walk a time (multiple in parallel of course)parfor walk in walks

for i=1 to numsteps vertex = walkatVertex() walktakeStep(vertexrandomNeighbor())


Problem What if Graph does not fit in memory


Distributed graph systems- Each hop across partition boundary is costly

Disk-based ldquosingle-machinerdquo graph systems- ldquoPagingrdquo from disk

is costly

(This talk)


Random walks in GraphChi

bull DrunkardMob ndashalgorithmndash Reverse thinking

ForEach interval p walkSnapshot = getWalksForInterval(p) ForEach vertex in interval(p) mywalks = walkSnapshotgetWalksAtVertex(vertexid) ForEach walk in mywalks

walkManageraddHop(walk vertexrandomNeighbor())

Note Need to store only current position of each walk

Thesis Defense Large-Scale Graph Computation on Just a PC

Research Fields

Why Graphs



Why use a cluster

Why use a cluster (2)




Research Goal

Terminology

Slide 13

Disk-based graph computation

Slide 15

Computational Model





Our Solution



Example Layout


PSW Loading Sub-graph (2)


ldquogauss-Seidelrdquo Asynchronous





Label Propagation


GraphChi system evaluation

GraphChi

Experiment Setting


PowerGraph Comparison



Graphchi-DB

Slide 44

Research Questions


Partitioned adjacency lists (PAL) data structure



Shard Structure (Basic) (2)

PAL In-edge Linkage

PAL In-edge Linkage (2)


Experiment Indices

Queries IO costs


Efficient Ingest



Log-Structured Merge-tree (LSM) Ref OrsquoNeil Cheng et al (19

Experiment Ingest

Advantages of PAL

Experimental comparisons



Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)

Evaluation hindsight





What if we have plenty of memory

Observations



conclusion

Summary of Published Work


Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






2

Research Fields

This thesis

Machine Learning

Data Mining

Graph analysis

and mining

External memory

algorithms research

Systems research

Databases

Motivation and applications

Research contributions

3

Why Graphs


4




DNA interaction

graph

WWWlink graph


5




6

Why use a cluster




7

Why use a cluster






8







9



10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






3

Why Graphs


4




DNA interaction

graph

WWWlink graph


5




6

Why use a cluster




7

Why use a cluster






8







9



10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






4




DNA interaction

graph

WWWlink graph


5




6

Why use a cluster




7

Why use a cluster






8







9



10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






5




6

Why use a cluster




7

Why use a cluster






8







9



10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






6

Why use a cluster




7

Why use a cluster






8







9



10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






7

Why use a cluster






8







9



10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






8







9



10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






9



10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






10



asymp 1 TB

Not a problem

Computation

Hard to scale


11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






11

Research Goal




12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






12

Terminology






friendshellip

13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






13




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

Thesis statement



14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






14


15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






15


Batchcomp

Evolving graph


PageRankSALSAHITS


k-Core

Graph Computation



Multi-BFS






GraphChi^2

Incremental comp

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction

Graph sampling

Graph Queries

Graph traversal

16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






16

Computational Model





supported) Data

Data Data

Data

DataDataData

DataData

Data

A Be


17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






17

Data

Data Data

Data

DataDataData

Data

Data

Data





Data Data

Data

Data


18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






18





19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






19


Random Access



20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






20


File edge-values


A B


x


Random write


21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






21

Our Solution


22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






22




1 Load

2 Compute

3 Write

23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






23



shard(1)


shard(2) shard(P)

1 10000100 700

1 Load

2 Compute

3 Write




24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






24

Example Layout

Shard 1



in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


1 Load

2 Compute

3 Write

25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






25

Vertices1100

Vertices101700

Vertices7011000

Vertices100110000






Shard 1

in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






26

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id

1 Load

2 Compute

3 Write

27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






27





>

28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






28


How PSW computes



29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






29





30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






30

Shard 1





Vertices1100

Vertices101700

Vertices7011000

Vertices100110000


in-e

dges

for v

ertic

es 1

100

sort

ed b

y so

urce

_id


31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






31


1 2 5 3 4

1 1 2 3 3

1 1 1 2 3

1 1 1 1 2

1 1 1 1 1



32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






32


1 2 4 3 5

1 1 1 3 3

1 1 1 1 1




33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






33




199 ~ 57

100 ~29

298

iterations

~52


graph diameter - 1

~ 06 diameter


Chapter 6

34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






34





Components




35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






35


Sneak peek




37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






37

GraphChi




38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






38

Experiment Setting







uk-2007-05 106M 37B 40 31 min



39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






39



PageRank




0 2 4 6 8 10 12

GraphLab v1 (8

cores)

GraphChi (Mac Mini)

Netflix (99B edges)

Minutes

0 2 4 6 8 10 12 14

Spark (50 machines)

GraphChi (Mac Mini)


Minutes0 5 10 15 20 25 30


GraphChi (Mac Mini)


Minutes

0 50 100 150 200 250 300 350 400 450

Hadoop (1636 ma-

chines)

GraphChi (Mac Mini)


Minutes



40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






40






OSDIrsquo12


2

vs

GraphChi

41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






41










42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






42





Graph size

000E+00 100E+09 200E+09 300E+09 400E+09 500E+09 600E+09 700E+09000E+00

500E+06

100E+07

150E+07

200E+07

250E+07

domain

twitter-2010

uk-2007-05 uk-union

yahoo-web


Perf

orm

an

ce


43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






43

GRAPHCHI-DBNew work

44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






44




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal

45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






45

Research Questions





46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






46






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






47


Our solution

48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






48


Shard 1

sort

ed b

y so

urce

_id


0 100 101 700 N


49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






49


Source Destination

1 8

1 193

1 76420

3 12

3 872

7 193

7 212

7 89139


50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






50


Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst


Problem 1



Edge-array

51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






51

PAL In-edge Linkage

Destination

8

193

76420

12

872

193

212

89139

hellip

Source File offset

1 0

3 3

7 5

hellip hellip

src dst

Pointer-array

Edge-array

52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






52

PAL In-edge Linkage

Destination Link

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array

Problem 2




53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






53

Source File offset

1 0

3 3

7 5

hellip hellip


Pointer-array

Edge-array


Destination

Next-in-offset

8 3339

193 3

76420 1092

12 289

872 40

193 2002

212 12

89139 22

hellip

256

512

1024

hellip



54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






54

Experiment Indices


55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






55





Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping







Shard X- adjacency



Edge (i)



57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






57

Efficient Ingest

Shards on Disk

Buffers in RAM

58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






58


Shards on Disk

Buffers in RAM

59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






59


Shards on Disk

Buffers in RAM



60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






60




On-disk shards

LEVEL 1 (youngest)

Downstream merge



Intervals 1--16

New edges

LEVEL 2

LEVEL 3(oldest)



61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






61

Experiment Ingest

0 2000 4000 6000 8000 10000 12000 14000 1600000E+00

50E+08

10E+09

15E+09


Time (seconds)

Edge

s

62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






62

Advantages of PAL





63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






63


64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






64





65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






65




Neo4j

GraphChi-DB

BASELINE

0 10 20 30 40 50 60 70


66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






66

Comparison Ingest








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping








GraphChi-DB Neo4j0

00501

01502

02503

03504 0379

0127


mill

iseco

nds


6653

8078


mill

iseco

nds

GraphChi-DB

Neo4j MySQL0

40

80

120

160

200

224

7598

59


mill

iseco

nds 0

2000

4000

6000

1264 1631

4776


mill

isec

onds


68M edges

15B edges

68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






68







18 ms 9 ms




69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






69




can adjust)




70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






70

Discussion



71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






71

GREATER IMPACT

72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






72






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






73

Impact Users





74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






74

Impact Users (cont)



75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






75


76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






76







u v

77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






77





interactions




78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






78




Batchcomp

Evolving graph

Incremental comp


GraphChi^2


PageRankSALSAHITS


k-Core

Graph Computation

Shortest Path

Friends-of-Friends

Induced Subgraphs

Neighborhood query


Link prediction



Multi-BFS

Graph sampling

Graph Queries



Graph traversal


79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






79








80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






80



81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






81

Observations





82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






82



graph


computational state


RAM

83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






83






project

84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






84

CONCLUSION

85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






85


UAI 2010


VLDB 2012


OSDI 2012



SEA 2014


(submitted arxiv)


ICML 2011


THESIS

86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






86






Thank You

87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






87

ADDITIONAL SLIDES

88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






88

Economics



Operating costs



Daily 2856 $ 124800 $




89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






89




RAM


caches



B

CPU

block transfers


90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






90

PSW for in-memory

Appendix



91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






91





Optimization (1989)



IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






IO Complexity


bull Intuition

shard(1)


shard(2) shard(P)

1 |V|v1 v2



93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






93






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






94




communication)



95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






95







96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






96

LinkBench








100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping













100E+07 100E+08 200E+08 100E+090

100020003000400050006000700080009000

10000

Requests sec

Number of nodes


Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






Possible Solutions










Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






Number of Shards





500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping









500

1000

1500

2000

2500

3000


Seco

nds

iter

ation


Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






Bottlenecks


500

1000

1500

2000

2500









1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping










1 2 40

20406080

100120140160


Loading Computation

Number of threads

Runti

me

(sec

onds

)1 2 4

0100200300400500600700800900

1000


Loading Computation

Number of threads

Runti

me

(sec

onds

)

104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






104

In-memory vs Disk

105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






105



106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






106










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping










a

a+1

2a Max-id

ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






ID-mapping


012

aa+1a+2

2a2a+1

hellip

0 1 2 255


256 257 258 511

109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






109



computational state


RAM


110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






110
















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping















is costly

(This talk)








Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping













Research Fields

Why Graphs



Why use a cluster





Research Goal

Terminology

Slide 13


Slide 15

Computational Model





Our Solution



Example Layout









Label Propagation



GraphChi

Experiment Setting





Graphchi-DB

Slide 44

Research Questions






PAL In-edge Linkage



Experiment Indices

Queries IO costs


Efficient Ingest




Experiment Ingest

Advantages of PAL




Comparison Ingest




Discussion

Greater impact


Impact Users

Impact Users (cont)







Observations



conclusion



Additional slides

Economics


PSW for in-memory


IO Complexity




LinkBench



Possible Solutions

Number of Shards


Bottlenecks


In-memory vs Disk



Vertices Nodes

ID-mapping






Thesis Defense Large-Scale Graph Computation on Just a PC Aapo Kyrölä [email protected] Carlos...

Documents

Transcript of Thesis Defense Large-Scale Graph Computation on Just a PC Aapo Kyrölä [email protected] Carlos...