KONECT Cloud – Large Scale Network Mining in the Cloud
-
Upload
jerome-kunegis -
Category
Technology
-
view
104 -
download
1
description
Transcript of KONECT Cloud – Large Scale Network Mining in the Cloud
1
KONECT Cloud
Large Scale Network Mining in the Cloud
Jérôme Kunegis Future SOC Lab Day, 18.04.2012
Networks are Everywhere
Communication
Authorship
Friendship
c
Interaction
Trust
Co-occurrence
Social Networks
friend
Trust Networks
trust
Friend/Enemy Network
enemy
frien
d
Interaction Networklisten
KONECT – Koblenz Network Collection
148 network datasets
26 are undirected 38 are directed 84 are bipartite 59 have unweighted edges 77 allow multiple edges 04 have signed edges 08 have ratings as edges 78 have edge arrival times
konect.uni-koblenz.de
Largest Network
Directed “who follows who” network
0 041 652 230 users
1 468 365 182 edges
konect.uni-koblenz.de/networks/twitter
148 Network Datasets
authorshipcommunicationco-occurrence
featuresfolksonomyinteraction
physicalratings
referencesemantic
socialtrust
What We Computed
Connected componentsNetwork diameterClustering coefficientsDegree distributionsSpectral distributionEigenvector centralityGraph drawingTemporal AnalysisLink prediction
←at Future SOC Lab
Network Diameter
6
90 Percentile Effective Diameter
5
90 Percentile Effective Diameter
3
90 Percentile Effective Diameter
3.75
Computing the Effective Diameter
for each node i { |V| count hops needed to reach 90% |E|
}
Total runtime: |E| × |V|
Graph Sampling
KeepX% of edges
Computation
× 1 000 vertices (sampled)× 120 840 391 edges× 20 sample sizes (5%, 10%, …, 100%)× 50 random samplings
Evaluation on single machine:
1 TiB memory 64 cores Matlab 64 bit
Results
Dr. Jérôme Kunegis
west.uni-koblenz.de
Thank You!
konect.uni-koblenz.de