Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA,...

24
Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France

Transcript of Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA,...

Page 1: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Traffic-driven model of the World-Wide-Web Graph

A. Barrat, LPT, Orsay, FranceM. Barthélemy, CEA, FranceA. Vespignani, LPT, Orsay, France

Page 2: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Outline

The WebGraph Some empirical characteristics Various models Weights and strengths Our model:

Definition Analysis: analytics+numerics

Conclusions

Page 3: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

The Web as a directed graph

i

jl nodes i: web-pages

directed links: hyperlinks

in- and out- degrees:

Page 4: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

•Small world : captured by Erdös-Renyi graphs

Poisson distribution

<k> = p N

With probability p an edge is established among couple of vertices

Empirical facts

Page 5: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

•Small world•Large clustering: different neighbours of a node will likely know each other

1

2

3

n

Higher probability to be connected

=>graph models with large clustering, e.g. Watts-Strogatz 1998

Empirical facts

Page 6: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

•Small world•Large clustering•Dynamical network•Broad connectivity distributions

•also observed in many other contexts (from biological to social networks)•huge activity of modeling

Empirical facts

(Barabasi-Albert 1999; Broder et al. 2000; Kumar et al. 2000; Adamic-Huberman 2001; Laura et al. 2003)

Page 7: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Various growing networks models Barabáási-Albert (1999): preferential attachment Many variations on the BA model: rewiring (Tadic

2001, Krapivsky et al. 2001), addition of edges, directed model (Dorogovtsev-Mendes 2000, Cooper-Frieze 2001), fitness (Bianconi-Barabáási 2001), ...

Kumar et al. (2000): copying mechanism Pandurangan et al. (2002): PageRank+pref.

attachment Laura et al. (2002): Multi-layer model Menczer (2002): textual content of web-pages

Page 8: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

The Web as a directed graph

i

jl nodes i: web-pages

directed links: hyperlinks

Broad P(kin) ; cut-off for P(kout)

(Broder et al. 2000; Kumar et al. 2000; Adamic-Huberman 2001; Laura et al. 2003)

Page 9: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Additional level of complexity: Weights and Strengths

i

jLinks carry weights/traffic:

wij

In- and out- strengths

l

Adamic-Huberman 2001: broad distribution of sin

Page 10: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Model: directed network

n i

j (i) Growth

(ii) Strength driven preferential attachment (n: kout=m outlinks)

AND...

“Busy gets busier”

Page 11: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Weights reinforcement mechanism

i

j

n

The new traffic n-i increases the traffic i-j“Busy gets busier”

Page 12: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Evolution equations

(Continuous approximation)

Coupling term

Page 13: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Resolution

Ansatz

supported by numerics:

Page 14: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Results

Page 15: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Approximation

Total in-weight i sini : approximately proportional to the

total number of in-links i kini , times average weight hwi = 1+

Then: A=1+

sin 2 [2;2+1/m]

Page 16: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Measure of A

prediction of

Numerical simulations

Approx of

Page 17: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Numerical simulations

NB: broad P(sout) even if kout=m

Page 18: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Clustering spectrum

i.e.: fraction of connected couples of neighbours of node i

Page 19: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Clustering spectrum

• increases => clustering increases

• New pages: point to various well-known pages, often connected together => large clustering for small nodes

• Old, popular pages with large k: many in-links from many less popular pages which are not connected together => smaller clustering for large nodes

Page 20: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Clustering and weighted clustering

takes into account the relevance of triangles in the global traffic

Page 21: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Clustering and weighted clustering

Weighted Clustering larger than topological clustering:triangles carry a large part of the traffic

Page 22: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Assortativity

Average connectivity of nearest neighbours of i

Page 23: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Assortativity

•knn: disassortative behaviour, as usual in growing networksmodels, and typical in technological networks

•lack of correlations in popularity as measured by the in-degree

Page 24: Traffic-driven model of the World-Wide-Web Graph A. Barrat, LPT, Orsay, France M. Barthélemy, CEA, France A. Vespignani, LPT, Orsay, France.

Summary Web: heterogeneous topology and traffic Mechanism taking into account interplay between

topology and traffic Simple mechanism=>complex behaviour, scale-free

distributions for connectivity and traffic Analytical study possible Study of correlations: non-trivial hierarchical

behaviour Possibility to add features (fitnesses, rewiring,

addition of edges, etc...), to modify the redistribution rule...

Empirical studies of traffic and correlations?