Complex Networks as a tool to study human...

Post on 28-Jun-2020

0 views 0 download

Transcript of Complex Networks as a tool to study human...

Santander Octubre2006 Marta Sánchez de La Lama

1de 9

Complex Networks as a tool to studyhuman activity

José Javier Ramasco

Bruno GonçalvesSteven A. Morris

Romualdo Pastor-SatorrasSergei Dorogovtsev

Santander Octubre2006 Marta Sánchez de La Lama

2de 9

Social Networks as weighted graphs

Measures and meaning

Social Inertia

Results in Empirical Networks and models

More information, Range

Conclusions

Outline

Santander Octubre2006 Marta Sánchez de La Lama

3de 9

Social Networks

Friendster.com

Santander Octubre2006 Marta Sánchez de La Lama

4de 9

Social Networks

Some characteristics that make social Some characteristics that make social networksnetworkshard to studyhard to study

Arbitrariness of the definition Arbitrariness of the definition

Variability Variability

Size constrains on the social probes Size constrains on the social probes

Santander Octubre2006 Marta Sánchez de La Lama

5de 9

Collaboration Networks

1 2 5

3 4 6

1 2 3 4 5 6

w = 1

2 1

1

1

1

1

Santander Octubre2006 Marta Sánchez de La Lama

6de 9

Collaboration Networks

The projected network contains less informationThe projected network contains less informationthan than the original bipartite graph.the original bipartite graph.

A way to avoid that in part is to use a weightedA way to avoid that in part is to use a weightednetwork in the projection.network in the projection.

The weight of a link may be defined asThe weight of a link may be defined as

Note however that there is still a certain loose of Note however that there is still a certain loose of information.information.

M.E.J. Newman, PRE 64, 016132 (2001).

!

wij = "ik" j

k= num. common collab.

k collab

#

!

wij ="ik" j

k

nk #1k collab

$

Santander Octubre2006 Marta Sánchez de La Lama

7de 9

Measures

We can define a number of distributions forWe can define a number of distributions forthe weighted networkthe weighted network

Degree distribution Degree distribution

Weight distribution of the edges Weight distribution of the edges

Strength distribution Strength distribution!

Pk(k) k C

k(k)

!

Pw(w) w C

w(w)

!

si = wij

j"# ( i)

$

Ps(s) s Cs(s)

Santander Octubre2006 Marta Sánchez de La Lama

8de 9

Measures

Other properties of the projected networkOther properties of the projected networkthat we are interested in that we are interested in

Clustering Clustering

Correlations Correlations

!

ci =2ti

ki(ki "1) C = ci

ciw

=1

si(ki "1)

wij + wim

2j,m#$ (i)

% aijaima jm Cw

= ciw

!

knn,i =1

kik j

j"# ( i)

$ knn (k) = knn,i k

snn (s) = snn,i s

Inn (I) = Inn,i s

A. Barrat, et al., PNAS 101, 3747 (2004).

Santander Octubre2006 Marta Sánchez de La Lama

9de 9

Social Inertia

WIJ

Remember that Remember that

that the strength meansthat the strength means

And the degree of a node meansAnd the degree of a node means

Let us define then the Let us define then the Social InertiaSocial Inertia as as

!

wij = num. of works together

!

si = wij

j"# ( i)

$ = total num of partnerships

!

ki= num. of different partners

!

"i=si

ki

J.J. Ramasco and S.A. Morris, PRE 73, 016122 (2006).

Santander Octubre2006 Marta Sánchez de La Lama

10de 9

Social Inertia

The extreme values of The extreme values of II are are

In general, represents a measure ofIn general, represents a measure ofthe eagerness of an author for collaborationsthe eagerness of an author for collaborationswith new people.with new people.

This concept could be generalized to other weightedThis concept could be generalized to other weightedgraphs, its physical meaning (?). graphs, its physical meaning (?).

!

Ii "1 if all collaborations happened

with different partners.

Ii " qi if all works where done with the

same team.

Santander Octubre2006 Marta Sánchez de La Lama

11de 9

Social Inertia

Warning: the social inertia is averaged overWarning: the social inertia is averaged overtime. It is the same concept as an average time. It is the same concept as an average vvelocity.elocity.

It should be possible to define an instant inertiaIt should be possible to define an instant inertiabut it requires a more detailed knowledge ofbut it requires a more detailed knowledge ofthe empirical databases.the empirical databases.

!

˜ I i="s

i

"ki T

?

Santander Octubre2006 Marta Sánchez de La Lama

12de 9

Empirical results

!

Na

= num. agents

Nc = num. collaborations

Nai = num. agents that

work alone

Santander Octubre2006 Marta Sánchez de La Lama

13de 9

Empirical results

Main and Inset: movies

Slopes main blue = -3 = 1-δ

!

Pw(w) ~ w

"#

Cw

= P(w')dw' ~ w1"#

w

$

%

Santander Octubre2006 Marta Sánchez de La Lama

14de 9

Empirical results

Main: moviesInset: biosensors

Slopes main red = -3

Slopes inset red = -3.8

Santander Octubre2006 Marta Sánchez de La Lama

15de 9

Empirical results

Main: moviesInset: superstrings

Slopes main red = 0.7

Santander Octubre2006 Marta Sánchez de La Lama

16de 9

Empirical results

Main: moviesInset: infoscience

Slopes main red = 0.4

Slopes inset red = 0.6

Santander Octubre2006 Marta Sánchez de La Lama

17de 9

Models

J.J. Ramasco et al., PRE 70, 036106 (2004).R. Guimarà et al., Science 308, 697 (2005).

1 2 3 4 5 6

Each step a new collaboration of size Each step a new collaboration of size nn is added. is added. mm of the agents are new, without experience. of the agents are new, without experience.

The rest The rest m-nm-n are selected from the pool of are selected from the pool of old old agents agents Prob Prob pp →→ one of the previous partners one of the previous partners of an of an ““oldold”” agent is chosen agent is chosen Prob Prob 1-p 1-p →→ an an ““oldold”” agent is chosen with agent is chosen with prob prob proportional to qproportional to q

After After QQcc collaborations, the agents have a collaborations, the agents have a probprob. . 1/1/ττ of becoming inactive.of becoming inactive.

Santander Octubre2006 Marta Sánchez de La Lama

18de 9

Simulation results

Santander Octubre2006 Marta Sánchez de La Lama

19de 9

Simulation results

Santander Octubre2006 Marta Sánchez de La Lama

20de 9

Range, different quality of the connections

Santander Octubre2006 Marta Sánchez de La Lama

21de 9

Motivation

• IMDB Actor network, wim = number of times i and m have worked together.

!

Pw(w) ~ w

"#

Cw

= P(w')dw' ~ w1"#

w

$

%

fit # = "4

Santander Octubre2006 Marta Sánchez de La Lama

22de 9

Motivation

!

si = wij = wi" 1+

j#" ( i)

$ wi" 2+K+ wi" ki

% s(k) ~ k1/ 2 &%<w> ~ k

'1/ 2

• P(w) has finite second moment.• P(w) has finite second moment.

• <w> does not depend on k

• P(w) has finite second moment.

• <w> does not depend on k

• Are the weight correlated?

(J.J. Ramasco, Eur. Phys. J. ST 143, 47 (2007))

Santander Octubre2006 Marta Sánchez de La Lama

23de 9

Models

• We can try to replicate this situation in a toy model

• The simplest case requires P(w,w’), P(w) and P(w’|w)

and also a sets of rules for weight assignment.

• This is not a unique method.

!

P(w) = P(w," w')dw'

P(w' |w) = P(w,w') /P(w)

Santander Octubre2006 Marta Sánchez de La Lama

24de 9

Models• We chose three possible functional forms for P(w,w’)

• The reason was that

!

P+(w,w') =X+

(w + w')2+"

PU(w,w') =

XU

(ww')1+"

P#(w,w') =X#

(ww'+1)1+"

!

< w >+ (w0) = (1+" + w

0) /"

< w >U

=" /(" #1)

< w ># (w0) = (" +1/w

0) /(" #1)

!

P(w) ~ w"1"#

Santander Octubre2006 Marta Sánchez de La Lama

25de 9

Measures• The goal is to find a measure to estimate the type/intensity of weight

correlations and their intensity.

• P(w) is fixed.

• Compare actual pattern with a random

configuration

!

"w

2(i) = (wij# < w > (i))

2

j

$

!

" =<#w >original

<#w >random

!

Y2(i) =

wij

2

j"

( wij )2

j"

, Y2(k) ~

cons.

1/k

# $ %

!

" =<Y

2>original

<Y2

>random

!

r(i) =wmax (i) " wmin (i)

wmax (i) + wmin (i)

!

" =< r >original

< r >random

M. Barthelemy, Physica A 319, 633 (2003).J.J. Ramasco et al., to appear PRE.

Santander Octubre2006 Marta Sánchez de La Lama

26de 9

Measures

• The three magnitudes are able to detect up to some level weightcorrelations

• But there is a resolution problem

Optimal measure?

Santander Octubre2006 Marta Sánchez de La Lama

27de 9

Transport• There are many ways to describe network transport properties

• We focus on the so called Superhighways

!

"sphw =Wsph (orig)

Wsph (rand)

Ssphw =Num. nodes sphw (orig)

Num. nodes sphw (rand)

Z. Wu et al., PRL 96, 148702 (2006).

Santander Octubre2006 Marta Sánchez de La Lama

28de 9

Transport

• Actor network

ρ ≈ 0.268(1)

Ω ≈ 20(7)

S ≈ 3(1)

• US airport traffic

ρ ≈ 0.983(1)

Ω ≈ 2.6(1)

Santander Octubre2006 Marta Sánchez de La Lama

29de 9

ConclusionsWe have studied collaborations networks usingWe have studied collaborations networks usinga weighteda weighted graph representation.graph representation.

This representation allows us to define the This representation allows us to define the Social Inertia in a natural way.Social Inertia in a natural way.

The Inertia (measured in a quantitative way) The Inertia (measured in a quantitative way) grows with the experience and the network is grows with the experience and the network is assortative assortative for the inertia. for the inertia.

The model is able to reproduce some of the The model is able to reproduce some of the quantitative features of the empirical networksquantitative features of the empirical networksbut it is necessary more detail for a better but it is necessary more detail for a better quantitative result.quantitative result.

J.J. Ramasco and S.A. Morris, PRE 73, 016122 (2006).

Santander Octubre2006 Marta Sánchez de La Lama

30de 9

Conclusions

Weight correlations appear in real-world networksWeight correlations appear in real-world networks

We have propose a measures to quantify the levelWe have propose a measures to quantify the levelof correlationsof correlations

These correlations have a strong effect on theThese correlations have a strong effect on thetransport properties of the graphtransport properties of the graph

Open questions:Open questions:

–– Origin of the phenomenon in real nets.Origin of the phenomenon in real nets.

–– Effect on disease spreading.Effect on disease spreading.

J.J. Ramasco & B. Gonçalves, to appear in PRE.

Santander Octubre2006 Marta Sánchez de La Lama

31de 9

Web Surfing• The database is formed by the weblogs of Emory University from Apr. 1st2005 to Jan. 17th 2006 (41 weeks).

• Each click in a web of the university is registered at the time resolution of 1second.

Santander Octubre2006 Marta Sánchez de La Lama

32de 9

Statistics IPs URLs

!

Cx = fx (x ')dx 'x

"

#

Santander Octubre2006 Marta Sánchez de La Lama

33de 9

Statistics

Santander Octubre2006 Marta Sánchez de La Lama

34de 9

Growth

IPs

URLs

Santander Octubre2006 Marta Sánchez de La Lama

35de 9

Growth

Santander Octubre2006 Marta Sánchez de La Lama

36de 9

Other aspects

Santander Octubre2006 Marta Sánchez de La Lama

37de 9

Other aspects

Santander Octubre2006 Marta Sánchez de La Lama

38de 9

Other aspects

Santander Octubre2006 Marta Sánchez de La Lama

39de 9

Other aspects

Santander Octubre2006 Marta Sánchez de La Lama

40de 9

Conclusions

We have studied an empirical databasegenerating an evolving bipartite graph.

Preferential attachment plays an importantrole in the evolution of the weights but sodoes aging of the connections.

These data allow us to consider also theactivity patterns of the community.

Open questions …

B Gonçalves & JJ Ramasco, in preparation