Planetary-Scale Views on a Large Instant-Messaging Network
-
Upload
michael-stokes -
Category
Documents
-
view
17 -
download
0
description
Transcript of Planetary-Scale Views on a Large Instant-Messaging Network
![Page 1: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/1.jpg)
Planetary-Scale Views on a Large Instant-Messaging Network
Jure Leskovec ([email protected])Joint work with Eric Horvitz, Microsoft Research
![Page 2: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/2.jpg)
2
Instant Messaging
Contact (buddy) list Messaging window
![Page 3: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/3.jpg)
3
Instant Messaging as a Network
Buddy Conversation
![Page 4: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/4.jpg)
4
IM – Phenomena at planetary scale
Observe social and communication phenomena at a planetary scale
Largest social network analyzed to date
Research questions: How does communication change with user
demographics (age, sex, language, country)? How does geography affect communication? What is the structure of the communication
network?
![Page 5: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/5.jpg)
5
Data description: Communication
For every conversation (session) a list of participants: User Id Time Joined Time Left Number of Messages Sent Number of Messages Received
There can be multiple participants per conversation
Everything is anonymized. No message text
![Page 6: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/6.jpg)
6
Data description: Demographics User demographic data (self-reported):
Age Gender Location (Country, ZIP) Language
![Page 7: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/7.jpg)
7
Data statistics: Total activity We collected the data for June 2006 Log size:
150Gb/day (compressed) Total: 1 month of communication data:
4.5Tb of compressed data Activity over June 2006 (30 days)
245 million users logged in 180 million users engaged in conversations 17,5 million new accounts activated More than 30 billion conversations More than 255 billion exchanged messages
![Page 8: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/8.jpg)
8
Data statistics: Typical day
Activity on a typical day (June 1 2006): 1 billion conversations 93 million users login 65 million different users talk (exchange
messages) 1.5 million invitations for new accounts sent
![Page 9: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/9.jpg)
Part 3-9
User & Communication characteristics
How does user demographics influence communication?
![Page 10: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/10.jpg)
10
User Age: MSN vs. the world
![Page 11: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/11.jpg)
Part 3-11
Communication: Demographics
People tend to talk to similar people (except gender)
How do people’s attributes (age, gender) influence communication?
Probability that users share an attribute
![Page 12: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/12.jpg)
12
Age: Number of conversations
Use
r se
lf r
eport
ed
ag
eHigh
Low
1) Young people communicate with same age
2) Older people communicate uniformly across ages
![Page 13: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/13.jpg)
13
Age: Total conversation duration
Use
r se
lf r
eport
ed a
ge
High
Low
1) Old people talk long2) Working ages (25-40)
talk short
![Page 14: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/14.jpg)
14
Age: Messages per conversation
Use
r se
lf r
eport
ed a
ge
High
Low
1) Old people talk long2) Working ages (25-40)
talk quick
![Page 15: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/15.jpg)
15
Age: Messages per unit time
Use
r se
lf r
eport
ed a
ge
High
Low1) Old people talk slow
2) Young talk fast
![Page 16: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/16.jpg)
16
Communication: Gender
Is gender communication biased? Homophily: Do female talk more among themselves? Heterophily: Do male-female conversations took longer?
Findings: Num. of. conversations is not biased (follows chance) Cross-gender conversations take longer and are more
intense (more attention invested)
M F49%21%20%
Conversations
M F5 min4.5 min4min
Duration
M F7.66.65.9
Messages/conversation
![Page 17: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/17.jpg)
17
Communication: Geo distance
Longer links are used more
![Page 18: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/18.jpg)
18
Communication: Geography (1)
Each dot represents number of users at geo location
Map of the world appears!Costal regions dominate
![Page 19: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/19.jpg)
19
Communication: Geography (2)
Users per capita
Fraction of country’s population on MSN:•Iceland: 35%•Spain: 28%•Netherlands, Canada, Sweden, Norway: 26%•France, UK: 18%•USA, Brazil: 8%
![Page 20: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/20.jpg)
20
Communication: Geography (3)
Digital darkness, “Digital Divide”
![Page 21: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/21.jpg)
21
World communication axis
For each conversation between geo points (A,B) we increase the intensity on the line between A and B
![Page 22: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/22.jpg)
22
Who talks to whom: Number of conversations
![Page 23: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/23.jpg)
23
Who talks to whom: Conversation duration
![Page 24: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/24.jpg)
24
Number of people per conversation
Max number of people simultaneously talking is 20, but conversation can have more people
![Page 25: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/25.jpg)
25
Conversations: number of messages
Sessions between fewer people run out of steam
![Page 26: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/26.jpg)
Messaging as a Network
26At least 1 message exchanged
![Page 27: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/27.jpg)
27
IM Communication Network Buddy graph
240 million people (people that login in June ’06) 9.1 billion buddy edges (friendship links)
Communication graph (take only 2-user conversations) Edge if the users exchanged at least 1 message 180 million people 1.3 billion edges 30 billion conversations
![Page 28: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/28.jpg)
28
Network: Number of buddies
Number of buddies follows power-law with exponential
cutoff distribution
Limit of 600 buddies
![Page 29: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/29.jpg)
29
Network: Communication degree
There is “no average” degree. But degrees are heavily skewed.“Heavy tailed” or “power law” distributions
![Page 30: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/30.jpg)
30
Network: Connectivity
![Page 31: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/31.jpg)
31
Is the world Small-world?
Milgram’s small world experiment
(i.e., hops + 1)
Small-world experiment [Milgram ‘67] People send letters from Nebraska to Boston
How many steps does it take? Messenger social network of the whole planet Eart
240M people, 1.3B edges
6 degrees of
separation
![Page 32: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/32.jpg)
32
Network: Small world
MSN Messenger network
Number of steps
between pairs of people
Avg. path length 6.690% of the people can be reached in
< 8 hops
Hops Nodes0 1
1 10
2 78
3 3,96
4 8,648
5 3,299,252
6 28,395,849
7 79,059,497
8 52,995,778
9 10,321,008
10 1,955,007
11 518,410
12 149,945
13 44,616
14 13,740
15 4,476
16 1,542
17 536
18 167
19 71
20 29
21 16
22 10
23 3
24 2
25 3
![Page 33: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/33.jpg)
33
Distance of links on a shortest path
0 5 10 15 20 25 300
1000
2000
3000
4000
5000
6000
7000
Hops, h
Geo d
ista
nce [
km
] b
etw
een
th
e n
od
e a
t h
an
d h
-1 h
op
s
on
th
e s
hort
est
path
Closer to the target node
![Page 34: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/34.jpg)
34
Where do shortest paths go?
What are characteristic of nodes on a shortest paths?
t
c
d(c,t)=h
Good nodes:d=h-1
Bad nodes: d≥h
Forwarding messages
![Page 35: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/35.jpg)
35
How hard it is to forward?
Number of nodes that get me closer to target
Number of choices (degree)
t
c
d(c,t)=h
Good nodes:d=h-1
Bad nodes: d≥h
![Page 36: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/36.jpg)
36
Random routing: Success prob.
If I forward the message at random, what is the success probability?
0 2 4 6 8 10 12 14 16 180
0.1
0.2
0.3
0.4
0.5
0.6
Hops to target, h
Success p
robabilit
y,
good/d
egre
e
t
c
d(c,t)=h
Good nodes:d=h-1
Bad nodes: d≥h
![Page 37: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/37.jpg)
37
Using node attributes: age
Age difference between c and t Age difference between c and c’
As we get closer to target more similar the current
node’s age is
Nodes on path have actually larger age difference than nodes off the
path
![Page 38: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/38.jpg)
38
Paths go through heavy users
Total usage time in minutes
Shortest paths get through the heavy users
![Page 39: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/39.jpg)
39
Compact nations
Degrees of separation (avg. shortest path length) inside the country
County Country Avg Path Len [hops]
Turkey Turkey 5.18Brazil Brazil 5.60
Belgium Belgium 5.63United
KingdomUnited
Kingdom 5.63
Spain Spain 5.72Mexico Mexico 5.72France France 6.03China China 6.38United States
United States 6.96
![Page 40: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/40.jpg)
40
USA: Degrees of separation
County CountryAvg Path
Len [hops]
United States Lebanon 6.17
United States Australia 6.22
United States Norway 6.23
United States Albania 6.24
United States Malta 6.24
United StatesUnited Kingdom
6.28
United States Bahamas 6.29
United States Sweden 6.37
United States Bahrain 6.37
United States Canada 6.38
County Country
Avg Path Len
[hops]
United States Bulgaria 7.28
United States Poland 7.39
United States Russia 7.42
United States Romania 7.48
United States Lithuania 7.57
United States Slovakia 7.84
United States Korea, South 8.03
United States Czech Republic 8.05
United States Japan 8.85Top “close” countries Top “far” countries
![Page 41: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/41.jpg)
41
Network: Clustering
How many triangles are closed?
Clustering normally decays as k-1
High clustering Low clustering
Communication network is
highly clustered: k-
0.37
![Page 42: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/42.jpg)
42
Network: k-Cores decomposition
What is the structure of the core of the network?
[Batagelj & Zaveršnik, 2002]
![Page 43: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/43.jpg)
43
Network: Robutesness
People with k<20 are the periphery Core is composed of 79 people, each having 68
edges among them
![Page 44: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/44.jpg)
44
Network: Tie-strength
Remove nodes (in some order) and observe how network falls apart: Number of edges deleted Size of largest connected component
Order nodes by: Number of links Total conversations Total conv. Duration Messages/conversation Avg. sent, avg. duration
![Page 45: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/45.jpg)
45
Strength: Nodes vs. Edges
![Page 46: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/46.jpg)
46
Strength: Connectivity
![Page 47: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/47.jpg)
47
Conclusion
Social network of the whole planet Earth The largest social network analyzed
Strong presence of homophily people that communicate are similar (except gender)
Well connected Small-world in only few hops one can research most of the
network Very robust
many (random) people can be removed and the network is still connected
![Page 48: Planetary-Scale Views on a Large Instant-Messaging Network](https://reader035.fdocuments.us/reader035/viewer/2022062422/568130ab550346895d96b52a/html5/thumbnails/48.jpg)
48
References
J. Leskovec and E. Horvitz: Worldwide Buzz: Planetary-Scale Views on an Instant-Messaging Network, WWW 2008
http://www.cs.cmu.edu/~jure