Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir...

28
Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt

Transcript of Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir...

Page 1: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Power Laws:Rich-Get-Richer Phenomena

Chapter 18: “Networks, Crowds and Markets”

By Amir Shavitt

Page 2: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Popularity

• How do we measure it?• How does it effect the network?• Basic network models with popularity

Page 3: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Popularity Examples

• Books• Movies• Music• Websites

Page 4: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Our Model

• The web as a directed graph– Nodes are webpages– Edges are hyperlinks– #in-links = popularity– 1 out-link per page

www.cs.tau.ac.il

www.tau.ac.il

www.eng.tau.ac.il

course homepage

Page 5: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Our Main Question

• As a function of k, what fraction of pages on the web have k in-links?

Page 6: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Expected Distribution

• Normal distribution– Each link addition is an experiment

Page 7: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Power Laws

• f(k) 1/kc

• Usually c > 2• Tail decreases much slower than the normal

distribution

Page 8: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Examples

• Telephone numbers that receive k calls per day

• Books bought by k people• Scientific papers that receive k citations

Page 9: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Power Laws Graphs Examples

Page 10: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Power Laws vs Normal Distribution

• Normal distribution – many independent experiments

• Power laws – if the data measured can be viewed as a type of popularity

Page 11: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

What causes power laws?

• Correlated decisions across a population• Human tendency to copy decisions

Page 12: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Our Model

• The web as a directed graph– Nodes are webpages– Edges are hyperlinks– #in-links = popularity– 1 out-link per page

www.cs.tau.ac.il

www.tau.ac.il

www.eng.tau.ac.il

course homepage

Page 13: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Building a Simple Model

• Webpages are created in order 1,2,3,…,N– Dynamic network growth

• When page j is created, with probability:– p: Chooses a page uniformly at random among all

earlier pages and links to it– 1-p: Chooses a page uniformly at random among

all earlier pages and link to its link

Page 14: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Examplep1-p

Page 15: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Results

1 10 100 1000 10000 1000001

10

100

1000

10000

100000

1000000

p=0.1p=0.3p=0.5

#in-degree ( k )

coun

t

y = 78380k-2.025

y = 835043x-2.652

Page 16: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Conclusions

• p gets smaller → more copying → the exponent c gets smaller

• More likely to see extremely popular pages

Page 17: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Rich-Get-Richer

• With probability (1-p), chooses a page k with probability proportional to k’s #in-links

• A page that gets a small lead over others will tend to extend this lead

Page 18: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Initial Phase

• Rich-get-richer dynamics amplifies differences• Sensitive to disturbance• Similar to information cascades

Page 19: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

How Sensitive?

• Music download site with 48 songs• 8 “parallel” copies of the site• Download count for each song• Same starting point

Subjects

Social influence condition

Independent condition

World 1

World 8

World

Page 20: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

How many people have chosen to download this song?

Source: Music Lab, http://www.musiclab.columbia.edu/

Page 21: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Results

• Exp. 1: songs shown in random order

• Exp. 2: songs shown in order of download popularity

Gini = A / (A + B)

The Lorenz curve is a graphical representation of the cumulative distribution function

Page 22: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Gini Coefficient

song 1

song 2

song 3

song 4

0123456

rank ≤ 1

rank ≤ 2

rank ≤ 3

rank ≤ 4

05

10152025

song 1

song 2

song 3

song 4

02468

1012

rank ≤ 1

rank ≤ 2

rank ≤ 3

rank ≤ 4

0

5

10

15

20

25

sum of downloads of all songs with rank i

Page 23: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Findings

• Best songs rarely did poorly• Worst songs rarely did well• Anything else was possible• The greater the social influence, the more

unequal and unpredictable the collective outcomes become

Page 24: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Zipf Plot

100

101

102

103

104

100

101

102

103

104

node degree for AS20000102.m

Zipf plot - plotting the data on a log-log graph, with the axes being log (rank order) and log (popularity)

rank order

popularity

Page 25: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

The Long Tail

What number of items have popularity at least k?

1 10 100 1000 10000 1000001

10

100

1000

10000

100000

1000000

rank

popu

larit

y (#

in-d

egre

e)

p=0.1n=1,000,000

Tail contains990,000 nodes

Page 26: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.
Page 27: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Why is it Important?

Page 28: Power Laws: Rich-Get-Richer Phenomena Chapter 18: “Networks, Crowds and Markets” By Amir Shavitt.

Search Tools and Recommendation Systems

• How does it effect rich-get-richer dynamics?• How do we find niche products?