Cooperation, Reputation & Gossiping

25
Introduction Indirect Reciprocity Gossip Results Cooperation, Reputation & Gossiping V.A. Traag 1 , P. Van Dooren 1 , Y.E. Nesterov 2 1 ICTEAM Universit´ e Catholique de Louvain 2 CORE Universit´ e Catholique de Louvain 15 April 2011

description

Presentation at Machine Learning Group, Université Libre de Bruxelles, April 15, 2011.

Transcript of Cooperation, Reputation & Gossiping

Page 1: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Cooperation, Reputation & Gossiping

V.A. Traag1, P. Van Dooren1, Y.E. Nesterov2

1ICTEAMUniversite Catholique de Louvain

2COREUniversite Catholique de Louvain

15 April 2011

Page 2: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Outline

1 Introduction

2 Indirect Reciprocity

3 Gossip

4 Results

Page 3: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Cooperation

Cooperation (and defection)

• Organizations (also Wikipedia, open source software, . . . )◮ Why do people contribute?

• Worker ants in colonies◮ Why do workers help without individual benefit?

• Prudent parasites in hosts◮ Why do parasites not replicate faster?

• Human body◮ Why do cells not replicate faster?

Central question

If defecting (not cooperating) is a real option, why (and how) hascooperation evolved?

Page 4: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Formal cooperation (and defection)

Prisoner’s Dilemma

• The game knows two options, donating or not donating.

• Donate at a cost c > 0 to benefit someone else with benefitb > c .

• Agents are paired, and play a round of donating or not.

• Cooperators C donate, defectors D do not donate.

This can be summarized in the payoff matrix

A =

(C D

C b − c −c

D b 0

)

Defectors dominate

Whatever strategy you encounter (C or D), always better to defect.

Page 5: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Dynamical View

• Natural to model game dynamics in an evolutionary context.

• Survival of the fittest (fitness = payoff).

Definition (Replicator equation)

Population with i = 1, . . . , n different mixed strategies pi

xi Relative abundance (frequency)

p =∑

i pixi Average strategy

fi = p⊺

i Ap Expected payoff

f = p⊺Ap Average payoff

Evolution of the population given by

xi = xi (fi − f ) = xi ((pi − p)⊺Ap).

Page 6: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Overview

What are possibly mechanisms to get cooperation?Payoff matrix

A =

(C D

C b − c −c

D b 0

)

Mechanisms

• Kin selection (r > cb)

Cooperate because offspring benefits of your cooperation. Basisof ‘selfish gene’, or ‘inclusive fitness’.

• Direct reciprocity (w > cb)

Cooperate because of possible future payoffs.

• Indirect reciprocity (q > cb)

Cooperate because someone else may cooperate with you in thefuture.

Nowak. Science (2006) vol. 314 (5805) pp. 1560-1563

Page 7: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Some strategies

Example (Always)

Defect/cooperate on all rounds

Other CDDDDCC

AllD DDDDDDD

AllC CCCCCCCC

Example (Tit-for-tat)

Start cooperating, then repeatopponent.

Other CDDDDCC

TFT CCDDDDC

Example (Win-Stay, Lose-Shift)

Change strategy if losing, keep itotherwise.

Other CDDDDCC

WSLS CCDCDCC

Example (Generous Tit-for-tat)

As TFT, but cooperates afterdefection with probability p.

Other CDDDDCC

GTFT CCDDCDC

Page 8: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Insufficient explanation

Why is kin selection and reciprocity not sufficient?

Insufficient explanation

• Humans cooperate also with non-kin.

• Humans cooperate in non-iterative situations.

Indirect reciprocity

• Cooperate if cooperated with others in the past.

• Brings reputation into play.

• How to respond to reputation?

• How to determine new reputation?

Nowak and Sigmund. Nature (2005) vol. 437 (7063) pp. 1291-1298

Page 9: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Image score

Definition (Image score, reputation)

• Integer status −5 ≤ Si ≤ 5 known to all.

• If cooperate increase (with 1).

• If defect decrease (with 1).

Definition (Discriminator Strategy)

• Cooperative threshold −5 ≤ kj ≤ 6.

• If status Si ≥ kj cooperate, otherwise defect.

• Strategy kj = −5 corresponds to AllC.

• Strategy kj = 6 corresponds to AllD.

Nowak and Sigmund. Nature (1998) vol. 393 (6685) pp. 573-577.

Page 10: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Other reputation dynamics

Morals

• Defecting a defector, good or bad?

• What action should be regarded as good?

• When to cooperate, when to defect?

GG GB BG BB

C ∗ ∗ ∗ ∗

D ∗ ∗ ∗ ∗

∗ ∗ ∗ ∗

Reputation of donor and recipientAction of donor

New reputation can beeither Good or Bad

Action can be eitherCooperate or Defect

Page 11: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Some reputation dynamics

GG GB BG BB

C G G G G

D B B B BImage scoring

C G G G G

D B G B BStanding

C G B G B

D B G B BJudging

C G B G B

D B B B BShunning

Page 12: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Leading eight

Best strategies

• In total 2, 048 different possible strategies.

• There are 8 strategies (leading eight) that perform best (highestpayoff, and ESS).

GG GB BG BB

C G ∗ G ∗

D B G B ∗

C D C ×

Maintainance of cooperation

Mark defectors

Punish defectors

Forgive defectors

Apologize

Ohtsuki and Iwasa. Journal of Theoretical Biology (2004) vol. 231 (1) pp. 107-120

Page 13: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Subjective reputation

Subjective reputation

• Unrealistic that everybody knows the reputation of everybody.

• Introduce a subjective (private) reputation.

• ‘Observe’ only a few interactions.

Observing

• Probability q of observing an interaction.

• Cooperation declines with lower q.

• Diverging reputations cause further errors.

• Good may defect bad, but not all agree on who’s bad.

Page 14: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Synchronize reputations

Synchronizing reputations

• Spread local information to synchronize reputations.

• Players ‘gossip’ about each other to share information.

• Start gossip, spread gossip and how to interpret gossip?

Lying, cheating and defecting

• Possibly ‘false’ gossips spread.

• Spread rumours unconditionally allows liars to invade.

• Liars cannot invade conditional rumour spreaders.

Page 15: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Proposed model

Basics

1 Each agent has a reputation of the other: Rij(m)

2 Everybody plays and cooperates/defects based on localreputation

3 Everybody gossips the result of the interaction

4 New reputation Rij(m + 1) based on:◮ Own observation (∆Iij (m)),◮ Gossip (∆Sij (k ,m)).

Decision to cooperate

The decision to cooperate αij(m) =

{0 if Rij(m) < 01 if Rij(m) ≥ 0

Page 16: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Gossiping

Consider all neighbours k when updating the reputation Rij

i j

k

The link tobe updated.

Does i ‘like’ k?

Will k gossip to i?

What actionhas j takento k?

Page 17: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Social strategy

G B

C G B

D B G

Reputation of k , or αik(m).

Action of j , or αjk(m)

Action is considered aseither Good or Bad

Social strategy

• Cooperation vs. good agent and defection vs. bad agent is good

• Change in reputation due to gossiping with neighbour k

∆Sij(k ,m) = αki (m)(2αik(m)− 1)(2αjk(m)− 1)

Page 18: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Individual strategy

C D

C + −

D − +

Action of j , or αji (m).

Action of j , or αij(m)

Action is considered aseither Good or Bad

Individual strategy

• +1 for ‘good’ actions, −1 for ‘bad’ actions to reputation

• We currently study WSLS-like: Consider CC and DD as good.

∆Iij(m) = (2αij(m)− 1)(2αji (m)− 1)

Page 19: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Reputation dynamics

Combine individual & social strategies

Combine with social influence parameter 0 ≤ λ ≤ 1

∆Rij(m) = (1− λ)

Individual strategy︷ ︸︸ ︷

(2αij(m)− 1)(2αji (m)− 1)+

λ1

n − 2

k 6=i ,j

αki (m)(2αik(m)− 1)(2αjk(m)− 1)

︸ ︷︷ ︸

Social strategy

Reputation dynamics

Rij(m + 1) = Rij(m) + ∆Rij(m)

Page 20: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Cooperative fixed points

Fixed point

• For which networks do we have αij(m + 1) = αij(m)?

• Good reputation remains good, bad reputation remains bad

Undirected case

• If αij(m) = αji (m), fixed points are groups

• Cooperate within groups, defect between groups

• Implies it is (weakly) social balanced

• Can have q groups if

λ >q

q + 1

More social influence may lead to more fragmented cooperation.

Page 21: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Evolutionary dynamics

Four different regimes (Cooperate with prob p on first round)

p < 1/2 p > 1/2

λ < 1/2 Individualistic prejudiced

• Defect vs. cooperators

• Cycles of cooperation vs.defectors

Individualistic trusting

• Cooperate vs.cooperators

• Cycles of cooperationvs. defectors

λ > 1/2 Social prejudiced

• Cooperate vs. cooperators(except second round)

• Defect vs. defectors(except second round)

Social trusting

• Cooperate vs.cooperators

• Defect vs. defectors

Page 22: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Phase portraits Individual

C D

Gossipers

Individual PrejudicedC D

Gossipers

Individual Trusting

In ‘friendly’ environment, being individually prejudiced pays off.

Page 23: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Phase portraits Social

C D

Gossipers

Social PrejudicedC D

Gossipers

Social Trusting

In ‘hostile’ environment, being socially trusting pays off.

Page 24: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Conclusions

Proposed model

• Proposed model for gossiping and reputation dynamics

• Interesting possible cooperative network structure

• Evolutionary stable for some parameter range

• More socially oriented strategy could have developed fromindividual strategy

Shortcomings

• Actual convergence to fixed point not investigated

• Characterize directed fixed points

• Evolutionary dynamics investigated in limit of large n

• Interact all-to-all unrealistic, e.g. restrict to graph

• Gossip perhaps passed on further (cascades of gossip)

Page 25: Cooperation, Reputation & Gossiping

Introduction Indirect Reciprocity Gossip Results

Thank you for your attention.

Questions?