Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social...

103
Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences Mitchell Centre for Network Analysis

Transcript of Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social...

Page 1: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social Network Analysis- Part I basics

Johan Koskinen

Workshop: Monday,  29 August 2011

The Social Statistics Discipline Area, School of Social Sciences

Mitchell Centre for Network Analysis

Page 2: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Statistics and networks?

Why statistics?

- Is the network a unique narrative?

- Numbers in lieu of ethnography?

Possible answers

- Detecting systematic tendencies

- Social mechanisms

- Why not?

Page 3: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Outline

Statistics for

networks

Types of

analysis

Networks

ERGM

Statistics & S.N.

SAOM

Non-

parameteric

Functional

dependencies Statistical

dependencies

Page 4: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Part 1

Social network data?

Page 5: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

marypaul

We conceive of a network as a Relation defined on a collection of individuals

relates to

“… go to for advice…”

Page 6: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

marypaul

We conceive of a network as a Relation defined on a collection of individuals

relates to

“… consider a friend…”

Page 7: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

marypaul

We conceive of a network as a Relation defined on a collection of individuals

relates to

on

off

Gen

eral

ly b

inar

y Tie present

Tie absent

Page 8: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

marypaul

We conceive of a network as a Graph: G(V,E), on

relates to

on

off

Gen

eral

ly b

inar

y Tie present

Tie absent

Individuals: V={1,2,…,n}

Relation: E {(i,j) : i,j V}

Page 9: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

We conceive of a network as a Graph: G(V,E), on

on

off

Gen

eral

ly b

inar

y Tie present

Tie absent

Individuals: V={1,2,…,n}

Relation: E {(i,j) : i,j V} i(i , j)

j

Page 10: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

We conceive of a network as a Graph: G(V,E), on

Individuals: V={i,j,k,l}

Relation: E ={(i,j),(i,k),(k,j),(l,j)} john pete

mary

paul

l

i

j

k

Page 11: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

We conceive of the Graph as a collection of

Tie variables: {Xij: i,j V}

i(i , j)

j

Page 12: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

We conceive of the Graph as a collection of

on

off

Gen

eral

ly b

inar

y xij = 1

Tie variables: {Xij: i,j V}

i(i , j)

j

xij = 0

Page 13: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

We conceive of the Graph as a collection of

Tie variables: {Xij: i,j V}

john pete

mary

paul

i - xij xik xil

j xji - xjk xjl

k xki xkj - xkl

l xli xlj xlk -

x =

i - 1 1 0

j 0 - 0 0

k 0 1 - 0

l 0 1 0 -

=

Page 14: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

We conceive of the Graph as a collection of

Tie variables: {Xij: i,j V}

i - xij xik xil

j xji - xjk xjl

k xki xkj - xkl

l xli xlj xlk -

x =

i - 1 1 0

j 0 - 0 0

k 0 1 - 0

l 0 1 0 -

=

l

i

j

k

Page 15: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

The Adjacency matrix:

The matrix of the collection Tie var. {Xij: i,j V}

i - xij xik xil

j xji - xjk xjl

k xki xkj - xkl

l xli xlj xlk -

x =

Page 16: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

The Adjacency matrix:

The matrix of the collection Tie var. {Xij: i,j V}

i - xij xik xil

j xji - xjk xjl

k xki xkj - xkl

l xli xlj xlk -

x =

out-degree

Page 17: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

The Adjacency matrix:

The matrix of the collection Tie var. {Xij: i,j V}

i - xij xik xil

j xji - xjk xjl

k xki xkj - xkl

l xli xlj xlk -

x =

In-degree

Page 18: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

The Adjacency matrix:

The matrix of the collection Tie var. {Xij: i,j V}

i - xij xik xil

j xji - xjk xjl

k xki xkj - xkl

l xli xlj xlk -

x =

In-degree

Page 19: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

- xij xik xil

xji - xjk xjl

xki xkj - xkl

xli xlj xlk -

x =

x <- matrix(rbinom(100,1,.4),10,10)

number of nodes

density (#arcs/#possible arcs)

number of cells

Page 20: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

- xij xik xil

xji - xjk xjl

xki xkj - xkl

xli xlj xlk -

x =

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0

number of nodes

density (#arcs/#possible arcs)

number of cells

No diagonal (self-nominations)

Page 21: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0x Print matrix to screen

Page 22: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])

Print matrix to screenTo sum third row

Page 23: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])

Print matrix to screenTo sum third row

Page 24: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])rowSums(x)

To sum all rows

Page 25: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])rowSums(x)

To sum all rows

Page 26: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])rowSums(x)

To sum all rows

Out-degree distribution

Page 27: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks – example in R

Let’s create an Adjacency matrix:

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])rowSums(x)colSums(x) To sum all columns

in-degree distribution

Page 28: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

To draw the Graph

Tie variables: {Xij: i,j V}

i(i , j)

j

Page 29: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

To draw the Graph

Tie variables: {Xij: i,j V}

i(i , j)

j

?

Page 30: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

To draw the Graph

load package “network”

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])rowSums(x)colSums(x)library('network')

Page 31: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

To draw the Graph

load package “network”

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])rowSums(x)colSums(x)library('network')myGraph <- as.network(x)

Transform the adjacency matrix

to a “network object”

Page 32: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Social networks

To draw the Graph

load package “network”

x <- matrix(rbinom(100,1,.4),10,10)diag(x) <- 0xsum(x[3,])rowSums(x)colSums(x)library('network')myGraph <- as.network(x)plot(myGraph)

plot the new “network object”

Page 33: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Part 2

Modes of analysis of Social network data?

Page 34: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Modes of Analysis SNA

Graphical Descriptive Statistical

Page 35: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Modes of Analysis SNA

Graphical

john

pete

mary

paul

A social network of tertiary students – Kalish (2003)

Page 36: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Modes of Analysis SNA

Graphical

john

pete

mary

paul

Yellow: Jewish Blue: Arab

A social network of tertiary students – Kalish (2003)

Page 37: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Modes of Analysis SNA

Descriptive

john

pete

mary

paul

Centrality index Density

arab jew

arab medium low

jew high

Page 38: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Modes of Analysis SNA

Statistical

“nonparametric”

john

pete

mary

paul

Centrality index Density

arab jew

arab medium low

jew high

Differences in centrality may be explained by chance

Differences in densities unlikely if classes assumed “equal”

Page 39: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Modes of Analysis SNA

Statistical

model based

john

pete

mary

paul

Bernoulli

arab jew

arab medium low

jew high

Ties are distributed independently with parameter

The network may be described by an

- a priori BBM

- social selection ERGM with separate effects for clustering and homophily on race

64ˆ p

Page 40: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Part 2

Background: statistical analysis

Page 41: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Statistical analysis – why statistics?

Why statistics?

Statistics assessing whether observed

measured quantities are ”big”

reject chance or not

six in 50 out of 51: balanced dice?

Networks not as easy

- Good model for chance in SNA?

- Model to capture systematic patterns (the typical)

Page 42: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

“I give advice…”Consider testing:

Page 43: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

“… to people I consider my friends”Consider testing:

Page 44: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”

“friendship”

Association?

Page 45: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”

“friendship”

Correlate advice x with friendship y?

ij

ij

Page 46: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”

“friendship”

Correlate advice x with friendship y?

ij

ij

Page 47: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”

“friendship”

Correlate advice x with friendship y?

ij

ij

Page 48: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”“friendship”

Correlate advice x with friendship y?

ij ij

Page 49: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”“friendship”

Correlate advice x with friendship y?

ij ij

Page 50: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”“friendship”

Correlate advice x with friendship y?

ij ij

- xij xik xil

xji - xjk xjl

xki xkj - xkl

xli xlj xlk -

- yij yik yil

yji - yjk yjl

yki ykj - ykl

yli ylj ylk -

Page 51: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”“friendship”

Correlate advice x with friendship y?

ij ij

- xij xik xil

xji - xjk xjl

xki xkj - xkl

xli xlj xlk -

- yij yik yil

yji - yjk yjl

yki ykj - ykl

yli ylj ylk -

Page 52: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”“friendship”

Correlate advice x with friendship y?

ij ij

- xij xik xil

xji - xjk xjl

xki xkj - xkl

xli xlj xlk -

- yij yik yil

yji - yjk yjl

yki ykj - ykl

yli ylj ylk -

Page 53: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”“friendship”

Correlate advice x with friendship y?

ij ij

- xij xik xil

xji - xjk xjl

xki xkj - xkl

xli xlj xlk -

- yij yik yil

yji - yjk yjl

yki ykj - ykl

yli ylj ylk -

Page 54: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Consider testing:

“advice”

“friendship”

Correlate advice x with friendship y?

ij

ij

Here we get r = 0.21 Large?

Page 55: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Using standard statistical techniquesIs r = 0.21 big?

21 2

nr

rt

Standard* statistical approach:

Reject H0 (no correlation) if

*though careless

is greater than 2

Here t = 6.44 (df 868)

2-sided p-value: 2x10-10

Page 56: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Does this – p-value of 2x10-10 – mean that

“advice”“friendship”

advice x and friendship y? are truly associated?

ij ij

“I give advice…… to my friends”

Page 57: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Does this – p-value of 2x10-10 – mean that

“advice”“friendship”

advice x and friendship y? are truly associated?

ij ij

“I give advice…… to my friends”N O !

Page 58: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Here I generated

“advice”“friendship”

friendship y independently of advice x

ij ij

Page 59: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Friendship and advice ties are independent but

There may be dependence on actors

Some people:I give advice to everyone

Page 60: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Friendship and advice ties are independent but

There may be dependence on actors

Some people:

…and everyone is my friend

Page 61: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Friendship and advice ties are independent but

There may be dependence on actors

other people:I don’t really give advice

…and no one is my

friend

Page 62: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Sends ties to 0% others

- xij xik xil

xji - xjl

xki xkj - xkl

xli xlj xlk -

- yij yik yil

yji - yjl

yki ykj - ykl

yli ylj ylk -

Page 63: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Sends ties to 70% others

- xij xik xil

xji - xjl

xki xkj - xkl

xli xlj xlk -

- yij yik yil

yji - yjl

yki ykj - ykl

yli ylj ylk -

Page 64: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

- xij xik xil

xji - xjl

xki xkj - xkl

xli xlj xlk -

- yij yik yil

yji - yjl

yki ykj - ykl

yli ylj ylk -

inbetween

70%

0%

Page 65: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

Mostly ones

- xij xik xil

xji - xjl

xki xkj - xkl

xli xlj xlk -

- yij yik yil

yji - yjl

yki ykj - ykl

yli ylj ylk -

inbetween

70%

0%Mostly zeros

Page 66: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

xij

xik

xil

xji

xkl

xli

xlj

xlk

inbetween

70%

0%yij

yik

yil

yji

ykl

yli

ylj

ylk

Page 67: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

0

0

0

xji

xkl

1

1

1

inbetween

70%

0%0

0

0

yji

ykl

1

0

1

Page 68: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Can’t we simply do t-tests?

0

0

0

xji

xkl

1

1

1

inbetween

70%

0%0

0

0

yji

ykl

1

0

1correlations assuming no association

0.21

Page 69: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

History: non-parametric approaches

From late 1930s

“the first generation of research dealt with the distribution of various network statistics, under a variety of null models”

(Wasserman and Pattison, 1996)

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Observed network

Summary measure(e.g. centralization)

Distribution of measure under null distribution

Page 70: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

friendship

Non-parametric: 2 relations

Conformity of 2 sociometric measures (Katz and Powell, 1953)

If no association between A and B, for each pair:

B: advice networkA: friendship network

marypaul

heads

tails

paul

friendship

maryadvice

friendship

marypaul

concordant

discordant

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Distribution of #concordant under null distribution

obs #concordant

# pairs: or

Page 71: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphsX XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Compare?

Page 72: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on expected density U |E(X++) : Bernoulli

X XX X

XX X

X XX X

XX X X

E(X++)

X1+

Xi+

Xn+

X+1 X+i X+n

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 73: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 74: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 75: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 76: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 77: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 78: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 79: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

XX X

XX X X

XX X

X XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 80: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

Condition on both in-degrees and out-degrees : U |X+•,X•+

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 81: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

Condition on both in-degrees and out-degrees : U |X+•,X•+

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 82: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

Condition on both in-degrees and out-degrees : U |X+•,X•+

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 83: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX XX X

X XX X X

X XX

X X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

Condition on both in-degrees and out-degrees : U |X+•,X•+

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 84: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX XX X

X XX X X

X XX

X X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

Condition on both in-degrees and out-degrees : U |X+•,X•+

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

Page 85: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Different null distributions for directed graphs

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

X XX X

XX X

X XX X

XX X X

X++

X1+

Xi+

Xn+

X+1 X+i X+n

Condition on popularity/in-degrees: U |X+•

Condition on both in-degrees and out-degrees : U |X+•,X•+

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Permute|cond.

For a systematic statistical approach to successive conditioning see Pattison et al., 2000

Page 86: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Non-parametric: conditional uniform null distributions

Condition on density: U |X++

Condition on expected density U |E(X++) : Bernoulli

Condition on activity/out-degrees: U |X•+

Condition on popularity/in-degrees: U |X+•

Condition on both in-degrees and out-degrees : U |X+•,X•+

library(help=sna)# e.g.: rgnm

Try and identify these distributions in ‘sna’:

Page 87: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Investigating the triad census conditional on the dyad census (Holland & Leinhardt 1970)

Different directed triangles

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Distribution of #030T given observed MAN

Types of dyads:

M (mutual):

A (asymetric):

N (null):

Observed network

U |MAN : uniform graphs with same MAN as observed

obs #030T

Page 88: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Investigating the triad census conditional on the dyad census

Interpretation

-5 0 5 10 15 20 250

0.05

0.1

0.15

0.2

Distribution of #030T given observed MAN

Given that we’ve accounted for different types of reciprocation

M A N

obs #030T

What triads occur more (less) freq. than chance?

Alt.: What triads occur more (less) freq. than what is explained by density and reciprocation alone?

Page 89: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Triad census in R

Load the data set coleman

that comes with the package “sna”

?colemandata(coleman) # loads data setcolenet <- as.network(coleman[1,,]) # create network objcolenet # check propertiesplot(colenet) # plotdyad.census(colenet)ObsTriad <- triad.census(colenet)

Page 90: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Triad census in R

Generate a null-distribution of NumReplics graphs

with the same MAN as colenet

NumReplics <- 500g<-rguman(NumReplics,73,mut=62,asym=119,null=2447,method = "exact")

TriadRes <- matrix(c(0),NumReplics,16)for (i in 1:NumReplics){

TriadRes[i,] <- triad.census(g[i,,])} Calculate the triad census for each

simulated graph

Page 91: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Triad census in R

Generate a null-distribution of NumReplics graphs

with the same MAN as colenet

plot the simulated triad census against the observed par( mfrow = c( 4, 4 ) ) for (k in 1:16) { hist(TriadRes[,k],xlim = c(min(ObsTriad[k],TriadRes[,k]),max(ObsTriad[k],TriadRes[,k] ) ) ,xlab=dimnames(ObsTriad)[[2]][k],main="") lines(ObsTriad[k],0,type="o", col="red") }

Page 92: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Quadratic Assignment Procedure (QAP) (Krackhardt, 1987)

B: advice networkA: friendship network

X XX X

XX X

X XX X

XX X X

X XX X

X X XX

XX

X X

Page 93: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Quadratic Assignment Procedure (QAP) (Krackhardt, 1987)

B: advice networkA: friendship network

X XX X

XX X

X XX X

XX X X

X XX X

X X XX

XX

X X

Page 94: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Quadratic Assignment Procedure (QAP) (Krackhardt, 1987)

B: advice networkA: friendship network

X XX X

X X XX

XX

X X

X

X

X X

X XX X

XX X

X XX X

XX X X

Page 95: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Quadratic Assignment Procedure (QAP) (Krackhardt, 1987)

B: advice networkA: friendship network

X XX X

X X XX

XX

X X

X

X

X X

XX X

X X

X XX

X X X

Page 96: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Quadratic Assignment Procedure (QAP) (Krackhardt, 1987)

B: advice networkA: friendship network

X XX X

X X XX

XX

X X

X

X

X X

XX X

X XX X

XX X X

Page 97: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Quadratic Assignment Procedure (QAP) (Krackhardt, 1987)

B: advice networkA: friendship network

X XX X

X X XX

XX

X X

X XX X

XX XX X

XX X X

X X

Page 98: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Quadratic Assignment Procedure (QAP) (Krackhardt, 1987)

B: advice networkA: friendship network

X XX X

X X XX

XX

X X

X XX X

XX XX X

XX X X

X X

How “unusual” is the observed number of concordant pairs compared to the permutation distribution?

Page 99: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

QAP in R

Load the data set coleman

that comes with the package “sna”

?colemandata(coleman) # loads data setq.12<-qaptest(coleman,gcor,g1=1,g2=2)# qap test summary(q.12)# summary of testplot(q.12)# plot of null distribution

Page 100: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Drawbacks of non model based statistical analysis

Weak (uninteresting) null hypotheses – what is it we are rejecting?

Test: Testing centralization using conditioning on density: U |X++

Interpretation: network more centralised than expected by chance, but also, network not generated by randomly distributing edges

Test: Testing association between relations using QAP

Interpretation: relations are not unrelated, but also, ties are more concordant than if identities of vertices did not matter (sic) john

pete

mary

john

pete mary

Page 101: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Drawbacks of non model based statistical analysis

We have no model for what we are interested in – are “significant” effects artifacts of other effects ?

Test: Testing structural effects using U |MAN

Limit in interpretation: what if we are interested in both reciprocity and triangulation?

Page 102: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Models for networks

• Models allow us to model features of the data that we are interested in

• If we are able to fit a model we (may) have adequately described the data (c.p. only holds true for non-parametric analysis when null hypothesis not rejected)

• Common critique: (a) only one observation(b) not inferring to population(c) where does “chance” come from?

• “chance” = “uncertainty”; possible process rather than sample (c.p. time series analysis)

Page 103: Social Network Analysis - Part I basics Johan Koskinen Workshop: Monday, 29 August 2011 The Social Statistics Discipline Area, School of Social Sciences.

Models for networks

• Stochastic block models (e.g. Nowicki and Snijders, 2001)

• Latent class/ clustering models (e.g. Schweinberger, and Snijders, 2003; Handcock et al., 2007)

• Regressing variables on networks and covariates- the influence model (Robins et al., 2001)- the network effects and network autocorrelation models (Marsden and Friedkin, 1994)

• Models for longitudinal social network data (e.g. Snijders et al., 2007)