Methods and Applications for Detecting Structure in ...€¦ · structure, community structure, in...

Methods and Applications for Detecting Structure in Complex Networks

Elizabeth A. Leicht

Introduction

http://www-personal.umich.edu/~mejn/networks/attweb.gif

Yeast proteins: Maslov & Sneppen, Science 296, 910-913 (2002).

Web site: M. E. J. Newman & M. Girvan, Physical Review E 69, 026113 (2004).

7th Grade Students: adapted from Moreno, Who Shall Survive, 1934.

Visualizing a Network

Large Networks

Talk Outline

• Introduction to networks and their structure.

• Network structure detection method 1: linear-algebra and optimization techniques.

• Network structure detection method 2: probabilistic mixture models and the expectation-maximization algorithm.

• Conclusions.

Detecting Communities in Directed Networks

- In many networks, the vertices divide naturally into communities, also known as groups or modules.

- Previous methods for detecting communities in networks have mostly focused just on undirected networks.

- One established method for partitioning a network relies on identifying statistically surprising configurations of edges by measuring modularity (Q).

(fraction of edges within communities)- (expected fraction of such edges)

Calculating Modularity

Rewrite modularity using the adjacency matrix.

[Aij ! Pij ] !ci,cj

How do we determine the “expected” number of edges between two vertices?

1, if there is an edge from j to i

0, otherwise .

ci = the community to which i belongs.Pij = the expected number of edges from j to i.

kini koutj

Division of a Network into Two Communitiessj = !1si = +1

Aij !kini koutj

(sisj + 1)

!ij =1

2(sisj + 1)

Let Bij = Aij !

kini koutj

mbe an element of the modularity matrix.

Let v(1) be the eigenvectorcorresponding to the mostpositive eigenvalue of thesymmetric matrix B + BT

+1, if v(1)i

!1, if v(1)i

2msTBs =

B + BT"

Two Communities and More

- A network can be divided into more than two communities by repeated bisection.

- We can calculate the additional contribution to the modularity, ∆ Q, from the each additional division and halt the divisions

when no there is no additional increase in the modularity.

Communities with Edge Direction Bias

Hubs and Authorities Example

Edge Direction Bias in Real Networks

Minnesota

Indiana

Michigan

Iowa Wisconsin

Michigan State

Northwestern Ohio State

Pennsylvania State

Purdue

Illinois

Edge Direction Bias in Real Networks

Purdue IndianaIowa

Michigan

Wisconsin

Michigan State

Minnesota

Northwestern Ohio State

Pennsylvania State

Illinois

Exploratory Analysis of Structure in Networks

- Previously we identified communities in networks because we specifically sought a method to detect modules in networks.

- Reliance on specific measures of network structure where we are required to know the type of structure, for which we are looking, in advance can be limiting.

- We turn to probabilistic techniques and the Expectation Maximization (EM) Algorithm to identify general patterns of connection between vertices.

The Method

There are three types of quantities in this method of approach.

1. Observed data: the actual edges falling between pairs of vertices in a network ( ).

2. Missing data: we assume that the vertices divide into c groups. We denote the group to which vertex belongs as and set of all missing data as

3. Model parameters: they describe the patterns of vertices in different groups (θ, π).

the probability that there exists an edgefrom a vertex in group r to a vertex i

!r = the probability of a vertex belonging to group r

!r = 1

!ri = 1

The likelihood of the data given the model is

Frequently, one works not with the likelihood itself, but with the log-likelihood.

A Likelihood Problem

Pr(A|g,!, ") =!

gj,iand Pr(g|!, ") =

Pr(A,g|!, ") = Pr(A|g,!, ")Pr(g|!, "),

L = ln Pr(A,g|!, ") =!

ln!gj+

We cannot directly observe g.

It is, however, possible to calculate an expected value for the log-likelihood over all possible values of g.

Dealing with the “Missing” Data

ln!r +

Aij ln "ri

qjr = Pr(gj = r|A,!, ") =Pr(A,gj = r|!, ")

Pr(A|!, ")=

∏i "

· · ·

Pr(g|A, !,")

ln"gj+

Aij ln!gj,i

The EM Algorithm

- Initialize model parameters (θ, π) with random values.

- Find the probability a given vertex is a member of group (E-step).

- Maximize the model parameters (M-step)

- Iterate until convergence.

qir !ri =

!j Aijqjr

!j kjqjr

The AddHealth Network

Example: AddHealth

Example: A “Keystone” Network

“Big Ten” Results with the EM Algorithm

0 0.5 1

Vertex shading based on probability of being assigned to group 1.

Vertex size based on probability of being beaten by teams assigned to group 1.

Example: The “Big Ten” Conference

Illinois

Purdue

Indiana

WisconsinIowa

Pennsylvania StateMinnesota

Michigan StateNorthwestern

Ohio StateMichigan

Conclusions

- There are many existing methods for detecting structure in complex networks, but this work takes two new and interesting approaches.

- In this dissertation a method for detecting one specific type of structure, community structure, in directed networks is presented.

- Additionally, a method for detecting more general types of structure by turning to probabilistic mixture models and the Expectation-Maximization Algorithm is given.

- These two topics are not the only ones covered in my dissertation.

- The dissertation also introduces method for tackling the specific issue of identifying similar vertices.

- A method for probing the structure of time evolving citation networks was also developed.

Methods and Applications for Detecting Structure in ...€¦ · structure, community structure, in...

Documents

Transcript of Methods and Applications for Detecting Structure in ...€¦ · structure, community structure, in...

Detecting Kernel-level Rootkits using Data Structure Invariants Vinod Ganapathy Rutgers University.

ANNUAL FEE STRUCTURE (2020-21) NATIONAL CURRICULUM · ANNUAL FEE STRUCTURE (2020-21) NATIONAL CURRICULUM Notes for Parents 2. Rs. 40,000 Security Deposit will be charged additionally.

Detecting Changes in 3D Structure of a Scene from …...Detecting Changes in 3D Structure of a Scene from Multi-view Images Captured by a Vehicle-mounted Camera Ken Sakurada Takayuki

Detecting screen Slit Atomic Structure and the Periodic Table

DESIGNING A BASE PAY STRUCTURE - University of …wagon/WS_11.ppt · PPT file · Web view · 2000-04-16Designing A Base Pay Structure The ... Compensation Policy Guidelines Additionally

Machine learning methods for detecting structure in ... · This work introduces and demonstrates a number of statistical and machine learning approaches to network structure detection.

Detecting regime switches in the dependence structure of ... · Detecting regime switches in the dependence structure of high dimensional ﬁnancial data Jakob STÖBER and Claudia

Detecting Spoofed Packets-DISCEX · include both active and passive host-based methods as well as the more commonly discussed routing-based methods. Additionally, we present the results

DYNAMIC ANALYSIS OF SOIL STRUCTURE INTERACTION … · moment opposing building frames resisting on Soil Pile Structure (SPS) is additionally anticipated. In SSI modeling, for diminishing

Modeling Additive Structure and Detecting Interactions with Additive Groves of Regression Trees

Detecting type of Persuasion : Is there structure in ...ceur-ws.org/Vol-2048/paper10.pdf · Detecting type of Persuasion : Is there structure in persuasion tactics? Rahul R Iyer Carnegie

Understanding Principle Component Approach of Detecting Population Structure Jianzhong Ma PI: Chris…

Revit Structure 2010 Fundamentals - SDC Publications ... · Additionally, in a Revit/BIM environment when beams are attached to columns, and columns to grids, ... Revit Structure,

Detecting the Knowledge Structure of Bioinformatics with Text Mining and Citation Analysis

Detecting the correct graph structure in pose graph SLAM

Detecting Signs of Intrusion - SEI Digital Library · nizations improve the security of their networked computer systems. Module structure ... in network security. ... Detecting Signs

Detecting Diagonal Activity to Quantify Harmonic Structure Preservation With Cochlear Implant Mappings

Detecting deviations from metronomic timing in …Perception & Psychophysics 1999, 61 (3), 529-548 Detecting deviations from metronomic timing in music: Effects ofperceptual structure

Detecting and Locating GPS Jamming - orfe.princeton.edu · whole.4 Additionally, GPS technology has created both positive direct and indirect spillover: 3.3 million jobs that rely

ANALYZING SONG STRUCTURE WITH SPECTRAL CLUSTERING … · ANALYZING SONG STRUCTURE WITH SPECTRAL CLUSTERING ... form the building ... ward detecting and representing repetition structure,