20131216 Stat Journal
-
Upload
medku -
Category
Technology
-
view
261 -
download
3
description
Transcript of 20131216 Stat Journal
![Page 1: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/1.jpg)
Topological network alignment
20131216Statistics journal
![Page 2: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/2.jpg)
![Page 3: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/3.jpg)
Result
G H
G(V, E) H(U, F)
EC = 0.089
![Page 4: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/4.jpg)
Motivation
HumanYeast
Are two networks the same or similar?
large-scale networks such as interactome
![Page 5: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/5.jpg)
Theoretical background
Network or GraphCollection of nodes (vertex) and connections between them (edges).Biology, social communication, and web pages
![Page 6: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/6.jpg)
Theoretical background
Graph G and HNode set V and U (V U)Edge set E V*V and F U*UPossible graphs: for G
G H
G(V, E) H(U, F)
![Page 7: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/7.jpg)
Theoretical background
Graph comparisonSubgraph isomorphismIs G an exact subgraph of H?NP-completeEfficient algorithms are not known.
Graph alignmentFitting G into HEdge correctness (EC): the % of E aligned to FNP-hard
G H
G(V, E) H(U, F)
![Page 8: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/8.jpg)
Previous approaches
Local alignment : ambiguous, different pairingMapping are chosen independently for local regions of similarity.PathBLAST : homology informationNetworkBLAST : conserved protein clusters with likelihood methodMaWISh : evolution (sequence alignment)GRAEMLIN : dense conserved subgraph with phylogeny
Global alignmentProvide unique alignment from each node in smaller graph to exactly one node in larger graphISORANK : maximize overall matchGRAEMLIN : training from known graph alignments and phylogeny
![Page 9: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/9.jpg)
New approaches
Never use a priori informationSequence, Homology, Clusters, Phylogeny ,and Known alignments
Topological similarityOrbit, graphlet, and signature similarity
Of course, a priori information can be used.
そう、 GRAAL ならね
![Page 10: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/10.jpg)
n-node graphlet and automorphism orbits
![Page 11: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/11.jpg)
n-node graphlet and automorphism orbits
graphlet
orbit
Topologically relevant
Topologically relevant
Topologically relevant
![Page 12: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/12.jpg)
Graphlet Degree Vector
![Page 13: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/13.jpg)
Graphlet Degree Vector
![Page 14: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/14.jpg)
Graphlet Degree Vector
![Page 15: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/15.jpg)
Graphlet Degree Vector
![Page 16: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/16.jpg)
n-node graphlet and automorphism orbits
Orbit 15 in touches orbit 0, 1, 4, and 15 once.
![Page 17: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/17.jpg)
Signature similarityWeight vector
[0, 1] 1 means is not affected by any other orbit.
𝑜15=4 𝑜44=5
![Page 18: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/18.jpg)
Signature similarity
Node , denotes the i-th coordinates of its signature vector. The distance is the i-th orbits of nodes and is
The total distance between and is
The signature similarity is
S = 1 is that and are identical (D = 0).
![Page 19: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/19.jpg)
GRAph ALigner algorithm (GRAAL)
Compute costs of aligning each node with each node .
This matrix is row V and col U (all pairs of nodes).Align the densest parts (the minimal cost nodes, seed).Greedily alignment in the sphere.Repeat * while all nodes in the smaller graph will be aligned.
GRAAL uses only topological information.Biological information can be used by the equation
G H
G(V, E) H(U, F)
density topology
: degree of node
*
![Page 20: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/20.jpg)
GRAALSearch the densest part and align.
Search the minimal cost nodes pair (seed).If multi-minimal cost pairs, chosen randomly.
G(V, E) H(U, F)
![Page 21: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/21.jpg)
GRAALSearch the densest part and align.
Search the minimal cost nodes pair (seed).If multi-minimal cost pairs, chosen randomly.
Seed nodes pair
G(V, E) H(U, F)
![Page 22: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/22.jpg)
GRAALMake spheres and align.
Make sphere .Greedily align and with the minimal cost.
𝑢𝑣
G(V, E) H(U, F)
: length of the shortest path
![Page 23: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/23.jpg)
GRAALMake spheres and align.
Make sphere .Greedily align and with the minimal cost.
𝑢𝑣
G(V, E) H(U, F)
𝑆𝑮 (𝑣 ,𝑟 )
𝑆𝑯 (𝑢 ,𝑟 )
: length of the shortest path
![Page 24: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/24.jpg)
GRAALMake spheres and align.
Make sphere .Greedily align and with the minimal cost.
𝑢𝑣
G(V, E) H(U, F)
𝑆𝑮 (𝑣 ,𝑟 )
𝑆𝑯 (𝑢 ,𝑟 )
Align
: length of the shortest path
![Page 25: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/25.jpg)
GRAALExpand radii of spheres and align.
𝑢𝑣
: length of the shortest path
G(V, E) H(U, F)
𝑆𝑮 (𝑣 ,𝑟 )
𝑆𝑯 (𝑢 ,𝑟 )
Make sphere .Greedily align and with the minimal cost.
Aligned node
![Page 26: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/26.jpg)
GRAALExpand radii of spheres and align.
𝑢𝑣
: length of the shortest path
G(V, E) H(U, F)
𝑆𝑮 (𝑣 ,𝑟 )
𝑆𝑯 (𝑢 ,𝑟 )
Make sphere .Greedily align and with the minimal cost.
Aligned node
radii :
![Page 27: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/27.jpg)
GRAALExpand radii of spheres up to 3.
𝑢𝑣
: length of the shortest path
G(V, E) H(U, F)
𝑆𝑮 (𝑣 ,𝑟 )
𝑆𝑯 (𝑢 ,𝑟 )
Make sphere .Greedily align and with the minimal cost.
Aligned node
![Page 28: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/28.jpg)
GRAALExpand radii of spheres up to 3.
𝑢𝑣
: length of the shortest path
G(V, E) H(U, F)
𝑆𝑮 (𝑣 ,𝑟 )
𝑆𝑯 (𝑢 ,𝑟 )
Make sphere .Greedily align and with the minimal cost.
Aligned node
radii :
![Page 29: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/29.jpg)
GRAALExpand radii of spheres up to 3.
𝑢𝑣
: length of the shortest path
G(V, E) H(U, F)
𝑆𝑮 (𝑣 ,𝑟 )
𝑆𝑯 (𝑢 ,𝑟 )
Some nodes are not aligned.
Make sphere .Greedily align and with the minimal cost.
Aligned node
radii :
![Page 30: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/30.jpg)
GRAALRepeat with new edge networks .
𝑮𝑝 (𝑽 ,𝑬𝑝 )
The distance between and , Aligned node
𝑝 ≤2𝑯 𝑝 (𝑼 ,𝑭𝑝 )
𝑆𝑯 𝑝 (𝑢 ,𝑟 )
: length of the shortest path
𝑆𝑮𝑝 (𝑣 ,𝑟 )
𝑝 ≤2
𝑟=1
𝑟=1
![Page 31: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/31.jpg)
GRAALRepeat with new edge networks .
𝑮𝑝 (𝑽 ,𝑬𝑝 )
The distance between and , Aligned node
𝑝 ≤2
𝑆𝑮𝑝 (𝑣 ,𝑟 )𝑟=1
: length of the shortest path
edge()
edge
Path: 6 12 25 can be replaced by , which is analogous for insertion or deletion.
![Page 32: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/32.jpg)
GRAALRepeat with new edge networks .
𝑮𝑝 (𝑽 ,𝑬𝑝 )
The distance between and , Aligned node
𝑝 ≤2
New seed
𝑯 𝑝 (𝑼 ,𝑭𝑝 )
𝑆𝑯 𝑝 (𝑢 ,𝑟 )
: length of the shortest path
New seed
𝑆𝑮𝑝 (𝑣 ,𝑟 )
𝑝 ≤2
𝑟=1
𝑟=1
![Page 33: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/33.jpg)
GRAALRepeat with new edge networks .
𝑮𝑝 (𝑽 ,𝑬𝑝 )
The distance between and , Aligned node
𝑝 ≤2
New seed
𝑯 𝑝 (𝑼 ,𝑭𝑝 )
𝑆𝑯 𝑝 (𝑢 ,𝑟 )
: length of the shortest path
New seed
𝑆𝑮𝑝 (𝑣 ,𝑟 )
𝑝 ≤2
𝑟=1
𝑟=1
![Page 34: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/34.jpg)
GRAALNodes in G are aligned to exactly one node in H.
The distance between and , Aligned node
: length of the shortest path
G(V, E) H(U, F)
![Page 35: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/35.jpg)
Alignment scoreEdge correctness: the % of edges in G are aligned to edges in H.
Node correctness: the % of nodes in G are aligned to nodes in H.Correct mapping is needed.
Interaction correctness: the % of interactions that aligned correctly.Correct interaction is needed.
G H
G(V, E) H(U, F)
GRAAL function
The correct node mapping G to H𝑔 :𝑽→𝑼𝑓 :𝑽→𝑼
![Page 36: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/36.jpg)
Statistical significance
: a random mapping between nodes in G(V, E) and H(U, F).The probability P of successfully aligning k or more edges by chance is the tail of the hypergeometric distribution:
G H
G(V, E) H(U, F)
𝑛1=|𝑉|𝑃=∑𝑖=𝑘
𝑚 2 (𝑚2
𝑖 )(𝑝−𝑚2
𝑚1−𝑖 )( 𝑝𝑚1
)
The number of edges from G that are aligned to edges in H.
The number of node pairs in H.
Edge correctness
![Page 37: 20131216 Stat Journal](https://reader033.fdocuments.us/reader033/viewer/2022052310/554fbabeb4c9053d018b4714/html5/thumbnails/37.jpg)
Result
G H
G(V, E) H(U, F)
EC = 0.089