UNCONVENTIONAL COMPUTING - Babeș-Bolyai University
Transcript of UNCONVENTIONAL COMPUTING - Babeș-Bolyai University
UNCONVENTIONAL
COMPUTING
Laura Dioşan Lecture 4
MIR
PR
Complex networks
Content
Basic concepts
Typology
Applications
MIR
PR
What is a complex network? A set of connected elements
A network (a graph) G=(N,M) is composed by a set of nodes N={n1,n2,…,nN} and
a set of links (edges) L= {l1,l2,…,lM}
A graph = a mathematical (abstract) representation of a network
Each element is represented by
location (physics) Node (computer science) Actor/agent (sociology) Vertex (graph theory)
Interaction of two elements is represented by:
Contact (physics) Link (computer science) Relation (sociology) Edge (graph theory)
MIR
PR
What is a complex network?
Nodes and links can be provided by various contexts
MIR
PR
What is a complex network?
Nodes and links can be provided by various contexts
MIR
PR
What is a complex network?
Nodes and links can be provided by various contexts
MIR
PR
What is a complex network?
A complex network is a network with
Topological attributes (features) non-trivial
Link patterns that are neither purely regular nor purely random
MIR
PR
Basic concepts
Description of a network (with N nodes and M kinks) by using matrix:
The matrix of link weights
Adjacency matrix
Laplaciane matrix L = A – K
K – diagonal matrix with
ijii ak
MIR
PR
Basic concepts Shortest path between 2 nodes (dij)
Shortest path (= number of edges) Shortest path( = sum of edge’s weights)
Average path l
(arithmetic or harmonic) mean of all the shortest paths between any 2 nodes of network
Diameter (D)
The longest (number of edges) path from all shortest path (as value)
Component
Set of all nodes that can be reached starting from a given node
MIR
PR
Basic concepts
Degree of a node (ki)
Number of links of a node
Strength of a node (si)
Sum of link weights of a node si = ∑wij
Betweenness (bi) of a node or of a link
Number of shortest paths that use that node/link
The node of largest degree is the strongest one?
Which is the node of largest betweenness?
MIR
PR
Basic concepts
Network motifs
Components (sub-graphs) that appear more frequently that we expect (randomly)
Each motif can code specific information
MIR
PR
Basic concepts
Clustering coefficient C are my friends,
friends of my friends?
Based on the number of triangles from the network
Coefficient of a node = proportion of
# links that connect the neighbours of the node
#possible links among these neighbours
C – mean over the coefficients of all the network’s nodes
MIR
PR
Basic concepts
Global efficiency
Harmonic mean of optimal paths between all the nodes from the network
Local efficiency (of a node) clustering
coefficient
Value of the shortest path among the neighbours of the node (without node)
MIR
PR
Basic concepts
Graph spectrum
Set of proper values of adjacency/Laplace matrix
A graph with N nodes has N proper values and N proper vectors
Important for topological features
Diameter
# cycles
Propagation of information
Important for connectivity features
Spectral density
MIR
PR
Basic concepts
Community structure
Community in a graph
A sub-graph with high connected nodes (more connected than random)
if the nodes of the network can be easily grouped into (potentially overlapping) sets of nodes such that each set of nodes is densely connected internally
Evaluation
Modularity M
MIR
PR
Basic concepts
Degree distribution
(cumulative) distribution of degrees
Proportion of network’s nodes of degree (greater or ) equal to a threshold
Can be
exponential distribution (random nets)
Power-law (scale-free nets)
MIR
PR
Basic concepts
Clustering distribution
MIR
PR
Basic concepts
Nearest neighbour degree knn(k) and assortativity
knn(k) – measures the degree of neighbours
quality of assortativity
MIR
PR
Typology Link orientation
Without orientation Oriented
Type of links
Free (without weights/coefficients) Weighted
Node type
Simple Bipartite
Topology Static Evolutionary
Node dynamic
Without dynamic With dynamic
MIR
PR
Typology
Link orientation dynamic processes in the network information propagation,
synchronisation,
robustness
Without orientation (symmetric links)
Oriented (asymmetric links)
MIR
PR
Typology
Link type dynamic processes in the
network
information propagation,
synchronisation,
robustness
Without weights (homogenous)
Weighted (eterogenous)
MIR
PR
Typology
Node type
Simple
Bipartite networks with nodes of 2 or more
types and links between nodes of the same type
MIR
PR
Typology
Topology network does not appear
instantly
Static (as structure)
Evolutionary (as structure)
Possible questions:
Which are the rules that govern/guide the evolution?
Which are the effects (consequences) of applying these rules over the net’s topology?
MIR
PR
Typology
Node dynamic Can be influenced by matrix of connections
Influence of net’s topology over dynamic process Sincronisations
Stocastic processes
And vice-versa (influence of process over net’s topology)
Without dynamic
With dynamic -> nodes are (coupled) dynamic systems: Regular oscillators – ex. Foucault oscillator,
Excitable systems
Chaotic oscillators – ex. Cryptology
Bi-stable systems – ex. semiconductor memory
MIR
PR
Applications
Social networks
Properties
Small-world
power-low distribution of nodes very high connected nodes (hubs)
Large clustering coefficient (relative to random networks)
Mix of assortativity
Ex: most connected nodes like to be together
MIR
PR
Applications
Social networks
Vilfredo Pareto (1848 – 1923)
Describe the allocation of wealth among individuals of different countries
All of them have a power-law distribution p(X≥ x) ~ x-β
Pareto principle (80-20 rule):
80% of effects are generated by 20% of causes
Consequence of cumulative distribution of power-law
Applied in various domains (Economy, ... Sociology )
MIR
PR
Applications
Social networks George K. Zipf (1902 – 1950)
Frequency of words in English language follows a rank distribution of type power-law N(r) ~rγ
Contexts (different scales) Words
Syllable
Phonemes
Distribution changes when semantic looses
Frequency and degree are similar
MIR
PR
Applications
Social networks
George K. Zipf (1902 – 1950)
Other domains
Linguistics
Population distribution
Income rankings
Relative to Pareto’s law
The rth largest city has n inhabitants (Zipf)
r cities have n or more than n inhabitants (Pareto)
MIR
PR
Applications
Social networks
Rumor spreading
Ignorant-Spreader-Stifler model (Daley and Kendal model)
Every time moment a random spreader i is selected; I contacts one of his neighbours j
If j is an ignorant, j becomes spreader
If j is a spreader or a stifler, i becomes stifler
MIR
PR
Applications
Social networks
Rumor spreading / epidemics in a small-world
Ignorant-Spreader-Stifler model (Daley and Kendal model)
MIR
PR
Applications
Social networks
Rumor spreading / epidemics in a small-world
How to fight?
MIR
PR
Applications
Social networks
Communities and their aim
Zachary (karate) club
MIR
PR
Applications
Social networks Communities and their
aim
Modular structure of complex networks
Existence of communities is not reflected by degree distribution, clustering coefficient or assortativity
Communities are related to the node functionalities
Community detection = a problem of multiple solutions
Algorithms
Link elimination
Agglomerative methods
Modularity-based methods
Spectral methods
MIR
PR
Applications
Social networks Communities and their aim Community detection
Link elimination
Eliminating weak links until the net brooks
MIR
PR
Applications
Social networks Communities and their aim Community detection
Agglomerative methods (Bottom-up)
Initially, every node belong to a community
Similar nodes
MIR
PR
Applications
Social networks Communities and their aim Community detection
Modularity-based methods
Modularity = # links among groups – estimated # links among groups in a random network
MIR
PR
Applications
Social networks Communities and their aim Community detection
Spectral methods
Split the network into
2 (or more) components
Spectral analysis of Laplacian matrix (contains all topological information about network)
MIR
PR
Applications
Biological networks
Properties
Small-world
Dissortative mixing
Many connected node are not preferentially connected each other
Organised in sub-modules
Large modularity
Community structures
Pioneers
Watts & Strogatz
MIR
PR
Applications
Biological networks
Metabolic nets
Protein nets
Yeast
6000 proteins
3 links/protein
20 000 links
Human body
~100 000 links
Genetically nets
MIR
PR
Applications
Biological networks
Ribonucleic acid (RNA) nets
Transformation sequence – structure is degenerated
The same structure can be obtained by various (multiple) chains
MIR
PR
Applications Biological networks
Brain functional nets RMN funcțional
Measures how different parts of brain answer to external stimulus
Electroencephalogram Recording the electrical activity waves
Magnetoencephalogram a functional neuro-imaging technique for mapping brain activity
by recording magnetic fields produced by electrical currents occurring naturally in the brain, using very sensitive magnetometers.
MIR
PR
Applications
Biological nets
Functional nets of brain
Alzheimer
Mild Cognitive Impairment
Schizophrenia
Epilepsy
MIR
PR
Applications
Musical nets
Note’s nets
Melody’s nets
Artist’s nets
User’s nets
MIR
PR
Applications
Rețele muzicale
Nets of musical notes
Note = node
Link = proximity (similarity) of notes
Nets of note duration
Nets of notes
MIR
PR
Applications
Nets of musical notes
Zipf law
Context = a hierarchy of models (patterns) that appear in different tempos (harmonic progression, melody, tone, rithm, ...)
Note + duration
Note + note
Note’s nets assortative
Specific clustering distribution
MIR
PR
Applications
Nets of musical notes
Why?
Create music starting from net’s properties and using guided random processes
MIR
PR
Applications
Musical nets
Melody nets
Node = melody
Links = different relations
Ex. Belong to a playlist
MIR
PR
Applications
Rețele muzicale ale melodiilor
De ce?
Analiza structurii
Drumuri, module
Detectarea celei mai influente melodii
Clasificare (etichetare)
Proiectarea unor sisteme automate de recomandare eficiente
MIR
PR
Applications
Rețele muzicale de melodii
Evoluează în timp
MIR
PR
Applications
Rețele muzicale de melodii
Gusturi muzicale
Apartenența melodiilor la playlist-urile utilizatorilor
MIR
PR
Applications
Rețele muzicale
Rețele ale artiștilor
Nod = artist
Legătură = diferite relații
Similaritate
Colaborare
Afinitate
MIR
PR
Applications
Rețele muzicale
Rețele ale artiștilor
Nod = artist
Legătură = diferite relații
Similaritate
Colaborare
Afinitate
Au același număr de noduri
Au structuri diferite
MIR
PR
Applications
Rețele muzicale ale artiștilor
Detecția comunităților oferă informații despre rețea
Separarea rețelei pe baza modularității
În comunități
În sub-comunități
MIR
PR
Applications
Rețele muzicale ale artiștilor
Detecția comunităților oferă informații despre rețea
Separarea rețelei pe baza modularității
Se pot identifica hub-uri
MIR
PR
Applications
Rețele muzicale ale artiștilor
Detecția comunităților oferă informații despre rețea
Separarea rețelei pe baza modularității
Se pot identifica hub-uri
Se pot face cartografieri ale hub-urilor
MIR
PR
Applications
Rețele muzicale
Rețele ale utilizatorilor
Nod = utilizator (consumator)
Legătură = diferite relații
De ce?
Folosite în sistemele de recomandare
MIR
PR
Applications
Rețele muzicale ale utilizatorilor
De ce e importantă topologia rețelei?
MIR
PR
Applications
Rețele muzicale
O diversitate de a proiecta muzica într-o rețea
Care model de rețea este cel mai potrivit?