Statistical physics on random graphs and its applications...n 1 = 7/12 n 2 = 5/12 p 11 = p 22 = 0.39...

Statistical physics on random graphs and its applications

Lenka Zdeborová

Currently: CR2 IPhT, Saclay

2008-2010: director’s postdoctoral fellow Los Alamos National Laboratory

2005-2008: PhD LPTMS, University Paris-Sud Orsay, supervisor Marc Mezard

1999-2004: master degree in theoretical physics Charles University in Prague

Lenka’s four lines cv

Statistical Physics of Complex Systems

Systems: composed of a large number of interacting elements often disordered, living on random or real networks, out of equilibrium (e.g. matter, optimization problems, power grid, internet, market, neurons ...)

Goal: Understand and predict global properties based on local interactions (or vice versa).

Methods: Analytic, rigorous, toy models, + simulations & algorithms

Lenka ZdeborováFlorent Krzakala (ESPCI)Aurelien Decelle (LPTMS)

Detection of functional modules from network topology

Popular example62 bottlenose dolphins living in Doubtful Sound in New Zealand observed by Lusseau (2003).

Edge if the pair is seen together more often than expected by chance.

The group separated in two groups after one dolphin left the place.

The structure of the two groups can be predicted from the network topology (Arenas, Fernandez, Gomez’2008)

State of artHundreds of papers on the topic (Newman, Girvan’02, ...........)

Focus on communities: Nodes of the same kind tend to be together. Not useful in many cases, e.g. food-web, adjacency of words in text.Current methods are unable to tell that a random graph does not have any communities. E.g.: Ising model on random graphs of degree 3, in the best bisection only about 11.4% of edges between the two groups.

Missing measures of significance, estimate of probability of error.

Need for more fundamental and formal approach!

Block modelq groups, N nodes

proportion of nodes in group

probability that an edge present between node from group a and another from group b

na a = 1, . . . , q

Generate a random network as follows:

pab =cab

n1 = 7/12 n2 = 5/12

p11 = p22 = 0.39

p12 = p21 = 0.14

Block modelq groups, N nodes

proportion of nodes in group

probability that an edge present between node from group a and another from group b

na a = 1, . . . , q

Generate a random network as follows:

Question 1: Given what is the best possible guess for the original group assignment?

q, {na}, {pab}

Question 2: Given only the graph, what is the best guess for q, {na}, {pab}

pab =cab

Question 2: Given only the graph, what is the best guess for {na, pab}

P ({na, pab}|G) =P ({na, pab})

P (G)P (G|{na, pab})

=P ({na, pab})

P (G, {qi}|{na, pab})

=P ({na, pab})

P (G, {qi}|{na, pab}) =N!

pAijqiqj

(1! pqiqj )1!Aij

=P ({na, pab})

P (G, {qi}|{na, pab}) =N!

pAijqiqj

(1! pqiqj )1!Aij

Z({na, pab}) !!

P (G, {qi}|{na, pab})Maximize to learn{na, pab}

Equilibrium statistical physics of!H({qi}) =

log nqi +!

"Aij log pqiqj + (1!Aij) log (1! pqiqj )

log nqi +!

(ij)∈E

logpqiqj

1! pqiqj

NaNb log (1! pab)

Equilibrium statistical physics of

Partition function maximized if and only if:

(ij)∈E

!a,qi!b,qi

#= pabnanb

quenched energy = annealed energy

Nishimori condition

!H({qi}) =N!

log nqi +!

"Aij log pqiqj + (1!Aij) log (1! pqiqj )

log nqi +!

(ij)∈E

logpqiqj

1! pqiqj

NaNb log (1! pab)

Learning of parameters(1) Compute the averages:➡ With Monte Carlo (detailed balance)➡ With belief propagation (= Bethe-Peierls =

TAP equations = cavity method) faster

!i!jqi

Zi!jnqie

k#!i\j

cqkqi!k!iqk

hqi =1N

cqkqiψkqk pab =

TAP equations = cavity method) faster(2) Update parameters as

(3) Repeat till convergence.

(ij)∈E

!a,qi!b,qi

#= pabnanb

{na}, {pab}

Bayes optimal inference (in error correcting codes by Nishimori’93, Sourlas’94): (1) Compute marginals (local magnetizations)(2) For each node take the most probable value.

{na}, {pab}

Bayes optimal inference (in error correcting codes by Nishimori’93, Sourlas’94): (1) Compute marginals (local magnetizations)(2) For each node take the most probable value.

This overlap is maximized at the

right value of {na}, {pab}

0.1 0.15 0.2 0.25 0.3 0.35 0.4

Example I na =

1q, caa = cin, ca!=b = cout, cq = cin + (q ! 1)cout

q = 4, c = 16 ferromagnet = communitiesOve

0 0.2 0.4 0.6 0.8 1

ferro para

Example I na =

0 0.2 0.4 0.6 0.8 1

ferro para

Paramagnetic phase: Random graph was created (Achlioptas,

Coja-Oghlan’08). Zero overlap between an equilibrium configuration and the original one. Learning impossible.

Example I na =

0 0.2 0.4 0.6 0.8 1

ferro para

Ferromagnetic phase: Network contains information about modules, equilibration easy

(Nishimori line no RSB, no glass).

Example I na =

q = 4, c = 16 ferromagnet = communities

0 0.2 0.4 0.6 0.8 1

detectioneasy

detectionimpossible

|cin ! cout| " q#

cde Almeida-Thouless

condition

Example II anti-ferromagnet = coloring

q = 5, na =1q, caa = 0, ca!=b =

q ! 1,

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

12 13 14 15 16 17

easyhardimpo

leImpossible - random

graph created, paramagnetic phase

Easy - planted configuration attractive

beyond AT conditioncAT = 16

cK = 13.23

Values of phase transitions the same as in random graph coloring (Zdeborova, Krzakala’07)

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

12 13 14 15 16 17

easyhardimpo

Hard - equilibrium solution correlated with the planted configuration, but hidden in an 1RSB

Example II anti-ferromagnet = coloring

Values of phase transitions the same as in random graph coloring (Zdeborova, Krzakala’07)

ConclusionUsing basic properties of (planted) Potts and spin glass models gives us fundamental approach and new algorithms for module detection in networks.

Currently (with Mark Newman and Cris Moore): Using a little more realistic model (correcting for the observed degree distribution) for analysis of real networks.

Statistical physics on random graphs and its applications...n 1 = 7/12 n 2 = 5/12 p 11 = p 22 = 0.39...

Documents

Transcript of Statistical physics on random graphs and its applications...n 1 = 7/12 n 2 = 5/12 p 11 = p 22 = 0.39...

2.2 Optimal cost spanning trees€¦ · A complete graph with n nodes (n 1) has nn-2 spanning trees. Recall: A tree with n nodes has n – 1 edges. K 5 (n=5, m=10) has 125 spanning

C OT AMUNDR - NSW Electoral Commission · cocoparra n p livingstone nb p weddin mountains n rp minjary n p kosciuszko n p bendick murrelld n p nangar nl p brindabella n p yanununbeyan

M N O P Q R S N T OP Q N P P O U P R V W O P P Q X O P N P ...

Process and Thread Afﬁnity with OpenMP - NERSC · 2013. 10. 10. · Recommended process/thread afﬁnity 10 • Running(on(unpacked(nodes( #PBS"–l"mppwidth=48""""#2"nodes" aprun"–n"24"–N"12"–S"6"./a.out"

The Tumor (T), the Lymph Nodes (N), and the …...Regional lymph nodes (N) NX: Cancer in nearby lymph nodes cannot be measured. N0: There is no cancer in nearby lymph nodes. N1, N2,

nt e n . . p l. t ssic n p l ssic p n l. p p

Robinson-Schensted Algorithms for Skew Tableauxrstan/pubs/pubfiles/76.pdf · A Young tableau P of shape J/p is a labeling of the nodes of n/p with an ordered alphabet so that the

New P V V J N N N P N P N 0 P O N F G Í P P Q F G > O Q ¢ O R · 2020. 10. 2. · p n p n 0 p o n f g Í p p q f g > o q ¢ o r ... w n o 72(,& ,3 \ê ! 72()/ l%7 ! 72()/ ,73 !

Algorithms for graph visualization · Straight-line Drawing of SP-Graphs ss Q-Nodes (Induction base): s t ss S-Nodes (series composition) s t ( G2) ( G1) ss P-Nodes (parallel composition)

V P V U R gq ^ ý u;Vóÿ d u;S:Wßÿ ^ WS S:Wß0]0nÿ ) N …...N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N N P N1 N1 N1 N1 N1 N1 N1 N1 N1 N1 N1 N1 P P P N1 N1

LISMORE - Australian Broadcasting Corporation · kings plains n p butterleaf n p capoompeta n ap bald rock n p maryland n fp boonoo boonoo n p basket swamp wn p barool n dp timbarra

CS P/N A042C868 P/N A042H290 SS P/N A042F943 TO PUMP ...

OSM Multi-data Source Config Using N-RAC Nodes - Updated

Selection Trees Winner trees. Loser Trees.. Winner Trees Complete binary tree with n external nodes and n - 1 internal nodes. External nodes represent.

arXiv:2005.00057v2 [cs.CV] 17 May 2020 · B-1 N-1 N0 N 2 N3 N 4 t N 1 Figure 3: The cell architecture for CP-NAS. One cell includes 2 input nodes, 4 intermediate nodes and 14 edges

More Codes Never Enough. 2 EVENODD Code Basics of EVENODD code each storage node as a single column # of data nodes k = p (prime) # of total nodes n.

ICN’s The n-D hypercube (n-cube) contains 2^n nodes (processors). The n-D hypercube (n-cube) contains 2^n nodes (processors). The nodes are neighbors in.

1a B.Pelc-Oważany Wtorek N P S N P S N P S N P S N P S Sp11 BP e wc 310 BP e wc … · 2020-01-22 · N P S N P S N P S N P S N P S 0 0:00- 0:00 1 7:30- 8:15 LK e_wc 305 LK e_wc

Graphs (networks) with golden spectral ratio · graph, the complete multipartite graph with n nodes and c colours, and the cocktail-party graph, respectively [1]. The path P n is

Cannon Combo D · Shell Layout P/N without P/N with P/N without P/N with P/N without P/N with P/N with P/N with size Boardlock Boardlock Boardlock Boardlock Boardlock Boardlock Cadmium