Convergent Dense Graph Sequences Jennifer Tour Chayes joint work with C. Borgs, L. Lovasz, V. Sos,...

Convergent Dense

Graph Sequences

Jennifer Tour Chayesjoint work with

C. Borgs, L. Lovasz, V. Sos, K. Vesztergombi

Convergent Dense

Graph Sequences

I: Metrics, Sampling & TestingII: Multi-way Cuts & Statistical Physics

Outline of I: Metrics, Sampling & Testing

Introduction: Motivation, Convergence and Testing

Subgraph Densities and Left Convergence Graph Metrics Convergence in Metric Szemeredi Lemma and Sampling Parameter Testing The Limit Object, Metric Convergence &

Testability

Introduction: Motivation Numerous examples of growing graph sequences

– e.g., Internet, WWW, social networks

Want a succinct but faithful representation for

Testing properties – e.g., clustering

Testing algorithms – e.g., for routing, search

Here we deal only with dense graphs

(also have results for bounded-degree graphs)

Introduction: Convergence

Given: Sequence Gn of graphs with |V(Gn)| ! 1

Questions: What is the “right” notion of convergence? Is there a useful metric s.t. Gn convergent

, Gn is Cauchy in the metric? What is the limit object?

Introduction: Testing

Given: a simple graph parameter f, i.e. a real-valued function on simple graphs, invariant under isomorphism

Question: Under what conditions is f testable, i.e. 8 > 0, 9 k < 1 such that 8 G with |V(G)| > k,

|f(G) – f(G[S])| <

with probability at least 1 – , where S ½ V(G) is a uniformly random sample of size k?

Preview

There is a reasonable notion of convergent graph sequences, which turns out to be equivalent to convergence in an appropriate metric, and is closely related to testability.

(Some of the) Main Theorems of Part I:f(Gn) converges 8 convergent graph sequence Gn

, f is continuous in the metric, f is testable, f can be extended to a continuous function on the limit object

Outline


Subgraph Densities and Left Convergence Graph Metrics Convergence in Metric Szemeredi Lemma and Sampling The Limit Object and Metric Convergence

Subgraph Densities

F, G simple graphs (For Part I, think of F as small and G as large)

Homomorphisms: adjacency preserving maps

Hom(F,G) = {: V(F) ! V(G) s.t. (E(F)) ½ E(G)}

Subgraph densities (1979 Erdos, Lovasz, Spencer):

t(F,G) = |V(G)||V(F)|

|Hom(F,G)|

E.g., t(K3,G) is the triangle density of G

Left Convergence

Our Definition: Gn is said to be (left) convergent if t(F, Gn) converges for all simple F.

Example: Let Gn = Gn,p. Then

t(F, Gn) ! p|E(F)| .

Outline



Outline

Graph Metrics Cut norm on matrices Distances between graphs with same

number of vertices Splitting vertices Cut metric between arbitrary graphs

Cut Norm on Matrices Definition (1999 Frieze, Kannan):

Let M be an n £ n matrix

The cut norm of M is given by

kMk = max | Mij |i 2 Sj 2 T

S,T

Distances Between Graphs on Same Number of Vertices

G, G0 weighted graphs on [n] with common vertex weights 1, …, n s.t. ii = different edge weights ij and ij

0

adjacency matrices AG with (AG)ij = i ijj

and AG0 with (AG0)ij = i ij0j

Define the distance between G and G0:

d(G, G0) = -2 k AG AG0k

=-2 max | i j (ij – ij0)|

S,T ½ [n] i 2 Sj 2 T

Distances between Graphs on Same Number of Vertices

Notice that the distance d(G, G0) is not invariant under isomorphisms of G and G0

So do an “integer overlay” and define

where the minimum goes over all relabelings G and G0

(G, G0) = min d(GI, GI0)

GI,GI0

E.g., G

=

G0 =

(G, G0) = 0

)

Different Numbers of Vertices

Question: What do we do if G and G0 have different numbers of vertices n n0?

Idea: Split each vertex of G into n0 new vertices vertex of G0 into n new vertices

Splitting Vertices

Given: Graph G on [n],

with weights i, ij

Split i into n0 pieces(i,1), … , (i,n0)

Split i into n0 pieces

i = u2n0 Xiu

ii1i2i3

21

3

Splitting Vertices (continued)

Given: Graph G on [n],

with weights i, ij

Replace edge ij bycomplete bipartite graphwith iu,jv = ij

) new graph G[X] on [nn0] Fact: t(F,G[X]) = t(F,G)

i

j

i1 i2 i3

j1 j2 j3

Cut Metric between Arbitrary Graphs

Given: G graph on [n], weights i, ij

G0 graph on [n0], weights 0u,

0uv

with ii = u 0

u

Define our cut metric

(G, G0) = min d(G[X], G0[X])X

where the minimum goes over all fractional overlays, i.e. all couplings (or joinings) X of

i and 0u, with Xiu ¸ 0, uXiu = i and iXiu =

0u.

Outline



Convergence in Metric

Definition: Let (Gn) be a sequence of simple graphs. We say that (Gn) is convergent in the

metric if (Gn) is a Cauchy sequence in .

Theorem 1 (Convergence in Metric): A sequence of simple graphs (Gn) is left convergent if and only if it is convergent in the metric .

I.e., subgraph densities of a sequence of graphs

converge , the sequence is Cauchy in the cut metric.

Outline


Subgraph Densities and Left Convergence Graph Metrics Convergence in Metric Szemeredi Lemma and Sampling Parameter Testing The Limit Object and Metric Convergence

Szemeredi Lemma Given:

simple (unweighted) graph G disjoint partition P = (V1, ... , Vq) of V(G)

Define the edge density between classes Vi and Vj:

ij = |Vi|1||Vj|

1 eG(Vi ,Vj)

Define the (weighted) “average graph” GP on V(G) with nodeweights x(G)= 1 edgeweights xy (G) = ij if x 2 Vi and y 2 Vj

Szemeredi Lemma (continued)

It turns out that Szemerdi’s Regularity Lemma, in the weak form proved by Frieze and Kannan, describes precisely the -distance between a simple graph G and its average over a sufficiently large partition.

Weak Regularity Lemma (1999 Frieze and Kannan): For all > 0 and all simple graphs G, there exists a partition P = (V1, ... , Vq) of V(G) into q · 41/2 classes such that

(G, GP) < .

Sampling

Main Technical Lemma: Let k be a positive integer, and let G, G0 be weighted graphs on at least k nodes with nodeweights one and edgeweights in [0,1]. If S is chosen uniformly from all subsets S ½ V of size k, then

|(G[S],G0[S]) (G,G0)| · 10 k1/4

with probability at least 1 – exp(k1/2/8).

related to 2003 Alon, Fernandez de la Vega, Kannan and Karpinski, but with different k dependence and different proofs

Sampling (continued)

Theorem 2 (Closeness of Sample): Let k be a positive integer, and let G be a simple graph on at least k nodes. If S is chosen uniformly from all subsets S ½ V of size k, then

(G,G[S]) · 10 (log2k)1/2

with probability at least 1 – exp(k2/2 log2k).

Sampling (continued)

Key Elements of the Proof of Theorem 2 Use an easy sampling argument to prove the

theorem for the special case in which V(G) can be decomposed into only a few large sets V1, …, Vq, with constant weights for an edge between any given pair Vi and Vj

Use the weak regularity lemma to approximate an arbitrary graph by the simple special case above

Use the previous (main technical) lemma to show that this approximation induces only a small error

“Proof” of Theorem 1 Recall Theorem 1: Convergence from the left

(i.e., subgraph convergence) , convergence in cut metric

Idea of Proof of Theorem 1 By the triangle inequality, Theorem 2 implies that two

graphs G and G0 (possibly with n n0) are close in metric only if their samples are close

But knowledge of all subgraph frequencies is more or less equivalent to knowledge of all sampling probabilities

t(F,G) ¼ Prob(G[S] = F)where S is uniform among all sets of size |V(F)|

“thus” (lots of work), convergence of subgraph frequencies , convergence in metric

Outline


Subgraph Densities and Left Convergence Graph Metrics Convergence in Metric Szemeredi Lemma and Sampling The Limit Object, Metric Convergence &

Testability

The Limit Object

Recall Theorem 1 (Convergence in Metric): A sequence of simple graphs (Gn) is left convergent if and only if it is

convergent in the metric .

) 9 limit object G1

Question: Is there a useful representation of this limit object?

Answer: Yes – the graphon.

Graphons

Definition: A function W: [0,1]2 ! R is called a graphon if W is measurable W(x,y) = W(y,x) kWk1 < 1

Example: Step functions G graph on n vertices WG(x,y) = Idxnedyne 2 E(G)

Graphons as Limit Objects

For |V(F)| = k, define

[0,1]k ij 2 E(F)t(F,W) = s dx1

… dxk W(xi, xj)

Theorem (2005 Lovasz and B. Szegedy): (Gn) is left convergent , 9 graphon W s.t. t(F, Gn) ! t(F,W).

Graphons and Metric Convergence Definition (Frieze and Kannan): The cut

norm of a graphon W is given by

kWk = sup | s W(x,y) |S,T½[0,1] S£T

Theorem 10:

(Gn) is left convergent , 9 graphon W and a relabeling of (Gn) s.t.

kW WGnk ! 0.

Summary of Part I

There is a reasonable notion of convergent graph sequences – convergence of subgraph densities, which turns out to be equivalent to convergence in an appropriate (and useful) metric – the cut metric, and is closely related to sampling and testability.

Summary of Part I

(Some of the) Main Theorems:f(Gn) converges 8 convergent graph sequence Gn

, f is continuous in the cut metric

, f is a testable graph parameter

, f can be extended to a continuous function on the limit object

Part II: Multi-way Cuts & Statistical Physics In Part I, we learned that a graph

sequence Gn can be probed “from the left” by studying the densities of subgraphs occurring in it

In Part II, we will learn that Gn can be probed “from the right” by studying generalized colorings (or multi-way cuts or statistical mechanical models) on it, giving us a dual notion of convergence

Outline of II: Multi-way Cuts & Statistical Physics

Homomorphisms into (Small) Weighted Graphs

Naïve Right Convergence and Ground State Energies

Incomplete Equivalences Microcanonical Ensemble and Right

Convergence Complete Equivalences


Recall that in Part I, we consideredHom(F,G) = {: V(F) ! V(G) s.t. (E(F)) ½ E(G)}with F small and simple, and G large

Now instead consider Hom(G,H) with G large andH small and weighted, with vertex weights i = i(H) and edge weights ij = ij(H), so that

:V(G)!V(H) x2V(G) xy2E(G)|Hom(G,H)| = (x)

(x)(y) This is a weighted count of the number of colorings

Example: Ising Magnet V(H) = {-1,+1} = eh, ’ = eJ’

|Hom(G,H)| = eEnergy()

:V(G)! {-1,+1}

with

Energy() = J xy h x xy2E(G) x2V(G)

Note that this is not the conventional normalization of the energy for a dense graph, so. |Hom(G,H)| exp[|V(G)|2 ].

Naïve Right Convergence Let r(G,H) = |V(G)|2 log |Hom(G,H)| Note that r(G,H) is not the free energy Our Definition:

Gn is said to be naïvely right convergent if r(Gn,H) converges for all soft-core weighted H, i.e. all H with

vertex weights i = i(H) > 0 and

edge weights ij = ij(H) > 0.

J = ( )0 1

01

1 1eH =

Ground State Energy

Let ij(H) = eJij Then the ground state energy of model

H on graph G isE(G,J) = |V(G)|-2 min J(x)(y)

:V(G)!V(H) xy2E(G)

Example: Maxcut Density

E(G,J) = |V(G)|-2 Maxcut (G)


Lemma: If ij(H) = eJij , then

r(G,H) = E(G,J) + O(|V(G)|-1 ) Again note that r(G,H) is not the free energy

– to leading order, it is just the ground state energy. The entropy has been wiped out by the V2 normalization.

So naïve right convergence of Gn is the same as convergence of the ground state energy for all soft-core models H on Gn.

Incomplete Equivalences

Metric Convergenc

e

f(Gn) is

Cauchy 8 testable f

Naïve Right Convergenc

e

Quotients Cauchy in

Hausdorff Metric

E(G,J) is Cauchy 8 J

Left Convergenc

ePart I Part I

Part II(previous Lemma)

Cut densities are testable

parameters

E(G,J) is continuous in

cut metricOR

Microcanonical Ensemble

Given q color classes with fraction aiincolor class i:

a = (a1, … , aq) with ai≥ 0 and ai= 1 Define the microcanonical homomorphism number:

:V(G)!V(H) x2V(G) xy2E(G)||1(i)|ai|V(G)||≤1

|Homa(G,H)| = (x)

(x)(y)and microcanonical ground state energy:

Ea (G,J) = |V(G)|-2 min J(x)(y) :V(G)!V(H) xy2E(G)||1(i)|-ai|V(G)||≤1

Examples Max/Min Bisection

J = ( )0 ±10±1

Densest Subgraph

J = ( )1 00 0

Right Convergence

Let ra(G,H) = |V(G)|2 log |Homa (G,H)|

Definition: Gn is said to be right convergent if ra(Gn,H) converges for all a and all soft-core weighted H, i.e. all H with

vertex weights i = i(H) > 0 and

edge weights ij = ij(H) > 0.

This is equivalent to convergence of all microcanonical ground state energies.

Complete Equivalences= Summary

Metric Convergenc

e

f(Gn) is

Cauchy 8 testable f

Right Convergenc

e

Quotients Cauchy in

Hausdorff Metric

Ea(G,J) is

Cauchy 8 J

Left Convergenc

ePart I Part I

Part II(previous Lemma)

Cut densities are testable

parameters

Ea (G,J) is continuous in

cut metricOR

Lots of Work

Lots of Work

Summary There is a reasonable notion of convergent graph

sequences – convergence of subgraph densities – which turns out to be equivalent to convergence in an appropriate (and useful) metric – the cut metric, and is closely related to sampling and testability

Convergence of subgraph densities is also equivalent to the dual notion of convergence of all microcanonical ground state energies of all soft-core models (and also to convergence of quotients, moding out by Szemeredi partitions, in the natural Hausdorff metric)

THE ENDTHE END

1234567890-=Qwertyuiop[]\Asdfghjkl;’Zxcvbnm,./ `!@#$%^&*()_+

QWERTYUIOP{}|ASDFGHJKL:”ZXCVBNM<>? ~

’

”

Convergent Dense Graph Sequences Jennifer Tour Chayes joint work with C. Borgs, L. Lovasz, V. Sos,...

Documents

Transcript of Convergent Dense Graph Sequences Jennifer Tour Chayes joint work with C. Borgs, L. Lovasz, V. Sos,...