Topics in Algorithms and Data Science Random Graphs (2...

Topics in Algorithms and Data Science

Random Graphs (2nd part)

Omid Etesami

Phase transitions for CNF-SAT

Phase transitions for other random structures

• We already saw phase transitions for random graphs

• Other random structures, like Boolean formula in conjunctive normal form (CNF), also have phase transitions

Random k-CNF formula

• n variables

• m clauses

• k literals per clause (k constant)

• literal = variable or negation

• each clause independently chosen from possible clauses.

• Unsatisfiability is an increasing property, so it has phase transition.

Satisfiability conjecture

• Conjecture. There is a constant rk such that m = rkn is a sharp threshold for satisfiability.

The conjecture was recently proved for large k by Ding, Sly, Sun!

Upper bound on rk

• Let m = cn.

• Each truth assignment satisfies the CNF with probability (1 – 2-k)cn.

• The probability that the CNF is satisfiable is at most 2n(1 – 2-k)cn.

• Thus rk ≤ 2k ln 2.

3-SAT solution space (height represents # of unsatisfied constraints)!

Lower bound on rk

• Lower bound more difficult. 2nd moment method doesn’t work.

• We focus on k = 3.

• Smallest Clause (SC) heuristic finds a satisfying solution almost surely when m = cn and constant c < 2/3. Thus r3 ≥ 2/3.

Smallest Clause (SC) heuristic

While not all clauses satisfied

assign true to a random literal in a random smallest-length clause

delete satisfied clauses; delete unsatisfied literals.

If a 0-length clause is ever found, we have failed.

Queue of 1-literal and 2-literal clauses

• While queue is not empty, a member of the queue is satisfied.

• Setting a literal to true, may add other clauses to the queue.

• We will show that while the queue is non-empty, the arrival rate is less than the departure rate.

Principle of deferred decisions

• We pretend that we do not know the literals appearing in each clause.

• During the algorithm, we only know the size of each clause.

Queue arrival rate

• When the t’th literal is assigned value, each 3-literal clause is added to the queue with probability 3/(2(n-t+1)).

• (With the same probability, the clause is satisfied.)

• Therefore, the average # of clauses added to the queue at each step is at most 3(cn – t + 1)/(2(n-t+1)) = 1 – Ω(1).

The waiting time is O(lg n)

Thm. The # steps any clause remains in the queue is Ω(lg n) with probability at most 1/n3.

The probability that the queue is empty at step t and remains non-empty in steps t, t + 1, …, t + s - 1 is at most exp(-Ω(s)) by multiplicative Chernoff bound: the # arrivals should be at least s while mean # arrivals is s(1 – Ω(1)).

(We upper-bound # arrivals with sum of independent Bernoullies.)

There are only n choices for t. Therefore for suitable choice of s0 = Ө(lg n), any non-empty episode is of length at most s0 with probability 1 – 1/n3.

The probability that setting a literal in the i’th clause makes the j’th clause false is o(1/n2)

If this trouble happens, then

• either of i’th or j’th clause is added to the queue at some step t,

• j’th clause consists of 1 literal when trouble happens,

• by SC rule i’th clause also consists of 1 literal when its literals is assigned,

• with probability 1 – 1/n3 the waiting time for both clauses is O(lg n).

If a1, a2, … is the sequence of literals that would be set to true (if clauses i and j didn’t exist), then 4 of the literals in these two clauses are the negation of the literals in at, at+1, …, at’ for t’ = t + O(lg n).

This happens with probability O((ln 4 n)/n4) times # choices for t.

Since there are O(n2) pairs of clauses, the algorithm fails with probability o(1) by union bound.

Nonuniform models of random graphs

Nonuniform models

• Fix a degree distribution: there is f(d) vertices of degree d

• Choose a random graph among all graphs with this degree distribution

• Edges are no longer independent

Degree distribution: vertex perspective vs edge perspective • Consider a graph where half of vertices have degree 1, half have degree 2

• A random vertex is equally likely of degree 1 or 2

• A random vertex of a random edge is twice more

likely to be of degree 2

• In many algorithms, we traverse a random edge

to reach an endpoint: the probability of reaching

a vertex of degree i is then proportional to i λi ,

where λi is the fraction of vertices of degree i

Giant component in random graphs with given degree distribution

[Molloy, Reed] There will be a giant component iff • Intuition: Consider BFS (branching process) from a fixed vertex.

• After the first level, a vertex of degree i has exactly i – 1 children.

• The branching process has probability of extinction < 1 iff the expected # children E[i – 1] ≥ 1, or in other words E[i – 2] >= 0.

• In calculating the expectation, the probability of degree i is from the edge perspective (and not the vertex perspective). Thus it is proportional to i λi.

Example: G(n, p=1/n)

Poisson degree distribution

If vertices have Poisson degree distribution with mean d,

then

random endpoint of a random edge has degree distribution

1 + Poisson(d).

Growth model without preferential attachment

Growing graphs

• Vertices and edges are added over time.

• Preferential attachment = selecting endpoints for a new edge with probability proportional to degrees

• Without preferential attachment = selecting endpoints for a new edge uniformly at random from the set of existing vertices

With preferential attachment

Basic growth model without preferential attachment • Start with zero vertices and zero edges

• At each time t, add a new vertex

• With probability δ, join two random vertices by an edge

The resulting graph may become a multigraph.

But since there are t2 pairs of vertices and O(t) existing edges, a multiple edge or self-loop happens at each step with small probability, and we ignore these cases.

new vertex

new edge

# vertices of each degree

Let dk(t) be expected # vertices of degree k at time t.

new vertex

new edge

degree distribution

Let dk(t) = pkt in the limit as t tends to infinity.

Geometric distribution which like the Poisson

Erdos-Renyi distribution falls off exponentially fast,

unlike preferential attachment power-law.

# components of each finite size

Let nk(t) be expected # components of size k at time t

• A randomly picked component is of size k with probability proportional to nk(t)

• A randomly picked vertex is in a component of size k with probability equal to k nk(t)

Components of size 4 and 2

Recurrence relation for nk(t)

• We use expectations instead of actual # of components of each size?!

• We ignore edges falling inside components since we are interested in small component sizes.

j vertices

k – j vertices

Recurrence relation for ak=nk(t) / t

j vertices

k – j vertices

Phase transition for non-finite components

Size of non-finite components below critical threshold

Summary of phase transition

Comparison with static random graph having degree distribution

• Could you explain why giant components appear for smaller δ in the grown model?

Why is δ = 1/4 the threshold for static model?

Growth model with preferential attachment

Description of the model

• Begin with empty graph

• At each time, add a new vertex

and with probability δ, attach the new vertex

to a vertex selected at random

with probability proportional to its degree

Obviously the graph has no cycles.

Degree of vertex i at time t

Let di(t) be the degree of vertex i at time t

Thus di(t) = a t1/2.

Since di(i) = δ,

we have di(t) = δ (t/i)1/2.

Power-law degree distribution

Vertex number tδ2/d2 has degree d.

Therefore, # of vertices of degree d is

In other words, probability of degree d is 2δ2/d3.

Small world graphs

Milgram’s experiment

• Ask one in Nebraska to send a letter to one in

Massachusetts with given address and occupation

• At each step, send to someone you know on a

“first name” basis who is closer

• In successful experiments, it took 5 or 6 steps

• Called “six degrees of separation”

The Kleinberg model for random graphs

• n × n grid with local and global edges

• From each vertex u, there is a long-distance edge

to a vertex v

• Vertex v is chosen with probability proportional to

d(u,v)-r where distance is Manhattan distance.

Normalization factor

• Let .

• # nodes of distance k from u is at most 4k.

• # nodes of distance k from u is at least k for k ≤ n/2.

• We have

• cr(u) = Θ(1) when r > 2.

• cr(u) = Θ(lg n) when r = 2.

• cr(u) = Ω(n2-r) when r < 2.

No short (polylogarithmic) paths exist when r > 2. • Expected # of edges connecting vertices of distance ≥ d* is

• Thus, with high probability there is no edge connecting vertices at distance at least d* for some d* = n1-Ω(1).

• Since many pairs of vertices are at distance Ω(n) from each other, the shortest path between these pairs is at least nΩ(1).

A pair of vertices with distance Ω(n)

Local algorithm when r = 2

The algorithm is local and greedy:

At each step follow the edge that takes us closest

to the target.

Target w

Analysis of the algorithm

Claim: with high probability for any pair of vertices u, t,

within O(ln2 n) steps the distance from u to t decreases by half:

Proof: If distance between u and t is k, there are Θ(k2) vertices at distance ≤ k/2 from t.

All these vertices are at distance Θ(k) from u.

Thus, with probability Θ(k2 k-r/cr(u))=Θ(1/ln n), there is an edge to a vertex half distant to t.

Repeating this process Θ(ln2 n) steps, by independence of edges, we succeed with probability o(1/n4).

Since there are n4 pairs, by union bound, we succeed for every pair (u, t).

Local algorithm when r = 2 takes polylogarithmic steps

Since the distance is at most 2n in the beginning

and halves every O(ln2 n) steps,

we reach the target within O(ln3 n) steps.

Target w

No local algorithm finds polylogarithmic paths when r < 2

Take u and t at distance ≥ nδ.

We show any local algorithm with high probability takes ≥ nδ steps to go from u to t for small constant δ > 0.

Otherwise, the algorithm should use an edge that takes us to a point at distance < nδ.

At each step, this happens with probability ≤ O(n2δ/cr) = O(n-2+r+2δ), since we cannot plan on the outgoing edges of vertices we haven’t yet visited in a local algorithm.

Since we should find such an edge in the first nδ steps, we can find it only with probability O(n-2+r+3δ), which is o(1) for small δ.

local algorithm for finding short path does not exist, despite existence of short paths

Proof that logarithmic paths exist when r = 0

• We show the diameter is O(lg n) in a way similar to the proof for Erdos-Renyi.

• Partition the grid into 3×3 squares: Now there are 9 non-local edges going out of each square.

• There are Ω(lg n) squares at distance Ө(lg1/2 n)

of any square.

• W.h.p. the non-local neighbors of these squares

are at least twice these squares,

since 9 > 2 and by Chernoff bound.

Proof that logarithmic paths exist when r = 0 (continued) • Similarly, one can show that while half of squares have not been

visited, the neighbors visited at each level is at least twice the number in the previous level

(since # outgoing edges × fraction of remaining squares ≥ 9 × 1/2 > 2)

• Therefore more than half the squares are can be reached with O(lg n) edges from any vertex.

• Any two sets consisting of more than half the squares have nonempty intersection. Q.E.D.

Topics in Algorithms and Data Science Random Graphs (2...

Documents

Transcript of Topics in Algorithms and Data Science Random Graphs (2...