Dynamics of 2D Potts and random-cluster modelsgheissari/thesis.pdf · Bakhtin, Paul Bourgade,...
Transcript of Dynamics of 2D Potts and random-cluster modelsgheissari/thesis.pdf · Bakhtin, Paul Bourgade,...
Dynamics of 2D Potts and random-cluster models
by
Reza Gheissari
A dissertation submitted in partial fulfillment
of the requirements for the degree of
Doctor of Philosophy
Department of Mathematics
New York University
May 2019
Eyal Lubetzky
Charles Newman
Acknowledgements
Firstly, I am indebted to Eyal Lubetzky for his tremendous generosity with his
time, experience, and knowledge. I am grateful he first introduced me to the field
of Markov chain mixing times and its links with equilibrium statistical mechanics.
More importantly, he has always been someone I could look up to when needing
motivation, talk to when seeking advice (academically or otherwise), bounce ideas
off of when feeling stuck, and run to when excited about any incremental progress.
I truly could not have had a better mentor, advocate, and role-model, and I will
always count myself fortunate that we joined Courant at the same time.
I would like to thank Chuck Newman for serving as a constant source of
inspiration and mentorship throughout my five years at Courant. He has taught me
much of what I know of two-dimensional statistical physics and percolation theory,
and his door has always been open when I have needed someone to look to for help.
I am grateful for my interactions and collaboration with Gerard Ben Arous. He
is a source of unbounded energy and ideas, and I am lucky to have learned about
mean-field spin glasses from him. I am also thankful for his wisdom and availability
the many times I have turned to him for guidance and direction.
This would not have been possible without the good fortune of having Clement
Hongler as an REU advisor at Columbia. He introduced me to mathematical re-
search through the beautiful field of two-dimensional statistical physics at criticality,
put up patiently with my mathematical immaturity, and strongly encouraged me
to pursue a PhD in mathematics at Courant. I also thank Julien Dubedat for his
time and tremendous patience while serving as my undergraduate thesis advisor.
Indeed being at Courant has been one of the best journeys of my life, and the
Courant community has been a most warm, welcoming, and inspirational group. I
iii
owe everything I know to the faculty members from whom I have learned so much
at Courant and at Columbia. I thank Dan Stein for his availability through our
many conversations and collaborations. And I have been lucky to attend insightful
courses from all of the aforementioned, as well as Professors Ofer Zeitouni, Yuri
Bakhtin, Paul Bourgade, Afonso Bandeira, Lai Sang Young, Georg Stadler, Joel
Spencer, Dorian Goldfeld, Daniela De Silva among others.
I especially thank Yuri Bakhtin, Paul Bourgade, Eyal Lubetzky, Chuck Newman,
and Professor Varadhan for agreeing to be on my thesis committee.
Besides the faculty, my time at Courant has also been shaped by my peers,
including Aukosh, Moumanti, Lisa, Krishnan.
I am grateful to my close friends: Danny, Ty, Emily, Nathalie, Sindhu, Erin,
Sarah, Trey, Sabrina, and many others. They have supported me, kept me grounded,
and made sure I take the time to step away from mathematics and enjoy life.
Most importantly, I thank my family for their unconditional love and support.
To my sister, Roya, I have been lucky to have you nearby; you have always been
there, both to talk through difficult decisions with, and as a source of distraction
from mathematics. To my parents, I cannot describe the immense role you have
played in making me the person I am. Thank you for always pushing me, giving
me honest advice, and believing in me; and thank you for instilling in me the
importance of a deep and unending yearning for knowledge.
iv
Abstract
This thesis considers the dynamical phase portrait of the Potts and random-
cluster (FK) models in two-dimensions, as the system size diverges.
The Potts model was introduced in the 1950’s as a model of ferromagnetism
where interacting particles take q ≥ 2 possible states (q = 2 being the Ising model).
The random-cluster (FK) model is a dependent percolation model indexed by a
real-valued q ∈ (0,∞) (q = 1 being independent percolation); it is central to the
analysis of the Potts models, as at integer q it encodes the correlations of the
corresponding Potts model. In 2D, the models have a sharp transition between
a high-temperature phase and a low-temperature phase, via rich behavior at a
critical point: at criticality, when q ≤ 4, they are conjectured to have a conformally
invariant scaling limit with random fractal interfaces (SLE’s), while when q > 4
they are characterized by phase coexistence and Brownian interfaces.
We are interested in understanding how long canonical Markov chains like the
Glauber dynamics for the Potts and FK models, as well as cluster dynamics such
as Swendsen–Wang, take to approximate the equilibrium measure on, say, an n× n
box. These dynamics are well-known to equilibrate rapidly at all high temperatures,
but are believed to undergo critical slowdowns, depending on the order of the phase
transition: this was only known for q = 2 where they mix in polynomial time (in n),
and for sufficiently large q, where they take exponentially long to equilibrate.
We begin by showing that the following holds for the mixing times (as well as
inverse spectral gaps) of Potts Glauber dynamics on the n×n torus, (Z/nZ)2. The
Potts Glauber dynamics at criticality has a mixing time that is polynomial in n
when q = 3, and nO(logn) when q = 4; on the other hand, for every q > 4 the mixing
time of the critical Potts dynamics grows exponentially in n. We also fill in the
v
rest of this phase diagram by proving that for every q, as soon as the temperature
is sub-critical, the Potts Glauber dynamics becomes exponentially slow.
The FK Glauber dynamics on (Z/nZ)2 (as well as the related Swendsen–Wang
dynamics) are known to be fast mixing O(log n) at all off-critical temperatures. We
prove that at the critical point, their mixing time on (Z/nZ)2 is at most nO(logn)
for all q ∈ (1, 4] whereas it becomes exponentially slow in n as soon as q > 4.
The analysis of the FK Glauber dynamics and Swendsen–Wang dynamics is
substantially complicated by their non-locality; long-range interactions can be
encoded into the FK boundary conditions on an n× n box. We first show that in
all off-critical regimes, the mixing time is stable—up to polynomial factors—with
respect to choice of realizable FK boundary conditions. We then turn to the critical
point, where we show that when q is large, the dynamics are sensitive to the choice
of boundary condition. For free or monochromatic boundary conditions and large q,
these dynamics at criticality are much faster than on the torus and mix in exp(no(1))
time. We conclude by investigating this sensitivity at criticality, under boundary
conditions that interpolate between free, wired, and periodic.
vi
Contents
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v
List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix
1 Introduction 1
1.1 Glauber dynamics for the Potts model . . . . . . . . . . . . . . . . 8
1.2 Glauber dynamics for the FK model . . . . . . . . . . . . . . . . . 12
1.3 Dependence on FK boundary conditions off-criticality . . . . . . . . 18
1.4 Sensitivity to boundary conditions in coexistence regimes . . . . . . 23
1.5 Some open problems . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2 Preliminaries 32
2.1 The Potts and FK models . . . . . . . . . . . . . . . . . . . . . . . 32
2.2 Markov chain mixing times . . . . . . . . . . . . . . . . . . . . . . . 42
3 Equilibrium estimates at q ∈ [1, 4] and RSW Theory 59
3.1 Crossing probabilities and RSW Theory . . . . . . . . . . . . . . . 60
3.2 Boundary bridges . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
4 Mixing on (Z/nZ)2 at continuous phase transitions 95
vii
4.1 Potts Glauber dynamics at continuous phase transition points . . . 96
4.2 FK Glauber dynamics at continuous phase transition points . . . . 103
5 Slow mixing in phase coexistence regions 134
5.1 Potts Glauber dynamics on (Z/nZ)2 in phase coexistence regions . 136
5.2 FK dynamics on (Z/nZ)2 at a discontinuous phase transition . . . . 141
5.3 Potts Glauber dynamics with free boundary at β > βc(q) . . . . . . 145
6 FK boundary conditions in uniqueness regions 148
6.1 Fast mixing on thin rectangles . . . . . . . . . . . . . . . . . . . . . 155
6.2 Polynomial mixing time for realizable boundary conditions . . . . . 184
6.3 Near-optimal mixing time for typical boundary conditions . . . . . 193
6.4 A canonical paths bound with realizable FK boundary conditions . 202
6.5 Slow mixing under worst-case boundary conditions . . . . . . . . . 211
7 Boundary conditions in coexistence regions I: free and monochro-
matic boundary 224
7.1 Cluster expansion and equilibrium estimates . . . . . . . . . . . . . 227
7.2 Upper bounds under free boundary conditions . . . . . . . . . . . . 236
8 Boundary conditions in coexistence regions II: other boundary
conditions 262
8.1 Subexponential mixing time with Dobrushin boundary conditions . 274
8.2 Sub-exponential mixing on cylinders . . . . . . . . . . . . . . . . . 281
8.3 Slow mixing with phase-symmetric boundary conditions . . . . . . . 292
Bibliography 297
viii
List of Figures
1.1 The Potts phase transition on Z2 . . . . . . . . . . . . . . . . . . . 4
1.2 Critical Potts configurations at q = 3, 4, 5 . . . . . . . . . . . . . . . 6
1.3 Dynamical phase diagram for the Potts Glauber dynamics . . . . . 11
1.4 A critical FK configuration with nested long-range connections . . . 13
1.5 Long-range FK boundary connections . . . . . . . . . . . . . . . . . 14
1.6 Dynamical phase diagram for the FK Glauber dynamics . . . . . . 15
1.7 Swendsen–Wang with free and periodic boundary . . . . . . . . . . 25
1.8 Dependence of Swendsen–Wang dynamics on boundary conditions
in phase coexistence regimes . . . . . . . . . . . . . . . . . . . . . . 27
3.1 Bounding q = 4 crossing probabilities . . . . . . . . . . . . . . . . . 64
3.2 A pair of boundary connections, called bridges, separated by a
dual-connection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
3.3 A pair of boundary bridges, γi, γi+1, over e ∈ ∂nR induced by a
configuration on Λ−R, and separated by a dual-bridge over e. . . . 74
3.4 Construction for the upper bound on |Be1| . . . . . . . . . . . . . . 83
3.5 Construction for the upper bound on |Be2| . . . . . . . . . . . . . . 88
ix
4.1 Coupling beyond a dual-crossing, with the appropriate bridges dis-
connected in the FK model . . . . . . . . . . . . . . . . . . . . . . . 116
4.2 Block choices for the block dynamics when proving mixing time
upper bounds on the torus . . . . . . . . . . . . . . . . . . . . . . . 127
5.1 Geometric bottleneck for phase coexistence on the torus: Potts . . . 137
5.2 Geometric bottleneck for phase coexistence on the torus: FK . . . . 143
6.1 Long-range FK boundary connections necessitate recursion on groups
of rectangles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156
6.2 Some examples of disconnecting intervals . . . . . . . . . . . . . . . 160
6.3 The subsets used to recurse on groups of rectangles . . . . . . . . . 171
6.4 The presence of certain dual-connections allows us to couple FK
configurations inside disconnect intervals . . . . . . . . . . . . . . . 175
6.5 Choice of subsets to bound the FK mixing time under realizable
boundary conditions . . . . . . . . . . . . . . . . . . . . . . . . . . 184
6.6 A spatial mixing estimate near “typical” FK boundary conditions . 196
7.1 The choice of the censored FK Glauber dynamics allows us to couple
configurations started from all-wired and all-free . . . . . . . . . . . 241
7.2 Enlargements En(Q) and E′n(Q) of a rectangular subset Q . . . . . 245
7.3 Boundary modification to restrict the influence of the open cluster
of the southern boundary . . . . . . . . . . . . . . . . . . . . . . . . 247
7.4 Boundary modification to couple the two FK configurations on the
central column C . . . . . . . . . . . . . . . . . . . . . . . . . . . . 255
x
8.1 Some of the sets and interfaces considered in the study of tilted FK
interfaces . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264
8.2 Bounding the probability that a tilted FK interface undergoes a
deviation of at least ±h . . . . . . . . . . . . . . . . . . . . . . . . 266
8.3 The diagonal strips used to construct a censored FK dynamics . . . 277
8.4 The process by which the censored dynamics pushes the FK interface
down to its equilibrium location . . . . . . . . . . . . . . . . . . . . 279
8.5 The constituent events that push the FK interface from one side to
the other on a cylinder . . . . . . . . . . . . . . . . . . . . . . . . . 288
xi
Chapter 1
Introduction
Background
The Ising model was introduced in the 1920’s by Lenz and Ising as a model of
the physical phenomenon of ferromagnetism, where a slab of iron spontaneously
magnetizes after coming in brief contact with an external magnetic field. The
mathematical model consists of a random assignment of ±1-spins to the vertices of a
graph describing the underlying atomic structure of the ferromagnet; the distribution
is governed by a competition between the system’s internal energy, which favors
local alignment, and its entropy, moderated by an inverse temperature β > 0.
In 1951, Domb and Potts generalized the Ising model to the q-state Potts model,
in which particles may take on q ≥ 2 possible states (with q = 2 corresponding to
the Ising model). The Ising and Potts models have seen tremendous attention (cf. [4,
71, 95]), being exactly solvable in the 1D and mean-field settings, and exhibiting
rich phase transitions on the physically relevant square lattices Zd. As such, since
their introductions, the models have become prototypes for understanding sharp
phase transitions and critical phenomena in statistical physics. They have also
1
found applications to many other areas, ranging from ecology and computational
biology, to statistical inference, image processing, and the theory of neural networks.
Let us now define the Potts model more precisely. The Potts model on a graph
G = (V,E) at inverse-temperature β > 0 is the distribution µG,β,q over colorings
of the vertices of G with q colors (states) amongst [q] = 1, ..., q, in which the
probability of a configuration σ ∈ [q]V is given by
µG,β,q(σ) ∝ exp[−βH(σ)] , where H(σ) :=∑
(x,y)∈E
1σx 6= σy , (1.0.1)
i.e., the Hamiltonian H(σ) counts the number of pairs of adjacent vertices that have
distinct colors (see §2.1.1). As mentioned, the Ising/Potts models are of particular
interest in the cases where their underlying geometry has a physically relevant
lattice structure, e.g., on the square lattice Zd (d ≥ 2). Here, they are believed
to undergo a rich phase transition at some critical temperature βc(q), separating
a high-temperature disordered phase, where correlations decay rapidly, from a
low-temperature phase where a macroscopic long-range order emerges.
Fortuin and Kasteleyn introduced the random-cluster (FK) model in 1972 [36];
the FK model on a graph G = (V,E) with parameters 0 < p < 1 and q > 0 is the
distribution πG,p,q over edge subsets of G, where the probability of an edge-subset ω
with o(ω) edges and k(ω) connected components (in the subgraph (V, ω)) satisfies
πG,p,q(ω) ∝ [p/(1− p)]o(ω)qk(ω) , (1.0.2)
(see also §2.1.2). The model generalizes Bernoulli bond percolation (q = 1) and has
far-reaching connections to electrical networks and random spanning trees (q ↓ 0):
see [47]. The FK model is also closely related to the Ising/Potts models at integer
2
q ≥ 2, with connectedness in ω encoding the correlations of the corresponding
Potts model via the Edwards–Sokal coupling [35]; e.g., one may produce σ ∼ µG,β,q
by first sampling ω ∼ πG,p,q for p = 1 − e−β, then assigning (all vertices of)
each connected component in ω a uniformly chosen color amongst [q] = 1, .., q
independently. As such, extensively studied in its own right, the random-cluster
representation has been an important tool in the analysis of Ising and Potts models
(see e.g., [5, 32, 34] for a sampling of breakthrough results on the Potts model that
rely on its random-cluster representation).
The canonical Markov chains for the Potts and FK models are the local-update
Glauber dynamics (e.g., heat-bath, Metropolis) and the non-local Swendsen–Wang
dynamics; these are important both as Markov Chain Monte Carlo samplers from
the Potts and FK measures, and because of the insights they give regarding
the physical off-equilibrium evolution of the corresponding statistical mechanics
systems [45, 94]. Over the years, there has been much effort in physics, theoretical
computer science, and mathematics communities to connect the fascinating static
phase transition of such spin systems to their off-equilibrium dynamics, measured
by e.g., autocorrelation length, spectral gap, or mixing time.
This thesis is part of this mathematical program, analyzing the off-equilibrium
dynamics of the Potts and FK models asymptotically as the system size diverges.
Specifically, we focus on understanding how the equilibrium phase transitions
of Potts and FK models present in their dynamical phase portrait as various
parameters (q, the temperature p or β, and the boundary conditions) are varied.
We study this in the particularly rich and physically relevant setting where the
underlying geometry is planar, e.g., a subset of the integer lattice Z2.
The results in this thesis are primarily a compilation of joint works with
3
Figure 1.1: Potts configurations for q = 3 at β < βc (left), β = βc (middle), andβ > βc (right) on the torus (Z/nZ)2.
Lubetzky [41, 42, 43] and a joint work with Blanca and Vigoda [8].
Potts and FK models on Z2
When the underlying graph is Z2, or some finite subset of Z2, and q ≥ 1,
significant progress has been made in the study of these models’ equilibrium
behavior, characterized by a sharp phase transition at a critical pc(q) (respectively,
βc(q), again related via the relation p = 1− e−β).
Using a duality relation between random-cluster models on planar graphs G
and on their planar dual G∗, and the self-duality of the integer lattice Z2, it
has long been predicted that the critical point of the FK model on Z2 is its self-
dual point psd(q) =√q
1+√q, (and therefore the critical point of the Potts model
is βsd(q) = log(1 +√q)). Indeed this has been known to be the critical point
of the Ising model (q = 2) dating back to the work of Kramers and Wannier in
1941 [56]. In the setting of two-dimensional Bernoulli bond percolation (q = 1),
Kesten (1980) proved [54] that the critical point is given by pc(1) = 12
which
matches the self-dual point psd(1). Later, using very different tools, namely the
4
celebrated techniques of reflection positivity and cluster expansion, the critical point
was identified with the self-dual point for sufficiently large q [55] (this region where
cluster expansion converges has since been brought down to q > 24.78 [57, 58]).
More recently, the seminal work of Beffara and Duminil-Copin [5] identified the
equality pc(q) = psd(q) =√q
1+√q
(and therefore βc(q) = log(1 +√q)) for all q ≥ 1.
The phase transition occurring at pc(q) and βc(q) in Z2 can be characterized
in various different ways. One canonical such choice is described in terms of the
infinite-volume measures on Z2, which can be obtained by taking weak limits of
the measures on Gn ↑ Z2 (see §2.1 for the consistency relations required on an
infinite-volume measure). In the case of the FK model, at high temperatures
p < pc(q), under πZ2,p,q, almost surely every connected component of ω is finite,
whereas at low temperatures p > pc(q), under πZ2,p,q there almost surely exists an
infinite connected component in ω. In the case of the Potts model, this translates
to the following: at high temperatures β < βc(q), there is exactly one unique
infinite-volume Gibbs measure on Z2, whereas at low-temperatures when β > βc(q)
there are multiple distinct infinite-volume Gibbs measures (consisting of mixtures of
q extremal measures corresponding to the situations in which each of the q possible
colors is dominant). In particular, when p < pc(q) (resp., β < βc(q)), probabilites
of FK connections (resp., correlations between spins) decay exponentially fast in
their Euclidean distance, whereas when p > pc(q) (β > βc(q)) they stay uniformly
bounded away from zero. (See Fig. 1.1 for a depiction in the case q = 3.)
Perhaps the most important question in the equilibrium analysis of these two-
dimensional statistical physics models is then to identify the behavior at the critical
pc(q) or βc(q). In particular, the phase transitions can be classified as either
first-order (discontinuous) or second-order (continuous). If the phase transition
5
Figure 1.2: Critical Potts configurations at q = 3, 4, 5 on the torus (Z/nZ)2 atn = 64 with FK boundaries between different clusters.
is second-order the FK representation almost surely does not have any infinite
clusters at its critical point, and the spin-system has a unique infinite-volume
Gibbs measure at β = βc; if the phase transition is first-order, however, the Potts
model has multiple Gibbs measures and the FK model almost surely has an infinite
cluster at criticality (at least under some choice of an infinite-volume FK Gibbs
measure). The famous solution of Onsager (1944) [75] showed that the Ising model
exhibits a continuous phase transition at βc(2); the result of Kesten [54] showed
the same for Bernoulli percolation on Z2; on the other hand, when q > 24.78, the
phase transition is first-order where a “disordered” high-temperature-like phase
coexists with a low-temperature-like “ordered” phase so that the FK model has
two extremal infinite-volume measures, giving rise to q+ 1 extremal infinite-volume
Potts measures.
Recently, the breakthrough of [34] combined sophisticated tools from the theory
of discrete holomorphic observables with percolation and Russo–Seymour–Welsh
(RSW) techniques to prove the continuity of the phase transition at q ∈ [1, 4].
This was followed by [32] which established that as soon as q > 4, the FK and
Potts models have a first-order phase transition as described above. At continuous
phase transition points, the behavior is believed to be governed by conformal field
6
theory (CFT) [6, 27], exhibiting remarkable features like power-law correlation
decay, conformal invariance, and convergence of interfaces to random fractals known
as Schramm–Loewner evolution (SLE) (this has been established in the special
case of q = 2, as well as q = 1 on the triangular or hexagonal lattices). On the
other hand, at discontinuous phase transition points, it is characterized by phase
coexistence and surface tension that is reminiscent of its low-temperature behavior,
but with the addition of the metastable “disordered” phase. See also Fig. 1.2.
In what follows, we build on this recent work to fill in a corresponding dynamical
phase diagram for the planar Potts and FK models, considering canonical Markov
chains like the Glauber and Swendsen–Wang dynamics. The central objects of
study for the dynamics are the spectral gap and mixing time (i.e., measuring
the time it takes for the Markov chain to converge to its invariant distribution).
In two-dimensions, these dynamics are of added interest at the critical point
itself, where the mixing time depends heavily on the order of the phase transition,
and is predicted to exhibit dynamical critical phenomena, e.g., dynamic critical
exponents, universality, and non-trivial scaling limits (see, e.g., the well-known
physics review [49]).
In §1.1 we consider the Glauber dynamics for the Potts model on an n×n torus
(Z/nZ)2 at its critical point βc(q) in the three regimes: 1 < q < 4, the extremal
q = 4, and q > 4. In §1.2, we consider the Glauber dynamics for the FK model on
(Z/nZ)2 in those same regimes, as well as the intimately related Swendsen–Wang
dynamics of the Potts model. The analysis of the FK dynamics is considerably more
intricate due to the presence of long-range dependencies in the Glauber dynamics’
transition rates. These long-range dependencies can be encoded into the boundary
conditions, and we study how they can affect the mixing times first in the setting of
7
p 6= pc(q) for the FK dynamics in §1.3. In §1.4, we turn to analyzing the sensitivity
of mixing times for FK and Swendsen–Wang dynamics to boundary conditions, in
the phase coexistence regime of p = pc(q) and q large: we classify the mixing times
under boundary conditions including (and interpolating between) free, wired and
toroidal. We end the introduction in §1.5 with a selection of related open problems
in the understanding of Potts and FK dynamics on Z2.
1.1 Glauber dynamics for the Potts model
Glauber dynamics is a local Markov chain, introduced by Glauber (1963) in [45],
modeling the physical evolution of a spin system following a deep quench, i.e.,
initialized from some off-equilibrium configuration. Various flavors of Glauber
dynamics (heat-bath, Metropolis) are also some of the most frequently used Markov
chain Monte Carlo (MCMC) samplers for such spin systems. For the Potts model,
the heat-bath Glauber dynamics updates each vertex via an i.i.d. rate-1 Poisson
process, where its new value is sampled according to µG,β,q conditioned on the
values of all other vertices: that is, it is the continuous-time Markov process that,
when the clock at a vertex i rings updates the spin σi according to the following
transition rates:
µG,β,q(σi = ` | σG\i
)=
eβ∑
(i,j)∈E 1σj=`∑1≤`≤q e
β∑
(i,j)∈E 1σj=`, for ` = 1, ..., q .
Its discrete-time version at each integer time-step picks a vertex uniformly at
random, then updates its state according to the same transition rate. The Potts
Glauber dynamics on G is, by construction, ergodic and reversible with respect to
the Potts distribution µG,β,q, and therefore one is interested in the time it takes for
8
the dynamics to approximate its invariant measure.
The spectral gap of a discrete-time Markov chain, denoted gap, is 1−λ where λ
is the largest nontrivial eigenvalue of the transition kernel, and for a continuous-time
chain it is the gap in the spectrum of its generator. The inverse of the spectral
gap, denoted gap−1 serves as an important gauge for the rate of convergence of the
chain to equilibrium in L2. The mixing time, tmix, of a Markov chain is a closely
related quantity (see (2.2.3)) which is defined by the time it takes to be within
L1-distance of at most ε of stationarity, from a worst-case initialization.
The static phase transition of the Ising, Potts and FK models are believed to
manifest in their Glauber dynamics on n× n boxes or torii. At this point a rich
dynamical phase transition has been established in the case of the Ising model: at
high temperature (β < βc(2)) the inverse-gap gap−1 is O(1) and the system rapidly
mixes in L1 at time Cβ log n+O(1) [1, 62, 65, 68, 69], whereas at low temperature
(β > βc(2)), these slow down to exp(cβn+ o(n)) [19, 20]. At the critical point, the
mixing time is believed to be nz for some dynamic critical exponent, which has
seen extensive numerical and exact study in physics communities and is believed to
be universal [25, 59, 74, 93]. Lubetzky and Sly [64] showed that the mixing time
indeed has a polynomial critical slowdown, though the rigorous upper and lower
bounds are far apart.
In the case of the Potts model, the absence of monotonicity and a more limited
understanding of the equilibrium measure made the corresponding picture less
understood. With the identification of the critical point [5], the exponential decay
of correlations at high temperatures implied an order one gap and optimal O(log n)
mixing [2, 3, 69] for all β < βc(q). Besides the natural expectation that the mixing
is slow for all β > βc(q), physicists predicted that the mixing time at criticality is
9
polynomial (with some dynamic critical exponent) when q = 3, 4 in accordance with
the continuous phase transition [25, 26, 31], whereas it should slow to exponential
in n as soon as q > 4. However, this was only known rigorously for the perturbative
regimes of (1) sufficiently large β βc(q) and (2) all β ≥ βc(q) for sufficiently large
q 1 [12, 13].
The first results of this thesis are to fill in much of the remainder of this
dynamical phase diagram on (Z/nZ)2 (see Figure 1.3 for a visualization).
Theorem 1. There exist c1, c′1 > 0 and c2, c
′2 > 0 so that continuous-time Glauber
dynamics on (Z/nZ)2 satisfy the following: for the 3-state critical Potts model,
nc1 . gap−1p . nc
′1 , (1.1.1)
and for the 4-state critical Potts model,
nc2 . gap−1p . nc
′2 logn . (1.1.2)
In fact, Eqs. (1.1.1)–(1.1.2) hold for arbitrary boundary conditions on an n× n box.
Theorem 2. Let β and q be such that the Potts model has two distinct infinite-
volume measures on Z2, i.e., µ1Z2,β,q 6= µ2
Z2,β,q (corresponding to two different
dominant colors). There exists c(β, q), c′(β, q) > 0 such that Glauber dynamics for
the q-state Potts model at β on (Z/nZ)2 satisfy
c ≤ 1
nlog gap−1
p ≤ c′ . (1.1.3)
In particular, this implies slow mixing on (Z/nZ)2 in the following two regimes:
10
β
q
β
q
Figure 1.3: The dynamical phase portrait of the Potts Glauber dynamics on (Z/nZ)2
as β, q vary (the grey curve indicating β = βc(q)). On the left, a visualizationof what was known, and on the right, our contribution. Red indicates the high-temperature regime with an order one inverse spectral gap, orange indicates apolynomial one, and shades of blue indicate an inverse gap that is exponential in n.
1. Low temperature: For every q, if β > βc(q), then 1n
log gap−1p 1.
2. Discontinuous phase transition: If q > 4 and β = βc(q), then 1n
log gap−1p 1.
Crucial to the analysis of [64] of the critical Ising dynamics (q = 2) were Russo–
Seymour–Welsh (RSW) estimates for the corresponding FK model—stating that
on n×m rectangles with uniformly bounded aspect ratios and arbitrary boundary
conditions, crossing probabilities are uniformly bounded away from zero—obtained
by [33] using the discrete holomorphic observable framework of Smirnov [88]. The
framework of [88] is further applicable to the critical Potts model for q = 3, and
the above RSW-type estimates for the FK-Ising model were extended [34] to this
case; this allows us to similarly extend the dynamical analysis of [64] to q = 3 and
obtain (1.1.1). However, at q = 4, these RSW estimates only hold in the bulk of
the n×m rectangle, and crossing probabilities near the boundary are believed to
be sensitive to boundary conditions; this difficulty results in our quasi-polynomial
(rather than a polynomial) upper bound on mixing (1.1.2).
When β > βc(q), or β = βc(q) and q > 4, the phase coexistence implies that
11
crossing probabilities and related quantities depend on which phase is “picked out”
to be dominant; in Section 5.1, we introduce a novel geometric bottleneck based on
multiple crossings of non-trivial homology on the torus to derive the implication
that phase coexistence implies slow mixing of Glauber dynamics on (Z/nZ)2. We
are also able to show slow mixing for the Potts Glauber dynamics as soon as
β > βc(q) on an n× n box with no (free) boundary conditions in Section 5.3.
1.2 Glauber dynamics for the FK model
We now turn to the heat-bath Glauber dynamics for the critical 2D FK model
in the same regimes of interest of q ∈ (1, 4] and q > 4. The FK Glauber dynamics
is defined in analogy to the Potts Glauber dynamics, but for edge updates. Each
edge receives an i.i.d. rate-1 Poisson clock, and when its clock rings, the inclusion
of e in the subgraph ω (also identified with an assignment ω ∈ 0, 1E) is chosen
according to πG,p,q conditioned on ω \ e: if the edge e = (x, y), then
πG,p,q(e ∈ ω | ωE\e) =
p
q(1−p)+p if x←→6 y in (V, ω \ e);
p if x←→ y in (V, ω \ e),
where x ←→ y in a graph G = (V,E) indicates x, y are in the same connected
component of G. Notice, importantly, that unlike the Glauber dynamics of the
Potts model, the transition rates for an edge e are not simply a function of the
states of its neighboring edges (or even those in an O(1) neighborhood).
All the same, the FK dynamics is believed to undergo a similar critical slowdown
phenomenon to the Potts Glauber dynamics: on an n × n torus, for all p 6= pc
(where the FK model has uniqueness) the mixing time of the dynamics has an order
12
Figure 1.4: A critical FK configuration that induces three nested, distinct boundarycomponents (red, blue, purple) called bridges.
one spectral gap and order log n mixing time, while at p = pc it should behave as
nz for some universal z > 0 in the presence of a continuous phase transition, and
as exp(cn) in the presence of a discontinuous phase transition.
For p 6= pc, the facts that tmix log n and gap−1rc 1 on (Z/nZ)2 were
established in [9] using the exponential decay of cluster diameters in the high-
temperature regime p < pc: on finite boxes with select nice boundary conditions,
this translates to a property known as strong spatial mixing, implying that the
number of disagreements between the states of two chains started at different initial
states decreases exponentially fast, thus tmix log n; this result readily extends to
p > pc by the duality of the two-dimensional FK model.
At q ∈ (1, 4] and p = pc(q), where correlations decay polynomially and not
exponentially fast, the upper bounds on gap−1p for the Ising and q = 3, 4-state Potts
models from [64] and Theorem 1 used a multi-scale approach that reduced the side
length of the box by a constant factor in each step via a coupling argument. On
(Z/nZ)2, these carry over to the FK dynamics for q = 2, 3, 4 by the comparison
estimates of [91], which we will discuss later. However, for non-integer q, FK
configurations may form macroscopic connections along the boundary of smaller-
scale boxes, destroying the coupling, because of the long-range dependencies in the
FK dynamics’ transition rates (see Fig. 1.5)—this is prevented for integer q thanks
13
Figure 1.5: Long-range connections in the boundary conditions stand in the way ofcoupling FK configurations beyond a boundary interface.
to the special relation between FK and Potts models.
To control this effect, in Section 3.2, we rely on percolation techniques and
RSW theory to prove upper and lower bounds on the total number of nested
disjoint macroscopic connections along the boundaries of smaller-scale boxes at
p = pc—these estimates may be of independent interest. We then need to devise a
new dynamical scheme relying on a coupling between a censored Markov chain with
a systematic block dynamics chain to push through the coupling while modifying
the boundary conditions at each step. Refer to §4.2.1 for a detailed proof overview.
It was recently shown [48] that for q = 2 the FK Glauber dynamics on any
graph G = (V,E) has gap−1rc ≤ |E|O(1); the technique there, however, is highly
specific to the case of q = 2. Indeed, this bound is violated on Zd, for any d ≥ 2,
at p = pc and q large, as follows from the exponential lower bounds of [12, 13], as
well as in the mean-field setting at any q > 2 [10, 38, 44, 46]. The best prior upper
bound on non-integer 1 < q < 4 was tmix ≤ exp(O(n)); we prove that for periodic
boundary conditions (as well as a wide class of others, including wired and free; see
Remark 1.2.1), the following holds:
Theorem 3. Let q ∈ (1, 4] and consider the Glauber dynamics for the critical FK
14
p
q
p
q
Figure 1.6: The phase portrait of the FK Glauber dynamics on (Z/nZ)2 as p, qvary (the grey curve indicating p = pc(q)). On the left, a visualization of what wasknown, and on the right, our contribution. Red indicates the uniqueness regimewith an order one inverse gap, brown indicates an nO(logn) one, which is expectedto be polynomial, and blue indicates an inverse gap that is exponential in n.
model on (Z/nZ)2. There exists c = c(q) > 0 such that gap−1rc . nc logn.
In the case of a discontinuous phase transition q > 4, these long-range depen-
dencies are avoided, as we no longer rely on a multi-scale argument to bound the
mixing time, using instead the FK analogue of the geometric bottleneck used to
prove Theorem 2.
Theorem 4. Let q > 4 and consider the Glauber dynamics for the critical FK
model on (Z/nZ)2. There exists c(q), c′(q) > 0 such that
c ≤ 1
nlog gap−1
rc ≤ c′ .
Refer to Figure 1.6 for a visualization of the contributions Theorems 3–4.
Remark 1.2.1. Theorem 3 holds for rectangles with uniformly bounded aspect
ratio, under any set of FK boundary conditions (formally defined in §2.1.2) with
the following property: for every edge e on the boundary of the box, there are
O(log n) distinct boundary components connecting vertices on either side of e (see
15
Definitions 4.2.11–4.2.12 and Theorem 4.2.14). This includes, in particular, the
wired and free boundary conditions, as well as, with high probability, “typical”
boundary conditions: those that are sampled from πZ2,pc,q (see Lemma 4.2.17).
1.2.1 Consequences for the Swendsen–Wang dynamics
Swendsen–Wang dynamics is a (non-local) Markov chain on Potts configura-
tions, introduced in 1987 by Swendsen and Wang [90], that exploits the coupling
between the FK and Potts models, to overcome low-temperature bottlenecks in the
energy landscape (thus providing a potentially faster sampler compared to Glauber
dynamics) via global cluster flips. Because of this, it is widely used as the standard
MCMC sampler for the Ising/Potts models.
More precisely, the Swendsen–Wang dynamics is the discrete-time Markov chain
that from a Potts configuration σ(t) generates configuration σ(t+ 1) by
1. generating an intermediate FK configuration ω ∈ 0, 1E as follows: for
e = (x, y), set ωe = 0 if σx(t) 6= σy(t), and ωe = 1 with probability p and
ωe = 0 with probability 1− p if σx(t) = σy(t).
2. to each connected component C ⊂ V of (V, e : ωe = 1), assign an i.i.d. color
ΞC ∼ Unif1, ..., q, and assign σx(t+ 1) = ΞC for every x ∈ C.
Chayes–Machta dynamics [21] is a closely related Markov chain on FK configurations,
analogous to Swendsen–Wang for integer q, yet defined for any real q ≥ 1 (see §2.2.1).
The second (re-coloring) step of the Swendsen–Wang dynamics allows it to
quickly jump between configurations in which one color is dominant, to those in
which another color is dominant; it therefore seamlessly swaps between the different
metastable phases at low-temperatures. However, as with FK Glauber dynamics,
16
the Swendsen–Wang dynamics may still be slowed down by the bottleneck between
the “ordered” and “disordered” phases, which can coexist at p = pc(q).
Ullrich formalized the close connection between the Swendsen–Wang dynamics
and the FK Glauber dynamics via the following comparison inequality.
Theorem 1.2.2 ([91, 92]). Let q ≥ 2 be integer. Let gapp and gaprc be the spectral
gaps of discrete-time Glauber dynamics for the Potts and FK models on a graph
on m edges and maximum degree ∆, resp., and let gapsw be the spectral gap of
Swendsen–Wang dynamics. Then
gapp ≤ 2q2(qe2β)4∆gapsw , (1.2.1)
gaprc ≤ gapsw ≤ 16gaprcm logm. (1.2.2)
Theorem 1.2.2 implies, in particular, that the mixing time of the FK Glauber
dynamics and Swendsen–Wang dynamics on a graph G are comparable up to
polynomial factors in the number of vertices. However, the Potts Glauber dynamics
can be much slower than both, e.g., on the n× n torus when β > βc(q).
Via these comparison estimates, [91, 92] concluded polynomial upper bounds on
the mixing times of the Swendsen–Wang dynamics at all off-critical temperatures
β 6= βc(q), as well as for the Ising Swendsen–Wang dynamics at β = βc(2). Later, [7]
proved that in fact the Swendsen–Wang dynamics has an order one spectral gap and
mixes in optimal O(log n) time when β < βc(q). It was at first unclear whether the
Swendsen–Wang dynamics undergoes a critical slowdown analogous to that of the
Ising/Potts models; Gore and Jerrum [46] showed that indeed, at the critical point
on the complete graph, for every q > 2 (where the phase transition is discontinuous)
the Swendsen–Wang dynamics has tmix ≥ exp(Ω(√n)); the full dynamical picture
17
for the dynamics on the complete graph has since been established in [10, 38, 44].
On the torus (Z/nZ)d with d ≥ 2, [12, 13] proved that for q sufficiently large, the
Swendsen–Wang dynamics becomes exponentially slow at βc(q).
These comparison inequalities allow us to translate our mixing time estimates
for the FK dynamics on (Z/nZ)2 into sharper bounds on the Swendsen–Wang
dynamics on (Z/nZ)2.
Corollary 1.2.3. Consider the Swendsen–Wang dynamics for the q-state Potts
model at its critical point β = βc(q) on (Z/nZ)2. At q = 3, it has gap−1sw = nO(1),
and at q = 4 it has gap−1sw = nO(logn). For every q > 4, it has 1
nlog gap−1
sw 1.
Remark 1.2.4. For q ∈ 2, 3, 4, the bounds of Theorem 1 for the Potts Glauber
dynamics held uniformly over all possible Potts boundary conditions. The com-
parison estimates of [91] carry the upper bounds on the mixing time of the Potts
Glauber dynamics to the FK Glauber dynamics and Swendsen–Wang dynamics,
yet only for a limited class of boundary conditions (e.g., there can be only one
boundary component of size larger than nε, in contrast to “typical” ones). More
generally, the exponential dependence in the maximum degree in (1.2.1) causes
boundary conditions with large components to be very problematic when applying
these comparison inequalities. We explore this further in Section 1.4.
1.3 Dependence on FK boundary conditions off-
criticality
While the FK representation is intimately related to the Ising/Potts models,
unlike the latter spin systems, the weight of a configuration ω is not a function of
18
local interactions between edges, but instead of global connectivity properties. This
non-local structure is a crucial feature of the model but significantly complicates its
analysis. As hinted at in the previous section, FK boundary conditions can enforce
such long-range interactions that destroy spatial mixing properties and can make
the dynamical analysis of the model much more difficult, both at criticality and
off-criticality. In this section, we carefully analyze the effect of boundary conditions
on the dynamics, in the uniqueness regime of p 6= pc(q) where the FK dynamics
exhibits optimal O(log n) mixing in the absence of boundary conditions [10].
For an n×n box Λn ⊂ Z2 with nearest-neighbor edges, we let ∂Λn be its (inner)
boundary (i.e., those vertices in Λn that are adjacent to vertices in Z2 \Λn). Recall
that an Ising/Potts boundary condition τ is a fixed assignment of spins to ∂Λn,
and µτΛn is the Gibbs distribution on Λn conditional on the assignment τ to ∂Λn.
(Since the interactions are nearest-neighbor, this is the same as conditioning on
a configuration on all of Z2 \ (Λn \ ∂Λn).) For the FK model on Λn, a boundary
condition ξ on ∂Λn is a partition ξ1, ξ2, ... of the vertices in ∂Λn such that all
vertices in ξi are forced to be in the same connected component of a configuration ω
via “ghost” (or external) wirings; these connections are considered in the counting
of k(ω) in (1.0.2) and can impose highly non-local interactions. When running the
FK Glauber dynamics in the presence of boundary conditions, the connectivity of
vertices x, y is computed with the connectivities imposed by the boundary taken
into consideration.
Of particular interest are boundary conditions for the FK model corresponding
to configurations on Z2 \ Λn: i.e., where the boundary partition is induced by
the connections of a random-cluster configuration on E(Z2) \ E(Λn). We call
such boundary conditions realizable. (In fact, many works, including the standard
19
text [47], often restrict attention to realizable boundary conditions.)
For the Ising/Potts model Glauber dynamics, the Θ(log n) mixing at high
temperatures β < βc(q) is well understood to hold independently of the boundary
conditions [2, 3, 5, 18, 69]. These bounds follow as a consequence of the exponential
decay of correlations of the model in the high-temperature regime, which holds
even near the boundary uniformly over choice of Ising/Potts boundary conditions;
this property is known as strong spatial mixing (SSM).
The corresponding bound for the FK Glauber dynamics, of Θ(log n) mixing
on Λn when p 6= pc(q), relied on a strong spatial mixing property for the random-
cluster model; however unlike the Ising/Potts models, strong spatial mixing for
the FK model only holds in the presence of boundary conditions that do not carry
information about random-cluster connectivities in non-local ways [3]: namely,
configurations in different regions of Λn do not interact through these boundaries.
As such, the Θ(log n) upper bound of [9] only holds under boundary conditions
that are free (no boundary condition), wired (all boundary vertices are connected
to one another) or periodic (the torus). The behavior of the FK Glauber dynamics
under other random-cluster boundary conditions remained unclear.
We prove that the FK dynamics at p 6= pc(q) mixes in polynomial time, uniformly
over all realizable boundary conditions.
Theorem 5. For every q > 1, p 6= pc(q), there exists a constant c(p, q) > 0 such
that the FK Glauber dynamics on the n×n box Λn ⊂ Z2 with any realizable boundary
condition ξ has gap−1rc = O(nc).
We pause to comment on the proof of Theorem 5. As mentioned above, it is
easy to construct examples of realizable boundary conditions where the correlation
between edges near the boundary does not decay in their distance, even if p pc(q),
20
as the boundary can enforce long-range interactions. Since the exponential decay of
correlations does hold for edges in the “bulk” of Λn (i.e, at distance Θ(log n) away
from its boundary), we are able to reduce the proof of Theorem 5 to proving a
polynomial upper bound for the mixing time of the FK-dynamics on thin rectangles
of dimension n×Θ(log n) with realizable boundary conditions. This will be the
key technical difficulty for us and is established in Theorem 6.1.1.
In the setting of spin systems, or boundary conditions that do not encode
long-range interactions, a polynomial upper bound on n×Θ(log n) rectangles would
follow from standard canonical paths arguments [53, 66, 67, 86]. However, even
realizable boundary conditions can heavily distort the graph with external wirings,
preventing this approach from succeeding (c.f., Section 6.4). Instead, to prove
Theorem 6.1.1 we devise a novel application of the recursive (block dynamics)
scheme that was used in the proofs of Theorems 1 and 3. We point the reader to
Section 6.1 for a more detailed proof overview.
Using Theorem 5, we prove near-optimal O((log n)C) mixing time for “typical”
boundaries as we detail now. The notion of typicality should be understood as
with high probability under some probability distribution over realizable boundary
conditions, with a natural choice being the marginal distribution of the infinite
random-cluster measure πZ2,p,q on E(Z2) \ E(Λn).
Theorem 6. For every q > 1, p < pc(q) (resp., p > pc(q)), there exists a constant
C > 0 such that if a boundary condition ξ on Λn is randomly drawn according
to πZ2,p′,q′(·E(Z2)\E(Λn)) for any p′ < pc(q′) (resp., p′ > pc(q
′)), with probability
1− o(1), the continuous-time FK Glauber dynamics on Λn with boundary condition
ξ has gap−1rc = O((log n)C).
The proof of Theorem 6 uses Theorem 5 in a crucial way. Typical boundary
21
conditions do not exhibit the strong spatial mixing property from [3, 9]; however,
for boundary conditions having boundary components of size at most α log n,
correlations between edges near the boundary decay exponentially in their graph
distance divided by α log n. Using this, we reduce bounding the mixing time on Λn
with typical boundaries to bounding the mixing time on Θ((log n)2)×Θ((log n)2)
rectangles with arbitrary realizable boundary conditions. Theorem 5 implies that
the mixing time in these smaller rectangles is at most poly-logarithmic in n.
Given that our rapid mixing result for realizable boundaries relies heavily on the
planarity of the boundary connections in Z2 \ Λn, one may wonder whether rapid
mixing holds for all possible FK boundary conditions (including those not realizable
as configurations on Z2 \ Λn). We answer this in the negative, showing that there
exist (non-realizable) boundaries for which the FK-dynamics is slow mixing even
while p 6= pc(q). In fact, this torpid mixing holds at p pc(q), which may sound
especially surprising as correlations in πΛn,p,q die off faster as p decreases.
Theorem 7. Let q > 2. For every α ∈ (0, 12] and λ > 0 there exists a boundary
condition ξ, such that when p = λn−α the FK Glauber dynamics on the n× n box
Λn with boundary condition ξ has gap−1rc = exp(Ω(nα)).
Our proof of this theorem is constructive: we take any graph G on m edges for
which slow mixing of the FK-dynamics is known at some value of p(m) < pc(q), and
show how to embed G into the boundary of Λn. We then develop a procedure to
transfer mixing time bounds from G (speficially the case where G is the complete
graph [10, 38, 44, 46]) to Λn at p(m) sufficiently small so that the effect of the
configuration away from the boundary is negligible.
We remark about the condition q > 2 in Theorem 7. In [48] it was shown
that the mixing time of FK-dynamics when q = 2 is at most polynomial in the
22
number of vertices on any graph and at every p ∈ (0, 1). It is believed that this
rapid mixing holds for all q ≤ 2; hence the requirement q > 2 should be sharp for
Theorem 7. We believe that the above slow mixing result may also extend to small,
but Ω(1) values of p < pc(q), though our current proof does not allow for this. In
principle, one would want to embed a bounded degree graph into ∂Λn, so that its
critical point at which it exhibits slow mixing is Ω(1). There are several examples
of bounded degree graphs where torpid mixing is known [12, 13, 24, 41, 42].
1.4 Sensitivity to boundary conditions in coexis-
tence regimes
In the previous section, we investigated the sensitivity of mixing times in the
uniqueness regime of the FK dynamics p 6= pc(q). Theorems 5–6 hinted at stability
of mixing times with respect to changes of boundary conditions, as long as they
are compatible with the planarity of the model.
In this section, we stick to combinations of free, wired, and periodic boundary
conditions and demonstrate how different boundary conditions can dramatically
alter the mixing time at criticality in phase coexistence regimes, namely at q > 4
and p = pc(q). None of these boundary conditions carry long-range information.
Instead, the dependence of mixing times on boundary conditions here is because
the boundary conditions can pick out one of the two dominant phases (free and
wired) and destroy the metastability of the other. Since we are now discussing FK
boundary conditions which correspond to some Potts boundary condition at integer
q (free to free, and wired to monochromatic, say, red) the results will hold for both
the FK dynamics and the Swendsen–Wang dynamics of the Potts model.
23
1.4.1 Free and monochromatic boundary conditions
A similar sensitivity to boundary conditions in phase coexistence regimes has
been extensively studied in the low temperature Ising model. There, it is a long-
standing open problem to show that the mixing time under all-plus boundary
conditions is order n2, with the largest minus component following a droplet
shrinkage picture governed by mean-curvature flow [11, 52]. Martinelli [66] showed
that under all-plus boundary conditions, the mixing time is eO(√n) at β βc as
compared with exponentially slow on the torus; this was subsequently improved to
exp(no(1)) and then nO(logn) and all β > βc in [63, 70].
If we naively followed the intuition from the low temperature Ising and Potts
model, free boundary conditions—where the q different colors’ phases are metastable
so gap−1p & exp(cn)—might be expected to induce the same (slow) critical mixing
behavior as in the torus. However, this is not case, as the following theorem
demonstrates; see also Fig. 1.7.
Theorem 8. Fix any q large enough and let p = pc(q). The following holds for the
FK Glauber dynamics on an n× n box with free boundary conditions:
gap−1rc . exp
(no(1)
). (1.4.1)
At integer q and β = βc(q), the same bound holds for the Swendsen–Wang dynamics.
The estimate (1.4.1) holds also for wired FK boundary conditions (which at
integer q correspond to monochromatic Potts boundary conditions), since the free
boundary conditions at pc(q) are self-dual to wired boundary conditions (see §2.1.2).
In fact, we establish (1.4.1) for FK boundary conditions sampled from the free or
wired infinite-volume Gibbs measures (see Proposition 7.0.2), as well as ones that
24
500 1000 1500 2000 2500 3000
0.2
0.4
0.6
0.8
1.0
periodic
free
Figure 1.7: Swendsen–Wang realizations for the critical 5-state Potts model, froma monochromatic initial state, under free (left) vs. periodic (right) boundaryconditions; plot shows largest component size in the corresponding FK configuration.
are free on three sides and wired on the fourth (Corollary 7.2.7).
Theorems 2 and 8 show the similarities between the dynamical behavior of
the FK model at its critical point pc(q) in the presence of a discontinuous phase
transition, and the 2D Ising model in the low temperature regime β > βc(2). In fact,
the proof of Theorem 8 follows the approach used in [70] to establish sub-exponential
upper bounds on tmix for Ising Glauber dynamics in the presence of all-plus boundary
conditions. The absence of monotonicity in the Potts model frequently leads us
to work directly with the FK representation. However, this entails adapting the
cluster-expansion techniques and the Wulff construction framework of [30] to the
25
FK model. Moreover, unlike the Ising model—where central to the upper bounds on
mixing in many related works is the coupling of configurations beyond an interface
between clusters (e.g., the interface between the plus and minus phases, used to
establish the inductive step in the multi-scale argument of [66])—recall that the
boundary conditions of the FK model may feature long-range connections between
vertices. Using these as a “bridge” over the interface (see Figure 1.5), different
FK configurations below the interface may induce different distributions above it,
thus preventing the coupling. Working around obstacles of this type comprises a
significant part of the proof of Theorem 8, as it did the proofs of Theorems 3 and 5.
1.4.2 Combinations of free, monochromatic, and periodic
boundary conditions
We next investigate the relationship between mixing times and boundary con-
ditions that interpolate between periodic and free/monochromatic, at the critical
point of a discontinuous phase transition (see Figure 1.8). As with Theorem 8, our
results hold for q large enough, and are expected to hold whenever the order-disorder
surface tension (see Def. 7.1.1) is positive, namely at p = pc for all q > 4 (see [32]).
Recall that ∂Λn is the set of vertices of Λn adjacent to Z2 \ Λn. Also, let ∂nΛn
be its northern boundary, and define the eastern, western, and southern boundaries
∂eΛn, ∂wΛn, ∂sΛn similarly.
Theorem 9 (Mixed b.c.). Let q be large, ε > 0, and let (an, bn, cn, dn) be marked
vertices on ∂Λn such that they are not all within εn of any one side of ∂Λn and are
all distance greater than εn from each other. There exists c(ε, q) > 0 (independent
of n and an, bn, cn, dn) such that the FK Glauber dynamics on Λn at p = pc(q) with
26
Boundary Swendsen–Wang
Periodic/Mixed
||
||
tmix ≥ exp(cn)
Dobrushin tmix ≤ exp(O(√n log n))
Cylindrical
||
||
||
||
||
||
tmix ≤ exp(n1/2+o(1))
Figure 1.8: Mixing time bounds for Swendsen–Wang dynamics on n × n boxeswith different sets of boundary conditions. Dashed lines indicate free boundaryconditions, the bold red lines denote red boundary conditions, and hash markings|, || indicate periodic boundary conditions on their respective sides. The differentboundary conditions depicted here are special cases of Theorems 4 and 8–11.
boundary conditions that are wired on the boundary segments (an, bn) and (cn, dn)
and free elsewhere, has
gap−1rc & exp(cn) .
In particular, this holds with wired boundary on ∂e,wΛn and free elsewhere.
When q is integer and β = βc(q), the same holds for the Swendsen–Wang
dynamics, where wired boundary conditions are replaced with monochromatic, i.e.,
all sites on the boundary segments (an, bn) and (cn, dn) take the same state/color.
Theorem 10 (Dobrushin b.c.). Let q be large and let (an, bn) be marked vertices
on ∂Λn. There exists c(q) > 0 (independent of n and an, bn) so that FK Glauber
dynamics at p = pc(q) with boundary conditions that are wired on the boundary
27
segment (an, bn) and free elsewhere, has
gap−1rc . exp(c
√n log n) .
In particular, this holds with wired boundary on ∂s,wΛn and free elsewhere. When q
is integer and β = βc(q), the same bound holds for Swendsen–Wang dynamics.
Theorem 11 (Cylinders). Let q be large and p = pc(q). The FK Glauber dynamics
with periodic boundary conditions on ∂n,sΛn and either wired or free boundary
conditions on each of ∂eΛn and ∂wΛn satisfies
gap−1rc . exp(n1/2+o(1)) .
When q is integer and β = βc(q), the same holds for the Swendsen–Wang dynamics.
Remark 1.4.1. Theorems 9–11 showed the dependence of mixing times on bound-
ary conditions for the critical FK model in the phase coexistence regime. If, instead,
one were interested in the simpler setting of Glauber dynamics for the 2D Ising
model at large β (where the plus and minus phases would assume the role of wired
and free phases in our theorems), the proofs would carry over (and even simplify
due to the absence of long-range interactions), via the tools of [30].
Remark 1.4.2. By the well-known relation between gap and the total variation
mixing time tmix (see (2.2.3) in §2.2), the bounds in Theorems 1–11 all hold
with gap−1 replaced by tmix. Similarly, standard comparison estimates (see [60,
Lemma 13.22]) extend our bounds for heat-bath Glauber dynamics to Metropolis
(and other flavors of local-update dynamics), and the bounds on FK Glauber
dynamics to the non-local Chayes–Machta dynamics (and its variants).
28
1.5 Some open problems
There are many interesting open questions that remain unresolved in the under-
standing of the dynamics of Ising, Potts, and FK models on the two-dimensional
lattice Z2. In the context of the Ising model, many of the biggest open problems are
very well-known (e.g., determining the precise polynomial for the mixing time at
β = βc(q), and identifying the mixing time of the low-temperature model with plus
boundary conditions as Θ(n2)). In what follows we collect some less well-established
open problems that arise specifically in the context of the results of this thesis,
regarding the dynamics of the planar Potts and FK dynamics.
Critical behavior at continuous phase transition point
Beginning with the Potts Glauber dynamics on the torus (Z/nZ)2, the most
clear open problem remaining from our results is the following:
Problem 1.5.1. Show that there exists c > 0 such that the Potts Glauber dynamics
at q = 4 and β = βc(q) on (Z/nZ)2 has gap−1p . nc.
Addressing this question requires a sharper coupling technique than the coupling
beyond a given dual-FK crossing as done in Chapter 4; this is because at q = 4
under the wired boundary conditions, the probability of such a top-to-bottom
crossing is not expected to satisfy an RSW estimate. Rather one would want
to rely on the fact that typical boundary conditions support such dual-crossings
with some positive probability, and sub-blocks of Λn will have typical boundary
conditions with high probability; the ideas in proving Theorem 3 may prove useful
for this. Indeed, pushing this approach through seems closely related to resolving
the following gap left open by Theorem 3.
29
Problem 1.5.2. Show that there exists a c(q) > 0 such that the FK Glauber
dynamics at q ∈ (1, 4] and p = pc(q) on (Z/nZ)2 has gap−1rc . nc.
In the other direction, we draw attention to the fact that there is no polynomial
lower bound on the mixing time or inverse gap of the continuous-time FK Glauber
dynamics (or Swendsen–Wang dynamics) at criticality for any q ∈ (1, 4] (including
even the case of FK-Ising model q = 2). The rigorous lower bound of Li and
Sokal [61] bounds an auto-correlation time by the static critical exponents of the
model—namely the specific heat—but in the only case where these exponents are
known rigorously, q = 2, the specific heat actually vanishes.
Question 1.5.3. Does the Swendsen–Wang dynamics for the critical Ising model
on (Z/nZ)2 exhibit polynomial slowdown? More generally, show there exist c(q) > 0
such that the continuous-time FK Glauber dynamics at q ∈ (1, 4] and p = pc(q)
have gap−1rc & nc (and similarly for gap−1
sw at q = 2, 3, 4).
Behavior in phase coexistence regimes
The latter half of this thesis turns to the behavior of the Glauber and Swendsen–
Wang dynamics in the phase coexistence regime of q > 4 and p = pc(q). In this
entire regime, in Chapter 5 we identify the exponentially slow mixing for all these
dynamics on (Z/nZ)2; we then turn to the interplay between metastability of phases,
and the ability of boundary conditions to break this metastability—however, those
results are all perturbative and only apply for q sufficiently large.
Problem 1.5.4. Show that the FK Glauber dynamics and Swendsen–Wang dynam-
ics at q > 4 and p = pc(q) on Λn with free or monochromatic boundary conditions
have a sub-exponential mixing time.
30
The above problem is primarily a question of an equilibrium nature, as it requires
various sharp estimates on order-disorder interfaces (see Section 7.1), which rely
on the perturbative cluster expansion approach, to be extended to all q > 4. An
open problem of a more dynamical nature is to translate the sub-exponential upper
bounds under free and monochromatic boundary conditions to the Potts Glauber
dynamics: at present all the dynamical tools used in Chapters 7–8 rely crucially on
the monotonicity of the FK dynamics.
Problem 1.5.5. Show that the Potts Glauber dynamics on Λn with free or
monochromatic boundary condition has gap−1p ≤ exp(no(1)) when q > 4 and
β = βc(q) (or even q sufficiently large and β = βc(q)).
In fact, as far as we know, it is even open to show for q 6= 2 a sub-exponential
upper bound on the mixing time of the Potts Glauber dynamics at low-temperatures
with monochromatic boundary conditions. One would expect a droplet shrinkage
governed by motion by mean-curvature (with a continuous-time mixing time of
Θ(n2) and order n inverse gap [11, 52]) to apply here as predicted for the low-
temperature Ising model. Of course it is a central open problem to establish this
Θ(n2) scaling, and the mean-curvature flow picture even for the Ising model at any
low but non-zero temperature (c.f., important progress at zero-temperature [17]).
Since this proposed picture drives our results in Chapters 7–8 for critical FK
dynamics, we close with the following broader, open-ended question.
Question 1.5.6. How closely does the behavior of the dynamical low-temperature
Ising model with plus boundary conditions translate to the critical FK model with
q > 4 and, say, free boundary conditions? What about to the Swendsen–Wang
dynamics of the critical Potts model with q > 4 and free boundary conditions?
31
Chapter 2
Preliminaries
In this chapter, we review the definitions of the Potts and random-cluster prop-
erties, discuss important properties of their phase transition in Z2, and introduce
some of the tools that will be central to our analysis throughout this thesis. More
specialized tools and techniques (e.g., cluster expansion) which will only appear
in specific chapters are defined locally therein. In Section 2.1, we describe the
Potts and random-cluster models on subsets of Z2, define the relevant notation,
and describe the rich phase transition they undergo at the critical βc(q) and pc(q)
respectively. In Section 2.2, we overview key definitions from the theory of Markov
chain mixing times, define the natural Markov chains for the Potts and random-
cluster models, and describe the main dynamical tools we will appeal to, in order
to obtain the desired bounds on their mixing time and spectral gap.
2.1 The Potts and FK models
We begin by redefining the Potts and random-cluster (FK) models and describing
their phase transition in Z2. For a more detailed survey of the random-cluster
32
model, and many of the key ideas described in this section, refer to [47].
2.1.1 Potts model
The (ferromagnetic) q-state Potts model on a graph G = (V,E) is the probability
distribution over configurations σ ∈ Ωp = [q]V (viewed as assignments of colors
out of [q] = 1, ..., q to the vertices of G) in which the probability of σ w.r.t. the
inverse-temperature β > 0 and the boundary conditions ζ (an assignment of colors
in [q] to the vertices of some boundary subset H ⊂ V , i.e. ζ ∈ [q]H) is given by
µζG,β,q(σ) =1
Zp
1σH = ζ exp(− β
∑u∼v
1σ(u) 6= σ(v)),
where the sum is over unordered pairs of adjacent vertices (u, v) ∈ E, σH indicates
the restriction of σ to H, and the normalizing constant Zp is the partition function.
In this thesis, we focus on graphs that are rectangular subsets of Z2 with nearest
neighbor edges ((u, v) ∈ E if |u− v| = 1) and vertex set
Λn,m := J0, nK× J0,mK = k ∈ Z : 0 ≤ k ≤ n × k ∈ Z : 0 ≤ k ≤ m ,
so that the notation Ja, bK stands for k ∈ Z : a ≤ k ≤ b. We use the abbreviated
form Λn = Λn,n and just Λ when both n and m are made clear from the context.
For general subsets S ⊂ Z2, the edge-set E(S) is the set of all edges of Z2 having
both ends in S; the boundary vertex set ∂S will be the set of vertices in S with a
neighbor in Z2 − S; we set the interior So = S − ∂S with edge set E(S)− E(∂S).
When considering rectangles Λ, denote the northern (top) boundary of Λ by
∂sΛ := J0, nK× n, define ∂s, ∂w and ∂e analogously, and let multiple subscripts
33
denote their union, i.e., ∂e,wΛ = ∂eΛ ∪ ∂wΛ.
2.1.2 Random-cluster (FK) models
For a graph G = (V,E), a random-cluster (FK) configuration ω ∈ Ωrc = 0, 1E
assigns binary values to the edges of G, either open (1) or closed (0). (In the
context of boundary conditions, these are often referred to instead as wired and free,
respectively). A cluster is a maximal connected subset of vertices that are connected
by open bonds, where singletons count as individual clusters. Every configuration
ω is naturally identified with a subset of E given by e ∈ E : ω(e) = 1.
For a boundary subset H ⊂ V , an FK boundary condition ξ is a partition of
the vertices in H. For a boundary subset H ⊂ V and an edge subset F ⊂ E, we
say that a configuration ωF on F induces a boundary condition ξω if for every
v, w ∈ H, they are in the same element of the partition ξω if and only if they
are connected by a set of open edges in ωF (i.e., v and w are wired in ξω). We
sometimes use ωH to indicate the boundary partition induced by ω on the vertices
of H, and if v, w are wired in a boundary condition ξ, we write vξ←→ w.
In the specific case where S ⊂ Z2, a boundary condition ξ on ∂S is said to
be realizable if there exists an FK configuration ω on E(Z2)− E(S) such that ω
induces ξ: i.e., u, v ∈ ∂S are in the same connected component of ω if and only if
they are wired in ξ.
Remark 2.1.1. Realizable boundary conditions are the most natural class of
boundary conditions (see [47]) since they are compatible with the geometry (i.e.
planarity) of Z2. However, non-realizable boundary conditions are still relevant
in some cases; for example, when considering the random-cluster model on non-
lattice graphs such as trees. In Section 6.5, we consider the mixing time of the FK
34
dynamics under non-realizable boundary conditions.
For p ∈ (0, 1) and q > 0, the FK model on G with a boundary condition ξ on
H ⊂ V is the probability measure over configurations ω ∈ 0, 1E given by
πξG,p,q(ω) =1
Zrc
po(ω)(1− p)|E|−o(ω)qk(ω;ξ),
where o(ω) is the number of open edges in ω and k(ω; ξ) counts the number of
connected components in the augmented graph (V, ωξ) and ωξ adds auxiliary edges
between all pairs of vertices in H that are in the same element of ξ. The partition
function Zrc is again the proper normalizing constant.
We sometimes interchange vertex sets with the subgraph they induce; e.g., the
random-cluster configuration on a set R ⊂ Z2 corresponds to the random-cluster
configuration in the subgraph induced by (R,E(R)). We omit the subscripts p, q
when understood from context.
Edwards–Sokal Coupling
The Edwards–Sokal coupling [35] provides a way to move back and forth
between the Potts model and the random-cluster model on a given graph G for
q ∈ 2, 3, . . .. The joint probability assigned by this coupling to (σ, ω), where
σ ∈ Ωp is a q-state Potts configuration at inverse-temperature β > 0 and ω ∈ Ωrc
is an FK configuration with parameters (p = 1− e−β, q), is proportional to
∏(xy)∈E(G)
[(1− p)1ω(xy) = 0+ p1ω(xy) = 1, σ(x) = σ(y)
].
35
It follows that, starting from a Potts configuration σ ∼ µG,β,q, one can sample an FK
configuration ω ∼ πG,p,q by letting ω(e) = 1 (e ∈ ω) with probability p = 1− e−β if
the endpoints x, y of the edge e have σ(x) = σ(y), and ω(e) = 0 (e /∈ ω) otherwise.
Conversely, from ω ∼ πG,p,q, one obtains σ ∼ µG,β,q by assigning an i.i.d. color in [q]
to each cluster of ω (i.e., σ(x) assumes that color for every vertex x of that cluster).
In the presence of boundary conditions ζ ∈ [q]∂G for the Potts model, it is
possible to sample σ ∼ µζG,β,q using the random-cluster model as follows. Associate
to ζ the FK boundary conditions ξ that wire x, y ∈ ∂G to each other if and only
if ζx = ζy, i.e., the partition corresponding to ξ is given by the color-classes of ζ.
Further denote by Fζ the random-cluster event that no boundary sites of different
colors under ζ are connected via ω: no two boundary sites x, y with ζ(x) 6= ζ(y)
are connected via ω in G:
Fζ =ω ∈ 0, 1E(G) : x
ω←→ y =⇒ ζx = ζy, for all x, y ∈ ∂G.
Then one can sample a configuration of µζΛ,β,q by first sampling ω ∼ πξΛ,p,q(· | Fζ)
for p = 1− e−β, then coloring the boundary clusters as specified by ζ, and coloring
every other cluster by an i.i.d. color uniformly over [q]. For further details, see [64],
where sampling under Fζ was described in the context of the FK-Ising model.
Monotonicity and the FKG inequality
While the Potts model for q 6= 2 does not obey any monotonicity property, the
FK model indeed does as long as q ≥ 1; this is central to the analysis of the Potts
model via its FK representation, and plays a crucial role in most of the proofs
throughout this thesis. The absence of this monotonicity is also the central obstacle
36
to a better understanding of the FK model when q ∈ (0, 1).
Let ω ≤ ω′ in the natural partial order on configurations, i.e., if ω(e) ≤ ω′(e)
for every e ∈ E. A function is called increasing if it is non-decreasing in this partial
order, and an event is called increasing if its indicator function is increasing (so
that the event is closed under addition of (open) edges). For every q ≥ 1 and every
p ∈ [0, 1], the model enjoys the FKG inequality [37]:
EπξG,p,q[f(ω)g(ω)
]≥ EπξG,p,q [f(ω)]EπξG,p,q [g(ω)] for every increasing f, g ,
and in particular, taking f and g to be indicator functions,
πξG,p,q(A ∩B) ≥ πξG,p,q(A)πξG,p,q(B) for every increasing events A,B .
This FKG inequality yields a monotonicity in boundary conditions for the FK
model. Define a partial order over boundary conditions by ξ ≤ η if the partition
corresponding to ξ is finer than that of η. The extremal boundary conditions
then, are the free boundary where ξ = v : v ∈ ∂G, which we denote by ξ = 0,
and the wired boundary where ξ = ∂G, denoted by ξ = 1. When q ≥ 1, the
FK model satisfies the following monotonicity in boundary conditions: if ξ, η are
two boundary conditions on ∂G with ξ ≤ η, then πξG,p,q πηG,p,q, where denotes
stochastic domination.
37
The Domain Markov property
The domain Markov property of the Potts model states that on a graph G =
(V,E), for every subset S ⊂ V and every configuration η
µ(σS−∂S ∈ · | σ(V−S)∪∂S = η) = µη∂SS
where, throughout the thesis, σS = σS = σ(S) denote the restriction of a configu-
ration σ to S. The Domain Markov property of the FK model states that, on any
graph G = (V,E), for every subgraph (S,E(S)) and every FK configuration ζ,
π(ωE(S) ∈ · | ωE−E(S) = ζ
)= πζS ,
where the boundary conditions ζ is the partition induced by the configuration ζ on
the vertices of ∂S.
Planar duality
Let Λ∗n = (Λ∗n, E(Λ∗n)) denote the planar dual of Λn. That is, Λ∗n corresponds
to the set of faces of Λn, and for each e ∈ E(Λn), there is a dual edge e∗ ∈ E(Λ∗n)
connecting the two faces bordering e. The FK distribution satisfies πΛn,p,q(ω) =
πΛ∗n,p∗,q(ω
∗), where ω∗ is the dual configuration to ω given as follows: for every
primal edge e and its dual-edge e∗ (intersecting at their midpoints) ω∗(e∗) = 1 if
and only if ω(e) = 0, and where
p∗ =q(1− p)
q(1− p) + p.
38
Under a realizable boundary condition ξ, this distributional equality becomes
πξΛn,p,qd= πξ
∗
Λ∗n,p∗,q∗ , where ξ∗ is the boundary condition induced by taking the dual
configuration of the configuration on E(Z2)− E(Λn) identified with ξ. Notice that
Z2 is isomorphic to its dual. The unique value of p satisfying p = p∗, denoted psd(q),
is called the self-dual point.
Infinite-volume Gibbs measures
Extra care is needed to define the Potts and FK models on infinite graphs,
as their partition functions become infinite. Infinite-volume Gibbs measures are
therefore defined via what is known as the DLR conditions : for an infinite graph
G = (V,E), a Potts measure µG,β,q on [q]V , defined in terms of its finite-dimensional
distributions, satisfies the DLR conditions if for every finite subset S ⊂ V ,
EµG,β,q(σG\S∈·)[µG,β,q
(σS ∈ · | σG\S
)]= µG,β,q(σS ∈ ·) .
An infinite-volume FK measure is defined analogously.
On infinite lattices, e.g., Z2, infinite-volume Gibbs measures can arise as weak
limits of finite-volume measures, say n → ∞ limits of the Potts/FK models on
Λn with certain prescribed boundary conditions. In particular, exploiting the
monotonicity in boundary conditions, all possible infinite-volume FK measures
lie in between (stochastically) the extremal ones obtained by taking wired and
free boundary conditions on Λn: we denote these limiting distributions by π1Z2,p,q
and π0Z2,p,q, and note that they may or may not be distinct. The uniqueness/non-
uniqueness of infinite-volume Gibbs measures is a canonical way by which the Potts
phase transition is characterized.
39
Potts and FK phase transitions
The FK model on Z2 exhibits a phase transition characterized by the emergence
of an infinite connected component. That is, there exists a critical value p = pc(q)
such that if p < pc(q), then, almost surely, all connected components of ω ∼ π1Z2,p,q
are finite, and if p > pc(q) then ω ∼ π0Z2,p,q almost surely has an infinite component.
For q ≥ 1, the exact value of pc(q) for Z2 was recently settled in [5], proving
pc(q) = psd(q) =
√q
√q + 1
,
(previously known only at q = 1 and q = 2 due to [54] and [56, 75]).
In particular, the work [5] implied that for q ≥ 1, for every p 6= pc(q), there is
a unique infinite-volume Gibbs measure so that πZ2,p,q := π1Z2,p,q = π0
Z2,p,q. By the
Edwards–Sokal coupling, the implications of this picture for the infinite-volume
measures of the Potts model are the following: when β < βc(q), the Potts model
has a unique infinite-volume measure, whereas when β > βc(q), it has multiple,
with infinite-volume measures consisting of mixtures of q extremal measures corre-
sponding to situations in which the FK infinite cluster gets each of the q possible
colors.
For two vertices x, y ∈ V , denote by x ←→ y the event that x and y belong
to the same connected component of ω, and for subsets S,R ⊂ V , denote by
S ←→ R the event that there exist x ∈ S, y ∈ R such that x ←→ y. With this
notation in hand, the FK phase transition is sharp in the sense that there is a
strong exponential decay of connectivities (which we sometimes refer to as the EDC
property). Namely, a consequence of the results in [2, 5] is that for every q > 1 and
p < pc(q), there is a c = c(p, q) > 0 such that for every boundary condition ξ and
40
all u, v ∈ Λn,
πξΛn,p,q(uΛn←→ v) ≤ e−cd(u,v) , (2.1.1)
where d(u, v) is the graph distance between u, v in Z2 and uΛn←→ v denotes that
there is an open path between u and v in the FK configuration on E(Λn) (not
using the connections of ξ). By the planar duality on Z2, when p > pc(q), there is
an analogous exponential decay of dual-connections.
At its critical point p = pc(q), we say the FK phase transition is continuous
or second-order if there exists a unique infinite-volume Gibbs measure (having no
infinite cluster), and discontinuous or first-order if there exist multiple infinite-
volume Gibbs measures (and under π1Z2,p,q there is almost surely an infinite cluster).
Our results at criticality hinge on the results of [34] characterizing the behavior
of the FK model at p = pc(q), summarized as follows, and sometimes referred to as
a dichotomy theorem.
Theorem 2.1.2 ([34, Theorem 3]). Let q ≥ 1; the following statements for the
critical FK model on Z2 are equivalent:
1. Discontinuous phase transition: π0Z2,pc,q
6= π1Z2,pc,q
.
2. Exponential decay of correlations under π0Z2,p,q: there exists c(p, q) > 0 so that
π0Z2,pc,q
((0, 0)←→ ∂J−n, nK2
)≤ e−cn . (2.1.2)
As mentioned, prior to the breakthrough work of [34], continuity of the phase
transition was known at q = 1 due to the seminal work of [54] and at q = 2
by Onsager, who also established that at p = pc(q) and q = 2, correlations
41
decay polynomially with critical exponent 14. Then along with the above theorem,
Duminil-Copin, Sidoravicius and Tassion [34] proved that the phase transition
is continuous at every q ∈ (1, 4]; this is believed to be the case for all q ∈ (0, 4]
but the absence of FKG inequalities leaves this as a major open problem in
the window q ∈ (0, 1) [4, 47]. Their proof combined the machinery of discrete
holomorphic observables introduced by Smirnov [88] with Russo–Seymour–Welsh
(RSW) estimates on crossing probabilities; this latter tool will be used extensively
in our proofs of Theorems 1 and 3 and is described in detail in Chapter 3.
Discontinuity of the phase transition, conjectured for all q > 4, was first
proved by Kotecky and Shlosman [55] for sufficiently large q using a perturbative
approach known as cluster expansion; the proof in [58] applies whenever q1/4 >
(κ+√κ2 − 4)/2, where κ is the connective constant of Z2. Plugging in the rigorous
bound κ < 2.6792 due to [79] affirms the phase coexistence for all q > 24.78. More
recently, by using relations between the FK model and the six-vertex model, [32]
verified that indeed the phase transition is discontinuous for all q > 4. As a
consequence, there are two extremal infinite-volume FK measures π1Z2,pc,q
6= π0Z2,pc,q
corresponding to the wired and free phases: this induces q + 1 distinct extremal
Potts measures on Z2, with q corresponding to the low-temperature phases where
a single color is dominant, along with a q + 1’th disordered phase.
2.2 Markov chain mixing times
In this section, we describe some of the central ideas from the theory of Markov
chain mixing times. We then define the (heat-bath) Glauber dynamics for the Potts
and random-cluster models, as well as the Swendsen–Wang dynamics, and present
42
the central techniques of block dynamics, censoring, and canonical paths to the
analysis of dynamics for spin-systems. For more details on Markov chain mixing
times and Glauber dynamics see [60] and [67], respectively.
Consider a discrete-time Markov chain (Xt)t≥0 with finite state space Ω, transi-
tion kernel P and stationary distribution π. In the continuous-time setting, instead
of P t consider the heat kernel given by
Ht(x, y) = Px(Xt = y) = etL(x, y) ,
where Px(Xt = y) = P(Xt = y | X0 = x) and L = limt↓01t(Ht − I) is the generator.
(We will define many of the relevant quantities (e.g., coupling distance, mixing
time) in discrete time, with the continuous-time versions obtained by replacing P t
by Ht and the transition probabilities P (x, y) by the rates c(x, y).)
Spectral gap
The mixing time of the Markov chain is closely related to the gap in its spectrum:
in discrete-time, gap := 1 − λ2 where λ2 is the second largest eigenvalue of P ;
in continuous-time it is the smallest non-zero eigenvalue in the spectrum of the
generator L. An important variational characterization of the gap is given by
gap = inff∈L2(π)
Varπ(f)>0
E(f, f)
Varπf, (2.2.1)
where the Dirichlet form E(f, g) is defined by
E(f, g) =1
2
∑x,y∈Ω
π(x)P (x, y)(f(y)− g(x))2 , (2.2.2)
43
for a discrete-time Markov chain, and by 〈−Lf, g〉L2(π) for a continuous-time process.
Mixing times
Denote the (worst-case) total variation distance between Xt and π by
dtv(t) = maxx∈Ω‖P t(x, ·)− π‖tv ,
where the total variation distance between two probability measures ν, π on Ω is
‖π − ν‖tv = supA⊂Ω
[π(A)− ν(A)] = 12‖π − ν‖L1 .
Further define the coupling distance
dtv(t) = maxx,y∈Ω
‖P t(x, ·)− P t(y, ·)‖tv ,
noting that dtv is submultiplicative and dtv(t) ≤ dtv(t) ≤ 2dtv(t). The total
variation mixing time of the Markov chain with respect to the precision parameter
0 < δ < 1 is
tmix(δ) = inftt : max
x∈Ω‖P t(x, ·)− π‖tv < δ .
For any choice of δ < 12, the quantity tmix(δ) enjoys submultiplicativity thanks
to the aforementioned connection with dtv; we write tmix, omitting the precision
parameter δ, to refer to the standard choice of δ = 1/(2e).
Another useful characterization of the total-variation distance is as
‖π − ν‖tv = inf(X,Y )∼(π,ν)
P(X 6= Y )
44
where the infimum is over all couplings Pr of the two measures π and ν. A (one step)
coupling of the Markov chain M specifies, for every pair of states (Xt, Yt) ∈ Ω× Ω,
a probability distribution over (Xt+1, Yt+1) such that the processes Xt and Yt,
viewed in isolation, are faithful copies ofM, and if Xt = Yt then Xt+1 = Yt+1. The
coupling time, denoted Tcoup, is the minimum T such that P(XT 6= YT ) ≤ 1/(2e),
starting from the worst possible pair of configurations X0, Y0. The following
inequality is standard : tmix ≤ Tcoup.
Relation between inverse spectral gap and mixing time
The total variation mixing time is bounded from below and from above via
the spectral gap, up to polynomial factors in the number of vertices. Let gap? be
the absolute spectral gap, which, for a discrete-time chain, is defined as gap? =
mini(1− |λi|) where the λi’s are the nontrivial eigenvalues of the transition kernel.
In our applications, as the spectrum of P or L is positive, gap? = gap. Then, for
any aperiodic, reversible Markov chain P with stationary measure π,
gap−1 − 1 ≤ tmix ≤ gap−1? log(2e · π−1
min) , (2.2.3)
where πmin := minxπ(x) (see, e.g., [60, §12.2]). For the FK and Potts models on a
box with O(n2) vertices and fixed 0 < p < 1 and q ≥ 1, there exists c(p, q) > 0 such
that πmin & e−cn2, from which it follows that tmix are gap−1 are equivalent up to
nO(1)-factors. As our theorems are primarily at polynomial in n levels of precision,
this comprarison (2.2.3) justifies our moving back and forth between spectral gap
and mixing time.
45
2.2.1 Dynamics for Potts and FK models
Heat-bath Potts Glauber dynamics
The discrete-time heat-bath Glauber dynamics for the Potts model onG = (V,E)
with boundary conditions η is the following reversible Markov chain w.r.t. µG,β,q:
given a configuration σ(t), one generates the configuration σ(t+ 1) by selecting a
vertex v in V uniformly at random, and (1) letting σw(t+ 1) = σw(t) for all w 6= v
and (2) sampling σv(t+ 1) according to µηG,β,q(σv ∈ · | (σw(t))w 6=v) . In particular,
the probability that the new color assigned to v will be k ∈ [q] is proportional to
exp(β∑
w∼v 1σw(t) = k).
The continuous-time heat-bath Glauber dynamics for the Potts model assigns
i.i.d. rate-1 Poisson clocks to all vertices in V ; when a clock at a site v rings, the
chain resamples σv according to the same rates described above.
Heat-bath FK Glauber dynamics
The discrete-time heat-bath Glauber dynamics for the FK model on G = (V,E)
with boundary conditions ξ is the following reversible Markov chain w.r.t. πG,p,q.
Given a configuration ω(t), one generates ω(t + 1) by selecting an edge e ∈ E
uniformly at random, and (1) letting ωf(t + 1) = ωf(t) for all f 6= e and (2)
sampling ωe(t+ 1) according to Bernoulli(p) if x←→ y in the configuration ω(t)
restricted to E − e (counting the connections induced by ξ), and according
to Bernoulli( pp+q(1−p)) otherwise. Its continuous-time version is defined by again
assigning each edge an i.i.d. rate-1 Poisson clock, and when the clock at an edge e
rings, resampling it as in item (2).
46
FK Glauber dynamics and planar duality
Each run of the FK-dynamics on Λn, with realizable boundary conditions ξ
and parameters p, q, determines a valid run of the FK-dynamics on the dual graph
Λ∗n with boundary conditions ξ∗ and parameters p∗, q. (Simply identify the FK
configuration in each step with its dual configuration; it can be straightforwardly
verified that the transitions of the FK-dynamics on the dual graph occur with the
correct probabilities.) Hence, the two dynamics have the same mixing times.
Remark 2.2.1. The edge-set of the dual graph Λ∗n is not exactly in correspondence
with the edge-set of a rectangle Λ∗ = −12, ..., n+ 1
2 × −1
2, ..., n+ 1
2 as it does
not include any edges that are between boundary vertices of Λ∗. All the proofs in
the paper carry through, only with the natural minor geometric modifications, to
the case of rectangles Λn with modified edge-set that only contains edges edges
with at least one endpoint in Λn \ ∂Λn. The dual of this modified graph is then a
(n−1)×(n−1) rectangle with all nearest-neighbor edges. With these considerations,
it often suffices for us to prove our theorems for p < pc(q). For example, the dual of
a realizable boundary condition is also realizable (induced by the dual configuration
on Z2 \ Λn), so it is sufficient to prove Theorem 5 for p < pc(q).
Notice that the definition of realizability of the boundary condition is slightly
different depending on the above choice of edge-set for a rectangle (while ξ is
encoded in a configuration on E(Z2) \ E(Λn), its dual ξ∗ would be encoded in a
configuration E((Z2)∗) \ (E(Λ∗) \ E(∂Λ∗))). However, it is easy to see that these
two notions of realizability are equivalent: a partition of ∂Λn can be encoded in a
configuration on E(Z2) \ E(Λn) if and only if it can be encoded in a configuration
on E(Z2) \ (E(Λn) \ E(∂Λn))
47
Cluster dynamics
Swendsen–Wang dynamics for the q-state Potts model on G = (V,E) at inverse-
temperature β is the following discrete-time Markov chain reversible w.r.t. µG,β,q.
From a spin configuration σ ∈ Ωp on G, generate a new state σ′ ∈ Ωp as follows.
1. Introduce auxiliary FK edge variables (ωe)e∈E as follows: for e = xy, set
ωe = 0 if σx 6= σy and ωe ∼ Bernoulli(p) if σx = σy.
2. For every connected component C of the graph (V, e : ωe = 1), assign an
i.i.d. color ΞC uniform among [q], and set σ′x = ΞC for every x ∈ C.
In the presence of boundary conditions, step (2) of the Swendsen–Wang dynamics
does not reassign the color of any cluster that contains a boundary vertex; instead
for those clusters, the color assignment is dictated by the boundary condition.
Remark 2.2.2. The definition of the Swendsen–Wang dynamics indicates why
it has fast mixing at low-temperature whereas the Potts Glauber dynamics slows
down on (Z/nZ)2. When β > βc, the metastable states are the q ordered phases,
corresponding to the q colors; but step (2) above recolors all FK clusters, and in
particular, reassigns the large macroscopic cluster of the ordered phase an i.i.d. color,
allowing the dynamics to easily jump between the q metastable states. However, as
it relies heavily on the FK representation of the model, the order-disorder energy
barrier may still be hard to overcome when both ordered and disordered phases are
metastable, e.g., when π1Z2 6= π0
Z2 .
Chayes–Machta dynamics for the FK model on G = (V,E) with parameters
(p, q), for q ≥ 1 and 0 < p < 1, is the following analogous discrete-time reversible
Markov chain: From an FK configuration ω ∈ Ωrc on G, generate a new state
ω′ ∈ Ωrc as follows.
48
1. Assign each cluster C of ω an auxiliary i.i.d. variable XC ∼ Bernoulli(1/q).
2. Resample every e = xy such that x and y belong to clusters with XC = 1 via
i.i.d. random variables Xe ∼ Bernoulli(p), to obtain the new configuration ω′.
In the presence of boundary conditions, step (1) counts clusters of ω by including
the auxiliary connections dictated by the boundary conditions.
Variants of Chayes–Machta dynamics with 1 ≤ k ≤ bqc “active colors” have
also been studied, with numerical evidence for k = bqc being the most efficient
choice; see [39].
Spectral gap comparisons
The following comparison inequalities between the above Markov chains are
due to Ullrich (see [91, Thm. 1], [92, Thm 4.8 and Lem. 2.7]).
Theorem 2.2.3 ([91, 92]). Let q ≥ 2 be integer. Let gapp and gaprc be the spectral
gaps of discrete-time Glauber dynamics for the Potts and FK models, respectively,
on a graph G = (V,E) with maximum degree ∆ and no boundary conditions, and
let gapsw be the spectral gap of Swendsen–Wang. Then we have
gapp ≤ 2q2(qe2β)4∆gapsw , (2.2.4)
(1− p+ p/q)gaprc ≤ gapsw ≤ 8gaprc |E| log |E| . (2.2.5)
The proof of (1.2.2) further extends to all real q > 1, whence
gaprc . gapcm . gaprc |E| log |E| , (2.2.6)
as was observed (and further generalized) by Blanca and Sinclair [10, §5], where
49
gapcm is the spectral gap of Chayes–Machta dynamics, and the constants in the
comparisons . are independent of ∆ and depend only on p and q. In particular,
because the estimates of Theorem 1.2.2 hold for general graphs G and are formulated
in the absence of boundary conditions, they hold immediately for Λ with free
boundary conditions, and with periodic boundary conditions (where it is identified
with the model on (Z/nZ)2 with no boundary conditions).
Potts boundary conditions and spectral gap comparisons
The comparison estimates in Theorem 1.2.2 are only valid in the absence of
boundary conditions, whereas much of this thesis considers boundary conditions
other than periodic and free. For other boundary conditions, we can deform Λn as
in the following remark; this could, however, distort the maximum degree ∆ by an
order n factor, leading to an exponential in n cost in (1.2.1).
Remark 2.2.4. For any FK boundary condition ξ on G, we can define a (not
necessarily planar) graph G by identifying all vertices of each boundary component
of ξ with a single vertex in G, and keeping the same edge structure. Then the FK
Glauber dynamics on G with no boundary conditions is the same as that on G
with boundary conditions ξ. In such a case, let ∂G denote the set of vertices in G
that arise from the boundary components of G.
Remark 2.2.5. The exponential dependence on ∆ in Eq. (1.2.1) can be improved
to exponential in the maximum degree of all but one vertex (see [91, Theorem 1’]),
whence ∆ in Eq. (1.2.1) can be replaced with the second largest degree of G. This
implies that Theorem 9 in fact also holds for the Potts Glauber dynamics.
The following is a consequence of the spin-flip symmetry of the Swendsen–Wang
dynamics.
50
Fact 2.2.6. Consider Swendsen–Wang dynamics on G with Potts boundary con-
ditions η, by considering the graph G, where boundary vertices of each color are
identified as single vertices. Let gapsw(Gη) be the spectral gap of Swendsen–Wang
dynamics on G with b.c., η, and let gapsw(Gη) be the spectral gap of Swendsen–
Wang dynamics on G with ∂G assigned the colors given by η. If ∂G consists of at
most one vertex, then
gapsw(Gη) = gapsw(Gη) = gapsw(G) .
Remark 2.2.5 and Fact 2.2.6 imply that for Λn with FK boundary conditions ξ
with at most one nontrivial boundary component, corresponding to Potts b.c., η
(an assignment of a color, e.g., red, to the nontrivial component of ξ and no other
color assignments),
gaprc . gapsw . n2 log n · gaprc .
Notice that all of the boundary conditions considered in Theorems 9–11 are of this
form. Thus, it suffices to prove Theorems 9–11 for the FK Glauber dynamics and
the implication for Swendsen–Wang dynamics follows immediately.
2.2.2 Dynamical tools
Monotonicity and the grand coupling
A discrete-time Markov chain with state space Ω and transition kernel P is
monotone if µP νP for every two probability distributions µ, ν on Ω such that
µ ν. In continuous-time, this translates to Ht(µ, ·) Ht(ν, ·) for every µ ν.
The random mapping representation of the discrete-time FK Glauber dynamics
51
on a graph G = (V,E) views the updates as a sequence (Ji, Ui)i≥1, in which Ji’s are
i.i.d. uniform edges (the updated locations), and the Ui’s are i.i.d. uniform on [0, 1]:
starting from an initial configuration ω0, at time i, writing Ji = (x, y), the dynamics
replaces the value of ω(Ji) by 1Ui ≤ p if x←→ y in the configuration on E−Ji
(in the presence of boundary conditions, including boundary connections) and by
1Ui ≤ pp+q(1−p) otherwise. The grand coupling for FK Glauber dynamics is a
coupling of the chains from all initial configurations on G which, via the above
random mapping representation, uses the same update sequence (Ji, Ui)i≥1 for each
one of these chains. (In continuous time, the random mapping representation would
also include the sequence of update times (Ti)i≥1.) Using this representation and
the FKG inequality, one sees that heat-bath Glauber dynamics for the FK model
at q ≥ 1 is monotone.
Thus, for q ≥ 1, this coupling preserves the partial ordering of the Markov chains
started from all possible initial configurations, at all times t ≥ 0. In particular,
under the grand coupling, the value of an edge e in Glauber dynamics at time t
from an arbitrary initial state ω0, is sandwiched between the corresponding values
from the free and wired initial states; thus, by a union bound over all edges,
maxx∈Ω‖P t(x, ·)− π‖tv ≤ max
x,y∈Ω‖P t(x, ·)− P t(y, ·)‖tv
≤ |E(Λ)| ‖P t(1, ·)− P t(0, ·)‖tv . (2.2.7)
Censoring inequality
Key to our proof will be the Peres–Winkler [76] censoring inequality for monotone
systems. While the theorem of [76] and its subsequent applications in e.g., [63, 70]
are stated for spin systems whose sites are the vertices of the underlying graph,
52
one can view the edges as the sites by considering the appropriate line graph; it is
then easy to verify that the FK Glauber dynamics satisfies the conditions of [76,
Theorem 1.1]. Further, while Theorem 1.1 of [76] is stated for the discrete-time
dynamics, its formulation in continuous-time follows from the same proof: see
also [70, Theorem 2.5].
Theorem 2.2.7 ([76]). Let µT be the law of continuous-time Glauber dynamics
at time T of a monotone system on Λ with invariant measure π, whose initial
distribution µ0 is such that µ0/π is increasing. Set 0 = t0 < t1 < . . . < tk = T for
some k, let (Bi)ki=1 be subsets of Λ, and let µT be the law at time T of the censored
dynamics, started at µ0, where only updates within Bi are kept in the time interval
[ti−1, ti). Then ‖µT − π‖tv ≤ ‖µT − π‖tv and µT µT ; moreover, µT/π and µT/π
are both increasing.
Block dynamics
A key ingredient in the proof of [64], as well as our proof of Theorem 1, is the
block dynamics technique due to Martinelli (see [67, §3]) for bounding the spectral
gap of the Glauber dynamics. SupposeB1, ..., Bk are such thatB1−∂B1, ..., Bk−∂Bk
covers Λ− ∂Λ. Then the block dynamics is the corresponding Glauber dynamics
that updates one block (instead of one site) at a time: each block is assigned a rate-1
Poisson clock; when the clock at Bi rings, resample the configuration on Bi − ∂Bi
according to µσBi where the boundary conditions σ are given by the configuration
restricted to Λ− (Bi − ∂Bi).
Theorem 2.2.8 ([67, Proposition 3.4]). Consider the continuous-time Glauber
dynamics for the Potts model on Λ with boundary condition ζ, which is reversible
w.r.t. the Gibbs distribution µζΛ. Let gapp(Λζ) and gapB(Λζ) respectively be the
53
spectral gaps of the single-site dynamics on Λ and block dynamics corresponding to
B1, . . . , Bk such that Bo1, ..., B
ok cover Λo. Then letting χ = supx∈Λ #i : Bi 3 x,
we obtain
gapp(Λζ) ≥ χ−1gapB(Λζ) min
i=1,..,kminϕ
gapp(Bϕi ) .
An analogue in the context of FK Glauber dynamics is described in §6.1.6.
Canonical paths
The following well-known geometric approach (see [28, 29, 53, 86] as well as [60,
Corollary 13.24]) serves as an effective method for obtaining an upper bound on
the inverse gap of a Markov chain, and will be used in our proof of Theorem 8.
Theorem 2.2.9. Let P be the transition kernel of a discrete-time Markov chain on
Ω, reversible w.r.t. π, and write Q(x, y) = π(x)P (x, y) for every x, y ∈ Ω. For each
(a, b) ∈ Ω2, assign a path ζ(a, b) = (x0 = a, . . . , xn = b) such that P (xi, xi+1) > 0 for
all i, and write |ζ(a, b)| = nab, identifying ζ(a, b) with (xi−1, xi) : i = 1, . . . , nab.
Then,
gap−1 ≤ maxx,y∈Ω
Q(x,y)>0
1
Q(x, y)
∑a,b∈Ω
(x,y)∈ζ(a,b)
nabπ(a)π(b). (2.2.8)
A very standard application of Theorem 2.2.9 (see e.g., [66] in the setting of
the Ising model) proves upper bounds on mixing times of spin systems in terms of
the cut-width of the underlying graph.
Lemma 2.2.10. Consider the Glauber dynamics for the q-state Potts model at
inverse temperature β on a rectangle Λn,l = J0, nK × J0, `K for 0 ≤ ` ≤ n, with
54
arbitrary boundary conditions η. There exists a constant c(β, q) > 0 such that
gapp(Ληn,l)−1 . ec` .
Remark 2.2.11. In the absence of boundary conditions, the same bound holds
for the FK models by making the natural modifications and observing that in
the FK setting, the probability of any single edge-flip is at least some c(p, q) > 0.
However, the introduction of boundary conditions, which can encode long-range
correlations, destroys the standard bound on the cut-width of the n× ` rectangle,
and prevents this from going through. In fact, we will see later that under general
FK boundary conditions, the bound of Lemma 2.2.10 does not hold. We are able to
prove a somewhat satisfactory exp[c` log n] analogue under arbitrary realizable FK
boundary conditions, by using a tree-like structure of planar boundary conditions
to recursively bound the cutwidth on n× ` graphs in Section 6.4. That bound may
be of independent interest.
In particular, Lemma 2.2.10 and Remark 2.2.11 directly imply upper bounds
that are exponential in n for the inverse gaps in Theorems 2, 4 and 5.3.1.
Boundary modifications
Due to the difficulties of dealing with problematic FK boundary conditions, it
will be helpful to modify boundary conditions, say to remove long-range boundary
connections, and perform the dynamical analysis with the modified boundary
conditions. Let ξ, ξ′ be a pair of FK boundary conditions on Λ with corresponding
55
FK Glauber dynamics mixing times tmix, t′mix; define
Mξ,ξ′ = ‖πξΛ/πξ′
Λ‖∞ ∨ ‖πξ′
Λ /πξΛ‖∞ .
It is well-known (see, e.g., [70, Lemma 2.8]) that for some c independent of n, ξ, ξ′,
gaprc(Λξ) ≤M3ξ,ξ′gaprc(Λξ′) and tmix ≤ cM3
ξ,ξ′|E(Λ)|t′mix (2.2.9)
(this follows from first bounding tmix via its spectral gap, then using the variational
characterization of the spectral gap: the Dirichlet form, expressed in terms of local
variances, gives a factor of M2ξ,ξ′ , and the variance produces another factor of Mξ,ξ′).
The Radon–Nikodym factor Mξ,ξ′ can easily be bounded by a distance between
ξ and ξ′.
Definition 2.2.12. For any pair of boundary conditions, ξ, ξ′ define the symmetric
distance function d(ξ, ξ′) as follows: if ξ′′ is the unique smallest (in the previously
defined partial ordering) boundary condition with ξ′′ ≥ ξ and ξ′′ ≥ ξ′, define
d(ξ, ξ′) = k(ξ)− k(ξ′′) + k(ξ′)− k(ξ′′), where k(ξ) is the number of elements in the
partition induced by ξ.
In particular, if ξ is a boundary condition on Λ and ξ′ is a refinement of ξ, then
ξ′ ≤ ξ and d(ξ, ξ′) = k(ξ′)− k(ξ).
For an initial configuration ω0, and boundary condition ξ, let
dξω0(t) = ‖Pξω0
(Xt ∈ ·)− πξΛ‖tv .
where here and throughout, for any Markov chain (Xt)t≥0, we abbreviate P(Xt ∈
· | X0 = ω0) with boundary conditions ξ, to Pξω0(Xt ∈ ·).
56
Lemma 2.2.13. Let ξ, ξ′ be a pair of boundary conditions on ∂Λ. Then,
Mξ,ξ′ ≤ q2d(ξ′,ξ) , (2.2.10)
Let P be a distribution over coupled pairs of boundary conditions (ξ, ξ′) such that
P-a.s., Mξ,ξ′ is at most some M∆. For some universal c > 0, if t′ = ct|E|−2M−4∆ ,
E[dξ1(t) ∨ dξ0(t)] ≤ e−M∆ + 8E[dξ′
1 (t′) ∨ dξ′
0 (t′)] . (2.2.11)
In particular, taking P to be a point mass on (ξ, ξ′), for every t > 0,
maxω0∈0,1
dξω0(t) ≤ 8 max
ω0∈0,1dξ′
ω0
(c|E(Λ)|−2q−8d(ξ′,ξ) t
)+ exp
(−q2d(ξ′,ξ)
). (2.2.12)
Proof. By definition of d(ξ, ξ′), for any configuration ω, the difference in the number
of clusters under boundary conditions ξ vs. ξ′, |k(ω; ξ)− k(ω; ξ′)| is at most d(ξ, ξ′).
By definition of the FK model, every additional cluster receives a weight of q.
In order to prove (2.2.11), begin with the observation that by (2.2.9),
E[dξ1(t) ∨ dξ0(t)] ≤ e−M∆ + P(tmix ≥ t/M∆) ≤ e−M∆ + P(t′mix ≥ |E|t′) .
For t′mix ≥ s, there must exist an ω0 such that dξ′ω0≥ 1/(2e). But by Eq. (2.2.7),
dξ′ω0
(s) ≤ 2|E(Λ)|(dξ′
1 (s) ∨ dξ′
0 (s)), which implies that
P(t′mix ≥ |E(Λ)|t′) ≤ P
(dξ′
1 (|E(Λ)|t′) ∨ dξ′
0 (|E(Λ)|t′) ≥ (4e|E(Λ)|)−1
).
57
As a consequence of Theorem 2.2.7, it is well-known (see [70, Corollary 2.7]) that
dξ′
1 (t) ∨ dξ′
0 (t) ≤(
4(dξ′
1 (t0) ∨ dξ′
0 (t0)))bt/t0c
,
and thus
P(dξ′
1 (|E(Λ)|t′) ∨ dξ′
0 (|E(Λ)|t′) ≥ (4e|E(Λ)|)−1)≤ 8E[dξ
′
1 (t′) ∨ dξ′
0 (t′)] .
Conductances and Cheeger’s inequality
A powerful geometric technique for proving lower bounds on mixing times is
constructing a set S ⊂ Ω that is a bottleneck for the Markov chain dynamics. For
a chain with transition kernel P (x, y) and stationary distribution π, let the edge
measure between A,B ⊂ Ω be
Q(A,B) =∑ω∈A
π(ω)∑ω′∈B
P (ω, ω′) , (2.2.13)
(see, e.g., [60, Chapter 7]). Then the conductance/Cheeger constant of Ω is
Φ? = minS⊂Ω
Φ(S) where Φ(S) =Q(S, Sc)
π(S)π(Sc). (2.2.14)
and the following relation between Φ and the spectral gap of the chain [60] holds:
2Φ ≥ gap ≥ Φ2/2 . (2.2.15)
58
Chapter 3
Equilibrium estimates at q ∈ [1, 4]
and RSW Theory
At a continuous phase transition point i.e., q ≤ 4 and p = pc(q), the equilibrium
behavior of the Ising, Potts and FK models are expected to have a beautiful,
conformally invariant structure with a scaling limit described by conformal field
theory (CFT). In particular, the models are expected to be characterized by their
algebraic critical exponents, convergence of interfaces to SLE’s, and convergence of
fields of local observables to Euclidean fields in CFT. Since the seminal works of
Schramm introducing SLE ([83] as well as e.g., [80, 84]) and Smirnov introducing
the framework of discrete holomorphic observables [87, 88], such a scaling limit has
been established in the case of the Ising model q = 2 e.g., [14, 22, 23, 40, 51, 88]
(as well as Bernoulli percolation q = 1 on the hexagonal lattice e.g., [15, 16, 87]).
One particular feature of this stunning picture is that the probability of, say,
an open FK cluster crossing an n × αn rectangle with bounded aspect ratio α
remains bounded away from zero and one as n → ∞ (indicative of criticality
59
from a renormalization perspective): this is known as a Russo–Seymour–Welsh
(RSW) bound. In this chapter, we use percolation techniques and RSW theory,
to prove necessary equilibrium estimates on crossing probabilities, arm-exponents,
and numbers of disjoint long-range crossings for the FK model at a continuous
phase transition point. In addition to being central to the dynamical analysis of
the Potts and FK models at p = pc(q) and q ∈ (1, 4] (namely Theorems 1 and 3),
many of these estimates are independently of interest; we point in particular, to
the bounds on the number of nested boundary components (bridges) in a typical
realizable boundary, derived in Section 3.2.
Throughout the chapter, take q ∈ (1, 4], let p = pc(q) and drop p, q from the
notation.
3.1 Crossing probabilities and RSW Theory
In this section we present estimates on crossing probabilities that will be used
to prove the desired mixing time bounds. Russo–Seymour–Welsh (RSW) bounds
were first introduced [81, 85] in the context of Bernoulli percolation, and are central
to the understanding of Bernoulli percolation at its critical point p = 12.
In [33], RSW estimates were proven for the FK representation of the Ising
model, q = 2, using the framework of discrete-holomorphic observables from [88];
in this setting, where the boundary conditions play a role, the bounds on crossing
probabilities held uniformly over the boundary conditions on the rectangle Λn,αn.
For 1 < q ≤ 4, the proof of continuity of the phase transition [34] relied crucially
on analogous RSW estimates, which we develop in detail in this section.
60
Crossing probability notation
Recall that for two vertices x, y ∈ V , we denote by x ←→ y the event that
x and y belong to the same cluster of ω. In the context of a subgraph S ⊂ G,
write xS←→ y to denote that x and y belong to the same cluster of ωE(S). For a
rectangular subset R ⊂ Z2, refer to the event
Cv(R) :=⋃
x∈∂sR , y∈∂nR
ω : x
R←→ y
as a vertical crossing of a rectangle R, and denote a horizontal crossing of the
rectangle R by
Ch(R) :=⋃
x∈∂eR , y∈∂wR
ω : x
R←→ y.
Consider a subset of Z2 of the form A = R2 −R1 where R1, R2 are rectangular
subsets of Z2 with R1 ( R2. Call such domains annuli, and define open circuits
as paths of nontrivial homology in A connecting a vertex x to itself. Denote the
existence of an open circuit in the annulus A by Co(A).
Finally, we add the ∗-symbol to the above crossing events to refer to the
analogous dual-crossings (configurations ω such that an open-crossing occurs in its
corresponding dual-configuration ω∗ on the appropriate dual subgraph).
Notice that crossing events e.g., x ←→ y, Ch(R) are increasing events, and
dual-crossing events are decreasing events.
61
RSW estimates
The following RSW estimates hold for the q ∈ [1, 4] FK model at p = pc(q)1
(note the difference between 1 < q < 4 and the extremal case q = 4, where full
RSW-type bounds up to the boundary are believed to fail [34]).
Theorem 3.1.1 ([34, Theorem 7]). Consider the critical FK model for 1 ≤ q < 4
on Λ = Λn,n′ with n′ = bαnc for fixed 0 < α ≤ 1. There exists p0 = p0(q, α) > 0
such that for every boundary condition ξ and all n,
πξΛ(Cv(Λ)) > p0 .
Theorem 3.1.2 ([34, Theorem 3]). Let q = 4 and consider the critical FK model
on Λ = Λn,n′ with n′ = bαnc for fixed 0 < α ≤ 1. Then for every ε, ε′ > 0 there
exists p0 = p0(α, ε, ε′) > 0 such that, for every boundary condition ξ and every n,
πξΛ(Cv(Jεn, (1− ε)nK× Jε′n′, (1− ε′)n′K) > p0 .
Theorem 3.1.3 ([34, Proposition 2]). Fix ε, ε′ > 0 and 0 < α ≤ 1, and consider
the critical FK model at 1 ≤ q ≤ 4 on the annulus A = Λn,n′ − Jεn, (1 − ε)nK ×
Jε′n′, (1 − ε′)n′K for n′ = bαnc. There exists p0 = p0(q, α, ε, ε′) so that, for every
boundary condition ξ and every n,
πξA (Co(A)) > p0 .
A consequence of the above RSW-type bounds is polynomial decay of correlations
1In [34], Theorems 3.1.2 and 3.1.3 were formulated for the special case of ε = ε′, but readilyextend to the more general setting presented here.
62
for the critical FK model at 1 ≤ q ≤ 4 (see, e.g., the proof of [34, Lemma 1]).
Theorem 3.1.4 (decay of correlations, [34]). For 1 ≤ q ≤ 4 and p = pc(q), there
exists a unique infinite-volume Gibbs measure πZ2, and there exist c1, c2 > 0 such
that
n−c1 . πZ2
((0, 0)←→ ∂J−n, nK2
). n−c2 .
It is believed that for every q ∈ [1, 4] the above decay of correlations behaves
as n−zq for some critical exponent zq > 0: this is known in the settings of q = 1
corresponding to critical percolation, and q = 2 corresponding to the Ising model.
In particular, this power-law decay at criticality suggests the conjectured conformal
invariance and fractal structure of the critical FK model on Z2 when q ≤ 4.
3.1.1 Crossing probabilities at q = 4
For 1 ≤ q < 4, the probability of a horizontal crossing of a rectangle with
arbitrary boundary conditions is uniformly bounded away from 0 (Theorem 3.1.1),
whereas at q = 4, under free boundary conditions, it is expected that the probability
of such a crossing of Λ in fact decays to 0 as n → ∞ [34]. We lower bound this
crossing probability under general boundary conditions.
Theorem 3.1.5. Let q = 4 and consider the critical FK model on Λ = Λn,n′, where
n′ = bαnc for a fixed aspect ratio 0 < α ≤ 1. There exist c(α), γ(α) > 0 such that
for every boundary condition ξ and every n,
πξΛ (Ch(Λ)) > cn−γ .
Proof. By monotonicity in boundary conditions, it suffices to prove the above
63
12n′
12n′ + 4α logn
12n′ + 8α logn
logn 2 logn 3 logn 5 logn 11 logn
R0
R1
R2
R3
R4
Figure 3.1: The first four steps of the stitching technique of Theorem 3.1.5. Re-peating log εn times creates a macroscopic horizontal open crossing.
for free boundary conditions (the case ξ = 0). Fix δ > 0 and let R = J0, nK ×
J(12− δ)n′, (1
2+ δ)n′K. We will show the stronger result that there exist some
γ(α), γ′(α) > 0 such that,
n−γ . π0Λ
((0, n′/2)
R←→ (n, n′/2)). n−γ
′. (3.1.1)
The upper bound in (3.1.1) is a consequence of the polynomial decay of correla-
tions in Theorem 3.1.4, and it remains to establish the lower bound. Observe that
for every e,
infω′∈0,1E(Λ)−e
πω′
Λ (e ∈ ω) =pc
pc + q(1− pc)=
1
1 +√q
;
thus, we can force all the edges of R0 = J0, 2 log nK×n′2 to be open with probability
at least (1 +√q)−2 logn = n−2βc , where we recall that βc = log(1 +
√q).
We boost this to a horizontal crossing of length δn/2 from the boundary by
stitching together horizontal and vertical crossings and applying the FKG inequality
(see Fig. 3.1). Fix ε > 0 sufficiently small (e.g., a choice of ε = δ/10 would suffice),
64
and consider
R2k−1 = J(2k − 1) log n, (3 · 2k−1 − 1) log nK× Jn′2, n′
2+ 2k+1α log nK ,
R2k = J(2k − 1) log n, (3 · 2k − 1) log nK× Jn′2, n′
2+ 2k+1α log nK
for k = 1, . . . , K, where K = blog2
(εn
logn
)c.
Moreover, take R2k−1 and R2k to be the concentric 32-dilations of R2k−1 and
R2k, respectively. By construction, each R2K−1 and R2K has width at most 2εn
and height at most 2εn′, hence their respective dilations R2K−1 and R2K are both
contained in R.
As a consequence of Ri ⊂ Λ the free boundary conditions on Ri are dominated
by the distribution over boundary conditions induced by π0Λ. Thus, there exists
some p1(α), p2(α) > 0 given by Theorem 3.1.2 such that
π0Λ(Cv(R2k−1)) ≥ π0
R1(Cv(R2k−1)) ≥ p1 , and likewise π0
Λ(Ch(R2k)) ≥ p2 .
(Notice the aspect ratios of R2k−1 are the same for all k, and similarly for R2k.)
Further, for every k, these events are increasing; thus by the FKG inequality,
π0Λ (Cv(R2k−1) ∩ Ch(R2k)) ≥ p1p2 .
At the final scale K, the width of R2K is (2− o(1))εn and its height is (2− o(1))εn′,
so
(ωR0
= 1 ∩K⋂k=1
(Cv(R2k−1) ∩ Ch(R2k))
)⊂ (0, n
′
2)
R←→ 2εn × J12n′, (1
2+ 2ε)n′K
65
for any sufficiently large n. By repeated application of the FKG inequality,
π0Λ
((0, n
′
2)
R←→ 2εn × J12n′, (1
2+ 2ε)n′K
)≥ n−2βc(p1p2)K = n−γ (3.1.2)
for some γ > 0. By symmetry, the exact same argument shows that
π0Λ
((n, n
′
2)
R←→ (1− 2ε)n × J12n′, (1
2+ 2ε)n′K
)≥ n−γ . (3.1.3)
In order to complete the desired horizontal crossing, we require an open path
connecting the left and right crossings, via an open circuit in the annulus A1 given
by
A1 = J0, nK× J(12− δ)n′, (1
2+ δ)n′K− Jεn, (1− ε)nK× J(1
2− 3ε)n′, (1
2+ 3ε)n′K .
By Theorem 3.1.3, there is an absolute constant p3(α) > 0 such that π0A1
(Co(A1)) >
p3. Since the induced boundary conditions on ∂A1 by π0Λ stochastically dominate
free boundary conditions on ∂A1, it follows that π0Λ (Co(A1)) > p3. Finally, the event
Co(A1) is increasing, and its intersection with the two horizontal crossing events
from (3.1.2) and (3.1.3) is a subset of the event (0, n′2
)R←→ (n, n
′
2). Thus, by
FKG, the latter has probability at least p3n−2γ , establishing (3.1.1), as desired.
The following is a slight extension of Theorem 3.1.5.
Proposition 3.1.6. Let q ∈ (1, 4] and fix α ∈ (0, 1]. Consider the critical FK
model on Λ = Λn,n′ with bαnc ≤ n′ ≤ dα−1ne. For every ε > 0, there exists
c?(α, ε, q) > 0 such that for every x ∈ Jεn, 1− εnK, and every boundary condition ξ
on ∂Λ, one has
πξΛ((x, 0)←→ (x, bn′c)
)& n−c? .
66
Proof. The proposition was proved in the case n′ = bαnc and x = n2. Since
the crossing probabilities of Theorem 3.1.2 are monotone in the aspect ratio,
each is bounded away from zero for aspect ratios in [α, α−1], yielding the desired
extension.
The next two results are for q = 4 (Theorem 3.1.1 implies both for 1 < q < 4).
Lemma 3.1.7. Let q = 4 and fix α ∈ (0, 1]. Consider the critical FK model on
Λ = Λn,n′ with bαnc ≤ n′ ≤ dα−1ne and (1, 0) boundary conditions denoting wired
on ∂sΛ and free elsewhere. For every ε > 0, there exists p = p(α, ε) > 0 such that
π1,0Λ (Cv(J0, nK× J0, (1− ε)n′K)) ≥ p .
Proof. Consider the rectangle Λ′ = J0, nK × J−εn′, n′K which contains Λ. By
monotonicity in boundary conditions and the domain Markov property, we see that
π1,0Λ (ω ∈ ·) sup
ξπ0
Λ′
(ωE(Λ) ∈ · | ωE(Λ′)−E(Λ) = ξ
) π0
Λ′(ωE(Λ) ∈ ·) .
By Theorem 3.1.2, there exists a p(ε, α) > 0 such that π0Λ′
(Cv(Λ)
)≥ p. Since Cv(Λ)
is an increasing event, these together imply the desired.
Corollary 3.1.8. Let q = 4 and fix α ∈ (0, 1]. Consider the critical FK model on
Λ = Λn,n′ with bαnc ≤ n′ ≤ dα−1ne and boundary conditions, denoted by (1, 0, 1, 0),
that are wired on ∂n,sΛ and free on ∂e,wΛ. Then there exists p(α) > 0 such that
π1,0,1,0Λ (Cv(Λ)) ≥ p .
Proof. For all n′ ≤ n this follows immediately from self-duality and monotonicity
67
in boundary conditions. For n ≤ n′ ≤ dα−1ne, by monotonicity in boundary
conditions and Lemma 3.1.7, for any ε ∈ (0, 1), there is a p(1, ε) > 0 such that,
π1,0,1,0Λ (Cv(J0, nK× J0, εnK)) ≥ p ,
and by reflection symmetry, π1,0,1,0Λ (Cv(J0, nK× Jn′ − εn, n′K)) ≥ p. Let
Aε = Λ− Jεn, (1− ε)nK× Jεn, n′ − εnK .
Since Co(Aε) can be lower bounded by four crossings of rectangles, each of whose
probabilities is monotone in the aspect ratio and thus bounded away from 0
uniformly over n ≤ n′ ≤ dα−1ne, we have that π1,0,1,0Λ (Cv(Aε)) ≥ p′ uniformly over
n ≤ n′ ≤ dα−1ne for some p′(α, ε). Now observe that
(Cv(J0, nK× J0, εnK) ∩ Cv(J0, nK× Jn′ − εn, n′K) ∩ Co(Aε)
)⊂ Cv(Λ) .
After fixing any small ε > 0, by the FKG inequality, there exists some p(α) > 0
such that for every n ≤ n′ ≤ dα−1ne, one has π1,0,1,0Λ (Cv(Λ)) ≥ p, as required.
3.1.2 Lower bounds on arm exponents
As mentioned, the polynomial decay is expected to have a critical exponent
predicted by the algebraic structure of conformal field theory; while this critical
exponent is known rigorously at q = 2 [75], this is not the case for other q ∈ (1, 4].
Here, we adapt a standard argument for obtaining the Bernoulli percolation two-arm
exponent, and use it to lower bound the FK one-arm exponent.
Lemma 3.1.9. Fix an ε > 0 and consider the critical FK model for q ∈ (1, 4] on
68
R = J−n, nK2; there exists c(q) > 0 such that
π0R
(0←→ ∂(J−n
2, n
2K2))≥ cn−
12 , (3.1.4)
and thus, there exists c′(ε, q) > 0 such that for every x, y ∈ J−(1− ε)n, (1− ε)nK2,
π0R
(x←→ y
)≥ c′|x− y|−1 . (3.1.5)
Proof. Let L = J−n8, n
8K× 0, for every x ∈ L, set Rx = x+ J−n
2, n
2K2, and let R+
x
be the top half of Rx. Also let B = J−3n4, 3n
4K2 and define the event
Sx = x←→ ∂nRx in R+x ∩ x+ (1
2, 0)
∗←→ ∂nRx in R+x .
Consider the event S that there exists a site x ∈ L such that Sx holds. We
begin by proving that π1B(S) ≥ c for some c > 0 independent of n. By using
Theorem 3.1.1–3.1.2 and stochastic domination twice, we see that there exists
c(q) > 0 such that
π1B(Cv(J−n
8,− n
10K× J0, n
2K) ∩ C∗v(J n10
, n8K× J0, n
2K)) ≥ c .
But one can observe that the above event implies that the right-most point on
L that is part of the cluster of the vertical open crossing in R+0 satisfies Sx, so
π1B(S) ≥ c. At the same time, we have by a union bound that
π1B(S) ≤
∑x∈L
π1B(Sx) ≤ (n/4) max
x∈Lπ1B(Sx) , so that
maxx∈L
π1B(Sx) ≥ 4cn−1 .
69
The maximum on the left-hand side is attained by some deterministic x ∈ L which
we set to be j, for which we have, by the FKG inequality and self-duality, that
π1B(Sj) ≤ π1
B
(j ←→ ∂nRj in R+
j
)π1B
(j + (1
2, 0)
∗←→ ∂nRj in R+j
)= π1
B
(j
R+j←→ ∂nRj
)π0B(j
R+j←→ ∂nRj) . (3.1.6)
By the RSW estimate, Theorem 3.1.3, we see that π0B(Co(B −
⋃j∈LRj)) ≥ ε for
some ε(q) > 0, and therefore by monotonicity in boundary conditions,
π0B
(j
R+j←→ ∂nRj
)≥ επ1
B
(j
R+j←→ ∂nRj
).
Plugging this in to (3.1.6) implies that π0B(j ←→ ∂Rj) ≥ 2
√cεn
. In order to com-
plete the proof of (3.1.4), we translate by −j to see that π0B−j(0←→ ∂J−n
2, n
2K2) ≥
c′n−12 for some c′(q) > 0. Since j ∈ L, B − j ⊂ R and by monotonicity, we
deduce (3.1.4).
Going from (3.1.4) to (3.1.5) is a standard exercise in using RSW estimates
(Theorems 3.1.1–3.1.2) and the stitching arguments used in the proof of Theo-
rem 3.1.5; since both x, y are macroscopically far from ∂R, we can use (3.1.4) to
connect each of them to some distance O(|x− y|) away, and stitch open crossings
to connect these two together via the FKG inequality and Theorems 3.1.1–3.1.2,
yielding the desired.
3.2 Boundary bridges
As discussed earlier, dual-crossings, whose probabilities have been bounded
uniformly in the boundary condition by Russo–Seymour–Welsh type estimates at
70
Figure 3.2: A pair of boundary connections (green, purple) constituting two distinctboundary bridges, separated by a (blue) dual connection.
q = 2 by [33] and at all q ∈ (1, 4] by [34], and considered in the preceding sections,
are central to coupling proofs of upper bounds on mixing times of Ising models
in [64] and Potts models in Chapter 4. When we move to the Glauber dynamics
for the FK model, there is a major new obstacle with which we have to deal. As
pointed out in the introduction (e.g., Figure 1.5), disjoint long-range FK clusters
along the boundary of rectangles, called bridges (see Fig. 3.2), prevent the coupling
described above and are an impediment to polynomial mixing time upper bounds
for the FK Glauber dynamics at q ∈ (1, 4] and p = pc(q) (see also Section 4.2.1).
Definition 3.2.1. Let Λn,m = J0, nK × J0,mK. Given an FK configuration ω on
E(Z2)− E(Λn,m) and a boundary edge e ∈ ∂Λn,m, say without loss of generality
e ∈ ∂nΛn,m, a bridge over e is an open FK cluster in ω that contains at least one
vertex in ∂nΛn,m to the left of e and one to the right. Let Be(ω) denote the set
of all bridges of e.
(For a more detailed definition, we also refer the reader to Section 3.2.1.) In
critical bond percolation (q = 1 and p = 1/2), the Berg–Kesten (BK) inequality
would suggest that π(|Be| > K log n + a) ≤ exp(−ca) for universal K, c > 0 and
all a. In our setting of FK percolation for 1 < q < 4 at p = pc, the classical BK
71
inequality does not hold; the goal of this section, namely Proposition 3.2.8 and
Lemmas 3.2.10–3.2.11, is to use an iterative revealing procedure of nested bridges
to nonehthess obtain such a bound for |Be|.
It then follows (see Corollary 3.2.9) that if we sample from the FK measure on
a 2n× 2n box Λ with arbitrary boundary conditions, the boundary conditions this
induces on the concentric inner n × n box will have order log n distinct bridges
over a given edge with probability 1−O(n−c). A simpler formulation of that is as
follows.
Theorem 3.2.2. For every q ∈ [1, 4], there exist K ′ > K > 0 and c(q) > 0 such
that, the critical FK model has, for every e ∈ ∂Λn,n a, say, n10
distance from a
corner of Λn,n,
πZ2,pc,q
(K log n ≤ |Be(ω)| ≤ K ′ log n
)≥ 1−O(n−cK
′) .
The lower bound on |Be| demonstrates that the behavior of the number of
bridges at p = pc is truly different than at p 6= pc; there, by the exponential decay
of correlations at p < pc (dual connections at p > pc), the typical number of bridges
over an edge is O(1).
Relation to mixing estimates. To be more precise about the obstacle posed by
having multiple bridges over an edge, recall the following. In [64] and then [42], the
upper bounds on the mixing time of the Potts models at β = βc for q ∈ 2, 3, 4
relied on RSW bounds [33, 34] to expose dual-interfaces in the FK representation,
beyond which block dynamics chains could be coupled. However, the fact that
chains, started from any two initial configurations, could be coupled past a dual-
interface, relied on a certain conditional event, implicit in the relation between the
72
FK and Potts models at integer q (that no distinct boundary components were
connected in the interior configuration). Without this conditioning, connections
between two components on one side of a rectangle alter the boundary conditions
elsewhere via bridges over the dual-interface, preventing coupling (see Fig. 1.5).
Similar difficulties were pointed out in [9] at p < pc; we also deal with them
in Chapters 7–8 at criticality when q > 4. In both of these cases, the exponential
decay of correlations under π0Z2 ensures that all such bridges would be, with
high probability, microscopic: in [9] these bridges are negligible after restricting
attention to side-homogenous (wired or free on sides) boundary conditions, while in
Chapters 7–8, relevant boundary segments will be disconnected from one another
by brute-force modifications. In contrast, in the present setting at the critical point
of a continuous phase transition, the power-law decay of correlations precludes such
techniques; thus, at p = pc(q) obtaining sharp bounds on the number of bridges
becomes not only necessary, but also substantially more delicate.
3.2.1 Boundary bridges
In this subsection we define boundary bridges of the FK model and related
notation. As explained above, the presence of boundary bridges will be the key
obstacle to coupling and, in turn, to mixing time bounds.
Definition 3.2.3. Consider a rectangle Λ = Λn,n′ with boundary conditions ξ, and
a connected segment L = Ja, bK × n′ ⊂ ∂nΛ. A component γ ⊂ ∂nΛ of ξ is a
bridge over L if there exist v = (v1, v2), w = (w1, w2) ∈ γ such that vξ←→ w and
v1 < a and w1 > b .
73
hulle(γi)hullw(γi)
γi
γi+1
e
Figure 3.3: A pair of boundary bridges, γi, γi+1, over e ∈ ∂nR induced by aconfiguration on Λ−R, and separated by a dual-bridge over e.
Note that every two distinct bridges γ1 6= γ2 over L are disjoint in ξ. Denote by
BL = BL(ξ) the set of all bridges over the segment L. Define bridges on subsets of
∂Λs, ∂Λe, ∂Λw analogously.
Definition 3.2.4 (hull and length of a bridge). The west and east hulls of a bridge
γ over L = Ja, bK× n′ are defined as
hullw(γ) = Jmaxx ≤ a : (x, n′) ∈ γ, aK× n′ ,
hulle(γ) = Jb,minx ≥ b : (x, n′) ∈ γK× n′ ,
so that the hulls of a bridge γ are connected subsets of ∂nΛ (see Fig. 3.3. The west
and east lengths of γ are defined to be
`w(γ) = | hullw(γ)| , `e(γ) = | hulle(γ)| .
Given the above convention, for any L and ξ we can define a east-ordering of
74
BL(ξ) as (γ1, γ2, ..., γ|BL|) where, for all i < j,
`e(γi) < `e(γj) .
Note that, in this ordering of the bridges, hulle(γi) ( hulle(γj) for all i < j. Define
a west-ordering of BL analogously.
Definition 3.2.5. For a subset R ⊂ Λ, an induced boundary condition on ∂R is
one that can be identified with the component structure of an edge configuration
ωΛ−Ro along with the boundary condition on Λ.
Using the above definitions, and planarity, one can check the following useful
facts (depicted in Fig. 3.3). For concreteness we use the east-ordering of BL =
γ1, ..., γ|BL|.
Fact 3.2.6. Let Λ ⊃ R with boundary conditions ξ, and let L ⊂ ∂nR. If γi, for
i < |BL|, is the i-th bridge in the east-ordering of BL, then either the two connected
components of ∂nR− (hullw(γi) ∪ L ∪ hulle(γi)) are connected in Λ−R, or each of
these components is connected to ∂Λ in Λ−R.
Fact 3.2.7. Let Λ ⊃ R with boundary conditions ξ, and let L ⊂ ∂nR. For every
two induced bridges γ1 6= γ2 over a segment L such that hulle(γ1) ⊂ hulle(γ2), either
the two sets (hullw(γ2)4 hullw(γ1)) and (hulle(γ2)4 hulle(γ1)) are dual-connected
in Λ−R, or each of these sets is dual-connected to ∂Λ in Λ−R.
3.2.2 Estimating the number of boundary bridges
In this section, we bound the number of distinct induced boundary bridges over
a segment of ∂R.
75
When sampling boundary conditions on R ⊂ Λ under πξΛ, the induced bridges
over e and all properties of them, are measurable w.r.t. ωΛ−Ro . For any configura-
tion ω, we denote by Be = Be(ωΛ−Ro , ξ) the set of all bridges over e corresponding
to that configuration on Λ, with the above defined west and east orderings.
The main estimate on |Be|, that will be key to the proof of Theorem 9, is the
following.
Proposition 3.2.8. Let q ∈ (1, 4] and fix α ∈ (0, 1]. Consider the critical FK
model on Λ = Λn,n′ with n′ ≥ bαnc, along with the subset R = Λn,n′/2. There exists
c(α, q) > 0 such that for every e ∈ ∂nR, every boundary condition ξ, and every
K > 0,
πξΛ(ω : |Be| ≥ K log n) . n−cK . (3.2.1)
Moreover, there exists c′(α, q) > 0, and for every ε > 0 there is some K0(ε), such
that for every e ∈ Jnε, n − nεK × bn′2c, every boundary condition ξ, and every
K < K0,
πξΛ(ω : |Be| ≥ K log n) & n−c′K .
In addition to the tail behavior of |Be|, we can also classify its typical behavior,
showing that a fixed edge indeed has order log n bridges over it with high probability
(cf. the case of p 6= pc(q) where this quantity is typically O(1)).
Corollary 3.2.9. Let q ∈ (1, 4] and α ∈ (0, 1]. Consider a rectangle Λ = Λn,n′
with n′ ≥ bαnc, along with R = Λn,n′/2. There exists c(α, q) > 0, and for every
ε > 0, there exist K ′ > K(ε) > 0, such that for every e ∈ Jnε, n− nεK×bn′2c and
every ξ,
πξΛ (|Be| /∈ JK log n,K ′ log nK) ≤ n−c .
76
The model at p = pc(q), q ∈ (1, 4] is believed to be scale-invariant; in line with
this, having nested bridges whose length grow exponentially induces c log n clusters
ranging in scale between O(1) and O(nε). Indeed, this is how the lower bound of
Corollary 3.2.9 is obtained. It will, therefore, be important for us to split the set
Be into those bridges according to their proximity to their interior bridges, as well
as the boundary.
For the rest of this subsection, since e is fixed, if e is in the left half of ∂nR then
we will use the east-ordering of Be and otherwise we will use the west-ordering
of Be. If e is in the left half of ∂nR define the subsets Be1 = Be
1(ωΛ−Ro , ξ) and
Be2 = Be
2(ωΛ−Ro , ξ) of Be as follows:
Be1 =
γi ∈ Be : `e(γi−1) ≤ n
6, `e(γi) ≤ 2`e(γi−1)
,
Be2 =
γi ∈ Be : `e(γi−1) ≥ n
6, n− x− `e(γi) ≥ 1
2(n− x− `e(γi−1))
.
For e in the right half of ∂nR, define Be1 and Be
2 analogously, by replacing `e with
`w and n − x with x. For convenience, let γ0 be the possibly nonexistent bridge
given by the two vertices incident to the edge e, which will allow us to treat γ1 as
we would treat the other γi’s.
Before proving Proposition 3.2.8, we present the two lemmas central to the
upper bound (3.2.1) of Proposition 3.2.8, proving exponential tails on each of |Be1|
and |Be2| beyond O(log n). Together, these will imply the O(log n) upper bound
on |Be|, so we defer the proofs of the two lemmas until after completing the proof
of Proposition 3.2.8 using the lemmas. We conclude this section with a proof of
Corollary 3.2.9.
Lemma 3.2.10. There exists c1(α, q) > 0 such that for every e ∈ ∂nR, ξ and
77
K > 0,
πξΛ (ω : |Be1| ≥ K log n) . n−c1K .
Lemma 3.2.11. There exists c2(α, q) > 0 such that for every e ∈ ∂nR, ξ and
K > 0,
πξΛ (ω : |Be2| ≥ K log n) . n−c2K .
With these two lemmas in hand the proof of Proposition 3.2.8 is greatly simpli-
fied.
Proof of Proposition 3.2.8. We begin with the upper bound. Fix an edge
e ∈ ∂nR and a boundary condition ξ on ∂Λ. Without loss of generality suppose
that e is in the left half of ∂nR and use the east-ordering of Be = γ1, γ2, ..., γ|Be|.
Observe that violating the second condition in Be1 means that `e(γi) has at least
doubled the length of its predecessor, whereas violating the second condition in
Be2 means that n − x − `e(γi) is at most half the corresponding quantity of its
predecessor. Noting that violating the length condition of `e(γi−1) (compared to
n/6) is disjoint between Be1 and Be
2, and since `e(γi) ≤ n and n− x− `e(γi) ≥ 1
for all i, we deterministically have
|Be − (Be1 ∪Be
2)| ≤ 2 log2 n ≤ 3 log n .
Using a union bound,
πξΛ (|Be1 ∪Be
2| ≥ (K − 3) log n) ≤ πξΛ(|Be
1| ≥ K−32
log n)
+ πξΛ(|Be
2| ≥ K−32
log n).
The bounds on the two terms on the right-hand side are given by Lemmas 3.2.10–
78
3.2.11, respectively. Taking the minimum of c1, c2 in those lemmas then implies
that there exists c(α, q) > 0 such that
πξΛ(|Be| ≥ K log n) . n−c(K−3)/2 .
In order to prove the lower bound, for any ε > 0, fix any edge e = (x, bn′2c) with
x ∈ Jnε, n− nεK. For i ≥ 1, suppressing the dependence on e, define the sets
Rni = Jx− 2i+1, x+ 2i+1K× Jbn′
2c+ 2i, bn′
2c+ 2i+1K ,
Rei = Jx+ 2i, x+ 2i+1K× Jbn′
2c − 2i, bn′
2c+ 2i+1K , (3.2.2)
Rwi = Jx− 2i+1, x− 2iK× Jbn′
2c − 2i, bn′
2c+ 2i+1K ,
and their respective subsets,
Rni = Jx− 2i+1 + 2i−1, x+ 2i+1 − 2i−1K× Jbn′
2c+ 2i, bn′
2c+ 2i + 2i−1K ,
Rei = Jx+ 2i, x+ 2i + 2i−1K× Jbn′
2c, bn′
2c+ 2i + 2i−1K , (3.2.3)
Rwi = Jx− 2i − 2i−1, x− 2iK× Jbn′
2c, bn′
2c+ 2i + 2i−1K .
When K < K0 := ε log 44
, for every i ≤ 2K log n, we have Rwi , R
ni , R
ei ⊂ Λ. Also
define the following crossing events.
Ai = Cv(Rwi ) ∩ Ch(Rn
i ) ∩ Cv(Rei ) ,
A∗i = C∗v(Rwi ) ∩ C∗h(Rn
i ) ∩ C∗v(Rei ) .
79
Then by definition of distinct bridges in Λ−Ro, we observe that for each k,
|Be| ≥ K log n ⊃K logn⋂i=1
A2i−1 ∩ A∗2i . (3.2.4)
By monotonicity in boundary conditions, the FKG inequality, and Theorem 3.1.2,
there exists p(α, q) > 0 such that for every i ≤ 2K log n,
πξΛ(Ai) ≥ π0Rwi(Cv(Rw
i ))π0Rni(Ch(Rn
i ))π0Rei(Cv(Re
i )) ≥ p ,
πξΛ(A∗i ) ≥ π1Rwi(C∗v(Rw
i ))π1Rni(C∗h(Rn
i ))π1Rei(C∗v(Re
i )) ≥ p .
Thus, if K < K0, we have πξΛ(|Be| ≥ K log n) ≥ p2K logn, as required.
We now prove Lemmas 3.2.10–3.2.11, whose proofs constitute the majority of
the work in obtaining Proposition 3.2.8.
Proof of Lemma 3.2.10. Assume without loss of generality that e = Jx−1, xK×
bn′2c is such that x ≤ n
2and use the east-ordering of Be = γ1, γ2, ..., γ|Be|. In
order to obtain an upper tail on |Be1|, let us describe a revealing procedure for the
FK configuration ω on Λ−Ro under πξΛ.
Let F = J0, nK × Jbn′2c, n′K = Λ − Ro (so that ωF , ξ is the set of connections
with respect to which the existence/properties of bridges are measurable). We can
sequentially reveal γ1, . . . , γ|Be| by exposing the open clusters (in F ) containing
vertices of ∂nΛ one at a time, starting from those adjacent to the right-vertex of e.
Such procedures for exposing the clusters have been used in related settings (see,
e.g., [9, 64]); we formally describe the procedure here since in our case it involves
long-range interactions imposed by the FK boundary conditions. One can reveal
the open cluster Cv containing a vertex v in the set F by
80
1. initializing the set C = v;
2. exposing the values of ω on all edges in E(F ) that contain vertices in C;
3. adding to C any vertices that are now connected by a path of open edges to v
(including possibly the connections imposed by the boundary condition ξ);
4. repeating the process from step (1) with the new set C.
Any open cluster containing vertices in ∂nR on both the right and left sides of e
is a bridge over e. In order to reveal the first m bridges over the edge e, we can
iteratively reveal the open clusters of ∂nR in F , starting initially with the cluster of
(x, bn′2c), and continuing to the right along ∂nR, until m distinct bridges have been
exposed. Using this revealing procedure, the edges which are revealed in order to
expose the first m bridges over e are either enclosed by γm and ∂nR, or belong to
the outer boundary of γm and are closed, thus forming a bounding dual-path.
Let (Fm) be the filtration associated with the above revealing process for the
bridges (Fm reveals γ1, . . . , γm) over the edge e. Our aim is to prove that for every
m ≥ 1,
πξΛ(γm ∈ Be1 | Fm−1) ≤ p , (3.2.5)
(if γm doesn’t exist, we vacuously say γm /∈ Be1) for the choice of
p = 1− p1p2p3 < 1 , (3.2.6)
where p1(α, q), p2(α, q), p3(α, q) > 0 are defined as follows:
• p1 is given by Theorem 3.1.1 with aspect ratio 1 for 1 < q < 4 and by
81
Lemma 3.1.7 with the choice ε = 1/2 and aspect ratio 1/2 for q = 4,
• p2 is the probability given by Theorem 3.1.2 for ε = 1/4 and aspect ratio
6 ∨ α−1,
• p3 is the probability given by Theorem 3.1.2 for ε = 1/3 and aspect ratio 1.
Let us first conclude the proof of Lemma 3.2.10 given (3.2.5). By iteratively
conditioning on (Fi)i≥1 we see that the sequence of indicators (1γi ∈ Be1)i≥1 is
stochastically dominated by the i.i.d. sequence (Zi)i≥1 where Zi ∼ Bernoulli(p). At
the same time, by definition of the set Be1, through this revealing process, as soon
as dlog2 ne many of the indicators (1γi ∈ Be1) are zero, all subsequent ones are
deterministically zero (note that once `e(γi) > n/6, every subsequent bridge will
also have this property). Therefore, we can bound
πξΛ(|Be1| ≥ r) ≤ P
(Bin(r + dlog2 ne − 1, p) ≥ r
)which, upon taking r = K log n and using, say, Hoeffding’s inequality once K ≥
2p−1, yields the desired estimate.
We now turn to proving the conditional estimate of (3.2.5). First observe that
by Fact 3.2.7 and the definition of hulle(γm−1), if Lw, Le are the two connected
subsets of ∂nR− hulle(γm−1), the event
Em =Lw
F ∗←→ Le or LeF ∗←→ ∂Λ
satisfies Em ⊃ |Be| ≥ m ⊃ γm ∈ Be1. In fact, the revealing process of Fm−1
reveals precisely the dual-path that bounds the open cluster of γm−1, and that
dual-path is either a dual connection from Lw to Le in F ∗, or it is the west-most
82
n′2
n′2
+ k
n′
x x + k x + 2k x + 3kz
Λ−R
ζ
R3
R2
γm−1
R1
Figure 3.4: After conditioning on ζ (via the configuration in the blue shadedregion), the probability of the purple and green dual-crossings is greater thanp1p2p3, bounding the probability of γm ∈ Be
1.
dual crossing from Le to ∂Λ that is to the right of γm−1. Either way, denote by ζ
the dual-bridge/crossing revealed as such by Fm−1 (see Fig. 3.4), and let (z, bn′2c)
be the west-most point of ζ ∩ ∂nR.
Let k = `e(γm−1); in order for γm ∈ Be1, necessarily `e(γm) ≤ 2k and
(z, bn′2c) ∈ Jx+ k, x+ 2kK× bn′
2c =: I .
We will establish the desired upper bound of (3.2.5) uniformly over Fm−1, ζ and
k. It suffices to only consider k ≤ n6
because otherwise, `e(γm) > n6
and therefore
γm /∈ Be1 deterministically.
Note that conditional on Fm−1 (which contains the σ-algebras generated by ζ
and k), by Fact 3.2.6, the event γm ∈ Be1 implies the event S, stating that either
ζ is a dual-bridge and I is primal-connected in F ∪ ξ to the left component of
∂nR−hulle(ζ), or alternatively ζ is a dual-crossing to ∂Λ and I is primal-connected
83
to ∂Λ in F . Thus, in this conditional space,
γm /∈ Be1 ⊃
ζ
F ∗←→ Jx+ 2k, x+ 3kK× bn′2c, (3.2.7)
since the right-hand side of Eq. (3.2.7) implies Sc which implies the left-hand side.
In order to lower bound the probability of the last event in Eq. (3.2.7), let D∗
be the outer (if ζ is a dual-crossing in F , then eastern) connected component of
F ∗ − ζ, and let D be its dual. Define also the following subsets of Λ:
R1 =Jx, x+ kK× Jbn′2c, bn′
2c+ mink, αn
4K ,
R2 =Jx, x+ 3kK× Jbn′2c+ mink
2, αn
8, bn′
2c+ mink, αn
4K ,
R3 =Jx+ 2k, x+ 3kK× Jbn′2c, bn′
2c+ mink, αn
4K ,
whereby, the event in the right-hand side of Eq. (3.2.7) can be written as ζ F ∗←→
∂sR3. For any i = 1, 2, 3, define the following crossing events (see Fig. 3.4):
C∗v(Ri ∩D) =∂eRi
Ri∩D←→6 ∂wRi
,
C∗h(Ri ∩D) =∂nRi
Ri∩D←→6 ∂sRi
.
(3.2.8)
(observe that implicit in (C∗v(Ri∩D))c is the event ∂eRi∩D 6= ∅∩∂wRi∩D 6= ∅,
and similarly, implicit in (C∗h(Ri∩D))c is the event ∂nRi∩D 6= ∅∩∂sRi∩D 6= ∅).
Claim 3.2.12. Conditional on Fm−1 (and in particular also ζ and k),
γm /∈ Be1 ⊃
(C∗v(R1 ∩D) ∩ C∗h(R2 ∩D) ∩ C∗v(R3 ∩D)
).
Proof. Suppose that ω satisfies the events on the right-hand. Recall that ζ is
84
such that ∂eR3 ∩ D 6= ∅ and ∂wR3 ∩ D 6= ∅, and ∂eR3←→6 ∂wR3 in R3 ∩ D since
ω ∈ C∗v(R3 ∩D). Consider R3 ∩D with boundary conditions wired on ∂e,wR3 ∩D
and free on ζ and ∂n,sR3 ∩D; then the boundary conditions on R3 ∩D alternate
between free and wired on boundary curves ordered clockwise as Lw1 , Lf1 , L
w2 , L
f2 ...;
by planarity and the choice of generalized Dobrushin boundary conditions, for
any two wired boundary curves Lwi , Lwi+1, either Lwi ←→ Lwi+1, or Lfi
∗←→ Lfj for
some j 6= i. Picking the two wired segments of ∂e,wR3 ∩ D closest to ∂sR3, the
aforementioned fact that ∂eR3←→6 ∂wR3 in R3 ∩D implies that either ∂sR3∗←→ ζ
or ∂sR3∗←→ ∂nR3. In the former, γm /∈ Be
1 holds by Eq. (3.2.7), so suppose only
the latter holds and call the dual-crossing ζ3.
Since ∂sR3∗←→ ∂nR3, both ∂sR2 ∩D and ∂nR2 ∩D are nonempty. Clearly, ζ3
splits R2 ∩D into the subset to its east, Ue, and that to its west, Uw. Consider
the set to its east, Ue, with boundary conditions that are wired on ∂s,nR2 ∩D and
free on ζ and on ∂e,wR2 ∩D. Since ζ3 and ζ are vertex-disjoint (by our assumption
that ∂sR3∗←→6 ζ in R3 ∩D), and the wired boundary segments adjacent to ζ3 are
disconnected in Ue, it must be that either ζ3∗←→ ζ or ζ3
∗←→ ∂eR2 in Ue. Using
the same reasoning on Uw, either ζ3∗←→ ζ or ζ3
∗←→ ∂wR2 in Uw. Combining these,
either ζ∗←→ ζ3, in which case ζ
∗←→ ∂sR3, or alternatively ∂eR2∗←→ ζ3
∗←→ ∂wR2
in R2 ∩D. In the former case, by Eq. (3.2.7), γm /∈ Be1; assume therefore that
only the latter case holds, and let ζ2 be a dual-crossing between ∂eR2 to ∂wR2 that
intersects ζ3.
Finally, we can deduce that ∂eR1 ∩D and ∂wR1 ∩D are nonempty as ζ2 and ζ
are vertex-disjoint (by our assumptions ζ∗←→6 ζ3 and ζ2
∗←→ ζ3). Considering now
Us, the subset of R1 ∩D south of ζ2 with wired boundary conditions on ∂e,wR1 ∩D
and free elsewhere, as before we deduce that either ζ2∗←→ ζ or ζ2
∗←→ ∂sR1 in Us.
85
Since, by definition of ζ, deterministically ∂sR1 ∩D = ∅, the former must hold, and
ζ∗←→ ∂sR3 through ζ2 and ζ3, and Eq. (3.2.7) concludes the proof.
We will next bound the probability of each of the events C∗v(R1∩D), C∗h(R2∩D)
and C∗v(R3 ∩ D), which, using the above claim, will translate to a bound on
γm /∈ Be1.
To see this, first note that by planarity, for all i = 1, 2, 3 and every subset D,
C∗v(Ri ∩D) ⊃ (Ch(Ri))c = C∗v(Ri) , (3.2.9)
and likewise for horizontal crossing events. Define the rectangle R1 ⊃ R1 by
R1 = Jx, x+ kK× Jbn′2c, n′K ⊂ Λ .
Let the boundary conditions (1, 0) on R1 be free on ∂sR1 and wired on ∂n,e,wR1.
Combining Eq. (3.2.9), monotonicity in boundary conditions, and the domain
Markov property, we get for p1(α, q) > 0 given by Eq. (3.2.6),
πξΛ(C∗v(R1 ∩D) | `e(γm−1) = k,Fm−1, ζ
)≥ π1
R1(C∗v(R1 ∩D) | `e(γm−1) = k,Fm−1, ζ)
≥ π1,0
R1(C∗v(R1)) ≥ p1 ,
where the last inequality follows from Theorem 3.1.1, Lemma 3.1.7 and self-duality.
We stress that wiring of ∂n,e,wR1 allowed us to ignore the information revealed on
R1 −D as far as the configuration in R1 ∩D is concerned, and the fact that ωζ is
closed allowed us to place a free boundary on ∂sR1, supporting Lemma 3.1.7.
86
Next, consider the rectangle R2 ⊃ R2 defined by
R2 = Jx− k, x+ 4kK× Jbn′2c, n′K ,
so that R2 ⊂ Λ since k = `e(γm−1) ≤ n/6. By monotonicity in boundary conditions
and Eq. (3.2.9), we get that for the choice of p2(α, q) > 0 given by Eq. (3.2.6),
πξΛ(C∗h(R2 ∩D) | `e(γm−1) = k,Fm−1, ζ)
≥ π1R2
(C∗h(R2 ∩D) | `e(γm−1) = k,Fm−1, ζ)
≥ π1R2
(C∗h(R2)) ≥ p2 .
Similarly, applying the exact same treatment of R2 to
R3 = Jx+ k, x+ 4kK× Jn′4, n′K ⊂ Λ ,
(it is possible to encapsulate R3 by a rectangle with wired boundary conditions
since ζ does not intersect ∂sR3 in our conditional space) shows that
πξΛ(C∗v(R3 ∩D) | `e(γm−1) = k,Fm−1, ζ)
≥ π1R3
(C∗v(R3 ∩D) | `e(γm−1) = k,Fm−1, ζ)
≥ π1R3
(C∗v(R3)) ≥ p3 ,
for p3(α, q) > 0 as defined in Eq. (3.2.6).
Putting these all together, by the FKG inequality and Claim 3.2.12,
πξΛ(γm /∈ Be
1
∣∣ Fm−1
)≥ p1p2p3 ,
87
n′2
n′2
+ l4
n′
n− 7l6
n− l n− l3
n− l6
ζ
R3
R2
γm−1
R1
Figure 3.5: After revealing ζ (via the blue shaded region), the existence of the threedual-crossings depicted precludes |Be
2| ≥ m.
implying the desired (3.2.5), and concluding the proof.
Proof of Lemma 3.2.11. Without loss of generality suppose e is in the left half
of ∂nR and use the east-ordering of bridges so that Be = γ1, ..., γ|Be|.
The proof follows the same argument used to prove Lemma 3.2.10. In what
follows we describe the necessary modifications that are needed here. Recall the
prescribed revealing process for the configuration on F = J0, nK× bn′2c described in
the proof of Lemma 3.2.10; recall also that (Fm) is the filtration corresponding to
the process of sequentially revealing the distinct bridges over the edge e. Our goal
is to prove the following analogue of (3.2.5), that for every m ≥ 1,
πξΛ(γm ∈ Be2 | Fm−1) ≤ p , (3.2.10)
for the choice of p = 1− p1p2p3 < 1 where,
• p1 is given by Theorem 3.1.1 with aspect ratio α for 1 < q < 4 and by
Lemma 3.1.7 with the choice ε = 1/2 and aspect ratio α/2 for q = 4 ,
88
• p2 is the probability given by Theorem 3.1.2 for ε = 1/8 and aspect ratio 6/α ,
• p3 is the probability given by Theorem 3.1.2 for ε = 1/3 and aspect ratio α .
Indeed, by iteratively conditioning on (Fi)i≥1, the bound (3.2.10) allows us to
stochastically dominate the sequence of indicators (1γi ∈ Be1)i≥1 by the i.i.d.
sequence (Zi)i≥1 where Zi ∼ Bernoulli(p), and moreover by definition of Be2, as soon
as dlog2 ne of the indicators are zero, all subsequent ones are deterministically zero.
The desired inequality then follows by comparison to P(Bin(r+ dlog2 ne− 1, p) ≥ r)
for r = K log n.
As before, we consider a fixed m, and let Lw, Le be the left and right connected
components of ∂nR− hulle(γm−1). As in the proof of Lemma 3.2.10, reveal γm−1,
in which case we reveal the enclosing dual-path ζ attaining
Em =Le
F ∗←→ Lw or LeF ∗←→ ∂Λ
,
whose west-most vertex of intersection with ∂nR is marked by (z, bn′2c). Condition-
ally on Fm−1, which contains the σ-algebras of γm−1, ζ and `e(γm−1) = k,
Also for any instance of the configuration revealed by Fm−1, we can set k =
`e(γm−1) as before, and let
l := n− (x+ `e(γm−1)) .
If n−z < l/2, deterministically γm /∈ Be2 (as argued in the proof of Proposition 3.2.8),
hence we may assume that n − z ≥ l/2; moreover, since k ≥ n6, it must be that
89
l6≤ k. Define the following subsets of Λ:
R1 =Jn− l − l6, n− lK× Jbn′
2c, bn′
2c+ αl
6K ,
R2 =Jn− l − l6, n− l
6K× Jbn′
2c, bn′
2c+ αl
6K ,
R3 =Jn− l3, n− l
6K× Jbn′
2c, bn′
2c+ αl
6K .
Define C∗v(R1 ∩D), C∗h(R2 ∩D), C∗v(R3 ∩D) as in Eq. (3.2.8). As in Claim 3.2.12,
γm /∈ Be2 ⊃
(C∗v(R1 ∩D) ∩ C∗h(R2 ∩D) ∩ C∗v(R3 ∩D)
).
Finally, for Ri, i = 1, 2, 3 given by
R1 =Jn− l − l6, n− lK× Jbn′
2c, bn′
2c+ αl
3K ,
R2 =Jn− l − l3, nK× Jbn′
2c − αl
6, bn′
2c+ αl
3K ,
R3 =Jn− l2, n, l
2K× Jbn′
2c − αl
6, bn′
2c+ αl
3K ,
(note that all three are subsets of Λ, by the fact that l ≤ n and n′ ≥ bαnc), the
same monotonicity argument used in the proof of Lemma 3.2.10 now implies (see
Fig. 3.5) that
πξΛ(γm−1 /∈ Be
2
∣∣ Fm−1
)≥ p1p2p3 ,
implying (3.2.10) and concluding the proof.
By matching the tail estimate of Proposition 3.2.8 with a lower bound, we can
straightforwardly see that an order log n bridges over a fixed edge is indeed typical.
Proof of Corollary 3.2.9. Fix an abitrary ε > 0, any boundary condition ξ, and
90
any e ∈ Jnε, n − nεK × bn′2c. For this e, recall the definitions of the rectangles
Rni , R
ei , R
wi as well as their subsets Rn
i , Rei , R
wi from (3.2.2)–(3.2.3). As before, when
M < M0 := ε log 44
, for every i ≤ 2M log n, all these are subsets of Λ and we can
define the crossing events
Ai = Cv(Rwi ) ∩ Ch(Rn
i ) ∩ Cv(Rei ) , and A∗i = C∗v(Rw
i ) ∩ C∗h(Rni ) ∩ C∗v(Re
i ) .
Now for each i ≤ M log n, we can define the event χi := A2i−1 ∩ A∗2i and notice
that
|Be| ≥M logn∑i=1
1χi .
Observe that for each i, the event Ai is measurable with respect to the configuration
ω on the half-annulus Rwi ∪Rn
i ∪Rei . By a similar reasoning as before, there exists
some p = p(α, q) > 0 such that for every i = 1, ..., 2M log n, and every configuration
η,
πξΛ(Ai | ωΛ−Rn,e,wi
= η) ≥ π0Rwi(Cv(Rw
i ))π0Rei(Cv(Re
i ))π0Rni(Ch(Rn
i )) ≥ p ,
πξΛ(A∗i | ωΛ−Rn,e,wi
= η) ≥ π1Rwi(C∗v(Rw
i ))π1Rei(C∗v(Re
i ))π1Rni(C∗h(Rn
i )) ≥ p .
Observe that for i 6= j the interiors of Rn,e,wi and Rn,e,w
j are disjoint. As a
consequence, we can also deduce by the domain Markov property and monotonicity,
that for every configuration η, for every i = 1, ...,M log n,
πξΛ(χi | ωΛ−Rn,e,w
2i−1−Rn,e,w2i
= η)≥ p2 .
91
In particular, the sequence of indicators (1χi)i=1,...,M logn stochastically dominates
a sequence of i.i.d. Bernoulli(p2) random variables. We therefore deduce that
|Be| ≥M logn∑i=1
1χi Bin(M log n, p2)
Choosing K < p2
2M , and using Hoeffding’s inequality to bound the probability that
the binomial random variable on the right-hand side is at most K log n, we see that
πξΛ(|Be| ≤ K log n
)≤ exp
[− 1
4Mp4 log n
].
Combining this via a union bound with the upper bound from (3.2.1) in Proposi-
tion 3.2.8 implies the desired.
3.2.3 Disjoint crossings
To extend our FK mixing time bound from favorable boundary conditions
(see §4.2.4) to periodic boundary conditions (which are not in that class) in §4.2.6,
we will need an analogous bound on the number of disjoint crossings of a rectangle.
For a rectangle R = (V (R), E(R)) and a configuration ωE(R), let ΨR =
ΨR(ωE(R)) be the set containing every component A ⊂ V (R) (connected via the
edges of ωE(R)) that intersects both ∂sR and ∂nR. We will need the following
equilibrium estimate similar to Proposition 3.2.8.
Proposition 3.2.13. Let q ∈ (1, 4] and α ∈ (0, 1]. Consider the critical FK model
on Λ = Λn,n′ with n′ ≥ bαnc, and the subset R = J0, nK × Jn′3, 2n′
3K. There exists
92
c(α, q) > 0 such that for every boundary condition ξ and every m ≥ 3,
πξΛ (|ΨR| ≥ m) ≤ e−cm .
Proof. We will prove by induction that, for all m ≥ 1,
πξΛ(|ΨR| ≥ m) ≤ (1− p)m−2 , (3.2.11)
where p > 0 is as given by Theorem 3.1.1 with aspect ratio 3/α when 1 < q < 4,
and is as given by Corollary 3.1.8 with aspect ratio α/3 when q = 4.
The cases m = 1, 2 are trivially satisfied for any 0 < p < 1. Now let m ≥ 3,
and suppose that Eq. (3.2.11) holds for m− 1; the proof will be concluded once we
show that
πξΛ(|ΨR| ≥ m
∣∣ |ΨR| ≥ m− 1)≤ 1− p .
Conditioned on the existence of at least m− 1 distinct components in ΨR, we can
condition on the west-most component in ΨR (by revealing all dual-components
of ωR incident to ∂wR, then revealing the primal-component of the adjacent
primal-crossing). We can also condition on the m − 2 east-most components in
ΨR (by successively repeating the aforementioned procedure from east to west,
i.e., replacing ∂wR above by ∂eR to reveal some component C ∈ ΨR, then by its
western boundary ∂wC, etc.).
Through this process, we can find two disjoint vertical dual-crossings ζ1, ζ2 of
R, each one a simple dual-path; the set (R∗ − ζ1 − ζ2)∗ consists of three connected
subsets of R; let D denote the middle one. There are exactly m− 1 elements of
ΨR in R−D, thus its m-th element, if one exists, must belong to D. Since every
93
edge in ζ1 ∪ ζ2 is dual-open, for any such choice of ζ1, ζ2, we then have
πξΛ(|ΨR| ≥ m
∣∣ |ΨR| ≥ m− 1, ζ1, ζ2
)= πξΛ
(Cv(D)
∣∣ ζ1, ζ2
),
Using the domain Markov property and monotonicity of boundary conditions,
πξΛ(Cv(D)
∣∣ ζ1, ζ2
)≤ π1,0,1,0
D (Cv(D)) ,
where (1, 0, 1, 0) boundary conditions on D denote those that are free on ζ1, ζ2 and
wired on ∂R ∩D. Again by monotonicity (in boundary conditions and crossing
events),
π1,0,1,0D (Cv(D)) ≤ π1,0,1,0
R
(Cv(D)
∣∣ ωζ1 = 0, ωζ2 = 0)≤ 1− π1,0,1,0
R (C∗h(R)) ,
where, following the notation of Corollary 3.1.8, (1, 0, 1, 0) boundary conditions on
a rectangle R are wired on ∂n,sR and free on ∂e,wR. By monotonicity in boundary
conditions and the definition of p, the right-hand side is bounded above by
1− π(1,0,1,0)R
(C∗h(J0, nK× Jn′
3, n′
3+ αn
3K))≤ 1− p .
94
Chapter 4
Mixing on (Z/nZ)2 at continuous
phase transitions
In this chapter, we analyze the Glauber dynamics of the Potts and FK models
at their critical point when q ∈ (1, 4], so that their phase transition is continuous.
The rich equilibrium picture described at the beginning of Chapter 3 is believed
to present itself in the local-update dynamics of the system by e.g., inducing an
auto-correlation length and spectral gap that are polynomial in the system size,
with universal, dynamical critical exponents (see e.g., [39, 61, 82, 89]).
In the case of q = 2, Holley [50] showed that the continuous-time Ising Glauber
dynamics has an auto-correlation that decays as t−1/4 and deduced that the spec-
tral gap decays at least polynomially in n. Lubetzky and Sly [64] proved that
the inverse spectral gap of the critical Ising Glauber dynamics on Λn,n is truly
polynomial in n (independently of its boundary conditions). They relied on a
recursive block dynamics approach, along with the RSW estimate of [33] to couple
Ising configurations beyond dual-crossings with a positive probability. This implied
95
that the inverse gap on an n× n box is at most a constant multiple of the inverse
gap on an n2× n
2box; recursing O(log n) times yields a polynomial upper bound on
the inverse gap.
In Section 4.1, we use a similar approach to analyze the q = 3, 4 Potts Glauber
dynamics. With the RSW estimates described in Section 3.1 in hand, the argument
of [64] extends to q = 3; at q = 4 we use the lower-bounds on crossing probabilities
from Section 3.1.1 in place of the RSW estimates up to the boundary. Also, though
at q = 3, 4 we do not have access to precise arm exponents as was necessary for the
mixing time lower bounds of [50, 64], we use the lower bounds on the arm exponent
from Section 3.1.2 to give a polynomial lower bound for the inverse spectral gap.
In Section 4.2, we prove corresponding upper bounds on the mixing times of
the FK Glauber dynamics. We would like to adapt a similar strategy, following a
recursive approach and using the RSW estimates at q ∈ (1, 4] to couple beyond a
dual-crossing, whose existence has a positive probability. However, as indicated
in Section 3.2 significant complications arise due to the non-locality of the FK
boundary conditions induced on sub-blocks. These difficulties are discussed in
detail, together with a road map of how we deal with them, in Section 4.2.1.
4.1 Potts Glauber dynamics at continuous phase
transition points
This section contains the proof of Theorem 1 (as well as its analogs for boxes
with non-periodic boundary conditions); recall from Theorem 1.2.2 that it suffices
to prove the desired bounds for Glauber dynamics for the Potts model in order
to obtain them for FK Glauber as well as Swendsen–Wang and Chayes–Machta
96
dynamics at integer q. Consider Λ = Λn,n′ = J0, nK× J0, n′K for n′ = bαnc, where
α ∈ [α, 1] for some fixed 0 < α ≤ 12.
4.1.1 Mixing under arbitrary boundary conditions
We first establish analogues of Eqs. (1.1.1)–(1.1.2) for Glauber dynamics for the
Potts model with arbitrary boundary conditions, modulo an equilibrium estimate
on crossing probabilities at q = 4 which we established in §3.1.1. Whenever we refer
to arbitrary or fixed boundary conditions we mean ones that include an assignment
of a color, or free to each of the vertices of ∂Λ (in contrast to periodic). The
following is a general form of the approach of [64] to proving upper bounds on
mixing times in the presence of RSW bounds; we stress that, while this proof
does extend from the Ising model to the Potts model, in fact it fails to produce a
polynomial upper bound for the critical FK model at noninteger 1 < q < 4, despite
the availability of the necessary (uniform) RSW estimates (cf. Section 4.2).
Theorem 4.1.1. Suppose q ≥ 1 and that there exists a nonincreasing sequence
(an) such that
π0Λn/3,n
(Cv(Λn/3,n)
)≥ an . (4.1.1)
Then there exists some absolute constant c > 0 such that Glauber dynamics for the
Potts model on Λ = Λn,n′ with arbitrary boundary conditions, ζ, satisfies
gap−1p ≤ (c an)−2 log3/2 n .
Combining the RSW bound of Theorem 3.1.1 with Theorem 4.1.1 establishes
97
the analog of Eq. (1.1.1) for a rectangle with arbitrary (non-periodic) boundary con-
ditions. At q = 4, we use the polynomially decaying bound on crossing probabilities
uniform in boundary conditions (see Theorem 3.1.5), through which Theorem 4.1.1
will yield the matching quasi-polynomial upper bound on mixing.
Corollary 4.1.2. There exist absolute constants c1, c2 > 0 such that Glauber
dynamics for the critical 3-state Potts model on Λ with arbitrary fixed boundary
satisfies
gap−1p . nc1 ,
whereas for the 4-state critical Potts model on Λ with arbitrary boundary conditions,
gap−1p . nc2 logn .
From this corollary, Eqs. (1.1.1)–(1.1.2) of Theorem 1 follow by moving from
the box to the torus exactly as done in [64, Theorem 4.4] (see also §4.1.2). For
q = 2, 3, 4, by Theorem 1.2.2, these imply the analogous upper bounds on the
inverse gap of the Swendsen–Wang dynamics, as well as Glauber dynamics for the
FK model.
Proof of Theorem 4.1.1. We use the block dynamics technique of Theorem 2.2.8
used in [64]. Define two sub-blocks of Λ, as follows:
Bw := J0, 2n3
K× J0, n′K , Be := Jn3, nK× J0, n′K .
Then let B denote the block dynamics on Λ with sub-blocks Bw, Be as defined
in §2.2.2. We bound gapB(Λζ) and gapp(Bϕi ) of Theorem 2.2.8 uniformly in ζ, ϕ.
Lemma 4.1.3. For any two initial configurations σ, σ′ on Λ with corresponding
98
block dynamics chains Xt and Yt, there exists an absolute constant c > 0 such that,
if (an) is a sequence satisfying (4.1.1), there is a grand coupling, such that P(X1 6=
Y1) ≤ 1 − c an. Moreover, there exists some c′ > 0 such that gapB(Λζ) ≥ c′ an
uniformly in ζ.
Proof. We construct explicitly a grand coupling that allows us to couple the two
configurations with the above probability. First recall that the Potts boundary
condition ζ on Λ corresponds to an FK boundary condition ξ where two boundary
vertices are in the same cluster if and only if they have the same color, along with
the decreasing event Fζ = ω : ∀x, y ∈ ∂Λ, ζ(x) 6= ζ(y) =⇒ xΛ←→6 y. Via the
Edwards–Sokal coupling, we move from the Potts model with boundary ζ to the
corresponding FK model with boundary ξ conditional on the event Fζ .
Suppose the clock at block Bw rings first. The two initial configurations σ, σ′
induce two Potts boundaries η, η′ corresponding to FK boundaries ψ, ψ′ on ∂eBw
along with the events Fη,ζ and Fη′,ζ ; (η, ζ) is the boundary condition on B1 with η
on ∂eB1 and ζ∂Bwon the rest of ∂Bw. Here and throughout the rest of the paper,
when discussing boundary conditions, we use the restriction to a line to denote the
boundary condition induced on that line by the configuration we have revealed.
We seek to couple the two initial configurations on all of Λ by first coupling
them on Λ − Boe. For each initial configuration, the block dynamics samples a
Potts configuration on Bw by sampling an FK configuration from πψ,ξBw(· | Fη,ζ),
πψ′,ξ
Bw(· | Fη′,ζ).
Via the grand coupling defined in §2.2.1 of all boundary conditions on ∂eBw,
we reveal the open component of ∂eBw in order to condition on the right-most dual
vertical crossing of Λ−Boe. Note that all FK measures we consider are stochastically
dominated by π1,ξBw
(by monotonicity in boundary conditions and since Fη,ζ is a
99
decreasing event). Then if a sample from π1,ξBw
has a dual vertical crossing in Bw∩Be,
under the grand coupling, so will all the samples of πψ,ξBw(· | Fη,ζ).
Under the event Fη,ζ , by construction it is impossible to add boundary connec-
tions by modifying the interior of Bw (either such connections would be between
monochromatic sites in which case they are already in the same cluster, or otherwise
such connections are impossible under Fη,ζ). Thus, if there is such a dual vertical
crossing under π1,ξBw
, the event Fη,ζ ensures that to the west of that crossing, all
realizations of πψ,ξBwsee the same boundary conditions. By the domain Markov
property, the grand coupling then couples all such realizations west of the right-
most dual-crossing of π1,ξBw
and therefore on all of Λ−Boe (for the explicit revealing
procedure, see [64, §3.2]). We then use the same randomness to color coupled
clusters the same way, and couple all corresponding Potts configurations on Λ−Boe.
The colorings of the boundary clusters are predetermined, but because the ∂eBw
boundary clusters cannot extend past the dual vertical crossing, the two Potts
configurations can be coupled west of the dual vertical crossing.
Suppose the clock at block Be rings next. If we have successfully coupled Xt
and Yt in Λ−Boe, then the identity coupling couples the configurations on all of Λ.
But note that by the assumption of Theorem 4.1.1, and the fact that n′ ≤ n,
π1Bw
(C∗v(Jn3 ,
2n3
K× J0, n′K))≥ an .
Moreover, by time t = 1, there is a probability c > 0 that the dynamics rang
the clock of Bw and then the clock of Be in which case we have coupled the two
configurations with probability an at time t = 1. By the submultiplicativity of d(t),
100
for all t > 0,
P(Xt 6= Yt) ≤ (1− c an)t ,
which implies that there exists a new constant c > 0 such that tmix ≤ c/an and
(log 12ε
)(gapB(Λζ))−1 ≤ tmix(ε) ≤ c/an .
In particular there exists c′ > 0 such that (gapB(Λζ))−1 ≤ c′/an.
By Theorem 2.2.8 there exists c > 0 such that we get the following relation
between the gap of Glauber dynamics on Λ and Glauber dynamics on the blocks
Bi, i ∈ e,w:
(gapp(Λζ))−1 ≤(c an)−1 max
imaxσ
(gapp(Bσi )−1.
However, each Bi is a rectangle Λ2n/3,n′ with some fixed boundary conditions and
one can check that for α ∈ [α, 1], it also has, up to rotation, aspect ratio αi ∈ [α, 1].
It follows that maxi maxσ(gapp(Biσ))−1 satisfies the same relation as (gapp(Λ
ζ))−1.
Recursing 2 log3/2 n times yields the desired bound on (gapp(Λζ))−1 for any ζ.
4.1.2 Periodic boundary conditions
We now complete the proof of Theorem 1 by transferring the bounds on mixing
times under fixed boundary conditions to the torus. The translation of the mixing
time bounds is identical to the proof of [64], and we therefore do not lay out the
details. A similar (but more complicated) translation of mixing time bounds from
fixed boundary conditions to the torus, is used in Section 4.2.6, whose full details
are included.
101
Proof of Theorem 1. The proof to go from arbitrary boundary conditions to the
torus is the same as the proof of Theorem 4.4 of [64] which used block dynamics
twice to first reduce mixing on the torus (Z/nZ)2 to a cylinder (Z/nZ)× J0, n′K,
and then that cylinder to a rectangle with fixed boundary conditions, on which
Corollary 4.1.2 gives the desired polynomial (quasi-polynomial) mixing time bound.
We only observe that the proof goes through after replacing the RSW bounds there
by the estimate in Theorem 3.1.5, and conditioning on the event Fζ as before.
4.1.3 Polynomial lower bounds
In order to provide as complete a picture as possible, we also extend the
polynomial lower bound of [64] to the Glauber dynamics for the q = 3, 4 Potts
models, showing that indeed they undergo a critical slowdown. We do not have
access to precise arm exponents as exist for q = 2, so instead we appeal to the
arm-exponent lower bound derived in Lemma 3.1.9.
Theorem 4.1.4. Let q = 3, 4 and consider the critical Potts model on Λ = J0, nK2
with boundary conditions η. The continuous-time Glauber dynamics has gap−1 & cn
for some c(q) > 0, and the same holds on (Z/nZ)2.
Proof. Now that we have a bound on connection probabilities macroscopically
away from boundaries, we modify the lower bound of [64] to our setting. Fix any
boundary condition η (if we are considering the torus, reveal σ∂Λ and fix that to
be your boundary condition η). Let Λ1 = Jn4, 3n
4K2 and Λ2 = Jn
3, 2n
3K2. Recall the
variational form of the spectral gap, Eq. (2.2.2), and consider the test function
f(σ) =∑
x∈Λ21σ(x) = q. Since f is 1-Lipschitz, E(f, f) is easily seen to be
O(n2). We now lower bound Varµ(f). To do so, we move to the FK representation
102
of the Potts model on Λ via the Edwards–Sokal coupling using the event Fη.
For the remainder of this proof only, let E and Var be with respect to the joint
distribution over FK and Potts configurations given by the Edwards–Sokal coupling.
By Theorem 3.1.3, we see by the FKG inequality,
πξΛ(C∗o(Λ− Λ1) | Fη) ≥ π1Λ(C∗o(Λ− Λ1)) ≥ c1
for some c1(q) > 0. By the law of total variance and the above, we see that
Varµ(f) ≥ E[Var(f | ωΛ−Λ1
)1C∗o(Λ− Λ1)]≥ c1Var(f | ωΛ−Λ1
, ∂Λ1←→6 ∂Λ) .
But given that ∂Λ1←→6 ∂Λ, the probability of σ(x) = q for x ∈ Λ2 is 1/q; in
particular, by the Edwards–Sokal coupling and FKG, we can expand the above as
Var(f | ωΛ−Λ1, ∂Λ1←→6 ∂Λ) ≥ q−1
∑x,y∈Λ2
πξΛ(x←→ y | ωΛ−Λ1, ∂Λ1←→6 ∂Λ)
≥ q−1∑x,y∈Λ2
π0Λ1
(x←→ y) ≥ c2n3
for some c2(q) > 0, where the last inequality follows from (3.1.5) of Lemma 3.1.9.
4.2 FK Glauber dynamics at continuous phase
transition points
In this section, we prove Theorem 3. While we would still like to use a recursive
approach based on coupling beyond RSW-type dual-crossings. However, we run
into significant difficulties due to non-localities inherent to the FK model, as can
103
be encoded into what we referred to as boundary bridges in Setion 3.2. To be more
precise about the obstacle posed by having multiple bridges over an edge, recall
the following. In Section 4.1, the upper bounds on the mixing time of the Potts
models at β = βc for q ∈ 2, 3, 4 relied on RSW bounds [33, 34] to expose dual-
interfaces in the FK representation, beyond which block dynamics chains could be
coupled. However, the fact that chains, started from any two initial configurations,
could be coupled past a dual-interface, relied crucially on the conditional event Fζ ,
which was inherent to the Edwards–Sokal coupling of the models with boundary
conditions (that no distinct boundary components were connected in the interior
configuration). Without this conditioning, connections between two components
on one side of a rectangle alter the boundary conditions elsewhere via bridges over
the dual-interface, preventing coupling (see Fig. 1.5).
Moreover, as we saw in Section 3.2, such boundary bridges cannot be avoided—
indeed, as Corollary 3.2.9 indicated, the typical boundary condition that will be
induced on a sub-block in the block dynamics will have many long-range boundary
bridges over any fixed edge.
In Section 4.2.1, we sketch out the proof approach and the necessary modi-
fications to the block-dynamics scheme of the preceding section to allow us to
disregard unfavorable boundary conditions with atypically many bridges. We then
introduce these dynamical tools and their applications, and prove the main theorem
in Sections 4.2.4–4.2.6.
4.2.1 Main techniques
To convert our upper bounds on bridges from Section 3.2 to an upper bound
on the mixing time, our dynamical analysis restricts its attention to boundary
104
conditions with |Be| = O(log n) for every e ∈ ∂Λ; we call such boundary conditions
“typical” (see Definitions 4.2.11–4.2.12) and observe that wired and free boundary
conditions are both typical.
To maintain “typical” (as opposed to worst-case) boundary conditions through-
out the multi-scale analysis, we turn to the Peres–Winkler censoring inequalities [76]
for monotone spin systems, that were used in [70] (then later in [63]) for the Ising
model under “plus” boundary, a class of boundary conditions that dominate the
plus phase (observe that, in contrast, “typicality” is not monotone).
A major issue when attempting to carry out this approach—adapting the
analysis of the low temperature Ising model to the critical FK model—is the stark
difference between the nature of the corresponding equilibrium estimates needed
to drive the multi-scale analysis. In the former, crucial to maintaining “plus”
boundary conditions throughout the induction of [70] was that in the presence of
favorable boundary conditions, the multiscale analysis could be controlled except
with super-polynomially small probability. This yielded a bound on coupling the
dynamics started at the extremal (plus and minus) initial configurations, which a
standard union bound over the O(n2) sites of the box (see (2.2.7)) then transformed
to a bound on tmix.
We wish to couple the dynamics from the extremal (wired and free) initial
configurations, since arbitrary starting states may induce boundary conditions on
the smaller scales that are not “typical.” Even in the ideal scenario where the
induced boundary conditions have no bridges, though, the probability that we fail
to couple the dynamics from wired and free initial states is at least 1− ε (as per
the RSW estimates). In particular, even in this ideal setting, we could not afford
the O(n2) factor of translating this to a bound on tmix—the actual setting is far
105
worse, replacing the failure probability by 1−n−c (Proposition 3.1.6). An approach
based on the classical block dynamics recursion on the spectral gap (see [67] and
the proofs in [42, 64]) would force one to analyze the dynamics under worst-case
boundary, whereas we would like to restrict attention to the typical boundary
conditions encountered throughout the dynamical process.
Therefore, in Definition 4.2.7 we construct a censored dynamics that mimics
a block dynamics chain, and bound the total variation distance between their
distributions in terms of the probability that we encounter unfavorable boundary
conditions on the sub-blocks before mixing (Proposition 4.2.8). By doing so, we
compare the censored dynamics to the block dynamics with boundary conditions
modified to eliminate all O(log n) bridges over certain edges, paying a cost of nc in
the mixing time. We then let the block dynamics run some nc rounds (paying a
union bound for the probability of ever encountering atypical boundary conditions,
bounded in Corollary 4.2.18). The block dynamics would be making nc many
independent attempts at coupling (possible due to the absence of bridges over
a particular edge) beyond a dual-crossing whose existence has probability n−c/2:
see Lemma 4.2.16. This polynomial bound on the block dynamics coupling time
translates to a quasi-polynomial bound for the censored dynamics via O(log n)
recursions onto smaller scale blocks, yielding the bound on tmix.
Finally, since periodic boundary conditions do not fall in our class of “typical”
boundaries, in Section 4.2.6 we extend this bound first to cylinders, and then to
the torus.
106
Dynamical tools
In this section, we introduce the main techniques we use to control the total
variation distance from stationarity for the FK Glauber dynamics. For the rest of
this chapter, tmix will always be with respect to the FK Glauber dynamics.
4.2.2 Modifications of boundary conditions
Crucial to the proof of Theorem 3 is the modification of boundary bridges so
that we can couple beyond FK interfaces as done in [64]; in this subsection we
define boundary condition modifications and control the effect such modifications
can have on the mixing time.
Definition 4.2.1 (segment modification). Let ξ be an FK boundary condition on
a rectangle Λ which corresponds to a partition P1, ...,Pk of ∂Λ, and let ∆ ⊂ ∂Λ.
The segment modification on ∆, denoted by ξ∆, is the boundary condition that
corresponds to the partition P1 − V (∆), ...,Pk − V (∆) ∪⋃v∈V (∆)v of ∂Λ.
Definition 4.2.2 (bridge modification). Let ξ be an FK boundary condition on
∂Λ, corresponding to a partition P1, . . . ,Pk of ∂Λ. Let Be be the set of disjoint
bridges in ξ∂nΛ over the edge e = (x, y) ∈ ∂nΛ, corresponding to the components
Pij`j=1, as per Definition 3.2.3. The bridge modification of ξ over e, denoted ξe,
is the boundary condition associated to the partition where every Pij is split into
two components,
Pwij
= (v1, v2) ∈ Pij : v1 − x < 0 and Peij
= (v1, v2) ∈ Pij : v1 − x > 0 .
(Observe that, in particular, ξe has no bridges over e.) Define the bridge modification
107
w.r.t. the other sides of ∂Λ analogously.
Definition 4.2.3 (side modification). Let ξ be a boundary condition on ∂Λ,
corresponding to a partition P1, . . . ,Pk of ∂Λ. The side modification ξs is defined
as follows. Split every Pj into its four sides, that is, for i = n, s,e,w, let
P ij = v ∈ Pj : v ∈ ∂iΛ ,
where for the corner vertices the choice is arbitrary (for concreteness, associate the
corner with the side that follows it clockwise). Then for every ξ, the modified ξs
has no components that contain vertices in more than one side of ∂Λ.
Recalling the definition of distances between boundary conditions, we have the
following.
Fact 4.2.4. For a segment ∆, we have d(ξ, ξ∆) ≤ |V (∆)|; for an edge e, we have
d(ξ, ξe) = |Be|; for the side modification ξs, we have that d(ξ, ξs) is bounded above
by three times the number of components in ξ with vertices in multiple sides of ∂Λ.
4.2.3 Censored block dynamics
We next define the censored and systematic block dynamics whose coupling is
the core of the dynamical analysis used to prove Theorem 3. This coupling may
be of general interest in the study of mixing times of monotone Markov chains,
where one only has control on mixing times in the presence of favorable boundary
conditions. We therefore present it in more generality than necessary for the proof
of Theorem 3: consider the heat-bath dynamics for a monotone spin or edge system
on a graph G with boundary ∂G that satisfies the domain Markov property and
has extremal configurations 0, 1 and invariant measure πξG.
108
Definition 4.2.5 (systematic block dynamics). Let B0, . . . , Bs−1 denote a finite
cover of G and for k ≥ 1 let ik := (k − 1) mod s.
The systematic block dynamics (Yk)k≥0 for the FK model is a discrete-time flavor
of the block dynamics w.r.t. Bi, with blocks that are updated in a sequential
deterministic order: at time k, the chain updates E(Bik) by resampling ωE(Bik ) ∼
πξG(· | ωE(G)−E(Bik )).
Remark 4.2.6. The systematic block dynamics as defined has unique invariant
measure πξG, but it is neither time-homogenous nor reversible. If one wanted
a time-homogenous and reversible analogue, one could, e.g., in each time step
update all s blocks sequentially in forward and then reverse order, i.e., in the order
(B0, ..., Bs−1, ..., B0).
Definition 4.2.7 (censored block dynamics). Let B0, ..., Bs−1 be as before and
consider a set Γi of permissible boundary conditions for Bi, and fix ε > 0. The
censored block dynamics (Xt)t≥0 is the continuous-time single-bond (single-site)
heat-bath dynamics that simulates Yk as follows. For a given ε > 0, define
T = T (ε) = maxi
maxξ∈Γi
tξ,Bimix (ε) , (4.2.1)
where tξ,Bimix is the mixing time of standard FK Glauber dynamics on the block Bi
with boundary conditions ξ. Let ik := (k − 1) mod s and let the chain Xt be
obtained from the standard heat-bath dynamics by censoring, as in Theorem 2.2.7,
for every integer k ≥ 1, along the interval ((k − 1)T, kT ], all updates except those
in Bik .
Recall the notation dξω0(t) and the abbreviation Pξω0
(Xt ∈ ·) from §2.2.1.
109
Proposition 4.2.8 (comparison of censored / systematic block dynamics). Let
(Xt)t≥0 and (Yk)k≥0 be the censored and systematic block dynamics, respectively,
w.r.t. some blocks B0, . . . Bs−1 and permissible boundary conditions Γi on G with
boundary conditions ξ and initial state ω0, as per Definitions 4.2.5–4.2.7. Let
ρ := maxk≥1
maxi∈J0,s−1K
Pω0
(Yk∂Bi /∈ Γi
), (4.2.2)
where Yk∂Bi is the boundary condition induced on ∂Bi by the configuration Yk on
G \Bi. Then for every ε > 0, every integer k ≥ 0, and T as in (4.2.1),
∥∥Pω0
(XkT ∈ ·
)− Pω0 (Yk ∈ ·)
∥∥tv≤ k(ρ+ ε) . (4.2.3)
Remark 4.2.9. Although we defined the systematic and censored block dynamics
for deterministic block updates, one could easily formulate the same bound for
the usual block dynamics with random updates, where the s sub-blocks are each
assigned i.i.d. Poisson clocks (cf. [67]), by also randomizing the order in which the
censored block dynamics updates sub-blocks, using the identity coupling on the
corresponding clocks.
Proof of Proposition 4.2.8. We now prove Eq. (4.2.3) by induction on k. Fix
any ω0 and let δk =∥∥Pω0
(XkT ∈ ·
)− Pω0 (Yk ∈ ·)
∥∥tv
denote its left-hand side;
observe that δ0 = 0 by definition, and suppose that δk ≤ k(ρ + ε) for some k.
Denote by i = ik+1 the block that is updated at time k + 1 by the systematic block
dynamics, and let X(i)t and Y
(i)k be the censored and systematic chains corresponding
to the block sequence (B(i+`) mod s)`≥0 (where the block Bi is the first to be updated).
110
By the Markov property and the triangle inequality,
δk+1 ≤1
2
∑ω,ω′
(∣∣∣Pω0(XkT = ω′)− Pω0(Yk = ω′)∣∣∣Pω′(X(i)
T = ω)
+∣∣∣Pω′(X(i)
T = ω)− Pω′(Y (i)1 = ω)
∣∣∣Pω0(Yk = ω′)
)= δk +
∑ω′
Pω0(Yk = ω′)∥∥∥Pω′(X(i)
T ∈ ·)− Pω′(Y (i)1 ∈ ·)
∥∥∥tv. (4.2.4)
The last summand in (4.2.4) satisfies
∑ω
Pω0(Yk = ω)∥∥∥Pω(X
(i)T ∈ ·)− Pω(Y
(i)1 ∈ ·)
∥∥∥tv
≤ Pω0(Yk∂Bi ∈ Γi) maxω:ω∂Bi∈Γi
∥∥∥Pω(X(i)T ∈ ·)− Pω(Y
(i)1 ∈ ·)
∥∥∥tv
+ Pω0(Yk∂Bi /∈ Γi)
≤ (1− ρ)ε+ ρ ,
by the definition of T = T1(ε) and ρ; here we identified the configuration on G−Bi
with the boundary it induces on ∂Bi. Combined with Eq. (4.2.4), this completes
the proof of Eq. (4.2.3).
Remark 4.2.10. In the setting of Proposition 4.2.8, when the initial state is
ω0 ∈ 0, 1 (either minimal or maximal), one can obtain the following improved
bound. Set
T = maxi
maxξ∈Γi
tξ,Bimix (ω0Bi , ε) , (4.2.5)
where tξ,Bimix (ω0, ε) = inft : dξω0(t) ≤ ε, relaxing the previous definition (4.2.1) of T
to only consider the initial state ω0. Let (Xt) be the censored block dynamics w.r.t.
this new value of T , and denote by (X ′t) the modification of (Xt) where, for every
111
k ≥ 1, the configuration of the block Bik (i.e., the block that is to be updated in
the interval ((k − 1)T, kT ]) is reset at time (k − 1)T to the original value of ω0
on that block. We claim that (4.2.3) holds1 for the relaxed value of T in (4.2.5) if
we replace Xt by X ′t. Indeed, all the steps in the above proof of Proposition 4.2.8
remain valid up to the final inequality, at which point the fact that we consider X ′t
(as opposed to Xt) implies that
maxω:ω∂Bi∈Γi
∥∥∥Pω(X(i)T ∈ ·)− Pω(Y
(i)1 ∈ ·)
∥∥∥tv
= maxξ∈Γi
∥∥∥Pω0Bi(X
(i)T ∈ ·)− π
ξBi
∥∥∥tv,
which is at most ε when T is as defined in (4.2.5).
Proof of main result
In what follows, we prove Theorem 3 by combining the equilibrium estimates
of §3.2 with the dynamical tools provided in §4.2.1. We first establish an analog of
Theorem 3 (Theorem 4.2.14) for “typical” boundary conditions (defined in §4.2.4
below), and then, using Proposition 3.2.13, derive from it the case of periodic
boundary conditions in §4.2.6. The effect of boundary bridges (which may foil the
multiscale coupling approach, as described in §4.2.1) is controlled by restricting
the analysis to those boundary conditions that have O(log n) bridges, and applying
Proposition 4.2.8 to bound the mixing time under such boundary conditions. We
now define the favorable boundary conditions for which we prove a mixing time
upper bound of nO(logn).
1In fact, (4.2.3) is valid for X ′t with the relaxed T in (4.2.5) for every ω0, not just for the maximal
and minimal configurations; however, it is when ω0 ∈ 0, 1 that the modified dynamics X ′t can
easily be compared to Xt, and thereafter to Xt, via the censoring inequality of Theorem 2.2.7.
112
4.2.4 Typical boundary conditions
We define the class of “typical” boundary conditions on a segment (e.g., ∂nΛ).
Definition 4.2.11 (typical boundary conditions on a segment). For K > 0, N ≥ 1,
and a segment L, let ΞK,N be the set of boundary conditions ξ on L such that
|Be(ξ)| ≤ K logN for every e ∈ L .
We will later see (as a consequence of Lemma 4.2.17 below) that the boundary
conditions on each of the sides of a box Λ induced by the infinite-volume FK measure
πZ2 belong to the class of “typical” boundary conditions with high probability.
Next, we define the global property we require of typical boundary conditions.
Definition 4.2.12 (typical boundary conditions on ∂Λ). Let ΥK1,K2,N = ΥΛK1,K2,N
be the set of boundary conditions ξ on ∂Λ such that ξ∂iΛ ∈ ΞK1,N for every
i = n, s,e,w, and ξ has at most K2 logN distinct components with vertices on
different sides of ∂Λ.
Remark 4.2.13. The wired and free boundary conditions on a side ∂iΛ are always
in ΥK1,K2,N whenever K1 logN ≥ 1 and K2 logN ≥ 1 (in the former all vertices are
in just one component and in the latter no two vertices are in the same component).
4.2.5 Mixing under typical boundary conditions
Since periodic boundary conditions are not in ΥK1,K2,N for any K2 > 0, we first
bound the mixing time on rectangles ΛN,N ′ where N ′ = bαNc for α ∈ (0, 1], with
boundary conditions ξ ∈ ΥK1,K2,N .
113
Theorem 4.2.14. Let q ∈ (1, 4] and fix α ∈ (0, 1] and K1, K2 > 0. Consider the
Glauber dynamics for the critical FK model on ΛN,N ′ with αN ≤ N ′ ≤ N and
boundary conditions ξ ∈ ΥK1,K2,N . Then there exists c = c(α, q,K1, K2) > 0 such
that
tmix . N c logN .
Observe that if we define
ΥK,N := ΥK,2K,N , (4.2.6)
clearly ΥK1,K2,N ⊂ ΥmaxK1,K2,N , so it suffices to consider ΥK,N for general K > 0.
The proof of Theorem 4.2.14 proceeds by analyzing the censored and systematic
block dynamics on Λ, obtaining good control on the systematic block dynamics
using the RSW estimates of [34], then comparing it to the censored block dynamics.
The choice of parameters for which we will apply Proposition 4.2.8 is the following.
Definition 4.2.15 (block choice for censored / systematic block dynamics). Let
q ∈ (1, 4] and for any n′ ≤ n ≤ N , consider the critical FK Glauber dynamics on
Λn,n′ . Let
Be = Jn4, nK× J0, n′K ,
Bw = J0, 3n4
K× J0, n′K ,
ordered as B0 = Be, B1 = Bw as in the setup of Proposition 4.2.8. For K =
maxK1, K2 given by Theorem 4.2.14, let Γi = ΥK,N be the set of permissible
boundary conditions for the block Bi in Λn,n′ .
Before proving Theorem 4.2.14 we will prove two lemmas that will be necessary
114
for the application of Proposition 4.2.8. We first introduce some preliminary
notation.
For any n ≤ N , label the following edges in ∂Λn,n′ :
e?s = (bn2c+ 1
2, 0) , and e?n = (bn
2c+ 1
2, n′) .
Recall the definitions of the bridge modification ξe and the side modification ξs
from Definitions 4.2.2–4.2.3. We will, throughout the proof of Theorem 4.2.14, for
any boundary condition ξ on ∂Λn,n′ , let the modification ξ′ ≤ ξ be given by
ξ′ := ξe?s ∧ ξe?n ∧ ξs , (4.2.7)
i.e., the bridge modification of ξ on e?s and e?n, combined with the side modification ξs.
If ΞK,N ,ΥK,N are the sets of boundary conditions defined in Definition 4.2.15,
we let Ξ′K,N ,Υ′K,N be the sets corresponding to the modification ξ 7→ ξ′ of every
element in the original sets. Observe that Υ′K,N ⊂ ΥK,N and likewise, Ξ′K,N ⊂ ΞK,N .
Lemma 4.2.16. Let α ∈ (0, 1] and consider the systematic block dynamics Ykk∈N
on Λn,n′ with bαnc ≤ n′ ≤ n and blocks given by Definition 4.2.15. There exist
cY , c?(α, q) > 0 such that for every two initial configurations ω1, ω2, and every
boundary condition ξ on ∂Λn,n′, modified to ξ′ by Eq. (4.2.7), for all k ≥ 2,
‖Pξ′ω1(Yk ∈ ·)− Pξ′ω2
(Yk ∈ ·)‖tv ≤ exp(−cY kn−c?) .
In particular, for all k ≥ 2,
maxω0
‖Pξ′ω0(Yk ∈ ·)− πξ
′
Λn,n′‖tv ≤ exp(−cY kn−c?) .
115
n4
n2
3n4
ξ′
1/0 Be
Figure 4.1: If the depicted dual-crossing exists under any (ξ′, ωi), and the bridgesover e∗s , e
∗n are disconnected, one can couple the two chains on the green shaded
region, and in particular on Be −Bw.
Proof. We construct a coupling between the two systematic block dynamics chains,
starting from two arbitrary initial configurations ω1, ω2, as follows. The systematic
block dynamics first samples a configuration on Be (the interior of Be) according to
πξ′,ωiBe
for i = 1, 2, where (ξ′, ωi) is the boundary condition induced by ωiBw−Be∪ ξ′
on ∂Be. By Proposition 3.1.6, applied to the box
R∗ = B∗w ∩B∗e ,
and monotonicity in boundary conditions,
π1Be
(e?sR∗←→ e?n) & n−c? ,
where c?(min12, α, ε = 1
4, q) > 0 is given by that proposition.
116
We can condition on the west-most vertical dual-crossing between e?s and
e?n (if such a dual-crossing exists) as follows: reveal the open components of
∂Be ∩ J0, bn2cK× J0, n′K, so that no edges in other components are revealed (as in
e.g., steps (1)–(4)). If the open components do not connect to the eastern half of
∂R∗, i.e., to ∂R∗ ∩ Jbn2c+ 1, nK× J0, n′K, then it must be the case that the desired
open dual-crossing exists and can be exposed without revealing any information
about edges east of it.
By monotonicity in boundary conditions, if under π1Be
a vertical dual-connection
from e?s to e?n exists, the grand coupling (see §2.2.1) ensures that the same under
πξ′,ωiBe
for any ωiΛn,n′−Be. By definition of the modification ξ′, there are no bridges
over e?s , no bridges over e?n, and no components of ξ′ with vertices in multiple sides
of ∂Λn,n′ ; thus, conditional on this vertical dual-crossing, the following event holds:
⋂v∈ ∂R∩J0, n
2K×J0,n′K
w∈ ∂R∩Jn2,nK×J0,n′K
ω : v←→6 w through ω and connections in ξ′ .
By the domain Markov property (see Fig. 4.1), for any pair ω1Bw−Beand ω2Bw−Be
,
πξ′,ω1
Be
(ωJ 3n
4,nK×J0,n′K
∣∣ e?s R∗←→ e?n
)d= πξ
′,ω2
Be
(ωJ 3n
4,nK×J0,n′K
∣∣ e?s R∗←→ e?n
),
using that the boundary conditions to the east of the vertical dual-crossing are the
same under both measures. (In the presence of bridges over e?s or e?n the above
distributional equality does not hold; different configurations west of such a dual-
crossing could still induce different boundary conditions east of the dual-crossing,
preventing coupling (as illustrated in Fig. 1.5)—cf. the case of integer q where this
problem does not arise thanks to the event Eζ .)
117
This implies that, on the event e?sR∗←→ e?n, the grand coupling couples the two
systematic block dynamics chains so that they agree on E(Λn,n′) − E(Bw) with
probability 1. In this case, let η be the resulting configuration on E(Be)− E(Bw),
so that
η = Y1E(Be)−E(Bw) .
If the two chains were coupled on E(Be)− E(Bw), the boundary conditions (ξ′, η)
induced on ∂Bw would be the same for any pair of systematic block dynamics
chains with initial configurations ω1, ω2; in particular the identity coupling would
couple them on all of Λn,n′ in the next step when Bw is resampled from πξ′,ηBw
. Thus,
for some c > 0,
‖Pξ′ω1(Y2 ∈ ·)− Pξ′ω2
(Y2 ∈ ·)‖tv ≤ 1− cn−c? .
Since the systematic block dynamics is Markovian and all of the above estimates
were uniform in ω1 and ω2, the probability of not having coupled in time k under
the grand coupling is bounded above by
(1− cn−c?)bk/2c ≤ exp(−cbk/2cn−c?) .
The next lemma will be key to obtaining the desired upper bound on ρ as
defined in (4.2.2); it shows that with high probability, the boundary conditions
induced by the FK measure on a segment will be in ΞK,N , hence the term “typical”
boundary conditions.
Lemma 4.2.17. Fix q ∈ (1, 4]. There exists cΥ(q) > 0 so that, for every ΞK,N
given by Definition 4.2.11 on Λn,n′ with n′ ≤ n ≤ N and K > 0, and every boundary
118
condition ξ,
πξBe(ω∂eBw
/∈ ΞK,N) . N−cΥK ,
where ω∂eBwdenotes the boundary conditions induced on ∂eBw by ωE(Be)−E(Bw)∪ξ.
The same statement holds when exchanging e and w.
Proof. By symmetry, it suffices to prove the bound for the boundary conditions on
∂eBw. Consider the rectangle
R = Jn2, nK× J0, n′K .
By Proposition 3.2.8 with aspect ratio 12, there exists c(q) = c(α = 1
2, q) > 0 such
that, for every edge e ∈ ∂eBw and every boundary condition η on ∂R,
πηR(|Be| ≥ K logN) . N−cK ,
where, for a configuration ωR on R, we recall that |Be| is the number of disjoint
bridges in ωRR−Bw∪ ξR over e. A union bound over all n′ edges on ∂eBw implies
that
maxηπηR(ω∂eBw
/∈ ΞK,N) . n′N−cK . N−cK+1 ,
using n ≤ N . Consequently,
πξBe(ω∂eBw
/∈ ΞK,N) = EπξBe
[πξRR (ω∂eBw
/∈ ΞK,N)]. N−cK+1 ,
where the expectation is w.r.t. πξBeover the boundary conditions ξR induced on R
by ξ and the configuration on Be −R. This concludes the proof of the lemma.
Corollary 4.2.18. Fix q ∈ (1, 4], and consider the systematic block dynamics on
119
Λn,n′ for n′ ≤ n ≤ N with block choices as given in Definition 4.2.15. There exists
cΥ(q) > 0 so that, for every fixed K > 0 and every boundary condition ξ′ ∈ Υ′K,N
on ∂Λn,n′,
ρ . N−cΥK ,
where ρ is as defined as in (4.2.2) w.r.t. the initial configuration ω0 ∈ 0, 1 and
the permissible boundary conditions ΥK,N .
Proof. Let Yk be the systematic block dynamics on Λn,n′ where n ≤ N . Recall the
definition of ρ in Eq. (4.2.2), so that in the present setting,
ρ = maxω0∈0,1
maxk≥1
maxi∈e,w
Pξ′ω0(Yk∂Bi /∈ ΥK,N) .
In the first time step, ω0Beinduces wired or free boundary conditions on ∂wBe
and so, by Remark 4.2.13, the boundary condition on ∂wBe is trivially in ΞK,N .
Furthermore, the boundary conditions on ∂n,e,sBe also belong to ΞK,N by the
hypothesis ξ′ ∈ ΥK,N . Finally, there cannot be more than 2K logN components
in the boundary condition on ∂Be consisting of vertices on multiple sides for the
following reason: as a result of the side modification on ξ′, such components can
only arise from connections between ∂wBe and the bridges in B(n/4,0) and B(n/4,n′);
however, there are at most K logN bridges in each set under any configuration
on Λ− Be (summing to at most 2K logN components, as claimed). Altogether,
Y1∂Be∈ ΥK,N deterministically.
To address all subsequent time steps, by reflection symmetry and the definition
of the systematic block dynamics, is suffices to consider Y2∂Bw. By Lemma 4.2.17,
the probability that a boundary condition on ∂eBw induced by the systematic
dynamics will not be in ΞK,N is O(N−cΥK), with cΥ > 0 from that lemma. The fact
120
that, deterministically, the boundary conditions on ∂n,s,wBw are in ΞK,N , and there
are at most 2K logN components of the boundary condition on ∂Bw containing
vertices of multiple sides of ∂Bw, follows by the same reasoning argued for the first
time step.
We are now in a position to prove Theorem 4.2.14.
Proof of Theorem 4.2.14. Consider Λ = ΛN,N ′ with aspect ratio α ∈ (0, 1] and
boundary conditions ξ ∈ ΥK,N for a fixed
K ≥ K0 := 6(c? + 1) maxc−1Υ , 1 , (4.2.8)
where c? = c?(minα, 12, 1
4, q) is the constant given by Proposition 3.1.6, and
cΥ = cΥ(q) is given by Corollary 4.2.18. It suffices to prove the proposition for all
K sufficiently large, as ΥK,N ⊂ ΥK′,N for every K ≤ K ′.
We prove the following inductively in n ∈ J1, NK: for every K > K0 as above,
every (α ∧ 12)n ≤ n′ ≤ n, and every ξ ∈ ΥK,N , if
tn = N2(c?+λ+1) log4/3 n where λ := 32K log q + 5 ,
then Glauber dynamics for the critical FK model on Λn,n′ has
‖Pξ1(Xtn ∈ ·)− Pξ0(Xtn ∈ ·)‖tv ≤ N−3 . (4.2.9)
To see that Eq. (4.2.9) implies Theorem 4.2.14, note that (2.2.7), with the choice
n = N , implies that dtv(N c(α,q) logN) = O(1/N) = o(1) for some c(α, q) > 0.
For the base case, fix a large constant M , where clearly tmix = O(1) for all n ≤M .
121
Next, let m ∈ JM,NK, and assume (4.2.9) holds for all n ∈ J1,m− 1K. Consider
the censored and systematic block dynamics, (Xt)t≥0 and Ykk≥0, respectively, on
the blocks defined in Definition 4.2.15 on Λm = Λm,m′ for some (α∧ 12)m ≤ m′ ≤ m
and boundary conditions ξ ∈ ΥK,N .
Recall that ξ ∈ ΥK,N has at most K logN bridges over any edge and at most
2K logN components spanning multiple sides of ∂Λm; thus, by Fact 4.2.4, the
boundary modification ξ′ defined in (4.2.7) satisfies d(ξ′, ξ) ≤ 8K logN . By the
definition of λ, we have |E|2q4d(ξ′,ξ) = o(Nλ). Hence, by Lemma 2.2.13 (Eq. (2.2.11),
where we increased the time on the right-hand to Nλ, for large enough N , by the
monotonicity of dtv) and the above bound on d(ξ′, ξ), we have that for all k, T ≥ 0,
‖Pξ1(XNλkT ∈ ·)− Pξ0(XNλkT ∈ ·)‖tv
≤ 2 maxω0∈0,1
‖Pξω0(XNλkT ∈ ·)− πξΛm‖tv
≤ 16 maxω0∈0,1
‖Pξ′ω0(XkT ∈ ·)− πξ
′
Λm‖tv + 2e−N
λ/4
,
and subsequently, by Theorem 2.2.7,
‖Pξ1(XNλkT ∈ ·)−Pξ0(XNλkT ∈ ·)‖tv ≤ 16 maxω0∈0,1
‖Pξ′ω0(XkT ∈ ·)−πξ
′
Λm‖tv+2e−N
λ/4
.
(4.2.10)
We will next show that the first term in the right-hand above satisfies
maxω0∈0,1
‖Pξ′ω0(XkT ∈ ·)− πξ
′
Λm‖tv = o(N−3) , (4.2.11)
which will imply (4.2.9) if we choose k, T such that NλkT ≤ tm. By triangle
122
inequality,
maxω0∈0,1
‖Pξ′ω0(XkT ∈ ·)− πξ
′
Λm‖tv
≤ maxω0∈0,1
‖Pξ′ω0(XkT ∈ ·)− Pξ′ω0
(Yk ∈ ·)‖tv + maxω0∈0,1
‖Pξ′ω0(Yk ∈ ·)− πξ
′
Λm‖tv
≤ maxω0∈0,1
‖Pξ′ω0(XkT ∈ ·)− Pξ′ω0
(Yk ∈ ·)‖tv + e−cY km−c?
,
where the last inequality is valid for every k ≥ 2 by Lemma 4.2.16. Using Υ′K,N ⊂
ΥK,N and Proposition 4.2.8,
maxω0∈0,1
‖Pξ′ω0(XkT ∈ ·)− πξ
′
Λm‖tv ≤ k(ρ+ ε(T )) + e−cY km
−c?,
and so, combined with (4.2.10),
‖Pξ1(XNλkT ∈ ·)− Pξ0(XNλkT ∈ ·)‖tv ≤ 16k(ρ+ ε(T )) + 16e−cY km−c?
+ 2e−Nλ/4
,
(4.2.12)
where ρ and ε were given in (4.2.1)–(4.2.2), that is, in our context,
ε(T ) = maxω′∈Ω
maxi∈e,w
maxζ∈Υ
BiK,N
‖Pζ,Biω′ (XT ∈ ·)− πζBi‖tv ,
ρ = maxk≥1
maxi∈e,w
Pω0
(Yk∂Bi /∈ ΥBi
K,N
).
We will bound ε(T ) by the inductive assumption for the choice of
T := ktb3m/4cNλK logN , where k := c−1
Y (c? + 6)N c? logN . (4.2.13)
In order to apply the induction hypothesis for a box whose side lengths are smaller
123
by a constant factor vs. the original dimensions of m ×m′, we repeat the above
analysis for the sub-block Bi (whose dimensions are b34mc × m′), and get from
Eq. (2.2.7) and the above arguments that
ε(T ) . N2 maxi∈e,w
maxζ∈Υ
BiK,N
‖Pζ,Bi0 (XT ∈ ·)− Pζ,Bi1 (XT ∈ ·)‖tv ,
which by reapplying (4.2.12) at the lower scale of the Bi’s implies that
ε(T ) . N2k(ρ′ + ε′( T
kNλ ))
+N2e−cY km−c?
+N2e−Nλ/4
,
where ε′(T ) and ρ′ are the counterparts of ε(T ) and ρ w.r.t. the sub-blocks (as per
Definition 4.2.15) of Bi rotated by π/2. (N.b. this rotation is crucial to ensuring
that the aspect ratios of the rectangles we consider remain uniformly bounded as we
recurse down in scale, and consequently the coupling probabilities satisfy the same
lower bound; this rotation is also what forces us to maintain “typical” boundary
conditions on all four sides of the rectangles we are considering as opposed to, say,
just on ∂e,wΛ.)
This yields the following new bound on (4.2.11):
maxω0∈0,1
‖Pξ′ω0(XkT ∈ ·)− πξ
′
Λm‖tv
. N2k2(ρ+ ρ′ + ε′( T
kNλ ))
+ kN2e−cY km−c?
+ o(N−3).
Note that the dimensions of the sub-blocks of Bi (those under consideration in
ε′(T )) are b34mc × b3
4m′c. Hence, by the inductive assumption at scale b3
4mc and
Eq. (2.2.7),
ε′(tb3m/4c
)= O(1/N) ,
124
which, along with (2.2.7) and the sub-multiplicativity of dtv(t), yields that for T
from (4.2.13),
ε′( TkNλ ) = ε′
(tb3m/4cK logN
). N−K ≤ N−6(c?+1) .
By Corollary 4.2.18, we have ρ . N−cΥK ≤ N−6(c?+1) by our choice of K0,
and similarly for ρ′. So, for k = N c?+o(1) as in (4.2.13), k2ρ . N−4c?−6+o(1) =
o(N−5), and similarly, k2ρ′ = o(N−5). Finally, this choice of k guarantees that
kN2 exp(−cY km−c?) is at most kN−c?−4 = o(N−3). Combining the last three
displays with these bounds yields (4.2.11). The proof is concluded by noting that
indeed NλkT ≤ N2c?+2λ+o(1)tb3m/4c ≤ tm.
4.2.6 Mixing on the torus
Here we extend Theorem 4.2.14 to the n×n torus, proving Theorem 3. Observe
that the periodic FK boundary conditions identified with (Z/nZ)2 in fact have
order n components with vertices on multiple sides of ∂Λ. We thus have to extend
the bound of Theorem 4.2.14 to periodic boundary conditions using the topological
structure of (Z/nZ)2. The proof draws from the extension of mixing time bounds
in [64] from fixed boundary conditions to (Z/nZ)2. In the present setting, having
to deal with a specific class of boundary conditions forces us to reapply the bridge
modification and the censored and systematic block dynamics techniques.
We first bound the mixing time on a cylinder with typical boundary conditions
125
on its non-periodic sides. In what follows, for any Λn,n′ , label the following edges:
e?sw = (0, bn′2c+ 1
2) , e?se = (n, bn′
2c+ 1
2) ,
e?nw = (0, b9n′
10c+ 1
2) , e?ne = (n, b9n′
10c+ 1
2) .
Then define the modification ξ′ of boundary conditions ξ by
ξ′ = ξe?sw ∧ ξe?se ∧ ξe?nw ∧ ξe?ne ∧ ξs (4.2.14)
and define Ξ′K,N ,Υ′K,N as before, for the new modification. We say that a boundary
condition on ∂n,sΛ is in ΥK,N if its restriction to each side is in ΞK,N and there
are fewer than 2K logN distinct components with vertices in ∂nΛ and ∂sΛ, and
analogously for boundary conditions on ∂e,wΛ.
Theorem 4.2.19 (Mixing time on a cylinder). Fix q ∈ (1, 4], α ∈ (0, 1] and K > 0.
There exists some c(α, q,K) > 0 such that the critical FK model on Λ = ΛN,N ′ with
αN ≤ N ′ ≤ α−1N and boundary conditions, denoted by (p, ξ), that are periodic on
∂n,sΛ and ξ ∈ ΥK,N on ∂e,wΛ, satisfies tmix . N c logN .
Proof. We will use a similar approach as in the proof of Theorem 4.2.14 to reduce
the cylinder to rectangles with “typical” boundary conditions. It suffices to prove
the theorem for large K, since ΥK,N ⊂ ΥK′,N for K ≤ K ′. We establish it for every
fixed
K ≥ K0 +K ′0 where K0 = 4(c? + 1)(c−1Υ ∨ 1) and K ′0 = K0(c−1
Ψ ∨ 1) ,
in which c? = c?(α5, 1
4, q) > 0 is given by Proposition 3.1.6, the constant cΥ is
c(2α5, q) > 0 from Proposition 3.2.8, and cΨ = cΨ(3α
5, q) > 0 is given by Proposi-
126
B0
B1
R
1/0
ξ′ ξ′
Figure 4.2: Left and center: block choices B0, B1 for the censored and systematicblock dynamics block dynamics chain on ΛN,N ′ with periodic boundary on ∂n,sΛ,and ξ′ on ∂e,wΛ. Right: the dual crossings in R∗s and R∗N which allow coupling onthe set R.
tion 3.2.13.
Define, as in Definition 4.2.7, the censored and systematic block dynamics on
B0 := J0, NK× J0, N ′5
K ∪ J0, NK× J2N ′
5, N ′K ,
B1 := J0, NK× J0, 3N ′
5K ∪ J0, NK× J4N ′
5, N ′K .
The choice of boundary class on Bi for i = 0, 1 is Γi = Υ3K,N . Observe that
by translating vertically on the universal cover, the blocks B0 and B1 are, by
construction, N × 45N ′ rectangles with non-periodic boundary conditions. These
blocks and the coupling scheme are depicted in Figure 4.2.
It again suffices, by Eq. (2.2.7), to show that there exists some c(α, q,K) > 0
such that
‖Pp,ξ1 (XNc logN ∈ ·)− Pp,ξ0 (XNc logN ∈ ·)‖tv ≤ N−3 . (4.2.15)
127
In the setting of the cylinder, the side modification (p, ξs) of (p, ξ) only disconnects
∂eΛ from ∂wΛ, and so, if ξ′ is as in (4.2.14), then d(ξ′, ξ) ≤ 6K logN . Thus,
by (2.2.11), the triangle inequality and Theorem 2.2.7 (as explained in the derivation
of (4.2.10)), if
tN = NλkT for λ := 24K + 5
(so that |E|2q4d(ξ′,ξ) = o(Nλ)), then for every k, T ≥ 0,
‖Pp,ξ1 (XtN ∈ ·)− Pp,ξ0 (XtN ∈ ·)‖tv
≤ 16 maxω0∈0,1
‖Pp,ξ′ω0(XkT ∈ ·)− πp,ξ
′
Λ ‖tv + 2e−Nλ/4
,
which, by Proposition 4.2.8, is at most
16 maxω0∈0,1
‖Pp,ξ′ω0(Yk ∈ ·)− πp,ξ
′
Λ ‖tv + 16k(ρ+ ε(T )) + 2e−Nλ/4
, (4.2.16)
where ε(T ) and ρ are given by (4.2.1) and (4.2.2), respectively, w.r.t. the blocks
B0, B1, the permissible boundary conditions Υ3K,N , and the initial configuration
ω0 ∈ 0, 1:
ε(T ) = maxω′∈Ω
maxi∈0,1
maxζ∈Υ
Bi3K,N
‖Pζ,Biω′ (XT ∈ ·)− πζBi‖tv ,
ρ = maxk≥1
maxi∈0,1
Pω0
(Yk∂Bi /∈ ΥBi
3K,N
).
We next bound the first term in the right-hand side of Eq. (4.2.16) by the
probability of not coupling the systematic block dynamics chains started from two
arbitrary initial configurations under the grand coupling (cf. Lemma 4.2.16). In
128
the first time step, we try to couple the chains started from ω1, ω2 on
R := J0, NK× J3N ′
5, 4N ′
5K ,
so that in the second step the identity coupling couples them on all of Λ. It suffices
to couple the systematic chains started from ω1 = 0 and ω2 = 1 under the grand
coupling. In order to couple samples from the (0, ξ′) and (1, ξ′) boundary conditions
on R (induced by ω1 = 0 and ω2 = 1 resp.), define the following two sub-blocks of
B0:
Rs := J0, NK× J2N ′
5, 3N ′
5K , Rn := J0, NK× J4N ′
5, N ′K .
By Proposition 3.1.6, monotonicity in boundary conditions, and the FKG inequality,
minηπη,ξ
′
B0
(e?sw
R∗s←→ e?se , e?nw
R∗n←→ e?ne
)& N−2c? .
By the Domain Markov property, and the definition of the boundary modification
ξ′,
π1,ξ′
B0
(ωR
∣∣ e?sw R∗s←→ e?se , e?nw
R∗n←→ e?ne
)d= π0,ξ′
B0
(ωR
∣∣ e?sw R∗s←→ e?se , e?nw
R∗n←→ e?ne
).
As before, using the grand coupling and revealing edges from ∂sRs and ∂nRn until
we reveal a pair of such horizontal dual-crossings, by monotonicity, we can couple
πω1,ξ′
B0and πω2,ξ′
B0on R with probability at least cN−2c? . On that event, the two
chains are coupled in the next step (and thereafter) on all of Λ with probability
1. By the definition of the systematic block dynamics, we conclude that, for some
129
cY > 0 and every k ≥ 2,
maxω0∈0,1
‖Pp,ξ′ω0(Yk ∈ ·)− πp,ξ
′
Λ ‖tv ≤ exp(−cY kN−2c?) .
To bound ρ, first note that, for ω0 ∈ 0, 1, the block B0 has boundary conditions
(0, ξ′) or (1, ξ′), both of which are in Υ3K,N by Remark 4.2.13. Thereafter, the
uniformity of Proposition 3.2.8 in boundary conditions implies that for every η,
πη,ξ′
B0(ω∂B1
/∈ ΞK,N) . N−cΥK ,
and likewise when exchanging B0 and B1. We need to also bound the number of
boundary components intersecting distinct sides of B0 or B1. We can bound the
connections between ∂n,sBi and ∂e,wBi (for i = 0, 1) deterministically by 4K logN as
in the proof of Corollary 4.2.18. In the present setting there could also be (multiple)
open components connecting ∂sBi to ∂nBi in Λ− Bi. By Proposition 3.2.13 and
monotonicity in boundary conditions, for every η,
πη,ξB0(|ΨΛ−B1| ≥ K logN) . N−cΨK ,
where, as in that proposition, |ΨΛ−B1| is the number of distinct vertical crossings
of Λ−B1. By the choices of K0 and K ′0, a union bound yields
ρ . maxηπη,ξ
′
B0(ω∂B1
/∈ Υ3K,N) . N−4c?−4 .
Observe that on their respective translates, B0 and B1 are N × 45N ′ rectangles,
so we can bound maxi maxξ∈Υ3K,Ntξ,Bimix using Theorem 4.2.14; by that theorem,
130
rotational symmetry, and the sub-multiplicativity of dtv, we have that for some
cB(α, q,K) > 0,
ε(T ) . exp(−c−1B TN−cB logN) ,
uniformly over αN ≤ N ′ ≤ α−1N . Altogether, combining the bounds on ρ, ε, and
the systematic block dynamics distance from stationarity, in Eq. (4.2.16), we see
that for
k = N2c?+1 and T = N (cB+1) logN
one has
‖Pp,ξ1 (XtN ∈ ·)− Pp,ξ0 (XtN ∈ ·)‖tv = o(N−3) ,
implying Eq. (4.2.15) and concluding the proof.
Proof of Theorem 3. The theorem is obtained by reducing the mixing time on
the torus to that on a cylinder and then applying Theorem 4.2.19. Fix α ∈ (0, 1] and
consider Λ = Λn,n′ with αn ≤ n′ ≤ α−1n and periodic boundary conditions, denoted
by (p), identified with (Z/nZ)× (Z/n′Z) (the special case n′ = n is formulated in
Theorem 3).
Let c? = c?(α5, 1
4, q) > 0 be given by Proposition 3.1.6 and let cΥ,cΨ, K0 and K ′0
be given as in the proof of Theorem 4.2.19. Define K = K0 +K ′0. We consider the
censored and systematic block dynamics with the block choices,
B0 := J0, nK× J0, n′5K ∪ J0, nK× J2n′
5, n′K and
B1 := J0, nK× J0, 3n′
5K ∪ J0, nK× J4n′
5, n′K ,
131
and boundary class
Υp3K,n :=
ξ : ξ∂n,sΛ = p, ξ∂e,wΛ ∈ Υ3K,n
.
By Theorem 2.2.7, the triangle inequality and Proposition 4.2.8, for every k, T ≥ 0,
‖Pp1(XkT ∈ ·)− Pp0(XkT ∈ ·)‖tv ≤ 2 maxω0∈0,1
‖Ppω0(Yk ∈ ·)− πpΛ‖tv + 2k(ρ+ ε(T )) ,
where ρ and ε(T ) are w.r.t. the class Υp3K,n of permissible boundary conditions. It
suffices, as before, to prove that the right-hand side is o(n−3) and then use (2.2.7)
and the sub-multiplicativity of dtv(t) to obtain the desired result.
Recall the edges e?sw, e?nw, e
?se and e?ne on Λn,n′ . As in the proof of Theorem 4.2.19,
if
Rs := J0, nK× J2n′
5, 3n′
5K , Rn := J0, nK× J4n′
5, n′K ,
then by Proposition 3.1.6 and the FKG inequality, we have
π1,pB0
(e?sw
R∗s←→ e?se , e?nw
R∗n←→ e?ne
)& n−2c? .
Crucially, while no boundary modification was done in this case, the periodic
sides of B0 have no bridges over the four designated edges, and the two horizontal
dual-crossings, from the event above, disconnect its non-periodic sides (∂nB0 and
∂sB0) from ∂nB1 and ∂sB1. Therefore, if that event occurs for the systematic
block dynamics chain started from ω0 = 1, the grand coupling carries it to the
chains started from all other initial states, and yields a coupling of all these chains
132
on J3n5, 4n
5K × J0, n′K ⊃ ∂B1. By definition of the systematic block dynamics and
sub-multiplicativity of dtv(t), for k ≥ 2,
maxω0∈0,1
‖Ppω0(Yk ∈ ·)− πpΛ‖tv ≤ exp(−cY kn−2c?) . (4.2.17)
Observe that at every time step of the systematic block dynamics, the block Bi
(i = 0, 1) is an n× 45n′ rectangle with periodic boundary conditions on ∂e,wBi and
boundary conditions η induced by the chain on ∂n,sBi. By Theorem 4.2.19, for
some c(α, q,K) > 0,
maxi
max(p,η)∈Υp3K,n
t(p,η),Bimix . nc logn ,
and by sub-multiplicativity of dtv(t), we have ε(T ) . exp(−c−1Tnc logn). As in
the proof of Theorem 4.2.19, since the estimate on ρ was uniform in the boundary
conditions, we again have ρ . n−4c?−4 (using Propositions 3.2.8 and 3.2.13). Com-
bining the bounds on ρ and ε with (4.2.17), there exists some c(α, q,K) > 0 such
that
‖Pp1(Xnc logn ∈ ·)− Pp0(Xnc logn ∈ ·)‖tv = o(n−3) ,
as required.
133
Chapter 5
Slow mixing in phase coexistence
regions
In this chapter, we prove Theorems 2 and 4. In particular, we show that as soon
as there are multiple distinct infinite volume Potts or FK measures, their respective
Glauber dynamics on (Z/nZ)2 are exponentially slow. The previous works [12, 13]
had used complicated geometric bottlenecks and the perturbative Pirogov–Sinai
theory [77, 78] to establish exponentially slow mixing for these dynamics in regimes
where one of q or β is sufficiently large on (Z/nZ)d for general d ≥ 2. Our proofs in
this section hold only for d = 2, but use a simple geometric bottleneck consisting of
three vertical and three horizontal loops around the torus, for which we can prove
exponentially slow mixing as soon as there is an exponential decay of connectivities
under π1Z2 (without relying on perturbative techniques).
Let us first sketch the key idea behind the proofs, then prove slow mixing in
phase coexistence regimes for the Potts Glauber dynamics in Section 5.1, and for
Swendsen–Wang and FK Glauber dynamics more generally, in Section 5.2.
134
For completeness, in Section 5.3, we prove that for every q ≥ 2 and β > βc(q),
the Potts Glauber dynamics on Λn with free boundary conditions is slow.
5.0.1 Proof idea
A standard approach to mixing time lower bounds is to construct a bottleneck
set of configurations which has a very small conductance (2.2.14), as this upper
bounds the spectral gap per Cheeger’s inequality (2.2.15). Suppose there are
multiple infinite-volume Gibbs measures, say corresponding to two distinct colors
in the Potts case, and wired and free in the FK case. We establish a bottleneck
in the state space consisting of configurations containing vertical and horizontal
crossings (either of monochromatic vertices, or of open edges) each forming a loop
around the torus. The basic idea is that a combination of a horizontal loop and a
vertical loop in the torus can be translated on the universal cover of the torus to
form a macroscopic wired FK circuit. Escaping such configurations via a pivotal
edge would require a macroscopic dual-crossing inside the circuit, an event which,
for any given circuit, has an exponentially small probability (see Theorem 2.1.2).
Unfortunately, conditioning on the locations of these two loops includes negative
information about the interior of the circuit and prevents us from appealing to the
decay of correlations estimates. If we instead considered two pairs of horizontal and
vertical loops: one could expose the required wired circuit with no information on
its interior. However, a subtler problem then arises, where after exposing one pair
of loops (say the vertical ones), revealing the second (horizontal) pair might leave
the potentially pivotal edge outside of the formed wired circuit, preventing us from
estimating the probability of it being pivotal. It turns out that using three pairs
of horizontal and vertical loops supports a suitable protocol for exposing a wired
135
circuit such that the potential edge is pivotal only if it supports a macroscopic
dual-crossing within that circuit, thus leading to the desired lower bound.
5.1 Potts Glauber dynamics on (Z/nZ)2 in phase
coexistence regions
In this section, we will prove that as soon as there are multiple distinct infinite-
volume Potts measures on Z2, the corresponding Potts Glauber dynamics is expo-
nentially slow on (Z/nZ)2. We use in particular, the fact that for a given (p, q),
there are multiple Z2-measures if and only if π1Z2,p,q has an infinite cluster; in turn,
the planarity of Z2 and results of [5, 34] ensure that whenever π1Z2,p,q has an infinite
cluster, its dual-clusters are almost surely finite and have exponential tails. Namely,
by the dual version of (2.1.1) when q ≥ 1 and p > pc(q), and by Theorem 2.1.2
when q > 4 and p = pc(q), there exists some c(q) > 0 such that
π1Z2
((0, 1
2)∗←→ ∂J−n, nK2
)≤ e−cn . (5.1.1)
The geometric bottleneck described above then uses this property to bound the
conductance of the space.
The argument easily extends to (Z/nZ)× (Z/n′Z) where n′ = bαnc for some
aspect ratio α; moreover, it suffices to prove the theorem for the discrete-time
setting, since any O(n2) cost of moving to continuous time can be absorbed in the
exponential lower bound.
Proof of Theorem 2. In the context of this proof, denote by u! v the existence
of a sequence of sites uiki=1 such that ui is adjacent to ui+1 and with u1 = u and
136
D q
C q
Figure 5.1: Bottleneck for the Potts Glauber dynamics: the box Λ′ is equipartitionedinto three vertical and three horizontal strips. The green region is exposed to revealan outermost open circuit C q of color q (in this case, red). A boundary configurationσ ∈ ∂Siv,q contains a macroscopic dual-crossing in Dq like the dotted black pathextending from the pivotal vertex w to the inner boundary of its vertical strip.
uk = v, with σ(xi) = σ(xi+1) for all i. Define the bottleneck set
S =⋂
i=1,2,3
Siv,q ∩ Sih,q ,
where the constituent events are defined as follows for i = 1, 2, 3:
Siv,q = σ : ∃x such that (x, 0)! (x, n) in J (i−1)n3
, in3K× J0, nK, σ((x, 0)) = q ,
Sih,q = σ : ∃y such that (0, y)! (n, y) in J0, nK× J (i−1)n3
, in3K, σ((0, y)) = q .
We call each of these Potts connections of nontrivial homology on the torus Potts
loops, as the color clusters satisfying Siv,q (resp., Sih,q) will contain a path of homology
(0, 1) (resp., (1, 0)). Define the boundary subset ∂S = σ ∈ S : P (σ, Sc) > 0
where P (σ, ·) is the transition kernel for discrete-time Potts Glauber dynamics.
137
Notice that a configuration σ ∈ S is in ∂S if and only if there exists a vertex
v such that if the value of σv were changed to form σ′, then σ′ would not be in
S. The bound P (σ, σ′) ≤ 1 for all σ, σ′ implies that Q(S, Sc)/µpΛ(S) ≤ µpΛ(∂S | S),
where the superscript p indicate periodic boundary conditions on Λ = Λn. By
spin flip symmetry of the torus, and our definition of Potts loops (namely, the fact
that geometrically, Sih,q−1 ⊂ (Siv,q)c ⊂ Sc), we have µpΛ(Sc) ≥ 1
2. Therefore, the
conductance of the Markov chain satisfies
Φ? ≤Q(S, Sc)
µpΛ(S)µpΛ(Sc)≤ 1
2µpΛ(∂S | S) , (5.1.2)
and it suffices to show that the right-hand side is exponentially decaying in n.
We obtain an exponentially decaying upper bound on µpΛ(∂S | S) by examining
the pivotality of vertices to the event S. For a set S and a configuration σ ∈ S, we
call a vertex v pivotal to S if and only if there is a configuration σ′ /∈ S satisfying
σv 6= σ′v and σw = σ′w for every w 6= v, It is clear that the set
∂S =⋃w∈∂S
⋃i=1,2,3
⋃j∈v,h
σ ∈ S : w is pivotal to Sij,q .
Let us union bound over all vertices in Λ and the six different crossing events:
fix a vertex w ∈ Λ whose pivotality to—without loss of generality—Siv,q we examine.
Clearly, w must be in J (i−1)n3
, in3K× J0, nK. Choose a translate of Λ on the universal
cover by some (mn3, `n
3) to obtain Λ′, as follows:
1. vertically translate such that the horizontal third of Λ that contained w is
now the middle horizontal third of Λ′
2. If w is closer to the left than the right, of the vertical third of Λ containing it,
138
choose a horizontal translate of Λ so that vertical third becomes the left-most
vertical third of Λ′. Otherwise, horizontally translate so that vertical third
becomes the right-most vertical third of Λ′.
When considering instead pivotality to Sih,q, the roles of the two steps are reversed,
so that w will be translated to be in the middle vertical third of Λ′ and either
the top or bottom horizontal thirds. Notice that by periodicity of the boundary
conditions, and the invariance of the events Sij,q under translations by (mn3, `n
3)m,`∈Z,
that
µpΛ(w is pivotal to Sij,q | S
)= µpΛ′
(w is pivotal to Sij,q | S
).
In order to bound the probability of w being pivotal to S1v,q given S, let us reveal
the outermost Potts loops of color q in Λ′ as follows. First expose (sample from the
measure µpΛ′(σ∂Λ′ | S)) all the spin values on ∂Λ′ to reduce the torus to a rectangle
with fixed boundary conditions. Then given those exposed spins, iteratively expose,
starting from ∂Λ′, all spins ?-adjacent to (i.e., Euclidean distance at most√
2 away
from) exposed vertices whose spin value is not q. By construction, this revealing
process will end when we will have revealed the outermost q-colored paths, and
therefore a Potts circuit, C q, of spin value q, and nothing interior to it (such a
circuit exists since we are working conditionally on S).
Call Dq the set of all vertices in C q or “interior to it” (i.e., in the connected
subgraph of Λ′ \ C q containing the center of Λ′). By the domain Markov property,
the distribution on the remaining vertices in Dq is given by µσCq=qDq (· | S). Let us
now move to the FK representation of the model on Dq; the FK representation of
the has fully wired boundary conditions on C q, and therefore the event FσCq is
139
trivially satisfied. Refer to Figure 5.1 for a visualization.
In order for w to be pivotal to Siv,q it must be the case that w ∈ C q, so we
suppose that w ∈ C q. Also, we claim that, there must be a dual-crossing from one
of the interior dual-neighbors of w (an edge whose center is distance√
52
from v) to
one of n3× J0, nK or 2n
3× J0, nK in Λ′ (the choice depends on which translate Λ′
is chosen), and the dual-crossing must be contained completely within Dq. Indeed,
suppose there were no such dual-crossing. Then, by planarity, there would be a
primal FK connection in the same vertical third connecting the two neighbors of
w in C q. By the Edwards–Sokal coupling and the definition of Potts connections,
such an FK connection would translate to a new Potts connection of color q that
does not use the vertex w. The new vertical Potts crossing is still a vertical loop
because of the exposed horizontal crossings. in the circuit C q.
At the same time, by construction, the inner boundary of the vertical strip
containing w is at a graph distance at least n/6 from w. Since the event FσCq is
trivially satisfied, by monotonicity in boundary conditions, and the FKG inequality
(the FK representation of S is an increasing event and dual-crossings are decreasing),
we have uniformly over C q,
µpΛ′(w is pivotal to Siv,q | C q, S) ≤ πpΛ′( ⋃e:d(e,w)=
√5/2
e∗←→ n
3, 2n
3 × J0, nK | C q, S
)≤ 4π1
Z2
((0, 1
2)∗←→ ∂J−n
6, n
6K2).
In turn, by (5.1.1), this right-hand side is bounded above by e−cn for some c =
c(β, q) > 0 as soon as either q ≥ 1 and p > pc(q) or q > 4 and p = pc(q). Combining
140
this with (2.2.15) with (5.1.2), we see that
gapp ≤ µpΛ(∂S | S) ≤ n2e−cn ,
for every β, q such that the Potts model has distinct Z2-measures.
5.2 FK dynamics on (Z/nZ)2 at a discontinuous
phase transition
We now translate the proof in Section 5.1 to the FK Glauber dynamics; the idea
of the proof is essentially the same—we work directly with FK crossings rather than
loops of a fixed Potts color. It again suffices to prove the desired for the discrete-
time dynamics, as any polynomial in n cost of translating to continuous-time is
easily absorbed into the exponential slowdown.
Proof of Theorem 2. Using (5.1.1), we will establish a bottleneck set S for the
random cluster model on Λ with periodic boundary conditions. Define the bottleneck
event
S =⋂
i=1,2,3
Siv ∩ Sih
where the constituent events are defined as follows for i = 1, 2, 3:
Siv :=ω : ∃x such that (x, 0)←→ (x, n′) in J (i−1)n
3, in
3K× J0, n′K
,
Sih :=ω : ∃y such that (0, y)←→ (n, y) in J0, nK× J (i−1)n′
3, in′
3K.
The crossings in the above events will contain paths on the torus of homology
141
class (1, 0) and (0, 1), which we call horizontal and vertical loops, respectively. We
aim to get an exponentially decaying upper bound on πpΛ(∂S | S) (the superscript
p denotes periodic boundary conditions on Λ), where the boundary subset ∂S =
ω ∈ S : P (ω, Sc) > 0 is the event that there exists an edge e that is pivotal to S.
Specifically, we have
∂S =⋃
e∈E(Λ)
⋃i=1,2
⋃j∈v,h
ω ∈ S : ω − e /∈ Sij ,
in which configurations ω are identified with their edge-sets.
Via the symmetry of periodic FK boundary conditions, RSW-type estimates on
the torus (Z/nZ) × (Z/bαncZ) were proved in [5] for all q ≥ 1 at p = pc(q). By
[5, Theorem 5], therefore, there exists some ρ(α, q) > 0 such that πpΛ(C∗v(Jn3 ,2n3
K×
Jn′3, 2n′
3K) ≥ ρ, and since C∗v(Jn3 ,
2n3
K× Jn′3, 2n′
3K) ⊂ Sc, we get
Q(S, Sc)
πpΛ(S)πpΛ(Sc)≤ Q(S, Sc)
πpΛ(S)ρ≤ ρ−1πpΛ(∂S | S) .
Combining this with Eq. (2.2.15), it suffices to prove that there exist constants
c1 = c1(α, q) > 0 and c2 = c2(α, q) > 0 such that,
πpΛ(∂S | S) ≤ c1e−c2n . (5.2.1)
to obtain the desired bound on the inverse spectral gap. A union bound implies
πpΛ(∂S | S) ≤∑
e∈E(Λ)
∑i=1,2
∑j∈v,h
πpΛ(ω − e /∈ S | ω ∈ S) .
Without loss of generality, examine the probability that e is pivotal to Siv. The
142
D
C
Figure 5.2: The lower bound construction for the FK Glauber dynamics: the boxΛ′ is equipartitioned into three vertical and three horizontal strips. The purplecrossings form an outermost open circuit C . A boundary configuration ω ∈ ∂Scontains a macroscopic dual-crossing like the red path extending from the pivotaledge e to the boundary of the vertical strip.
other cases can be treated analogously. Fix an edge e in J0, n3K× J0, n′K and consider
its horizontal coordinate (in Fig. 5.2, e is the edge of intersection of the purple and
red paths).
Now move to a translate by (mn3, `n
′
3)m,`∈Z of Λ on the universal cover of the
torus, which we call Λ′ = Λ′n,n′ . Choose a translate such that:
1. The horizontal third of Λ that contained e, is now the middle horizontal third
of Λ′.
2. If e was closer to the left of its vertical third than the right, that strip is the
left vertical third of Λ′. Otherwise, that strip is the right vertical third of Λ′.
As before, by periodicity of boundary conditions and invariance of Sij under
translations by (mn3, `n
′
3)m,`∈Z, it suffices to bound the probability that e is pivotal
143
to the event S in Λ′. Begin by exposing all the edges E(∂Λ′) so as to fix a boundary
condition and move from a torus to a rectangle with some fixed boundary condition.
Then expose the outermost circuit C in Λ′ as follows (similarly to steps (1)–(4)).
First expose the left-most vertical crossing of Λ′ by revealing the dual component
of ∂wΛ′: by construction, its adjacent primal edges will form the left-most vertical
crossing of Λ′ and we will not have revealed any edges to its right. Repeating this
procedure on the n,e, s sides reveals an outermost circuit C without exposing any
of its interior edges (see Fig. 5.2 where the shaded region consists of the edges we
reveal through this process).
If e is not an open edge in one of the vertical crossings we have exposed, then
certainly e is not pivotal to the event S. So suppose e is an open edge in a vertical
loop and—by construction—also an open edge in C . Denote by D the subgraph of
Λ′ whose edge values have not yet been exposed, i.e., all edges interior to C .
It is a necessary condition for e to be pivotal to S that there exists a dual path
from the dual-vertex in D incident to the dual-edge e∗ to the vertical boundary of
the vertical third containing e in Λ′, i.e., to n3, 2n
3 × J0, n′K. If there is no such
dual-path, then there is a different primal vertical crossing of that strip which does
not contain e, and that crossing—because of the exposed horizontal loops—is itself
a loop of homology class (0, 1) and satisfies S1v .
But observe that n3, 2n
3× J0, n′K is a graph distance at least n/6 from e. Thus,
by the Domain Markov property and FKG inequality, for every possible circuit C
revealed,
πpΛ′(ωD ∈ · | S) = π1C∪D(· | S) π1
C∪D π1Z2(ωD ∈ ·) .
By (5.1.1), for every q > 4, if p = pc(q), there exists an absolute c(q) > 0 such
144
that, for any fixed e,
πpΛ(ω − e /∈ Siv | ω ∈ S) ≤ e−cn .
If e were pivotal to Sih the analogous claim would hold with probability less than
e−cn′/6. Summing over all six crossing events and summing over all edges e we
conclude that πpΛ(∂S | S) ≤ 12αn2e−cαn, implying Eq. (5.2.1) as desired.
5.3 Potts Glauber dynamics with free boundary
at β > βc(q)
We conclude the section with a short proof that the mixing time of the Potts
Glauber dynamics on Λn with free boundary conditions is exponentially slow at
β > βc(q). (N.b. unlike the periodic boundary conditions, this slow mixing should
not extend to the critical point β = βc(q), even for q > 4, where the mixing time
should be polynomial: see Problem 1.5.5.) The proof here uses a similar bottleneck
set and revealing procedure to that in Section 5.1 based on vertical and horizontal
crossings of Λn but is substantially simplified by the uniqueness of the FK measure
when p > pc(q).
Theorem 5.3.1. Let q ≥ 2 and β > βc(q). There exist c(β, q), c′(β, q) > 0 such
that Potts Glauber dynamics on Λn with free boundary conditions satisfies
c ≤ 1
nlog gap−1
p ≤ c′ .
Proof. Recall the notation u! v indicating a sequence of monochromatic edges
145
(the endpoints of the edges have the same state) connecting vertices u to v, and
define the bottleneck set S = Sv,q ∩ Sh,q where
Sv,q = σ : ∃u ∈ ∂sΛn, v ∈ ∂nΛn such that u! v, σ(u) = q ,
Sh,q = σ : ∃u ∈ ∂eΛn, v ∈ ∂wΛn such that u! v, σ(u) = q .
Recall that the boundary subset ∂S is the set σ ∈ S : P (σ, Sc) > 0 (where
P (σ, ·) is the transition kernel for discrete-time Potts Glauber dynamics. Then
a configuration σ ∈ S is in ∂S if and only if there exists a pivotal vertex w, i.e.,
if σw were changed, the resulting configuration would not be in S. By the fact
that Sh,q−1 ⊂ Scv,q ⊂ Sc, we have µΛ(Sc) ≥ 12. As in (5.1.2), it therefore suffices to
obtain an exponentially decaying upper bound on µΛ(∂S | S).
As before, let us union bound over the n2 possible vertices w ∈ Λn and whether
they are pivotal to Sv,q or Sh,q. Let us fix any w ∈ Λn and consider the event w
is pivotal to Sv,q; the bound on pivotality of w ∈ Λn to Sh,q follows analogously
(consider the rotation by π2
of Λn). Suppose w is in the right-half of Λn, i.e.,
w ∈ Jdn2e, nK × J0, nK (resp., left-half w ∈ J0, bn
2cK × J0, nK) reveal the right-most
(resp., left-most) vertical Potts crossing of color q in Λ as in the proof of Theorem 2.
Namely, in the case that we are revealing the right-most vertical crossing, iteratively
expose, starting from ∂eΛ, the states of vertices ?-adjacent to exposed vertices
whose state is not q. This revealing process will end when we have revealed the
right-most q-colored vertical crossing (such a crossing exists since we are doing
this revealing under the conditional measure µΛ(· | S)). Call this vertical q-colored
crossing C qv and call Dq
v the set of all vertices in C qv along with all those whose
value have not yet been exposed (those to the left of C qv ). By the domain Markov
146
property, the distribution on the remaining vertices in Dq is given by µσCq
v=q
Dqv
whose
FK representation is wired on C qv and free on ∂Λn ∩ ∂Dq
v . Since this FK boundary
condition has one wired boundary component (corresponding to C qv ), the event
FσCqv
is trivially satisfied.
In order for w to have been pivotal to Sv,q it must be the case that w ∈ C qv and
moreover, that one of the two dual-vertices adjacent to w and interior to Dqv are
dual-connected to ∂wΛn. Otherwise, by planar duality, ∂nΛn would be connected to
∂sΛn by an open path that intersects C qv and does not use w. By monotonicity
in boundary conditions and the FKG property, we therefore have
µΛn(w is pivotal to Sv,q | S) ≤ π0Λn
( ⋃e:d(e,w)=
√5/2
e∗←→ ∂wΛn
)
which is at most 4ne−cd(∂wΛ,w) ≤ 4ne−cn/2 for some c(p, q) > 0 by the exponential
decay of dual-connectivities at p > pc(q) (the dual of (2.1.1)).
Summing over all w ∈ Λn and the vertical and horizontal crossing events, we
deduce that
µΛn(∂S | S) ≤ µΛn(∃w : w is pivotal to S | S) ≤ e−cn
for some c(p, q) > 0, concluding the proof.
147
Chapter 6
FK boundary conditions in
uniqueness regions
In Chapters 4–5, we analyzed the mixing times of Potts and FK Glauber
dynamics on the torus (Z/nZ)2. As we saw, the bounds at a continuous phase
transition point—where there is a unique infinite-volume measure—in fact applied
much more broadly, to boxes with arbitrary boundary conditions in the Potts case,
and “typical” boundary conditions in the FK case. When there is no uniqueness
of infinite-volume measure, mixing times are very sensitive to choice of boundary
condition, as is well-studied in the low-temperature Ising model, and explored
further for the Potts and FK models in Chapters 7–8. Before that, let us first
investigate the effect that FK boundary conditions can have on mixing times in
the off-critical uniqueness regimes of p 6= pc(q).
For the Ising and Potts models, at high-temperatures, where correlations die
out exponentially fast even adjacent to the boundary conditions—a property known
as strong spatial mixing (SSM)—it is well known that the mixing time of Ising and
148
Potts Glauber dynamics are fast, independently of the boundary conditions [2, 69].
In the FK model, the analogous strong spatial mixing property does not hold near
the boundary even for “typical” FK boundary conditions. As such, the work of [10]
proving fast mixing for the FK dynamics when p 6= pc(q) only held on the select
FK boundary conditions for which SSM does hold, namely periodic, wired, and
free. In this chapter, we analyze the stability of those mixing time bounds with
respect to changes of FK boundary condition, proving Theorems 5–6.
This chapter is organized as follows. In Section 6.0.1, we introduce our general
framework to deduce mixing time estimates on Λn from spatial and local mixing
properties. The key ingredient for us is a mixing time bound for thin rectangles
(Theorem 6.1.1) with arbitrary realizable boundary conditions, presented in Sec-
tion 6.1. This proof uses a novel block-dynamics approach in the presence of FK
boundary conditions, recursing not over rectangles but rather on collections of
rectangles dictated by the boundary conditions at each scale (see Section 6.1.1 for
a proof sketch). We are then in position to complete the proof of Theorem 5 in
Section 6.2, showing that when p 6= pc(q), uniformly over all realizable boundary
conditions ξn, the mixing time on Λξnn is at most polynomial in n. This is boosted
to nearly-optimal mixing time for typical boundaries (Theorem 6) in Section 6.3.
We also complement these with two results of a different flavor. As mentioned in
Remark 2.2.11, the traditional canonical paths bound on thin rectangles (exponential
in the shorter side length) does not apply in the presence of FK boundary conditions
as they play the role of distorting the underlying graph; we are nonetheless able to
prove a bound of a similar flavor under realizable boundary conditions in Section 6.4.
Finally, we prove the slow mixing result for worst-case (non-realizable) boundary
conditions in (Theorem 7) in Section 6.5.
149
For ease of exposition, we will prove everything in this chapter for the discrete-
time FK Glauber dynamics—the inverse spectral gap and mixing time of the
continuous-time dynamics are seen to be an order n2 faster by standard comparison
inequalities.
6.0.1 Mixing time upper bounds: a general framework
In this section we introduce a general framework for bounding the mixing time
of the FK-dynamics on Λn = (Λn, E(Λn)) by its mixing times on certain subsets.
In [10] it was shown that a strong form of spatial mixing (encoding exponential
decay of correlations uniformly over subsets of Λn) implies optimal mixing of the
FK-dynamics. However, this notion, known as strong spatial mixing (SSM) and
described in Remark 6.0.2, does not hold for most boundary conditions for which
fast mixing of the FK-dynamics is still expected. To circumvent this, we introduce
a weaker notion, which we call moderate spatial mixing (MSM).
Notation. We introduce some notation first. For a set R ⊆ Λn, let E(R) ⊆ En be
the set of edges of E(Λn) with both endpoints in R. We will denote by Rc the vertex
set Λn \R and by Ec(R) the edge-complement of R; i.e., Ec(R) := E(Λn) \ E(R).
For a configuration ω : E(Λn)→ 0, 1, we will use ω(R), or alternatively ω(E(R)),
for the configuration of ω on E(R). With a slight abuse of notation, for an edge
set F ⊆ E(Λn), we use F = ω for the event that the configuration on F is given
by ω; when ω is the all free or the all wired configuration, we simply use F = 0
and F = 1, respectively.
Definition 6.0.1. Let ξ be a boundary condition for Λn = (Λn, E(Λn)) and let
B = B1, B2, . . . , Bk be a collection of subsets of Λn. We say that moderate spatial
150
mixing (MSM) holds on Λn for ξ, B and δ > 0 if for all e ∈ E(Λn), there exists
Bj ∈ B such that
∣∣∣πξΛn,p,q( e = 1 | Ec(Bj) = 1 )− πξΛn,p,q( e = 1 | Ec(Bj) = 0 )∣∣∣ ≤ δ . (6.0.1)
In words, MSM holds for B if for every edge e ∈ E(Λn) we can find Bj such that
e ∈ E(Bj) and the “influence” of the configuration on Ec(Bj) on the state of e is
bounded by δ.
Remark 6.0.2. SSM as defined in [10] holds when MSM holds for a specific
sequence of collections of subsets: if Br is the set of subsets containing all the
square boxes of side length 2r centered at each e ∈ E(Λn) (intersected with E(Λn)),
then SSM holds if MSM holds for Br for every r ≥ 1 with δ = exp(−Ω(r)).
MSM does not capture the fast mixing of the FK-dynamics the way SSM
does. Namely, it is easy to find collections of subsets for which MSM holds for all
boundary conditions, including those boundary conditions for which we later prove
slow mixing; see Theorem 7. However, if, for a collection B = B1, B2, . . . , Bk,
we also bound the mixing time of the FK-dynamics on every Bj, we can deduce a
mixing time bound for the FK-dynamics on Λn. Let tmix(Bτ ) denote the mixing
time of the FK-dynamics on the subset B ⊆ Λn with boundary condition τ . (Recall
that τ corresponds to a partition of ∂B and that ∂B consists of those vertices in B
that are adjacent to vertices in Z2 \B.)
Definition 6.0.3. Let ξ be a boundary condition for Λn = (Λn, E(Λn)) and let
B = B1, B2, . . . , Bk with Bj ⊂ Λn. We say that local mixing (LM) holds for B
151
and T > 0, if
tmix(B
(1,ξ)j
)≤ T and tmix
(B
(0,ξ)j
)≤ T for all j = 1, ..., k
where (1, ξ) (resp., (0, ξ)) denotes the boundary condition on Bj induced by the
event Ec(Bj) = 1 (resp. Ec(Bj) = 0) and the boundary condition ξ.
Remark 6.0.4. Observe that when Bj ∩ ∂Λn = ∅, (1, ξ) and (0, ξ) are simply the
wired and free boundary condition on Bj, respectively. When Bj ∩ ∂Λn 6= ∅, the
connectivities from ξ could also induce some connections in (1, ξ) and (0, ξ).
Our next theorem, roughly speaking, establishes the following implication:
MSM + LM =⇒ upper bound for mixing time of FK-dynamics,
with the quality of the bound depending on the T for which LM holds. A similar
(and inspiring) implication for the Glauber dynamics of the Ising model in graphs
of bounded degree was established by Mossel and Sly in [73]; there, the notion of
MSM is replaced by a form of spatial mixing which is stronger than SSM.
Theorem 6.0.5. Let ξ be a boundary condition on Λn = (Λn, E(Λn)) and let
B = B1, B2, . . . , Bk with Bj ⊂ Λn for all j = 1, . . . , k. If for ξ and B, moderate
spatial mixing holds for some δ ≤ 1/(12|E(Λn)|) and local mixing holds for some
T > 0, then tmix(Λξn) = O(Tn2 log n).
Proof. Consider two copies Xt, Yt of the FK-dynamics. We couple the evo-
lution of Xt and Yt by using the same random e ∈ E(Λn) and the same
uniform random number r ∈ [0, 1] to decide whether to add or remove e from the
configurations. This is a standard coupling of the steps of the FK-dynamics (see,
152
e.g., [10, 47]); we call it the identity coupling. It is straightforward to verify that,
when q ≥ 1, the identity coupling is a monotone coupling, in the sense that if
Xt ⊆ Yt then Xt+1 ⊆ Yt+1 with probability 1.
We bound the coupling time Tcoup of the identity coupling for the FK-dynamics.
The result then follows from the fact that tmix ≤ Tcoup. Since the identity coupling
can be extended to a simultaneous coupling of all configurations that preserves
the partial order ⊆, the coupling time starting from any pair of configurations is
bounded by the coupling time for initial configurations Y0 = ∅ and X0 = E(Λn).
We prove that there exists T = O(Tn2 log n) such that the identity coupling P
satisfies
P[XT (e) 6= YT (e)] ≤ 1
4|E(Λn)|
for every e ∈ E(Λn). A union bound over e ∈ E(Λn) then implies that Tcoup ≤ T .
Fix any edge e ∈ E(Λn) and let Bj ∈ B be a subset for which (6.0.1) is
satisfied (such a subset exists since moderate spatial mixing holds). To bound
P[XT (e) 6= YT (e) ], we introduce two additional instances Z+t , Z−t of the FK-
dynamics. These two copies update only edges with both endpoints in Bj and
reject all other updates. We set Z+0 = E(Λn) and Z−0 = ∅. The four Markov chains
Xt, Yt, Z+t and Z−t are coupled with the identity coupling, with updates
outside Bj ignored by both Z+t and Z−t . The monotonicity of this coupling,
along with a triangle inequality, implies that for all t ≥ 0,
P[Xt(e) 6= Yt(e) ] ≤∣∣∣P[Z+
t (e) = 1 ]− πξΛn( e = 1 |Ec(Bj) = 1 )∣∣∣ (6.0.2)
+∣∣∣πξΛn( e = 1 |Ec(Bj) = 1 )− πξΛn( e = 1 |Ec(Bj) = 0 )
∣∣∣+∣∣∣πξΛn( e = 1 |Ec(Bj) = 0 )− P[Z−t (e) = 1 ]
∣∣∣ . (6.0.3)
153
Observe that the restrictions of the chains Z+t and Z−t to Bj are lazy ver-
sions of FK-dynamics on Bj with stationary measures πξΛn( · |Ec(Bj) = 1 ) and
πξΛn( · |Ec(Bj) = 0) , respectively. The laziness comes from the fact that they only
accept updates that are in Bj , and by the local mixing assumption, once T updates
have been done in Bj, the chains Z+t and Z−t will be mixed.
Now, after T = CT |E(Λn)| log2d24|E(Λn)|e steps, the expected number of
updates in Bj is
CT |E(Λn)| log2d24|E(Λn)|e |E(Bj)||E(Λn)|
≥ CT log2d24|E(Λn)|e .
Let AT be the event that the number of updates in Bj after T steps is at least
T log2d24|E(Λn)|e. A Chernoff bound then implies that, for a large enough constant
C > 0,
Pr[AcT
] ≤ 1
24|E(Λn)|.
Therefore,
∣∣∣Pr[Z+
T(e) = 1 | AT ]− Pr[Z+
T(e) = 1]
∣∣∣ ≤ Pr[AcT
] ≤ 1
24|E(Λn)|. (6.0.4)
By the local mixing property, and the fact that tmix(ε) ≤ dlog2 ε−1e · tmix for
any positive ε < 1/2, we have
∣∣∣Pr[Z+
T(e) = 1 | AT ]− πξΛn( e = 1 |Ec(Bj) = 1 )
∣∣∣ ≤ 1
24|E(Λn)|. (6.0.5)
It then follows from (6.0.4), (6.0.5) and the triangle inequality that when t = T
the right-hand side in (6.0.2) is at most 112|E(Λn)| . The same bound can be deduced
for (6.0.3) in a similar manner.
154
Finally, since moderate spatial mixing holds for some δ < 112|E(Λn)| , we have
∣∣∣πξΛn( e = 1 |Ec(Bj) = 1 )− πξΛn( e = 1 |Ec(Bj) = 0 )∣∣∣ ≤ 1
12|E(Λn)|;
see Definition 6.0.1. Putting these together we see that P[XT (e) 6= YT (e) ] ≤ 14|E(Λn)|
as desired.
6.1 Fast mixing on thin rectangles
The main difficulty in proving Theorem 5 using the general framework from
Section 6.0.1 is obtaining mixing time estimates on thin rectangles of dimension
Θ(n)×Θ(log n) with realizable boundary conditions. To motivate this, we notice
that since p < pc(q), the influence of the boundary is lost, with high probability,
at a distance Θ(log n). Thus the main difficulty will be to bound the mixing time
of the FK-dynamics in the annulus of width Θ(log n) with realizable boundary
conditions on the outside. As such, the key ingredient in the proof of Theorem 5
will be the following mixing time bound on thin rectangles.
As before, for an n × l rectangle Λn,l = J0, nK × J0, lK, we use ∂nΛn,l, ∂eΛn,l,
∂sΛn,l and ∂wΛn,l for its north, east, south and west boundaries respectively.
Theorem 6.1.1. Consider Λn,l = (Λn,l, E(Λn,l)) for l ≤ n with an arbitrary
realizable boundary condition ξ that is either free or wired on ∂eΛn,l∪∂wΛn,l∪∂sΛn,l.
Then, for every q > 1 and p 6= pc(q), the mixing time of the FK-dynamics on Λn,l
is at most exp(O(l + log n)).
Observe that when l = O(log n), this implies the mixing time is nO(1), which will
be the setting of interest in our proofs. Moreover, we note that it suffices for us to
155
Bw Be
(a)
R1R2 R2
(b)
Figure 6.1: (a) A boundary condition for which no configuration in Bw∩Be isolatesBw \Be from Be \Bw. (b) A boundary condition ξ where every pair of overlappingrectangles (as in (6.1.1)) must interact through ξ; however, the two groups ofrectangles R1, R2 do not interact through ξ.
prove Theorem 6.1.1 for the set of realizable boundary conditions ξ that are free on
∂eΛn,l ∪ ∂wΛn,l ∪ ∂sΛn,l and all p 6= pc(q), as the set of boundary conditions dual
to these are exactly the set of realizable boundary conditions that are wired on
∂eΛn,l ∪ ∂wΛn,l ∪ ∂sΛn,l; see Remark 2.2.1.
In Section 6.1.1, we give a overview of the main ideas in the proof of this theorem.
In Sections 6.1.2–6.1.4, we introduce some crucial notions regarding groups of
rectangular subsets of Λn,l and their relations with the boundary conditions ξ.
Sections 6.1.5–6.1.6 bound the mixing time of the block dynamics with respect to
a suitably-chosen set of subsets of Λn,l. The recursive proof of Theorem 6.1.1 is
then completed in Section 6.1.7.
6.1.1 Outline of proof of Theorem 6.1.1
We first mention some obstructions that boundary conditions present to proving
Theorem 6.1.1 using approaches that are common in analogous problems for spin
systems. A traditional approach to proving mixing time bounds for thin rectangles
is the canonical paths method ([53, 66, 67, 86]), which gives an upper bound that
is exponential in the shorter side length (see Lemma 2.2.10); however, boundary
conditions can significantly distort the augmented graph with external wirings,
156
preventing this approach from succeeding. (Though this is not the approach we take,
we are able to obtain a weaker canonical paths bound using a tree-like structure of
realizable boundary conditions that holds for all p, q in Section 6.4).
A sharper approach would be to use an inductive scheme (e.g., as in [18, 64, 67]
as well as our Chapter 4), whereby, we bound the mixing time of the FK Glauber
dynamics on Λn,l by the mixing times in smaller rectangular blocks, e.g.,
Bw = J0, 23nK× J0, lK and Be = J1
3n, nK× J0, lK . (6.1.1)
This method requires bounding by the mixing time of the so-called block dynamics
as defined for the Potts Glauber dynamics in Section 2.2.2 and for the FK Glauber
dynamics in Section 6.1.6.
The mixing time of the FK Glauber dynamics on Λn,l is bounded by the mixing
time of the block dynamics times the worst mixing time of the FK dynamics in any
Bi with worst-case configuration on Ec(Bi); see Theorem 6.1.18. With the choice
of blocks in (6.1.1), applying this recursively, one would bound the mixing time
of the FK dynamics on Λn,l by the mixing time of the block dynamics raised to a
Θ(log n) power. Hence, establishing Theorem 6.1.1 would require an O(1) upper
bound on the mixing time of the block dynamics.
As in Chapter 4, the mixing time and spectral gap of the block dynamics is
typically bounded by showing that after the first block update in either Bw or Be,
the configuration in Bw ∩Be will be such that it disconnects the influence of the
configuration on Bw \Be from Be \Bw with probability Ω(1); this then allows a
standard coupling argument to be used to bound the mixing time. In the presence
of long-range boundary connections, however, it could be that no configuration on
157
Bw ∩Be would disconnect the two sides from one another and facilitate coupling;
see Figure 6.1(a) for such an example. As such, our choices of blocks will depend
on the boundary conditions, and be chosen to allow for the block dynamics to
couple in O(1) time, while ensuring that the blocks are still at most a fraction of
the size of the original rectangle, so that after O(log n) recursive steps we arrive at
a sufficiently small base scale.
In particular, we will show that for every realizable boundary condition ξ on
Λn,l, there exists a choice of two blocks, whose widths are at most 45n, such that
they are sufficiently isolated from one another in ξ. As Figure 6.1(b) demonstrates,
there are realizable boundary conditions that would force these blocks to not be
single rectangles, as in (6.1.1), but rather collections of rectangular subsets of Λn,l.
Thus, our recursive argument will proceed instead on groups of rectangles,
R =⋃
Ri, where Ri = Jai, biK× J0, lK ⊂ Λn,l
are disjoint, with boundary conditions induced by ξ and the configuration of the
chain on E(Λn,l) \ E(R).
We formally define groups of rectangles (Definition 6.1.6), and their boundary
conditions in Section 6.1.3. We consider next the notion of compatibility of a
group of rectangles R ⊂ Λn,l with the boundary condition ξ. Roughly speaking,
we say a group of rectangles R is compatible with ξ if ξ limits the boundary
interactions between the rectangles of R for every possible configuration on Λn,l \R;
see Definition 6.1.8. In Section 6.1.5, we provide an algorithm (see Lemma 6.1.10)
that, for a group of rectangles R compatible with ξ, finds two suitable subsets
for the block dynamics: these subsets will each be group of rectangles compatible
158
with ξ, of width between 1/5 and 4/5 of the width of R. This will allow us to
induct on groups of rectangles compatible with ξ, while ensuring that number of
recursive steps is O(log n). Specifically, our splitting algorithm will find interior
and exterior subsets Aint and Aext with no boundary connections between the two;
see Figure 6.3(a). These will be the cores of the two blocks for R, but in order to
bound the coupling time of the block dynamics, we want the two blocks to overlap,
and therefore we enlarge Aint and Aext by m = Θ(log l) to form the blocks Rint
and Rext for the block dynamics; see Figure 6.3(b).
Finally, in Section 6.1.6, we bound the coupling time of this block dynamics by
some sufficiently large constant to conclude the proof. This follows by leveraging
the fact that p < pc (p > pc) to condition on the existence of certain disconnecting
dual (primal) paths in Rint ∩Rext that have Ω(1) probability.
We emphasize that in order to push through this recursion, is will be crucial
that our notion of compatibility with ξ is strong enough to yield a uniform bound
on this block dynamics coupling time, while being broad enough that the splitting
algorithm always succeeds in finding sub-blocks that are themselves groups of
rectangles compatible with ξ.
6.1.2 Disconnecting intervals
In this section, we introduce the notion of disconnecting intervals, one of the
building blocks of our recursive proof of polynomial mixing. Recall that we use
∂nΛn,l, ∂eΛn,l, ∂sΛn,l and ∂wΛn,l for the north, east, south and west boundaries of
the rectangle Λn,l, respectively.
Definition 6.1.2. For a realizable boundary condition ξ on Λn,l that is free on
∂eΛn,l ∪ ∂sΛn,l ∪ ∂wΛn,l, an interval Ja, bK ⊂ J0, nK is called disconnecting of
159
a0 a1 a2 a3 a4 a5 a6 a7 a8 a9 a10
Figure 6.2: The rectangle Λn,l with a boundary condition ξ inducing disconnectingintervals. For example, Ja1, a4K, Ja3, a4K and Ja7, a10K are disconnecting intervalsof free-type; Ja1, a2K, Ja0, a6K, Ja7, a8K and Ja9, a10K are of free-wired-type; Ja0, a5Kand Ja5, a6K are of wired-type.
1. free-type: if there are no boundary connections in ξ between ∂Λn,l∩(Ja, bK×l)
and ∂Λn,l ∩ (Ja, bKc × J0, lK)
2. wired-type: if there is a boundary component in ξ that contains the vertices
(a, l) and (b, l), as well as possibly other vertices in ∂Λn,l.
Observe that an interval can be both of free-type and of wired-type if (a, l) and
(b, l) are connected through ξ but are not connected to any boundary vertex in
Ja, bKc × J0, lK; in this case, we may refer to the interval as being of free-wired-type;
see Figure 6.2 for several examples.
The following properties concerning the union and intersection of disconnecting
intervals will be crucial to our proofs.
Lemma 6.1.3. Let ξ be a realizable boundary condition on Λn,l that is free on
∂sΛn,l ∪ ∂eΛn,l ∪ ∂wΛn,l and let a < b < c. If both Ja, bK and Jb, cK are disconnecting
intervals of wired-type, then so is Ja, cK. If both Ja, bK and Jb+1, cK are disconnecting
intervals of free-type, then so is Ja, cK.
Proof. If Ja, bK and Jb, cK are disconnecting intervals of wired-type, then by definition
the vertices (a, l), (b, l) and (c, l) are all in the same component of ξ; hence Ja, cK
is a disconnecting interval of wired-type. If Ja, bK and Jb+ 1, cK are disconnecting
intervals of free-type, then by definition there are no connections in ξ between Ja, bK
160
and Ja, bKc ⊃ Ja, cKc or between Jb + 1, cK and Jb + 1, cKc ⊃ Ja, cKc. Consequently,
there are not connections in ξ between Ja, cK = Ja, bK ∪ Jb + 1, cK and Ja, cKc, and
Ja, cK is thus a disconnecting interval of free-type.
Lemma 6.1.4. Let ξ be a realizable boundary condition on Λn,l that is free on
∂sΛn,l ∪ ∂eΛn,l ∪ ∂wΛn,l. Suppose there exist a < b ≤ c < d such that Ja, cK and
Jb, dK are disconnecting intervals. Then either both Ja, cK and Jb, dK are of free-type
or both are of wired-type.
Proof. Suppose by way of contradiction and without loss of generality that Ja, cK is
only of free-type (i.e., free-type but not free-wired-type) and Jb, dK is of wired-type.
By definition, there exists a component of ξ that contains both (b, l) and (d, l).
Since a < b ≤ c < d, there is therefore a connection in ξ between Ja, cK× l and
Ja, cKc × l. Hence, Ja, cK cannot be a disconnecting interval of free-type yielding
the desired contradiction.
Lemma 6.1.5. Let ξ be a realizable boundary condition on Λn,l that is free on
∂sΛn,l ∪ ∂eΛn,l ∪ ∂wΛn,l. Suppose there exist a < b < c < d such that Ja, cK and
Jb, dK are disconnecting intervals.
1. If Ja, cK and Jb, dK are both of wired-type, then Ja, bK, Jb, cK, Jc, dK and Ja, dK
are all disconnecting intervals of wired-type.
2. If Ja, cK and Jb, dK are both of free-type, then Ja, b− 1K, Jb, cK, Jc+ 1, dK and
Ja, dK are all disconnecting intervals of free-type.
Proof. For the first part, suppose that both Ja, cK and Jb, dK are disconnecting
intervals of wired-type. By definition, the vertices (a, l) and (c, l) are in the same
component of ξ, as are (b, l) and (d, l). By the planarity of realizable boundary
161
conditions, it must be the case that these two components of ξ are indeed the same,
and therefore (a, l), (b, l), (c, l), (d, l) are all in the same boundary component of ξ.
As such, Ja, bK, Jb, cK, Jc, dK and Ja, dK are all disconnecting intervals of wired-type.
For the second part, suppose first by way of contradiction that Ja, b− 1K is not a
disconnecting interval of free-type. Then there must be a boundary component in ξ
with vertices in Ja, b− 1K×l and Ja, b− 1Kc×l. If it is a component containing
a vertex in Ja, cKc × l, it would violate the fact that Ja, cK is disconnecting of
free-type, while if it is a component containing a vertex in Jb, cK × l, it would
violate that Jb, dK is disconnecting of free-type as Ja, b− 1K ⊂ Jb, dKc. By analogous
reasoning Jc+ 1, dK, Jb, cK and Ja, dK are all disconnecting intervals of free-type.
6.1.3 Groups of rectangles
In this section, we define groups of rectangles and their boundary conditions,
which constitute the other building blocks of our recursive proof of polynomial
mixing for the FK-dynamics on thin rectangles. As in the previous section, we
consider an n× l rectangle Λn,l = J0, nK×J0, lK with a realizable boundary condition
ξ that is free on ∂eΛn,l ∪ ∂sΛn,l ∪ ∂wΛn,l.
Henceforth, we take
m = m(l) = C? log l ,
where C? is a large constant such that C? > c−1 which we choose later, with c being
the constant from (2.1.1).
A rectangular subset R ⊂ Λn,l is a rectangle of the form Ja, bK× J0, lK for some
0 ≤ a < b ≤ n. For such a rectangular subset, we denote by W (R) its width;
i.e., W (R) = b − a. For the union of distinct and disjoint rectangular subsets
162
R =⋃N(R)i=1 Ri, where Ri = Jai, biK× J0, lK and a1 < b1 < a2 < · · · < aN(R) < bN(R),
its width is defined by W (R) =∑N(R)
i=1 W (Ri) and its boundary is given by
∂R =⋃N(R)i=1 ∂Ri. Moreover, we let ∂nR =
⋃N(R)i=1 ∂nRi and similarly define ∂sR;
on the other hand, ∂eR will be the eastern boundary of the right-most rectangle
Ri and ∂wR, the western boundary of the left-most.
Definition 6.1.6. A group of rectangles R =⋃N(R)i=1 Ri is the union ofN(R) disjoint
rectangular subsets Ri of Λn,l such that W (Ri) ≥ 2m for every i = 1, ..., N(R).
Remark 6.1.7. The requirement that W (Ri) ≥ 2m for every i, which may seem
arbitrary at the moment, is because in our recursive argument, we want our groups
of rectangles R to have interiors that are not influenced by the configuration on
E(Λn,l) \ E(R). When a group of rectangles has a thin constituent rectangle Ri,
the influence of the outside configuration can permeate through all of Ri.
We will be considering blocks dynamics with blocks consisting of groups of
rectangles. If we want to update the configuration on a group of rectangles R ⊂ Λn,l,
the boundary condition induced on ∂R will consist of the boundary condition on
Λn,l which will be fixed to be ξ, together with a fixed random-cluster configuration
ωRc on Ec(R) = E(Λn,l) \ E(R). Hence, our boundary conditions on R will be of
this form, and we denote them by the pair ζ = (ξ, ωRc).
6.1.4 Compatible boundary conditions
We now define the notion of compatibility of groups of rectangles with boundary
conditions ξ. This is crucial in our inductive argument; our algorithm for finding
suitable blocks for the block dynamics will guarantee that if the starting group
163
rectangles is compatible w.r.t. to a given boundary condition, so will each of the
blocks, enabling an inductive procedure.
Let ξ be a realizable boundary condition on ∂Λn,l that is free on ∂sΛn,l∪∂eΛn,l∪
∂wΛn,l, and free in all vertices in ∂nΛn,l at distance at most m from ∂eΛn,l ∪ ∂wΛn,l
(i.e., they appear as singletons in the corresponding boundary partition). This latter
requirement for vertices near the corners of Λn,l allows us to always choose blocks
in our recursion whose boundaries are at least distance m from ∂eΛn,l ∪ ∂wΛn,l; this
simplifies our analysis.
The following will be the distinguishing property of our choice of blocks for the
block dynamics.
Definition 6.1.8. Let R =⋃N(R)i=1 Ri be a group of rectangles with Ri = Jai, biK×
J0, lK and a1 < b1 < a2 < . . . < aN(R) < bN(R). We say R is compatible with ξ, if
1. Between every two consecutive rectangles Ri = Jai, biK × J0, lK and Ri+1 =
Jai+1, bi+1K× J0, lK the interval Jbi −m, ai+1 +mK is a disconnecting interval;
2. The interval Ja1 +m, bN(R) −mK is also a disconnecting interval.
Remark 6.1.9. It is clear from the definition that Λn,l is compatible with ξ: the
first condition is vacuous, while the second is satisfied since all vertices a distance
at most m from ∂eΛn,l ∪ ∂wΛn,l are free. Observe also that bi − ai ≥ 2m for every i
and so bN(R) −m ≥ a1 +m; see Definition 6.1.6.
6.1.5 Defining the blocks for the block dynamics
Now that we have introduced disconnecting intervals, group of rectangles and
the notion of compatibility, we describe our algorithm for picking two blocks
164
Rint,Rext for the block dynamics, based on the boundary condition ξ. Recall
the definitions of width W (·) of a rectangular subset and width of a collection of
rectangles W (R) =∑N(R)
i=1 W (Ri). It will also be convenient to have the following
notation ∂‖R =⋃N(R)i=1 ∂wRi ∪ ∂eRi for the vertical sides of the group of rectangles
R =⋃N(R)i=1 Ri. The following lemma provides the basis for our splitting algorithm.
Lemma 6.1.10. Let ξ be a realizable boundary condition on ∂Λn,l that is free
on ∂sΛn,l ∪ ∂eΛn,l ∪ ∂wΛn,l and free in all vertices in ∂nΛn,l at distance at most
m from ∂eΛn,l ∪ ∂wΛn,l. For every group of rectangles R compatible with ξ, with
W (R) ≥ 100m, there exists a disconnecting interval Jc?, d?K such that both (c?, l)
and (d?, l) are in ∂nR, are distance at least m from ∂‖R, and
1
4W (R) ≤ W (R∩ (Jc?, d?K× J0, lK)) ≤ 3
4W (R) .
We pause to comment on why a disconnecting interval with such properties
provides the desired blocks for the block dynamics. The interval Jc?, d?K from the
lemma will be used to define Aint = R ∩ (Jc?, d?K × J0, lK) and Aext = R \ Aint;
their enlargements by m will form the blocks Rint and Rext (see Figure 6.3). The
requirement that W (Aint) be a fraction of W (R) bounded away from 0 and 1 is
so that we only recurse O(log n) times before reaching small enough widths. The
requirement that the corners of Jc?, d?K× J0, lK are a distance at least m from ∂‖R
is so that when we enlarge the sets Aint,Aext by m, we do not overflow beyond the
rectangles containing (c?, l) and (d?, l). Crucially, our ability to pick disconnecting
segments that satisfy this latter requirement will be guaranteed by the compatibility
of R with ξ.
Proof of Lemma 6.1.10. We begin by finding a candidate disconnecting interval
165
Jc, dK with (c, l), (d, l) ∈ ∂nR satisfying:
1
3W (R) ≤ W (R∩ (Jc, dK× J0, lK)) ≤ 2
3W (R) . (6.1.2)
In the second part of the proof we show how to modify the interval Jc, dK to obtain
a disconnecting interval Jc?, d?K with the added property that both (c?, l) and (d?, l)
are distance at least m from ∂‖R.
If there exist a pair of vertices (x, l), (y, l) ∈ ∂nR such that 13W (R) ≤ W (R∩
(Jx, yK× J0, lK)) ≤ 23W (R) with (x, l) connected to (y, l) through ξ, then we take
c = x, d = y; that is, we use Jc, dK = Jx, yK as our candidate disconnecting interval.
Suppose otherwise that there does not exist any such boundary connection: then
every pair (x, l), (y, l) ∈ ∂nR connected through ξ is such that
W (R∩ (Jx, yK× J0, lK)) <1
3W (R) , or W (R∩ (Jx, yK× J0, lK)) >
2
3W (R) .
(6.1.3)
If the latter holds, then there is a pair, say (x0, l), (y0, l) ∈ ∂nR, for which the latter
holds with a minimal width. The interval Jx0, y0K would be a disconnecting interval
of wired-type and there is no other vertex (z, l) ∈ ∂nR with z ∈ Jx0 + 1, y0 − 1K
connected to (x0, l) and (y0, l) in ξ since if there were such a z it would violate the
assumption that Jx0, y0K is of minimal width with W (R∩(Jx0, y0K×J0, lK)) > 23W (R)
or that every pair of vertices (x, l), (y, l) ∈ ∂nR connected in ξ satisfy (6.1.3).
Consequently, all other connections through ξ between vertices (x1, l), (y1, l) ∈
∂nR∩ (Jx0 + 1, y0 − 1K× J0, lK) will be such that
W (R∩ (Jx1, y1K× J0, lK)) <1
3W (R) .
166
We can then partition the vertices of ∂nR ∩ (Jx0 + 1, y0 − 1K × l) into disjoint
disconnecting intervals of free-wired-type using the following procedure:
1. Let ρ = C1, . . . , Ck be the partition of the vertices of ∂nR∩ (Jx0 + 1, y0 −
1K × l) induced by the boundary condition ξ; every Ci corresponds to a
distinct connected component of ξ;
2. For each Ci, consider the disconnecting interval Li of free-wired-type deter-
mined by the left-most and right-most vertices of Ci in ∂nR∩ (Jx0 + 1, y0 −
1K× l). Notice that some of the Ci’s may be singletons, which we view as
disconnecting intervals of the free-wired-type;
3. Let Li1 , Li2 , . . . , Li` be those disconnecting intervals which are maximal, in
the sense that there does not exist j and k such that Lij ⊂ Lk.
The set of disconnecting intervals Li1 , Li2 , . . . , Li` partitions Jx0 + 1, y0 − 1K into
disjoint disconnecting intervals of free-wired-type with the property that W (R∩
(Lij × J0, lK)) ≤ 13W (R) for every j ∈ 1, . . . , `. We can then use Lemma 6.1.3 to
merge adjacent disconnecting intervals until we obtain a candidate disconnecting
interval Jc, dK ⊂ Jx0, y0K (of free-type), having width W (R ∩ (Jc, dK × J0, lK) ∈
[13W (R), 2
3W (R)].
Now that we have found a candidate disconnecting interval Jc, dK satisfy-
ing (6.1.2), we modify it to obtain a disconnecting interval Jc?, d?K with the property
that both (c?, l) and (d?, l) are distance at least m from ∂‖R.
If (c, l) is at distance at least m from ∂‖R, set c? = c, and similarly if (d, l) is at
distance at leastm from ∂‖R, then set d? = d. Otherwise, suppose (c, l) is at distance
less than m from ∂wRi for some constituent rectangular subset Ri = Jai, biK× J0, lK
of R. Since R is compatible with ξ, the interval Ic = Jbi−1 − m, ai + mK is a
167
disconnecting interval, and we set
c? =
ai +m, if Ic is of wired-type, or i = 1, or W (Ri) = 2m;
ai +m+ 1 , if Ic is only of free-type, and W (Ri) > 2m;
note that the first case includes when Ic is of free-wired-type, whereas the second
case only applies when the interval is of free-type and not of wired-type. When
(c, l) is instead at distance less than m from ∂eRi for some i, then we simply set
c? = bi −m.
Symmetrically, if (d, l) is at distance less than m from ∂eRi for some Ri =
Jai, biK× J0, lK, let Id = Jbi −m, ai+1 +mK
d? =
bi −m, if Id is of wired-type, or i = N(R), or W (Ri) = 2m;
bi −m− 1 , if Id is only of free-type, and W (Ri) > 2m.
When (d, l) is at distance less than m from ∂wRi, let d? = ai + m. To see that
this process is well-defined, notice that since W (Ri) ≥ 2m for every i, the points
(c, l), (d, l) cannot be both less than m away from ∂eRi and less than m away from
∂wRi.
We claim that in all of these cases the interval Jc?, d?K is a disconnecting interval.
The fact that (c?, l), (d?, l) ∈ ∂nR are a distance at least m away from ∂‖R follows
directly from the construction.
First, observe that when (c, l), (d, l) are both a distance at least m from ∂‖R,
then we set c? = c and d? = d; in this case Jc?, d?K is disconnecting since Jc, dK was
chosen to be disconnecting.
Next suppose that d was at least a distance m from ∂‖R while c was at distance
less than m from ∂wRi for some i. In this setting, we establish that Jc?, d?K is a
168
disconnecting interval by considering the following four cases:
Case 1: i = 1. Note that Ja1 + m, bN(R) −mK and Jc, dK are disconnecting
intervals by compatibility and construction, respectively. Also, W (R∩ (Jc, dK)) ≥100m
3since W (R) ≥ 100m. Hence, c < a1 +m < d < bN(R) −m and Lemma 6.1.5
implies that Jc?, d?K = Ja1 +m, dK is disconnecting.
Case 2: i > 1 and Jbi−1−m, ai +mK is of wired-type. Since W (R∩ (Jc, dK)) ≥100m
3, we have bi−1 − m < c < ai + m < d. Then, by Lemma 6.1.4, Jc, dK is
a disconnecting interval of wired-type, and Lemma 6.1.5 implies that Jc?, d?K =
Jai +m, dK is a disconnecting interval of wired-type.
Case 3: i > 1, Jbi−1 −m, ai +mK is of free-type and W (Ri) > 2m. We again
have bi−1 −m < c < ai + m < d, and by Lemma 6.1.4 Jc, dK is a disconnecting
interval of free-type. Therefore, by Lemma 6.1.5, Jc?, d?K = Jai + m + 1, dK is a
disconnecting interval of free-type.
Case 4: i > 1, Jbi−1 −m, ai + mK is only of free-type and W (Ri) = 2m. In
this case, it must be that i < N(R). Therefore, by the compatibility of R and
ξ, Jbi − m, ai+1 + mK = Jai + m, ai+1 + mK is also a disconnecting interval. In
fact, by Lemma 6.1.4, Jbi −m, ai+1 + mK is a disconnecting interval of free-type.
Applying Lemma 6.1.5 with respect to Jai+m+1, dK and Jai+m, ai+1 +mK implies
Jc?, d?K = Jai +m, dK is a disconnecting interval.
Suppose otherwise that c was at distance less than m from ∂eRi for some i, and d
was still at least a distance at least m from ∂‖R. In this case, W (R∩(Jc, dK)) ≥ 100m3
implies N(R) > 1 as well as i < N(R). Moreover, Jbi − m, ai+1 + mK is a
disconnecting interval by the compatibility of R and ξ. Since bi − m < c <
ai+1 +m < d, Lemma 6.1.4 then implies that when Jbi −m, ai+1 +mK is of wired-
type (resp., of free-type) then Jc, dK is also of wired-type (resp., of free-type), and
169
therefore by Lemma 6.1.5, Jc?, d?K = Jbi − m, dK is a disconnecting interval of
wired-type (resp., of free-type).
The symmetric cases where (c, l) is a distance at least m from ∂‖R, and (d, l)
is a distance less than m from ∂‖R can be checked analogously. The remaining
case in which both (c, l) and (d, l) are a distance less than m from ∂‖R can also be
checked similarly, by first modifying the candidate interval on one side in order to
obtain a disconnecting interval Jc?, dK having (c?, l) a distance at least m from ∂‖R,
and then performing the modification on d to obtain the desired disconnecting
interval Jc?, d?K.
Finally, we claim that in all such situations, Jc?, d?K satisfies
1
4W (R) ≤ W (R∩ (Jc?, d?K× J0, lK)) ≤ 3
4W (R) .
This follows from the facts that W (R) ≥ 100m, |c− c?| ≤ m and |d− d?| ≤ m.
We will now use the disconnecting interval given by Lemma 6.1.10 to define two
subsets Aint and Aext of R, and set Rint and Rext to be their enlargments by m.
Definition 6.1.11. For a group of rectangles R compatible with ξ, let Jc?, d?K be
the disconnecting interval given by Lemma 6.1.10. Then define the interior and
exterior cores as
Aint = R∩ (Jc?, d?K× J0, lK) and Aext = R∩ (Jc?, d?Kc × J0, lK) ;
see Figure 6.3(a). Using the disconnecting interval Jc?, d?K define the interior and
exterior blocks as follows:
1. Let c− = c? −m and c+ = c? +m. Let d− = d? −m and d+ = d? +m.
170
Aint AintAext Aext
(a)
Rint Rint
Rext Rext
(b)
Figure 6.3: (a) The cores Aint and Aext and (b) the blocks Rint and Rext. Theblocks Rint and Rext are the enlargements of Aint and Aext by exactly m, and arethus, themselves, groups of rectangles.
2. DefineRint = R∩(Jc−, d+K×J0, lK) andRext = R∩((J0, c+K∪Jd−, nK)×J0, lK);
see Figure 6.3(b).
We will demonstrate that this choice of Rint and Rext has certain fundamental
properties that will facilitate the recursive argument in the proof of Theorem 6.1.1.
Proposition 6.1.12. If R is a group of rectangles compatible with ξ, and moreover,
W (R) ≥ 100m, then the sets Rint and Rext are groups of rectangles satisfying the
following properties:
1. 15W (R) ≤ W (Rint) ≤ 4
5W (R) and likewise 1
5W (R) ≤ W (Rext) ≤ 4
5W (R);
2. Both Rint and Rext are compatible with ξ.
Proof. We show first that Rint and Rext are groups of rectangles (see Defini-
tion 6.1.6). By construction Rint and Rext are unions of rectangular subsets of
Λn,l. Since R is a group of rectangles, every constituent rectangular subset Ri has
W (Ri) ≥ 2m. By Lemma 6.1.10, (c?, l), (d?, l) were such that they are a distance at
least m from ∂‖R; as a consequence Jc−, c+K× J0, lK and Jd−, d+K× J0, lK are subsets
of R. Moreover, every constituent rectangular subset of Rint and Rext is either a
constituent rectangle of R, or contains one of Jc−, c+K× J0, lK and Jd−, d+K× J0, lK,
implying that every rectangular subset of Rint and Rext has width at least 2m.
Together, these verify that Rint and Rext are indeed groups of rectangles.
171
The first property follows from the facts that, by Lemma 6.1.10, 14W (R) ≤
W (Aint) ≤ 34W (R) and 1
4W (R) ≤ W (Aext) ≤ 3
4W (R), while W (R) ≥ 100m,
W (Rint)−W (Aint) = 2m and W (Rext)−W (Aext) = 2m.
To verify the second property, consider Rext first and label its rectangular
subsets Rext1 , ..., Rext
N(Rext) (from left to right) with Rextj = Jaextj , bextj K. Let i, i + 1
be the indices of the two distinct rectangular subsets containing Rext \ Aext. As
before, R1, ..., RN(R) are the constituent rectangular subsets of R. Then, for every
j ∈ 1, ..., N(Rext)− 1 \ i, there is a k ∈ 1, .., N(R)− 1 such that bextj = bk
and aextj+1 = ak+1. Hence, the compatibility of R with ξ guarantees that the interval
Jbextj −m, aextj+1 +mK is disconnecting for every j 6= i.
To see that Jbexti −m, aexti+1 +mK is disconnecting, notice that by construction of
Rext, Jbexti −m, aexti+1 +mK = Jc?, d?K. Finally, since (c?, l), (d?, l) were at distance
at least m from ∂‖R, the interval Jaext1 + m, bextN(Rext) − mK matches the interval
Ja1 +m, bN(R) −mK, and thus by compatibility of R with ξ implies the former is a
disconnecting interval. Altogether these imply the compatibility of Rext with ξ.
Similarly, label the constituent rectangles of Rint as Rint1 , ..., Rint
N(Rint) and notice
that 1 and N(Rint) are the indices of the two rectangles containing Rint \ Aint.
Every interval of the form Jbinti −m, ainti+1 +mK corresponds (up to change of index)
to such an interval for R, so that by compatibility of R with respect to ξ, these
are all disconnecting intervals. The interval Jaint1 +m, bintN(Rint) −mK is exactly the
interval Jc?, d?K given by Lemma 6.1.10, and therefore this is disconnecting by
construction. Together, these imply the compatibility of Rint with ξ.
172
6.1.6 Block dynamics coupling time
Here we consider the block dynamics on R with blocks Rint and Rext as defined
in Section 6.1.5; see also Figure 6.3(b). We begin by defining the block dynamics
for a group of rectangles.
Definition 6.1.13. Let R be a group of rectangles with boundary condition ζ.
The block dynamics Xt with blocks B = B1, . . . ,Bk such that Bi ⊂ R and⋃ki=1 E(Bi) = E(R) is the discrete-time Markov chain that at each t, selects i
uniformly at random from 1, ..., k and updates the configuration on E(Bi) from
the stationary distribution conditional on the configuration Xt(Ec(Bi)).
Fix B = Rint,Rext and let gapB(Rζ) be the spectral gap of this block dynamics
on R with boundary condition ζ = (ξ, ωRc), where ξ is a realizable boundary
condition on Λn,l and ωRc is a configuration on Ec(R).
Lemma 6.1.14. Let ξ be a realizable boundary condition on ∂Λn,l that is free on
∂sΛn,l ∪ ∂eΛn,l ∪ ∂wΛn,l and free on vertices in ∂nΛn,l at distance at most m from
∂eΛn,l ∪ ∂wΛn,l. For every q > 1 and p 6= pc(q), there exists K = K(p, q) ≥ 1 such
that for every group of rectangles R compatible with ξ, and every configuration ωRc,
gapB(R(ξ,ω(Rc))) ≥ K−1 .
Proof. We consider the p < pc(q) case first. Let Xt, Yt be two instances of the
block dynamics on R with boundary condition ζ = (ξ, ωRc) started from initial
configurations X0, Y0. We design a coupling P for the steps of Xt and Yt and
bound its coupling time. This yields upper bounds for both the mixing time and
the inverse spectral gap of the block dynamics; see Section 2.2 for a brief overview of
173
the coupling method. For this, we will show that for any two initial configurations
X0, Y0
P(X2 = Y2) = Ω(1) . (6.1.4)
Since this bound will be uniform over X0, Y0, we can make independent attempts
at coupling the two chains every two steps. Hence, there would exist T = O(1)
such that
maxX0,Y0
P(XT 6= YT ) ≤ 1
4,
bounding the coupling time by T = O(1) concluding the proof.
First observe that with probability 1/4 the first block to be updated is Rint
and the second is Rext. Suppose this is the case and let us consider the update on
block Rint. Observe that X1(Rint) is distributed according to the random-cluster
measure πθX on Rint, where θX is the boundary condition induced on ∂Rint by
the boundary condition ζ on R and the configuration of X0 in E(R) \ E(Rint).
Likewise, Y1(Rint) has law πθY , with the boundary condition θY defined analogously
but considering the configuration of Y0 in E(R) \ E(Rint) instead. Therefore, any
coupling for the random-cluster measures πθX , πθY yields a coupling for the first
steps of Xt and Yt.
Let θ1 be the boundary condition on ∂Rint induced by ζ and the configuration
that is all wired on E(R) \ E(Rint); let πθ1 be the corresponding random-cluster
measure on Rint. Let Qw, Qe ⊂ Rint be the two rectangles of width m that contain
all the vertices in Rint \ Aint; i.e., Qw ∪ Aint ∪ Qe = Rint, Qw ∩ Aint = ∅ and
Qe ∩ Aint = ∅ (see Figure 6.4(a)). Let ∂E(Qw) be the set edges with one endpoint
in Qw and the other in Aint, and similarly define ∂E(Qe).
174
Aint
Qw Qe
(a)
Aint
(b)
Aint
(c)
Figure 6.4: (a) The block Rint with its subsets Aint, Qw and Qe. (b) The blockRint with the dual-paths (dotted) of a configuration in Γ. (c) The block Rint withthe dual-paths (dotted) of a configuration in Γns
w ∩ Γwew ∩ Γns
e ∩ Γwee ⊂ Γ.
Let Γw be the set of configurations in Rint that have a dual-path in E(Qw) ∪
∂E(Qw) connecting the top-most edge in ∂E(Qw) to an edge in ∂sQw, and similarly
define Γe as the set of configurations inRint that have a dual-path in E(Qe)∪∂E(Qe)
from the top-most edge in ∂E(Qe) to an edge in ∂sQe. (A dual-path is an open
path in the dual configuration.) Let Γ = Γe ∩ Γw; see Figure 6.4(b). The following
lemma supplies the desired coupling.
Lemma 6.1.15. Let q > 1 and p < pc(q). There exists a coupling P1 of the
distributions πθX , πθY , πθ1 such that if (ωθX , ωθY , ωθ1) is sampled from P1, then all
of the following hold:
1. P1(ωθX , ωθY , ωθ1) > 0 only if ωθX ≤ ωθ1 and ωθY ≤ ωθ1;
2. P1(ωθX (Aint) = ωθY (Aint) | ωθ1 ∈ Γ) = 1;
3. There exists a constant ρ = ρ(p, q) > 0 such that P1(ωθ1 ∈ Γ) ≥ ρ.
Hence, if we use the coupling P1 from Lemma 6.1.15 to couple the first step of
the chains, then X1 and Y1 will agree on E(Aint) with probability at least ρ > 0.
175
If this occurs, then we can easily couple the update on Rext in the second step
so that X2 = Y2, since X1(E(Aint)) = Y1(E(Aint)) implies X1(E(R) \ E(Rext)) =
Y1(E(R)\E(Rext)), and thus the boundary conditions induced by the two instances
of the chain on Rext are identical. As a consequence, we obtain that for any X0, Y0,
P(X2 = Y2) ≥ρ4.
which gives (6.1.4) and thus concludes the proof for p < pc(q).
The case when p > pc(q) follows by an analogous dual argument. In this case, the
set Γ has o(1) probability and we therefore replace it by the set Γ∗ = Γ∗e∩Γ∗w, where
Γ∗w is the set of configurations in Rint that have an open path in E(Qw) ∪ ∂E(Qw)
connecting the top-left corner of Aint (c?, l) to ∂sQw; similarly, Γ∗e is the set of
configurations in Rint that have an open path in E(Qe) ∪ ∂E(Qe) connecting the
top-right corner (d?, l) of Aint to ∂sQe. Let θ0 be the boundary condition on ∂Rint
induced by ζ and the all-free configuration on E(R) \ E(Rint); let πθ0 be the
resulting random-cluster distribution on Rint. The constant bound on the coupling
time of the block dynamics would then follow as above from the following dual
analogue of Lemma 6.1.15.
Lemma 6.1.16. Let q > 1 and p > pc(q). There exists a coupling P0 of the
distributions πθX , πθY , πθ0 such that if (ωθX , ωθY , ωθ0) is sampled from P0, then all
of the following hold:
1. P0(ωθX , ωθY , ωθ0) > 0 only if ωθX ≥ ωθ0 and ωθY ≥ ωθ0;
2. P0(ωθX (Aint) = ωθY (Aint) | ωθ0 ∈ Γ∗) = 1;
3. There exists a constant ρ = ρ(p, q) > 0 such that P0(ωθ0 ∈ Γ∗) ≥ ρ.
176
We proceed with the proof of the key Lemma 6.1.15. Parts 1 and 2 of the lemma
follow from a fairly standard coupling technique; see, e.g., [2, 10]. We shall also use
this approach later in the proof of Lemma 6.2.5. Part 3 will be a consequence of
the EDC property (2.1.1); see proof of Claim 6.1.17.
Proof of Lemma 6.1.15. Let
L = ∂wQw ∪ ∂nQw ∪ ∂eQe ∪ ∂nQe;
note that L ∩ Aint = ∅. For an FK configuration ω on Rint let
F (ω) := Rint \⋃
v∈LC(v, ω) ,
where C(v, ω) is the vertex set of the connected component of v in ω, ignoring the
boundary connections.
Clearly, πθ1 πθX and πθ1 πθY and thus there exist monotone couplings PX
(resp., PY ) for πθX and πθ1 (resp., πθY and πθ1). We use PX and PY to construct
the coupling P1 as follows:
1. sample (ωθX , ωθ1) from PX and sample ωθY from PY ( · | ωθ1 );
2. if Aint ⊆ F (ωθ1), sample ω∆ from πη1
∆ —where ∆ = ∆(ωθ1) is the subgraph
induced by F (ωθ1) and η1 is the boundary condition on ∂F (ωθ1) induced by θ1
and the configuration of ωθ1 on E(Rint)\E(F (ωθ1))—and make ωθ1(F (ωθ1)) =
ωθX (F (ωθ1)) = ωθY (F (ωθ1)) = ω∆.
Let P1 be the resulting distribution of (ωθX , ωθY , ωθ1). After step (i), P1 has the
desired marginals. Moreover, we claim that replacing the configuration in F (ωθ1)
177
with ω∆ in step (ii) has no effect on the distribution, provided Aint ⊆ F (ωθ1). For
this, we show that the three boundary conditions η1, ηX , ηY induced on ∂F (ωθ1)
by the configurations of ωθX , ωθY , ωθ1 on E(Rint) \ E(F (ωθ1)), respectively, and
the corresponding boundary conditions θX , θY , θ1 are all the same.
First, observe that the boundary condition on ∂sAint is, in all three cases, free
by assumption. Also, from the definition of F (ωθ1) it follows that every edge of
E(Rint) \ E(F (ωθ1)) incident to ∂F (ωθ1) is closed in ωθ1 ; hence the same holds
for ωθX and ωθY . The remaining portion of ∂F (ωθ1) is precisely the set of vertices
(∂Aint ∩ ∂R) \ ∂sR. To show that η1, ηX , ηY also agree on (∂Aint ∩ ∂R) \ ∂sR we
use the fact that top-left and top-right corners of Aint correspond to the endpoints
of the disconnecting interval Jc?, d?K. Indeed, for the boundary conditions η1, ηX , ηY
to disagree on (∂Aint ∩ ∂R) \ ∂sR it must be the case that there are at least two
distinct connected components of ζ = (ξ, ωRc) connecting (∂Aint ∩ ∂R) \ ∂sR and
∂R \ ∂Aint. Since (c?, l), (d?, l) /∈ ∂‖R and ξ is free on ∂sR, this would require at
least two distinct connected components of ξ connecting vertices in Jc?, d?K× l
to vertices in Jc?, d?Kc × l. However, when Jc?, d?K is a disconnecting interval of
free-type, there are no such connected components, and when it is disconnecting of
wired-type, the planarity of realizable boundary conditions implies that there can
be at most one such connected component, which is exactly the component of ξ
containing both (c?, l) and (d?, l).
Altogether, these together imply that when Aint ⊆ F (ωθ1), the three boundary
conditions η1, ηX , ηY induced on ∂F (ωθ1) are the same. The domain Markov
property of random-cluster measures (see, e.g., [47]) then guarantees that replacing
the configuration in F (ωθ1) with ω∆ had no effect on the distributions.
Finally, note that if ωθ1 ∈ Γ, then the vertices in the boundary components of
178
L, i.e.,⋃v∈LC(v, ωθ1), will be confined to Qw ∪Qe, in which case Aint ⊆ F (ωθ1).
This establishes parts 1 and 2, and part 3 follows directly from the following claim,
which will conclude the proof.
Claim 6.1.17. Let q > 1 and p < pc(q). There exists ρ = ρ(p, q) > 0 such that
πθ1(Γ) ≥ ρ.
Proof. Let Γnsw (resp., Γns
e ) be the set of configurations on Rint such that there is
dual-path from ∂nQw to ∂sQw (resp., from ∂nQe to ∂sQe) in E(Qw)∪∂E(Qw) (resp.,
E(Qe)∪∂E(Qe)). Also, let Γwew (resp., Γwe
e ) be the set of configurations onRint such
that there is dual-path in E(Qw)∪∂E(Qw) (resp., in E(Qe)∪∂E(Qe)) between the
left-most edge in ∂nQw (resp., the right-most edge in ∂nQe) and the top-most edge
of ∂E(Qw) (resp., ∂E(Qe)); see Figure 6.4(c). Note that Γnsw ∩Γwe
w ∩Γnse ∩Γwe
e ⊂ Γ.
The width of Qw is m = C? log l. Hence, the EDC property (2.1.1) implies
that when C? is large enough there exists a constant ρ0 = ρ0(p, q) > 0 such that
πθ1(Γnsw ) ≥ ρ0 and similarly for Γns
e . The EDC property (2.1.1) also implies that
πθ1(Γwew ) ≥ ρ1 and πθ1(Γwe
e ) ≥ ρ1, for a suitable ρ1 = ρ1(p, q) > 0. We justify this
as follows. By the EDC property (2.1.1), there exists some constant D such that
with probability Ω(1), no pair of vertices whose distance is at least D, one of which
is in ∂nQw \ ∂wQw and the other in ∂eQw ∪ ∂sQw ∪ ∂eQw, will be connected in Qw.
At the same time, with probability Ω(1), we can force O(D) edges bordering those
pairs of vertices that are distance less than D to be closed so that no pair of vertices
that are closer than D are connected in Qw either. The analogous reasoning holds
for Γwee and Qe. Since each of the events Γns
w ,Γwew ,Γns
e ,Γwee are decreasing events,
by the FKG inequality (see, e.g., [47]) we get
πθ1(Γ) ≥ πθ1(Γnsw ∩ Γwe
w ∩ Γnse ∩ Γwe
e ) ≥ πθ1(Γnsw )πθ1(Γwe
w )πθ1(Γnse )πθ1(Γwe
e ) ≥ ρ20ρ
21,
179
and thus we can take ρ = ρ20ρ
21 to have the desired estimate.
Proof of Lemma 6.1.16. The proof of Lemma 6.1.15 carries to Lemma 6.1.16
with certain natural modifications we describe next. As before, we let L = ∂wQw ∪
∂nQw∪∂eQe∪∂nQe and let (L∗, E(L∗)) be the dual-graph induced by the set of dual-
edges intersecting E(L) in (Z2)∗; its vertex set consists of exactly 2E(L)−1 vertices,
and we refer to those outside of Rint as ∂extL∗. Similarly, let (R∗int, E(R∗int)) and
(A∗int, E(A∗int)) be the dual-graphs induced by the dual-edges intersecting E(Rint)
and E(Aint) in (Z2)∗ respectively.
For an FK configuration ω on E(Rint) we define a dual version of F (ω) as
F ∗(ω) := R∗int \⋃
v∗∈∂extL∗C∗(v∗, ω) ,
where C∗(v∗, ω) is the dual-vertex set of the connected component of v∗ in the
dual-configuration ω∗ (ignoring the boundary connections).
Using monotone couplings for πθ0 and πθX , and for πθ0 and πθY , we can define P0
analogously to P1 with the configuration on E(F ∗(ωθ0)) being resampled whenever
A∗int ⊆ F ∗(ωθ0). Observe that updating the dual configuration on E(F ∗(ωθ0)) is
equivalent to updating the primal edges intersecting E(F ∗(ωθ0)). The fact that
resampling the configuration in E(F ∗(ωθ0)) has no effect on the distribution when
A∗int ⊆ F ∗(ωθ0) follows in similar fashion to the proof of Lemma 6.1.15. Indeed,
from the definition of F ∗(ωθ0), when A∗int ⊆ F ∗(ωθ0) there is a primal connection
between (c?, l) and ∂sQw in E(Qw) ∪ ∂E(Qw) together with a primal connection
between (d?, l) and ∂sQe in E(Qe) ∪ ∂E(Qe), so that A∗int ⊆ F ∗(ωθ0) implies the
event Γ∗. Together with the fact that Jc?, d?K is a disconnecting interval and the
assumptions that ξ is free on ∂sAint and (c?, l), (d?, l) /∈ ∂‖R, this ensures that the
180
three induced boundary conditions on E(F ∗(ωθ0)) coincide.
Finally, part 3 of the lemma follows by an analogous argument to that in
Claim 6.1.17, only replacing the EDC property by the matching exponential decay
of dual-connectivities when p > pc(q).
6.1.7 Proof of Theorem 6.1.1
In this section, we put together the results from Sections 6.1.3–6.1.6 to prove
Theorem 6.1.1. We first remind the reader of the following spectral gap estimate for
the FK Glauber dynamics. This proposition is written in the spin system setting
in [67] as presented in Theorem 2.2.8, but the proof follows mutatis mutandis for
the FK model, and the proof is therefore omitted. Also, we note that this theorem
holds in more generality for arbitrary graphs with arbitrary boundary conditions,
but for clarity we choose to state it here for groups of rectangles.
Theorem 6.1.18 ([67, Proposition 3.4]). Consider the FK Glauber dynamics on
a group of rectangles R with boundary condition ζ. Let gaprc(Rζ) and gapB(Rζ),
respectively, be the spectral gaps of the FK Glauber dynamics on R and of the block
dynamics with blocks B = R1, . . . ,Rk such that Ri ⊂ R and⋃ki=1E(Ri) = E(R).
For every p, q there exists γ = γ(p, q) ∈ (0, 1) such that
gaprc(Rζ) ≥ γ · ( maxe∈E(R)
#i : E(Ri) 3 e)−1 · gapB(Rζ) · mini=1,...,kη∈Ω(Rci )
gaprc(R(ζ,η)i ) ,
where Ω(Rci) denotes the set of FK configurations on E(R) \ E(Ri).
The final ingredient is the following mixing time estimate for the base case in
our recursive proof.
181
Lemma 6.1.19. Consider a group of rectangles R0 ⊂ Λn,l with W (R0) ≤ 100m.
For every q > 1 and p 6= pc(q), there exists κ = κ(p, q) > 0 such that for every
boundary condition ζ on R0,
gaprc(Rζ0) ≥ 1
l(log l)2 · qκl.
Proof. Note that |∂R0| = O(m + l). Hence, we can first modify the boundary
conditions to be all free on all of ∂R0, incurring a cost of a qO(l) factor in the
spectral gap by Eq. (2.2.9) and Lemma 2.2.13; recall that m = O(log l). Then we
can use the fast mixing result of [10], for instance, to bound the mixing time on R0
with free boundary condition by O(l(log l)2). This translates into a lower bound
for the spectral gap and the result follows.
Proof of Theorem 6.1.1. Fix q > 1, p 6= pc(q) and Λn,l with a realizable bound-
ary condition ξ′ that is free on ∂eΛn,l ∪∂sΛn,l ∪∂wΛn,l. By Eqs. (2.2.9)–(2.2.10), we
may modify ξ′ to a boundary condition ξ that is also free on all vertices a distance
at most m = C? log l from ∂eΛn,l ∪ ∂wΛn,l at a cost of an exponential in m factor
in the mixing time of the FK-dynamics. Let ξ be the resulting realizable boundary
condition.
We wish to prove, by induction, that for every 100m ≤ s ≤ n, every group of
rectangles Rs ⊂ Λn,l that is compatible with ξ and has W (Rs) = s satisfies
gap(R(ξ,ωRcs )s
)≥ 1
l(log l)2qκl · blog s(6.1.5)
for some b = b(p, q) > 0 to be chosen, uniformly over all configurations ωRcs on
Ec(Rs). Eq. (6.1.5) concludes the proof since Λn,l is a group of rectangles with
182
W (Λn,l) = n and is compatible with ξ.
The base case of this induction was shown in Lemma 6.1.19. Now suppose
inductively that this holds for all 1 ≤ k ≤ s − 1 for some s ≤ n; we show that
it also holds for s. Fix any Rs that is compatible with ξ, and any configuration
ωRcs . Then, if we let Rint = Rint(Rs) and Rext = Rext(Rs) be the blocks given
by Definition 6.1.11 and Bs the block-dynamics with respect to these blocks, by
Theorem 6.1.18
gaprc
(R(ξ,ωRcs )s
)≥ γ
2· gapBs
(R(ξ,ωRcs )s
)· mini∈int,ext
minωRc
i
gaprc
(R
(ξ,ωRci)
i
)≥ γ
2K· mini∈int,ext
minωRc
i
gaprc
(R
(ξ,ωRci)
i
),
where the second inequality follows from Lemma 6.1.14. By Proposition 6.1.12,
maxW (Rint),W (Rext) ≤ 45s, and we can apply the inductive hypothesis to bound
the second term on the right-hand side above. Combined with Lemma 6.1.19, we
see that the choice of b = (2γ−1K)1
log(5/4) ensures that (6.1.5) holds also for Rs.
(Note that 2γ−1K ≥ 1.)
This establishes the result for the case when the boundary condition is free on
∂eΛn,l ∪ ∂sΛn,l ∪ ∂wΛn,l. As noted earlier (see Remark 2.2.1), this implies by duality
the same bound for the class of realizable boundary conditions ξ that are wired on
∂eΛn,l ∪ ∂sΛn,l ∪ ∂wΛn,l for all p 6= pc(q).
183
Csw Cse
Cnw Cne
5r
5r
(a)
Rs
Rn
Rw Ren−6r
3r
(b)
e
e
2r+1
(c)
Figure 6.5: (a) The subsets Cne, Cnw, Cse, and Csw. (b) The subsets Rn, Re, Rw
and Rs. (c) B(e, r) for two edges e of Λn.
6.2 Polynomial mixing time for realizable bound-
ary conditions
In this section we prove Theorem 5. This theorem is proved for p < pc(q) using
the technology introduced in Section 6.0.1; namely, we construct a collection of
subsets B for which we can establish LM and MSM; see Definitions 6.0.1–6.0.3.
To establish LM we crucially use Theorem 6.1.1. The results for p > pc(q) follow
from the planar duality of the model and the self-duality of the class of realizable
boundary conditions, as explained in Section 2.2.1, Remark 2.2.1.
For general realizable boundary conditions, proving LM for a collection of
subsets B for which MSM holds is the main challenge. This is because, for MSM
to hold for a collection B for all realizable boundary conditions, a subset in B
needs to contain Ω(n) edges. In particular, some element of B must include most
(or all) edges near ∂Λn, as otherwise it is straightforward to construct examples
of realizable boundary conditions for which MSM does not hold. Thus, a trivial
(exponential in the perimeter) upper bound for the mixing time on those subsets
with Ω(n) edges would be unhelpful and we use Thoerem 6.1.1.
184
We now define the collection of blocks for which we can establish both LM
and MSM. Let r ∈ N and let Cne, Cnw, Cse, Csw ⊂ Λn be the four square boxes
of side length 5r with a corner that coincides with a corner of Λn; see Figure
6.5(a). Let Rn ⊂ Λn be the (n− 6r)× 2r rectangle at distance 3r from both ∂wΛn
and ∂eΛn whose top boundary is contained in ∂nΛn and let Re, Rw, Rs be defined
analogously; see Figure 6.5(b). Let R = Rn ∪Re ∪Rw ∪Rs. Now, for e ∈ E(Λn),
let B(e, r) ⊂ Λn be the set of vertices in the minimal square box around e such
that d(e,Λn \ B(e, r)) ≥ r. Note that if d(e, ∂Λn) > r, then B(e, r) is just a
square box of side length 2r + 1 centered at e; otherwise B(e, r) intersects ∂Λn; see
Figure 6.5(c). Finally, let
Br = Cne, Cnw, Cse, Csw, R ∪ B(e, r) : e ∈ E(Λn), d(e, ∂Λn) > r. (6.2.1)
We claim that LM holds for Br with r = Θ(log n) and T = O(nC) for some
constant C > 0.
Theorem 6.2.1. Let q ≥ 1, p < pc(q) and r = c0 log n with c0 > 0 independent of
n. There exists a constant C > 0 such that LM holds for every realizable boundary
condition ξ and Br with T = O(nC).
The subsets B(e, r) in Br and the corner boxes Cne, Cnw, Cse and Csw are small
enough that crude bounds for their mixing times are sufficient. As mentioned
earlier, the main challenge for proving local mixing for Br is to derive a mixing time
bound for R = Rn ∪Re ∪Rw ∪Rs as it intersects the boundary of Λn and contains
Ω(n) vertices. To establish such a bound we rely on Theorem 6.1.1. In particular,
we relate the mixing time of the FK-dynamics on R to that of the FK-dynamics
on a single thin rectangle by concatenating the four rectangles constituting R, one
185
after another, such that the union of their outer boundaries make up the northern
boundary of the new rectangle.
The final ingredient of the proof is establishing MSM for the collection Br.
We show that MSM holds for Br with r = Θ(log n) for all realizable boundary
conditions ξ where the vertices in ∂Λn at distance 5r from the corners of Λn are
free in ξ. This is sufficient since any realizable boundary condition can be turned
into a realizable boundary condition with this property by simply removing all
connections in ξ involving vertices near the corners of Λn; this modification can
change the mixing time of the FK-dynamics by a factor of at most exp(O(r)); see
Eqs. (2.2.9)–(2.2.10). Theorem 5 then follows from Theorems 6.2.1, 6.2.2 and 6.0.5.
Theorem 6.2.2. Let q ≥ 1, p < pc(q) and r = c0 log n with c0 > 0 independent
of n. Let ξ be a realizable boundary condition with the property that every vertex
v ∈ ∂Λn at distance at most 5r from a corner of Λn is free in ξ. Then, for all
sufficiently large c0 > 0, MSM holds for ξ and Br with δ < 1/(12|E(Λn)|).
We are now ready to prove Theorem 5 using the above.
Proof of Theorem 5. As mentioned earlier and in Remark 2.2.1, by duality of
the dynamics and self-duality of the class of realizable boundary conditions, it
suffices to prove the theorem for p < pc(q). Let P be the set of realizable boundary
conditions of Λn = (Λn, E(Λn)). For η ∈ P , let (η1, η2, . . . , ηk) denote the partition
of ∂Λn corresponding to η, and let η(`) be the boundary condition obtained as
follows: for each v ∈ ∂Λn, if v ∈ ηi and v is at distance at most ` from a corner of
Λn, remove v from ηi and add it as a singleton to the partition. Let P` be the set
of all boundary conditions obtained in this manner.
Consider Λn with arbitrary realizable boundary conditions ξ ∈ P. By (2.2.9)–
186
(2.2.10), we see that there exists C > 0 such that for every ξ ∈ P , we have
tmix(Λξn) ≤ Cq8C` · n2 · tmix(Λξ(`)
n ) .
It therefore suffices to prove the mixing time estimate uniformly over all modified
boundary conditions η ∈ P` for ` = 5r and r = c0(log n) with c0 taken to be
sufficiently large, as q8C` would only be polynomial in n. By Theorem 6.2.2, for c0
large enough, uniformly over all such boundary conditions we have moderate spatial
mixing with respect to Br and η ∈ P` with δ < 1/(12|E(Λn)|). Theorem 6.2.1
implies that LM holds for Br and η ∈ P` with T = O(nc) where c > 0 constant.
The result then follows from Theorem 6.0.5.
Remark 6.2.3. We note that Theorem 5 also holds for the FK-dynamics on
rectangles Λn,l ⊂ Z2 with ` ≤ n, provided these rectangles are not too thin. For
example, when l = Ω((log n)2), our proofs would yield that the mixing time of the
FK-dynamics is polynomial in n.
6.2.1 Local mixing for realizable boundary conditions
In this subsection, we prove Theorem 6.2.1. As mentioned earlier, this theorem
may be viewed as a corollary of Theorem 6.1.1, which bounds the mixing time of
the FK Glauber dynamics on thin rectangles.
Proof of Theorem 6.2.1. Let r = c0 log n. We wish to show that each of the
subsets in Br has mixing time O(nc) under the boundary conditions (1, ξ) and (0, ξ).
We begin by bounding the mixing time on the square boxes Cne, Cnw, Cse, Csw
and B(e, r) of Br. Since these have side length O(log n), a crude estimate on the
mixing time is sufficient. For instance, by (2.2.9), at a cost of exp(O(r)) = nO(1)
187
factor, we can compare the mixing time in these boxes with boundary condition
either (1, ξ) or (0, ξ) to the mixing time on equally sized boxes with free boundary
conditions. In this setting, an upper bound of O((log n)2 log log n) is known [9],
and thus we obtain an nO(1) bound for their mixing times.
It remains to bound the mixing time of the FK dynamics on the set R =
Rn ∪ Re ∪ Rw ∪ Rs. For this, we use Theorem 6.1.1. We argue that the mixing
time of the FK dynamics on R is roughly equal to that of the FK dynamics on a
[4(n− 6r)− 3]× 2r rectangle Q with a suitably chosen boundary condition. We
proceed to construct the rectangle Q and a boundary condition ξ′ whose vertices,
edges and wirings are identified with those of R and (1, ξ). The case of R and (0, ξ)
is handled later in similar fashion.
We introduce some notation first. For a rectangle S, let Sα denote the rectangle
that results from a clockwise rotation of S by an angle of amplitude α. Also, if
S1, . . . , Sk are rectangles of the same height, let [S1, . . . , Sk] denote the rectangle
obtained by identifying the vertices in ∂eSi with those in ∂wSi+1 for all i =
1, . . . , k− 1. When identifying the vertices, the double edges are removed. We take
Q =[Rπ/2w , Rn, R
−π/2e , R−πs
].
Observe that every vertex of Q, except those where the boundary overlaps occur,
correspond to exactly one vertex in R; vertices in the overlaps correspond to exactly
two vertices in R. Conversely, every vertex in R corresponds to exactly one vertex
of Q. The edges of Q and R are identified using this correspondence between the
vertices. Observe also that ∂nQ = ∂R∩ ∂Λn. We construct the boundary condition
ξ′ of Q as follows. If u, v ∈ ∂R ∩ ∂Λn are wired in ξ, the corresponding vertices are
188
also wired in ξ′. The boundary condition ξ′ is also wired along ∂wQ ∪ ∂nQ ∪ ∂eQ.
Claim 6.2.4. The boundary condition ξ′ of Q is realizable. In particular, ξ′ can
be realized by an FK configuration in the half plane of Z2 containing only vertices
north of ∂nQ, and a wiring of ∂eQ ∪ ∂sQ ∪ ∂wQ.
Finally, to completely capture the effect of the boundary condition (1, ξ) on R,
each of the three columns in Q that corresponds to overlaps of columns from R,
are externally wired.
Now, by Theorem 6.1.1 and (2.2.9), we have
gaprc(Qξ′) = n−O(1) .
We claim next that the FK Glauber dynamics on R with boundary condition
(1, ξ) has roughly the same gap as the FK Glauber dynamics on Q with boundary
condition ξ′. To see this, we add a double edge to each edge of Q that corresponds
to two edges in R. With this modification, there is now a one-to-one correspondence
between the FK configurations in R and Q. Also, adding these edges has almost no
effect on the mixing time of the FK dynamics in Q, as their endpoints are wired,
and so they only need to be updated once to mix. Moreover, by construction, the
boundary condition ξ′ for Q together with the wiring of the overlapping columns
in Q encode exactly the same connectivities as the boundary condition (1, ξ) for
R. Hence, for every pair of FK configurations on Q, the FK dynamics has the
same transition probability as FK dynamics on R between the corresponding
configurations. Consequently, we can conclude that
gaprc(R(1,ξ)) = n−O(1) .
189
Finally, for the case of the FK dynamics on R with boundary condition (0, ξ)
we can simply wire ∂wRn to ∂nRw, ∂sRw to ∂wRs, ∂eRs to ∂sRe and ∂nRe to ∂eRn,
which only incur a penalty of nO(1) by (2.2.9) and proceed as in the previous
case.
Proof of Claim 6.2.4. First note that ∂nQ corresponds to ∂R ∩ ∂Λn. Let ω be
an FK configuration on Z2 \ Λn that realizes ξ. A path from u ∈ ∂R ∩ ∂Λn to
v ∈ ∂R∩∂Λn in ω splits ∂R∩∂Λn into two parts R1 and R2, one containing all the
vertices from u to v in ∂R∩∂Λn clockwise and the other all the vertices from u to v
in ∂R∩∂Λn counterclockwise. The planarity of Z2 implies that any other boundary
component will be either completely contained in R1 or R2. From this property, it
follows that if v1, v2 ∈ ∂nQ are wired in ξ′, then Jv1, v2K is a disconnecting interval.
This implies that the connectivities of ξ′ in ∂nQ can be realized by a configuration
on the half plane of Z2 that contains all the vertices north of ∂nQ. For example,
every component C = c0, . . . , ck of ξ, with ci to the left of ci+1, can be realized
by the gadget consisting k paths of length hC starting at c0, . . . , ck and going north,
together with one path parallel to ∂nQ that joins the endpoints of all of these path.
Since Jci, ci+1K is a disconnecting interval for all i and C, we can choose hC for each
C so that the resulting configuration is a valid configuration in the half plane.
6.2.2 Moderate spatial mixing for realizable boundary con-
ditions
In this section we prove Theorem 6.2.2. We reduce the moderate spatial mixing
condition (6.0.1) to bounding the probability of certain connectivities in an FK
configuration. Specifically, if e ∈ S ⊂ Λn, the configuration on Ec(S) affects the
190
state of e when there are paths from e to the boundary of S; the probability of such
paths is maximized when we assume an all wired configuration on Ec(S). Recall
that for S ⊂ Λn, we let Sc = Λn \ S, and we use Ec(S) = E(Λn) \ E(S).
Lemma 6.2.5. Consider the FK model on Λn with arbitrary boundary condition
ξ on ∂Λn. For any e ∈ E(Λn), any S ⊂ Λn such that e ∈ E(S), and any pair of
configurations ω1, ω2 on Ec(S):
∣∣∣πξΛn( e = 1 | Ec(S) = ω1 )− πξΛn( e = 1 | Ec(S) = ω2 )∣∣∣
≤ πξΛn
(e ξ←→ ∂S \ ∂Λn | Ec(S) = 1
),
where e ξ←→ ∂S denotes the event that there is a path from e to ∂S taking into
account the connections induced by ξ.
In the proof of Theorem 6.2.2 we use this lemma; its proof via machinery from [2]
will be straightforward.
Proof of Theorem 6.2.2. We need to show that as long as c0 is large enough,
for every e ∈ E(Λn) there exists Be ∈ Br such that (6.0.1) holds for some δ <
1/(12|E(Λn)|). For each e ∈ E(Λn) the subset Be is chosen as follows:
1. If d(e, ∂Λn) > r, then Be = B(e, r);
2. Otherwise, if e ∈ R and d(e, ∂R \ ∂Λ) ≥ r, then Be = R;
3. Otherwise, e ∈ Ci for some i ∈ ne,nw, sw, se, and we take Be = Ci.
191
By Lemma 6.2.5, for every e ∈ E(Λn),
∣∣∣πξΛn( e = 1 | Ec(Be) = 1 )− πξΛn( e = 1 | Ec(Be) = 1 )∣∣∣
≤ πξΛn
(e
ξ←→ ∂Be \ ∂Λn | Ec(Be) = 1).
In all three cases above, by construction d(e, ∂Be \ ∂Λn) ≥ r. This together
with the fact that all vertices of ∂Λn within distance 5r from the corners have no
connections in ξ, implies that for e to be connected to ∂Be \ ∂Λn a path of open
edges reaching a distance at least r is required in the configuration on Be. The
EDC property (see (2.1.1)) implies that for c0 large enough
πξΛn
(e η←→ ∂Be | Ec(Be) = 1
)≤ 1
12|E(Λn)|,
and the result follows.
We conclude this section with the proof of Lemma 6.2.5.
Proof of Lemma 6.2.5. Let (1, ξ) be the boundary condition induced on S by
ξ and the event Ec(S) = 1. Similarly, let θ1 (resp., θ2) be the boundary
condition induced on S by configurations ω1 (resp., ω2) on Ec(S) and ξ. For
ease of notation set πθ1 = πξΛn(· | Ec(S) = ω1), πθ2 = πξΛn(· | Ec(S) = ω2) and
π(1,ξ) = πξΛn(· | Ec(S) = 1). For an FK configuration ω on S let
Γ(1,ξ)(ω) := S \⋃
v∈∂S\∂ΛnC(v, ω) ,
where C(v, ω) is the set of vertices in the connected component of v in ω, taking into
account the connectivities induced by (1, ξ). In words, Γ(1,ξ)(ω) is the set of vertices
192
of S not connected to ∂S \ ∂Λn in ω using possibly the boundary connections.
We claim that there exists a coupling P of the distributions πθ1 , πθ2 and π(1,ξ)
such that P(ω1, ω2, ω) > 0 only if ω1 ≤ ω and ω2 ≤ ω on E(S) and ω1, ω2 agree on
all edges with both endpoints in Γ(1,ξ)(ω). Given this coupling P, we have
∣∣πξΛn( e = 1 | Ec(S) = ω1 )− πξΛn( e = 1 | Ec(S) = ω2 )∣∣
≤ P(e /∈ E(Γ(1,ξ)(ω))
)= πξΛn
(e
ξ←→ ∂S \ ∂Λn | Ec(S) = 1),
as claimed. The construction of the coupling P is standard and is thus ommitted;
see, e.g., [2, 9] and the proof Lemma 6.1.15 for similar constructions.
6.3 Near-optimal mixing time for typical bound-
ary conditions
In this section we provide the proof of Theorem 6, where we establish a sharper
O(n2) mixing time upper bound for the FK Glauber dynamics on Λn = (Λn, E(Λn))
for the class of boundary conditions we call typical.
Definition 6.3.1. Let ω be a random-cluster configuration on Z2, and let ξω be the
boundary condition on ∂Λn induced by the edges of ω in E(Z2) \ E(Λn). Suppose
ω is sampled from πZ2,p,q. A set C of realizable boundary conditions for Λn is called
typical (with respect to (p, q)) if ξω ∈ C with probability 1− o(1).
We define classes of boundary conditions Cα and C?α, consisting of realizable
boundary conditions whose distinct boundary components consist only of vertices
193
at most distance α log n apart in ∂Λn. For any realizable boundary condition ξ
corresponding to a partition ξ1, ξ2, ... of ∂Λn, let L(ξi) be the smallest connected
subgraph of ∂Λn containing all vertices in ξi.
Definition 6.3.2. We say a boundary condition ξ on ∂Λn with corresponding
partition ξ1, ξ2, ... is in Cα if maxi |L(ξi)| ≤ α log n. We say that a realizable
boundary condition ξ is in C?α if its dual boundary condition ξ? is in Cα; see Section
2.1.2 for the definition of dual configuration. We refer to the classes Cα and C?α as
α-localized boundaries.
It is straightforward to see that if one samples a “random” boundary condition
from the infinite-volume measure πZ2,p,q, then with high probability, the induced
boundary condition on ∂Λn is α-localized for some α > 0. Since there is a unique
random-cluster measure on Z2 when p 6= pc(q), this is well-defined.
Lemma 6.3.3. For every q ≥ 1 and p < pc(q), the class of boundary conditions
Cα is typical with respect to (p, q) for sufficiently large α > 0. Similarly, for every
q ≥ 1 and p > pc(q), the class C?α is typical with respect to (p, q) for sufficiently
large α > 0.
Proof. By planar duality (namely the duality of the sets of boundary conditions
Cα and C?α, it suffices to prove the case p < pc(q)). For any u, v ∈ Z2, by the EDC
property (2.1.1), we have that for q ≥ 1 and p < pc(q) there exists c = c(p, q) > 0
such that πZ2(u ↔ v) ≤ e−cd(u,v) . Let u, v ∈ ∂Λn and suppose d(u, v) ≥ α log n.
Then, there exists some C(p, q) > 0 such that for sufficiently large α > 0,
πZ2\Λn(uZ2\Λn←→ v) = πZ2(u
Z2\Λn←→ v) ≤ πZ2(u↔ v) ≤ Ce−cα logn ≤ 1
n3,
194
where recall that uZ2\Λn←→ v denotes the event that there exists a path from u to
v in Z2 \ Λn. A union bound over all pairs of vertices in ∂Λn implies that if ω is
sampled from πZ2 and ξω is the resulting boundary condition on ∂Λn, then ξω ∈ Cα
with probability 1− o(1) and thus Cα is typical.
Remark 6.3.4. One may also be interested in the following notion of typicality,
which sometimes comes up in recursive mixing time upper bounds. Let q ≥ 1 and
p < pc(q) (resp., p > pc(q)) and consider a random-cluster sample from πζR2n,p,q,
where R2n is the concentric box of side length 2n containing Λn with arbitrary
boundary condition ζ. One could easily show that the boundary condition induced
by the configuration on R2n \Λn is in Cα (resp., C?α) with probability 1− o(1). This
would follow by coupling this measure to the infinite-volume measure using the fact
that Cα is a decreasing event, and finding a dual circuit in the annulus R2n \ Λn
(which exists with probability 1−O(e−Ω(n))).
Theorem 6 follows immediately from the following theorem proving near-optimal
mixing for α-localized boundary conditions, together with Lemma 6.3.3.
Theorem 6.3.5. For every q > 1, p < pc(q) (resp., p > pc(q)), and every α > 0,
there exists a constant C > 0 such that for every realizable boundary condition
ξ ∈ Cα (resp., ξ ∈ C?α) on ∂Λn, the mixing time of the discrete-time FK Glauber
dynamics on the n× n box Λn with boundary condition ξ is O(n2(log n)C).
In particular, we prove Theorem 6.3.5 in the regime p < pc(q) and ξ ∈ Cα and
this translates to a matching bound at p > pc(q) and ξ ∈ C?α by duality. To prove
this theorem we again use the general framework from Theorem 6.0.5. Namely,
we construct a collection of subsets of Λn for which we can establish MSM and
LM; see Definitions 6.0.1 and 6.0.3. The fact that ξ ∈ Cα will allow us to prove
195
e
∂Λ
B(e, r′)
B(e, r)
· · · · · ·
Figure 6.6: If r = Θ((log n)2) and r′ = Θ(log n), influence from outside of B(e, r′)may be easily propagated to e through long boundary connections in B(e, r′); butto propagate influence from the exterior of B(e, r), Ω(log n) of them would have tobe connected in Λn.
MSM with respect to Θ((log n)2)×Θ((log n)2) rectangles along the boundary, and
Theorem 5 will provide the LM estimate on these rectangles.
Consider the collection Br = B(e, r) : e ∈ E(Λn). Recall that for r ≥ 0 and
e ∈ E(Λn), we set B(e, r) ⊂ Λn to be the set of vertices in the minimal square box
around e such that d(e,Λn \B(e, r)) ≥ r; see Figure 6.5(c). We first show that
MSM holds for Br and ξ ∈ Cα when r = Θ((log n)2) and δ < n−3.
Lemma 6.3.6. Let q ≥ 1, p < pc(q), α > 0, η ∈ Cα, r = c0(log n)2 and B =
B(e, r) : e ∈ E(Λn). For large enough c0 > 0, MSM holds for η, B for some
δ < n−3.
For this lemma, it is crucial that r = Θ((log n)2), as MSM does not hold for typical
boundary conditions for Br when, for example, r = Θ(log n). This is because in a
typical configuration ω on Z2 \Λn it is likely that there exist pairs of vertices of ∂Λn
at distance γ log n, for a suitably small constant γ > 0, that are connected in ω.
Thus, for some e ∈ E(Λn) close to ∂Λn, it is possible for the configuration outside of
B(e, r) to exert a strong influence on the state of e when r = γ′ log n with constant
γ′ > 0, even if γ′ γ; the presence of a constant number of open edges on (or near)
∂Λn would propagate the influence from Λn \B(e, r) to e. Taking r = Ω((log n)2)
196
avoids this issue, since, roughly speaking, Ω(log n) open edges at specific points in
∂Λn would now be required to propagate the influence from Λn \B(e, r) to e; see
Figure 6.6(b). The proof of Lemma 6.3.6 is provided in Section 6.3.1.
The final ingredient in the proof of Theorem 6.3.5 is a LM estimate for Br with
r = Θ((log n)2). Such an estimate is readily provided by Theorem 5, with mixing
time that is poly-logarithmic in n.
Proof of Theorem 6.3.5. Let α > 0 be sufficiently large and let η ∈ Cα. By
Lemma 6.3.6, MSM holds for η and Br with r = Θ((log n)2) for some δ < n−3.
Observe also that every B(e, r) ∈ Br with boundary condition (1, η) or (0, η) is
a rectangle of side–length at most r = O((log n)2) with a realizable boundary
condition. Then by Theorem 5 (see also Remark 6.2.3), for every e ∈ E(Λn) we
have
maxtmix(B(e, r)0,η), tmix(B(e, r)1,η) ≤ (log n)C ,
for a suitable C > 0, yielding the desired LM estimate. The result then follows
from Theorem 6.0.5.
6.3.1 Moderate spatial mixing for Cα
We now prove Lemma 6.3.6. The proof involves showing that if ξ ∈ Cα, when
p < pc(q), the correlation between edges e, e′ ∈ E(Λn) near the boundary decays
exponentially in d(e, e′)/(α log n)—whereas SSM would entail a decay rate that is
exponential in just d(e, e′).
Proof of Lemma 6.3.6. Fix an edge e ∈ E(Λn) and for ease of notation let
197
B = B(e, r) ⊂ Λn and πη = πηΛn,p,q. Let (1, η) be the boundary condition induced
on B by η and the event Ec(B) = 1.
Lemma 6.2.5 implies that for every pair of configurations ω1, ω2 on Ec(B),
|πη( e = 1 | Ec(B) = ω1 )− πη( e = 1 | Ec(B) = ω2 )| ≤ π(1,η)(e η←→ ∂B \ ∂Λn
)(6.3.1)
where e η←→ ∂B \ ∂Λn denotes the event that there is a path from e to ∂B \ ∂Λn
taking into account the connections induced by η. Thus, it is sufficient to bound
the right-hand-side of (6.3.1).
There are three cases corresponding to the location of e in Λn. First, if
d(e, ∂Λn) > r, then B ∩ ∂Λn = ∅ and (1, η) is just the wired boundary con-
dition on B. In this case the right-hand-side of (6.3.1) is at most n−3 by the EDC
property; see (2.1.1).
The second and third cases correspond to whether B intersects one or two sides
of ∂Λn. For the second case, assume without loss of generality that B intersects
∂nΛn but not ∂wΛn or ∂eΛn. That is, d(e, ∂nΛn) ≤ r, but e is at distance
at least r from ∂wΛn and ∂eΛn. Let ∂wB, ∂sB, ∂eB be the west, south and east
boundaries of B, all of which are wired in ω. By a union bound
π(1,η)(e η←→ ∂B \ ∂Λn
)≤ π(1,η)
(e η←→ ∂wB
)+ π(1,η)
(e η←→ ∂eB
)+ π(1,η)
(e η←→ ∂sB
),
where e η←→ ∂wB denotes the event that there is a path from e to ∂wB in
B, taking into account those connections inherited from η (and ignoring the
connections induced by the wired configuration on Ec(B)). Define e η←→ ∂eB
198
and e η←→ ∂sB similarly.
The event e η←→ ∂sB implies that there exists a path of length at least r,
either from e or from ∂Λn to ∂sB. Therefore, the EDC property ((2.1.1)) implies
that for large enough n,
π(1,η)(e η←→ ∂sB
)≤ 1
3n3.
We bound next π(1,η)(e η←→ ∂wB). Let η0, . . . , ηd be the boundary com-
ponents of η. Since η is realizable, the planarity of Z2 implies that for every
i, j ∈ 0, . . . , d there are only three possibilities: L(ηi) ∩ L(ηj) = ∅, L(ηi) ⊂ L(ηj)
or L(ηj) ⊂ L(ηi). (Recall that L(ηi) ⊂ ∂Λn is the path of minimum length
that contains all the vertices in ηi.) Call ηi a maximal boundary component if
@j ∈ 0, . . . , d such that L(ηi) ⊂ L(ηj). The set of all maximal boundary compo-
nents defines a partition for ∂Λn. Since also η ∈ Cα, we deduce that there exists
a sequence of edges e0 = u0, v0, e1, . . . , ek = uk, vk in B ∩ ∂Λn such that 1)
(γ+α) log n ≥ d(ei, ei+1) ≥ γ log n for all i = 0, . . . , k−1, where γ is a large con-
stant we choose later and k ≥ c02(γ+α)
log n, and 2) the set Si = Jvi, ui+1K ⊂ B ∩ ∂Λn
is a disconnecting interval.
Let Ei be the event that Si is connected to Si+1 by a path of open edges in B.
Let et be the closest edge in the sequence e0, e1, . . . , ek to e and let Et = ∩ti=0Ei.
Since d(e, ∂wB) ≥ r, we also have t ≥ c08(γ+α)
log n. Then,
π(1,η)(e η←→ ∂wB
)≤ π(1,η)
(e η←→ ∂wB | Ect
)+ π(1,η)
(Et). (6.3.2)
If the event Ect occurs, then there exists i < t such that Si is not connected to Si+1
in B. This implies that there is a dual-path of length at least γ log n separating
199
Si from Si+1. Consequently, a path from e to ∂wB would require two vertices at
distance at least γ log n to be connected by a path of open edges in B. By the EDC
property and a union bound, this has probability at most 1/(9n3) for large enough
γ. Thus,
π(1,η)(e η←→ ∂wB | Ect
)≤ 1
9n3. (6.3.3)
We bound next π(1,η)(Et). Let ui, vi denote the endpoints of the edge ei, where
ui is to the left of vi for all i. For 1 ≤ i ≤ t, consider the rectangle Qi ⊂ B with
corners at vi−1, ui+3 and the other two corners on ∂sB. Then,
π(1,η)(Et) = π(1,η)(E0, . . . , Et) ≤ π(1,η)(E1, E5, E9 . . . , E`), (6.3.4)
where t − 4 < ` ≤ t. Now, let E ′i be the event that Si is connected to Si+1 by a
path completely contained in Qi. We have
π(1,η)(E1, E5, E9 . . . , E`) = π(1,η)(E ′1, E5, E9 . . . , E`) + π(1,η)(E1 ∩ (E ′1)c, E5, E9 . . . , E`)
≤ π(1,η)(E ′1, E5, E9 . . . , E`) +1
n4,
where the last inequality follows from the fact that for the event E1 ∩ (E ′1)c to occur
there have to be two vertices at distance at least γ log n connected by a path in
B; by the EDC property this only occurs with probability at most n−4 for large γ.
Iterating this procedure for E5, E9, . . . we get
π(1,η)(E1, E5, E9 . . . , E`) ≤ π(1,η)(E ′1, E ′5, E ′9 . . . , E ′`) +`
n4. (6.3.5)
200
Let Q =⋃(`−1)/4i=0 Q4i+1. Monotonicity implies that
π(1,η)(E ′1, E ′5, E ′9 . . . , E ′`) ≤ π1(E ′1, E ′5, E ′9 . . . , E ′` | Ec(Q) = 1)
=
`−14∏i=0
π1(E ′4i+1 | Ec(Q) = 1),
where for the last equality we use that the events E ′1, E ′5, . . . , E ′` are independent
under the wired boundary condition when also conditioning on Ec(Q) = 1 . We
claim that there exists a constant ρ ∈ (0, 1) (independent of n) such that for all
i = 0, . . . , `−14
π1(E ′4i+1 | Ec(Q) = 1) ≤ 1− ρ . (6.3.6)
To see this, let j = 4i+ 1 and note that for for large enough D > 0, by the EDC
property and a union bound imply that there is no connection between any pair of
vertices (u, v) with u ∈ Sj and v ∈ Sj+1 and d(u, v) ≥ D with probability Ω(1). At
the same time, by forcing an adjacent O(D) edges to be closed at a cost of e−O(D),
we see that with Ω(1) probability, in fact no other pairs (u, v) with d(u, v) ≤ D are
connected either. (6.3.6) holds. Thus,
π(1,η)(E ′1, E ′5, E ′9 . . . , E ′`) ≤ (1− ρ)`+3
4 ≤ (1− ρ)t−1
4 ≤ 1
9n3, (6.3.7)
where the last inequality holds for sufficiently large c0 since t ≥ c08(γ+α)
log n.
Putting (6.3.7), (6.3.5), (6.3.3), (6.3.4) and (6.3.2) together we get
π(1,η)(e η←→ ∂wB
)≤ 2
9n3+
`
n4≤ 1
3n3,
201
since ` = O(log n). Analogously, we get π(1,η)(e η←→ ∂eB) ≤ 13n3 , and thus
π(1,η)(e η←→ ∂B
)≤ 1
n3.
Finally for the third case, suppose without loss of generality that B intersects
∂nΛn and ∂wΛn, but not ∂eΛn or ∂sΛn. A union bound implies that
π(1,η)(e η←→ ∂B \ ∂Λ
)≤ π(1,η)
(e η←→ ∂sB
)+ π(1,η)
(e η←→ ∂eB
),
(6.3.8)
and each term in the right-hand side of (6.3.8) can bounded in the same way as
π(1,ξ)( e η←→ ∂wB ) in the second case; thus, the result follows.
6.4 A canonical paths bound with realizable FK
boundary conditions
As explained in §2.2.2, a classical and very fruitful bound using the canonical
paths method demonstrates that the Ising/Potts models on Λn,l have a mixing time
that is at most exponential in minn, l (for every β, q and boundary conditions):
see [28, 29, 53, 86]. However, FK boundary conditions on Λn,l can significantly
distort the cutwidth of the underlying graph encoding the interactions. Indeed
the lower bound in Theorem 7 shows that one could not possibly hope for such a
canonical paths bound in the presence of arbitrary FK boundary. Nonetheless, using
the nesting structure of disconnecting intervals of realizable boundary conditions,
we are able to prove a (weaker) bound that is exponential in (n ∨ l) log(n ∧ l) but
holds at every value of parameters p, q (including q < 1 as well as the critical p).
202
Unfortunately, this bound requires that the boundary conditions on one of the long
sides be either wired or free. The results in this section were not presented [8] but
are based on joint work with Blanca and Vigoda.
Theorem 6.4.1. Consider Λn,l = J0, nK × J0, lK for l ≤ n with realizable FK
boundary conditions (0, ξ) that are free on ∂sΛn,l and ξ on⋃i∈n,e,w ∂iΛn,l. Then for
every p ∈ (0, 1) and every q > 0, there exists c(p, q) > 0 such that
gap−1rc (Λ
(1,ξ)n,l ) . exp(c l log n) .
The same holds under (1, ξ) boundary conditions, denoting wired on ∂sΛn,l.
By duality, through Remark 2.2.1, it suffices to show the case of (0, ξ).
6.4.1 Tree structure of realizable boundary conditions
Recall the nesting properties of boundary bridges from Section 3.2 and of
disconnecting intervals from Section 6.1.2. In this section, we construct a tree Tξ of
rectangular subsets of Λn,l, corresponding to some realizable boundary condition ξ
on ∂nΛn,l, in order to encode the nested structure of the disconnecting intervals
of ξ. This will allow us to inductively bound the cutwidth of the graph Λ with
boundary conditions ξ.
Constructing the tree Tξ.
Recall that a rectangular subset of Λ = Λn,l is a subset of the form Ja1, a2K×J0, nK
for 0 ≤ a1 < a2 ≤ n; moreover, each rectangular subset is naturally identified
with the interval Ja1, a2K so that we call Ja1, a2K× J0, lK a disconnecting interval if
Ja1, a2K is one. Begin by identifying the root of the tree Tξ with Λ = J0, nK× J0, lK.
203
We define Tξ inductively from here. For any rectangular subset corresponding to a
vertex of Tξ, we describe how to assign its children rectangular subsets.
Consider a vertex w ∈ Tξ corresponding to rectangular subset Ja1, a2K× J0, lK.
If a2 = a1 + 1, then w will be a leaf of Tξ (and will have no children in Tξ). If
a2 > a1 + 1, then either w is the root of Tξ, or Ja1, a2K is a disconnecting interval
of wired-type, so that a1, a2 are in some boundary component Ci, and moreover
there will not exist any x ∈ (a1, a2) with (x, l) ∈ Ci. In this latter case, we define
the children of w in Tξ as follows:
1. Label the boundary components that are directly nested in Ci (i.e., not
counting those that are nested in a component Cj that is nested in Ci)
Ci1 , ..., Cik for some k. (Notice that singleton vertices with no bridges over
them before Ci count.)
2. Label the x–values of the vertices in⋃j Cij in increasing order by x1, ..., xK ,
with x1 = a1 + 1 and xK = a2 − 1, and set x0 = a1, xK+1 = a2.
3. The node w ∈ Tξ will have K + 1 children identified with the rectangular
subsets
Jxi, xi+1K× J0, lK for 0 ≤ i ≤ K .
Remark 6.4.2 (Well-definedness of Tξ). It should be clear from the above con-
struction that each of the children of w is either of the form xi+1 = xi + 1, or it is
a disconnecting interval of wired-type, with the property that xi and xi+1 are in
the same boundary component and there do not exist any x ∈ (xi, xi+1) with (x, l)
in that boundary component. As such, the above construction is well-defined.
204
Remark 6.4.3 (Properties of Tξ). With the well-definedness in hand, one can see
the following properties of Tξ. The tree Tξ has depth at most n, and there are
exactly n leaves corresponding to Ji, i+ 1K× J0, lK for all 0 ≤ i ≤ n− 1. Moreover,
for any node w ∈ Tξ, the disconnecting intervals of its children are strict subsets
of its disconnecting interval, and the union of the intervals corresponding to its
children will be exactly the interval corresponding to w.
Labeling the tree Tξ. For a given realizable boundary condition ξ on ∂nΛ, let Tξ
be the corresponding tree defined above. For a node w ∈ Tξ, we will let Rw denote
the corresponding rectangular subset. Moreover, if Rw = Ja, bK× J0, lK we recall its
width W (w) = W (Rw) = b− a, so that w is a leaf of Tξ if and only if W (w) = 1,
and the width of a child is strictly less than that of its parent.
It will also be important for us to have a particular ordering on the children of
a node w ∈ Tξ. We order the children so that the first child has the biggest width,
the next children are the neighboring rectangular regions to this first child, and
then continuing outwards.
Namely, for every w ∈ Tξ, label its children (w(i))i in any way that W (w(1)) ≥
W (w(j)) for every j, and such that⋃j≤k Rw(j) is a connected rectangular subset of
Λ for every k.
Remark 6.4.4. Notice crucially that any any such rectangular subset of the
form⋃j≤k Rw(j) can have at most two northern boundary components connect to
∂nΛ \⋃j≤k Rw(j) in ξ, and they must be through its two northern corner vertices.
205
6.4.2 Bounding the Cutwidth of Λn,l
Definition 6.4.5. For a fixed realizable boundary condition ξ on ∂Λn,l and any
enumeration of the edges in Λn,l as e1, ..., e|E(Λ)| identified with the bijection γ :
1, ..., |E(Λ)| → E(Λ), we can define
CWΛ,γ = maxk≤|E(Λ)|
|∂outγ(1), ..., γ(k)| ,
where for a set E(A) ⊂ E(Λ), |∂outA| is the number of vertices in ∂A that are
incident an edge in A and incident an edge in E(Λ) \ E(A) (where a boundary
vertex is incident to all the edges of E(Λ) adjacent to any vertex in its boundary
component in ξ). The cutwidth of Λ is given by
CWΛ = minγCWΛ,γ ,
where the minimum is over all bijections γ : 1, ..., |E(Λ)| → E(Λ).
For a rectangular subset R ⊂ Λ we define its cutwidth CWR with respect to
Λ by taking the minimum over bijections γ : 1, ..., |E(R)| → E(R) but defining
∂out as before, w.r.t. Λ.
Lemma 6.4.6. Consider Λ = Λn,l with boundary conditions (0, ξ) given by arbitrary
realizable boundary conditions ξ on ∂nΛ that is free on⋃i∈s,e,w ∂iΛ. For every
w ∈ Tξ, the rectangular subset Rw satisfies
CWRw ≤ CWRw(1)∨ (max
j≥2CWR
w(j)+ 2l + 2) ∨ (2l + 2) .
Corollary 6.4.7. Consider Λ with (0, ξ) boundary conditions given by realizable
boundary conditions ξ on ∂nΛ and free elsewhere. There exists C > 0 (independent
206
of ξ) such that
CWΛ ≤ Cl log n .
Proof. We will repeatedly apply Lemma 6.4.6 to bound CWRw along the tree Tξ.
We begin with the leaves of Tξ, each of which are single–column rectangular subsets.
For any leaf w ∈ Tξ, it has CWRw ≤ 2l + 2 (simply enumerate the edges in E(Rw)
in lexicographic order of their midpoints).
Then we can bound CWΛ by the maximum over all root–to–leaf paths of Tξ of
repeated application of Lemma 6.4.6 along those paths. More precisely, let a path
w = (w0, w1, ..., wf ) be a sequence of nodes in Tξ such that w0 is the root of Tξ, wf
is a leaf of Tξ and for every i, wi is a child of wi−1. Then applying Lemma 6.4.6 to
wf−1, ..., w0 and using that CWRwf≤ 2l + 2 and Rw0 = Λ, we see that
CWΛ ≤ (2l + 2) + maxw=(w0,...,wf )
∑1≤i≤f
(2l + 2) · 1wi 6= w(1)i−1 .
This is to say that when going down the path w there is no additive gain in the
cutwidth when the child of a node w is the child of largest width, and an additive
gain of at most 2l + 2 otherwise.
Now notice that for every node w ∈ Tξ and j ≥ 2, deterministically, W (w(j)) ≤12W (w). Combining this with the trivial fact that W (w) ≥ 1 for every w ∈ Tξ, we
see that along every root–to–leaf path w, at most log2 n of summands above are
nonzero. As a result we have
CWΛ ≤ (2l + 2)(1 + log2 n) ,
207
yielding the desired.
Proof of Lemma 6.4.6. Fix a w ∈ Tξ. We begin by defining an inductive algo-
rithm to choose
γ : 1, ..., |E(Rw)| → E(Rw)
given the enumerations γj : 1, ..., |E(Rw(j))| → E(Rw(j)) that attained the
cutwidth, i.e., CWRw(j)
= CWRw(j) ,γj . (Of course for any choice of γ, CWRw ≤
CWRw,γ.) We define the enumeration γ of the edges in E(Rw) as follows.
1. Recall the labeling of the children of w ∈ Tξ such that W (w(1)) ≥ W (w(j))
for every j and⋃j≤k Rw(j) is a connected rectangular subset.
2. For m ∈ 1, ..., |E(Rw(1))|, set γ(m) = γ1(m).
3. For every k ≥ 2, for every m ∈ |E(⋃j≤k−1Rw(j))| + 1, ..., |E(
⋃j≤k Rw(j))|
follow the ordering on edges in E(Rw(k)) given by γk, skipping over all edges
that are in E(⋃j≤k−1Rw(j)) and have thus already been enumerated.
We claim that this enumeration gives the desired inequality on CWRw,γ.
The idea here is that at any step k in the enumeration γ, the quantity
|∂out(⋃j≤k γ(j))| is at most the cutwidth of the child to which the most recently
enumerated edge belongs, plus the contribution from the children that have already
been fully enumerated. For the children that have already been enumerated, by
Remark 6.4.4, their contribution is at most 2l + 2. While enumerating the first
child (w(1)), this additional contribution is not present. We now make this intuition
precise.
First of all, if w is a leaf of Tξ, as noted earlier CWw ≤ 2l + 2 because Rw must
be single-column rectangular subset. Now assume that w is not a leaf of Tξ and
208
that it has children labeled w(1), w(2), ....
We consider the maximal cutwidth through the indices 1, ..., |E(Rw(1))| first,
then consider the maximal cutwidth in the rest of the enumeration separately.
Throughout the enumeration by indices m ∈ 1, ..., |E(Rw(1))|, by definition
of CWRw(1)
the maximal size of |∂out(⋃j≤m γ(j))| will be exactly the cutwidth
CWRw(1)
. Now, let’s consider the largest |∂out(⋃j≤m γ(j))| gets through indices
m ∈ |E(⋃j≤k−1Rw(j))|+ 1, ..., |E(
⋃j≤k Rw(j))| for any k ≥ 2. For all such m, we
have
|∂out(⋃j≤m
γ(j))| ≤ CWRw(k)
+ |∂out(⋃
j≤k−1
Rw(j))| ,
and therefore,
CWRw,γ ≤ CWRw(1)∨(
maxk≥2
CTWRw(k)
+∣∣∣∂out( ⋃
j≤k−1
Rw(k)
)∣∣∣) .By Remark 6.4.4, the contribution of
⋃j≤k−1Rw(j) is at most 2l+ 2 for every k ≥ 2,
concluding the proof.
6.4.3 From cutwidth to spectral gap
We now use the classical canonical paths approach used for short range spin-
systems to translate Corollary 6.4.7 into a proof of Theorem 6.4.1. Recall the
canonical paths bound on the inverse gap given by Theorem 2.2.9.
Proof of Theorem 6.4.1. We first move from boundary conditions that are ar-
bitrary realizable boundary conditions on⋃i∈n,e,w ∂iΛ to those that are arbitrary
realizable on ∂nΛ and free on ∂eΛ∪ ∂wΛ, by (2.2.9), absorbing a multiplicative cost
209
of nl · ecl in the spectral gap.
Now fix any realizable boundary condition ξ on ∂nΛ, let γ be an enumeration
of E(Λ) satisfying CWΛ,γ ≤ Cl log n (whose existence is guaranteed by Corollary
6.4.7) and label the edges E(Λ) according to γ as e1, ..., e|E(Λ)|. For any two FK
configurations ω, ω′ ∈ Ωrc on Λ, we define a path ζ(ω, ω′) in Ωrc as follows: let
el1 , ..., elk ∈ E(Λ) be the sequence of edges on which ω(eli) 6= ω′(eli), labeled in
the order dictated by γ. Then for every i, the i’th edge of ζ will be between the
configuration
ηi(ej) =
ω′(ej) for j ≤ li − 1
ω(ej) for j ≥ li
,
and its neighbor in Ω, η′i which instead has η′(eli) = ω′(eli). At the same time, we
can define
ηi(ej) =
ω(ej) for j ≤ li − 1
ω′(ej) for j ≥ li
.
Then, using the definition of π, it is clear that for every i,
πξ(ω)πξ(ω′) ≤ πξ(ηi)πξ(ηi)q
2·CWΛ,γ
as (recalling that k(ω; ξ) is the number of connected components using boundary
conditions ξ in configuration ω)
|k(ηi; ξ) + k(ηi; ξ)− k(ω; ξ)− k(ω′; ξ)| ≤|∂outel1 , ..., eli−1|+ |∂outeli , ..., e|E(Λ)|| ,
210
which is in turn at most 2|∂oute1, ..., eli−1|. By construction, for every i, the map
(ω, ω′) 7→ (ηi, ηi) is injective. Moreover, the probability of FK dynamics making
any transition in Ωrc is bounded below by some ρ(p, q) > 0. Plugging these in to
Theorem 8.0.6, we obtain for some C > 0,
gap−1rc ≤ max
η,η′∈Ω
1
ρπ(η)
∑ω,ω′:(η,η′)∈ζω,ω′
nlπ(η)π(η)q2·CWΛ,γ
≤ ρ−1 · nl · qCl logn
which for all l, n sufficiently large satisfies the desired for a different C.
6.5 Slow mixing under worst-case boundary con-
ditions
In this section we show that there are (non-realizable) boundary conditions for
the graph (Λn, E(Λn)) for which the FK-dynamics requires exponentially many
steps to converge to stationarity. In particular, we prove Theorem 7 from the
introduction.
Theorem 7 is a corollary of a more general theorem we establish. This general
theorem enables the transferring of mixing time lower bounds for the FK-dynamics
on arbitrary graphs to mixing time lower bounds for the FK-dynamics on Λn, for
suitably chosen boundary conditions. The high level idea is that any graph G with
fewer than n4
edges can be “embedded” into a subset L of the boundary ∂nΛn of
Λn as a boundary condition we shall denote ξ(G). When p is sufficiently small, the
effect of the configuration on Λn \ L becomes negligible, and so the mixing time of
211
the FK-dynamics on Λn with boundary condition ξ(G) is primarily dictated by its
restriction to the embedded graph G.
We show first how to embed any graph G = (VG, EG) into a subset L ⊂ ∂nΛn.
For m ≤ bn/4c, let
L = L(m) = J4i, 4i+ 1K : i = 0, ...,m− 1 × n ⊂ ∂nΛn
with edge set E(L) consisting of all edges in E(Λn) connecting vertices in L.
Definition 6.5.1. Let G = (VG, EG) be a graph with |EG| = m for m ≤ bn/4c
and let L be as above. We say a function φ : L→ VG is an embedding of G into
(L,E(L)) if for every u, v ∈ EG there exists a unique pair x ∈ φ−1(u) ⊆ L and
y ∈ φ−1(v) ⊆ L, where φ−1(u) and φ−1(v) denotes the pre-image sets for u and v
respectively, such that x, y ∈ E(Λn).
Notice that every graph G on m ≤ bn/4c edges can be embedded into L by
identifying each edge in EG with an edge in E(L).
Fact 6.5.2. For every graph G = (VG, EG) with m ≤ bn4c edges, there exists an
embedding of G into (L,E(L)).
Now let ξ(G) be the boundary condition on ∂Λn defined by the partition:
v : v ∈ ∂Λn \ L ∪ φ−1(v) : v ∈ VG .
In words, ξ(G) is the boundary condition that is free everywhere except in the
vertices of L and where all the vertices in L that are mapped by φ to the same
vertex of G are wired in ξ(G). We are now ready to state our main comparison
result from which Theorem 7 follows straightforwardly.
212
Theorem 6.5.3. Let G = (VG, EG) be a graph and suppose there exist q > 2 and
p = λ|EG|−α with λ > 0, α > 1/3 such that gap(G) ≤ exp(−Ω(|VG|)). Then, as
long as n ≥ 4|EG| ≥ εn for some ε > 0, with the same choice of p and q, we have
gap(Λξ(G)n ) ≤ e−Ω(|VG|) .
Proof of Theorem 7. It was established in [44] that for every q > 2 and every `
sufficiently large, there exists an interval (λs(q), λS(q)) such that if p′ = λ′/` with
λ′ ∈ (λs(q), λS(q)), then the spectral gap at parameters (p′, q) satisfies
gap(K`) ≤ exp(−Ω(`)) ,
where K` denotes the complete graph on ` vertices. Therefore, for a fixed p = λn−α
there exists a choice of ` = Θ(nα) such that at parameters (p, q), gap(K`) ≤
exp(−Ω(`)). Since the number of edges in K` is Θ(n2α), the result follows from
Theorem 6.5.3 and (2.2.3).
6.5.1 Main comparison inequality: proof of Theorem 6.5.3
We now turn to the proof of Theorem 6.5.3. A standard tool for bounding
spectral gaps is construction of bottleneck sets with small conductance; see Sec-
tion 2.2.1. It will be easier to do so for the following modified heat-bath (MHB)
dynamics, allowing us to isolate moves on E(L), where we have embedded G, from
those in Ec(L) = E(Λn) \ E(L).
Definition 6.5.4. Given an FK configuration Xt, one step of the MHB chain is
given by:
213
1. Pick e ∈ E(Λn) uniformly at random;
2. If both endpoints of e lie in L, then perform a heat-bath update on e. That is,
replace the configuration in e with a sample from πξΛn,p,q(· | Xt(E(Λn) \ e));
3. Otherwise, replace the configuration in E(Λn) \ E(L) with a sample from
πξΛn,p,q(· | Xt(E(L))).
The MHB chain is clearly reversible w.r.t. πξΛn,p,q.
Let gapmhb(Λξn) denote the spectral gap of the MHB dynamics on Λn with
boundary condition ξ and parameters p and q. The following comparison inequality
allows us to focus on finding upper bounds for the spectral gap of the MHB
dynamics; its proof is deferred to Section 6.5.2.
Lemma 6.5.5. For all p ∈ (0, 1), q > 0, n ∈ N and boundary condition ξ for Λn,
we have
gap(Λξn) ≤ gapmhb(Λξ
n) .
With this in hand, we are now ready to prove Theorem 6.5.3.
Proof of Theorem 6.5.3. Recall from (2.2.15) that since, by assumption, the
FK-dynamics on G has gaprc(G) ≤ exp(−Ω(|VG|)), there must exist S? ⊂ ΩG (the
set of FK configurations on G) with πG(S?) ≤ 12
such that
Φ(S?) =QG(S?, S
c?)
πG(S?)≤ e−Ω(|VG|) . (6.5.1)
Here QG is the edge measure (2.2.14) of the FK-dynamics on G and πG = πG,p,q
denotes the random-cluster measure on G. We will construct from this set S?, a
214
set A? ⊂ Ω, such that :
Φ(A?) =Qmhb(A?, A
c?)
πξ(G)(A?)≤ e−Ω(|VG|) , and
Φ(Ac?) =Qmhb(Ac?, A?)
πξ(G)(Ac?)≤ e−Ω(|VG|) , (6.5.2)
where Qmhb denotes the edge measure (2.2.13) of the MHB dynamics on Λξ(G)n
and πξ(G) = πξ(G)Λn,p,q
. This implies Theorem 6.5.3 by combining it with (2.2.3)
and (2.2.15).
Let ξ1, . . . , ξk be the partition of L induced by ξ(G). For an FK configuration
ω on Ec(L), we say that ξiω←→ ξj if there is an open path in ω from a vertex in ξi
to a vertex in ξj. Let Sξ(G)(ω) be the set
Sξ(G)(ω) = ξi ∈ ξ(G) : ξiω←→ ξj for some j 6= i, j ∈ 1, . . . , k ,
i.e., those ξi that are connected to some ξj in ω. For M ≥ 0, let
Rξ(G)(M) = ω ∈ 0, 1Ec(L) : |Sξ(G)(ω)| ≤M .
In words, Rξ(G)(M) is the set of FK configurations on E(Λn) \ E(L) that connect
at most M elements of the partition ξ1, . . . , ξk of the vertex set L.
Observe that any configuration θ on E(L) corresponds to a configuration on EG.
Namely, if θ(u, v) = 1, then the edge φ(u), φ(v) is open in the configuration on
G, where φ is the embedding of G into L. With a slight abuse of notation we may
use θ also for the corresponding configuration on ΩG. With this convention, let
AM = ω ∈ Ω : ω(E(L)) ∈ S? , ω(Ec(L)) ∈ Rξ(G)(M) . (6.5.3)
215
We show that if M = δ|VG| for some δ > 0 sufficiently small, and n is taken to
be large enough, we can take A? = AM . This will follow from the following two
claims.
Claim 6.5.6. The following are true of S? and AM defined above:
(i) πξ(G)(AM) ≥ q−M(1− e−Ω(M))πG(S?) ;
(ii) πξ(G)(AcM) ≥ e−O(M) .
Claim 6.5.7. The modified heat-bath dynamics satisfies
Qmhb(AM , AcM) ≤ πξ(G)(AM)e−Ω(M logn) +
q2M+1
pQG(S?, S
c?) .
Dividing the bound from Claim 6.5.7 by πξ(G)(AM) and using the bounds from
Claim 6.5.6, we see that
Qmhb(AM , AcM)
πξ(G)(AM)≤ e−Ω(M logn) +
2q3M+1
p
QG(S?, Sc?)
πG(S?)≤ e−Ω(M logn) + eO(M)e−Ω(|VG|) .
for sufficiently large M , where the last inequality follows from (6.5.1) and the facts
that M = δ|VG| and p ≥ λ|VG|−2α. Similarly, we get
Qmhb(AM , AcM)
πξ(G)(AcM)≤ e−Ω(M logn) + eO(M)e−Ω(|VG|) .
Then, since M = δ|VG|, for some δ > 0 sufficiently small we obtain (6.5.2).
6.5.2 Proof of auxiliary facts
In this section we provide the proofs of Lemma 6.5.5, and Claims 6.5.6 and 6.5.7.
216
Proof of Lemma 6.5.5. For any B ⊂ E(Λn), let PB be the transition matrix
corresponding to a heat-bath update on the entire set B. For e ∈ E(Λn), we use
Pe for Pe. Let Phb and Pmhb be the transition matrices for the FK-dynamics and
the MHB dynamics on Λξn, respectively. Let A = E(Λn) \ E(L). Then,
Pmhb =1
|E(Λn)|
( ∑e∈E(L)
Pe +∑e∈A
PA
).
For ease of notation, set π = πξΛn,p,q. Then, for any f, g ∈ R|Ω|, where Ω denotes
the set of FK configurations on Λn, let
〈f, g〉π =∑ω∈Ω
f(ω)g(ω)π(ω) .
If we endow R|Ω| with the inner product 〈·, ·〉π, we obtain a Hilbert space denoted
L2(π) = (R|Ω|, 〈·, ·〉π). Recall, that if a matrix P is reversible w.r.t. π, it defines
a self-adjoint operator from L2(π) to L2(π) via matrix vector multiplication, and
thus 〈f, Pg〉π = 〈P ∗f, g〉π = 〈Pf, g〉π.
Now, the matrix Pmhb is positive semidefinite, since it is an average of pos-
itive semidefinite matrices. Hence, it is a standard fact (see, e.g., [60]) that if
〈f, Pmhbf〉π ≤ 〈f, Phbf〉π for all f ∈ R|Ω|, then gapmhb(Λξn) ≥ gaprc(Λξ
n). To show
this, note that for e ∈ A, we have PA = PePAPe. Thus, for all f ∈ R|Ω|,
〈f, PAf〉π = 〈f, PePAPef〉π = 〈P ∗e f, PAPef〉π = 〈Pef, PAPef〉π
≤ 〈Pef, Pef〉π = 〈f, Pef〉π ,
where we used that Pe = P ∗e , Pe = P 2e and that 〈f, Pf〉π ≤ 〈f, f〉π for every
217
f ∈ R|Ω| and every matrix P reversible w.r.t. π. Then,
〈f, Pmhbf〉π ≤1
|E(Λn)|∑
e∈E(Λn)
〈f, Pef〉π = 〈f, Phbf〉π,
and the result follows.
Recall the notation of Lemma 6.5.3. In order to compare the marginal distribu-
tion in L to πG, we need to bound the number of connections in Ec(L) between
different boundary components of ξ(G) restricted to L: this will show that typical
FK configurations on Ec(L) do not have much influence on the connectivities
amongst L. This bound follows from the fact that p = O(n−α) for some α > 1/3
and the (approximate) independence of connections between ξi and ξj. For the
reminder of this section we set ξ = ξ(G) for ease of notation. Claims 6.5.6–6.5.7
will then be seen as consequences of the following lemma.
Lemma 6.5.8. Let p = λn−α with λ > 0 and α > 1/3. Let ξ be any boundary
condition on ∂nΛn and let η be any FK configuration on E(L). Then, when q ≥ 1,
for every M ≥ 1,
πξΛn(Rξ(M) | η
)≥ 1− exp
[−Ω
(M log[Mn3α−1]
)].
Proof. Let Y be the random variable for the number of vertices of L connected to
at least one other vertex of L in an FK configuration on Ec(L) sampled from the
distribution πξ,η(·) = πξΛn,p,q(· | η). It is sufficient to show that
πξ,η(Y ≥M) ≤ exp(−Ω
(M log[Mn3α−1]
)).
218
By classical comparison inequalities (see e.g., [47]), when q ≥ 1, no matter the
boundary conditions ξ, η, the random-cluster measure on Ec(L) is stochastically
dominated by the independent bond percolation distribution on Ec(L) with the
same parameter p, which we denote by ν = νΛn,p. (Recall that ν is the distribution
on Ω that results from adding every edge in E(Λn) independently with probability
p.) Hence, if X is defined as Y but for ν on Ec(L), we get
πξ,η(Y ≥M) ≤ ν(X ≥M).
Consider the subgraph Λ = (Λn, Ec(L)). Let Z be the total number of vertices
in L that are connected in a configuration sampled from ν to another vertex at
distance 3 in Λ. Then, since the distance between any two vertices in L in Λ is at
least 3, we see that X ≤ Z and
ν(X ≥M) ≤ ν(Z ≥M).
Thus, it suffices to establish a tail bound for Z. Enumerate the vertices of L
as v1, . . . , v2m and let Zj be the indicator random variable for the event that vj
is connected to another vertex at distance 3 in Λ. For r = 0, 1, 2, 3, split up
Z =∑
r Zr, where
Zr =∑i≥0
Z4i+r+1 .
We claim that there is some suitable c > 0 such that under ν, each Zr is stochastically
dominated by the binomial random variable S ∼ Bin(dn/4e, cn−3α). This is because
the random variables Z4i+r+1 are jointly dominated by independent Bernoulli
random variables, Ber(cn−3α), for a suitable c > 0, since for every k the events
219
Zk = 1 and Zk+4 = 1 depend on disjoint sets of edges. The fact that the
success probability of each one is at most cn−3α follows from the fact that there are
at most 16 choices of three adjacent edges from a vertex in ∂nΛn, and p = λn−α.
Hence, by Chernoff–Hoeffding inequality, for every δ > 0,
ν(Zr ≥ Eν [S] + δdn/4e) ≤ exp[−n
4D(cn−3α + δ ‖ cn−3α
)].
where D(a‖b) is the relative entropy between the Bernoulli random variables Ber(a)
and Ber(b):
D(a‖b) = a log(a/b) + (1− a) log((1− a)/(1− b)) .
Since Eν [S] = O(n1−3α), Var(S) = O(n1−3α), α > 13
and M ≥ 1, it follows that for
every M ≥ 1,
ν(Zr ≥M/4) ≤ e−Ω(M log[Mn3α−1]) .
We have Z = Z1 + Z2 + Z3 + Z4, and so a union bound implies the matching bound
for ν(Z ≥M).
Now recall the definitions of the set S? and AM from (6.5.3).
Proof of Claim 6.5.6. For part (i), observe that if ω is sampled from πξ, then
πξ(AM) = πξ(ω(L) ∈ S? | ω(Ec(L)) ∈ Rξ(M)
)πξ(ω(Ec(L)) ∈ Rξ(M)
).
By Lemma 6.5.8,
πξ(ω(Ec(L)) ∈ Rξ(M)) ≥ 1− e−Ω(M) .
220
Moreover, since
πξ(ω(L) ∈ S? | ω(Ec(L) = 0)) = πG(S?) ,
it follows from Eq. (2.2.10) that
πξ(ω(L) ∈ S? | ω(Ec(L)) ∈ Rξ(M)) ≥ q−MπG(S?) ,
and thus,
πξ(AM) ≥ q−M(1− e−Ω(M))πG(S?).
Similarly for part (ii), we have
πξ(AcM) ≥ πξ(ω(L) 6∈ S? | ω(Ec(L)) ∈ Rξ(M)
)πξ(ω(Ec(L)) ∈ Rξ(M))
≥ q−M(1− e−Ω(M))πG(Sc?)
which is at least e−O(M) since πG(S?) ≤ 12.
Proof of Claim 6.5.7. Let Pmhb be the transition matrix for the MHB dynamics
and for ease of notation set B = Ec(L). We have
Qmhb(AM , AcM) ≤
∑ω∈AM
∑ω′∈Ω:
ω′(B)/∈Rξ(M)
πξ(ω)Pmhb(ω, ω′)
+∑ω∈AM
∑ω′∈Ω:
ω′(L)/∈S?
πξ(ω)Pmhb(ω, ω′) . (6.5.4)
For the first term in (6.5.4), observe by definition of MHB dynamics, for every
221
ω ∈ AM
∑ω′∈Ω:
ω′(B)/∈Rξ(M)
Pmhb(ω, ω′) ≤∑ω′∈Ω:
ω′(B)/∈Rξ(M)
πξ(ω′(B) | ω(L)
)≤ e−Ω(M logn),
where the last inequality follows from Lemma 6.5.8. Hence,
∑ω∈AM
∑ω′∈Ω:
ω′(B)/∈Rξ(M)
πξ(ω)Pmhb(ω, ω′) ≤ πξ(AM)e−Ω(M logn).
For the second term in (6.5.4), observe that ω 6= ω′ and that ω and ω′ can differ
in at most one edge e; otherwise Pmhb(ω, ω′) = 0. Thus, setting
p+(q) = max
p,
q(1− p)q(1− p) + p
, and p−(q) = min
1− p, p
q(1− p) + p
,
we obtain
Pmhb(ω, ω′) =1
|E(Λn)|π(ω′(e) | ω(E(Λn) \ e)
)≤ p+(q)|EG|p−(q)|E(Λn)|
PG(ω(L), ω′(L)) .
Then, since |EG| ≤ |E(Λn)| and p+(q)/p−(q) ≤ q/p,
∑ω∈AM
∑ω′∈Ω:
ω′(L)/∈S?
πξ(ω)Pmhb(ω, ω′)
≤ q
p
∑θ∈Rξ(M)
πξ(θ)∑ω1∈S?
∑ω2 6∈S?
πξ(ω1 | θ)PG(ω1, ω2)
≤ q
pπξ(Rξ(M))
∑ω1∈S?
∑ω2 6∈S?
maxθ∈Rξ(M)
πξ(ω1 | θ)PG(ω1, ω2),
222
where ω1, ω2 are FK configurations on E(L) and θ is an FK configuration on B.
Eq. (2.2.10) implies
maxθ∈Rξ(M)
πξ(ω1 | θ) ≤ q2MπG(ω1),
and so
∑ω∈AM
∑ω′∈Ω:
ω′(L)/∈S?
πξ(ω)Pmhb(ω, ω′) ≤ q2M+1
p
∑ω1∈S?
∑ω2 6∈S?
πG(ω1)PG(ω1, ω2)
=q2M+1
pQG(S?, S
c?).
Combining these two bounds, we get
Qmhb(AM , AcM) ≤ πξ(AM)e−Ω(M logn) +
q2M+1
pQG(S?, S
c?) .
223
Chapter 7
Boundary conditions in
coexistence regions I: free and
monochromatic boundary
In this chapter and Chapter 8, we analyze the sensitivity to boundary conditions
in coexistence regions of the FK model, namely at q > 4 and p = pc(q). Whereas
at continuous phase transition points, our mixing upper bounds held for both the
torus and typical FK boundary conditions, and are believed to be independent of
(realizable) FK boundary conditions, in the presence of phase coexistence, natural
boundary conditions like combinations of free, wired, and periodic can lead to
drastically different temporal mixing. This phenomenon can be seen in analogy to
the Ising Glauber dynamics at low temperatures, where the sensitivity to boundary
conditions is very well-studied [11, 52, 63, 66, 70]. We begin with the simplest case
of this, considering the free and wired boundary conditions (corresponding to free
and monochromatic Potts boundary conditions) at large q and p = pc(q).
224
We prove the sub-exponential mixing for the FK Glauber dynamics using
censoring inequalities (Theorem 2.2.7). A bound on the FK dynamics then implies
the analogous bounds for Chayes–Machta and Swendsen–Wang via Theorem 1.2.2.
The monotonicity requirement of the censoring prevents us from carrying this out
in the setting of the Potts Glauber dynamics, though we expect similar pictures
to hold there. As in [70], we will work directly with distributions over boundary
conditions induced by infinite-volume measures; this is more delicate in the setting
of the FK model where boundary interactions are no longer nearest-neighbor. We
now formally define these distributions. Throughout the chapter, let q > 4 and
p = pc(q) so that π1Z2,p,q 6= π0
Z2,p,q and drop the p, q from the notation.
Definition 7.0.1 (wired/free-at-infinity b.c.). In order to sample a boundary
condition on ∆ ⊂ ∂Λ from π0Z2 or π1
Z2 we sample the infinite-volume configuration
on E(Z2) − E(Λ) and then identify the induced boundary condition with the
partition of ∆ that is induced by that configuration. Call the distribution over
boundary conditions induced by π1Z2(ωE(Z2)−E(Λ) ∈ ·) wired-at-infinity and that
induced by π0Z2(ωE(Z2)−E(Λ) ∈ ·) free-at-infinity.
We say that a distribution P on boundary conditions on ∆ dominates P′
(denoted P P′) if ξ′ ∼ P′ is a stochastically finer partition than ξ ∼ P. Moreover,
if boundary conditions are given by different distributions on different subsets of
∂Λn,m, then define the overall distribution on boundary conditions by sampling
the boundary partitions independently on each of the boundary subsets (with no
connections between the subsets). We say such a boundary condition piecewise
dominates wired-at-infinity and is dominated by free-at-infinity.
A particular example of random boundary conditions that will be useful later
on is the following: if R ⊂ D are two concentric rectangles, then the boundary
225
conditions induced by the connections from ωE(D)−E(R), where ω is sampled from
π1D, dominate the wired-at-infinity boundary conditions.
We prove the following proposition, from which Theorem 8 follows easily.
Proposition 7.0.2. Let q be sufficiently large and consider Glauber dynamics for
the critical FK model on Λ = Λn,n. Let P be a distribution over boundary conditions
on ∂Λ such that P π0Z2 or P π1
Z2 and let E be its corresponding expectation.
Then for every ε > 0, there exists c(ε, q) > 0 such that for t? = exp(cn3ε),
maxω0∈0,1
E[‖P t?(ω0, ·)− πξΛ‖tv
]. e−cn
2ε
,
where ξ ∼ P. In particular, P(tmix & t?) . exp(−cn2ε).
Proof ideas
In order to prove Proposition 7.0.2, we appeal to the Peres–Winkler censoring
inequalities [76] for monotone spin systems, a crucial part of the analysis in [70]
(then later in [63]) of the low-temperature Ising model under “plus” boundary, a
class of boundary conditions that dominate the plus phase. A major issue when
attempting to adapt this approach to the critical FK model at sufficiently large q
with free-at-infinity boundary conditions is that the typical free-at-infinity boundary
conditions still have many boundary connections, inducing problematic long-range
interactions along the boundary (see Fig. 1.5), which, as in other previous chapters,
prevent coupling beyond interfaces.
To remedy this, at every step of the analysis we modify the boundary conditions
to all-free on appropriate segments of length no(1) (this modification can only affect
the mixing time by an affordable factor of exp(no(1))). With high probability, by the
226
exponential decay of correlations under π0Z2 , no boundary connections circumvent
the interface past the modified boundary. Refined large deviation estimates on
fluctuations of FK interfaces then allow us to control the influence of other long-
range boundary interactions (see Figure 7.3) and to couple different configurations
beyond distance n1/2+o(1) (the length scale that captures the normal fluctuations of
the interface), yielding mixing estimates on n× n1/2+o(1) boxes, the basic building
block of the proof. These precise large-deviation estimates are only attainable using
cluster expansion, which forces us to only work with sufficiently large q (as opposed
to all q > 4 where the phase transition is discontinuous); we develop these necessary
equilibrium estimates using cluster expansion in the following Section 7.1, before
proving the main result in Section 7.2.
7.1 Cluster expansion and equilibrium estimates
7.1.1 Surface tension
The order-disorder surface tension will play a large role in both upper and
lower bounds studying the effect of boundary conditions on mixing in the phase
coexistence regime. In the low-temperature regime, the metastable phases are the
q ordered ones and there is a positive order-order surface tension (see, e.g., the
q = 2 case); this leads to sensitivity of mixing times to boundary conditions in
Ising/Potts Glauber dynamics (cf., e.g., [66, 70]), but not in FK/Swendsen–Wang
dynamics (as these are symmetric w.r.t. the q ordered phases). At the critical point,
when q > 4, the disordered phase is also metastable, and should induce similar
sensitivity to boundary conditions in FK Glauber and Swendsen–Wang dynamics.
Let Sn = J0, nK × J−∞,∞K and let (1, 0, φ) FK boundary conditions on ∂Sn
227
denote those that are wired on ∂Sn ∩ (x, y) ∈ Z2 : y ≥ x tanφ and free elsewhere
on ∂Sn. We will always be taking φ ∈ (−π/2, π/2).
Definition 7.1.1. The order-disorder surface tension on Sn in direction φ is given
by
τ1,0(φ) = limn→∞
cosφ
βcnlog
[ Z1,0,φSn
(Z1SnZ
0Sn)1/2
]
(whenever this limit exists) where ZξSn denotes the FK partition function on Sn
with boundary conditions ξ on ∂Sn.
It was proved first in [57] that for q sufficiently large, at p = pc(q) and φ 6= ±π2,
the surface tension τ1,0(φ) exists and satisfies τ1,0(φ) > 0.
7.1.2 Cluster expansion
We first introduce the cluster expansion framework used to prove that the FK
order-disorder surface tension τ1,0(φ) as defined in Definition 7.1.1 is positive for all
large q at p = pc(q). We skip many of the details here as understanding them is not
necessary to the equilibrium estimates we require. This approach was extensively
developed in [30] for the low-temperature Ising model when β βc, and then
extended to the critical Potts/FK model for large q (at the discontinuous phase
transition point) in [72] where one can find more details on the below. Though these
cluster expansion techniques only go through at p = pc(q) when q is sufficiently
large, all of the interface estimates are expected to hold at pc whenever τ1,0(φ) > 0,
so in particular, for every q > 4.
Notation. In what follows, two (primal) edges e, f are adjacent if they share a
vertex. Two primal edges e, f are co-adjacent if e? is adjacent to f ?. Connectedness
228
and co-connectedness are defined naturally with respect to adjacency and co-
adjacency. For an edge subset A ⊂ E(Z2), the (edge)-boundary of A is the set of
all edges in A that are co-adjacent to E(Z2) \ A. The co-boundary of A is the set
of edges in E(Z2) \ A that are adjacent to A.
Definition 7.1.2. Let D be a connected subgraph of Z2 and let a, b be two marked
boundary points on ∂D. Consider Dobrushin boundary conditions that are wired
on the clockwise segment (a, b) and free on (b, a) (where if D is infinite, simply
connected, we define these boundary arcs in the natural way). Then for an FK
realization ω on D, the primal-FK interface (or simply the FK interface), I = I(ω),
is defined as follows:
1. Consider the dual-component of the boundary arc (b, a) in ω, and consider
its co-boundary (a set of closed dual-edges in ω).
2. This co-boundary has a unique co-connected component, call it C?(ω), that
is incident to the boundary arc (a, b). The primal-FK interface I is the set of
all primal edges that are dual to edges in C?(ω).
(Notice that this interface is not necessarily a simple path, but it is connected,
and will have no cycles.) Define the dual-FK interface analogously as follows:
consider the connected component of open edges touching the boundary arc (a, b)
and consider its co-boundary (a set of closed primal edges). This co-boundary
has a unique co-connected component that is incident the boundary arc (b, a);
the set of dual-edges that are dual to edges in this co-connected component will
be the dual-FK interface. For more general boundary conditions, there will be a
compatible collection of interfaces between all boundary segments that are wired.
Recall that we defined the infinite strip Sn = J0, nK× J−∞,∞K.
229
Definition 7.1.3. For any angle φ ∈ (−π2, π
2), an edge-cluster weight function
Φ(C, I) is any real-valued function with first argument that is a connected set of
edges in Sn and second argument that is a possible realization of an FK interface
with Dobrushin boundary conditions that are wired on the clockwise arc between
(0, 0) to (n, bn tanφc), such that for some λ > 0 and every C, I,
1. Φ(C, I) = 0 when C ∩ I = ∅ ,
2. Φ(C, I1) = Φ(C, I2) when ΠC ∩ I1 = ΠC ∩ I2 ,
3. Φ(C, I) = Φ(C + (0, s), I1) when I1 ∩ ΠC = (0, s) + I ∩ ΠC ,
4. |Φ(C, I)| ≤ exp(−λ`(C)) ,
where
ΠC = (x, y) ∈ R2 : ∃y′ s.t. (x, y′) ∈ C ,
and `(C) is the minimum number of edges in a connected subset of E(Z2) that
contains all the boundary edges of C. More generally, for an edge-cluster weight
function Φ and a domain Vn with Dobrushin boundary conditions between (0, 0)
and (n, nbtanφc), its partition function is given by
ZΦ = ZΦ(Vn, φ) =∑I
λ|I|+∑C:C∩I=∅ Φ(C,I) , (7.1.1)
where the sum runs over all possible FK order-disorder interfaces in Vn.
There exists a unique edge-cluster weight function Φo/d—called the FK order-
disorder weight function—such that the probability that the FK interface in Sn
under the (1, 0, φ) boundary condition is I is given by
π(1,0,φ)Sn (I) = Z−1
Φo/dλ|I|+
∑C:C∩I6=∅ Φo/d(C,I) .
230
The FK order-disorder weight function Φo/d is made explicit in [72, Proposition 5].
By adapting the methods of [30] to the FK cluster expansion, the following large
deviation estimate on order-disorder interface fluctuations was obtained in [72].
Proposition 7.1.4 ([72, Proposition 5]). Consider the critical FK model on Sn and
fix a δ > 0. There exist some q0 and c(δ) > 0 such that for all φ ∈ [−π2
+ δ, π2− δ]
and every q ≥ q0, every h ≥ 1 and n ≥ 1,
π1,0,φSn
(I 6⊂ (x, y) : y ∈ [x tanφ− h, x tanφ+ h]
). n2 exp(−ch2/n) .
The following is a finer result, that reformulates results of [30] in the FK setting.
Define the cigar-shaped region Uκ,d,φ for every d > 0 and κ > 0 by
Uκ,d,φ = Sn ∩
(x, y) ∈ Z2 : |y − x tanφ| ≤ d∣∣∣x(n−x)
n
∣∣∣ 12
+κ, (7.1.2)
Proposition 7.1.5. Consider the critical FK model on a domain Vn ⊃ Uκ,d,φ, let
Φo/d be the FK order-disorder weight function, and let Φ be any function satisfying
Φ(C, I) = Φo/d(C, I) when C ⊂ Uκ,d,φ , |Φ(C, I)| ≤ exp(−λ`(C)) ∀C, I ,
(e.g., Φ = Φo/d1C ⊂ Uκ,d,φ). There exist q0 > 0 and f(κ) = O(κ−1) such that for
all q ≥ q0, all φ ∈ [−π2
+ δ, π2− δ], there exists C(q, δ) > 0, such that
| logZΦ(Vn, φ)− logZΦo/d(Sn, φ)| ≤ C(log n)f(κ) . (7.1.3)
231
Moreover, the order-disorder surface tension τ1,0(φ) (given in Def. 7.1.1) satisfies
| logZΦ(Vn, φ)− n log(1 +√q)(cosφ)−1τ1,0(φ)| ≤ C(log n)f(κ) . (7.1.4)
7.1.3 Necessary equilibrium estimates
We now include some equilibrium estimates from cluster expansions at suf-
ficiently large q, which are adaptations of the necessary low-temperature Ising
equilibrium estimates of [30] to the setting of the critical FK model with large q.
For any φ ∈ (−π/2, π/2), the strip Sn = J0, nK×J−∞,∞K has (1, 0, φ) boundary
conditions denoting wired on ∂Sn ∩ J0, nK×y ≤ x tanφ and free elsewhere. Then
I is the set of all order-disorder interfaces (bottom-most dual crossings from
(0, 0) ←→ (0, n tanφ)). Recall the definition of the cigar-shaped region Uκ,d,φ
from (7.1.2) and call Irκ,d,φ ⊂ I the subset of interfaces that are not contained in
the cigar-shaped region.
Proposition 7.1.6. Consider the critical FK model on Sn and fix a δ > 0. There
exists some q0 such that for all φ ∈ [−π2
+ δ, π2− δ], there exists c(d, φ) > 0 such
that for every q ≥ q0 and every κ > 0,
π1,0,φSn (Irκ,d,φ) . n2 exp(−cn2κ) .
Proof. We use an extension of [30, §4] to the framework of the FK/Potts models
in the phase coexistence regime by [72]. The specific case of [72, §5] states the
following: consider the critical FK model on the strip Sn = J0, nK × J−∞,∞K
with (1, 0, φ) boundary conditions. Then for every q ≥ q0 and every d, κ > 0, and
232
φ ∈ (−π2, π
2),
limn→∞
1
nlog
∑I∈Ird,κ,φ
π1,0,φSn (I)∑
I∈I π1,0,φSn (I)
= 0 .
A straightforward adaptation of the proof of [72, Proposition 5] to the form of [30,
Proposition 4.15] in fact yields the very large deviation bound we desire. A specific
case is when φ = 0: there exist some q0, c > 0 such that for all q ≥ q0 and every
a ≥ 0,
π1,0,φ=0Sn (|H| ≥ a) . n2 exp(−ca2/n) , (7.1.5)
where |H| denotes the maximum vertical distance of an edge e in the interface to
the x-axis. Though the result of [72] is written with the Potts model in mind and
thus with integer q, the cluster expansion and all the results hold with noninteger
q as well.
We now prove the following estimate on FK interfaces near a repulsive boundary.
Proposition 7.1.7. Fix c > 0 and consider the critical FK model on R = J0, nK×
J0, `K for c√n log n ≤ ` ≤ n with boundary conditions (1, 0) denoting 1 on ∂n,e,wR
and 0 on ∂sR. Let H be the maximum vertical height of the order-disorder interface.
There exist constants q0, c′ > 0 such that, for all q ≥ q0 and every 0 ≤ a ≤ `,
π1,0R (H ≥ a) . n2 exp
(−c′ a2/n
).
Proof. Fix the a ≤ ` from the statement of Proposition 7.1.7. Denote by (1, 0, ∗)
boundary conditions that are still wired (resp. free) on the intersection of ∂Sn and the
upper (resp. lower) half plane, but now also free on all of Sn ∩ (x, y) : y ≤ −a/2
233
(this induces a free bottom boundary on the semi-infinite strip starting from
y = −a/2).
Clearly π1,0Sn π1,0,∗
Sn . Then by Eq. (7.1.5), there exists A(q) > 0 such that with
probability bigger than 1 − An2 exp(−ca2/4n), the interface under π1,0Sn does not
touch the line y = −a/2, and therefore, with that probability, there is a horizontal
dual-crossing of Sn contained entirely above y = −a/2. Via the grand coupling,
since this is a decreasing event, the same horizontal dual-crossing would be present
under π1,0,∗Sn and we expose the bottom most horizontal dual-crossing above the line
y = −a/2 and couple the configurations above it.
At the same time, using (7.1.5), we have that under π1,0Sn , with the same
probability, the maximum y-coordinate of the interface does not exceed a/2. If we
have coupled the two configurations above a bottommost dual-crossing above the
line y = −a/2, the same would be true of the interface under π1,0,∗Sn . Thus, taking a
union bound,
π1,0,∗Sn (H ≥ a) ≤ 2An2 exp(−ca2/4n) .
Using the monotonicity of the FK model, and denoting by R′ the vertical
translate of R by −a/2, we obtain π1,0R′ π1,0,∗
Sn (ωR′); together with the fact that
H ≥ a is a decreasing event, for c′ = c/4,
π1,0R (H ≥ a) ≤ 2An2 exp(−c′a2/n) ,
where now, the interface is again between (0, 0) and (0, n).
Now for any fixed ε > 0, consider the rectangle V` = J0, nK×J0, `K with ` ≥ 4n12
+ε,
and (1, 0,∆) denoting free boundary conditions on ∂V ∩ (x, y) : y ≥ 2n12
+ε and
∆ = 0 × Jn2− n3ε, n
2+ n3εK and wired elsewhere. Denote the four points at which
234
the boundary conditions change by (w1, w2) ∈ ∂wV × ∂eV , z1, z2 ∈ ∂sV with z1 to
the left of z2. Let C1 and C2 be the blocks J0, n2K×J0, `K and Jn
2K×J0, `K respectively.
The main equilibrium estimate we use in the sequel reads as follows.
Proposition 7.1.8. Let Γ = ω : wiC∗i←→ zi, i = 1, 2. There exists q0 > 0 so
that the following holds. For every q ≥ q0 there exists c = c(q) > 0 such that the
corresponding critical FK model on V` satisfies
π1,0,∆V`
(Γc) . e−cn3ε
.
Proof. This corresponds to Claim 3.10 proven in the appendix of [70] for ` = n12
+ε
in the setting of the low-temperature Ising model using the cluster expansion
of [30] and the analogues of Propositions 7.1.4 and 7.1.5 (Propositions 4.15 and
Theorem 4.16 of [30] respectively), but the extension to larger ` is immediate. We
sketch the proof of [70] before justifying its extension to the current setting.
(a) Via a surface tension estimate analogous to (7.1.4), with probability 1 −
exp(−cn3ε), the interfaces connect wi ←→ zi instead of w1 ←→ w2. In order
to then claim that the two interfaces are confined to the right and left halves
of V , it is shown in the appendix of [70] that with high probability the two
interfaces do not interact, via the exponential decay of the Ising cluster weights
in cluster lengths.
(b) The next step in [70] was to show that the interface does not deviate farther
than nε from the east of z1. The complication in the Ising setup was that the
plus boundary on ∂sV produced a repulsive force on the interface so neither
Proposition 4.15 nor Theorem 4.16 of [30] were directly applicable.
235
(c) To circumvent the problem that ∂sV is not sufficiently far from z1 to contain
the cigar-shaped region, the region V is extended in [70] to V ∪ J0, n/2−n3εK×
J−n, 0K with appropriate boundary conditions. Thereafter, the proof concludes
by lower bounding the weight of all interfaces between w1 and z1 that do not
interact with the extension of V , repeatedly using Theorem 4.16 of [30]: the
key to this lower bound consists of stitching cigar shaped regions of increasing
length, all sufficiently far from the extension of V and lying above the straight
line connecting w1 to z1.
(d) If the extension is accounted for, an estimate of the form of Theorem 4.16 of [30]
with the appropriate angle φ implies that with high probability, the interface
does not deviate far to the east of z1, thus stays bounded away from ∂eC1.
For more details on these arguments, see [70, Appendix A]. With Propositions
7.1.4–7.1.6, steps (a)–(d) carry through in the setting of the critical FK model with
large q, proving Proposition 7.1.8. Notice that the long-range interactions of the
FK model are irrelevant to this situation where all boundary conditions used are
completely free or completely wired and cannot be perturbed.
7.2 Upper bounds under free boundary condi-
tions
In this section, we use the sharp bounds on interface fluctuations, obtained in
Section 7.1, along with the dynamical scheme of [63, 70] to prove sub-exponential
mixing for the FK Glauber dynamics under free and wired boundary conditions
at large q and p = pc(q). This in turn implies sub-exponential mixing for the
236
Swendsen–Wang dynamics with free or monochromatic boundary conditions when
q is large enough and β = βc(q). Because of the long-range interactions of induced
FK boundary conditions, we need to again modify FK boundary conditions to
disconnect these bridges at each step of the recursion. For that, let us define the
following boundary modification, for distributions over boundary conditions.
Definition 7.2.1 (boundary modification). If P is a distribution over boundary
conditions on ∂Λ, ∆ ⊂ ∂Λ, we let P∆ be the distribution over boundary conditions
ξ′ obtained by first sampling a boundary condition ξ ∼ P then setting ξ′ to be
the segment modification ξ∆ as defined in Definition 4.2.1; this induces a coupling
(ξ, ξ′) ∼ (P,P∆). E.g., if ∆ = ∂Λ then ξ′ = 0, and if ∆ consists of a single vertex v
and ξ is induced by a configuration where every boundary vertex is connected to v
and to no other boundary vertex, then ξ′ would be wired on ∂Λ− v.
7.2.1 The recursive scheme
Throughout this subsection, let P be a distribution over FK boundary conditions
on Λn,m and E the corresponding expectation. For ξ ∼ P, we say that
APn,m(t, δ) holds if max
ω0∈0,1E[∥∥∥P t(ω0, ·)− πξΛn,m
∥∥∥tv
]≤ δ .
Using this notation, the following corollary is a consequence of Lemma 2.2.13.
Corollary 7.2.2. Consider Λn,m with boundary conditions ξ ∼ P and ∆ ⊂ ∂Λn,m
such that |V (∆)| n3ε, with P∆ defined as in Definition 7.2.1. If for some t, δ,
maxω0∈0,1
E∆[‖P t(ω0, ·)− πξΛn,m‖tv
]≤ δ ,
237
then APn,m(t′, δ′) holds with δ′ = 8δ + exp(−ecn3ε
) and t′ = exp(cn3ε)t for some
c(q) > 0 independent of ∆ and ξ. Similarly, APn,m(t, δ) implies,
maxω0∈0,1
E∆[‖P t′(ω0, ·)− πξΛn,m‖tv
]≤ δ′ .
Before proving the main theorem, we fix an ε > 0 and prove a recursive scheme
that yields a mixing time bound on rectangles with side lengths n× n 12
+ε for n of
the form n ∈ 2kk∈N. We remark that as in [70], this is a technical assumption
that is not requisite to the upper bound, (see Remark 3.12 of [70]).
For the base scale of the recursion, we use a the following: it is a consequence
of a trivial mixing time upper bound that is exponential in the longer side-length
of a rectangle, that holds independently of the boundary conditions (e.g., pay an
exponential in the perimeter cost to make the boundary conditions free, then use
the canonical paths estimate of Theorem 2.2.9).
Proposition 7.2.3. There exists c = c(q) > 0 such that for every n, for every q,
for the FK Glauber dynamics, APn,m(t, exp[−te−c(n∨m)]) holds independent of P.
An intermediate step to proving Proposition 7.0.2 is proving analogous bounds
for rectangles with free-at-infinity boundary conditions on three sides and wired-at-
infinity on the fourth.
Definition 7.2.4. A distribution P over boundary conditions on Λn,m is in D(Λn,m)
if it is dominated by π0Z2 on ∂n,e,wΛn,m and dominates π1
Z2 on ∂sΛn,m.
We say that An,m(t, δ) holds if APn,m(t, δ) holds for every P ∈ D(Λn,m).
The main estimate for our recursion on increasing rectangles is the following.
238
Proposition 7.2.5. For the critical FK Glauber dynamics with q large enough
on Λn,m, the following holds: for any m ∈ Jn12
+ε, nK and α ∈ (1, 2), there exist
c1, c2 > 0 such that for every t, δ,
An,m(t, δ) =⇒ An,bαmc(
2ec2n3ε
t , c1(δ + e−c2n2ε
+ n2t−c2)), (7.2.1)
and for every m n12
+ε there exist c1, c2 > 0 such that for every t, δ,
An,m(t, δ) =⇒ A2n,m
(3ec2n
3ε
t , c1(δ + e−c2n3ε
)). (7.2.2)
Before proving the implications in Proposition 7.2.5, we first prove two easy but
important consequences.
Corollary 7.2.6. There exists q0 such that, for every q ≥ q0, there exist c, c′ > 0
such the following holds. If n ∈ 2kk∈N is sufficiently large, then the statement
An,n1/2+ε(exp[cn3ε], exp[−c′n2ε]) holds for critical FK Glauber dynamics on Λn,n1/2+ε.
Proof. Choose n0 nε and let t0 = exp(c′nε) for some constant c′ > 0 large
enough that An0,n
1/2+ε0
(t0, δ0) holds with δ0 = exp(−c′nε), noting that such a
choice of c′ exists by Proposition 7.2.3. Applying (7.2.1) followed by (7.2.2) with
m = n12
+ε and α = 212
+ε allows one to, for any n, express the mixing time of a
2n × (2n)12
+ε box in terms of that of a n × n 12
+ε box. Starting with the scale n0
and repeating this step log2 n times implies that there exists c > 0 fixed such that
An,n1/2+ε(exp(cn3ε), c exp[−n−2ε/c]) holds.
We now use the bound on n× n 12
+ε rectangles to obtain mixing time bounds on
the n×m rectangle with boundary conditions that are disordered on three sides
and ordered on the fourth.
239
Corollary 7.2.7. Consider the critical FK Glauber dynamics on Λn,m for m ∈
Jn12
+ε, nK and boundary conditions ξ ∼ P. Then there exists q0 > 0 such that for
all q ≥ q0 there exists a constant c = c(m, q) > 0 such that for large enough n, for
every P ∈ D(Λn,m),
maxω0∈0,1
E[‖P t?(ω0, ·)− πξΛn,m‖tv
]. e−cn
2ε
,
for t? = exp(cn3ε), and in particular if tmmix is the corresponding mixing time,
P(tmmix ≥ t?) . e−cn2ε
.
Proof. Choose an α ∈ (1, 2) such that αkn12
+ε = m for some integer k logm.
Then let hj = bαjn 12
+εc and let Λj = Λjn = J0, nK× J0, hjK so that Λk = Λn,m.
We prove the above by induction on j ∈ J0, kK for n× hj rectangles, showing
that
maxω0∈0,1
E[‖P thj (ω0, ·)− πξΛn,hj ‖tv
]≤ (cj1 + 2jc1)e−c2n
2ε
, (7.2.3)
where c1, c2 are the constants of (7.2.1) for m = hj, and
thj = 2jhjc2(1+n3ε) .
The base case j = 0 is given by Corollary 7.2.6, and if (7.2.3) holds for some
fixed j ∈ J0, k − 1K, then an application of (7.2.1) immediately implies it for j + 1.
The observations that j ≤ log n and hj ≤ n allow us to choose slightly different
constants to obtain the first inequality of Corollary 7.2.7. The triangle inequality
and Eq. (2.2.7) can then be used to boost the bound on d1(t) ∨ d0(t) to a bound
on d(t), so that Markov’s inequality implies the second inequality.
240
B
A
Q
n
bαmcm
1
1
1
t
1
ν1
1
1
η
t
νη2
η
Figure 7.1: Setup for the proof of Eq. (7.2.1) starting from wired initial conditions.
We now prove Proposition 7.2.5 from which the above corollaries follow. The
proof of Proposition 7.0.2 then follows from Corollary 7.2.7 using similar techniques
(see §7.2.2)
Proof of Eq. (7.2.1). Fix any P ∈ D(Λn,bαmc) and observe that the proof is
independent of this choice of P. Consider the quantity, E[‖P t(ω0, ·)− πξΛn,bαmc‖tv]
for ω0 = 0, 1.
(i) Wired initial conditions. Begin with the case when ω0 = 1. Let A,B be two
copies of Λn,m with A translated upwards by b(α− 1)mc such that Q := A ∪B =
Λn,bαmc and A ∩B is the middle rectangle in Qn of thickness m.
In order to compensate for the long-range interactions of the FK model, that are
not present in the setting of [70], we force a set of boundary edges to be free (in a
manner similar to part 2 of the proof of Theorem 3.2 of [70]) to “disconnect” B from
A. Consider the boundary condition ξ′, a modification of ξ ∼ P on ∆ = ∆s ∪∆n
241
for,
∆s =(x, y) ∈ ∂e,wA : y ≤ b(α− 1)mc+ n3ε ,
∆n =(x, y) ∈ ∂e,wB : y ≥ bm− n3εc ,
according to Definition 7.2.1. By Corollary 7.2.2 it suffices, up to new choice
of constants c1, c2 to show that the FK Glauber dynamics under P∆ on A ∪ B
satisfies (7.2.1).
Denote by P the transition kernel of the censored dynamics (Xs)s≥0 started
from the all wired configuration, only accepting updates in block A up to time t,
resetting all edge values in B to 1 at time t then only accepting updates in block
B from time t to time 2t (observe that as in Lemma 3.4 of [70], by Theorem 2.2.7,
resetting all edge values to 1 only slows mixing). Let ν1 denote the distribution
after time t on A and let νη2 denote the distribution after time 2t of configurations
on B given that at time t the configuration on B was set to 1 and the boundary
condition on Bc was η (see Fig. 7.1).
The monotonicity of the FK model along with Theorem 2.2.7 yields,
dξ′
1 (2t) ≤ ‖P 2t(1, ·)− πξ′
Q‖tv . (7.2.4)
Now we aim to show that E∆[‖P 2t(1, ·)−πξ
′
Q‖tv]≤ δ′ where we let δ′ be the second
argument in the right hand side of (7.2.1). For R = A,B we denote by πξ′,ηR the
modified stationary distribution with η boundary conditions on Q−R.
To simplify the notation, throughout the rest of this section, we let ‖µ− ν‖R
denote ‖µR − νR‖tv. Also, for any R, ξ and any random variable X, let πξR(X)
denote the expectation of X under πξR. By the Markov property and the triangle
242
inequality,
E∆[‖P 2t(1, ·)− πξ
′
Q‖tv]≤ E∆
[‖ν1 − πξ
′,1A ‖Bc
]+ E∆
[‖πξ
′,1A − πξ
′
Q‖Bc]
+E∆[‖πξ
′
Q − πξ′,0Q ‖Bc
]+ E∆
[πξ′,0Q (‖νη2 − π
ξ′,ηB ‖tv)
].
(7.2.5)
We begin by bounding the first and fourth terms which are easier, then use
the equilibrium estimates of the cluster expansion to bound the third term in
Lemma 7.2.8, analogous to which the second term can be bounded. First observe
that with probability 1− exp(−cn3ε) for some c > 0, the boundary conditions on
A are sampled from a distribution in D(A). The concern is that the wired initial
configuration may add connections to ∂n,e,wA via the long-range FK interactions.
Such an effect on the boundary conditions on A is impossible if there are no
boundary connections from ∂e,wAc to ∂n,e,wA. Because of the modification on
∆s such a connection would require a connection of length at least n3ε under P
and therefore also in the free phase, which has probability less than exp(−cn3ε)
(see Eq. (2.1.2)). If no such connection exists along the boundary, the boundary
conditions on ∂n,e,wA are sampled from a measure dominated by π0Z2 because the
modification of Definition 7.2.1 only removes connections. From now on, paying a
cost of exp(−cn3ε), we assume the decreasing event that this is the case.
Then by the assumption that An,m(t, δ) holds, the first term in (7.2.5) is smaller
than δ. The observation that P∆ P on ∂e,wQ, implies that for any decreasing f
only depending on ∂n,e,wB,
E∆[πξ′,0Q (f)
]≥ π0
Z2(πξQ(f)) = π0Z2(f) , (7.2.6)
243
so that the πξ′,0Q -averaged distribution on boundary conditions on B is in D(B),
the statement An,m(t, δ) applies, and the fourth term in (7.2.5) is also bounded
above by δ.
We now turn to the second and third terms of (7.2.5), which can be bounded
similarly, and thus we only go through the details of the third term:
Lemma 7.2.8. There exists c(q) > 0 such that
E∆[‖πξ
′
Q − πξ′,0Q ‖Bc
]. e−cn
2ε
.
Proof. We bound the total variation distance by proving that under the grand
coupling of the two distributions on Bc, they agree with probability 1 = e−cn2ε
.
Let ∂±(Q) denote the two connected components of ∂Q−∆n above and below ∆n
respectively. We break up the expectation into an average over Γ1, the set of ξ′ in
which there does not exist a pair (x, y) ∈ ∂+Q× ∂−Q such that xξ′←→ y (i.e. they
are in the same boundary component), and Γc1. By Eq. (2.1.2) of Theorem 2.1.2
and a union bound over pairs of boundary vertices, there exists a constant c′(q) > 0
such that
P∆(Γc1) ≤ 16n2e−cn3ε
. exp(−c′n3ε) , (7.2.7)
For all such ξ′, we use the worst bound of 1 on the total variation distance.
Suppose now that Γ1 holds and observe that this is a decreasing event so P∆(· |
Γ1) P∆. Let Γ2 denote the decreasing event that the interface (bottom-most
horizontal dual crossing) of Q is contained entirely below ∂s(Bc). Let Γ3 be the
decreasing event that there does not exist any vertex x ∈ ∂−e Q such that x←→ ∂nB,
244
n n
n
Q
En(Q)
E′n(Q)
Figure 7.2: Bounding Γ2: if a dual-open circuit (purple) exists under π0Z2 in
E′n(Q)−Q then the boundary conditions on ∂n,e,wQ are dominated by those underfree on E ′n(Q). The wired boundary conditions on ∂s(En(Q)) then also dominate thewired-at-infinity boundary conditions on ∂sQ allowing us to dominate the interface(blue) in Q by that in En(Q).
and there does not exist any y ∈ ∂−wQ such that y ←→ ∂nB.
By monotonicity and the domain Markov property, for ξ′ ∈ Γ1, if Γ2 ∩ Γ3 holds,
it is the case that the boundary conditions on ∂n,e,wBc will not have been affected
by the updates on A ∩B as the interface and all its long-range interactions with
∂Q would be confined to A ∩B. Then one could reveal all boundary components
of ∂e,s,wB so that they are all confined to B and by monotonicity under the grand
coupling the two distributions would be coupled on Bc. As a result,
E∆[‖πξ
′
Q − πξ′,0Q ‖Bc
∣∣ Γ1
]≤ E∆
[πξ′,1Q (Γc2 ∪ Γc3)
∣∣ Γ1
]. (7.2.8)
We bound the two probabilities separately and take a union bound. To bound the
245
probability of Γc2, consider the enlarged rectangle,
En(Q) = J−n, 2nK× J0, n+ αmK ⊃ Q , (7.2.9)
with (0, 1) boundary conditions denoting wired on ∂sEn(Q) and free elsewhere.
By Definition 7.0.1 we sample ∂n,e,wQ separately and then ∂sQ. First observe
that by Eq. (2.1.2), with π0Z2-probability 1− e−cn, there is a dual circuit between
Q and its enlargement by n in all four sides E′n(Q). In that case, the boundary
conditions on Q are dominated by those with free on ∂E′n(Q) (see Figure 7.2). We
can subsequently dominate the boundary conditions on ∂sQ by making them all
wired and extending them all the way across En(Q) to obtain that there exists
c > 0 such that
E∆[πξ′,1Q (Γc2)
∣∣ Γ1
]≤ π0,1
En(Q)(Γc2) + e−cn .
By Proposition 7.1.7, we deduce that π0,1En(Q)(Γ
c2) . exp(−cn2ε) for some c > 0.
We now bound the probability of Γc3.
Claim 7.2.9. There exists c = c(q) > 0 such that for every ξ′ ∈ Γ1,
πξ′,1Q (Γc3 | Γ2) . e−cn
3ε
+ e−cn12 +ε
+ e−cn .
Proof. Under Γ2, by monotonicity, we can only worsen our bound on the probability
of Γc3 by replacing the boundary conditions on Q by wired on ∂−Q− ∂sQ, free on
∂sQ, and ξ′ elsewhere.
Let Q = J0, nK× J0, 2mK ⊃ Q with boundary conditions free on ∂s,nQ∪∆n, and
wired elsewhere. By the exponential decay of correlations in the free phase, with
πQ-probability 1− e−cm, the measure this induces on Q dominates the boundary
246
“1”
“0”
0“0”
∆n
B
Figure 7.3: The event Γ2 is the event that the blue cluster does not climb toohigh. Under Γ2, there is a horizontal dual-crossing of B immediately adjacent theblue interface, but the boundary of the red component may have been perturbedby connections in the blue shaded region. The event Γ3 is the event that the redcomponent then does not then climb above ∆n. Boundary conditions “1” (“0”)indicate those that dominate (are dominated by) wired-at infinity (free-at-infinity).
conditions under Γ2 on Q.
Controlling the probability of Γc3 can now be expressed in a manner similar to
the equilibrium bound, Proposition 7.1.8. As is standard in such problems, (see,
e.g., the appendix of [70]), we can up to an error of e−cn separate the left and right
interfaces (see Proposition 7.1.5 whence the probability that they interact is a large
deviation of order n), and just consider Q with free boundary conditions now on
all of ∂eQ also.
Then extend the northern boundary of Q to make Q symmetric about ∆n and
call the new domain Q′. We can, using monotonicity, let its boundary conditions
(1, 0,∆n) be free on ∂Q′ ∩ (x ≥ m12
+ε ∪∆n) and wired elsewhere.
At this point, we apply Proposition 7.1.8 with ` = n (up to a π2-rotation and
247
a rescaling of m to n/2) to obtain the desired bound: if in the new domain, the
boundary points are denoted by w1, w2 ∈ ∂nQ′ × ∂sQ
′ and z1, z2 ∈ ∂eQ′, and
Cii∈n,s are the north or south halves of Q′ respectively, Proposition 7.1.8 implies
that there exists c > 0 such that for large enough n,
π1,0,∆n
Q′
( ⋂i=n,s
wiCi←→ zi
)≤ e−cn
3ε
.
Putting everything together, we conclude that if Γ′ is the event that under
π1,0,∆n
Q′ , the two interfaces are contained in bottom and top halves of Q′ respectively,
then there exists c = c(q) > 0 such that, for large enough n,
π1,0,∆n
Q(Γ′c) ≤ 2e−cn
3ε
+ 2e−cm + 2e−cn ,
Monotonicity and n12
+ε ≤ m ≤ n imply the bound on E∆[πξ′,1Q (Γc3 | Γ2) | Γ1].
By union bounding over the errors that arise from each of the Γi not occurring,
and otherwise conditioning on their occurrence, we obtain
E∆[‖πξ
′
Q − πξ′,0Q ‖Bc
]. e−cn
2ε
,
as desired.
The corresponding bound on the second term of (7.2.4) is the same up to
changes of scale corresponding to working with distributions on configurations on
A, not Q. Because of the modification on ∆s, as remarked earlier, with probability
1 − exp(−cn3ε), the boundary conditions on A are in D(A). This event is a
decreasing event so it only increases the probability of Γi for i = 1, 2, 3. Because
248
α > 1, the middle rectangle is at least order n12
+ε so the bound on the interface
touching ∂s(Bc) also still holds. Combined with (7.2.4) and the bounds on the
other terms in (7.2.5), we conclude that for some c1, c2 > 0 and all large enough n,
E∆[dξ′
1 (2t)]≤ 2δ + 2c1e
−c2n2ε
.
(ii) Free initial configuration. Consider the dynamics started from ω0 = 0. Let
P denote the transition kernel of the censored dynamics that only updates edges
in B until time t at which point all edges in A are reset to 0 and the dynamics
subsequently only updates edges in A until time 2t. Let νη2 denote the distribution
obtained between times t and 2t given boundary conditions η on Ac and initial
configuration 0 on A. Let
∆ = (x, y) ∈ ∂e,w(Ac) : y ≥ b(α− 1)m− n3εc ,
so that again by Theorem 2.2.7 and Corollary 7.2.2 it suffices to prove the desired
implication under E∆ for the P dynamics.
The Markov property and the triangle inequality together imply,
E∆[‖P 2t(0, ·)− πξ
′
Q‖tv]≤ E∆
[‖P t(0, ·)− πξ
′,0B ‖Ac
]+ E∆
[‖πξ
′,0B − πξ
′
Q‖Ac]
+E∆[πξ′
Q(‖νη2 − πξ′,ηA ‖tv)
]. (7.2.10)
We can bound the first term by δ by assumption and the observation that the free
initial configuration does not change the boundary conditions on B. The second
term can be bounded using the same approach as the proof of Lemma 7.2.8, where
now Ac is shorter than the rectangle in Lemma 7.2.8 but still & n12
+ε. Via an
249
application of Lemma 7.2.8, up to an error of exp(−cn2ε) for c > 0 fixed, the open
component of ∂sB and its perturbations to the boundary of A ∪ B do not reach
∂nAc so the grand coupling couples the two distributions on Ac.
In that case, the measure on ∂A is dominated by π0Z2 and therefore by (2.1.2),
up to an error of e−cm the entirety of Bc is disconnected from Ac. By monotonicity
and the fact that m ≥ n12
+ε, we obtain that there exists c > 0 such that
E∆[‖πξ
′,0B − πξ
′
Q‖Ac]. e−cn
12 +ε
+ e−cn2ε
.
It remains to bound the third term in (7.2.10) following the approach of [70].
Lemma 7.2.10. There exist constants c, c′ > 0 such that
πξ′(‖νη2 − π
ξ′,ηA ‖tv) . e−cn
2ε
+ e−c′` .
Proof. Using the bound on total variation by the probability of disagreement under
a maximal coupling, together with monotonicity and a union bound, write
πξ′(‖νη2 − π
ξ′,ηA ‖tv) ≤
∑e∈E(A)
πξ′
Q
(νη2 (e /∈ ω)− πξ
′
Q(e /∈ ω)).
For any e ∈ E(A), consider K` = e+ J−`, `K2∩A. Denote by νη2,` the distribution
obtained by the dynamics in K` with boundary conditions given by (ξ′, η) on
∂K` ∩ ∂A and 0 elsewhere. Then via a very rough mixing time estimate akin to
Proposition 7.2.3, there exists c > 0 such that
νη2 (e /∈ ω)− πξ′
Q(e /∈ ω) ≤ e−te−c`
+(πξ′,ηK`
(e /∈ ω)− πξ′
Q(e /∈ ω)).
250
Absorbing a 2n2 for the maximum number of edges in A, it suffices to prove there
exist constants c, c′ > 0 such that for every e ∈ E(A),
E∆[πξ′
Q(πξ′,ηK`
(e /∈ ω)− πξ′,ηA (e /∈ ω))
]. e−c
′` + e−cn2ε
.
For any fixed e ∈ E(A) let Γc := e K`←→ ∂K` ∩ Ao. By the FKG inequality,
πξ′,ηA (e /∈ ω | Γ) ≥ πξ
′,ηK`
(e /∈ ω) ,
so it suffices to check that
E∆[πξ′
Q(πξ′,ηA (Γc))
]= E∆
[πξ′
Q(Γc)]. e−c` + e−cn
2ε
.
Proving this is very similar to proving the bound on the probability of Γc2 in
the proof of Lemma 7.2.8 as shown in Figure 7.2. For some c > 0, up to an error of
e−cn, we replace E∆[πξ′
Q(Γc)] with π0,1En(Q)(Γ
c) where En(Q) is the enlarged rectangle
defined in (7.2.9). Then by Proposition 7.1.7, for some other c > 0 with probability
1−exp(−cn2ε), the bottom-most horizontal crossing stays below ∂sA at which point
it suffices to consider π0En(Q)(Γ
c) because in that case, ∂sEn(Q) would be completely
disconnected from K`. But Eq. (2.1.2) and monotonicity imply that there exists
c > 0 such that
π0En(Q)(Γ
c) . e−c` ,
at which point a union bound over the two errors concludes the proof.
Choosing ` = dc−1 log te in Lemma 7.2.10 and union bounding over all e ∈ E(A)
251
yields that there exists a new c > 0 such that for sufficiently large n,
E∆[πξ′(‖νη2 − π
ξ′,ηA ‖tv)
]. t−c .
Combined with the bounds on the first and second terms of (7.2.10) and Theo-
rem 2.2.7 we see that there exist c1, c2 > 0 such that for large enough n,
E∆[‖P 2t(0, ·)− πξ
′
Q‖tv]≤ 2δ + c1e
−c2n2ε
+ 2n2t−c2 ,
which combined with part (i) of the proof, allows us to conclude the proof of (7.2.1).
We now prove the second implication to complete the proof of Proposition 7.2.5.
Proof of Eq. (7.2.2). Fix any P ∈ D(Λ2n,m) and observe that the proof is inde-
pendent of this choice of P. Consider the quantity, E[‖P t(ω0, ·) − πξΛ2n,m‖tv] for
ω0 = 0, 1.
(i) Wired initial configuration. Divide Q := Λ2n,m into,
A =Λn,m + (bn/2c, 0) , B = Λn,m ∪ Λn,m + (n, 0) , C = n × J0,mK .
Let Be, Bw be the two connected components of B so that the dynamics on B does
not update any of the edges of C ⊂ ∂B. Let ∆ = ∆s ∪∆n ∪∆e ∪∆w where
∆s = (x, y) ∈ ∂sA : |x− n| ≤ n3ε , ∆n = (x, y) ∈ ∂nA : |x− n| ≤ n3ε ,
∆e = (x, y) ∈ ∂nA : x ≤ n/2 + n3ε , ∆w = (x, y) ∈ ∂nA : x ≥ 3n/2− n3ε .
We first observe that, as before, by Corollary 7.2.2 and the size of |V (∆)|, it
252
suffices up to new choice of constants in (7.2.2), to prove the implication under P∆
as given by Definition 7.2.1. By monotonicity, Theorem 2.2.7, and Corollary 7.2.2,
up to another change of constants c2, c3, it suffices to prove,
maxω0∈0,1
E∆[‖P 2t′(ω0, ·)− πξ
′
Q‖tv]≤ cδ′
with t′ = exp(cn3ε)t and δ′ = 8δ+ exp(−ec′n3ε) for c, c′ > 0 given by Corollary 7.2.2
and P a censored dynamics. We begin with the situation in which ω0 = 1 and let
P be the transition kernel of the following censored dynamics: for the first time
interval [0, t′), only accept updates from A then at time t′ change all edges interior
to B to 1 and only updates edges interior to B until time 2t′. As before, let ν1
denote the distribution after time t′ on A and let νη2 denote the distribution after
time 2t′ of configurations on B given that at time t′ all edges in B are reset to 1
and the configuration on C was η.
The triangle inequality and Markov property together imply that,
E∆[‖P 2t′(1, ·)− πξ
′
Q‖tv]≤ E∆
[‖ν1 − πξ
′,1A ‖C
]+ E∆
[‖πξ
′,1A − πξ
′
Q‖C]
+E∆[‖πξ
′
Q − πξ′,0Q ‖C
]+ E∆
[πξ′,0Q (‖νη2 − π
ξ′,ηB ‖tv)
].
(7.2.11)
Here, boundary conditions of the form (ξ′, η) denote a boundary condition that
agrees with ξ′ on the intersection of the boundary of the domain and ∂Q, and takes
on boundary conditions η elsewhere. As in part (i) of the proof of (7.2.1), we observe
that because of the modification on ∆e ∪∆w with P-probability 1− exp(−cn3ε),
the boundary conditions on ∂nA are dominated by π0Z2 in spite of the wired
initial configurations on Q − A. In what follows, we assume—paying an error
253
of exp(−cn3ε)—that this decreasing event holds. Observe also that the wired
initial configuration can only make ∂sA more wired, and thus those boundary
conditions will continue to dominate the marginal of π1Z2 . Along with self-duality
and Corollary 7.2.2, this implies that the first term in (7.2.11) is bounded above by
δ′.
The fourth term can be bounded as follows: observe that the distribution
over boundary conditions (ξ′, η) on B under P∆(ξ′)πξ′,0Q (η) coincides with the ∆s
modification of P∆n,e,w(ξ′)πξ′,0Q (η). Because the boundary conditions on ∂nA are
dominated by π0Z2 , an argument like (7.2.6) implies P∆n,e,w(ξ′)πξ
′,0Q (η) π0
Z2(η) on
C. Thus, by Corollary 7.2.2, the third term is bounded above by 2δ′, where the
factor of 2 comes from the fact that B consists of two independent copies of Rn.
We used the fact that the configuration on E(C) is dominated by the partition of
C induced by the FK configuration under πξ′,0Q .
It remains to bound the second and third terms of (7.2.11), which can be treated
similarly so we only go through the bound of the former.
Lemma 7.2.11. There exists c = c(q) > 0 such that
E∆[‖πξ
′,1A − πξ
′
Q‖C]. e−cn
3ε
.
Proof. We bound the total variation distance on C by bounding the probability
that samples from the two distributions agree under the grand coupling.
Following the proof of Lemma 7.2.8, we first define the event Γ1 as the set of
ξ′ in which there exists no pair (x, y) ∈ (∂nBe − ∆n) × (∂nBw − ∆n) such that
xξ′←→ y where
ξ′←→ denotes that x, y are in the same boundary component. We
split up E∆[‖πξ′,1A − πξ
′
Q‖C ] into an average over those ξ′ ∈ Γ1 and those in Γc1.
254
“1” “1”
“1” “1”
∆s
∆n
CA A
“0” “0”
Figure 7.4: The modification analogous to Figure 7.3 for the second step of therecursion (Eq. (7.2.2)).
As in Lemma 7.2.8, we obtain the bound E∆[Γc1] ≤ exp(−cn3ε) for some c > 0
using the fact that the boundary conditions on ∂nA are obtained from a measure
that is dominated by π0Z2 and Eq. (2.1.2) implies an exponential decay of connections.
For all such ξ′, we bound the distance between the two measures by 1.
Now consider, for any e ∈ E(C), E∆[‖πξ′,1A − πξ
′
Q‖tv | Γ1]. Define in analogy
with the proof of Lemma 7.2.8, the decreasing events Γ2 and Γ3:
Γ2 :=⋂
i∈e,w
C∗v(A ∩Bi) ,
and Γ3 is the event that there does not exist any x ∈ ∂nA ∩ Be −∆n such that
x ←→ ∂wBe and likewise, there does not exist any x ∈ ∂nA ∩ Bw − ∆n such
that x ←→ ∂eBw. Under Γ2 ∩ Γ3, we could expose all the wired components of
∂A−∆n −∆s to reveal an outermost dual circuit around C. By monotonicity and
domain Markov, the two distributions would be coupled under the grand coupling
past that dual circuit, so that for all ξ′ ∈ Γ1,
‖πξ′,1A − πξ
′
Q‖C ≤ πξ′,1A (Γ2 ∩ Γ3) .
255
Let A = J0, nK× J2mK, viewed as two copies of A stacked above one another.
Let (0, 1,∆s) boundary conditions on A denote those that are free on ∂nA and
∆s and wired on the rest of ∂A. By monotonicity and the exponential decay of
correlations in the free phase given by (2.1.2), we see that there exists c > 0 such
that
E∆[πξ′,1A (Γc2)
∣∣ Γ1
]≤ E∆s
[πξ′,1A (Γc2)
∣∣ Γ1
]≤ e−cm + π0,1,∆s
A(Γc2) .
Consider π0,1,∆s
A(Γc2) and let V = J0, nK× J0, 4mK, as in Proposition 7.1.8 with
` = 4m. Let (0, 1,∆s) boundary conditions on V denote those that are free above
y = 2m and on ∆s and wired elsewhere. Then it is clear by monotonicity and the
fact that Γc2 is an increasing event, that
π0,1,∆s
A(Γc2) ≤ π0,1,∆s
V (Γc2) .
Because m n12
+ε, we can apply Proposition 7.1.8 to see that there exists a new
c(q) > 0 such that π0,1,∆s
A(Γc2) . exp(−cn3ε).
We now turn to bounding πξ′,1A (Γc3 | Γ2) using the same approach as in the proof
of Claim 7.2.9. Under Γ2, there is a pair of vertical dual crossings in A which allow
us to replace, by monotonicity, the boundary conditions (ξ′, 1) by ones that we
denote (0, 1,∆n) which are free on ∂e,w,sA and also free on ∆n and wired elsewhere.
To make the comparison to the setting of Proposition 7.1.8, perturb the boundary
conditions more by extending the wired segments down along ∂e,wA a length n1/2+ε.
Up to a π-rotation, Proposition 7.1.8 with the choice of ` = m implies that the
two interfaces are confined to the left and right halves of A with probability
1− exp(−cn3ε) for some new c > 0, and sufficiently large n. Monotonicity implies
256
that for any ξ′ ∈ Γ1,
πξ′,1A (Γc3 | Γ2) . e−cn
3ε
,
and together with a union bound, there exists c′ > 0 such that for large enough n,
E∆[‖πξ
′,1A − πξ
′
Q‖C]≤ e−c
′n12 +ε
+ 2e−cn3ε
.
Combined with the prior bounds on the first and third terms in the right-hand
side of (7.2.11) and Theorem 2.2.7, for some c > 0,
E∆[‖P 2t′(1, ·)− πξ
′
Q‖tv]. δ′ + e−cn
3ε
.
(ii) Free initial configuration. The bound for the free initial configuration,
E∆[‖P 2t′(0, ·)− πξ
′
Q‖tv]. δ′ + e−cn
3ε
,
is nearly identical to the above bound with the following change: the censored
dynamics P only allows updates in block A until time t′ then resets all edge values
in B to 0 then only allows updates in B between time t′ and 2t′. In this case, we
can again write, using the same notation as before,
E∆[‖P 2t′(0, ·)− πξ
′
Q‖tv]≤ E∆
[‖ν1 − πξ
′,0A ‖C
]+ E∆
[‖πξ
′
Q − πξ′,0Q ‖C
]+E∆
[‖πξ
′,0A − πξ
′
Q‖C]
+ E∆[πξ′,0Q (‖νη2 − π
ξ′,ηB ‖tv)
].
(7.2.12)
The free initial configuration precludes the long-range interactions of the FK model
modifying the boundary conditions on A and thus they are still in D(A). The
257
bound on the first term then follows from the assumption, without appealing to
self-duality, and the other three bounds hold as for ω0 = 1. Combining the above
with part (i) holds, we see that (7.2.2), which with (7.2.1), concludes the proof of
Proposition 7.2.5.
7.2.2 Proof of Proposition 7.0.2
It now remains to extend Corollary 7.2.7 to boundary conditions that are
dominated by the free phase or dominate the wired phase on all four sides, to obtain
the desired bounds for free and monochromatic boundary conditions. Suppose
without loss of generality that P π0Z2 on Λ; the case P π1
Z2 follows from
self-duality. First consider the dynamics started from ω0 = 1. We wish to prove
that there exists c > 0 such that for t? = exp(cn3ε),
E[‖P t?(1, ·)− πξ
′
Λ‖tv]. e−cn
2ε
.
By an application of Corollary 7.2.2, up to errors that can be absorbed by adjusting
the constant c appropriately, we can modify, as in Definition 7.2.1, the boundary
conditions on ∆ = ∆n ∪∆s, where ∆n = ∆1n ∪∆2
n,
∆1n =0, n × J3n
4− n3ε, 3n
4K ,
∆2n =0, n × Jn
2, n
2+ n3εK ,
and ∆s is the reflection of ∆n across the line y = n/2, and consider dynamics on
Λ with the new measure P∆ on boundary conditions. Let Λ± denote the top and
258
bottom halves of Λ, respectively. Then,
E∆[‖P t?(1, ·)− πξ
′
Λ‖tv]≤ E∆
[‖P t?(1, ·)− πξ
′
Λ‖Λ−
]+ E∆
[‖P t?(1, ·)− πξ
′
Λ‖Λ+
].
We deal only with the first term since the second can be bounded analogously.
Let P be the dynamics that censors all updates not in Λn,3n/4. Then by Markov
property and triangle inequality, the first term can be bounded above by
E∆[‖P t?(1, ·)− πξ′,1‖Λ−
]+ E∆
[‖πξ
′
Λ − πξ′,1‖Λ−
],
where πξ′,1 denotes the stationary distribution on Λn,3n/4 with boundary conditions
that are wired on ∂nΛn,3n/4 and ξ′ elsewhere. First observe that because of ∆1n, with
probability 1− exp(−cn3ε), the wired initial configuration on Λ− Λn,3n/4 does not
affect the boundary conditions on Λn,3n/4 and therefore the boundary conditions
on it are, up to a π-rotation, in D(Λn,3n/4). At this cost, we assume this decreasing
event holds.
The second term can then be bounded as in Lemma 7.2.8 by exp(−cn2ε) for
some c > 0. We use ∆2n to disconnect the wired boundary condition on ∂nΛn,3n/4
from Λ−; the new choice of ∆ and modifications in the sizes of the boxes do not
affect the proofs.
The first term can be bounded via Corollary 7.2.7 for m = 3n/4 since, with
probability 1−exp(−cn3ε), the boundary condition on ∂e,s,wΛn,3n/4 is dominated by
the marginal of π0Z2 while on ∂nΛn,3n/4, it is all wired. Combining the two bounds
and doing the same for Λ+, implies there is a c > 0 such that for t? = exp(cn3ε),
E∆[‖P t?(1, ·)− πξ
′
Λ‖tv]. e−cn
2ε
.
259
For the dynamics started from the free initial configuration, for every e ∈ E(Λ),
` ∈ N, define K` = Λ ∩ e+ J−`, `K2. Let PK` denote the transition kernel of the
dynamics restricted to K` with (ξ, 0) boundary conditions denoting ξ on ∂K` ∩ ∂Λ
and free elsewhere. We claim that, for some c > 0,
E[‖P t?(0, ·)− πξΛ‖tv
]≤∑
e∈E(Λ)
E[P t?(1, e /∈ ω)− πξΛ(e /∈ ω)
]≤∑
e∈E(Λ)
E[‖P t?
K`(0, ·)− πξ,0K`‖tv
]+ e−c` + e−cn ,
for some c > 0. The first inequality is an immediate consequence of monotonicity.
The second follows from an argument similar to that in the proof of Lemma 7.2.10
where now we take a new enlargement, E′n(Λ) that enlarges Λ by n in the southern
direction also. We can replace, by Eq. (2.1.2), πξΛ by π0E′n(Λ) up to an error of e−cn as
before. Again using (2.1.2) in the free phase, up to an error of e−c`, we can replace
π0E′n(Λ) with πξ,0K` , noting that the distributions at e match if e is disconnected from
K` by a dual circuit.
We can then bound the sum in the right-hand side by Proposition 7.2.3: uni-
formly in ξ,e, tmix for K` is bounded above by ec` for some c > 0. Choosing
` = dc−1 log t?e n3ε and union bounding over all the errors yields, for some c > 0,
E[‖P t?(0, ·)− πξΛ‖tv
]. e−cn
2ε
,
as desired. An application of Markov’s inequality, the triangle inequality, and (2.2.7)
to
maxω0∈0,1
E[‖P t?(ω0, ·)− πξΛ‖tv
]. e−cn
2ε
,
260
implies that there exists some c > 0 such that for t? = ecn3ε
and n sufficiently large,
P(tmix ≥ t?) . exp(−cn2ε) as required.
261
Chapter 8
Boundary conditions in
coexistence regions II: other
boundary conditions
In this chapter, we continue analyzing the sensitivity of mixing times to boundary
conditions in phase coexistence regimes, specifically when q is sufficiently large and
p = pc(q). As emphasized, the metastability of the FK Glauber and Swendsen–
Wang dynamics here on (Z/nZ)2 derives from the bottleneck between the ordered
and disordered phases. In Chapter 7, we saw that boundary conditions that pick out
the ordered or disordered phases substantially speed up mixing of the corresponding
FK and Potts models, and conjecturally lead to a picture governed by droplet
scaling and mean-curvature flow similar to the low-temperature Ising model. In this
section, we classify the mixing times under boundary conditions that are piecewise
ordered (i.e., wired or monochromatic—which we denote by R for red), disordered
(i.e., free), and periodic. We find that different combinations of these can lead to
262
either sub-exponential or exponential mixing depending on whether they preserve
or break the order-disorder phase symmetry.
We begin with some equilibrium estimates regarding interface fluctuations
in tilted domains in Section 8.0.1. Then, in Section 8.1, we prove Theorem 10,
showing sub-exponential mixing time upper bounds under Dobrushin boundary
conditions—those that induce a single macroscopic interface. In Section 8.2, we
prove Theorem 11, showing a similar upper bound on cylinders (c.f., torii where
mixing is exponentially slow). In Section 8.3, we show exponentially slow mixing
under mixed boundary conditions (e.g., wired on ∂n,sΛ and free on ∂e,wΛ) that
induce a pair of macroscopic interfaces (Theorem 9).
Finally, let us recall Remark 1.4.1, noting that some of these boundary conditions,
especially those on the cylinder, had not, to our knowledge, been studied in the
case of the low-temperature Ising Glauber dynamics: our results can be translated
to that setting (replacing wired and free by plus and minus) and the proofs would
only simplify.
8.0.1 Interface fluctuations on tilted subsets of Λn,n
In this section, we extend the surface tension estimate of Proposition 7.1.4 to
tilted half-infinite and finite strips. We first define subsets of Sn = J0, nK×J−∞,∞K
we consider in obtaining the desired sub-exponential upper bounds of Theorems 9–10
(see Figure 8.1).
Definition 8.0.1. For every n, which will be understood by context and omitted
from the following notation, define a tilted strip Sb,h,φ as (for b ∈ R, h ∈ R+,
263
φ
2h
(0, b)
Sb,h,φ
Sn
•
•
(0, b)
0
H+b,φ
H−b+a,φ ∩H+b,φ
a
•
•
Figure 8.1: An illustration of some of the sets and interfaces considered in Defini-tion 8.0.1 and Proposition 8.0.2. The (1, 0, φ) boundary conditions are depictedwith bold lines being wired and dashed lines being free.
φ ∈ (−π2, π
2)),
Sb,h,φ = (x, y) ∈ Sn : y ∈ Jb− h+ x tanφ, b+ h+ x tanφK .
Also, define the half-infinite strips,
H+b,φ = (x, y) ∈ Sn : y ≥ b+ x tanφ , H−b,φ = (x, y) ∈ Sn : y ≤ b+ x tanφ .
The following estimate—extending an estimate of [72], which in turn adapts [30]
to the FK model, to half-infinite strips—is a consequence of monotonicity of the
FK model and Proposition 7.1.4.
Proposition 8.0.2. Fix δ > 0 and let q be sufficiently large. For any b ∈ R and
φ ∈ [−π2
+ δ, π2− δ], consider the critical FK model on H+
b,φ ⊂ Sn with boundary
264
conditions (1, 0, b, φ) denoting wired boundary conditions on ∂H+b,φ ∩H
+b+1,φ and free
boundary elsewhere on ∂H+b,φ. There exist constants A > 0 and c(δ, q) > 0 such that
the order-disorder interface I, satisfies
π1,0,b,φ
H+b,φ
(I 6⊂ H−b+a,φ ∩H+b,φ) ≤ An2 exp
(−ca2/n
).
Proof. The proposition was proven in the case φ = 0 in Proposition 7.1.7 combining
monotonicity of the FK model in boundary conditions with Proposition 7.1.4. The
same proof carries over to the case φ 6= 0 as long as φ is uniformly bounded away
from ±π2
as the surface tension estimate of Proposition 7.1.6 on Sn is expressed in
that setup.
The following is the main equilibrium estimate we will use in the proofs of
sub-exponential mixing for general Dobrushin boundary conditions.
Proposition 8.0.3. Fix δ > 0 and let q be sufficiently large. For any b ∈ R,
h ≤ m ≤ n, and φ ∈ [−π2
+ δ, π2− δ], consider the critical FK model on Sb,m,φ∩Λn,n
with (1, 0, b, φ) boundary conditions denoting wired on ∂(Sb,m,φ ∩Λn,n)∩H+b+1,φ and
free elsewhere on ∂(Sb,m,φ ∩ Λn,n). Then there exists c(δ, q) > 0 such that
π1,0,b,φSb,m,φ∩Λn,n
(I 6⊂ Sb,h,φ ∩ Λn,n) . n2 exp(−ch2/n) .
We need the following preliminary estimate (see Figure 8.2 as a guide).
Lemma 8.0.4. Fix δ > 0 and let q be sufficiently large. For any b ∈ R and
φ ∈ [−π2
+ δ, π2− δ], consider the critical FK model on H+
b,φ ∩ Λn,n with (1, 0, b, φ)
boundary conditions denoting wired on ∂(H+b,φ ∩ Λn,n) ∩H+
b+1,φ and free elsewhere.
265
m h
b
Sb,h,φ
Sb,m,φ
•
•
b •
•φ
0
H+b,φ ∩ Λn,n
a
Figure 8.2: Left: the sets considered in Proposition 8.0.3, where we bound theprobability of the interface exceeding height ±h. Right: the proof of Lemma 8.0.4uses monotonicity in b.c., to go from bounding the probability of the order-disorderinterface exceeding height a in the gray set to the union of the gray and purplesets, to all shaded regions.
There exists c(δ, q) > 0 such that
π1,0,b,φ
H+b,φ∩Λn,n
(I 6⊂ H−b+a,φ ∩H+b,φ ∩ Λn,n) . n2 exp(−ca2/n) .
Proof. Let B = H−b+a,φ ∩H+b,φ ∩ Λn,n and consider the region S ′ = H+
b,φ ∩H−n,0.
For a general domain D ⊂ Sn let the boundary conditions (1, 0, b, φ) on it denote
wired on ∂D ∩ H+b+1,φ and free elsewhere on ∂D. By monotonicity in boundary
conditions, and then inclusion of events,
π1,0,b,φ
H+b,φ∩Λn,n
(I 6⊂ B) ≤ π1,0,b,φS′ (IΛn,n 6⊂ B)
≤ π1,0,b,φS′ (I 6⊂ H−b+a,φ ∩H
+b,φ) .
266
The application of the FKG inequality here is valid as under (1, 0, b, φ) boundary
conditions, events of the sort “I exceeds some height a”, are decreasing events, as
adding edges only pushes the order-disorder interface down. This property will be
used throughout the paper. By monotonicity in boundary conditions again,
π1,0,b,φS′ (I 6⊂ H−b+a,φ ∩H
+b,φ) ≤ π1,0,b,φ
H+b,φ
(I 6⊂ H−b+a,φ ∩H+b,φ) .
By Proposition 8.0.2, there exists a c(δ, q) > 0 such that
π1,0,b,φ
H+b,φ
(I 6⊂ H−b+a,φ ∩H+b,φ) . n2e−ca
2/n .
Proof of Proposition 8.0.3. As before, let (1, 0, b, φ) boundary conditions on
D ⊂ Z2 denote those that are wired on ∂D ∩H+b+1,φ and free on ∂D ∩H−b+1,φ. By
a union bound, write
π1,0,b,φSb,m,φ∩Λn,n
(I 6⊂ Sb,h,φ) ≤ π1,0,b,φSb,m,φ∩Λn,n
(I 6⊂ H+b−h,φ) + π1,0,b,φ
Sb,m,φ∩Λn,n(I 6⊂ H−b+h,φ)
(8.0.1)
and consider the two quantities independently. Observe that the first event on
the right-hand side is an increasing event, while the second event is a decreasing
event. By reflection symmetry and self-duality, if we prove the desired bound on the
latter, for general b, h, φ, it implies the former also. By monotonicity in boundary
conditions,
π1,0,b,φSb,m,φ∩Λn,n
(I 6⊂ H−b+h,φ) ≤ π1,0,b,φ
Sb,m,φ∩H+b−1,φ∩Λn,n
(I 6⊂ H−b+h,φ)
≤ π1,0,b,φ
H+b−1,φ∩Λn,n
(I 6⊂ H−b+h,φ) .
267
The right-hand side above is exactly the probability bounded in Lemma 8.0.4, from
which, along with the symmetry noted above and (8.0.1), the desired upper bound
follows.
We will also need the following bound in order to prove the mixing time upper
bounds on cylinders in §8.2. It is an adaptation of the proof of [70, Lemma A.6]
from the Ising model to the FK model via Propositions 7.1.4–7.1.5, and we omit
some details.
Proposition 8.0.5. Fix q to be large enough and ε > 0, and consider the critical
FK model on Λn,h for n12
+ε ≤ h ≤ n with 1, 0 boundary conditions denoting wired on
∂sΛn,h and free elsewhere. For ρ ∈ (0, 1) and δ small enough, there exists c(ε) > 0
such that
π1,0Λn,h
(I ∩ Jbn2c − δn, bn
2c+ δnK× J0, ρhK = ∅) & e−c(ρh)2/n .
Proof. Denote by B = Jbn2c − δn, bn
2c+ δnK× J0, ρhK. Recall the definition of the
cigar-shaped region Uκ,d,φ in (7.1.2). Following [70], for every − log2 n+ 2 ≤ i ≤
log2 n− 2, let zi be the nearest vertex to
(xi, d
∣∣∣∣ xi(n− xi)n
∣∣∣∣ 12
+κ)
where xi =n
2+
sgn i
4
|i|−1∑j=1
2−j .
Let Uzi,zi+1be the cigar shaped region zi+Uε/2,(1−ρ)∧ρ,φzi,zi+1
where d = (1+ρ)h/n12
+ε
and φzi,zi+1is the angle of the vector from zi to zi+1 (see, e.g., [70, Fig. 8]).
By monotonicity in boundary conditions and I ∩B = ∅ being an increasing
event,
π1,0Λn,h
(I ∩B = ∅) ≥ π1,0
H−h,0(I ∩B = ∅) ,
268
where (1, 0) boundary conditions on ∂H−h,0 are wired on ∂H−h,0 ∩ H−0,0 and free
elsewhere. Since Uzi,zi+1∩ B = ∅ for all i, if I ⊂
⋃|i|≤log2 n
Ui, then the desired
property holds.
Lemma A.6 of [70] gives a lower bound on the partition function restricted to
interfaces contained in⋃Ui as defined above in the setting of the Ising model;
the same proof extends that lower bound to the partition function of such FK
interfaces, noting that Proposition 7.1.5 is an analogue of [30, Theorem 4.16]: there
exists c > 0 such that
∑I⊂H−h,0
λ|I|+∑C∩I6=∅ Φ(C,I)1I ⊂
⋃Ui ≥ exp(−βcτ1,0(0)n− c(ρh)2/n) ,
where an error of (log n)O(1) was absorbed into the term c(ρh)2/n via the assumption
that h ≥ n1/2+ε and a choice of a suitable constant c > 0.
Furthermore, Proposition 7.1.5, in particular (7.1.4), implies there exists c > 0
so that
∑I⊂H−h,0
λ|I|+∑C∩I6=∅ Φ(C,I) ≤ exp(−βcτ1,0(0)n+ c(log n)c) .
Then writing the desired probability as in [70, (A.36)] as
π1,0
H−h,0(I ∩B 6= ∅) ≥ π1,0
H−h,0
(I ⊂
⋃|i|≤log2 n−2 Ui
)=
∑I⊂H−h,0
λ|I|+∑C∩I6=∅ Φ(C,I)1I ⊂
⋃Ui∑
I⊂H−h,0λ|I|+
∑C∩I6=∅ Φ(C,I)
,
and plugging in the above lower bound on the numerator and upper bound on the
denominator, concludes the proof.
269
8.0.2 Canonical paths estimate under typical FK boundary
conditions
In Section 6.4, we showed a canonical paths bound on n × l rectangles with
arbitrary realizable boundary conditions on three sides (but crucially not on one of
the long sides). In the proof of Theorems 10–11, we will require a similar bound
when both long-sides of Λn,l may have realizable boundary conditions carrying
long-distance information. We leverage the fact that we are at a discontinuous
phase transition point to prove a similar canonical paths estimate, but now only
for typical instances of these boundary conditions.
Proposition 8.0.6. Let q > 4, p = pc(q), and φ ∈ (−π2, π
2). Consider the FK
model on S = Sb,h,φ ∩ Λn,n with boundary conditions ξ ∼ P, where P is arbitrary
on ∂S ∩ ∂e,wΛn,n, and piecewise dominates wired-at-infinity or is dominated by
free-at-infinity on each of the other sides of ∂S. There exists Cq > 0 such that for
every sequence fn →∞, the FK Glauber dynamics have
P(ξ : tξmix ≥ |E(S)|2 exp[2(4h+ fn) log q]
)≤ O(n2e−Cqfn) .
The proof will use the following straightforward comparison estimate.
Lemma 8.0.7. Fix any p ∈ (0, 1) and q ≥ 1 and consider the FK model on
S = Sb,h,φ ∩ Λn,n with arbitrary boundary conditions ξ. For a connected edge subset
F ⊂ E(S), we have for every ω ∈ 0, 1E(S) and every ζ, ζ ′ ∈ 0, 1F
πξS(ω | ωF = ζ) ≤ q|∂F−∂S|+k(ξ,F )−1πξS(ω | ωF = ζ ′)
where k(ξ, F ) is the number of components of ξ that intersect both ∂F and ∂S−∂F .
270
Proof. By the domain Markov property, the difference between πξS(· | ωF = ζ) and
πξS(· | ωF = ζ ′) is in the boundary conditions that ζ ∪ ξ and ζ ′∪ ξ induce on S−F .
The boundary partitions induced can differ arbitrarily on ∂F − ∂S contributing
a factor of q|∂F−∂S|; on the other hand, since both configurations use ξ boundary
conditions on S, the boundary partitions on the rest of ∂(S − F ) can only differ if
ζ induces additional boundary connections between distinct boundary components
of ξ that reach ∂S − ∂F ; this accounts for the factor of qk(ξ,F )−1.
Proof of Proposition 8.0.6. We modify the proof of the canonical path estimate
for spin-systems with short-range interactions to the present setup. Partition every
segment L of ∂S \ ∂e,wΛn,n on which ξ is independently sampled, into b |L|fnc sub-
segments `i of fn vertices each, and possibly an additional exceptional segment `0.
Recall that for every edge e ∈ L, we defined Be to be the set of components of ξ
that contain vertices on both sides of L− e, called bridges : see Section 3.2. Let
Efn =
ξ :⋂L
⋂i≥1
∃e ∈ `i s.t. |Be| ≤ 1. (8.0.2)
We first prove that for every ξ ∈ fn, the FK Glauber dynamics on S has
gapξ,S ≤ |E(S)|(1 +√q) exp[2(4h+ fn) log q] . (8.0.3)
This will follow from a standard application of the canonical paths argument
for spin-systems with short-range interactions. Namely, label the edges in S
lexicographically in their midpoint, first by horizontal coordinate, then by vertical
coordinate, and define the path ζ(ω, ω′) (identified with a sequence of edges in Ωrc
between FK configurations, ω and ω′) as follows: let el1 , ..., elk be the sequence of
271
edges on which ω(eli) 6= ω′(eli), labeled in their lexicographic ordering. The i’th
edge in ζ(ω, ω′) will then be between configuration
η = ω′e1,...,eli−1 ∪ ωeli ,...,e|E(S)|
and its neighbor η′ which also has η′(ei) = ω′(ei). Also, let η∗ be the configuration
that is instead given by η∗ = ωe1,...,eli−1 ∪ ω′eli ,...,e|E(S)|. Then by Lemma 8.0.7
applied with the choice of F being e1, ..., eli−1 or eli , ..., e|E(S)|,
πξS(ω)πξS(ω′) ≤ πξS(η)πξS(η∗)q2(4h+fn)
as |∂F − ∂S| ≤ 2h and k(ξ, F ) ≤ 2h+ fn. This follows from the fact that ξ ∈ Efn
and the nested structure of boundary bridges, and the fact that the sides with
arbitrary boundary conditions have height at most 2h. By construction, for every
transition (η, η′), the map (ω, ω′) 7→ (η, η∗) is injective. Moreover, the probability of
making any transition in Ωrc is bounded below by 11+√q. Putting all this together,
by the path method we see that for every ξ ∈ Efn , Eq. (8.0.3) holds and by (2.2.3),
the corresponding bound with an extra factor of O(|E(S)|) also holds for the mixing
time.
It remains to bound the P-probability that ξ ∈ Efn . Fix a segment L of
∂S \ ∂e,wΛn,n on which P is piecewise sampled, and fix a sub-segment `i, then
take a union bound over all such segments and all `i. Moreover, suppose that
the boundary conditions on the segment L are dominated by free-at-infinity (the
estimate for the case when the distribution of P on ξL dominates wired-at-infinity
follows by similar reasoning). Since |Be| ≥ 1 is an increasing event, it suffices to
272
show
π0Z2(∃e ∈ `i s.t. |Be| = 0) ≥ 1−O(n2e−Cqfn) .
However, by planarity of boundary conditions induced by π0Z2 on ∂S, the complement
of the left-hand side is the event that there exist x, y in the two parts of L− `i such
that x←→ y, which, if Cq is the constant from (5.1.1), has probability at most
|L− `i|e−Cqfn ≤ 2ne−Cqfn .
(For the wired-at-infinity boundary conditions, observe that in order for |Be| > 1
for every e ∈ `i, there must exist x, y in the two parts of L− `i such that x∗←→ y:
this is in turn a decreasing event with exponentially decaying probability under
π1Z2 .) Taking a union bound over at most 4n sub-segments `i of various segments
L, we obtain that
P(ξ ∈ EC) ≤ 8n2e−Cqfn .
Remark 8.0.8. By standard comparison estimates, one could allow arbitrary
boundary conditions on any boundary segment of size O(h) of ∂S, paying a cost
in the spectral gap of at most exp(ch log q). This would follow from bounding the
ratio of the Dirichlet forms and the Radon-Nikodym derivative between the two
(see e.g., [66, Lemma 2.8] and [42, Eq. (5.1)] for details).
273
8.1 Subexponential mixing time with Dobrushin
boundary conditions
In this section, we consider the mixing time of FK Glauber and Swendsen–Wang
dynamics with boundary conditions that are free on a subset of ∂Λ and wired/red
elsewhere. While §8.3 demonstrates that such boundary conditions can induce a
slow mixing (at least exp(cn)) by respecting the order-disorder phase symmetry, this
section will establish that the mixing time is faster (at most exp(n1/2+o(1))) under
boundary conditions that have a single order-disorder interface. Define a general
class of order-disorder Dobrushin boundary conditions, whose FK representation is
wired on one connected boundary arc and free elsewhere. Let an, bn be two distinct
points on ∂Λn,n. For marked boundary points (an, bn), FK Dobrushin boundary
conditions are those that are wired on the clockwise (starting from the origin) arc
(an, bn) and free on (bn, an).
Sketch of proof
Our proof of Theorem 10 adapts the proof of tmix . exp(c√n log n) in [66] for
the low-temperature Ising model with plus boundary conditions, but using the
censoring inequality Theorem 2.2.7 instead of the block dynamics approach of [66].
For Dobrushin boundary conditions between (a, b), we sequentially (at times ti)
censor all updates except those in the strip Bi of height√n log n parallel to the
line segment 〈a, b〉 such that Bi and Bi+1 overlap on half their height. On the one
hand, the canonical paths estimate Proposition 8.0.6 bounds the mixing time of Bi
by exponential in its height, so that we take ti − ti−1 = O(ec√n logn). On the other
hand, by Theorem 2.2.7, if the censored chains started from all wired and all free are
274
coupled with high probability, that bounds the mixing time of the original chain. To
couple these two chains, we systematically push the interfaces of the chains started
from these initial configurations down (resp., up), until they are within O(√n log n)
of each other. This is possible because with high probability (see Lemma 8.0.4) the
interface of the bottom boundary of Bi (where the free initial configuration is seen)
never reaches the top of Bi−1 and the censored dynamics continues pushing the
interface down to O(√n log n) distance of 〈a, b〉 (see Figure 8.4).
Proof of Theorem 10. If an, bn are on the same side of ∂Λn,n, rotate Λn,n so that
they are both on ∂sΛn,n and the angle of the interface between them will be zero.
Then the proof below when an and bn are on different sides applies identically; the
only difference is the boundary conditions in the last step of recursion, where the
identity coupling still couples all FK chains with probability 1 in the mixing time
of that last block.
Now suppose an = (a1n, a
2n) and bn = (b1
n, b2n) are on different sides of ∂Λn,n; by
rotational and reflective symmetry and self-duality, we can take an ∈ ∂wΛn,n to
be the first point encountered clockwise from the origin, and ensure that φn =
tan−1(a2n−b2na1n−b1n
) is such that φn ∈ [−π4, π
4]. Fix any such choice of an, bn ∈ ∂Λn,n and
let φ = φn.
We establish the theorem for FK Glauber dynamics with (an, bn) Dobrushin
boundary conditions. Throughout this proof, let c1 = c1(q) > 0 be a large enough
constant (e.g., 5/Cq for Cq from Proposition 8.0.6 would suffice). Define the
275
overlapping blocks
Bi := San+(N−i)`,`,φ ∩ Λn,n (i = 1, . . . , N) and
Bi := San−(N−i)`,`,φ ∩ Λn,n (i = −1, . . . ,−N + 1) for N =⌈n`
⌉,
where we choose ` = c3
√n log n for c3 = 4/
√c2, with c2(q) as given by Proposi-
tion 8.0.2, so N ∼ c−13
√n/ log n (see Figure 8.3). Because of our choice of (B±j)
Nj=1,
as many as N of the Bj may be empty, and henceforth if Bj is empty, we say any
associated events hold trivially. Let
ti = i ·K log n ·N2 · |E(Bi ∪B−i)|2e4(4`+c1 logn) log q for 0 ≤ i ≤ N ,
for K to be chosen large later. Define the censored chain Xt: between times ti−1
and ti censor all updates except those in Bi ∪B−i. Let X1t be the censored chain
started from X0 = 1 and X0t be the censored chain started from X0 = 0. By
Theorem 2.2.7 and (2.2.7), it suffices to show that there exists a coupling of X0t
and X1t such that
P(X1tN6= X0
tN) = o(n−2) , (8.1.1)
since for any K, c1 fixed, we have tN . exp(O(√n log n)) as desired.
We now define a monotone coupling of X1t and X0
t which satisfies the above.
For each i = 1, . . . , N , define the event
A±i =X0tiE(R±i ) 6= X1
tiE(R±i )
, where R±i =
i⋃j=1
B±j \B±(i+1) .
276
an
bn
: Bi
: Bi+1
: R+i
BN
2`
2`
•
•
Figure 8.3: The blocks Bi in Λ, each of which are wired on all sides except possibly∂sBi and the region R+
i on which we have coupled.
We can then write under our coupling,
P(X0tN6= X1
tN) ≤ P
( N⋃i=1
(A+i ∪ A−i )
)≤ P(A+
N−1) + P(A−N−1) + P(AN | (A+N−1)c, (A−N−1)c) .
The bounds on P(A+N−1) and P(A−N−1) are analogous (using the duality of the FK
model) and therefore we only bound the former. Abusing notation slightly, when
we consider the restriction of the chain X1/0t to a boundary ∂S , we mean the
boundary conditions induced on that line by X1/0t Λ−S. We will prove the following
inductively.
Claim 8.1.1. There exists c(q) > 3 so that, for large enough K, c1, the following
holds.
(1) For every 1 ≤ i ≤ N−1, there exists an event Fi measurable w.r.t. (X0t , X
1t )t≤ti
such that P(Fi) ≥ 1−O(in−c) and P(X0ti∂nBi+1
∈ · | Fi) dominates the boundary
conditions induced by π1R+N−1
(·R+i
) on ∂nBi+1.
277
(2) For every 1 ≤ i ≤ N − 1,
P(A+i ) = P(X1
tiR+
i6= X0
tiR+
i) . in−c .
Proof of Claim 8.1.1. For the base case, (1) holds trivially because the outer
boundary on B1 is just given by the boundary of ∂Λ which will be all-wired there.
The base case proof of (2) is just a simplification of the proof of the inductive step
for (2) so we do not repeat it here. Now suppose both (1) and (2) hold for some
i− 1 and prove they hold for i for a c(q) > 0 we will pick later. By the inductive
hypothesis, with probability 1−O((i−1)n−c), we have X0tiR+
i−1= X1
tiR+
i−1, and the
boundary conditions on ∂nBi dominate wired-at-infinity. In particular, since on the
other sides of Bi both chains’ boundary conditions are either all-free or all-wired, by
Proposition 8.0.6, with probability 1−O((i− 1)n−c)−O(n−Cqc1+2) (where Cq > 0
is the constant from that proposition) the mixing time of FK Glauber dynamics
with boundary conditions given by X1ti∂Bi is at most |E(Bi∪B−i)|2e4(4`+c1 logn) log q.
Now suppose that both of these events hold and consider the probability that
X1tiR+
i6= X0
tiR+
i.
By submultiplicativity of total-variation distance, with probability 1−O(n−K)
there exists a coupling of X1/0ti Bi to π
X1/0ti−1
Bi. It remains to compute the probability
of (1) obtaining boundary conditions under πX0ti−1
Bithat dominate those induced by
π1R+N−1
(·R+i
) on ∂nBi+1, and (2) succeeding in coupling πX1ti−1
Bito π
X0ti−1
Bion Bi−Bi+1
(and therefore on all of R+i ).
Under the above events, let (ζ, 0) be the boundary conditions induced on ∂Bi by
X0ti−1
and (ζ, 1) be those induced by X1ti−1
where ζ is a random boundary condition
sampled from a distribution dominating π1R+N−1
(·R+i−1
) on ∂nBi. Then, the monotone
278
an
bn
1/0
BN
“1”
•
•
•
•
an
bn
1/0
BN
“1”
Figure 8.4: The green and blue blocks Bk, Bk+1 are updated by the censoreddynamics in two consecutive steps; with high probability, the chain (X0
tk)k pushes
its interface down toward 〈a, b〉 by ` at every step, and is subsequently coupled to(X1
tk)k on the growing gray region.
coupling of πζ,0Bi to πζ,1Bi couples the two on Bi − Bi+1 whenever the bottom-most
horizontal crossing I in the sample from πζ,0Bi has I ⊂ Bi+1 (see Figure 8.4). In
that case, by revealing dual-edges from the bottom up, the configurations from πζ,0Bi
and πζ,1Bi could be coupled above that interface and in particular on all of Bi−Bi+1.
(Observe that conditioning on the configuration below the interface, in order to
reveal I, cannot affect the boundary conditions above it because on each side
of ∂(R+i ∪ Bi+1) the boundary conditions are all-wired or all-free and additional
connections cannot be induced (cf. the boundary bridges of [43]).)
Observe, also, that when considering ζ dominating the boundary conditions
induced by π1R+N−1
(·R+i−1
), since the boundary on ∂Bi+1 ∩ ∂Λ is wired, by the
domain Markov property, the boundary conditions induced by X0ti+1
on ∂nBi+1
when I ⊂ Bi+1 holds will dominate π1R+N−1
(·R+i
). This implies part (1) if we can
bound πζ,0∂nBi(I ⊂ Bi+1).
Thus, for both (1) and (2) of the induction, it only remains to bound the
279
probability
E[πζ,0Bi(I 6⊂ Bi+1
) ∣∣ Fi−1
]≤ Eπ1
R+N−1
[πωR+i−1
,0
Bi(I 6⊂ Bi+1)
]≤ π1,0
R+i ∪Bi
(I 6⊂ Bi+1) , (8.1.2)
where (1, 0) boundary conditions on R+i ∪Bi are free on ∂sBi and wired elsewhere.
In that case, Lemma 8.0.4 (noting that the estimate there was independent of
b) with the choice of a = 12c3
√n log n implies that the probability in (8.1.2) is at
most O(n−6). Combining all of the above, the probability that items (1) and (2)
hold is at least
1−O((i− 1)n−c) +O(n−Cqc1+2) +O(n−K) +O(n−6) ,
which concludes the proof of the induction as long as we take c1, K large enough
that that the latter three terms are all o(n−c).
By Claim 8.1.1, we see that both P(A+N−1) and P(A−N−1) have probability at most
O(Nn−3) which is o(n−2). It remains to bound P(AN | (A+N−1)
c, (A−N−1)c) using
similar reasoning to the above. First of all, by part (2), with probability 1− o(n−2)
the chains X0tN−1
and X1tN−1
are coupled on both ∂nBN and ∂sBN . Moreover, by
part (1), the boundary conditions they induce dominates wired-at-infinity on ∂nBN
with probability at least 1− o(n−2) and likewise, are dominated by free-at-infinity
on ∂sBN with similar probability. Therefore, by time tN , by submultiplicativity of
total-variation distance and Proposition 8.0.6, we have
P(AN | (A+N−1)c, (A−N−1)c) ≤ 1− o(n−2)−O(n−Cqc1+2)−O(n−K) ,
280
and for c1, K large enough, the right-hand side is 1− o(n−2).
8.2 Sub-exponential mixing on cylinders
For the rectangle Λn,n define FK boundary conditions (p, 1) (resp. (p, 0) or
(p, 1, 0) boundary conditions) to be periodic boundary conditions on ∂n,sΛn,n and
wired boundary conditions on ∂e,wΛn,n (resp. free on ∂e,wΛn,n or wired on ∂wΛn,n
and free on ∂eΛn,n). We prove the mixing time upper bounds on cylinders with the
above boundary conditions (Theorem 11) at the same time. In what follows, we
use c > 0 to denote the existence of a constant (possibly depending on q), where
different appearances of c at different places may refer to different values.
The proof builds on the proof of Theorem 10 in that we use the censoring
inequalities to push the FK order-disorder across Λn,n in order to couple the
chains X1t and X0
t . We consider the censored dynamics that sequentially update
N = O(n12−ε) overlapping vertical strips of width n
12
+ε, ordered from left to right.
However, unlike the case in Theorem 10, our strips do not have wired boundary
conditions on three sides (their boundary conditions on the top and bottom are
periodic), and therefore, the interface is pushed to the next strip to be updated
with probability exp(−cn2ε) (rather than 1 − o(1)): see Figure 8.5. Thus, with
probability exp(−cn 12
+ε) the interface moves to next strip in N consecutive time
steps, so that one will succeed, with high probability, at pushing the interface
completely across Λn,n after exp(n12
+2ε) attempts.
Proof of Theorem 11. We again prove the upper bound for the FK Glauber
dynamics which translates to an upper bound on Swendsen–Wang dynamics by
Theorem 1.2.2. Fix any ε, δ > 0 small and consider blocks Bi for i = 1, ..., N where
281
N = dn`e − 1, given by
B2i−1 = J(i− 1)`, (i+ 1)`K× Jδn, (1− δ)nK
B2i = J(i− 1)`, (i+ 1)`K× J0, bn2c − δnK ∪ Jbn
2c+ δn, nK
where ` = n
12
+ε ,
and B2N+1 = B0 = Λ−∪Ni=1B2i−1 ∪B2i. Since the boundary conditions on ∂n,sΛn,n
are periodic, each B2i can be viewed as a single connected rectangle with boundary
∂n,sB2i = J(i− 1)`, (i+ 1)`K× bn2c+ δn, bn
2c − δn .
We prove the mixing time upper bound for (p, 1, 0) boundary conditions. We will
pause to comment where the (p, 1) boundary conditions would behave differently
(namely only when updating block B2N+1), and on why this does not affect the
proof. The (p, 0) boundary conditions can be treated by the dual version of the
argument we present.
We will cycle through the blocks Bi periodically, so define
Bj = Bj mod (2N+1) for all j > 2N .
Define the following censored Markov chain X1t (resp., X0
t ) started from initial
configuration 1 (resp., 0): for all i ≥ 0, Let fn = n12
+3ε and let
ti = i · n ·N2 · |E(Bi)|2e2(4`+fn) log q ;
during times [ti−1, ti), censor all updates outside block Bi. Let
T := t2N+1 exp(n12
+2ε) = exp(O(n12
+3ε)) ; (8.2.1)
282
by Theorem 2.2.7 and (2.2.7), it will suffice to show that
P(X1T 6= X0
T ) = o(n−2) .
We begin with a uniform upper bound on the mixing times of Bi.
Claim 8.2.1. Let m ≤ (2N + 1) exp(n12
+2ε) and, for every i ≤ m, define the event
Υi =(tX0ti−1
mix (Bi) ∨ tX1ti−1
mix (Bi))≤ |E(Bi)|2e2(4`+fn) log q
, (8.2.2)
where the superscript Xω0ti−1
denotes boundary conditions induced by Xω0ti−1
on Bi.
Then
P( ⋃i≤m
Υci
). m(n2e−Cqfn + e−n/2) , (8.2.3)
where Cq > 0 is the constant given by Proposition 8.0.6.
Proof of Claim 8.2.1. Let Ξi be the event that the law of the boundary conditions
on Bi under X0ti−1
piecewise dominate/are dominated by wired/free-at-infinity, resp.,
and likewise for X1ti−1
. We prove inductively that for every m ≤ (2N+1) exp(n12
+2ε),
P( ⋃i≤m
(Υci ∪ Ξc
i)
). m(n2e−Cqfn + e−n/2) . (8.2.4)
The base case, m = 1, has boundary conditions that are wired on ∂wB1 and
free/wired on ∂n,s,eB1, and thus Υ1 ∩ Ξ1 holds with probability 1 by a canonical
paths estimate (there are no distinct bridges). Suppose now that (8.2.4) holds for
some m; to show that it holds for m + 1, it suffices to show that the boundary
conditions induced by X0tm , i.e., the chain X0
t (the bound for the chain X1t follows
283
symmetrically).
Assume that the event⋂i≤m(Υi ∩ Ξi) holds. First of all, we notice that for any
i satisfying Υi, by the sub-multiplicativity of total-variation distance and definition
of ti,
‖P(X0tiBi ∈ ·)− π
X0ti−1
Bi‖tv ≤ e−n ; (8.2.5)
thus, a union bound over all i ≤ m = O(N exp(n12
+2ε)) implies that we may
construct a coupling of (X0t ) and some random variables Z1, . . . , Zm such that
P( ⋂i≤m
X0tiBi = Zi
)≥ 1− e−n/2 where Zi ∼ π
X0ti−1
Bifor each i .
We now claim that the boundary conditions induced by X0tm on Bm+1 are such
that they piecewise dominate/are dominated by wired/free-at-infinity on ∂Bm+1.
Consider the case wherem is even (the casem is odd follows analogously). According
to Ξm−1, if we denote by ζ the boundary conditions induced by X0tm−2
on ∂Bm−1,
then ζ piecewise dominates/is dominated by wired/free-at-infinity. Hence, when
sampling from πζBm−1, there would be a well-defined FK order-disorder interface I
between the boundary subsets that are alternately wired and free. For every such
interface, by the domain Markov property, the marginal under πζBm−1on each of
the connected components of E(Bm) \ I either dominates wired-at-infinity or is
dominated by free-at-infinity.
As a consequence, the boundary conditions on ∂n,s,wBm are piecewise sampled
from distributions dominating/dominated by wired/free-at-infinity, as are the
boundary conditions on the vertical bisector of (Bm−1 \Bm), a subset of ∂wBm+1.
Repeating this reasoning for the update on Bm, we see that the boundary conditions
284
on ∂n,s,wBm+1 are all sampled from distributions that piecewise dominate/are
dominated by wired/free-at-infinity. Finally, the same is true of ∂eBm+1 as it is
either completely free if m ≤ 2N + 1 or it is similarly sampled from πX0tj−1
Bjfor some
j < m which likewise satisfied Ξj. Thus we deduce that, except with probability
e−n/2, Ξm+1 holds. Then by Proposition 8.0.6, the boundary condition on Bm+1
induced by X0tm is such that
P(tX0tm
mix (Bm+1) ≥ |E(Bm+1)|2e2(4`+fn) log q) ≤ O(n2e−Cqfn) .
A union bound over the above errors concludes the proof.
Henceforth, we suppose that the event Υi holds for all i ≤ (2N + 1) exp(n12
+2ε),
which is the case with probability 1− exp(−Ω(n12
+3ε)) since fn = n12
+3ε.
Note that every time increment of t2N+1 we make an independent attempt
at coupling X1t to X0
t , albeit with initial configurations induced by the chains
at the end of the last sweep, and once the two chains are coupled on all of Λ,
they will remain coupled for all subsequent times. We will show that there exists
c(δ, q) > 0 such that for every k and every two configurations ω1k = X1
t(k−1)(2N+1)and
ω0k = X0
t(k−1)(2N+1),
P(X0tk(2N+1)
= X1tk(2N+1)
| ω1k, ω
0k)
≥ P(Ac2kN+k−1 | ω1k, ω
0k)P(Ack(2N+1) | Ac2kN+k−1, ω
1k, ω
0k)
& exp(−cn12
+ε) , (8.2.6)
285
where, in analogy to the proof of Theorem 10, if Ri =⋃j≤i mod (2N+1)Bj,
A2i =X0t2iR2i−B2i+1−B2i+2
6= X1t2iR2i−B2i+1−B2i+2
.
Eq. (8.2.6) is sufficient because the probability of not coupling X0T and X1
T by
time T (as defined in (8.2.1)) would then be bounded by
P(
Bin(exp(n12
+2ε), exp(−cn12
+ε)) = 0)
= o(n−2) .
In order to lower bound the probability in (8.2.6), we will construct a monotone
coupling of the two chains; therefore it suffices to consider the wired and free initial
configurations; by the Markov property, it also suffices to only consider the first
sweep k = 1.
Recall from (8.0.2) that for any rectangle Efn is the set of boundary conditions
on Bi such that in every boundary segment of length fn, there is an edge with
at most one boundary component containing vertices on both sides of that edge
(bridge).
Claim 8.2.2. There exists c(δ, q) > 0 such that, for large K and c1, the following
holds.
(1) For every 1 ≤ i ≤ N , there exists an event Fi measurable w.r.t. (X0t , X
1t )t≤t2i
such that P(X0t2i∂w(B2i+1∪B2i+2) ∈ · | Fi) is in Efn and dominates the wired-at-
infinity distribution on boundary conditions on ∂w(B2i+1 ∪B2i+2).
(2) For every 1 ≤ i ≤ N , the above defined event Fi satisfies
P(Fi, Ac2i) & exp(−cin2ε) .
286
Proof of Claim 8.2.2. For the base case of (1), observe that the boundary condi-
tions on ∂w(B1 ∪B2) dominate wired-at-infinity as they are all-wired; the base case
of (2) follows as the proof of the inductive step does, so we do not repeat it here.
Now assume that items (1) and (2) hold for i− 1 (for c(q) > 0 to be determined
later) and show that they hold for i for the same choice of c. Consider the middle
rectangle
Di = J(i− 1)`, (i+ 12)`K× Jbn
2c − δn, bn
2c+ δnK ,
and let I be the interface bounding the cluster(s) of X0tj∂wBj . Define the events
Γ2i =X0t2iB2i
: I ∩ (B2i −B2i+2) = ∅,
and
Γ2i−1 =X0t2i−1B2i−1
: I ∩Di = ∅∩ Γ2i−1 ,
where Γ2i−1 is the event that I is connected to e1, e2 in B2i−1−Di, where e1, e2 are
a pair of edges in either side of ∂wB2i−1− ∂wDi, with at most one bridge over them
(such a pair of edges exist since by assumption (1), the boundary conditions on
B2i−1 are in Efn). It is clear that both Γ2i−1 and Γ2i are increasing events. Then,
we can lower bound
P(Ac2i, Fi) ≥ P(Ac2i−2, Fi−1)P(Ac2i ∩ Fi ∩ Γ2i−1 ∩ Γ2i
∣∣ Ac2i−2, Fi−1
). (8.2.7)
By the inductive hypothesis, P(Ac2i−2, Fi−1) & exp(−c(i− 1)n2ε) and from now
on work in the probability space conditioned on Fi−1 ∩ Ac2i−2. Under Fi−1 ∩ Ac2i−2,
the boundary conditions X0t2i−2∂w(B2i−1∪B2i)
and X1t2i−2∂w(B2i−1∪B2i)
are coupled and
287
“1”“1”
“1”Di
0
0
B2i
“1”
e2
e1
Di
B2i−1
0
Figure 8.5: Left: the event Γ2i−1 where the interface of the component of ∂wB2i−1
does not intersect Di, and is connected to two edges e1, e2 that have no bridges.Right: the event Γ2i where the interface of the component of ∂wB2i∪∂n,sDi does notintersect the dashed line. Overall, the intersection Γ2i−1 ∩ Γ2i pushes the interfaceforward by `n.
dominate wired-at-infinity (let ζ denote that random boundary condition). In
the following time increment [t2i−2, t2i−1), only updates on B2i−1 are permitted;
since we are working on the event Υ2i−1, we can couple X0t2i−1B2i−1
to πζ,0B2i−1and
X1t2i−1B2i−1
to πζ,1B2i−1with probability 1−O(e−n) by (8.2.5).
Thus, by monotonicity, we consider the probability of the event I ∩Di = ∅ in
Γ2i−1 holding for a sample from πζ,0B2i−1. We claim that for some c(δ, q) > 0,
E[πζ,0B2i−1(I ∩Di = ∅) | Fi−1]
≥ Eπ1Z2
[πξ,0B2i−1(I ∩Di = ∅)]
& π1,0Λ(1−2δ)n,3`
(I ∩ Jn2− δn, n
2+ δnK× J0, 5`
2K = ∅)− e−c` ,
where the expectation is over all boundary conditions ξ induced by π1Z2 on ∂wB2i−1
288
and (1, 0) boundary conditions denote wired on ∂sΛ1,0(1−2δ)n,3`. Indeed, the second
inequality follows from considering the `-enlargement E`,2i−1 of B2i−1 which is
its concentric rectangle with extra side length `. If there is a wired circuit in
E`,2i−1 −B2i−1 under π1Z2 (by (5.1.1) this has probability 1− e−c`), we can replace
the expectation over b.c. induced by π1Z2 with an expectation over b.c. induced
by π1E`,2i−1
. Then extending the free boundary conditions on the other three sides
of B2i−1 all the way to ∂wE`,2i−1 and rotating yields the second inequality. By
Proposition 8.0.5 with the choices h = 3` and ρ = 56, there exists c(δ, q) > 0 so
that the probability in the right-hand side above is at least order e−cn2ε
(see e.g.,
Lemma 7.2.8 and Fig. 7.2 for a similar monotonicity argument).
Moreover, by the exponential decay of dual-connectivities, it is clear that for any
ζ ∈ Efn , we have πζ,0B2i−1(Γ2i−1) ≥ η for some η(q) > 0. Thus by the FKG inequality,
E[πζ,0B2i−1(Γ2i−1)] & e−cN
2ε
.
Let I2i−1 be the interface revealed by the component of ∂wB2i−1. Observe that
because Γ2i−1 is an increasing event, conditioned on Γ2i−1, if we reveal I2i−1 from
east to west, under the monotone coupling of πζ,1B2i−1to πζ,0B2i−1
the same edges would
also be open under πζ,1B2i−1; the same is also true of the edges that constitute Γ2i−1.
Having revealed these sets of open edges under both πζ,1B2i−1and πζ,0B2i−1
, by the
domain Markov property (there can not be distinct bridges over the interface we
have revealed), X1t2i−1Di = X0
t2i−1Di with probability at least (1−O(e−n))e−cn
2ε.
Now consider the next time increment [t2i−1, t2i) on B2i. Under the above
events, the configuration X0t2i−1Di π1
Z2(·Di), whence by (5.1.1), with probability
at least 1 − e−cδn, there is a pair of primal horizontal crossings of the top and
289
bottom halves of Di connecting ∂wB2i−1 to I2i−1. In that case, the distribution on
boundary conditions induced by X0t2i−1
(as well as X1t2i−1
) on ∂n,sB2i∩Di dominates
wired-at-infinity. Again, since we are working under the event Υ2i, we just consider
the event in Γ2i under πζ,0B2i. By applying (5.1.1) and enlarging the domains under
consideration as in the earlier bound on Γ2i−1, we obtain for some c(δ, q) > 0,
P(
Γ2i | Ac2i−2,Γ2i−1, Fi−1
)≥ (1−O(e−n))Eπ1
Z2
[πξ,0,(i+1/2)`B2i
(Γ2i)]
≥ (1−O(e−n))Eπ1Z2
[π0,ξ
Λ(1−2δ)n,3`/2(I ∩ Λ(1−2δ)n,` = ∅)
]≥ (1−O(e−n))(π0,1
Λn,2`(I ∩ Λn,3`/2 = ∅)− e−c`) .
Here, the boundary conditions (ξ, 0) in the first line denote ξ (over which we take
an expectation) induced on ∂B2i ∩ (x, y) : x = (i + 12)`, and free elsewhere on
∂B2i, and the boundary conditions in the second and third lines denote free on ∂n
of the boundary and respectively ξ and wired elsewhere. The second inequality is a
simple consequence of monotonicity in boundary conditions and the third inequality
follows from enlarging B2i by `/2 up to an error of e−c` coming from (5.1.1). By
Lemma 8.0.4 with φ = 0 and b = 1, there exists c(δ, q) > 0 such that the probability
on the right-hand side above is bounded below by (1−O(e−n))(1− e−cn2ε). In that
case, revealing the interface from east to west, we can couple X0t2i
to X1t2i
beyond
the interface (see also Fig. 8.5), so
P(Ac2i | A2i−2, Fi−1) ≥ (1−O(e−n))(1− e−cn2ε
)e−cn2ε
& exp(−cn2ε) . (8.2.8)
Finally, we claim that under the intersection of all the above events, with
probability 1−O(e−c`)−O(n2e−Cqfn), the boundary conditions induced by X1/0t2i
290
on ∂wB2i+1 and ∂wB2i+2 are in Efn and dominate wired-at-infinity, which combined
with (8.2.8) defines the desired set set Fi such that
P(Ac2i, Fi | A2i−2, Fi−1) & e−cn2ε
.
Recall that the configuration on Di under Γ2i−1 and Fi−1 dominates π1Z2Di . Then,
with probability 1 − 2e−cδn12 +ε
, Di contains two horizontal crossings connecting
∂wDi to I2i−1; since we are also conditioning on Γ2i, averaging over configurations
on Di, with probability 1− 3e−cδn12 +ε
, ∂wB2i+2 is surrounded by a wired circuit in
X0t2i
. Similarly, under X0t2i
, conditional on Γ2i−1, Γ2i and Fi−1, the configuration on
D′i = Ji`, (i+ 2)`K× J0, δnK ∪ J(1− δ)n, nK
below the interface revealed by Γ2i dominates π1Z2 so that with probability 1 −
2e−cδn12 +ε
, there are horizontal primal connections to that interface in both halves of
D′i. In that case, averaging over configurations in D′i with probability 1− 3e−cδn12 +ε
,
there is a wired circuit around ∂wB2i+1 in X0t2i
so that the distribution over boundary
conditions induced on ∂wB2i+1 also dominates wired-at-infinity. Moreover, as seen
in the proof of Proposition 8.0.6, a boundary condition dominating wired-at-infinity
is in Efn with probability 1−O(n2e−Cqfn). A union bound over the above concludes
the proof.
As a result, by item (2) of Claim 8.2.2, there exists some c(δ, q) > 0 for which
P(Ac2N) & exp(− cNn2ε
)& exp
(− cn
12
+ε).
Moreover, on that event, with high probability, the boundary conditions on ∂wB2N+1
291
induced by both X12N and X0
2N dominate wired-at-infinity. On the event Υ2N+1,
with probability 1−O(e−n) one can couple X1t2N+1
and X0t2N+1
to agree on B2N+1,
leading the two chains to be coupled on all of Λ. (It is only at this final step where
there is a difference between the (p, 1, 0) and (p, 1) boundary conditions; clearly,
if the coupling on B2N+1 succeeds in the former situation, it also succeeds in the
latter.)
8.3 Slow mixing with phase-symmetric bound-
ary conditions
Although the proof of slow mixing on the torus at a discontinuous phase
transition relied heavily on the topology of the torus (see proof of Theorem 2 in
[42]) we can—at least for sufficiently large q—also prove slow mixing in the presence
of mixed wired-free boundary conditions. Exploiting the self-duality, we see that
such boundary conditions still exhibit an exponential bottleneck, slowing down the
FK Glauber and Swendsen–Wang dynamics.
Definition 8.3.1. Let an, bn, cn, dn ∈ ∂Λn,n be a set of marked vertices ordered
clockwise from the origin around ∂Λn,n (by rotational symmetry, without loss of
generality assume an ∈ ∂wΛn,n). The mixed boundary conditions on (an, bn, cn, dn)
are those that are wired on the clockwise boundary arcs (an, bn) and (cn, dn) and free
on (bn, cn) and (dn, an)—all connected subsets of ∂Λn,n. We say that (an, bn, cn, dn)
are ε-separated if an ∈ ∂wΛn,n, at least one of bn, cn, dn is not contained in
J0, εnK× J0, nK, and
mini,j∈a,b,c,d; i 6=j
‖in − jn‖∞ ≥ εn .
292
For a choice of (an, bn, cn, dn), denote the FK measure with mixed boundary, πmixedΛ .
Remark 8.3.2. The requirement of ε-separation in our consideration of mixed
boundary conditions arises from the fact that if (an, bn, cn, dn) were all on ∂wΛn,n
repeating the proof of Theorem 10 with such boundary conditions would yield that
the mixing time is in fact sub-exponential. Clearly, if the four marked vertices
are sufficiently close to being on one side or to each other, a similar picture would
emerge. The requirement of macroscopic separation ensures that the bottleneck is
exponential in n.
With Definition 8.3.1 in hand, to prove Theorem 9, by rotational symmetry, we
wish to prove the following: let q be large, ε > 0, and consider the FK Glauber
dynamics for the critical Potts model on Λ = Λn,n with mixed boundary on
(an, bn, cn, dn) that are ε-separated. Then there exists c(ε, q) > 0 such that
tmix & exp(cn) .
Proof of Theorem 9. By (1.2.2) and Fact 2.2.6, it suffices to prove the bound
for the FK Glauber dynamics with mixed FK boundary conditions to deduce the
matching bound for Swendsen–Wang with Potts boundary conditions that are red
on the boundary arcs (an, bn) and (cn, dn) ⊂ ∂Λn,n and free elsewhere. Observe
that by planarity,
(an, bn)←→ (cn, dn)
=
(bn, cn)∗←→ (dn, an)
c,
293
and therefore, either
πmixedΛ
((an, bn)←→ (cn, dn)
)≤ 1
2, or πmixed
Λ
((bn, cn)
∗←→ (dn, an))≤ 1
2.
By self-duality of the class of ε-separated, mixed boundary conditions, we can
suppose without loss of generality that we are in the former case.
Recall the definition of the strips Sb,h,φ,H±b,φ in Definition 8.0.1, and let an =
(a1n, a
2n), and likewise for bn, cn, dn. Then let φa,d = tan−1(d
2n−a2
n
d1n−a1
n) and φb,c =
tan−1( c2n−b2nc1n−b1n
). Observe that by the ε-separation of (an, bn, cn, dn), one of φa,d and
φb,c is in [−π2
+ δ, π2− δ] for some small enough δ > 0 depending only on ε. Suppose
without loss of generality that for some δ(ε) > 0, φa,d ∈ [−π2
+ δ, π2− δ] and consider
the strip
S = H+a1n,φa,d
∩H−a1n+ε2n,φa,d
∩ Λ ,
Geometrically, by definition of ε-separation, S satisfies S ∩ ∂Λ ⊂ (an, bn) ∪ (cn, dn).
There is some x, h, φ = φa,d such that S = Sx,h,φa,d ∩ Λ; fix that x ∈ R+, h = ε2/2.
Define ∂nS = S ∩H+x+h−1,φ and ∂sS = S ∩H−x−h+1,φ, and let
A =
(an, bn)S←→ (cn, dn)
be the bottleneck set whose conductance Q(A,Ac)/(π(A)π(Ac)) we bound. Since
A ⊂ (an, bn)←→ (cn, dn) ,
294
we have that πmixedΛ (Ac) > 1
2. Therefore, we can write
gaprc ≤ 2Φ? ≤2Q(A,Ac)
πmixedΛ (A)πmixed
Λ (Ac)≤ 4πmixed
Λ (∂A | A) ,
(where ∂A := ω : P (ω,Ac) > 0 and we used a worst-case bound of 1 on the
transition rates in Q(A,Ac)), in which case it suffices to prove that for some
c(q) > 0,
πmixedΛ (∂A | A) . exp(−cε4n) .
For ω ∈ A, in order for P (ω,Ac) to be positive (ω ∈ ∂A), there must exist an
edge e in S that is pivotal to A, i.e., ω(e) = 1 and ω′ = ω − e /∈ A. We estimate
the probability πmixedΛ (∂A | A) by taking a union bound over the probability of any
edge, e, in E(S) being pivotal to S.
First examine whether e is closer in its y coordinate to ∂nS or ∂sS. Suppose
without loss of generality, we are in the former case, whence we expose the north-
most primal crossing of S, under πmixedΛ (ω | A) (revealing, first, the configuration
on Λ ∩H+x+h−1,φ, then the dual-components of ∂nS in S). Such a crossing exists by
conditioning on A.
Denote by ζ the horizontal crossing we have revealed as such. By the conditioning
on S, it is clear that ζ must connect (an, bn) to (cn, dn) in S. In order for e to be
pivotal to S, e must be an open edge in ζ and there must exist a dual crossing
connecting e to ∂sS. Let D be the southern connected component of E(Λ)− ζ; we
wish to bound
πmixedΛ
(e
D∗←→ ∂sS | A, ζ, ωζ = 1).
295
By monotonicity in boundary conditions, if we let R = D ∪ S, for every such ζ,
πmixedΛ (ωD | A, ζ, ωζ = 1) = π1,0
D π1,0R (ωD) ,
where (1, 0) boundary conditions denote free on ∂R− S and wired elsewhere.
We can decompose the probability
π1,0R
(e
D∗←→ ∂sS)≤ π1,0
R
(e
∗←→ ∂sS)
into the event Γ1 that the dual-component of ∂sS (and thus the interface of R with
(1, 0) boundary conditions) is a subset of S ∩H−x−h/2,φ, and Γc1. Under Γc1, since e
is closer in its y-coordinate to ∂nS, the vertical distance between e and ∂sS is at
least h/2 so that e /∈ S ∩H−x−h/2,φ and e cannot be dual-connected to ∂sS.
Bounding the probability of Γ1 by monotonicity in boundary conditions and
Proposition 8.0.3, there exists c(q) > 0 such that for every ζ,
πmixedΛ (Γ1 | A, ζ, ωζ = 1) ≤ π
1,0,a1n,φ
Sa1n,h/2,φ
∩Λ(I 6⊂ Sa1n,h/4,φ
∩ Λ) . n2 exp(−ch2n) .
Under Γ1 we can take a worst case bound of one on the probability of eD∗←→ ∂sS.
Therefore, for some c(q) > 0, we have πmixedΛ (∂A | A) . exp(−cε4n).
Using the above as a bound on Q(A,Ac)/πmixedΛ (A) in Eq. (2.2.14) and plugging
into Eq. (2.2.14) implies gap−1rc & exp(cε4n) for the FK Glauber dynamics with
mixed order-disorder boundary conditions.
296
Bibliography
[1] M. Aizenman and R. Holley. Rapid convergence to equilibrium of stochastic
Ising models in the Dobrushin Shlosman regime. In Percolation theory and
ergodic theory of infinite particle systems (Minneapolis, Minn., 1984–1985),
volume 8 of IMA Vol. Math. Appl., pages 1–11. Springer, New York, 1987.
[2] K. S. Alexander. On weak mixing in lattice models. Probab. Theory Related
Fields, 110(4):441–471, 1998.
[3] K. S. Alexander. Mixing properties and exponential decay for lattice systems
in finite volumes. Ann. Probab., 32(1A):441–487, 01 2004.
[4] R. J. Baxter. Exactly solved models in statistical mechanics. Academic Press
Inc., London, 1989.
[5] V. Beffara and H. Duminil-Copin. The self-dual point of the two-dimensional
random-cluster model is critical for q ≥ 1. Probab. Theory Related Fields,
153(3-4):511–542, 2012.
[6] A. Belavin, A. Polyakov, and A. Zamolodchikov. Infinite conformal symmetry
in two-dimensional quantum field theory. Nuclear Physics B, 241(2):333 – 380,
1984.
297
[7] A. Blanca, P. Caputo, A. Sinclair, and E. Vigoda. Spatial mixing and non-
local markov chains. In Proceedings of the Twenty-Ninth Annual ACM-SIAM
Symposium on Discrete Algorithms, SODA ’18, pages 1965–1980, Philadelphia,
PA, USA, 2018. Society for Industrial and Applied Mathematics.
[8] A. Blanca, R. Gheissari, and E. Vigoda. Random-cluster dynamics in
Z2: rapid mixing with general boundary conditions. preprint available at
arXiv:1807.08722., 2018.
[9] A. Blanca and A. Sinclair. Random-cluster dynamics in Z2. Probab. Theory
Related Fields. to appear. Extended abstract appeared in Proc. of the 27th
Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2016), pp.
498–513.
[10] A. Blanca and A. Sinclair. Dynamics for the mean-field random-cluster
model. In Proc. of the 19th International Workshop on Randomization and
Computation (RANDOM 2015), pages 528–543, 2015.
[11] T. Bodineau and F. Martinelli. Some new results on the kinetic Ising model
in a pure phase. Journal of Statistical Physics, 109(1):207–235, Oct 2002.
[12] C. Borgs, J. Chayes, A. Frieze, J. H. Kim, P. Tetali, E. Vigoda, and V. H. Vu.
Torpid mixing of some Monte Carlo Markov chain algorithms in statistical
physics. In Proc. of the 40th Annual Symposium on Foundations of Computer
Science (FOCS 1999), pages 218–229, 1999.
[13] C. Borgs, J. T. Chayes, and P. Tetali. Tight bounds for mixing of the Swendsen-
Wang algorithm at the Potts transition point. Probab. Theory Related Fields,
152(3-4):509–557, 2012.
298
[14] F. Camia, C. Garban, and C. M. Newman. Planar Ising magnetization field I.
uniqueness of the critical scaling limit. Ann. Probab., 43(2):528–571, 03 2015.
[15] F. Camia and C. M. Newman. Two-dimensional critical percolation: the full
scaling limit. Communications in Mathematical Physics, 268(1):1–38, Nov
2006.
[16] F. Camia and C. M. Newman. Critical percolation exploration path and SLE6:
a proof of convergence. Probability Theory and Related Fields, 139(3):473–519,
Nov 2007.
[17] P. Caputo, F. Martinelli, F. Simenhaus, and F. L. Toninelli. Zero temperature
stochastic 3D Ising model and dimer covering fluctuations: A first step towards
interface mean curvature motion. Communications on Pure and Applied
Mathematics, 64(6):778–831, 2011.
[18] F. Cesi. Quasi-factorization of the entropy and logarithmic Sobolev inequalities
for Gibbs random fields. Probability Theory and Related Fields, 120(4):569–584,
Aug 2001.
[19] F. Cesi, G. Guadagni, F. Martinelli, and R. H. Schonmann. On the two-
dimensional stochastic Ising model in the phase coexistence region near the
critical point. J. Statist. Phys., 85(1-2):55–102, 1996.
[20] J. T. Chayes, L. Chayes, and R. H. Schonmann. Exponential decay of connec-
tivities in the two-dimensional Ising model. J. Statist. Phys., 49(3-4):433–445,
1987.
[21] L. Chayes and J. Machta. Graphical representations and cluster algorithms I.
299
Discrete spin systems. Physica A: Statistical Mechanics and its Applications,
239(4):542–601, 1997.
[22] D. Chelkak, H. Duminil-Copin, C. Hongler, A. Kemppainen, and S. Smirnov.
Convergence of Ising interfaces to Schramm’s SLE curves. Comptes Rendus
Mathmatiques de l’Acadmie des Sciences, 352(2):157–161, 2014.
[23] D. Chelkak, C. Hongler, and K. Izyurov. Conformal invariance of spin correla-
tions in the planar Ising model. Ann. of Math., 181(3):1087–1138, 2015.
[24] C. Cooper and A. M. Frieze. Mixing properties of the Swendsen–Wang process
on classes of graphs. Random Structures & Algorithms, 15(34):242–261, 1999.
[25] R. da Silva, N. A. Alves, and J. D. de Felcio. Mixed initial conditions to
estimate the dynamic critical exponent in short-time Monte Carlo simulation.
Physics Letters A, 298(5):325 – 329, 2002.
[26] L. de Arcangelis and N. Jan. The dynamic critical exponent of the q=3
and 4 state Potts model. Journal of Physics A: Mathematical and General,
19(18):L1179–L1183, dec 1986.
[27] P. Di Francesco, P. Mathieu, and D. Senechal. Conformal Field Theory.
Graduate texts in contemporary physics. Island Press, 1996.
[28] P. Diaconis and L. Saloff-Coste. Comparison theorems for reversible Markov
chains. Ann. Appl. Probab., 3(3):696–730, 1993.
[29] P. Diaconis and D. Stroock. Geometric bounds for eigenvalues of Markov
chains. Ann. Appl. Probab., 1(1):36–61, 1991.
300
[30] R. Dobrushin, R. Kotecky, and S. Shlosman. Wulff construction, volume 104
of Transl. Math. Monogr. Amer. Math. Soc., Providence, RI, 1992.
[31] E. Domany. Exact results for two- and three-dimensional Ising and Potts
models. Phys. Rev. Lett., 52:1731–1731, May 1984.
[32] H. Duminil Copin, M. Gagnebin, M. Harel, I. Manolescu, and V. Tassion.
Discontinuity of the phase transition for the planar random-cluster and Potts
models with q > 4. Preprint, available at arXiv:1611.09877 (2016).
[33] H. Duminil-Copin, C. Hongler, and P. Nolin. Connection probabilities and
RSW-type bounds for the two-dimensional FK Ising model. Comm. Pure Appl.
Math., 64(9):1165–1198, 2011.
[34] H. Duminil-Copin, V. Sidoravicius, and V. Tassion. Continuity of the phase
transition for planar random-cluster and Potts models with 1 ≤ q ≤ 4. Comm.
Math. Phys. to appear.
[35] R. G. Edwards and A. D. Sokal. Generalization of the Fortuin-Kasteleyn-
Swendsen-Wang representation and Monte Carlo algorithm. Phys. Rev. D (3),
38(6):2009–2012, 1988.
[36] C. M. Fortuin and P. W. Kasteleyn. On the random-cluster model. I. Intro-
duction and relation to other models. Physica, 57:536–564, 1972.
[37] C. M. Fortuin, P. W. Kasteleyn, and J. Ginibre. Correlation inequalities on
some partially ordered sets. Comm. Math. Phys., 22:89–103, 1971.
[38] A. Galanis, D. Stefankovic, and E. Vigoda. Swendsen-Wang Algorithm on
301
the Mean-Field Potts Model. In Proc. of the 19th International Workshop on
Randomization and Computation (RANDOM 2015), pages 815–828, 2015.
[39] T. M. Garoni, G. Ossola, M. Polin, and A. D. Sokal. Dynamic critical behavior
of the chayes–machta algorithm for the random-cluster model, i. two dimensions.
Journal of Statistical Physics, 144(3):459–518, 2011.
[40] R. Gheissari, C. Hongler, and S. C. Park. Ising model: Local spin correlations
and conformal invariance. Communications in Mathematical Physics, 2019.
[41] R. Gheissari and E. Lubetzky. The effect of boundary conditions on mixing
of 2D Potts models at discontinuous phase transitions. Electron. J. Probab.,
23:30 pp., 2018.
[42] R. Gheissari and E. Lubetzky. Mixing times of critical two-dimensional Potts
models. Comm. Pure Appl. Math, 71(5):994–1046, 2018.
[43] R. Gheissari and E. Lubetzky. Quasi-polynomial mixing of critical 2D random
cluster models. Random Structures and Algorithms, 2019. to appear.
[44] R. Gheissari, E. Lubetzky, and Y. Peres. Exponentially slow mixing in the
mean-field Swendsen–Wang dynamics. Annales de l’Institut Henri Poincare
(B). to appear. Extended abstract appeared in Twenty-Ninth Annual ACM-
SIAM Symposium on Discrete Algorithms (SODA 2018), pp. 1981–1988.
[45] R. J. Glauber. Time-dependent statistics of the Ising model. J. Mathematical
Phys., 4:294–307, 1963.
[46] V. K. Gore and M. R. Jerrum. The Swendsen-Wang process does not always
mix rapidly. J. Statist. Phys., 97(1-2):67–86, 1999.
302
[47] G. Grimmett. The random-cluster model. In Probability on discrete structures,
volume 110 of Encyclopaedia Math. Sci., pages 73–123. Springer, Berlin, 2004.
[48] H. Guo and M. Jerrum. Random cluster dynamics for the Ising model is
rapidly mixing. In Proc. of the 28th Annual ACM-SIAM Symposium on
Discrete Algorithms (SODA 2017). To appear.
[49] P. C. Hohenberg and B. I. Halperin. Theory of dynamic critical phenomena.
Rev. Mod. Phys., 49:435–479, Jul 1977.
[50] R. Holley. On the Asymptotics of the Spin-Spin Autocorrelation Function
In Stochastic Ising Models Near the Critical Temperature, pages 89–104.
Birkhauser Boston, Boston, MA, 1991.
[51] C. Hongler and S. Smirnov. The energy density in the planar Ising model.
Acta Math., 211(2):191–225, 2013.
[52] D. A. Huse and D. S. Fisher. Dynamics of droplet fluctuations in pure and
random Ising systems. Phys. Rev. B, 35:6841–6846, May 1987.
[53] M. Jerrum and A. Sinclair. Approximating the permanent. SIAM J. Comput.,
18(6):1149–1178, 1989.
[54] H. Kesten. The critical probability of bond percolation on the square lattice
equals 1/2. Communications in Mathematical Physics, 74(1):41–59, Feb 1980.
[55] R. Kotecky and S. B. Shlosman. First-order phase transitions in large entropy
lattice models. Comm. Math. Phys., 83(4):493–515, 1982.
[56] H. A. Kramers and G. H. Wannier. Statistics of the two-dimensional ferromag-
net. part I. Phys. Rev., 60:252–262, Aug 1941.
303
[57] L. Laanait, A. Messager, S. Miracle-Sole, J. Ruiz, and S. Shlosman. Inter-
faces in the Potts model. I. Pirogov-Sinai theory of the Fortuin-Kasteleyn
representation. Comm. Math. Phys., 140(1):81–91, 1991.
[58] L. Laanait, A. Messager, and J. Ruiz. Phases coexistence and surface tensions
for the Potts model. Comm. Math. Phys., 105(4):527–545, 1986.
[59] K. B. Lauritsen and H. C. Fogedby. Critical exponents from power spectra.
Journal of Statistical Physics, 72(1):189–205, Jul 1993.
[60] D. A. Levin, Y. Peres, and E. L. Wilmer. Markov chains and mixing times.
American Mathematical Society, Providence, RI, 2009. With a chapter by
James G. Propp and David B. Wilson.
[61] X.-J. Li and A. D. Sokal. Rigorous lower bound on the dynamic critical
exponent of some multilevel Swendsen-Wang algorithms. Phys. Rev. Lett.,
67(12):1482–1485, 1991.
[62] S. L. Lu and H.-T. Yau. Spectral gap and logarithmic Sobolev inequality for
Kawasaki and Glauber dynamics. Comm. Math. Phys., 156(2):399–433, 1993.
[63] E. Lubetzky, F. Martinelli, A. Sly, and F. L. Toninelli. Quasi-polynomial
mixing of the 2D stochastic Ising model with “plus” boundary up to criticality.
J. Eur. Math. Soc. (JEMS), 15(2):339–386, 2013.
[64] E. Lubetzky and A. Sly. Critical Ising on the square lattice mixes in polynomial
time. Comm. Math. Phys., 313(3):815–836, 2012.
[65] E. Lubetzky and A. Sly. Cutoff for the Ising model on the lattice. Invent.
Math., 191(3):719–755, 2013.
304
[66] F. Martinelli. On the two-dimensional dynamical Ising model in the phase
coexistence region. J. Statist. Phys., 76(5-6):1179–1246, 1994.
[67] F. Martinelli. Lectures on Glauber dynamics for discrete spin models. In
Lectures on probability theory and statistics (Saint-Flour, 1997), volume 1717
of Lecture Notes in Math., pages 93–191. Springer, Berlin, 1999.
[68] F. Martinelli and E. Olivieri. Approach to equilibrium of Glauber dynamics in
the one phase region. I. The attractive case. Comm. Math. Phys., 161(3):447–
486, 1994.
[69] F. Martinelli, E. Olivieri, and R. H. Schonmann. For 2-D lattice spin systems
weak mixing implies strong mixing. Comm. Math. Phys., 165(1):33–47, 1994.
[70] F. Martinelli and F. L. Toninelli. On the mixing time of the 2D stochastic
Ising model with “plus” boundary conditions at low temperature. Comm.
Math. Phys., 296(1):175–213, 2010.
[71] B. McCoy and T. Wu. The two-dimensional Ising model. Harvard University
Press, 1973.
[72] A. Messager, S. Miracle-Sole, J. Ruiz, and S. Shlosman. Interfaces in the Potts
model. II. Antonov’s rule and rigidity of the order disorder interface. Comm.
Math. Phys., 140(2):275–290, 1991.
[73] E. Mossel and A. Sly. Exact thresholds for Ising–Gibbs samplers on general
graphs. Ann. Probab., 41(1):294–328, 01 2013.
[74] M. P. Nightingale and H. W. J. Blote. Monte carlo computation of correlation
305
times of independent relaxation modes at criticality. Phys. Rev. B, 62:1089–
1101, Jul 2000.
[75] L. Onsager. Crystal statistics. I. a two-dimensional model with an order-
disorder transition. Phys. Rev., 65:117–149, Feb 1944.
[76] Y. Peres and P. Winkler. Can extra updates delay mixing? Comm. Math.
Phys., 323(3):1007–1016, 2013.
[77] S. A. Pirogov and J. G. Sinaı. Phase diagrams of classical lattice systems.
(Continuation). Teoret. Mat. Fiz., 26(1):61–76, 1976.
[78] S. A. Pirogov and Y. G. Sinai. Phase diagrams of classical lattice systems.
Theoretical and Mathematical Physics, 25(3):1185–1192, Dec 1975.
[79] A. Ponitz and P. Tittmann. Improved upper bounds for self-avoiding walks in
Zd. Electron. J. Combin., 7:Research Paper 21, 10 pp. (electronic), 2000.
[80] S. Rohde and O. Schramm. Basic properties of SLE. Ann. of Math. (2),
161(2):883–924, 2005.
[81] L. Russo. A note on percolation. Z. Wahrscheinlichkeitstheorie und Verw.
Gebiete, 43(1):39–48, 1978.
[82] J. Salas and A. D. Sokal. Logarithmic corrections and finite-size scaling in the
two-dimensional 4-state Potts model. J. Statist. Phys., 88(3-4):567–615, 1997.
[83] O. Schramm. Scaling limits of loop-erased random walks and uniform spanning
trees. Israel Journal of Mathematics, 118(1):221–288, Dec 2000.
306
[84] O. Schramm. Conformally invariant scaling limits: an overview and a collection
of problems. In International Congress of Mathematicians. Vol. I, pages 513–
543. Eur. Math. Soc., Zurich, 2007.
[85] P. D. Seymour and D. J. A. Welsh. Percolation probabilities on the square
lattice. Ann. Discrete Math., 3:227–245, 1978. Advances in graph theory
(Cambridge Combinatorial Conf., Trinity College, Cambridge, 1977).
[86] A. Sinclair. Improved bounds for mixing rates of Markov chains and multi-
commodity flow. Combin. Probab. Comput., 1(4):351–370, 1992.
[87] S. Smirnov. Critical percolation in the plane: conformal invariance, cardy’s
formula, scaling limits. Comptes Rendus de l’Acadmie des Sciences - Series I -
Mathematics, 333(3):239 – 244, 2001.
[88] S. Smirnov. Conformal invariance in random cluster models. I. Holomorphic
fermions in the Ising model. Ann. of Math. (2), 172(2):1435–1467, 2010.
[89] A. Sokal. Monte Carlo methods in statistical mechanics: foundations and new
algorithms. In Functional integration (Cargese, 1996), volume 361 of NATO
Adv. Sci. Inst. Ser. B Phys., pages 131–192. Plenum, New York, 1997.
[90] R. H. Swendsen and J.-S. Wang. Nonuniversal critical dynamics in Monte
Carlo simulations. Phys. Rev. Lett., 58:86–88, Jan 1987.
[91] M. Ullrich. Comparison of Swendsen-Wang and heat-bath dynamics. Random
Structures Algorithms, 42(4):520–535, 2013.
[92] M. Ullrich. Rapid mixing of Swendsen-Wang dynamics in two dimensions.
Dissertationes Math. (Rozprawy Mat.), 502:64, 2014.
307
[93] F.-G. Wang and C.-K. Hu. Universality in dynamic critical phenomena. Phys.
Rev. E, 56:2310–2313, Aug 1997.
[94] J.-S. Wang and R. H. Swendsen. Cluster Monte Carlo algorithms. Phys. A,
167(3):565–579, 1990.
[95] F. Y. Wu. The Potts model. Rev. Mod. Phys., 54:235–268, Jan 1982.
308