-
Critical wetting in the (2+1)D Solid-On-Solid model
Authors:
Joseph Chen,
Reza Gheissari,
Eyal Lubetzky
Abstract:
In this note, we study the low temperature $(2+1)$D SOS interface above a hard floor with critical pinning potential $λ_w= \log (\frac{1}{1-e^{-4β}})$. At $λ<λ_w$ entropic repulsion causes the surface to delocalize and be rigid at height $\frac1{4β}\log n+O(1)$; at $λ>λ_w$ it is localized at some $O(1)$ height. We show that at $λ=λ_w$, there is delocalization, with rigidity now at height…
▽ More
In this note, we study the low temperature $(2+1)$D SOS interface above a hard floor with critical pinning potential $λ_w= \log (\frac{1}{1-e^{-4β}})$. At $λ<λ_w$ entropic repulsion causes the surface to delocalize and be rigid at height $\frac1{4β}\log n+O(1)$; at $λ>λ_w$ it is localized at some $O(1)$ height. We show that at $λ=λ_w$, there is delocalization, with rigidity now at height $\lfloor \frac1{6β}\log n+\frac13\rfloor$, confirming a conjecture of Lacoin.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Mean-field Potts and random-cluster dynamics from high-entropy initializations
Authors:
Antonio Blanca,
Reza Gheissari,
Xusheng Zhang
Abstract:
A common obstruction to efficient sampling from high-dimensional distributions is the multimodality of the target distribution because Markov chains may get trapped far from stationarity. Still, one hopes that this is only a barrier to the mixing of Markov chains from worst-case initializations and can be overcome by choosing high-entropy initializations, e.g., a product or weakly correlated distr…
▽ More
A common obstruction to efficient sampling from high-dimensional distributions is the multimodality of the target distribution because Markov chains may get trapped far from stationarity. Still, one hopes that this is only a barrier to the mixing of Markov chains from worst-case initializations and can be overcome by choosing high-entropy initializations, e.g., a product or weakly correlated distribution. Ideally, from such initializations, the dynamics would escape from the saddle points separating modes quickly and spread its mass between the dominant modes.
In this paper, we study convergence from high-entropy initializations for the random-cluster and Potts models on the complete graph -- two extensively studied high-dimensional landscapes that pose many complexities like discontinuous phase transitions and asymmetric metastable modes. We study the Chayes--Machta and Swendsen--Wang dynamics for the mean-field random-cluster model and the Glauber dynamics for the Potts model. We sharply characterize the set of product measure initializations from which these Markov chains mix rapidly, even though their mixing times from worst-case initializations are exponentially slow. Our proofs require careful approximations of projections of high-dimensional Markov chains (which are not themselves Markovian) by tractable 1-dimensional random processes, followed by analysis of the latter's escape from saddle points separating stable modes.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Finding planted cliques using Markov chain Monte Carlo
Authors:
Reza Gheissari,
Aukosh Jagannath,
Yiming Xu
Abstract:
The planted clique problem is a paradigmatic model of statistical-to-computational gaps: the planted clique is information-theoretically detectable if its size $k\ge 2\log_2 n$ but polynomial-time algorithms only exist for the recovery task when $k= Ω(\sqrt{n})$. By now, there are many simple and fast algorithms that succeed as soon as $k = Ω(\sqrt{n})$. Glaringly, however, no MCMC approach to the…
▽ More
The planted clique problem is a paradigmatic model of statistical-to-computational gaps: the planted clique is information-theoretically detectable if its size $k\ge 2\log_2 n$ but polynomial-time algorithms only exist for the recovery task when $k= Ω(\sqrt{n})$. By now, there are many simple and fast algorithms that succeed as soon as $k = Ω(\sqrt{n})$. Glaringly, however, no MCMC approach to the problem had been shown to work, including the Metropolis process on cliques studied by Jerrum since 1992. In fact, Chen, Mossel, and Zadik recently showed that any Metropolis process whose state space is the set of cliques fails to find any sub-linear sized planted clique in polynomial time if initialized naturally from the empty set. Here, we redeem MCMC performance for the planted clique problem by relaxing the state space to all vertex subsets and adding a corresponding energy penalty for missing edges. With that, we prove that energy-minimizing Markov chains (gradient descent and a low-temperature relaxation of it) succeed at recovering planted cliques of size $k = Ω(\sqrt{n})$ if initialized from the full graph. Importantly, initialized from the empty set, the relaxation still does not help the gradient descent find sub-linear planted cliques. We also demonstrate robustness of these Markov chain approaches under a natural contamination model.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Fast relaxation of the random field Ising dynamics
Authors:
Ahmed El Alaoui,
Ronen Eldan,
Reza Gheissari,
Arianna Piana
Abstract:
We study the convergence properties of Glauber dynamics for the random field Ising model (RFIM) with ferromagnetic interactions on finite domains of $\mathbb{Z}^d$, $d \ge 2$. Of particular interest is the Griffiths phase where correlations decay exponentially fast in expectation over the quenched disorder, but there exist arbitrarily large islands of weak fields where low-temperature behavior is…
▽ More
We study the convergence properties of Glauber dynamics for the random field Ising model (RFIM) with ferromagnetic interactions on finite domains of $\mathbb{Z}^d$, $d \ge 2$. Of particular interest is the Griffiths phase where correlations decay exponentially fast in expectation over the quenched disorder, but there exist arbitrarily large islands of weak fields where low-temperature behavior is observed. Our results are twofold:
1. Under weak spatial mixing (boundary-to-bulk exponential decay of correlations) in expectation, we show that the dynamics satisfy a weak Poincaré inequality -- equivalent to large-set expansion -- implying algebraic relaxation to equilibrium over timescales polynomial in the volume $N$ of the domain, and polynomial time mixing from a warm start. From this we construct a polynomial-time approximate sampling algorithm based on running Glauber dynamics over an increasing sequence of approximations of the domain.
2. Under strong spatial mixing (exponential decay of correlations even near boundary pinnings) in expectation, we prove a full Poincaré inequality, implying exponential relaxation to equilibrium and $N^{o(1)}$-mixing time. Note by way of example, both weak and strong spatial mixing hold at any temperature, provided the external fields are strong enough.
Our proofs combine a stochastic localization technique which has the effect of increasing the variance of the field, with a field-dependent coarse graining which controls the resulting sub-critical percolation process of sites with weak fields.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
High-dimensional SGD aligns with emerging outlier eigenspaces
Authors:
Gerard Ben Arous,
Reza Gheissari,
Jiaoyang Huang,
Aukosh Jagannath
Abstract:
We rigorously study the joint evolution of training dynamics via stochastic gradient descent (SGD) and the spectra of empirical Hessian and gradient matrices. We prove that in two canonical classification tasks for multi-class high-dimensional mixtures and either 1 or 2-layer neural networks, the SGD trajectory rapidly aligns with emerging low-rank outlier eigenspaces of the Hessian and gradient m…
▽ More
We rigorously study the joint evolution of training dynamics via stochastic gradient descent (SGD) and the spectra of empirical Hessian and gradient matrices. We prove that in two canonical classification tasks for multi-class high-dimensional mixtures and either 1 or 2-layer neural networks, the SGD trajectory rapidly aligns with emerging low-rank outlier eigenspaces of the Hessian and gradient matrices. Moreover, in multi-layer settings this alignment occurs per layer, with the final layer's outlier eigenspace evolving over the course of training, and exhibiting rank deficiency when the SGD converges to sub-optimal classifiers. This establishes some of the rich predictions that have arisen from extensive numerical studies in the last decade about the spectra of Hessian and information matrices over the course of training in overparametrized networks.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Metastability cascades and prewetting in the SOS model
Authors:
Reza Gheissari,
Eyal Lubetzky
Abstract:
We study Glauber dynamics for the low temperature $(2+1)$D Solid-On-Solid model on a box of side-length $n$ with a floor at height $0$ (inducing entropic repulsion) and a competing bulk external field $λ$ pointing down (the prewetting problem). In 1996, Cesi and Martinelli showed that if the inverse-temperature $β$ is large enough, then along a decreasing sequence of critical points…
▽ More
We study Glauber dynamics for the low temperature $(2+1)$D Solid-On-Solid model on a box of side-length $n$ with a floor at height $0$ (inducing entropic repulsion) and a competing bulk external field $λ$ pointing down (the prewetting problem). In 1996, Cesi and Martinelli showed that if the inverse-temperature $β$ is large enough, then along a decreasing sequence of critical points $(λ_c^{(k)})_{k=0}^{K_β}$ the dynamics is torpid: its inverse spectral gap is $O(1)$ when $λ\in (λ_c^{(k+1)},λ_c^{(k)})$ whereas it is $\exp[Θ(n)]$ at each $λ_c^{(k)}$ for each $k\leq K_β$, due to a coexistence of rigid phases at heights $k+1$ and $k$. Our focus is understanding (a) the onset of metastability as $λ_n\uparrowλ_c^{(k)}$; and (b) the effect of an unbounded number of layers, as we remove the restriction $k\le K_β$, and even allow for $λ_n\to 0$ towards the $λ= 0$ case which has $O(\log n)$ layers and was studied by Caputo et al. (2014). We show that for any $k$, possibly growing with $n$, the inverse gap is $\exp[\tildeΘ(1/|λ_n-λ_c^{(k)}|)]$ as $λ\uparrow λ_c^{(k)}$ up to distance $n^{-1+o(1)}$ from this critical point, due to a metastable layer at height $k$ on the way to forming the desired layer at height $k+1$. By taking $λ_n = n^{-α}$ (corresponding to $k_n\asymp \log n$), this also interpolates down to the behavior of the dynamics when $λ=0$. We compliment this by extending the fast mixing to all $λ$ uniformly bounded away from $(λ_c^{(k)})_{k=0}^\infty$. Together, these results provide a sharp understanding of the predicted infinite sequence of dynamical phase transitions governed by the layering phenomenon.
△ Less
Submitted 31 July, 2023;
originally announced July 2023.
-
On the tractability of sampling from the Potts model at low temperatures via Swendsen--Wang dynamics
Authors:
Antonio Blanca,
Reza Gheissari
Abstract:
Sampling from the $q$-state ferromagnetic Potts model is a fundamental question in statistical physics, probability theory, and theoretical computer science. On general graphs, this problem is computationally hard, and this hardness holds at arbitrarily low temperatures. At the same time, in recent years, there has been significant progress showing the existence of low-temperature sampling algorit…
▽ More
Sampling from the $q$-state ferromagnetic Potts model is a fundamental question in statistical physics, probability theory, and theoretical computer science. On general graphs, this problem is computationally hard, and this hardness holds at arbitrarily low temperatures. At the same time, in recent years, there has been significant progress showing the existence of low-temperature sampling algorithms in various specific families of graphs. Our aim in this paper is to understand the minimal structural properties of general graphs that enable polynomial-time sampling from the $q$-state ferromagnetic Potts model at low temperatures. We study this problem from the perspective of the widely-used Swendsen--Wang dynamics and the closely related random-cluster dynamics.
Our results demonstrate that the key graph property behind fast or slow convergence time for these dynamics is whether the independent edge-percolation on the graph admits a strongly supercritical phase. By this, we mean that at large $p<1$, it has a unique giant component of linear size, and the complement of that giant component is comprised of only small components. Specifically, we prove that such a condition implies fast mixing of the Swendsen--Wang and random-cluster dynamics on two general families of bounded-degree graphs: (a) graphs of at most stretched-exponential volume growth and (b) locally treelike graphs. In the other direction, we show that, even among graphs in those families, these Markov chains can converge exponentially slowly at arbitrarily low temperatures if the edge-percolation condition does not hold. In the process, we develop new tools for the analysis of non-local Markov chains, including a framework to bound the speed of disagreement propagation in the presence of long-range correlations, and an understanding of spatial mixing properties on trees with random boundary conditions.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Spatial mixing and the random-cluster dynamics on lattices
Authors:
Reza Gheissari,
Alistair Sinclair
Abstract:
An important paradigm in the understanding of mixing times of Glauber dynamics for spin systems is the correspondence between spatial mixing properties of the models and bounds on the mixing time of the dynamics. This includes, in particular, the classical notions of weak and strong spatial mixing, which have been used to show the best known mixing time bounds in the high-temperature regime for th…
▽ More
An important paradigm in the understanding of mixing times of Glauber dynamics for spin systems is the correspondence between spatial mixing properties of the models and bounds on the mixing time of the dynamics. This includes, in particular, the classical notions of weak and strong spatial mixing, which have been used to show the best known mixing time bounds in the high-temperature regime for the Glauber dynamics for the Ising and Potts models.
Glauber dynamics for the random-cluster model does not naturally fit into this spin systems framework because its transition rules are not local. In this paper, we present various implications between weak spatial mixing, strong spatial mixing, and the newer notion of spatial mixing within a phase, and mixing time bounds for the random-cluster dynamics in finite subsets of $\mathbb Z^d$ for general $d\ge 2$. These imply a host of new results, including optimal $O(N\log N)$ mixing for the random cluster dynamics on torii and boxes on $N$ vertices in $\mathbb Z^d$ at all high temperatures and at sufficiently low temperatures, and for large values of $q$ quasi-polynomial (or quasi-linear when $d=2$) mixing time bounds from random phase initializations on torii at the critical point (where by contrast the mixing time from worst-case initializations is exponentially large). In the same parameter regimes, these results translate to fast sampling algorithms for the Potts model on $\mathbb Z^d$ for general $d$.
△ Less
Submitted 4 October, 2023; v1 submitted 22 July, 2022;
originally announced July 2022.
-
High-dimensional limit theorems for SGD: Effective dynamics and critical scaling
Authors:
Gerard Ben Arous,
Reza Gheissari,
Aukosh Jagannath
Abstract:
We study the scaling limits of stochastic gradient descent (SGD) with constant step-size in the high-dimensional regime. We prove limit theorems for the trajectories of summary statistics (i.e., finite-dimensional functions) of SGD as the dimension goes to infinity. Our approach allows one to choose the summary statistics that are tracked, the initialization, and the step-size. It yields both ball…
▽ More
We study the scaling limits of stochastic gradient descent (SGD) with constant step-size in the high-dimensional regime. We prove limit theorems for the trajectories of summary statistics (i.e., finite-dimensional functions) of SGD as the dimension goes to infinity. Our approach allows one to choose the summary statistics that are tracked, the initialization, and the step-size. It yields both ballistic (ODE) and diffusive (SDE) limits, with the limit depending dramatically on the former choices. We show a critical scaling regime for the step-size, below which the effective ballistic dynamics matches gradient flow for the population loss, but at which, a new correction term appears which changes the phase diagram. About the fixed points of this effective dynamics, the corresponding diffusive limits can be quite complex and even degenerate. We demonstrate our approach on popular examples including estimation for spiked matrix and tensor models and classification via two-layer networks for binary and XOR-type Gaussian mixture models. These examples exhibit surprising phenomena including multimodal timescales to convergence as well as convergence to sub-optimal solutions with probability bounded away from zero from random (e.g., Gaussian) initializations. At the same time, we demonstrate the benefit of overparametrization by showing that the latter probability goes to zero as the second layer width grows.
△ Less
Submitted 17 August, 2023; v1 submitted 8 June, 2022;
originally announced June 2022.
-
Entropic repulsion of 3D Ising interfaces conditioned to stay above a floor
Authors:
Reza Gheissari,
Eyal Lubetzky
Abstract:
We study the interface of the Ising model in a box of side-length $n$ in $\mathbb Z^3$ at low temperature $1/β$ under Dobrushin's boundary conditions, conditioned to stay in a half-space above height $h$ (a hard floor). Without this conditioning, Dobrushin showed in 1972 that typically most of the interface is flat at height $0$. With the floor, for small $h$, the model is expected to exhibit {\it…
▽ More
We study the interface of the Ising model in a box of side-length $n$ in $\mathbb Z^3$ at low temperature $1/β$ under Dobrushin's boundary conditions, conditioned to stay in a half-space above height $h$ (a hard floor). Without this conditioning, Dobrushin showed in 1972 that typically most of the interface is flat at height $0$. With the floor, for small $h$, the model is expected to exhibit {\it entropic repulsion}, where the typical height of the interface lifts off of $0$. Detailed understanding of the SOS model -- a more tractable height function approximation of 3D Ising -- due to Caputo et al., suggests that there is a single integer value $-h_n^* \sim -c\log n$ of the floor height, delineating the transition between rigidity at height $0$ and entropic repulsion.
We identify an explicit $h_n^*=( c_\star+o(1))\log n$ such that, for the typical Ising interface above a hard floor at $h$, all but an $ε(β)$-fraction of the sites are propelled to be above height $0$ if $h < h_n^*-1$, whereas all but an $ε(β)$-fraction of the sites remain at height $0$ if $h\geq h_n^*$. Further, $c_\star$ is such that the typical height of the unconditional maximum is $(2c_\star + o(1))\log n$; this confirms scaling predictions from the SOS approximation.
△ Less
Submitted 31 July, 2023; v1 submitted 9 December, 2021;
originally announced December 2021.
-
Cutoff for the Glauber dynamics of the lattice free field
Authors:
Shirshendu Ganguly,
Reza Gheissari
Abstract:
The Gaussian Free Field (GFF) is a canonical random surface in probability theory generalizing Brownian motion to higher dimensions. In two dimensions, it is critical in several senses, and is expected to be the universal scaling limit of a host of random surface models in statistical physics. It also arises naturally as the stationary solution to the stochastic heat equation with additive noise.…
▽ More
The Gaussian Free Field (GFF) is a canonical random surface in probability theory generalizing Brownian motion to higher dimensions. In two dimensions, it is critical in several senses, and is expected to be the universal scaling limit of a host of random surface models in statistical physics. It also arises naturally as the stationary solution to the stochastic heat equation with additive noise. Focusing on the dynamical aspects of the corresponding universality class, we study the mixing time, i.e., the rate of convergence to stationarity, for the canonical prelimiting object, namely the discrete Gaussian free field (DGFF), evolving along the (heat-bath) Glauber dynamics. While there have been significant breakthroughs made in the study of cutoff for Glauber dynamics of random curves, analogous sharp mixing bounds for random surface evolutions have remained elusive. In this direction, we establish that on a box of side-length $n$ in $\mathbb Z^2$, when started out of equilibrium, the Glauber dynamics for the DGFF exhibit cutoff at time $\frac{2}{π^2}n^2 \log n$.
△ Less
Submitted 24 February, 2023; v1 submitted 17 August, 2021;
originally announced August 2021.
-
Sampling from Potts on random graphs of unbounded degree via random-cluster dynamics
Authors:
Antonio Blanca,
Reza Gheissari
Abstract:
We consider the problem of sampling from the ferromagnetic Potts and random-cluster models on a general family of random graphs via the Glauber dynamics for the random-cluster model. The random-cluster model is parametrized by an edge probability $p \in (0,1)$ and a cluster weight $q > 0$. We establish that for every $q\ge 1$, the random-cluster Glauber dynamics mixes in optimal $Θ(n\log n)$ steps…
▽ More
We consider the problem of sampling from the ferromagnetic Potts and random-cluster models on a general family of random graphs via the Glauber dynamics for the random-cluster model. The random-cluster model is parametrized by an edge probability $p \in (0,1)$ and a cluster weight $q > 0$. We establish that for every $q\ge 1$, the random-cluster Glauber dynamics mixes in optimal $Θ(n\log n)$ steps on $n$-vertex random graphs having a prescribed degree sequence with bounded average branching $γ$ throughout the full high-temperature uniqueness regime $p<p_u(q,γ)$.
The family of random graph models we consider includes the Erdős--Rényi random graph $G(n,γ/n)$, and so we provide the first polynomial-time sampling algorithm for the ferromagnetic Potts model on Erdős--Rényi random graphs for the full tree uniqueness regime. We accompany our results with mixing time lower bounds (exponential in the largest degree) for the Potts Glauber dynamics, in the same settings where our $Θ(n \log n)$ bounds for the random-cluster Glauber dynamics apply. This reveals a novel and significant computational advantage of random-cluster based algorithms for sampling from the Potts model at high temperatures.
△ Less
Submitted 24 February, 2023; v1 submitted 21 July, 2021;
originally announced July 2021.
-
Low-temperature Ising dynamics with random initializations
Authors:
Reza Gheissari,
Alistair Sinclair
Abstract:
It is well known that Glauber dynamics on spin systems typically suffer exponential slowdowns at low temperatures. This is due to the emergence of multiple metastable phases in the state space, separated by narrow bottlenecks that are hard for the dynamics to cross. It is a folklore belief that if the dynamics is initialized from an appropriate random mixture of ground states, one for each phase,…
▽ More
It is well known that Glauber dynamics on spin systems typically suffer exponential slowdowns at low temperatures. This is due to the emergence of multiple metastable phases in the state space, separated by narrow bottlenecks that are hard for the dynamics to cross. It is a folklore belief that if the dynamics is initialized from an appropriate random mixture of ground states, one for each phase, then convergence to the Gibbs distribution should be much faster. However, such phenomena have largely evaded rigorous analysis, as most tools in the study of Markov chain mixing times are tailored to worst-case initializations.
In this paper we develop a general framework towards establishing this conjectured behavior for the Ising model. In the classical setting of the Ising model on an $N$-vertex torus in $\mathbb Z^d$, our framework implies that the mixing time for the Glauber dynamics, initialized from a $\frac 12$-$\frac 12$ mixture of the all-plus and all-minus configurations, is $N^{1+o(1)}$ in dimension $d=2$, and at most quasi-polynomial in all dimensions $d\ge 3$, at all temperatures below the critical one. The key innovation in our analysis is the introduction of the notion of "weak spatial mixing within a phase", a low-temperature adaptation of the classical concept of weak spatial mixing. We show both that this new notion is strong enough to control the mixing time from the above random initialization (by relating it to the mixing time with plus boundary condition at $O(\log N)$ scales), and that it holds at all low temperatures in all dimensions.
This framework naturally extends to much more general families of graphs. To illustrate this, we also use the same approach to establish optimal $O(N\log N)$ mixing for the Ising Glauber dynamics on random regular graphs at sufficiently low temperatures, when initialized from the same random mixture.
△ Less
Submitted 23 November, 2022; v1 submitted 21 June, 2021;
originally announced June 2021.
-
Approximate domain Markov property for rigid Ising interfaces
Authors:
Reza Gheissari,
Eyal Lubetzky
Abstract:
Consider the Ising model on a centered box of side length $n$ in $\mathbb Z^d$ with $\mp$-boundary conditions that are minus in the upper half-space and plus in the lower half-space. Dobrushin famously showed that in dimensions $d\ge 3$, at low-temperatures the Ising interface (dual-surface separating the plus/minus phases) is rigid, i.e., it has $O(1)$ height fluctuations. Recently, the authors d…
▽ More
Consider the Ising model on a centered box of side length $n$ in $\mathbb Z^d$ with $\mp$-boundary conditions that are minus in the upper half-space and plus in the lower half-space. Dobrushin famously showed that in dimensions $d\ge 3$, at low-temperatures the Ising interface (dual-surface separating the plus/minus phases) is rigid, i.e., it has $O(1)$ height fluctuations. Recently, the authors decomposed these oscillations into pillars and identified their typical shape, leading to a law of large numbers and tightness of their maximum.
Suppose we condition on a height-$h$ level curve of the interface, bounding a set $S \subset \mathbb Z^{d-1}$, along with the entire interface outside the cylinder $S\times \mathbb Z$: what does the interface in $S\times \mathbb Z$ look like? Many models of random surfaces (e.g., SOS and DGFF) fundamentally satisfy the domain Markov property, whereby their heights on $S$ only depend on the heights on $S^c$ through the heights on $\partial S$. The Ising interface importantly does not satisfy this property; the law of the interface depends on the full spin configuration outside $S\times \mathbb Z$.
Here we establish an approximate domain Markov property inside the level curves of the Ising interface. We first extend Dobrushin's result to this setting, showing the interface in $S\times \mathbb Z$ is rigid about height $h$, with exponential tails on its height oscillations. Then we show that the typical tall pillars in $S\times \mathbb Z$ are uniformly absolutely continuous with respect to tall pillars of the unconditional Ising interface. Using this we identify the law of large numbers, tightness, and Gumbel tail bounds on the maximum oscillations in $S\times \mathbb Z$ about height $h$, showing that these only depend on the conditioning through the cardinality of $S$.
△ Less
Submitted 9 December, 2021; v1 submitted 20 October, 2020;
originally announced October 2020.
-
Random-cluster dynamics on random regular graphs in tree uniqueness
Authors:
Antonio Blanca,
Reza Gheissari
Abstract:
We establish rapid mixing of the random-cluster Glauber dynamics on random $Δ$-regular graphs for all $q\ge 1$ and $p<p_u(q,Δ)$, where the threshold $p_u(q,Δ)$ corresponds to a uniqueness/non-uniqueness phase transition for the random-cluster model on the (infinite) $Δ$-regular tree. It is expected that this threshold is sharp, and for $q>2$ the Glauber dynamics on random $Δ$-regular graphs underg…
▽ More
We establish rapid mixing of the random-cluster Glauber dynamics on random $Δ$-regular graphs for all $q\ge 1$ and $p<p_u(q,Δ)$, where the threshold $p_u(q,Δ)$ corresponds to a uniqueness/non-uniqueness phase transition for the random-cluster model on the (infinite) $Δ$-regular tree. It is expected that this threshold is sharp, and for $q>2$ the Glauber dynamics on random $Δ$-regular graphs undergoes an exponential slowdown at $p_u(q,Δ)$.
More precisely, we show that for every $q\ge 1$, $Δ\ge 3$, and $p<p_u(q,Δ)$, with probability $1-o(1)$ over the choice of a random $Δ$-regular graph on $n$ vertices, the Glauber dynamics for the random-cluster model has $Θ(n \log n)$ mixing time. As a corollary, we deduce fast mixing of the Swendsen--Wang dynamics for the Potts model on random $Δ$-regular graphs for every $q\ge 2$, in the tree uniqueness region. Our proof relies on a sharp bound on the "shattering time", i.e., the number of steps required to break up any configuration into $O(\log n)$ sized clusters. This is established by analyzing a delicate and novel iterative scheme to simultaneously reveal the underlying random graph with clusters of the Glauber dynamics configuration on it, at a given time.
△ Less
Submitted 10 April, 2021; v1 submitted 5 August, 2020;
originally announced August 2020.
-
Diffusions interacting through a random matrix: universality via stochastic Taylor expansion
Authors:
Amir Dembo,
Reza Gheissari
Abstract:
Consider $(X_{i}(t))$ solving a system of $N$ stochastic differential equations interacting through a random matrix $\mathbf J = (J_{ij})$ with independent (not necessarily identically distributed) random coefficients. We show that the trajectories of averaged observables of $(X_i(t))$, initialized from some $μ$ independent of $\mathbf J$, are universal, i.e., only depend on the choice of the dist…
▽ More
Consider $(X_{i}(t))$ solving a system of $N$ stochastic differential equations interacting through a random matrix $\mathbf J = (J_{ij})$ with independent (not necessarily identically distributed) random coefficients. We show that the trajectories of averaged observables of $(X_i(t))$, initialized from some $μ$ independent of $\mathbf J$, are universal, i.e., only depend on the choice of the distribution $\mathbf{J}$ through its first and second moments (assuming e.g., sub-exponential tails). We take a general combinatorial approach to proving universality for dynamical systems with random coefficients, combining a stochastic Taylor expansion with a moment matching-type argument. Concrete settings for which our results imply universality include aging in the spherical SK spin glass, and Langevin dynamics and gradient flows for symmetric and asymmetric Hopfield networks.
△ Less
Submitted 2 February, 2021; v1 submitted 23 June, 2020;
originally announced June 2020.
-
Local and global geometry of the 2D Ising interface in critical pre-wetting
Authors:
Shirshendu Ganguly,
Reza Gheissari
Abstract:
Consider the Ising model at low-temperatures and positive external field $λ$ on an $N\times N$ box with Dobrushin boundary conditions that are plus on the north, east, and west boundaries and minus on the south boundary. If $λ= 0$, the interface separating the plus and minus phases is diffusive, having $O(\sqrt N)$ height fluctuations, and the model is fully wetted. Under an order one field, the i…
▽ More
Consider the Ising model at low-temperatures and positive external field $λ$ on an $N\times N$ box with Dobrushin boundary conditions that are plus on the north, east, and west boundaries and minus on the south boundary. If $λ= 0$, the interface separating the plus and minus phases is diffusive, having $O(\sqrt N)$ height fluctuations, and the model is fully wetted. Under an order one field, the interface fluctuations are $O(1)$ and the interface is only partially wetted, being pinned to its southern boundary. We study the critical pre-wetting regime of $λ_N \downarrow 0$, where the height fluctuations are expected to scale as $λ^{ -1/3}$ and the rescaled interface is predicted to converge to the Ferrari--Spohn diffusion. Velenik (2004) identified the order of the area under the interface up to logarithmic corrections. Since then, more refined features of such interfaces have only been identified in simpler models of random walks under area tilts.
In this paper, we resolve several conjectures of Velenik regarding the refined features of the Ising interface in the critical pre-wetting regime. Our main result is a sharp bound on the one-point height fluctuation, proving $e^{ - Θ(x^{3/2})}$ upper tails reminiscent of the Tracy--Widom distribution, capturing a tradeoff between the locally Brownian oscillations and the global field effect. We further prove a concentration estimate for the number of points above which the interface attains a large height. These are used to deduce various geometric properties of the interface, including the order and tails of the area it confines, and the poly-logarithmic pre-factor governing its maximum height fluctuation. Our arguments combine classical inputs from the random-line representation of the Ising interface, with novel local resampling and coupling schemes.
△ Less
Submitted 2 February, 2021; v1 submitted 22 April, 2020;
originally announced April 2020.
-
Online stochastic gradient descent on non-convex losses from high-dimensional inference
Authors:
Gerard Ben Arous,
Reza Gheissari,
Aukosh Jagannath
Abstract:
Stochastic gradient descent (SGD) is a popular algorithm for optimization problems arising in high-dimensional inference tasks. Here one produces an estimator of an unknown parameter from independent samples of data by iteratively optimizing a loss function. This loss function is random and often non-convex. We study the performance of the simplest version of SGD, namely online SGD, from a random…
▽ More
Stochastic gradient descent (SGD) is a popular algorithm for optimization problems arising in high-dimensional inference tasks. Here one produces an estimator of an unknown parameter from independent samples of data by iteratively optimizing a loss function. This loss function is random and often non-convex. We study the performance of the simplest version of SGD, namely online SGD, from a random start in the setting where the parameter space is high-dimensional.
We develop nearly sharp thresholds for the number of samples needed for consistent estimation as one varies the dimension. Our thresholds depend only on an intrinsic property of the population loss which we call the information exponent. In particular, our results do not assume uniform control on the loss itself, such as convexity or uniform derivative bounds. The thresholds we obtain are polynomial in the dimension and the precise exponent depends explicitly on the information exponent. As a consequence of our results, we find that except for the simplest tasks, almost all of the data is used simply in the initial search phase to obtain non-trivial correlation with the ground truth. Upon attaining non-trivial correlation, the descent is rapid and exhibits law of large numbers type behavior.
We illustrate our approach by applying it to a wide set of inference tasks such as phase retrieval, and parameter estimation for generalized linear models, online PCA, and spiked tensor models, as well as to supervised learning for single-layer networks with general activation functions.
△ Less
Submitted 10 May, 2021; v1 submitted 23 March, 2020;
originally announced March 2020.
-
Local minima in disordered mean-field ferromagnets
Authors:
Eric Yilun Song,
Reza Gheissari,
Charles M. Newman,
Daniel L. Stein
Abstract:
We consider the complexity of random ferromagnetic landscapes on the hypercube $\{\pm 1\}^N$ given by Ising models on the complete graph with i.i.d. non-negative edge-weights. This includes, in particular, the case of Bernoulli disorder corresponding to the Ising model on a dense random graph $\mathcal G(N,p)$. Previous results had shown that, with high probability as $N\to\infty$, the gradient se…
▽ More
We consider the complexity of random ferromagnetic landscapes on the hypercube $\{\pm 1\}^N$ given by Ising models on the complete graph with i.i.d. non-negative edge-weights. This includes, in particular, the case of Bernoulli disorder corresponding to the Ising model on a dense random graph $\mathcal G(N,p)$. Previous results had shown that, with high probability as $N\to\infty$, the gradient search (energy-lowering) algorithm, initialized uniformly at random, converges to one of the homogeneous global minima (all-plus or all-minus). Here, we devise two modified algorithms tailored to explore the landscape at near-zero magnetizations (where the effect of the ferromagnetic drift is minimized). With these, we numerically verify the landscape complexity of random ferromagnets, finding a diverging number of (1-spin-flip-stable) local minima as $N\to\infty$. We then investigate some of the properties of these local minima (e.g., typical energy and magnetization) and compare to the situation where the edge-weights are drawn from a heavy-tailed distribution.
△ Less
Submitted 25 October, 2019;
originally announced October 2019.
-
Nature vs. Nurture: Dynamical Evolution in Disordered Ising Ferromagnets
Authors:
Lily Z. Wang,
Reza Gheissari,
Charles M. Newman,
Daniel L. Stein
Abstract:
We study the predictability of zero-temperature Glauber dynamics in various models of disordered ferromagnets. This is analyzed using two independent dynamical realizations with the same random initialization (called twins). We derive, theoretically and numerically, trajectories for the evolution of the normalized magnetization and twin overlap as the system size tends to infinity. The systems we…
▽ More
We study the predictability of zero-temperature Glauber dynamics in various models of disordered ferromagnets. This is analyzed using two independent dynamical realizations with the same random initialization (called twins). We derive, theoretically and numerically, trajectories for the evolution of the normalized magnetization and twin overlap as the system size tends to infinity. The systems we treat include mean-field ferromagnets with light-tailed and heavy-tailed coupling distributions, as well as highly-disordered models with a variety of other geometries. In the mean-field setting with light-tailed couplings, the disorder averages out and the limiting trajectories of the magnetization and twin overlap match those of the homogenous Curie--Weiss model. On the other hand, when the coupling distribution has heavy tails, or the geometry changes, the effect of the disorder persists in the thermodynamic limit. Nonetheless, qualitatively all such random ferromagnets share a similar time evolution for their twin overlap, wherein the two twins initially decorrelate, before either partially or fully converging back together due to the ferromagnetic drift.
△ Less
Submitted 9 October, 2019;
originally announced October 2019.
-
Tightness and tails of the maximum in 3D Ising interfaces
Authors:
Reza Gheissari,
Eyal Lubetzky
Abstract:
Consider the 3D Ising model on a box of side length $n$ with minus boundary conditions above the $xy$-plane and plus boundary conditions below it. At low temperatures, Dobrushin (1972) showed that the interface separating the predominantly plus and predominantly minus regions is localized: its height above a fixed point has exponential tails. Recently, the authors proved a law of large numbers for…
▽ More
Consider the 3D Ising model on a box of side length $n$ with minus boundary conditions above the $xy$-plane and plus boundary conditions below it. At low temperatures, Dobrushin (1972) showed that the interface separating the predominantly plus and predominantly minus regions is localized: its height above a fixed point has exponential tails. Recently, the authors proved a law of large numbers for the maximum height $M_n$ of this interface: for every $β$ large, $M_n/ \log n\to c_β$ in probability as $n\to\infty$.
Here we show that the laws of the centered maxima $(M_n - \mathbb{E}[M_n])_{n\geq 1}$ are uniformly tight. Moreover, even though this sequence does not converge, we prove that it has uniform upper and lower Gumbel tails (exponential right tails and doubly exponential left tails). Key to the proof is a sharp (up to $O(1)$ precision) understanding of the surface large deviations. This includes, in particular, the shape of a pillar that reaches near-maximum height, even at its base, where the interactions with neighboring pillars are dominant.
△ Less
Submitted 12 May, 2020; v1 submitted 16 July, 2019;
originally announced July 2019.
-
Maximum and shape of interfaces in 3D Ising crystals
Authors:
Reza Gheissari,
Eyal Lubetzky
Abstract:
Dobrushin (1972) showed that the interface of a 3D Ising model with minus boundary conditions above the $xy$-plane and plus below is rigid (has $O(1)$-fluctuations) at every sufficiently low temperature. Since then, basic features of this interface -- such as the asymptotics of its maximum -- were only identified in more tractable random surface models that approximate the Ising interface at low t…
▽ More
Dobrushin (1972) showed that the interface of a 3D Ising model with minus boundary conditions above the $xy$-plane and plus below is rigid (has $O(1)$-fluctuations) at every sufficiently low temperature. Since then, basic features of this interface -- such as the asymptotics of its maximum -- were only identified in more tractable random surface models that approximate the Ising interface at low temperatures, e.g., for the (2+1)D Solid-On-Solid model. Here we study the large deviations of the interface of the 3D Ising model in a cube of side-length $n$ with Dobrushin's boundary conditions, and in particular obtain a law of large numbers for $M_n$, its maximum: if the inverse-temperature $β$ is large enough, then $M_n / \log n \to 2/α_β$ as $n\to\infty$, in probability, where $α_β$ is given by a large deviation rate in infinite volume.
We further show that, on the large deviation event that the interface connects the origin to height $h$, it consists of a 1D spine that behaves like a random walk, in that it decomposes into a linear (in $h$) number of asymptotically-stationary weakly-dependent increments that have exponential tails. As the number $T$ of increments diverges, properties of the interface such as its surface area, volume, and the location of its tip, all obey CLTs with variances linear in $T$. These results generalize to every dimension $d\geq 3$.
△ Less
Submitted 9 April, 2020; v1 submitted 15 January, 2019;
originally announced January 2019.
-
Bounding flows for spherical spin glass dynamics
Authors:
Gerard Ben Arous,
Reza Gheissari,
Aukosh Jagannath
Abstract:
We introduce a new approach to studying spherical spin glass dynamics based on differential inequalities for one-time observables. Using this approach, we obtain an approximate phase diagram for the evolution of the energy $H$ and its gradient under Langevin dynamics for spherical $p$-spin models. We then derive several consequences of this phase diagram. For example, at any temperature, uniformly…
▽ More
We introduce a new approach to studying spherical spin glass dynamics based on differential inequalities for one-time observables. Using this approach, we obtain an approximate phase diagram for the evolution of the energy $H$ and its gradient under Langevin dynamics for spherical $p$-spin models. We then derive several consequences of this phase diagram. For example, at any temperature, uniformly over all starting points, the process must reach and remain in an absorbing region of large negative values of $H$ and large (in norm) gradients in order 1 time. Furthermore, if the process starts in a neighborhood of a critical point of $H$ with negative energy, then both the gradient and energy must increase macroscopically under this evolution, even if this critical point is a saddle with index of order $N$. As a key technical tool, we estimate Sobolev norms of spin glass Hamiltonians, which are of independent interest.
△ Less
Submitted 24 October, 2019; v1 submitted 2 August, 2018;
originally announced August 2018.
-
Algorithmic thresholds for tensor PCA
Authors:
Gerard Ben Arous,
Reza Gheissari,
Aukosh Jagannath
Abstract:
We study the algorithmic thresholds for principal component analysis of Gaussian $k$-tensors with a planted rank-one spike, via Langevin dynamics and gradient descent. In order to efficiently recover the spike from natural initializations, the signal to noise ratio must diverge in the dimension. Our proof shows that the mechanism for the success/failure of recovery is the strength of the "curvatur…
▽ More
We study the algorithmic thresholds for principal component analysis of Gaussian $k$-tensors with a planted rank-one spike, via Langevin dynamics and gradient descent. In order to efficiently recover the spike from natural initializations, the signal to noise ratio must diverge in the dimension. Our proof shows that the mechanism for the success/failure of recovery is the strength of the "curvature" of the spike on the maximum entropy region of the initial data. To demonstrate this, we study the dynamics on a generalized family of high-dimensional landscapes with planted signals, containing the spiked tensor models as specific instances. We identify thresholds of signal-to-noise ratios above which order 1 time recovery succeeds; in the case of the spiked tensor model these match the thresholds conjectured for algorithms such as Approximate Message Passing. Below these thresholds, where the curvature of the signal on the maximal entropy region is weak, we show that recovery from certain natural initializations takes at least stretched exponential time. Our approach combines global regularity estimates for spin glasses with point-wise estimates, to study the recovery problem by a perturbative approach.
△ Less
Submitted 10 September, 2019; v1 submitted 2 August, 2018;
originally announced August 2018.
-
Random-cluster dynamics in $\mathbb Z^2$: rapid mixing with general boundary conditions
Authors:
Antonio Blanca,
Reza Gheissari,
Eric Vigoda
Abstract:
The random-cluster model with parameters $(p,q)$ is a random graph model that generalizes bond percolation ($q=1$) and the Ising and Potts models ($q\geq 2$). We study its Glauber dynamics on $n\times n$ boxes $Λ_{n}$ of the integer lattice graph $\mathbb Z^2$, where the model exhibits a sharp phase transition at $p=p_c(q)$. Unlike traditional spin systems like the Ising and Potts models, the rand…
▽ More
The random-cluster model with parameters $(p,q)$ is a random graph model that generalizes bond percolation ($q=1$) and the Ising and Potts models ($q\geq 2$). We study its Glauber dynamics on $n\times n$ boxes $Λ_{n}$ of the integer lattice graph $\mathbb Z^2$, where the model exhibits a sharp phase transition at $p=p_c(q)$. Unlike traditional spin systems like the Ising and Potts models, the random-cluster model has non-local interactions. Long-range interactions can be imposed as external connections in the boundary of $Λ_n$, known as boundary conditions. For select boundary conditions that do not carry long-range information (namely, wired and free), Blanca and Sinclair proved that when $q>1$ and $p\neq p_c(q)$, the Glauber dynamics on $Λ_n$ mixes in optimal $O(n^2 \log n)$ time. In this paper, we prove that this mixing time is polynomial in $n$ for every boundary condition that is realizable as a configuration on $\mathbb Z^2 \setminus Λ_{n}$. We then use this to prove near-optimal $\tilde O(n^2)$ mixing time for "typical'' boundary conditions. As a complementary result, we construct classes of non-realizable (non-planar) boundary conditions inducing slow (stretched-exponential) mixing at $p\ll p_c(q)$.
△ Less
Submitted 6 May, 2019; v1 submitted 23 July, 2018;
originally announced July 2018.
-
Zero-temperature dynamics in the dilute Curie-Weiss model
Authors:
Reza Gheissari,
Charles M. Newman,
Daniel L. Stein
Abstract:
We consider the Ising model on a dense Erdős--Rényi random graph, $\mathcal G(N,p)$, with $p>0$ fixed---equivalently, a disordered Curie--Weiss Ising model with $\mbox{Ber}(p)$ couplings---at zero temperature. The disorder may induce local energy minima in addition to the two uniform ground states. In this paper we prove that, starting from a typical initial configuration, the zero-temperature dyn…
▽ More
We consider the Ising model on a dense Erdős--Rényi random graph, $\mathcal G(N,p)$, with $p>0$ fixed---equivalently, a disordered Curie--Weiss Ising model with $\mbox{Ber}(p)$ couplings---at zero temperature. The disorder may induce local energy minima in addition to the two uniform ground states. In this paper we prove that, starting from a typical initial configuration, the zero-temperature dynamics avoids all such local minima and absorbs into a predetermined one of the two uniform ground states. We relate this to the local MINCUT problem on dense random graphs; namely with high probability, the greedy search for a local MINCUT of $\mathcal G(N,p)$ with $p>0$ fixed, started from a uniform random partition, fails to find a non-trivial cut. In contrast, in the disordered Curie--Weiss model with heavy-tailed couplings, we demonstrate that zero-temperature dynamics has positive probability of absorbing in a random local minimum different from the two homogenous ground states.
△ Less
Submitted 27 July, 2017;
originally announced July 2017.
-
Concentration inequalities for polynomials of contracting Ising models
Authors:
Reza Gheissari,
Eyal Lubetzky,
Yuval Peres
Abstract:
We study the concentration of a degree-$d$ polynomial of the $N$ spins of a general Ising model, in the regime where single-site Glauber dynamics is contracting. For $d=1$, Gaussian concentration was shown by Marton (1996) and Samson (2000) as a special case of concentration for convex Lipschitz functions, and extended to a variety of related settings by e.g., Chazottes et al. (2007) and Kontorovi…
▽ More
We study the concentration of a degree-$d$ polynomial of the $N$ spins of a general Ising model, in the regime where single-site Glauber dynamics is contracting. For $d=1$, Gaussian concentration was shown by Marton (1996) and Samson (2000) as a special case of concentration for convex Lipschitz functions, and extended to a variety of related settings by e.g., Chazottes et al. (2007) and Kontorovich and Ramanan (2008). For $d=2$, exponential concentration was shown by Marton (2003) on lattices. We treat a general fixed degree $d$ with $O(1)$ coefficients, and show that the polynomial has variance $O(N^d)$ and, after rescaling it by $N^{-d/2}$, its tail probabilities decay as $\exp(- c\, r^{2/d})$ for deviations of $r \geq C \log N$.
△ Less
Submitted 1 September, 2017; v1 submitted 31 May, 2017;
originally announced June 2017.
-
Exponentially slow mixing in the mean-field Swendsen-Wang dynamics
Authors:
Reza Gheissari,
Eyal Lubetzky,
Yuval Peres
Abstract:
Swendsen-Wang dynamics for the Potts model was proposed in the late 1980's as an alternative to single-site heat-bath dynamics, in which global updates allow this MCMC sampler to switch between metastable states and ideally mix faster. Gore and Jerrum (1999) found that this dynamics may in fact exhibit slow mixing: they showed that, for the Potts model with $q\geq 3$ colors on the complete graph o…
▽ More
Swendsen-Wang dynamics for the Potts model was proposed in the late 1980's as an alternative to single-site heat-bath dynamics, in which global updates allow this MCMC sampler to switch between metastable states and ideally mix faster. Gore and Jerrum (1999) found that this dynamics may in fact exhibit slow mixing: they showed that, for the Potts model with $q\geq 3$ colors on the complete graph on $n$ vertices at the critical point $β_c(q)$, Swendsen-Wang dynamics has $t_{\mathrm{mix}}\geq \exp(c\sqrt n)$. The same lower bound was extended to the critical window $(β_s,β_S)$ around $β_c$ by Galanis et al. (2015), as well as to the corresponding mean-field FK model by Blanca and Sinclair (2015). In both cases, an upper bound of $t_{\mathrm{mix}} \leq \exp(c' n)$ was known. Here we show that the mixing time is truly exponential in $n$: namely, $t_{\mathrm{mix}} \geq \exp (cn)$ for Swendsen-Wang dynamics when $q\geq 3$ and $β\in(β_s,β_S)$, and the same bound holds for the related MCMC samplers for the mean-field FK model when $q>2$.
△ Less
Submitted 2 May, 2017; v1 submitted 19 February, 2017;
originally announced February 2017.
-
The effect of boundary conditions on mixing of 2D Potts models at discontinuous phase transitions
Authors:
Reza Gheissari,
Eyal Lubetzky
Abstract:
We study Swendsen--Wang dynamics for the critical $q$-state Potts model on the square lattice. For $q=2,3,4$, where the phase transition is continuous, the mixing time $t_{\textrm{mix}}$ is expected to obey a universal power-law independent of the boundary conditions. On the other hand, for large $q$, where the phase transition is discontinuous, the authors recently showed that $t_{\textrm{mix}}$…
▽ More
We study Swendsen--Wang dynamics for the critical $q$-state Potts model on the square lattice. For $q=2,3,4$, where the phase transition is continuous, the mixing time $t_{\textrm{mix}}$ is expected to obey a universal power-law independent of the boundary conditions. On the other hand, for large $q$, where the phase transition is discontinuous, the authors recently showed that $t_{\textrm{mix}}$ is highly sensitive to boundary conditions: $t_{\textrm{mix}} \geq \exp(cn)$ on an $n\times n$ box with periodic boundary, yet under free or monochromatic boundary conditions, $t_{\textrm{mix}} \leq\exp(n^{o(1)})$.
In this work we classify this effect under boundary conditions that interpolate between these two (torus vs. free/monochromatic). Specifically, if one of the $q$ colors is red, mixed boundary conditions such as red-free-red-free on the 4 sides of the box induce $t_{\textrm{mix}} \geq \exp(cn)$, yet Dobrushin boundary conditions such as red-red-free-free, as well as red-periodic-red-periodic, induce sub-exponential mixing.
△ Less
Submitted 4 June, 2018; v1 submitted 31 December, 2016;
originally announced January 2017.
-
Quasi-polynomial mixing of critical 2D random cluster models
Authors:
Reza Gheissari,
Eyal Lubetzky
Abstract:
We study the Glauber dynamics for the random cluster (FK) model on the torus $(\mathbb{Z}/n\mathbb{Z})^2$ with parameters $(p,q)$, for $q \in (1,4]$ and $p$ the critical point $p_c$. The dynamics is believed to undergo a critical slowdown, with its continuous-time mixing time transitioning from $O(\log n)$ for $p\neq p_c$ to a power-law in $n$ at $p=p_c$. This was verified at $p\neq p_c$ by Blanca…
▽ More
We study the Glauber dynamics for the random cluster (FK) model on the torus $(\mathbb{Z}/n\mathbb{Z})^2$ with parameters $(p,q)$, for $q \in (1,4]$ and $p$ the critical point $p_c$. The dynamics is believed to undergo a critical slowdown, with its continuous-time mixing time transitioning from $O(\log n)$ for $p\neq p_c$ to a power-law in $n$ at $p=p_c$. This was verified at $p\neq p_c$ by Blanca and Sinclair, whereas at the critical $p=p_c$, with the exception of the special integer points $q=2,3,4$ (where the model corresponds to the Ising/Potts models) the best-known upper bound on mixing was exponential in $n$. Here we prove an upper bound of $n^{O(\log n)}$ at $p=p_c$ for all $q\in (1,4]$, where a key ingredient is bounding the number of nested long-range crossings at criticality.
△ Less
Submitted 30 March, 2019; v1 submitted 3 November, 2016;
originally announced November 2016.
-
On the Spectral Gap of Spherical Spin Glass Dynamics
Authors:
Reza Gheissari,
Aukosh Jagannath
Abstract:
We consider the time to equilibrium for the Langevin dynamics of the spherical $p$-spin glass model of system size $N$. We show that the log-Sobolev constant and spectral gap are order $1$ in $N$ at sufficiently high temperature whereas the spectral gap decays exponentially in $N$ at sufficiently low temperatures. These verify the existence of a dynamical high temperature phase and a dynamical gla…
▽ More
We consider the time to equilibrium for the Langevin dynamics of the spherical $p$-spin glass model of system size $N$. We show that the log-Sobolev constant and spectral gap are order $1$ in $N$ at sufficiently high temperature whereas the spectral gap decays exponentially in $N$ at sufficiently low temperatures. These verify the existence of a dynamical high temperature phase and a dynamical glass phase at the level of the spectral gap. Key to these results are the understanding of the extremal process and restricted free energy of Subag--Zeitouni and Subag.
△ Less
Submitted 4 June, 2018; v1 submitted 23 August, 2016;
originally announced August 2016.
-
Mixing times of critical 2D Potts models
Authors:
Reza Gheissari,
Eyal Lubetzky
Abstract:
We study dynamical aspects of the $q$-state Potts model on an $n\times n$ box at its critical $β_c(q)$. Heat-bath Glauber dynamics and cluster dynamics such as Swendsen--Wang (that circumvent low-temperature bottlenecks) are all expected to undergo "critical slowdowns" in the presence of periodic boundary conditions: the inverse spectral gap, which in the subcritical regime is $O(1)$, should at cr…
▽ More
We study dynamical aspects of the $q$-state Potts model on an $n\times n$ box at its critical $β_c(q)$. Heat-bath Glauber dynamics and cluster dynamics such as Swendsen--Wang (that circumvent low-temperature bottlenecks) are all expected to undergo "critical slowdowns" in the presence of periodic boundary conditions: the inverse spectral gap, which in the subcritical regime is $O(1)$, should at criticality be polynomial in $n$ for $1< q \leq 4$, and exponential in $n$ for $q>4$ in accordance with the predicted discontinuous phase transition. This was confirmed for $q=2$ (the Ising model) by the second author and Sly, and for sufficiently large $q$ by Borgs et al.
Here we show that the following holds for the critical Potts model on the torus: for $q=3$, the inverse gap of Glauber dynamics is $n^{O(1)}$; for $q=4$, it is at most $n^{O(\log n)}$; and for every $q>4$ in the phase-coexistence regime, the inverse gaps of both Glauber dynamics and Swendsen--Wang dynamics are exponential in $n$.
For free or monochromatic boundary conditions and large $q$, we show that the dynamics at criticality is faster than on the torus (unlike the Ising model where free/periodic boundary conditions induce similar dynamical behavior at all temperatures): the inverse gap of Swendsen--Wang dynamics is $\exp(n^{o(1)})$.
△ Less
Submitted 31 August, 2017; v1 submitted 7 July, 2016;
originally announced July 2016.
-
Long-Time Predictability in Disordered Spin Systems Following a Deep Quench
Authors:
J. Ye,
R. Gheissari,
J. Machta,
C. M. Newman,
D. L. Stein
Abstract:
We study the problem of predictability, or "nature vs. nurture", in several disordered Ising spin systems evolving at zero temperature from a random initial state: how much does the final state depend on the information contained in the initial state, and how much depends on the detailed history of the system? Our numerical studies of the "dynamical order parameter" in Edwards-Anderson Ising spin…
▽ More
We study the problem of predictability, or "nature vs. nurture", in several disordered Ising spin systems evolving at zero temperature from a random initial state: how much does the final state depend on the information contained in the initial state, and how much depends on the detailed history of the system? Our numerical studies of the "dynamical order parameter" in Edwards-Anderson Ising spin glasses and random ferromagnets indicate that the influence of the initial state decays as dimension increases. Similarly, this same order parameter for the Sherrington-Kirkpatrick infinite-range spin glass indicates that this information decays as the number of spins increases. Based on these results, we conjecture that the influence of the initial state on the final state decays to zero in finite-dimensional random-bond spin systems as dimension goes to infinity, regardless of the presence of frustration. We also study the rate at which spins "freeze out" to a final state as a function of dimensionality and number of spins; here the results indicate that the number of "active" spins at long times increases with dimension (for short-range systems) or number of spins (for infinite-range systems). We provide theoretical arguments to support these conjectures, and also study analytically several mean-field models: the random energy model, the uniform Curie-Weiss ferromagnet, and the disordered Curie-Weiss ferromagnet. We find that for these models, the information contained in the initial state does not decay in the thermodynamic limit-- in fact, it fully determines the final state. Unlike in short-range models, the presence of frustration in mean-field models dramatically alters the dynamical behavior with respect to the issue of predictability.
△ Less
Submitted 17 April, 2017; v1 submitted 1 January, 2016;
originally announced January 2016.
-
Asymptotics of height change on toroidal Temperleyan dimer models
Authors:
Julien Dubédat,
Reza Gheissari
Abstract:
The dimer model is an exactly solvable model of planar statistical mechanics. In its critical phase, various aspects of its scaling limit are known to be described by the Gaussian free field. For periodic graphs, criticality is an algebraic condition on the spectral curve of the model, determined by the edge weights; isoradial graphs provide another class of critical dimer models, in which the edg…
▽ More
The dimer model is an exactly solvable model of planar statistical mechanics. In its critical phase, various aspects of its scaling limit are known to be described by the Gaussian free field. For periodic graphs, criticality is an algebraic condition on the spectral curve of the model, determined by the edge weights; isoradial graphs provide another class of critical dimer models, in which the edge weights are determined by the local geometry. In the present article, we consider another class of graphs: general Temperleyan graphs, i.e. graphs arising in the (generalized) Temperley bijection between spanning trees and dimer models. Building in particular on Forman's formula and representations of Laplacian determinants in terms of Poisson operators, and under a minimal assumption - viz. that the underlying random walk converges to Brownian motion - we show that the natural topological observable on macroscopic tori converges in law to its universal limit, i.e. the law of the periods of the dimer height function converges to that of the periods of a compactified free field.
△ Less
Submitted 20 November, 2014; v1 submitted 23 July, 2014;
originally announced July 2014.
-
Ising Model: Local Spin Correlations and Conformal Invariance
Authors:
Reza Gheissari,
Clément Hongler,
S. C. Park
Abstract:
We study the 2-dimensional Ising model at critical temperature on a simply connected subset $Ω_δ$ of the square grid $δ\mathbb{Z}^{2}$. The scaling limit of the critical Ising model is conjectured to be described by Conformal Field Theory; in particular, there is expected to be a precise correspondence between local lattice fields of the Ising model and the local fields of Conformal Field Theory.…
▽ More
We study the 2-dimensional Ising model at critical temperature on a simply connected subset $Ω_δ$ of the square grid $δ\mathbb{Z}^{2}$. The scaling limit of the critical Ising model is conjectured to be described by Conformal Field Theory; in particular, there is expected to be a precise correspondence between local lattice fields of the Ising model and the local fields of Conformal Field Theory.
Towards the proof of this correspondence, we analyze arbitrary spin pattern probabilities (probabilities of finite spin configurations occurring at the origin), explicitly obtain their infinite-volume limits, and prove their conformal covariance at the first (non-trivial) order. We formulate these probabilities in terms of discrete fermionic observables, enabling the study of their scaling limits. This generalizes results of [Hon10,HoSm13] and [CHI15] to one-point functions of any local spin correlations.
We introduce a collection of tools which allow one to exactly and explicitly translate any spin pattern probability (and hence any lattice local field correlation) in terms of discrete complex analysis quantities. The proof requires working with multipoint lattice spinors with monodromy (including construction of explicit formulae in the full plane), and refined analysis near their source points to prove convergence to the appropriate continuous conformally covariant functions.
△ Less
Submitted 21 November, 2018; v1 submitted 16 December, 2013;
originally announced December 2013.