HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: stickstootext

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: CC BY 4.0
arXiv:2208.07462v4 [math.PR] 27 Jan 2024

Speeding up random walk mixing by starting from a uniform vertex

Alberto Espuny Díaz [email protected] Institut für Mathematik, Technische Universität Ilmenau, 98684 Ilmenau, Germany. Patrick Morris [email protected] Departament de Matemàtiques and IMTECH, Universitat Politècnica de Catalunya (UPC), Barcelona, Spain. Guillem Perarnau [email protected] Centre de Recerca Matemàtica, Barcelona, Spain.  and  Oriol Serra [email protected]
(Date: January 27, 2024)
Abstract.

The theory of rapid mixing random walks plays a fundamental role in the study of modern randomised algorithms. Usually, the mixing time is measured with respect to the worst initial position. It is well known that the presence of bottlenecks in a graph hampers mixing and, in particular, starting inside a small bottleneck significantly slows down the diffusion of the walk in the first steps of the process. The average mixing time is defined to be the mixing time starting at a uniformly random vertex and hence is not sensitive to the slow diffusion caused by these bottlenecks.

In this paper we provide a general framework to show logarithmic average mixing time for random walks on graphs with small bottlenecks. The framework is especially effective on certain families of random graphs with heterogeneous properties. We demonstrate its applicability on two random models for which the mixing time was known to be of order (logn)2superscript𝑛2(\log n)^{2}( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, speeding up the mixing to order logn𝑛\log nroman_log italic_n. First, in the context of smoothed analysis on connected graphs, we show logarithmic average mixing time for randomly perturbed graphs of bounded degeneracy. A particular instance is the Newman-Watts small-world model. Second, we show logarithmic average mixing time for supercritically percolated expander graphs. When the host graph is complete, this application gives an alternative proof that the average mixing time of the giant component in the supercritical Erdős-Rényi graph is logarithmic.

This research has been supported by the Spanish Agencia Estatal de Investigación under projects PID2020-113082GB-I00 and the Severo Ochoa and María de Maeztu Program for Centers and Units of Excellence in R&D (CEX2020-001084-M). Alberto Espuny Díaz was partially supported by the Carl Zeiss Foundation and by DFG (German Research Foundation) grant PE 2299/3-1. Patrick Morris was supported by the DFG Walter Benjamin program - project number 504502205.

1. Introduction

Random walks on graphs are one of the fundamental tools for sampling (see, e.g., [38]). Applications are numerous in areas such as computer science, discrete mathematics and statistical physics. Prominent examples include the polynomial-time algorithm to estimate the volume of a convex body [19], computing the matrix permanent [28] or the use of Glauber dynamics to sample from Gibbs distributions, in particular from proper colourings [42].

Most usually, the size of the sampling space is exponential in the input size, and fully exploring this space is computationally intractable. The Markov chain Monte Carlo (MCMC) method consists of running a random walk in an appropriately chosen graph, whose vertex set is the sample space, until its distribution is arbitrarily close to equilibrium, regardless of the initial state. At that time we say the walk has mixed, and the time until it does is called the (worst-case) mixing time. To obtain efficient sampling algorithms it suffices to prove that the mixing time is poly-logarithmic in the input size.

The connection between rapid mixing and expanders is well-established. In the context of random walks, expansion is measured by means of a graph parameter called conductance; see Section 2.2 for the precise definition. Jerrum and Sinclair [28] gave an upper bound on the mixing time depending on the conductance and the logarithm of the minimum stationary value. This bound is central in the theory of Markov chains.

Random environments are particularly interesting sampling spaces and, in the last 20 years, researchers have developed the theory of random walks on random graphs. As expected, the good expansion properties of random graphs ensure rapid mixing. By the Jerrum-Sinclair bound, graphs with conductance bounded away from zero mix in logarithmically many steps and usually exhibit cut-off, that is, the distribution converges rapidly to the stationary distribution in a small window of time. Good examples are random graph models with control on the degrees, such as random regular graphs [34], random graphs with given degree sequences [6, 4], their directed analogues [9, 12], or graphs perturbed by random perfect matchings [27].

Nonetheless, the presence of small obstructions slows down the mixing. A canonical example is the giant component of a sparse Erdős-Rényi graph G(n,c/n)𝐺𝑛𝑐𝑛G(n,c/n)italic_G ( italic_n , italic_c / italic_n ) with c>1𝑐1c>1italic_c > 1. This component contains relatively small bottlenecks, that is, connected sets that only have few edges connecting them to the rest of the graph. In such cases, tools like the Jerrum-Sinclair bound fail to pin down the correct order of the mixing time. Fountoulakis and Reed [23] introduced a strengthening of the bound that is sensitive to small bottlenecks and used it to show that the mixing time of the largest component in G(n,c/n)𝐺𝑛𝑐𝑛G(n,c/n)italic_G ( italic_n , italic_c / italic_n ) is asymptotically almost surely (a.a.s. for short) O(log2n)𝑂superscript2𝑛O(\log^{2}n)italic_O ( roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ) [24]. Indeed, this is the correct order as the component contains paths of degree 2222 vertices (also referred to as bare paths) whose length is of order logn𝑛\log{n}roman_log italic_n. Starting at the centre of such paths, a random walk takes Ω(log2n)Ωsuperscript2𝑛\Omega(\log^{2}n)roman_Ω ( roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ) steps in expectation to escape from it. We remark that the mixing time in the supercritical random graph G(n,c/n)𝐺𝑛𝑐𝑛G(n,c/n)italic_G ( italic_n , italic_c / italic_n ) was also bounded independently by Benjamini, Kozma and Wormald [5], using a different approach investigating the anatomy of the giant component.

However, these local bottlenecks are a negligible part of the giant component and the rest of the component has good expansion properties. This suggests that, if the random walk started outside the bottlenecks, the mixing time would decrease. This was implicit in the work of Benjamini, Kozma and Wormald [5] and their description of the giant component, and such a speeding up of mixing time was also conjectured explicitly by Fountoulakis and Reed [24]. Berestycki, Lubetzky, Peres and Sly [6] confirmed their prediction, showing that there exists a=a(c)𝑎𝑎𝑐a=a(c)italic_a = italic_a ( italic_c ) such that the mixing time starting at a uniformly random vertex is asymptotically alogn𝑎𝑛a\log{n}italic_a roman_log italic_n with high probability (they in fact proved much more, establishing the value of a(c)𝑎𝑐a(c)italic_a ( italic_c ) precisely as well as cut-off for the random walk). This result reinforces the idea that, in certain heterogeneous scenarios, averaging over the starting position yields more efficient sampling algorithms.

The goal of this paper is to provide a general framework to show logarithmic average-case mixing time for random walks on graphs with small bottlenecks.

1.1. Average mixing times

Given an n𝑛nitalic_n-vertex graph G𝐺Gitalic_G, the lazy random walk over G𝐺Gitalic_G is a Markov chain with state space V(G)𝑉𝐺V(G)italic_V ( italic_G ) which can be defined as follows. If at any given time we are in a vertex uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ), the lazy random walk stays in u𝑢uitalic_u with probability 1/2121/21 / 2, and with probability 1/2121/21 / 2 it moves to a uniformly random neighbour of u𝑢uitalic_u in G𝐺Gitalic_G. If G𝐺Gitalic_G is a connected graph, it is well known that the lazy random walk over G𝐺Gitalic_G is ergodic and its distribution converges to the (unique) stationary distribution πGsubscript𝜋𝐺\pi_{G}italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT (see, e.g., [33] for a comprehensive review of random walks and mixing times).

The total variation distance dTV(μ,ν)subscript𝑑TV𝜇𝜈d_{\mathrm{TV}}(\mu,\nu)italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ , italic_ν ) between two probability distributions ν𝜈\nuitalic_ν and μ𝜇\muitalic_μ on the vertex set V(G)𝑉𝐺V(G)italic_V ( italic_G ) of a graph G𝐺Gitalic_G is defined as

dTV(μ,ν)maxAV(G)|μ(A)ν(A)|=12vV(G)|μ(v)ν(v)|.subscript𝑑TV𝜇𝜈subscript𝐴𝑉𝐺𝜇𝐴𝜈𝐴12subscript𝑣𝑉𝐺𝜇𝑣𝜈𝑣d_{\mathrm{TV}}(\mu,\nu)\coloneqq\max_{A\subseteq V(G)}|\mu(A)-\nu(A)|=\frac{1% }{2}\sum_{v\in V(G)}|\mu(v)-\nu(v)|.italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ , italic_ν ) ≔ roman_max start_POSTSUBSCRIPT italic_A ⊆ italic_V ( italic_G ) end_POSTSUBSCRIPT | italic_μ ( italic_A ) - italic_ν ( italic_A ) | = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT | italic_μ ( italic_v ) - italic_ν ( italic_v ) | . (1.1)

Let PGsubscript𝑃𝐺P_{G}italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT be the transition matrix of the lazy random walk over G𝐺Gitalic_G. For ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, the ϵitalic-ϵ\epsilonitalic_ϵ-mixing time tmix(G,ϵ)subscript𝑡mix𝐺italic-ϵt_{\mathrm{mix}}(G,\epsilon)italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ) of this lazy random walk is defined as

tmix(G,ϵ)min{t0:maxuV(G)dTV(μ0uPGt,πG)ϵ},subscript𝑡mix𝐺italic-ϵ:𝑡subscript0subscript𝑢𝑉𝐺subscript𝑑TVsuperscriptsubscript𝜇0𝑢superscriptsubscript𝑃𝐺𝑡subscript𝜋𝐺italic-ϵt_{\mathrm{mix}}(G,\epsilon)\coloneqq\min\left\{t\in\mathbb{N}_{0}:\max_{u\in V% (G)}d_{\mathrm{TV}}(\mu_{0}^{u}P_{G}^{t},\pi_{G})\leq\epsilon\right\},italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ) ≔ roman_min { italic_t ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : roman_max start_POSTSUBSCRIPT italic_u ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ≤ italic_ϵ } ,

where μ0usuperscriptsubscript𝜇0𝑢\mu_{0}^{u}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT is the distribution supported entirely on uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ).

If instead of considering the worst-case initial vertex we consider a uniformly random vertex vV(G)𝑣𝑉𝐺v\in V(G)italic_v ∈ italic_V ( italic_G ), then the quantity dTV(μ0vPGt,πG)subscript𝑑TVsuperscriptsubscript𝜇0𝑣superscriptsubscript𝑃𝐺𝑡subscript𝜋𝐺d_{\mathrm{TV}}(\mu_{0}^{v}P_{G}^{t},\pi_{G})italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) is a random variable. We define the average ϵitalic-ϵ\epsilonitalic_ϵ-mixing time t¯mix(G,ϵ)subscript¯𝑡mix𝐺italic-ϵ\bar{t}_{\mathrm{mix}}(G,\epsilon)over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ) of the lazy random walk, to be the time at which the expectation of this random variable falls below the ϵitalic-ϵ\epsilonitalic_ϵ. That is,

t¯mix(G,ϵ)min{t0:1nuV(G)dTV(μ0uPGt,πG)ϵ}.subscript¯𝑡mix𝐺italic-ϵ:𝑡subscript01𝑛subscript𝑢𝑉𝐺subscript𝑑TVsuperscriptsubscript𝜇0𝑢superscriptsubscript𝑃𝐺𝑡subscript𝜋𝐺italic-ϵ\bar{t}_{\mathrm{mix}}(G,\epsilon)\coloneqq\min\left\{t\in\mathbb{N}_{0}:\frac% {1}{n}\sum_{u\in V(G)}d_{\mathrm{TV}}(\mu_{0}^{u}P_{G}^{t},\pi_{G})\leq% \epsilon\right\}.over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ) ≔ roman_min { italic_t ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_u ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ≤ italic_ϵ } .
Remark 1.1.

In this work, we will focus on the quantity t¯mix(G,ϵ)subscriptnormal-¯𝑡normal-mix𝐺italic-ϵ\bar{t}_{\mathrm{mix}}(G,\epsilon)over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ), which we believe is a natural candidate for tracking mixing times starting from a uniform vertex. Nonetheless, other related quantities have been used to measure the mixing time from a uniform starting point.

Indeed, for a vertex uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ), define

tmix(u)(G,ϵ)min{t0:dTV(μ0uPGt,πG)ϵ}superscriptsubscript𝑡mix𝑢𝐺italic-ϵ:𝑡subscript0subscript𝑑TVsuperscriptsubscript𝜇0𝑢superscriptsubscript𝑃𝐺𝑡subscript𝜋𝐺italic-ϵt_{\mathrm{mix}}^{(u)}(G,\epsilon)\coloneqq\min\left\{t\in\mathbb{N}_{0}:d_{% \mathrm{TV}}(\mu_{0}^{u}P_{G}^{t},\pi_{G})\leq\epsilon\right\}italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_u ) end_POSTSUPERSCRIPT ( italic_G , italic_ϵ ) ≔ roman_min { italic_t ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ≤ italic_ϵ }

and consider the random variable tmix(Un)=tmix(Un)(G,ϵ)superscriptsubscript𝑡normal-mixsubscript𝑈𝑛superscriptsubscript𝑡normal-mixsubscript𝑈𝑛𝐺italic-ϵt_{\mathrm{mix}}^{(U_{n})}=t_{\mathrm{mix}}^{(U_{n})}(G,\epsilon)italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT = italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_G , italic_ϵ ), where Unsubscript𝑈𝑛U_{n}italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is a vertex chosen uniformly at random from V(G)𝑉𝐺V(G)italic_V ( italic_G ). This notion was the one studied by Berestycki, Lubetzky, Peres and Sly [6]. It is natural to compare t¯mixsubscriptnormal-¯𝑡normal-mix\bar{t}_{\mathrm{mix}}over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT to 𝔼(tmix(Un))𝔼superscriptsubscript𝑡normal-mixsubscript𝑈𝑛\mathbb{E}(t_{\mathrm{mix}}^{(U_{n})})blackboard_E ( italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ): in the first case, we average the total variation distance over starting vertices and take the smallest time t𝑡titalic_t when this average is smaller than ϵitalic-ϵ\epsilonitalic_ϵ; in the second one, we average the mixing times over the starting vertices (see Figure 1). In general as functions, neither of these notions is stronger than the other, in that one can design examples of trajectories for total variation distances dTV(μ0uPGt,πG)subscript𝑑normal-TVsuperscriptsubscript𝜇0𝑢superscriptsubscript𝑃𝐺𝑡subscript𝜋𝐺d_{\mathrm{TV}}(\mu_{0}^{u}P_{G}^{t},\pi_{G})italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) for different vertices u𝑢uitalic_u, showing that t¯mixsubscriptnormal-¯𝑡normal-mix\bar{t}_{\mathrm{mix}}over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT cannot be bounded by a function of 𝔼(tmix(Un))𝔼superscriptsubscript𝑡normal-mixsubscript𝑈𝑛\mathbb{E}(t_{\mathrm{mix}}^{(U_{n})})blackboard_E ( italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) and vice versa. However, bounding either 𝔼(tmix(Un))𝔼superscriptsubscript𝑡normal-mixsubscript𝑈𝑛\mathbb{E}(t_{\mathrm{mix}}^{(U_{n})})blackboard_E ( italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) or t¯mixsubscriptnormal-¯𝑡normal-mix\bar{t}_{\mathrm{mix}}over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT implies that tmix(Un)superscriptsubscript𝑡normal-mixsubscript𝑈𝑛t_{\mathrm{mix}}^{(U_{n})}italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT is small with high probability. In the first case this is a direct application of Markov’s inequality. In the second one, define du(t)dTV(μ0uPGt,πG)normal-≔subscript𝑑𝑢𝑡subscript𝑑normal-TVsuperscriptsubscript𝜇0𝑢superscriptsubscript𝑃𝐺𝑡subscript𝜋𝐺d_{u}(t)\coloneqq d_{\mathrm{TV}}(\mu_{0}^{u}P_{G}^{t},\pi_{G})italic_d start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_t ) ≔ italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ), for a vertex u𝑢uitalic_u, then t¯mix(G,ϵ)subscriptnormal-¯𝑡normal-mix𝐺italic-ϵ\bar{t}_{\mathrm{mix}}(G,\epsilon)over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ) is the time t𝑡titalic_t at which the expected value of du(t)subscript𝑑𝑢𝑡d_{u}(t)italic_d start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ( italic_t ) (averaged over starting points) is less than ϵitalic-ϵ\epsilonitalic_ϵ. By Markov’s inequality, dUn(t¯mix(G,ϵ2))ϵsubscript𝑑subscript𝑈𝑛subscriptnormal-¯𝑡normal-mix𝐺superscriptitalic-ϵ2italic-ϵd_{U_{n}}(\bar{t}_{\mathrm{mix}}(G,\epsilon^{2}))\leq\epsilonitalic_d start_POSTSUBSCRIPT italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ) ≤ italic_ϵ with probability at least 1ϵ1italic-ϵ1-\epsilon1 - italic_ϵ.

Refer to caption
Figure 1. Schematic plot of the total variation distance starting at different vertices and the two average mixing times for ϵ=0.05italic-ϵ0.05\epsilon=0.05italic_ϵ = 0.05. In red, the function 1nuV(G)dTV(μ0uPGt,πG)1𝑛subscript𝑢𝑉𝐺subscript𝑑normal-TVsuperscriptsubscript𝜇0𝑢superscriptsubscript𝑃𝐺𝑡subscript𝜋𝐺\frac{1}{n}\sum_{u\in V(G)}d_{\mathrm{TV}}(\mu_{0}^{u}P_{G}^{t},\pi_{G})divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_u ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) and the dot representing t¯mix(G,ϵ)subscriptnormal-¯𝑡normal-mix𝐺italic-ϵ\bar{t}_{\mathrm{mix}}(G,\epsilon)over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ). In blue, the average of mixing times at different thresholds and the dot representing 𝔼(tmix(Un)(G,ϵ))𝔼superscriptsubscript𝑡normal-mixsubscript𝑈𝑛𝐺italic-ϵ\mathbb{E}(t_{\mathrm{mix}}^{(U_{n})}(G,\epsilon))blackboard_E ( italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ( italic_G , italic_ϵ ) ).

A related but very different notion is the time it takes to mix starting at μVsubscript𝜇𝑉\mu_{V}italic_μ start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT, the uniform distribution over V𝑉Vitalic_V:

tmix(V)(G,ϵ)min{t0:dTV(μVPGt,πG)ϵ}.subscriptsuperscript𝑡𝑉mix𝐺italic-ϵ:𝑡subscript0subscript𝑑TVsubscript𝜇𝑉superscriptsubscript𝑃𝐺𝑡subscript𝜋𝐺italic-ϵt^{(V)}_{\mathrm{mix}}(G,\epsilon)\coloneqq\min\left\{t\in\mathbb{N}_{0}:d_{% \mathrm{TV}}(\mu_{V}P_{G}^{t},\pi_{G})\leq\epsilon\right\}.italic_t start_POSTSUPERSCRIPT ( italic_V ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ) ≔ roman_min { italic_t ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT italic_V end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ≤ italic_ϵ } .

A similar notion has been studied for directed graphs, where the initial distribution is the in-degree one; see, e.g., [9, Theorem 3]. In general, this latter notion of average mixing time is much smaller than the previous notions and we expect this to also be the case in the settings studied here, although we do not explore this direction.

Remark 1.2.

In the literature, the mixing time of the random walk is often defined as tmix(G)tmix(G,1/4)normal-≔subscript𝑡normal-mix𝐺subscript𝑡normal-mix𝐺14t_{\mathrm{mix}}(G)\coloneqq t_{\mathrm{mix}}(G,1/4)italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G ) ≔ italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , 1 / 4 ), since the distance to the stationary distribution is contractive after this time. However, this might not be the case for t¯mixsubscriptnormal-¯𝑡normal-mix\bar{t}_{\mathrm{mix}}over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT. Consider for instance the lollipop graph Ln,ksubscript𝐿𝑛𝑘L_{n,k}italic_L start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT: a clique on k𝑘kitalic_k vertices and a path on nk𝑛𝑘n-kitalic_n - italic_k vertices joined by an edge incident to one of the endpoints of the path. If k𝑘kitalic_k and nk𝑛𝑘n-kitalic_n - italic_k are both very large, then, after one step, the total variation distance is roughly 00 if we start at the clique (almost all the mass of πLn,ksubscript𝜋subscript𝐿𝑛𝑘\pi_{L_{n,k}}italic_π start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT is supported on the clique), and roughly 1111 if we start at the path. Taking k=αn𝑘𝛼𝑛k=\lceil\alpha n\rceilitalic_k = ⌈ italic_α italic_n ⌉, then

1nuV(G)dTV(μ0uPLn,k,πLn,k)1α.similar-to1𝑛subscript𝑢𝑉𝐺subscript𝑑TVsuperscriptsubscript𝜇0𝑢subscript𝑃subscript𝐿𝑛𝑘subscript𝜋subscript𝐿𝑛𝑘1𝛼\frac{1}{n}\sum_{u\in V(G)}d_{\mathrm{TV}}(\mu_{0}^{u}P_{L_{n,k}},\pi_{L_{n,k}% })\sim 1-\alpha.divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_u ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∼ 1 - italic_α . (1.2)

If α>3/4𝛼34\alpha>3/4italic_α > 3 / 4, then t¯mix(Ln,k,1/4)=1subscriptnormal-¯𝑡normal-mixsubscript𝐿𝑛𝑘141\bar{t}_{\mathrm{mix}}(L_{n,k},1/4)=1over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT italic_n , italic_k end_POSTSUBSCRIPT , 1 / 4 ) = 1. However, the time required to further decrease the distance to the stationary distribution is of order Ω(n2)normal-Ωsuperscript𝑛2\Omega(n^{2})roman_Ω ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), as this is the time required for the walk starting at a typical vertex in the path to hit the clique.

1.2. Our results

Our results will apply to graphs satisfying certain natural structural conditions, which we formalise in the following definition.

Definition 1.3.

Let G𝐺Gitalic_G be an n𝑛nitalic_n-vertex graph. For α>0𝛼0\alpha>0italic_α > 0, we say that a set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) is α𝛼\alphaitalic_α-thin in G𝐺Gitalic_G if

|G(S)|eG(S,V(G)S)<α|S|.subscript𝐺𝑆subscript𝑒𝐺𝑆𝑉𝐺𝑆𝛼𝑆|\partial_{G}(S)|\coloneqq e_{G}(S,V(G)\setminus S)<\alpha|S|.| ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | ≔ italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S , italic_V ( italic_G ) ∖ italic_S ) < italic_α | italic_S | .

For D>0𝐷0D>0italic_D > 0, we say that a set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) is D𝐷Ditalic_D-loaded in G𝐺Gitalic_G if

eG(S)>D|S|.subscript𝑒𝐺𝑆𝐷𝑆e_{G}(S)>D|S|.italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) > italic_D | italic_S | .

We say that G𝐺Gitalic_G is an (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph if it satisfies the following three properties:

  1. (S1)S1(\mathrm{S}1)( S1 )

    For all (logn)1/5k(11/D2)nsuperscript𝑛15𝑘11superscript𝐷2𝑛(\log n)^{1/5}\leq k\leq(1-1/D^{2})n( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ≤ italic_k ≤ ( 1 - 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_n, the number of G𝐺Gitalic_G-connected α𝛼\alphaitalic_α-thin sets SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|=k𝑆𝑘|S|=k| italic_S | = italic_k is less than nek𝑛superscripte𝑘n\mathrm{e}^{-\sqrt{k}}italic_n roman_e start_POSTSUPERSCRIPT - square-root start_ARG italic_k end_ARG end_POSTSUPERSCRIPT.

  2. (S2)S2(\mathrm{S}2)( S2 )

    For all (logn)1/5k(11/D2)nsuperscript𝑛15𝑘11superscript𝐷2𝑛(\log n)^{1/5}\leq k\leq(1-1/D^{2})n( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ≤ italic_k ≤ ( 1 - 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_n, the number of G𝐺Gitalic_G-connected α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded sets SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|=k𝑆𝑘|S|=k| italic_S | = italic_k is less than nek𝑛superscripte𝑘n\mathrm{e}^{-\sqrt{k}}italic_n roman_e start_POSTSUPERSCRIPT - square-root start_ARG italic_k end_ARG end_POSTSUPERSCRIPT.

  3. (S3)S3(\mathrm{S}3)( S3 )

    No set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|αn𝑆𝛼𝑛|S|\geq\alpha n| italic_S | ≥ italic_α italic_n is D𝐷Ditalic_D-loaded in G𝐺Gitalic_G.

Note that, for k>log2n𝑘superscript2𝑛k>\log^{2}nitalic_k > roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n, one has that nek<1𝑛superscripte𝑘1n\mathrm{e}^{-\sqrt{k}}<1italic_n roman_e start_POSTSUPERSCRIPT - square-root start_ARG italic_k end_ARG end_POSTSUPERSCRIPT < 1, and thus the conditions on an n𝑛nitalic_n-vertex graph G𝐺Gitalic_G being an (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph guarantee that there are no G𝐺Gitalic_G-connected vertex subsets of size between log2nsuperscript2𝑛\log^{2}nroman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n and (11/D2)n11superscript𝐷2𝑛(1-1/D^{2})n( 1 - 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_n that have too few edges leaving the set (S1)S1(\mathrm{S}1)( S1 ) or too many edges contained inside the set (S2)S2(\mathrm{S}2)( S2 ). These pseudo-random conditions on expansion and edge distribution arise naturally in the context of random graph models. Indeed, the density of a random graph within any vertex set and across any vertex partition is expected to be the same as the density of the whole graph, and concentration inequalities in conjunction with union bounds can be used to derive the non-existence of such bad connected vertex sets with high probability. Moreover, the conditions of Definition 1.3 bound the number of bad vertex sets of size between (logn)1/5superscript𝑛15(\log n)^{1/5}( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT and log2nsuperscript2𝑛\log^{2}nroman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n, with exponential decay as one can expect from concentration inequalities on binomial random variables. In the context of the current work, the conditions on spreader graphs will guarantee that all bottlenecks are small and they are scarce in the graph.

To digest the notion of spreader graphs, one can think of α>0𝛼0\alpha>0italic_α > 0 as an arbitrarily small constant and D𝐷Ditalic_D as arbitrarily large. The parameter α>0𝛼0\alpha>0italic_α > 0 controls conditions (S1)S1(\mathrm{S}1)( S1 ) and (S2)S2(\mathrm{S}2)( S2 ) in the sense that, as α𝛼\alphaitalic_α shrinks, these conditions become easier to satisfy and thus the definition of spreader graphs captures more graphs. Similarly, the parameter D𝐷Ditalic_D controls (S3)S3(\mathrm{S}3)( S3 ) and imposes in particular that the spreader graphs are sparse with bounded average degree. It should be noted that, due to D𝐷Ditalic_D appearing in (S1)S1(\mathrm{S}1)( S1 ) and (S2)S2(\mathrm{S}2)( S2 ) and α𝛼\alphaitalic_α appearing in (S3)S3(\mathrm{S}3)( S3 ), our definition is not actually monotone in these parameters. This is a technical subtlety that is needed in our proof to guarantee a trade-off between the conditions. However, in all applications, the restraints given by D𝐷Ditalic_D in (S1)S1(\mathrm{S}1)( S1 ) and (S2)S2(\mathrm{S}2)( S2 ) and α𝛼\alphaitalic_α in (S3)S3(\mathrm{S}3)( S3 ) are never critical, as we have very good control over the edge distribution in all linear sets.

We also remark that the constant 1/5151/51 / 5 could be replaced by any constant ζ<1/4𝜁14\zeta<1/4italic_ζ < 1 / 4. Indeed, for sets smaller than (logn)ζsuperscript𝑛𝜁(\log n)^{\zeta}( roman_log italic_n ) start_POSTSUPERSCRIPT italic_ζ end_POSTSUPERSCRIPT, we impose no restriction. The point is that, as we will focus on connected spreader graphs G𝐺Gitalic_G, even if a small set is an extreme bottleneck, the random walk will not get stuck there for too long before exploring the set enough to escape. In our proof, these bottlenecks contribute a factor (logn)4ζsuperscript𝑛4𝜁(\log n)^{4\zeta}( roman_log italic_n ) start_POSTSUPERSCRIPT 4 italic_ζ end_POSTSUPERSCRIPT (due to a connected set of size s𝑠sitalic_s having conductance at least 1/s21superscript𝑠21/s^{2}1 / italic_s start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and Theorem 2.1 giving a quadratic dependence on conductance, see Section 2.2 for details), hence a choice of ζ<1/4𝜁14\zeta<1/4italic_ζ < 1 / 4 guarantees that this contribution is negligible. It may be possible to replace this constraint of 1/4141/41 / 4 by 1/2121/21 / 2 but not beyond this.

Finally, we remark that the constants α𝛼\alphaitalic_α and D𝐷Ditalic_D could be replaced with functions that depend on n𝑛nitalic_n and the definition of spreader graphs could be adjusted so that our main theorem would still give bounds on average mixing times. However, as our focus is on sparse graphs with constant average degree, we do not pursue this direction here.

Remark 1.4.

The definition of (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graphs bears resemblance with that of α𝛼\alphaitalic_α-AN graphs (or α𝛼\alphaitalic_α-decorated expanders) introduced in [5]. An α𝛼\alphaitalic_α-AN graph G𝐺Gitalic_G is defined in terms of the existence of an expander subgraph B𝐵Bitalic_B whose complement is formed by a small number of small components, similar to what can be deduced from (S1)italic-S1(\mathrm{S}1)( S1 )-(S3)italic-S3(\mathrm{S}3)( S3 ), and additionally requiring that not too many components of GB𝐺𝐵G-Bitalic_G - italic_B are connected to each vV(B)𝑣𝑉𝐵v\in V(B)italic_v ∈ italic_V ( italic_B ). The backbone of the main result in [5] is to show that random walks on α𝛼\alphaitalic_α-AN graphs mix in O(log2n)𝑂superscript2𝑛O(\log^{2}n)italic_O ( roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ) steps.

Our main theorem provides a tool to prove logarithmic average mixing time for (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graphs.

Theorem 1.5.

For all ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, D4𝐷4D\geq 4italic_D ≥ 4 and 0<α<1/D20𝛼1superscript𝐷20<\alpha<1/D^{2}0 < italic_α < 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, there exists a C>0𝐶0C>0italic_C > 0 such that the following holds for all n𝑛nitalic_n sufficiently large. Suppose G𝐺Gitalic_G is an n𝑛nitalic_n-vertex connected (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph. Then,

t¯mix(G,ϵ)Clogn.subscript¯𝑡mix𝐺italic-ϵ𝐶𝑛\bar{t}_{\mathrm{mix}}(G,\epsilon)\leq C\log n.over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ) ≤ italic_C roman_log italic_n .

We believe that in many cases, as in our two applications below, this theorem can be used to quickly derive optimal bounds for average mixing times in settings where worst-case mixing times are established via conductance bounds.

The proof of Theorem 1.5 bears some similarities with the proof in [6]. Both use the idea of contracting badly connected sets and coupling the random walks in the original and the contracted graphs. However, our proof is conceptually simpler as it does not use the anatomy of the giant component [16], a powerful description of the largest component in the supercritical regime. Instead, we rely on the Fountoulakis-Reed bound for mixing [23] and recent progress on hitting time lemmas [35].

1.3. Application 1: Smoothed analysis on connected graphs

The idea of studying the effect of random perturbations on a given structure arose naturally in several distinct settings. In theoretical computer science, Spielman and Teng [40] (see also [41]) introduced the notion of smoothed analysis of algorithms. By randomly perturbing an input to an algorithm, they could interpolate between a worst-time case analysis and an average case analysis, leading to a better understanding of the practical performance of algorithms on real life instances. This has been hugely influential, leading to the study of smoothed analysis in a host of different settings, including numerical analysis [39, 43], satisfiability [22, 14], data clustering [3], multilinear algebra [7] and machine learning [29]. Almost simultaneously, in graph theory, Bohman, Frieze and Martin [8] introduced the model of randomly perturbed graphs which, as with smoothed analysis, allows one to understand the interplay between an extremal and probabilistic viewpoint. The majority of work on the subject has focused on dense graphs [10, 11, 26].

In the context of random walk mixing, it can be seen that small random perturbations cannot speed up the mixing time on dense graphs significantly. Indeed, the canonical examples leading to torpid mixing (e.g., two cliques connected by a long path) are robust with respect to that property. Smoothed analysis of sparse graphs was introduced by Krivelevich, Reichman and Samotij [31]. Here one starts with a connected graph of bounded degree (in fact, bounded degeneracy often suffices) and applies a small random perturbation by adding a copy of the binomial random graph RG(n,δ/n)similar-to𝑅𝐺𝑛𝛿𝑛R\sim G(n,{\delta}/{n})italic_R ∼ italic_G ( italic_n , italic_δ / italic_n ) for small δ>0𝛿0\delta>0italic_δ > 0. Although this perturbation is very slight, they showed that it greatly improves the expansion properties of the graph. A graph G𝐺Gitalic_G is said to be ΔΔ\Deltaroman_Δ-degenerate if there is some ordering of the vertices of G𝐺Gitalic_G such that each vertex has at most ΔΔ\Deltaroman_Δ neighbours in G𝐺Gitalic_G that precede it in the ordering. To be precise, Krivelevich, Reichman and Samotij proved that, for any ΔΔ\Delta\in\mathbb{N}roman_Δ ∈ blackboard_N and δ>0𝛿0\delta>0italic_δ > 0, if G𝐺Gitalic_G is an n𝑛nitalic_n-vertex ΔΔ\Deltaroman_Δ-degenerate connected graph and RG(n,δ/n)similar-to𝑅𝐺𝑛𝛿𝑛R\sim G(n,{\delta}/{n})italic_R ∼ italic_G ( italic_n , italic_δ / italic_n ), then GGRsuperscript𝐺𝐺𝑅G^{\prime}\coloneqq G\cup Ritalic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≔ italic_G ∪ italic_R a.a.s. satisfies that tmix(G)=O(log2n)subscript𝑡mixsuperscript𝐺𝑂superscript2𝑛t_{\mathrm{mix}}(G^{\prime})=O(\log^{2}n)italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_O ( roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ). By considering, for example, a path on n𝑛nitalic_n vertices, which has mixing time Ω(n2)Ωsuperscript𝑛2\Omega(n^{2})roman_Ω ( italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ), we see a vast improvement after a slight random perturbation. We also note that the result is tight on such examples, as the randomly perturbed path a.a.s. contains bare paths of length Ω(logn)Ω𝑛\Omega(\log n)roman_Ω ( roman_log italic_n ).

Our first application of Theorem 1.5 shows that we can improve the mixing time yet further in this model by starting from a uniformly chosen vertex, as in this case we avoid the small bottlenecks that remain if the initial graph had poor expansion.

Theorem 1.6.

For any ϵ,δ>0italic-ϵ𝛿0\epsilon,\delta>0italic_ϵ , italic_δ > 0 and Δnormal-Δ\Delta\in\mathbb{N}roman_Δ ∈ blackboard_N, there exists a C>0𝐶0C>0italic_C > 0 such that the following holds. Let G𝐺Gitalic_G be an n𝑛nitalic_n-vertex Δnormal-Δ\Deltaroman_Δ-degenerate connected graph, choose RG(n,δ/n)similar-to𝑅𝐺𝑛𝛿𝑛R\sim G(n,{\delta}/{n})italic_R ∼ italic_G ( italic_n , italic_δ / italic_n ) and let GGRnormal-≔superscript𝐺normal-′𝐺𝑅G^{\prime}\coloneqq G\cup Ritalic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≔ italic_G ∪ italic_R. Then, a.a.s.

t¯mix(G,ϵ)Clogn.subscript¯𝑡mixsuperscript𝐺italic-ϵ𝐶𝑛\bar{t}_{\mathrm{mix}}(G^{\prime},\epsilon)\leq C\log n.over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_ϵ ) ≤ italic_C roman_log italic_n .
Remark 1.7.

Theorem 1.6 is tight, up to the constant factor C𝐶Citalic_C, for all graphs with maximum degree Δnormal-Δ\Deltaroman_Δ. Indeed, this follows from the fact that Nk(v)=O((2d¯)klogn)superscript𝑁𝑘𝑣𝑂superscript2normal-¯𝑑𝑘𝑛N^{k}(v)=O((2\bar{d})^{k}\log n)italic_N start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( italic_v ) = italic_O ( ( 2 over¯ start_ARG italic_d end_ARG ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT roman_log italic_n ) for all vertices vV(G)𝑣𝑉superscript𝐺normal-′v\in V(G^{\prime})italic_v ∈ italic_V ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ), where Nk(v)superscript𝑁𝑘𝑣N^{k}(v)italic_N start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( italic_v ) denotes the number of vertices that are at distance at most k𝑘kitalic_k from v𝑣vitalic_v in Gsuperscript𝐺normal-′G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and d¯Δ+δnormal-≔normal-¯𝑑normal-Δ𝛿\bar{d}\coloneqq\Delta+\deltaover¯ start_ARG italic_d end_ARG ≔ roman_Δ + italic_δ is an upper bound on the average degree in Gsuperscript𝐺normal-′G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Such an upper bound can be shown easily by induction, see for example [13], and setting k=clogn𝑘𝑐𝑛k=c\log nitalic_k = italic_c roman_log italic_n for c>0𝑐0c>0italic_c > 0 sufficiently small shows that at least half of the vertices cannot be reached from v𝑣vitalic_v in k𝑘kitalic_k steps and hence t¯mix(G,ϵ)ksubscriptnormal-¯𝑡normal-mixsuperscript𝐺normal-′italic-ϵ𝑘\bar{t}_{\mathrm{mix}}(G^{\prime},\epsilon)\geq kover¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_ϵ ) ≥ italic_k. Nonetheless, the converse of the inequality in Theorem 1.6 is not true for all Δnormal-Δ\Deltaroman_Δ-degenerate graphs. Consider for instance a star: it is 1111-degenerate, but the mixing time of the randomly perturbed star is O(1)𝑂1O(1)italic_O ( 1 ) as we mix in the step after visiting the centre of the star for the first time.

Some time before the systematic study of random perturbations in the combinatorial and theoretical computer science communities discussed above, the notion appeared in physics literature with the study of so-called small-world networks. Here we will concentrate on a model introduced by Newman and Watts [37, 36] where, for some fixed k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N, δ>0𝛿0\delta>0italic_δ > 0 and n𝑛n\in\mathbb{N}italic_n ∈ blackboard_N large, one starts with n𝑛nitalic_n-vertices of the graph ordered as v1,,vnsubscript𝑣1subscript𝑣𝑛v_{1},\ldots,v_{n}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, adds all edges vivjsubscript𝑣𝑖subscript𝑣𝑗v_{i}v_{j}italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT for which i+1ji+k𝑖1𝑗𝑖𝑘i+1\leq j\leq i+kitalic_i + 1 ≤ italic_j ≤ italic_i + italic_k (with addition modulo n𝑛nitalic_n), and then adds all remaining edges independently with probability p=δ/n𝑝𝛿𝑛p=\delta/nitalic_p = italic_δ / italic_n. We denote the resulting random graph as Hn,k,δsubscript𝐻𝑛𝑘𝛿H_{n,k,\delta}italic_H start_POSTSUBSCRIPT italic_n , italic_k , italic_δ end_POSTSUBSCRIPT. It is easy to see that this graph fits into the framework of Krivelevich, Reichman and Samotij [31], and so their result implies that, for any k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N and δ>0𝛿0\delta>0italic_δ > 0, the Newman-Watts small world network Hn,k,δsubscript𝐻𝑛𝑘𝛿H_{n,k,\delta}italic_H start_POSTSUBSCRIPT italic_n , italic_k , italic_δ end_POSTSUBSCRIPT a.a.s. satisfies tmix(Hn,k,δ)=O(log2n)subscript𝑡mixsubscript𝐻𝑛𝑘𝛿𝑂superscript2𝑛t_{\mathrm{mix}}(H_{n,k,\delta})=O(\log^{2}n)italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_H start_POSTSUBSCRIPT italic_n , italic_k , italic_δ end_POSTSUBSCRIPT ) = italic_O ( roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ). In fact, this was established before their work by Addario-Berry and Lei [1], improving on a previous bound of O(log3n)𝑂superscript3𝑛O(\log^{3}n)italic_O ( roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_n ) due to Durrett [18]. Here, as a direct consequence of Theorem 1.6, we conclude that the average mixing time on the Newman-Watts small world network is of order O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ).

Corollary 1.8.

For all k𝑘k\in\mathbb{N}italic_k ∈ blackboard_N and ϵ,δ>0italic-ϵ𝛿0\epsilon,\delta>0italic_ϵ , italic_δ > 0, there exists a C>0𝐶0C>0italic_C > 0 such that the following holds. The Newman-Watts small world network Hn,k,δsubscript𝐻𝑛𝑘𝛿H_{n,k,\delta}italic_H start_POSTSUBSCRIPT italic_n , italic_k , italic_δ end_POSTSUBSCRIPT a.a.s. satisfies

t¯mix(Hn,k,δ,ϵ)Clogn.subscript¯𝑡mixsubscript𝐻𝑛𝑘𝛿italic-ϵ𝐶𝑛\bar{t}_{\mathrm{mix}}(H_{n,k,\delta},\epsilon)\leq C\log n.over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_H start_POSTSUBSCRIPT italic_n , italic_k , italic_δ end_POSTSUBSCRIPT , italic_ϵ ) ≤ italic_C roman_log italic_n .

1.4. Application 2: Giant components in random subgraphs of expanders

For p[0,1]𝑝01p\in[0,1]italic_p ∈ [ 0 , 1 ] and a graph G𝐺Gitalic_G, we define Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT to be the graph with the same vertex set where each edge of G𝐺Gitalic_G is retained in Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT independently with probability p𝑝pitalic_p. The graph G𝐺Gitalic_G is called the host graph, and the random subgraph Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT, the p𝑝pitalic_p-percolated one. Percolation on graphs is a well-established topic in probability theory. Most classically, if the host graph is the complete graph on n𝑛nitalic_n vertices Knsubscript𝐾𝑛K_{n}italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, then its p𝑝pitalic_p-percolated subgraph is the Erdős-Rényi graph G(n,p)𝐺𝑛𝑝G(n,p)italic_G ( italic_n , italic_p ). For any graph G𝐺Gitalic_G, let L1(G)subscript𝐿1𝐺L_{1}(G)italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G ) denote a largest connected component in G𝐺Gitalic_G and let 1(G)subscript1𝐺\ell_{1}(G)roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G ) denote its order. In their seminal paper [21], Erdős and Rényi proved a phase transition for 1(G(n,p))subscript1𝐺𝑛𝑝\ell_{1}(G(n,p))roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G ( italic_n , italic_p ) ). Namely, writing p=c/n𝑝𝑐𝑛p=c/nitalic_p = italic_c / italic_n for some constant c𝑐citalic_c, if c<1𝑐1c<1italic_c < 1 then a.a.s. 1(G(n,p))=O(logn)subscript1𝐺𝑛𝑝𝑂𝑛\ell_{1}(G(n,p))=O(\log n)roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G ( italic_n , italic_p ) ) = italic_O ( roman_log italic_n ), while if c>1𝑐1c>1italic_c > 1 then a.a.s. 1(G(n,p))=Ω(n)subscript1𝐺𝑛𝑝Ω𝑛\ell_{1}(G(n,p))=\Omega(n)roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G ( italic_n , italic_p ) ) = roman_Ω ( italic_n ) and the largest component, which is the unique component of linear size, is known as the giant component.

A central question in random graph theory is whether other host graphs exhibit the same phenomenon [2]. One quickly observes that, in order for Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT to have a sharp threshold for the component structure, the host graph G𝐺Gitalic_G should satisfy some additional properties. A natural property to consider is the pseudo-random notion of expansion. There is a strong connection between expansion and the graph spectrum. Given the eigenvalues of the adjacency matrix of a d𝑑ditalic_d-regular graph G𝐺Gitalic_G, say d=λ1λ2λn𝑑subscript𝜆1subscript𝜆2subscript𝜆𝑛d=\lambda_{1}\geq\lambda_{2}\geq\ldots\geq\lambda_{n}italic_d = italic_λ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ … ≥ italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we let λ(G)max{|λ2|,|λn|}𝜆𝐺subscript𝜆2subscript𝜆𝑛\lambda(G)\coloneqq\max\{|\lambda_{2}|,|\lambda_{n}|\}italic_λ ( italic_G ) ≔ roman_max { | italic_λ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | , | italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | } be the second largest eigenvalue. We then define an (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graph to be a d𝑑ditalic_d-regular graph G𝐺Gitalic_G on n𝑛nitalic_n vertices with λ(G)=λ𝜆𝐺𝜆\lambda(G)=\lambdaitalic_λ ( italic_G ) = italic_λ. When λ𝜆\lambdaitalic_λ is small compared to d𝑑ditalic_d, an (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graph is said to be an expander and it enjoys many of the same properties as a random graph with the same density. We refer the reader to the excellent survey of Krivelevich and Sudakov [32] on the subject.

In terms of percolation, Frieze, Krivelevich and Martin [25] proved that, if G𝐺Gitalic_G is an (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graph with λ=o(d)𝜆𝑜𝑑\lambda=o(d)italic_λ = italic_o ( italic_d ), then 1(Gp)subscript1subscript𝐺𝑝\ell_{1}(G_{p})roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) undergoes a phase transition at p=1/d𝑝1𝑑p=1/ditalic_p = 1 / italic_d. They obtained the following description of the supercritical regime: for δ>0𝛿0\delta>0italic_δ > 0 and p=(1+δ)/d𝑝1𝛿𝑑p=(1+\delta)/ditalic_p = ( 1 + italic_δ ) / italic_d, and provided that λδ4d𝜆superscript𝛿4𝑑\lambda\leq\delta^{4}ditalic_λ ≤ italic_δ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_d, a.a.s. 1(Gp)y(δ)nsimilar-tosubscript1subscript𝐺𝑝𝑦𝛿𝑛\ell_{1}(G_{p})\sim y(\delta)nroman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ∼ italic_y ( italic_δ ) italic_n for some y(δ)(0,1)𝑦𝛿01y(\delta)\in(0,1)italic_y ( italic_δ ) ∈ ( 0 , 1 ). Moreover, as in G(n,p)𝐺𝑛𝑝G(n,p)italic_G ( italic_n , italic_p ), the largest component L1(Gp)subscript𝐿1subscript𝐺𝑝L_{1}(G_{p})italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) is a.a.s. the unique component of linear size. Very recently, Diskin and Krivelevich [17] studied the mixing time of percolated (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graphs. More precisely, they showed that, in the supercritical regime, there exists C=C(δ)𝐶𝐶𝛿C=C(\delta)italic_C = italic_C ( italic_δ ) such that a.a.s. tmix(L1(Gp))Clog2nsubscript𝑡mixsubscript𝐿1subscript𝐺𝑝𝐶superscript2𝑛t_{\mathrm{mix}}(L_{1}(G_{p}))\leq C\log^{2}nitalic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ) ≤ italic_C roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n. This is indeed optimal for some graphs, in particular for Erdős-Rényi random graphs [24, 5], as discussed above.

Our next application of our main result shows that for percolated pseudo-random graphs the average mixing time is logarithmic.

Theorem 1.9.

For all δ>0𝛿0\delta>0italic_δ > 0 sufficiently small and all ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, there exists a C>0𝐶0C>0italic_C > 0 such that, if p=(1+δ)/d𝑝1𝛿𝑑p=(1+\delta)/{d}italic_p = ( 1 + italic_δ ) / italic_d and G𝐺Gitalic_G is an (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graph with λδ4d𝜆superscript𝛿4𝑑\lambda\leq\delta^{4}ditalic_λ ≤ italic_δ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_d, then a.a.s.

t¯mix(L1(Gp),ϵ)Clogn.subscript¯𝑡mixsubscript𝐿1subscript𝐺𝑝italic-ϵ𝐶𝑛\bar{t}_{\mathrm{mix}}(L_{1}(G_{p}),\epsilon)\leq C\log n.over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) , italic_ϵ ) ≤ italic_C roman_log italic_n .

Similarly as in Remark 1.7, it can be proven that Theorem 1.9 is tight up to multiplicative constant for all (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graphs.

As a consequence, for G=Kn𝐺subscript𝐾𝑛G=K_{n}italic_G = italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT we obtain the following.

Corollary 1.10.

For all δ>0𝛿0\delta>0italic_δ > 0 sufficiently small and all ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, there exists a C>0𝐶0C>0italic_C > 0 such that, for p=(1+δ)/n𝑝1𝛿𝑛p=(1+\delta)/nitalic_p = ( 1 + italic_δ ) / italic_n, a.a.s.

t¯mix(L1(G(n,p)),ϵ)Clogn.subscript¯𝑡mixsubscript𝐿1𝐺𝑛𝑝italic-ϵ𝐶𝑛\bar{t}_{\mathrm{mix}}(L_{1}(G(n,p)),\epsilon)\leq C\log n.over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G ( italic_n , italic_p ) ) , italic_ϵ ) ≤ italic_C roman_log italic_n .

By Remark 1.1, tmix(Un)=O(logn)subscriptsuperscript𝑡subscript𝑈𝑛mix𝑂𝑛t^{(U_{n})}_{\mathrm{mix}}=O(\log{n})italic_t start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT = italic_O ( roman_log italic_n ) a.a.s., where Unsubscript𝑈𝑛U_{n}italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is chosen uniformly at random from V(G)𝑉𝐺V(G)italic_V ( italic_G ). This result aligns with [6], although theirs is much stronger, showing cut-off for tmix(Un)subscriptsuperscript𝑡subscript𝑈𝑛mixt^{(U_{n})}_{\mathrm{mix}}italic_t start_POSTSUPERSCRIPT ( italic_U start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT as previously mentioned.

1.5. Organisation

The rest of this paper is organised as follows. In Section 2 we introduce all the necessary notation, definitions and tools for our proofs. We use these to prove Theorem 1.5 in Section 3. This section is structured in subsections where we build different tools to be used in our main proof; in particular, we discuss the main ideas of the proof in Section 3.1. Sections 4 and 5 are devoted to proving Theorems 1.6 and 1.9, respectively. Finally, we discuss some open problems in Section 6.

2. Preliminaries

2.1. Basic notation

Let 0={0,1,2,}subscript0012\mathbb{N}_{0}=\{0,1,2,\ldots\}blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = { 0 , 1 , 2 , … } denote the set of non-negative integers. If n𝑛nitalic_n is a positive integer, we set [n]{1,,n}delimited-[]𝑛1𝑛[n]\coloneqq\{1,\ldots,n\}[ italic_n ] ≔ { 1 , … , italic_n }. Throughout, we will consider both simple graphs and multigraphs. The word graph will refer to simple graphs, that is, each pair of vertices forms at most one edge. Our multigraphs, which will be allowed to have parallel edges but no loops, will be clearly identified as such. All our graphs are labelled, so whenever we discuss an n𝑛nitalic_n-vertex (multi)graph G𝐺Gitalic_G, we implicitly assume that V(G)=[n]𝑉𝐺delimited-[]𝑛V(G)=[n]italic_V ( italic_G ) = [ italic_n ]. Given a (multi)graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ) and disjoint sets A,BV(G)𝐴𝐵𝑉𝐺A,B\subseteq V(G)italic_A , italic_B ⊆ italic_V ( italic_G ), we write EG(A)subscript𝐸𝐺𝐴E_{G}(A)italic_E start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) for the (multi)set of edges of G𝐺Gitalic_G contained in A𝐴Aitalic_A, and EG(A,B)subscript𝐸𝐺𝐴𝐵E_{G}(A,B)italic_E start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A , italic_B ) for the (multi)set of edges with one endpoint in A𝐴Aitalic_A and the other in B𝐵Bitalic_B. We set eG(A)|EG(A)|subscript𝑒𝐺𝐴subscript𝐸𝐺𝐴e_{G}(A)\coloneqq|E_{G}(A)|italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) ≔ | italic_E start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) | and eG(A,B)|EG(A,B)|subscript𝑒𝐺𝐴𝐵subscript𝐸𝐺𝐴𝐵e_{G}(A,B)\coloneqq|E_{G}(A,B)|italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A , italic_B ) ≔ | italic_E start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A , italic_B ) |. If A={a}𝐴𝑎A=\{a\}italic_A = { italic_a }, we write EG(a,B)EG({a},B)subscript𝐸𝐺𝑎𝐵subscript𝐸𝐺𝑎𝐵E_{G}(a,B)\coloneqq E_{G}(\{a\},B)italic_E start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_a , italic_B ) ≔ italic_E start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( { italic_a } , italic_B ), and similarly in all related notation. For simplicity, we write e(G)eG(V(G))𝑒𝐺subscript𝑒𝐺𝑉𝐺e(G)\coloneqq e_{G}(V(G))italic_e ( italic_G ) ≔ italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_V ( italic_G ) ). We write G[A](A,EG(A))𝐺delimited-[]𝐴𝐴subscript𝐸𝐺𝐴G[A]\coloneqq(A,E_{G}(A))italic_G [ italic_A ] ≔ ( italic_A , italic_E start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) ). We say that A𝐴Aitalic_A is G𝐺Gitalic_G-connected if G[A]𝐺delimited-[]𝐴G[A]italic_G [ italic_A ] is connected. For each vertex vV(G)𝑣𝑉𝐺v\in V(G)italic_v ∈ italic_V ( italic_G ), we write degG(v)eG({v},V(G){v})subscriptdeg𝐺𝑣subscript𝑒𝐺𝑣𝑉𝐺𝑣\operatorname{deg}_{G}(v)\coloneqq e_{G}(\{v\},V(G)\setminus\{v\})roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) ≔ italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( { italic_v } , italic_V ( italic_G ) ∖ { italic_v } ) for its degree. We denote G(A)EG(A,V(G)A)subscript𝐺𝐴subscript𝐸𝐺𝐴𝑉𝐺𝐴\partial_{G}(A)\coloneqq E_{G}(A,V(G)\setminus A)∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) ≔ italic_E start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A , italic_V ( italic_G ) ∖ italic_A ) and degG(A)vAdegG(v)=2eG(A)+|G(A)|subscriptdeg𝐺𝐴subscript𝑣𝐴subscriptdeg𝐺𝑣2subscript𝑒𝐺𝐴subscript𝐺𝐴\operatorname{deg}_{G}(A)\coloneqq\sum_{v\in A}\operatorname{deg}_{G}(v)=2e_{G% }(A)+|\partial_{G}(A)|roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) ≔ ∑ start_POSTSUBSCRIPT italic_v ∈ italic_A end_POSTSUBSCRIPT roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) = 2 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) + | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) |.

In many of our statements we will consider an n𝑛nitalic_n-vertex graph satisfying a set of conditions or a conclusion, which are often asymptotic in nature. This is in fact an abuse of notation. To be precise, one must consider a sequence (Gk)k1subscriptsubscript𝐺𝑘𝑘1(G_{k})_{k\geq 1}( italic_G start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ≥ 1 end_POSTSUBSCRIPT of graphs on an increasing number of vertices so that the graphs in the sequence satisfy the conditions. This abuse of notation greatly simplifies the statements, so we will assume it throughout. (This also includes any asymptotic statements about random graphs.) For any sequence of graphs (Gk)k1subscriptsubscript𝐺𝑘𝑘1(G_{k})_{k\geq 1}( italic_G start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k ≥ 1 end_POSTSUBSCRIPT with |V(Gk)|𝑉subscript𝐺𝑘|V(G_{k})|\to\infty| italic_V ( italic_G start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) | → ∞, we say that a graph property 𝒫𝒫\mathcal{P}caligraphic_P holds asymptotically almost surely (a.a.s.) if limk[Gk𝒫]=1subscript𝑘delimited-[]subscript𝐺𝑘𝒫1\lim_{k\to\infty}\mathbb{P}[G_{k}\in\mathcal{P}]=1roman_lim start_POSTSUBSCRIPT italic_k → ∞ end_POSTSUBSCRIPT blackboard_P [ italic_G start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_P ] = 1.

2.2. Random walks

Given an arbitrary connected multigraph G𝐺Gitalic_G, the lazy random walk over G𝐺Gitalic_G is a Markov chain on state space V(G)𝑉𝐺V(G)italic_V ( italic_G ) defined by the transition matrix PG=(PG(i,j))i,jV(G)subscript𝑃𝐺subscriptsubscript𝑃𝐺𝑖𝑗𝑖𝑗𝑉𝐺P_{G}=(P_{G}(i,j))_{i,j\in V(G)}italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = ( italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_i , italic_j ) ) start_POSTSUBSCRIPT italic_i , italic_j ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT given by

PG(i,j)={1/2if i=j,eG(i,j)/(2degG(i))if ij.subscript𝑃𝐺𝑖𝑗cases12if 𝑖𝑗subscript𝑒𝐺𝑖𝑗2subscriptdeg𝐺𝑖if 𝑖𝑗P_{G}(i,j)=\begin{cases}1/2&\text{if }i=j,\\ e_{G}(i,j)/(2\operatorname{deg}_{G}(i))&\text{if }i\neq j.\end{cases}italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_i , italic_j ) = { start_ROW start_CELL 1 / 2 end_CELL start_CELL if italic_i = italic_j , end_CELL end_ROW start_ROW start_CELL italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_i , italic_j ) / ( 2 roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_i ) ) end_CELL start_CELL if italic_i ≠ italic_j . end_CELL end_ROW (2.1)

That is, the lazy random walk is a sequence of random variables (Xt)t0subscriptsubscript𝑋𝑡𝑡0(X_{t})_{t\geq 0}( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT with probability distributions (μt)t0subscriptsubscript𝜇𝑡𝑡0(\mu_{t})_{t\geq 0}( italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT, respectively, over V(G)𝑉𝐺V(G)italic_V ( italic_G ), where μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the starting distribution and, for each t1𝑡1t\geq 1italic_t ≥ 1, the distribution of μtsubscript𝜇𝑡\mu_{t}italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is obtained from the distribution of μt1subscript𝜇𝑡1\mu_{t-1}italic_μ start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT as μt=μt1PG=μ0PGtsubscript𝜇𝑡subscript𝜇𝑡1subscript𝑃𝐺subscript𝜇0superscriptsubscript𝑃𝐺𝑡\mu_{t}=\mu_{t-1}P_{G}=\mu_{0}P_{G}^{t}italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT. The sequence of distributions thus depends only on G𝐺Gitalic_G and the starting distribution. In the special case when there is a vertex xV(G)𝑥𝑉𝐺x\in V(G)italic_x ∈ italic_V ( italic_G ) such that μ0(x)=1subscript𝜇0𝑥1\mu_{0}(x)=1italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_x ) = 1, we will write (μtx)t0subscriptsuperscriptsubscript𝜇𝑡𝑥𝑡0(\mu_{t}^{x})_{t\geq 0}( italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT to denote the resulting sequence of distributions.

If G𝐺Gitalic_G is connected, the lazy random walk over G𝐺Gitalic_G converges to a stationary distribution πGsubscript𝜋𝐺\pi_{G}italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT (that is, a distribution satisfying πG=πGPGsubscript𝜋𝐺subscript𝜋𝐺subscript𝑃𝐺\pi_{G}=\pi_{G}P_{G}italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT), independently of the starting distribution μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. It is well known (see, e.g., [33]) that this stationary distribution satisfies

πG(u)=degG(u)2e(G)subscript𝜋𝐺𝑢subscriptdeg𝐺𝑢2𝑒𝐺\pi_{G}(u)=\frac{\operatorname{deg}_{G}(u)}{2e(G)}italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u ) = divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u ) end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG (2.2)

for all uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ). Given a set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ), we define πG(S)vSπG(v)subscript𝜋𝐺𝑆subscript𝑣𝑆subscript𝜋𝐺𝑣\pi_{G}(S)\coloneqq\sum_{v\in S}\pi_{G}(v)italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ≔ ∑ start_POSTSUBSCRIPT italic_v ∈ italic_S end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ). It follows from (2.2) that

πG(S)=degG(S)2e(G)=2eG(S)+|G(S)|2e(G).subscript𝜋𝐺𝑆subscriptdeg𝐺𝑆2𝑒𝐺2subscript𝑒𝐺𝑆subscript𝐺𝑆2𝑒𝐺\pi_{G}(S)=\frac{\operatorname{deg}_{G}(S)}{2e(G)}=\frac{2e_{G}(S)+|\partial_{% G}(S)|}{2e(G)}.italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) = divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG = divide start_ARG 2 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) + | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG . (2.3)

We define πmin(G)minvV(G)πG(v)subscript𝜋𝐺subscript𝑣𝑉𝐺subscript𝜋𝐺𝑣\pi_{\min}(G)\coloneqq\min_{v\in V(G)}\pi_{G}(v)italic_π start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_G ) ≔ roman_min start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) and πmax(G)maxvV(G)πG(v)subscript𝜋𝐺subscript𝑣𝑉𝐺subscript𝜋𝐺𝑣\pi_{\max}(G)\coloneqq\max_{v\in V(G)}\pi_{G}(v)italic_π start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_G ) ≔ roman_max start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ).

Recall the definition of mixing times in the introduction. The mixing time of a random walk on a connected (multi)graph G𝐺Gitalic_G is deeply tied with the concept of conductance. Given a set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ), we define

QG(S)iSjV(G)SπG(i)PG(i,j)=|G(S)|4e(G),subscript𝑄𝐺𝑆subscript𝑖𝑆subscript𝑗𝑉𝐺𝑆subscript𝜋𝐺𝑖subscript𝑃𝐺𝑖𝑗subscript𝐺𝑆4𝑒𝐺Q_{G}(S)\coloneqq\sum_{i\in S}\sum_{j\in V(G)\setminus S}\pi_{G}(i)P_{G}(i,j)=% \frac{|\partial_{G}(S)|}{4e(G)},italic_Q start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ≔ ∑ start_POSTSUBSCRIPT italic_i ∈ italic_S end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j ∈ italic_V ( italic_G ) ∖ italic_S end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_i ) italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_i , italic_j ) = divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 4 italic_e ( italic_G ) end_ARG , (2.4)

where the equality follows from (2.1) and (2.2). Observe that QG(S)=QG(V(G)S)subscript𝑄𝐺𝑆subscript𝑄𝐺𝑉𝐺𝑆Q_{G}(S)=Q_{G}(V(G)\setminus S)italic_Q start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) = italic_Q start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_V ( italic_G ) ∖ italic_S ). Finally, we define the conductance ΦG(S)subscriptΦ𝐺𝑆\Phi_{G}(S)roman_Φ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) of S𝑆Sitalic_S as

ΦG(S)QG(S)πG(S)πG(V(G)S).subscriptΦ𝐺𝑆subscript𝑄𝐺𝑆subscript𝜋𝐺𝑆subscript𝜋𝐺𝑉𝐺𝑆\Phi_{G}(S)\coloneqq\frac{Q_{G}(S)}{\pi_{G}(S)\pi_{G}(V(G)\setminus S)}.roman_Φ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ≔ divide start_ARG italic_Q start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) end_ARG start_ARG italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_V ( italic_G ) ∖ italic_S ) end_ARG . (2.5)

From the definitions in (2.3), (2.4) and (2.5) and the fact that degG(A)2e(G)subscriptdeg𝐺𝐴2𝑒𝐺\operatorname{deg}_{G}(A)\leq 2e(G)roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_A ) ≤ 2 italic_e ( italic_G ) for any set AV(G)𝐴𝑉𝐺A\subseteq V(G)italic_A ⊆ italic_V ( italic_G ), it follows that

ΦG(S)=e(G)|G(S)|degG(S)degG(V(G)S)|G(S)|2degG(S).subscriptΦ𝐺𝑆𝑒𝐺subscript𝐺𝑆subscriptdeg𝐺𝑆subscriptdeg𝐺𝑉𝐺𝑆subscript𝐺𝑆2subscriptdeg𝐺𝑆\Phi_{G}(S)=\frac{e(G)|\partial_{G}(S)|}{\operatorname{deg}_{G}(S)% \operatorname{deg}_{G}(V(G)\setminus S)}\geq\frac{|\partial_{G}(S)|}{2% \operatorname{deg}_{G}(S)}.roman_Φ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) = divide start_ARG italic_e ( italic_G ) | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_V ( italic_G ) ∖ italic_S ) end_ARG ≥ divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 2 roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) end_ARG . (2.6)

Our approach to estimate the mixing time of the lazy random walk over a multigraph G𝐺Gitalic_G is based on ideas of Fountoulakis and Reed [23, 24]. Roughly speaking, their main contribution is the fact that the mixing time of an abstract irreducible, reversible, aperiodic Markov chain (which we may represent using a weighted graph H𝐻Hitalic_H on its state space) can be bounded from above using the conductances of different H𝐻Hitalic_H-connected sets of states of various sizes. The fact that we may restrict ourselves to H𝐻Hitalic_H-connected sets is crucial to obtain tighter bounds than would be obtained through other classical means. For simplicity, here we only state a version of the result of Fountoulakis and Reed [23] which is applicable to our setting. For any p(πmin(G),1)𝑝subscript𝜋𝐺1p\in(\pi_{\min}(G),1)italic_p ∈ ( italic_π start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_G ) , 1 ), we let ΦG(p)subscriptΦ𝐺𝑝\Phi_{G}(p)roman_Φ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_p ) be the minimum conductance ΦG(S)subscriptΦ𝐺𝑆\Phi_{G}(S)roman_Φ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) over all G𝐺Gitalic_G-connected sets SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) such that p/2πG(S)p𝑝2subscript𝜋𝐺𝑆𝑝p/2\leq\pi_{G}(S)\leq pitalic_p / 2 ≤ italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ≤ italic_p (if no such set S𝑆Sitalic_S exists, we set ΦG(p)=1subscriptΦ𝐺𝑝1\Phi_{G}(p)=1roman_Φ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_p ) = 1).

Theorem 2.1 (Fountoulakis and Reed [23]).

Let G𝐺Gitalic_G be a connected multigraph. There exists an absolute constant C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT such that

tmix(G)C0j=1log2πmin(G)1ΦG2(2j).subscript𝑡mix𝐺subscript𝐶0superscriptsubscript𝑗1subscript2subscript𝜋superscript𝐺1superscriptsubscriptΦ𝐺2superscript2𝑗t_{\mathrm{mix}}(G)\leq C_{0}\sum_{j=1}^{\lceil\log_{2}\pi_{\min}(G)^{-1}% \rceil}\Phi_{G}^{-2}(2^{-j}).italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G ) ≤ italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⌈ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_G ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⌉ end_POSTSUPERSCRIPT roman_Φ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ( 2 start_POSTSUPERSCRIPT - italic_j end_POSTSUPERSCRIPT ) .

Another parameter of interest is the hitting time to a vertex (or set of vertices) in the random walk on a multigraph G𝐺Gitalic_G. Given any vV(G)𝑣𝑉𝐺v\in V(G)italic_v ∈ italic_V ( italic_G ) and the lazy random walk (Xt)t0subscriptsubscript𝑋𝑡𝑡0(X_{t})_{t\geq 0}( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT with starting distribution μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we define the hitting time to v𝑣vitalic_v as

τG(μ0,v)inf{t0:Xt=v,X0μ0}.subscript𝜏𝐺subscript𝜇0𝑣infimumconditional-set𝑡subscript0formulae-sequencesubscript𝑋𝑡𝑣similar-tosubscript𝑋0subscript𝜇0\tau_{G}(\mu_{0},v)\coloneqq\inf\{t\in\mathbb{N}_{0}:X_{t}=v,X_{0}\sim\mu_{0}\}.italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_v ) ≔ roman_inf { italic_t ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_v , italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∼ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } .

In more generality, given any set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ), we define the hitting time to S𝑆Sitalic_S as

τG(μ0,S)inf{t0:XtS,X0μ0}.subscript𝜏𝐺subscript𝜇0𝑆infimumconditional-set𝑡subscript0formulae-sequencesubscript𝑋𝑡𝑆similar-tosubscript𝑋0subscript𝜇0\tau_{G}(\mu_{0},S)\coloneqq\inf\{t\in\mathbb{N}_{0}:X_{t}\in S,X_{0}\sim\mu_{% 0}\}.italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_S ) ≔ roman_inf { italic_t ∈ blackboard_N start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_S , italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∼ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } .

Given any vertex uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ), let Pusubscript𝑃𝑢P_{u}italic_P start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT be the matrix obtained from the transition matrix PGsubscript𝑃𝐺P_{G}italic_P start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT by removing the row and column corresponding to u𝑢uitalic_u. If Pusubscript𝑃𝑢P_{u}italic_P start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT is primitive (i.e., all entries of Pumsuperscriptsubscript𝑃𝑢𝑚P_{u}^{m}italic_P start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT are positive for some m1𝑚1m\geq 1italic_m ≥ 1), by Perron-Frobenius, the largest eigenvalue of Pusubscript𝑃𝑢P_{u}italic_P start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, denoted by λusubscript𝜆𝑢\lambda_{u}italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, is real, of multiplicity 1111 and satisfies λu<1subscript𝜆𝑢1\lambda_{u}<1italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT < 1.

We will make use of the first visit time lemma of Cooper and Frieze [15]. Here we state a more recent version with weaker hypotheses due to Manzo, Quattropani and Scoppola [35].

Theorem 2.2 (First Visit Time Lemma, Manzo, Quattropani and Scoppola [35]).

Let G𝐺Gitalic_G be an n𝑛nitalic_n-vertex connected multigraph. Suppose that there exist a real number c>2𝑐2c>2italic_c > 2 and a diverging sequence T=T(n)𝑇𝑇𝑛T=T(n)italic_T = italic_T ( italic_n ) such that the following conditions hold:

  1. (HP1)HP1(\mathrm{HP}1)( HP1 )

    Fast mixing: maxx,yV(G)|μTx(y)πG(y)|=o(nc)subscript𝑥𝑦𝑉𝐺superscriptsubscript𝜇𝑇𝑥𝑦subscript𝜋𝐺𝑦𝑜superscript𝑛𝑐\max_{x,y\in V(G)}|\mu_{T}^{x}(y)-\pi_{G}(y)|=o(n^{-c})roman_max start_POSTSUBSCRIPT italic_x , italic_y ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT | italic_μ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT ( italic_y ) - italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_y ) | = italic_o ( italic_n start_POSTSUPERSCRIPT - italic_c end_POSTSUPERSCRIPT ).

  2. (HP2)HP2(\mathrm{HP}2)( HP2 )

    Small πmaxsubscript𝜋\pi_{\max}italic_π start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT: Tπmax(G)=o(1)𝑇subscript𝜋𝐺𝑜1T\cdot\pi_{\max}(G)=o(1)italic_T ⋅ italic_π start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( italic_G ) = italic_o ( 1 ).

  3. (HP3)HP3(\mathrm{HP}3)( HP3 )

    Large πminsubscript𝜋\pi_{\min}italic_π start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT: πmin(G)=ω(n2)subscript𝜋𝐺𝜔superscript𝑛2\pi_{\min}(G)=\omega(n^{-2})italic_π start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_G ) = italic_ω ( italic_n start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ).

Then, for all uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ), we have

supt0|[τG(πG,u)>t]λut1|n0𝑛absentsubscriptsupremum𝑡0delimited-[]subscript𝜏𝐺subscript𝜋𝐺𝑢𝑡superscriptsubscript𝜆𝑢𝑡10\sup_{t\geq 0}\left\lvert\frac{\mathbb{P}[\tau_{G}(\pi_{G},u)>t]}{\lambda_{u}^% {t}}-1\right\rvert\xrightarrow[n\to\infty]{}0roman_sup start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT | divide start_ARG blackboard_P [ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_u ) > italic_t ] end_ARG start_ARG italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT end_ARG - 1 | start_ARROW start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARROW start_OVERACCENT end_OVERACCENT → end_ARROW end_ARROW 0 (2.7)

and

|1λuπG(u)/RT(u)1|n0,𝑛absent1subscript𝜆𝑢subscript𝜋𝐺𝑢subscript𝑅𝑇𝑢10\left\lvert\frac{1-\lambda_{u}}{\pi_{G}(u)/R_{T}(u)}-1\right\rvert\xrightarrow% [n\to\infty]{}0,| divide start_ARG 1 - italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_ARG start_ARG italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u ) / italic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_u ) end_ARG - 1 | start_ARROW start_UNDERACCENT italic_n → ∞ end_UNDERACCENT start_ARROW start_OVERACCENT end_OVERACCENT → end_ARROW end_ARROW 0 , (2.8)

where

RT(u)t=0Tμtu(u)>1subscript𝑅𝑇𝑢superscriptsubscript𝑡0𝑇superscriptsubscript𝜇𝑡𝑢𝑢1R_{T}(u)\coloneqq\sum_{t=0}^{T}\mu_{t}^{u}(u)>1italic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_u ) ≔ ∑ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_u end_POSTSUPERSCRIPT ( italic_u ) > 1

is the expected number of indices t[T]{0}𝑡delimited-[]𝑇0t\in[T]\cup\{0\}italic_t ∈ [ italic_T ] ∪ { 0 } for which the lazy random walk (Xt)t0subscriptsubscript𝑋𝑡𝑡0(X_{t})_{t\geq 0}( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT on G𝐺Gitalic_G starting at X0=usubscript𝑋0𝑢X_{0}=uitalic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_u satisfies Xt=usubscript𝑋𝑡𝑢X_{t}=uitalic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_u.

From the intuitive point of view, the theorem says that the hitting time to u𝑢uitalic_u is roughly distributed as a geometric random variable with success probability πG(u)/RT(u)subscript𝜋𝐺𝑢subscript𝑅𝑇𝑢\pi_{G}(u)/R_{T}(u)italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u ) / italic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_u ). If one wants to hit u𝑢uitalic_u by independently sampling vertices according to πGsubscript𝜋𝐺\pi_{G}italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT, then it would be a geometric random variable with success probability πG(u)subscript𝜋𝐺𝑢\pi_{G}(u)italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u ). The factor RT(u)subscript𝑅𝑇𝑢R_{T}(u)italic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_u ) is the price to pay for taking into account the geometry of the graph: the more likely it is to return from u𝑢uitalic_u to u𝑢uitalic_u, the less connected u𝑢uitalic_u is to the rest of the graph, and the smaller the probability to hit it at a given (large) time is.

Remark 2.3.

In the proof of Theorem 2.2, one can check that, if we only want (2.7) to hold for a given uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ), then (HP2)italic-HP2(\mathrm{HP}2)( HP2 ) can be replaced by

  1. (HP2)superscriptHP2(\mathrm{HP}2^{\prime})( HP2 start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT )

    Small πG(u)subscript𝜋𝐺𝑢\pi_{G}(u)italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u ): TπG(u)=o(1)𝑇subscript𝜋𝐺𝑢𝑜1T\cdot\pi_{G}(u)=o(1)italic_T ⋅ italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_u ) = italic_o ( 1 ).

Remark 2.4.

The following holds as a corollary of Theorem 2.2. For any fixed D>0𝐷0D>0italic_D > 0 and n𝑛nitalic_n-vertex connected multigraph G𝐺Gitalic_G satisfying (HP1)italic-HP1(\mathrm{HP}1)( HP1 ), (HP2)italic-HP2(\mathrm{HP}2)( HP2 ) (or (HP2)superscriptitalic-HP2italic-′(\mathrm{HP}2^{\prime})( HP2 start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT )) and (HP3)italic-HP3(\mathrm{HP}3)( HP3 ) with the additional property that e(G)Dn𝑒𝐺𝐷𝑛e(G)\leq Dnitalic_e ( italic_G ) ≤ italic_D italic_n, if uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ) and t0=t0(n)subscript𝑡0subscript𝑡0𝑛t_{0}=t_{0}(n)italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_n ) is such that λut0=1o(1)superscriptsubscript𝜆𝑢subscript𝑡01𝑜1\lambda_{u}^{t_{0}}=1-o(1)italic_λ start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT = 1 - italic_o ( 1 ), then

1nvV(G)[τG(μ0v,u)t0]=o(1).1𝑛subscript𝑣𝑉𝐺delimited-[]subscript𝜏𝐺superscriptsubscript𝜇0𝑣𝑢subscript𝑡0𝑜1\frac{1}{n}\sum_{v\in V(G)}\mathbb{P}[\tau_{G}(\mu_{0}^{v},u)\leq t_{0}]=o(1).divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT blackboard_P [ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT , italic_u ) ≤ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] = italic_o ( 1 ) . (2.9)

Indeed, for ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, Theorem 2.2 implies that [τG(πG,u)t0]ϵ2delimited-[]subscript𝜏𝐺subscript𝜋𝐺𝑢subscript𝑡0superscriptitalic-ϵ2\mathbb{P}[\tau_{G}(\pi_{G},u)\leq t_{0}]\leq\epsilon^{2}blackboard_P [ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_u ) ≤ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] ≤ italic_ϵ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Then, Bϵ{vV(G):[τG(μ0v,u)t0]ϵ}normal-≔subscript𝐵italic-ϵconditional-set𝑣𝑉𝐺delimited-[]subscript𝜏𝐺superscriptsubscript𝜇0𝑣𝑢subscript𝑡0italic-ϵB_{\epsilon}\coloneqq\{v\in V(G):\mathbb{P}[\tau_{G}(\mu_{0}^{v},u)\leq t_{0}]% \geq\epsilon\}italic_B start_POSTSUBSCRIPT italic_ϵ end_POSTSUBSCRIPT ≔ { italic_v ∈ italic_V ( italic_G ) : blackboard_P [ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT , italic_u ) ≤ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] ≥ italic_ϵ } satisfies πG(B)ϵsubscript𝜋𝐺𝐵italic-ϵ\pi_{G}(B)\leq\epsilonitalic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_B ) ≤ italic_ϵ. Moreover, as G𝐺Gitalic_G is connected, we have that πG(B)|B|/(2e(G))|B|/(2Dn)subscript𝜋𝐺𝐵𝐵2𝑒𝐺𝐵2𝐷𝑛\pi_{G}(B)\geq|B|/(2e(G))\geq|B|/(2Dn)italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_B ) ≥ | italic_B | / ( 2 italic_e ( italic_G ) ) ≥ | italic_B | / ( 2 italic_D italic_n ) and so |B|2ϵDn𝐵2italic-ϵ𝐷𝑛|B|\leq 2\epsilon Dn| italic_B | ≤ 2 italic_ϵ italic_D italic_n. Therefore,

vV(G)[τG(μ0v,u)t0]subscript𝑣𝑉𝐺delimited-[]subscript𝜏𝐺superscriptsubscript𝜇0𝑣𝑢subscript𝑡0\displaystyle\sum_{v\in V(G)}\mathbb{P}[\tau_{G}(\mu_{0}^{v},u)\leq t_{0}]∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT blackboard_P [ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT , italic_u ) ≤ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] =vV(G)B[τG(μ0v,u)t0]+vB[τG(μ0v,u)t0]absentsubscript𝑣𝑉𝐺𝐵delimited-[]subscript𝜏𝐺superscriptsubscript𝜇0𝑣𝑢subscript𝑡0subscript𝑣𝐵delimited-[]subscript𝜏𝐺superscriptsubscript𝜇0𝑣𝑢subscript𝑡0\displaystyle=\sum_{v\in V(G)\setminus B}\mathbb{P}[\tau_{G}(\mu_{0}^{v},u)% \leq t_{0}]+\sum_{v\in B}\mathbb{P}[\tau_{G}(\mu_{0}^{v},u)\leq t_{0}]= ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) ∖ italic_B end_POSTSUBSCRIPT blackboard_P [ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT , italic_u ) ≤ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] + ∑ start_POSTSUBSCRIPT italic_v ∈ italic_B end_POSTSUBSCRIPT blackboard_P [ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT , italic_u ) ≤ italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ]
(2D+1)ϵn,absent2𝐷1italic-ϵ𝑛\displaystyle\leq(2D+1)\epsilon n,≤ ( 2 italic_D + 1 ) italic_ϵ italic_n ,

which can be made arbitrarily small, by taking ϵitalic-ϵ\epsilonitalic_ϵ small with respect to D𝐷Ditalic_D. This establishes (2.9).

3. A general approach to average mixing times

3.1. Proof overview

As discussed in the introduction, the main tool we will use to bound the mixing times of random walks is the result of Fountoulakis and Reed (Theorem 2.1) which relates the (worst-case) mixing time of a random walk in a graph G𝐺Gitalic_G to the conductance (see (2.5)) of the G𝐺Gitalic_G-connected vertex subsets S𝑆Sitalic_S of G𝐺Gitalic_G. We think of vertex subsets S𝑆Sitalic_S whose conductance is poor (those for which ΦG(S)=o(1)subscriptΦ𝐺𝑆𝑜1\Phi_{G}(S)=o(1)roman_Φ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) = italic_o ( 1 )) as bottlenecks: they have more edges internally in S𝑆Sitalic_S than leaving S𝑆Sitalic_S and so the random walk is likely to get held up in S𝑆Sitalic_S. The spreader graphs (see Definition 1.3) we are interested in studying here have only few small bottlenecks. Indeed, any vertex subset which can lead to small bottlenecks must either be thin or loaded and our upper bounds on the number of these sets in a spreader graph readily imply that any vertex set with poor conductance is of at most polylogarithmic size (size (log2n)superscript2𝑛(\log^{2}n)( roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n ) to be precise, see Remark 3.3). Now, if a set with poor conductance is very small (size at most (logn)1/5superscript𝑛15(\log n)^{1/5}( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT), it will not slow down mixing significantly as our random walk will not get stuck for very long in these sets before leaving them. Therefore, it is the intermediate size sets which pose a problem, and we will first show in Section 3.2 (see Lemma 3.4) that the set U𝑈Uitalic_U of bad vertices contained in some intermediate set which has poor conductance contains a negligible proportion of the overall vertex set of our spreader graph.

Intuitively, we can then see how starting at an average vertex in G𝐺Gitalic_G speeds up the mixing time. Indeed, we are very unlikely to start at a bad vertex in U𝑈Uitalic_U and, moreover, we are in fact very unlikely to visit a vertex in U𝑈Uitalic_U in the first O(logn)𝑂𝑛O(\log n)italic_O ( roman_log italic_n ) time steps, by which time we aim to show that the distribution of the random walk is already well-mixed. In order to formalise this intuition, we adjust our spreader graph G𝐺Gitalic_G by shrinking the intermediate sets with poor conductance and thus removing troublesome small bottlenecks. The resulting (multi-)graph we will call G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. Using that the number of bad vertices |U|𝑈|U|| italic_U | is negligible, or rather that the number of edges incident to U𝑈Uitalic_U, degG(U)subscriptdeg𝐺𝑈\operatorname{deg}_{G}(U)roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ), is negligible (Lemma 3.4), we show in Section 3.3 that switching from G𝐺Gitalic_G to G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT does not have a big effect on the edge distribution and that, in particular, the stationary distributions on G𝐺Gitalic_G and G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT are comparable. We then show in Section 3.4 that, after contracting intermediate sets with poor conductance, we can apply Theorem 2.1 of Fountoulakis and Reed to conclude that the worst-case mixing time in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is logarithmic. Here we will need that G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is defined carefully to preserve connectivity between sets after contractions (see (3.7)). Finally, we will prove Theorem 1.5 by coupling the random walk from an average vertex on G𝐺Gitalic_G with the random walk in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. As the random walk in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT from any starting point mixes rapidly, we can conclude that the random walk in G𝐺Gitalic_G also mixes rapidly as long as the two random walks stay coupled for long enough. For this, our final ingredient is to show that the random walk in G𝐺Gitalic_G is unlikely to hit our bad vertices U𝑈Uitalic_U, which we do in Section 3.5 by appealing to the First Visit Time Lemma (Theorem 2.2) of Manzo, Quattropani and Scoppola.

3.2. Badly connected sets

We will make use of the following simple definition.

Definition 3.1.

For γ>0𝛾0\gamma>0italic_γ > 0 and a connected multigraph G𝐺Gitalic_G, we say a set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) is γ𝛾\gammaitalic_γ-bad in G𝐺Gitalic_G if

|G(S)|degG(S)<γ,subscript𝐺𝑆subscriptdeg𝐺𝑆𝛾\frac{|\partial_{G}(S)|}{\operatorname{deg}_{G}(S)}<\gamma,divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) end_ARG < italic_γ ,

and that it is γ𝛾\gammaitalic_γ-good otherwise.

The following lemma gives us a basic property of bad sets in a connected multigraph and quickly ties them with our notion of (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graphs. Note that the notions of thin and loaded sets extend naturally to multigraphs.

Lemma 3.2.

Let G𝐺Gitalic_G be a multigraph. For 0<α10𝛼10<\alpha\leq 10 < italic_α ≤ 1, if a set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) is (α2/4)superscript𝛼24({\alpha^{2}}/{4})( italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 4 )-bad in G𝐺Gitalic_G, then

  1. (1)1(1)( 1 )

    |G(S)|2eG(S)subscript𝐺𝑆2subscript𝑒𝐺𝑆|\partial_{G}(S)|\leq 2e_{G}(S)| ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | ≤ 2 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ), and

  2. (2)2(2)( 2 )

    either S𝑆Sitalic_S is α𝛼\alphaitalic_α-thin in G𝐺Gitalic_G or it is α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded in G𝐺Gitalic_G (or both).

Proof.

The assertion (1)1(1)( 1 ) follows easily since, if |G(S)|>2eG(S)subscript𝐺𝑆2subscript𝑒𝐺𝑆|\partial_{G}(S)|>2e_{G}(S)| ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | > 2 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ), then

|G(S)|degG(S)=|G(S)|2eG(S)+|G(S)||G(S)|2|G(S)|=12α24,subscript𝐺𝑆subscriptdeg𝐺𝑆subscript𝐺𝑆2subscript𝑒𝐺𝑆subscript𝐺𝑆subscript𝐺𝑆2subscript𝐺𝑆12superscript𝛼24\frac{|\partial_{G}(S)|}{\operatorname{deg}_{G}(S)}=\frac{|\partial_{G}(S)|}{2% e_{G}(S)+|\partial_{G}(S)|}\geq\frac{|\partial_{G}(S)|}{2|\partial_{G}(S)|}=% \frac{1}{2}\geq\frac{\alpha^{2}}{4},divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) end_ARG = divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 2 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) + | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG ≥ divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 2 | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG = divide start_ARG 1 end_ARG start_ARG 2 end_ARG ≥ divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 4 end_ARG ,

a contradiction.

For the second assertion, suppose that S𝑆Sitalic_S is neither α𝛼\alphaitalic_α-thin nor α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded in G𝐺Gitalic_G. Then, we have that

|G(S)|degG(S)=|G(S)|2eG(S)+|G(S)||G(S)|4eG(S)α|S|4α1|S|=α24,subscript𝐺𝑆subscriptdeg𝐺𝑆subscript𝐺𝑆2subscript𝑒𝐺𝑆subscript𝐺𝑆subscript𝐺𝑆4subscript𝑒𝐺𝑆𝛼𝑆4superscript𝛼1𝑆superscript𝛼24\frac{|\partial_{G}(S)|}{\operatorname{deg}_{G}(S)}=\frac{|\partial_{G}(S)|}{2% e_{G}(S)+|\partial_{G}(S)|}\geq\frac{|\partial_{G}(S)|}{4e_{G}(S)}\geq\frac{% \alpha|S|}{4\alpha^{-1}|S|}=\frac{\alpha^{2}}{4},divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) end_ARG = divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 2 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) + | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG ≥ divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 4 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) end_ARG ≥ divide start_ARG italic_α | italic_S | end_ARG start_ARG 4 italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT | italic_S | end_ARG = divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 4 end_ARG ,

a contradiction. Here we used that |G(S)|2eG(S)subscript𝐺𝑆2subscript𝑒𝐺𝑆|\partial_{G}(S)|\leq 2e_{G}(S)| ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | ≤ 2 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) from (1)1(1)( 1 ) in the first inequality and the definitions of α𝛼\alphaitalic_α-thin and α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded in the second. ∎

We will also make use of the following simple observation.

Remark 3.3.

Let α,D>0𝛼𝐷0\alpha,D>0italic_α , italic_D > 0, and let G𝐺Gitalic_G be an n𝑛nitalic_n-vertex graph satisfying (S1)italic-S1(\mathrm{S}1)( S1 ) and (S2)italic-S2(\mathrm{S}2)( S2 ). Then, there are no G𝐺Gitalic_G-connected α𝛼\alphaitalic_α-thin or α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded sets S𝑆Sitalic_S of size (logn)2|S|(11/D2)nsuperscript𝑛2𝑆11superscript𝐷2𝑛(\log n)^{2}\leq|S|\leq(1-1/D^{2})n( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ | italic_S | ≤ ( 1 - 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_n.

Given any α>0𝛼0\alpha>0italic_α > 0 and D>0𝐷0D>0italic_D > 0, we define β1/D2𝛽1superscript𝐷2\beta\coloneqq 1/D^{2}italic_β ≔ 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and γα2/4𝛾superscript𝛼24\gamma\coloneqq{\alpha^{2}}/{4}italic_γ ≔ italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 4. Given any n𝑛nitalic_n-vertex graph G𝐺Gitalic_G, let

𝒮(α,D){SV(G):S is G-connected and γ-bad,(logn)1/5|S|(1β)n}.𝒮𝛼𝐷conditional-set𝑆𝑉𝐺𝑆 is G-connected and 𝛾-badsuperscript𝑛15𝑆1𝛽𝑛\mathcal{S}(\alpha,D)\coloneqq\big{\{}S\subseteq V(G):S\text{ is $G$-connected% and }\gamma\text{-bad},(\log n)^{1/5}\leq|S|\leq(1-\beta)n\big{\}}.caligraphic_S ( italic_α , italic_D ) ≔ { italic_S ⊆ italic_V ( italic_G ) : italic_S is italic_G -connected and italic_γ -bad , ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ≤ | italic_S | ≤ ( 1 - italic_β ) italic_n } . (3.1)

Further, we define U(α,D)V(G)𝑈𝛼𝐷𝑉𝐺U(\alpha,D)\subseteq V(G)italic_U ( italic_α , italic_D ) ⊆ italic_V ( italic_G ) to be the set of vertices that lie in sets in 𝒮(α,D)𝒮𝛼𝐷\mathcal{S}(\alpha,D)caligraphic_S ( italic_α , italic_D ), that is,

U(α,D)S𝒮(α,D)S.𝑈𝛼𝐷subscript𝑆𝒮𝛼𝐷𝑆U(\alpha,D)\coloneqq\bigcup_{S\in\mathcal{S}(\alpha,D)}S.italic_U ( italic_α , italic_D ) ≔ ⋃ start_POSTSUBSCRIPT italic_S ∈ caligraphic_S ( italic_α , italic_D ) end_POSTSUBSCRIPT italic_S . (3.2)

We now give bounds on |U(α,D)|𝑈𝛼𝐷|U(\alpha,D)|| italic_U ( italic_α , italic_D ) | and degG(U(α,D))subscriptdeg𝐺𝑈𝛼𝐷\operatorname{deg}_{G}(U(\alpha,D))roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ( italic_α , italic_D ) ).

Lemma 3.4.

Let α(0,1]𝛼01\alpha\in(0,1]italic_α ∈ ( 0 , 1 ] and D>0𝐷0D>0italic_D > 0. Let G𝐺Gitalic_G be an n𝑛nitalic_n-vertex connected graph which satisfies (S1)italic-S1(\mathrm{S}1)( S1 ) and (S2)italic-S2(\mathrm{S}2)( S2 ). If n𝑛nitalic_n is sufficiently large, then

|U(α,D)|nexp((logn)1/11)𝑈𝛼𝐷𝑛superscript𝑛111|U(\alpha,D)|\leq n\exp(-(\log n)^{1/11})| italic_U ( italic_α , italic_D ) | ≤ italic_n roman_exp ( - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 11 end_POSTSUPERSCRIPT ) (3.3)

and

degG(U(α,D))nexp((logn)1/11).subscriptdeg𝐺𝑈𝛼𝐷𝑛superscript𝑛111\operatorname{deg}_{G}(U(\alpha,D))\leq n\exp(-(\log n)^{1/11}).roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ( italic_α , italic_D ) ) ≤ italic_n roman_exp ( - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 11 end_POSTSUPERSCRIPT ) . (3.4)

In particular,

πG(U(α,D))exp((logn)1/11).subscript𝜋𝐺𝑈𝛼𝐷superscript𝑛111\pi_{G}(U(\alpha,D))\leq\exp(-(\log n)^{1/11}).italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ( italic_α , italic_D ) ) ≤ roman_exp ( - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 11 end_POSTSUPERSCRIPT ) . (3.5)
Proof.

Let 𝒮=𝒮(α,D)𝒮𝒮𝛼𝐷\mathcal{S}=\mathcal{S}(\alpha,D)caligraphic_S = caligraphic_S ( italic_α , italic_D ) and U=U(α,D)𝑈𝑈𝛼𝐷U=U(\alpha,D)italic_U = italic_U ( italic_α , italic_D ). By Lemma 3.2(2)2(2)( 2 ), all sets S𝒮𝑆𝒮S\in\mathcal{S}italic_S ∈ caligraphic_S must be α𝛼\alphaitalic_α-thin or α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded. By (S1)S1(\mathrm{S}1)( S1 ) and (S2)S2(\mathrm{S}2)( S2 ), for each (logn)1/5k(1β)nsuperscript𝑛15𝑘1𝛽𝑛(\log n)^{1/5}\leq k\leq(1-\beta)n( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ≤ italic_k ≤ ( 1 - italic_β ) italic_n, the number of α𝛼\alphaitalic_α-thin or α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded sets of size k𝑘kitalic_k in G𝐺Gitalic_G is less than 2nexp(k)2𝑛𝑘2n\exp(-\sqrt{k})2 italic_n roman_exp ( - square-root start_ARG italic_k end_ARG ). Moreover, as stated in Remark 3.3, there are no α𝛼\alphaitalic_α-thin or α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded sets of size k(logn)2𝑘superscript𝑛2k\geq(\log n)^{2}italic_k ≥ ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Thus,

|𝒮|<2nk=(logn)1/5(logn)2ek2n(logn)2e(logn)1/10.𝒮2𝑛superscriptsubscript𝑘superscript𝑛15superscript𝑛2superscripte𝑘2𝑛superscript𝑛2superscriptesuperscript𝑛110|\mathcal{S}|<2n\sum_{k=(\log n)^{1/5}}^{(\log n)^{2}}\mathrm{e}^{-\sqrt{k}}% \leq 2n(\log n)^{2}\mathrm{e}^{-(\log n)^{1/10}}.| caligraphic_S | < 2 italic_n ∑ start_POSTSUBSCRIPT italic_k = ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT roman_e start_POSTSUPERSCRIPT - square-root start_ARG italic_k end_ARG end_POSTSUPERSCRIPT ≤ 2 italic_n ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_e start_POSTSUPERSCRIPT - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 10 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT . (3.6)

Therefore, by the definition of U𝑈Uitalic_U, the bound on the size of the largest S𝒮𝑆𝒮S\in\mathcal{S}italic_S ∈ caligraphic_S, and assuming n𝑛nitalic_n is sufficiently large, we conclude that

|U|S𝒮|S||𝒮|(logn)2<2n(logn)4e(logn)1/10ne(logn)1/11.𝑈subscript𝑆𝒮𝑆𝒮superscript𝑛22𝑛superscript𝑛4superscriptesuperscript𝑛110𝑛superscriptesuperscript𝑛111|U|\leq\sum_{S\in\mathcal{S}}|S|\leq|\mathcal{S}|(\log n)^{2}<2n(\log n)^{4}% \mathrm{e}^{-(\log n)^{1/10}}\leq n\mathrm{e}^{-(\log n)^{1/11}}.| italic_U | ≤ ∑ start_POSTSUBSCRIPT italic_S ∈ caligraphic_S end_POSTSUBSCRIPT | italic_S | ≤ | caligraphic_S | ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT < 2 italic_n ( roman_log italic_n ) start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_e start_POSTSUPERSCRIPT - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 10 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ≤ italic_n roman_e start_POSTSUPERSCRIPT - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 11 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT .

We now turn our attention to (3.4). By the definition of U𝑈Uitalic_U, we have that

degG(U)S𝒮degG(S),subscriptdeg𝐺𝑈subscript𝑆𝒮subscriptdeg𝐺𝑆\operatorname{deg}_{G}(U)\leq\sum_{S\in\mathcal{S}}\operatorname{deg}_{G}(S),roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) ≤ ∑ start_POSTSUBSCRIPT italic_S ∈ caligraphic_S end_POSTSUBSCRIPT roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ,

and using Lemma 3.2(1)1(1)( 1 ) this simplifies to

degG(U)S𝒮4eG(S).subscriptdeg𝐺𝑈subscript𝑆𝒮4subscript𝑒𝐺𝑆\operatorname{deg}_{G}(U)\leq\sum_{S\in\mathcal{S}}4e_{G}(S).roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) ≤ ∑ start_POSTSUBSCRIPT italic_S ∈ caligraphic_S end_POSTSUBSCRIPT 4 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) .

Now, for each S𝒮𝑆𝒮S\in\mathcal{S}italic_S ∈ caligraphic_S, let b(S)𝑏𝑆b(S)italic_b ( italic_S ) be some G𝐺Gitalic_G-connected set such that (logn)2|b(S)|(logn)3superscript𝑛2𝑏𝑆superscript𝑛3(\log n)^{2}\leq|b(S)|\leq(\log n)^{3}( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ | italic_b ( italic_S ) | ≤ ( roman_log italic_n ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT and Sb(S)𝑆𝑏𝑆S\subseteq b(S)italic_S ⊆ italic_b ( italic_S ). Note that this is possible because G𝐺Gitalic_G is connected and every S𝒮𝑆𝒮S\in\mathcal{S}italic_S ∈ caligraphic_S has size less than (logn)2superscript𝑛2(\log n)^{2}( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. By (S2)S2(\mathrm{S}2)( S2 ) and the bounds on |b(S)|𝑏𝑆|b(S)|| italic_b ( italic_S ) |, for each S𝒮𝑆𝒮S\in\mathcal{S}italic_S ∈ caligraphic_S we have that

eG(b(S))α1|b(S)|α1(logn)3.subscript𝑒𝐺𝑏𝑆superscript𝛼1𝑏𝑆superscript𝛼1superscript𝑛3e_{G}(b(S))\leq\alpha^{-1}|b(S)|\leq\alpha^{-1}(\log n)^{3}.italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_b ( italic_S ) ) ≤ italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT | italic_b ( italic_S ) | ≤ italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT .

Therefore, using (3.6) and for n𝑛nitalic_n sufficiently large,

degG(U)S𝒮4eG(S)S𝒮4eG(b(S))4α1(logn)3|𝒮|ne(logn)1/11.subscriptdeg𝐺𝑈subscript𝑆𝒮4subscript𝑒𝐺𝑆subscript𝑆𝒮4subscript𝑒𝐺𝑏𝑆4superscript𝛼1superscript𝑛3𝒮𝑛superscriptesuperscript𝑛111\operatorname{deg}_{G}(U)\leq\sum_{S\in\mathcal{S}}4e_{G}(S)\leq\sum_{S\in% \mathcal{S}}4e_{G}(b(S))\leq 4\alpha^{-1}(\log n)^{3}|\mathcal{S}|\leq n% \mathrm{e}^{-(\log n)^{1/11}}.roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) ≤ ∑ start_POSTSUBSCRIPT italic_S ∈ caligraphic_S end_POSTSUBSCRIPT 4 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ≤ ∑ start_POSTSUBSCRIPT italic_S ∈ caligraphic_S end_POSTSUBSCRIPT 4 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_b ( italic_S ) ) ≤ 4 italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT | caligraphic_S | ≤ italic_n roman_e start_POSTSUPERSCRIPT - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 11 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT .

In particular, since G𝐺Gitalic_G is connected (so e(G)n/2𝑒𝐺𝑛2e(G)\geq n/2italic_e ( italic_G ) ≥ italic_n / 2), it follows from (2.3) that

πG(U)=degG(U)2e(G)e(logn)1/11.subscript𝜋𝐺𝑈subscriptdeg𝐺𝑈2𝑒𝐺superscriptesuperscript𝑛111\pi_{G}(U)=\frac{\operatorname{deg}_{G}(U)}{2e(G)}\leq\mathrm{e}^{-(\log n)^{1% /11}}.\qeditalic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) = divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG ≤ roman_e start_POSTSUPERSCRIPT - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 11 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT . italic_∎

3.3. Stationary distributions

Let α>0𝛼0\alpha>0italic_α > 0 and D>0𝐷0D>0italic_D > 0 be given, let G𝐺Gitalic_G be some n𝑛nitalic_n-vertex graph, and consider the set U=U(α,D)𝑈𝑈𝛼𝐷U=U(\alpha,D)italic_U = italic_U ( italic_α , italic_D ). We now define G*=G*(α,D)superscript𝐺superscript𝐺𝛼𝐷G^{*}=G^{*}(\alpha,D)italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_α , italic_D ) to be the multigraph obtained by contracting all the connected components of G[U]𝐺delimited-[]𝑈G[U]italic_G [ italic_U ] to single vertices. To be more precise, let U=U1Ut𝑈subscript𝑈1subscript𝑈𝑡U=U_{1}\cup\ldots\cup U_{t}italic_U = italic_U start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ … ∪ italic_U start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be a partition of U𝑈Uitalic_U into sets each of which induces a connected component in G[U]𝐺delimited-[]𝑈G[U]italic_G [ italic_U ] and let U*{u1,,ut}superscript𝑈subscript𝑢1subscript𝑢𝑡U^{*}\coloneqq\{u_{1},\ldots,u_{t}\}italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≔ { italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } be a set of t𝑡titalic_t new vertices. Then, let V(G*)=(V(G)U)U*𝑉superscript𝐺𝑉𝐺𝑈superscript𝑈V(G^{*})=(V(G)\setminus U)\cup U^{*}italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = ( italic_V ( italic_G ) ∖ italic_U ) ∪ italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and, for each xV(G)𝑥𝑉𝐺x\in V(G)italic_x ∈ italic_V ( italic_G ), let

f(x)={xif xU,uiif xUiU.𝑓𝑥cases𝑥if 𝑥𝑈subscript𝑢𝑖if 𝑥subscript𝑈𝑖𝑈f(x)=\begin{cases}x&\text{if }x\notin U,\\ u_{i}&\text{if }x\in U_{i}\subseteq U.\end{cases}italic_f ( italic_x ) = { start_ROW start_CELL italic_x end_CELL start_CELL if italic_x ∉ italic_U , end_CELL end_ROW start_ROW start_CELL italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL if italic_x ∈ italic_U start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ italic_U . end_CELL end_ROW (3.7)

In particular, f(U)=U*𝑓𝑈superscript𝑈f(U)=U^{*}italic_f ( italic_U ) = italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. Finally, we define the multiset

E(G*){f(x)f(y):xyE(G),f(x)f(y)}.𝐸superscript𝐺conditional-set𝑓𝑥𝑓𝑦formulae-sequence𝑥𝑦𝐸𝐺𝑓𝑥𝑓𝑦E(G^{*})\coloneqq\{f(x)f(y):xy\in E(G),f(x)\neq f(y)\}.italic_E ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≔ { italic_f ( italic_x ) italic_f ( italic_y ) : italic_x italic_y ∈ italic_E ( italic_G ) , italic_f ( italic_x ) ≠ italic_f ( italic_y ) } .

Observe that G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is connected if and only if G𝐺Gitalic_G is connected, and that U*superscript𝑈U^{*}italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT must be an independent set in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT.

Given some connected graph G𝐺Gitalic_G, we want to compare the behaviour of the lazy random walk on G𝐺Gitalic_G and its contracted form G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. In particular, we wish to compare their stationary distributions. In order to do this, we need to make them comparable by having them on the same state space. Let us describe this in full generality. Let H1=(V1,E1)subscript𝐻1subscript𝑉1subscript𝐸1H_{1}=(V_{1},E_{1})italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) and H2=(V2,E2)subscript𝐻2subscript𝑉2subscript𝐸2H_{2}=(V_{2},E_{2})italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) be two connected multigraphs (possibly with V1V2subscript𝑉1subscript𝑉2V_{1}\cap V_{2}\neq\varnothingitalic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∩ italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≠ ∅). Then, we define an auxiliary multigraph HH1H2=(V1V2,E1E2)𝐻subscript𝐻1subscript𝐻2subscript𝑉1subscript𝑉2subscript𝐸1subscript𝐸2H\coloneqq H_{1}\cup H_{2}=(V_{1}\cup V_{2},E_{1}\cup E_{2})italic_H ≔ italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = ( italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_E start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ italic_E start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) as the union of H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and H2subscript𝐻2H_{2}italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Given the stationary distributions πH1subscript𝜋subscript𝐻1\pi_{H_{1}}italic_π start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and πH2subscript𝜋subscript𝐻2\pi_{H_{2}}italic_π start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we define two distributions σ1subscript𝜎1\sigma_{1}italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and σ2subscript𝜎2\sigma_{2}italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT on H𝐻Hitalic_H, where, for each vV(H)𝑣𝑉𝐻v\in V(H)italic_v ∈ italic_V ( italic_H ),

σ1(v)subscript𝜎1𝑣\displaystyle\sigma_{1}(v)italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_v ) ={πH1(v)if vV1,0otherwise;absentcasessubscript𝜋subscript𝐻1𝑣if 𝑣subscript𝑉10otherwise\displaystyle=\begin{cases}\pi_{H_{1}}(v)&\text{if }v\in V_{1},\\ 0&\text{otherwise};\end{cases}= { start_ROW start_CELL italic_π start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_CELL start_CELL if italic_v ∈ italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL otherwise ; end_CELL end_ROW (3.8)
σ2(v)subscript𝜎2𝑣\displaystyle\sigma_{2}(v)italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_v ) ={πH2(v)if vV2,0otherwise.absentcasessubscript𝜋subscript𝐻2𝑣if 𝑣subscript𝑉20otherwise\displaystyle=\begin{cases}\pi_{H_{2}}(v)&\text{if }v\in V_{2},\\ 0&\text{otherwise}.\end{cases}= { start_ROW start_CELL italic_π start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_CELL start_CELL if italic_v ∈ italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL otherwise . end_CELL end_ROW

For the sake of notation, for any vertex vV(H)V1𝑣𝑉𝐻subscript𝑉1v\in V(H)\setminus V_{1}italic_v ∈ italic_V ( italic_H ) ∖ italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT we set degH1(v)=0subscriptdegsubscript𝐻1𝑣0\operatorname{deg}_{H_{1}}(v)=0roman_deg start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_v ) = 0 and, similarly, for any vV(H)V2𝑣𝑉𝐻subscript𝑉2v\in V(H)\setminus V_{2}italic_v ∈ italic_V ( italic_H ) ∖ italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT we set degH2(v)=0subscriptdegsubscript𝐻2𝑣0\operatorname{deg}_{H_{2}}(v)=0roman_deg start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_v ) = 0. With this setup, we abuse notation slightly and write dTV(πH1,πH2)subscript𝑑TVsubscript𝜋subscript𝐻1subscript𝜋subscript𝐻2d_{\mathrm{TV}}(\pi_{H_{1}},\pi_{H_{2}})italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) for dTV(σ1,σ2)subscript𝑑TVsubscript𝜎1subscript𝜎2d_{\mathrm{TV}}(\sigma_{1},\sigma_{2})italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_σ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_σ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ).

Lemma 3.5.

Let D1𝐷1D\geq 1italic_D ≥ 1 and 0<α<1/D20𝛼1superscript𝐷20<\alpha<1/D^{2}0 < italic_α < 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Let G𝐺Gitalic_G be an n𝑛nitalic_n-vertex connected graph which satisfies (S1)italic-S1(\mathrm{S}1)( S1 ) and (S2)italic-S2(\mathrm{S}2)( S2 ). If n𝑛nitalic_n is sufficiently large, we have that

dTV(πG,πG*)exp((logn)1/12).subscript𝑑TVsubscript𝜋𝐺subscript𝜋superscript𝐺superscript𝑛112d_{\mathrm{TV}}(\pi_{G},\pi_{G^{*}})\leq\exp(-(\log n)^{1/12}).italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ≤ roman_exp ( - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 12 end_POSTSUPERSCRIPT ) .
Proof.

Let U=U(α,D)𝑈𝑈𝛼𝐷U=U(\alpha,D)italic_U = italic_U ( italic_α , italic_D ), and let U*=f(U)V(G*)superscript𝑈𝑓𝑈𝑉superscript𝐺U^{*}=f(U)\subseteq V(G^{*})italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_f ( italic_U ) ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). We also fix G~GG*~𝐺𝐺superscript𝐺\tilde{G}\coloneqq G\cup G^{*}over~ start_ARG italic_G end_ARG ≔ italic_G ∪ italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, and recall the distributions defined in (3.8). Observe that

vV(G~)|degG(v)degG*(v)|=vUdegG(v)+vU*degG*(v)2degG(U).subscript𝑣𝑉~𝐺subscriptdeg𝐺𝑣subscriptdegsuperscript𝐺𝑣subscript𝑣𝑈subscriptdeg𝐺𝑣subscript𝑣superscript𝑈subscriptdegsuperscript𝐺𝑣2subscriptdeg𝐺𝑈\sum_{v\in V(\tilde{G})}|{\operatorname{deg}_{G}(v)}-\operatorname{deg}_{G^{*}% }(v)|=\sum_{v\in U}\operatorname{deg}_{G}(v)+\sum_{v\in U^{*}}\operatorname{% deg}_{G^{*}}(v)\leq 2\operatorname{deg}_{G}(U).∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over~ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT | roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) - roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) | = ∑ start_POSTSUBSCRIPT italic_v ∈ italic_U end_POSTSUBSCRIPT roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) + ∑ start_POSTSUBSCRIPT italic_v ∈ italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) ≤ 2 roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) . (3.9)

The equality uses the fact that, for all vV(G)U𝑣𝑉𝐺𝑈v\in V(G)\setminus Uitalic_v ∈ italic_V ( italic_G ) ∖ italic_U, we have degG(v)=degG*(v)subscriptdeg𝐺𝑣subscriptdegsuperscript𝐺𝑣\operatorname{deg}_{G}(v)=\operatorname{deg}_{G^{*}}(v)roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) = roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ). In the inequality, we simply note that vUdegG(v)=degG(U)subscript𝑣𝑈subscriptdeg𝐺𝑣subscriptdeg𝐺𝑈\sum_{v\in U}\operatorname{deg}_{G}(v)=\operatorname{deg}_{G}(U)∑ start_POSTSUBSCRIPT italic_v ∈ italic_U end_POSTSUBSCRIPT roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) = roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) by definition and that, by the definition of G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, this gives an upper bound for the second sum. Moreover, from (1.1) and the triangle inequality we have

dTV(πG,πG*)subscript𝑑TVsubscript𝜋𝐺subscript𝜋superscript𝐺\displaystyle d_{\mathrm{TV}}(\pi_{G},\pi_{G^{*}})italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) =12vV(G~)|degG(v)2e(G)degG*(v)2e(G*)|absent12subscript𝑣𝑉~𝐺subscriptdeg𝐺𝑣2𝑒𝐺subscriptdegsuperscript𝐺𝑣2𝑒superscript𝐺\displaystyle=\frac{1}{2}\sum_{v\in V(\tilde{G})}\left|\frac{\operatorname{deg% }_{G}(v)}{2e(G)}-\frac{\operatorname{deg}_{G^{*}}(v)}{2e(G^{*})}\right|= divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over~ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT | divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG - divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_ARG start_ARG 2 italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG |
12vV(G~)(|degG(v)degG*(v)2e(G)|+|degG*(v)2e(G)degG*(v)2e(G*)|).absent12subscript𝑣𝑉~𝐺subscriptdeg𝐺𝑣subscriptdegsuperscript𝐺𝑣2𝑒𝐺subscriptdegsuperscript𝐺𝑣2𝑒𝐺subscriptdegsuperscript𝐺𝑣2𝑒superscript𝐺\displaystyle\leq\frac{1}{2}\sum_{v\in V(\tilde{G})}\left(\left|\frac{% \operatorname{deg}_{G}(v)-\operatorname{deg}_{G^{*}}(v)}{2e(G)}\right|+\left|% \frac{\operatorname{deg}_{G^{*}}(v)}{2e(G)}-\frac{\operatorname{deg}_{G^{*}}(v% )}{2e(G^{*})}\right|\right).≤ divide start_ARG 1 end_ARG start_ARG 2 end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over~ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT ( | divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) - roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG | + | divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG - divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_ARG start_ARG 2 italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG | ) . (3.10)

The second term in the sum can be evaluated as

vV(G~)|degG*(v)2e(G)degG*(v)2e(G*)|subscript𝑣𝑉~𝐺subscriptdegsuperscript𝐺𝑣2𝑒𝐺subscriptdegsuperscript𝐺𝑣2𝑒superscript𝐺\displaystyle\sum_{v\in V(\tilde{G})}\left|\frac{\operatorname{deg}_{G^{*}}(v)% }{2e(G)}-\frac{\operatorname{deg}_{G^{*}}(v)}{2e(G^{*})}\right|∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over~ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT | divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG - divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_ARG start_ARG 2 italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG | =vV(G~)e(G)e(G*)2e(G)e(G*)degG*(v)absentsubscript𝑣𝑉~𝐺𝑒𝐺𝑒superscript𝐺2𝑒𝐺𝑒superscript𝐺subscriptdegsuperscript𝐺𝑣\displaystyle=\sum_{v\in V(\tilde{G})}\frac{e(G)-e(G^{*})}{2e(G)e(G^{*})}% \operatorname{deg}_{G^{*}}(v)= ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over~ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT divide start_ARG italic_e ( italic_G ) - italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG start_ARG 2 italic_e ( italic_G ) italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v )
=e(G)e(G*)e(G)absent𝑒𝐺𝑒superscript𝐺𝑒𝐺\displaystyle=\frac{e(G)-e(G^{*})}{e(G)}= divide start_ARG italic_e ( italic_G ) - italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_e ( italic_G ) end_ARG
=12e(G)vV(G~)(degG(v)degG*(v))absent12𝑒𝐺subscript𝑣𝑉~𝐺subscriptdeg𝐺𝑣subscriptdegsuperscript𝐺𝑣\displaystyle=\frac{1}{2e(G)}\sum_{v\in V(\tilde{G})}({\operatorname{deg}_{G}(% v)}-\operatorname{deg}_{G^{*}}(v))= divide start_ARG 1 end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over~ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT ( roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) - roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) )
12e(G)vV(G~)|degG(v)degG*(v)|.absent12𝑒𝐺subscript𝑣𝑉~𝐺subscriptdeg𝐺𝑣subscriptdegsuperscript𝐺𝑣\displaystyle\leq\frac{1}{2e(G)}\sum_{v\in V(\tilde{G})}|{\operatorname{deg}_{% G}(v)}-\operatorname{deg}_{G^{*}}(v)|.≤ divide start_ARG 1 end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over~ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT | roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) - roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) | .

Introducing this in (3.3) together with (3.9) and using Lemma 3.4 and that e(G)n/2𝑒𝐺𝑛2e(G)\geq n/2italic_e ( italic_G ) ≥ italic_n / 2, we conclude that

dTV(πG,πG*)vV(G~)|degG(v)degG*(v)2e(G)|degG(U)e(G)exp((logn)1/12).subscript𝑑TVsubscript𝜋𝐺subscript𝜋superscript𝐺subscript𝑣𝑉~𝐺subscriptdeg𝐺𝑣subscriptdegsuperscript𝐺𝑣2𝑒𝐺subscriptdeg𝐺𝑈𝑒𝐺superscript𝑛112d_{\mathrm{TV}}(\pi_{G},\pi_{G^{*}})\leq\sum_{v\in V(\tilde{G})}\left|\frac{% \operatorname{deg}_{G}(v)-\operatorname{deg}_{G^{*}}(v)}{2e(G)}\right|\leq% \frac{\operatorname{deg}_{G}(U)}{e(G)}\leq\exp(-(\log n)^{1/12}).\qeditalic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ≤ ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over~ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT | divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) - roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) end_ARG start_ARG 2 italic_e ( italic_G ) end_ARG | ≤ divide start_ARG roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) end_ARG start_ARG italic_e ( italic_G ) end_ARG ≤ roman_exp ( - ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 12 end_POSTSUPERSCRIPT ) . italic_∎

3.4. Mixing time after contractions

The following result shows the mixing properties of contracted spreader graphs.

Proposition 3.6.

For all D4𝐷4D\geq 4italic_D ≥ 4, 0<α<1/D20𝛼1superscript𝐷20<\alpha<1/D^{2}0 < italic_α < 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and ϵ>0italic-ϵ0\epsilon>0italic_ϵ > 0, there exists a C>0𝐶0C>0italic_C > 0 such that the following holds for all n𝑛nitalic_n sufficiently large. Suppose G𝐺Gitalic_G is an n𝑛nitalic_n-vertex connected (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph. Then,

tmix(G*(α,D),ϵ)Clogn.subscript𝑡mixsuperscript𝐺𝛼𝐷italic-ϵ𝐶𝑛t_{\mathrm{mix}}(G^{*}(\alpha,D),\epsilon)\leq C\log n.italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_α , italic_D ) , italic_ϵ ) ≤ italic_C roman_log italic_n .

In order to prove Proposition 3.6, we will rely on the following lemma.

Lemma 3.7.

For all D4𝐷4D\geq 4italic_D ≥ 4 and 0<α<β=1/D20𝛼𝛽1superscript𝐷20<\alpha<\beta=1/D^{2}0 < italic_α < italic_β = 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, the following holds for all n𝑛nitalic_n sufficiently large. Suppose G𝐺Gitalic_G is an n𝑛nitalic_n-vertex connected (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph. Then, taking G*=G*(α,D)superscript𝐺superscript𝐺𝛼𝐷G^{*}=G^{*}(\alpha,D)italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_α , italic_D ), for all G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT-connected S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) such that (logn)1/2/nπG*(S*)1/2superscript𝑛12𝑛subscript𝜋superscript𝐺superscript𝑆12{(\log n)^{1/2}}/{n}\leq\pi_{G^{*}}(S^{*})\leq{1}/{2}( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT / italic_n ≤ italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ 1 / 2 we have that ΦG*(S*)α2/8subscriptnormal-Φsuperscript𝐺superscript𝑆superscript𝛼28\Phi_{G^{*}}(S^{*})\geq{\alpha^{2}}/{8}roman_Φ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≥ italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8.

Proof.

Let UU(α,D)𝑈𝑈𝛼𝐷U\coloneqq U(\alpha,D)italic_U ≔ italic_U ( italic_α , italic_D ) and γα2/4𝛾superscript𝛼24\gamma\coloneqq\alpha^{2}/4italic_γ ≔ italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 4. Recall our definition of f:V(G)V(G*):𝑓𝑉𝐺𝑉superscript𝐺f\colon V(G)\to V(G^{*})italic_f : italic_V ( italic_G ) → italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) from (3.7). The proof will make use of the following claim.

Claim 1.

Any G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT-connected set S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) with |S*|2superscript𝑆2|S^{*}|\geq 2| italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | ≥ 2 and such that (logn)1/5|f1(S*)|(1β)nsuperscript𝑛15superscript𝑓1superscript𝑆1𝛽𝑛(\log n)^{1/5}\leq|f^{-1}(S^{*})|\leq(1-\beta)n( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ≤ | italic_f start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) | ≤ ( 1 - italic_β ) italic_n is γ𝛾\gammaitalic_γ-good in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT.

Proof.

Take any such S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) and let Sf1(S*)={vV(G):f(v)S*}𝑆superscript𝑓1superscript𝑆conditional-set𝑣𝑉𝐺𝑓𝑣superscript𝑆S\coloneqq f^{-1}(S^{*})=\{v\in V(G):f(v)\in S^{*}\}italic_S ≔ italic_f start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = { italic_v ∈ italic_V ( italic_G ) : italic_f ( italic_v ) ∈ italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT }. Note that S𝑆Sitalic_S must be G𝐺Gitalic_G-connected. We claim that the bounds on |S|𝑆|S|| italic_S | imply that it must be γ𝛾\gammaitalic_γ-good in G𝐺Gitalic_G. Indeed, if S𝑆Sitalic_S was γ𝛾\gammaitalic_γ-bad in G𝐺Gitalic_G, it would be a G𝐺Gitalic_G-connected subset of U𝑈Uitalic_U (recall (3.1) and (3.2)) and so would be mapped by f𝑓fitalic_f to a single vertex. As f𝑓fitalic_f is surjective and |S*|2superscript𝑆2|S^{*}|\geq 2| italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | ≥ 2, this is clearly not possible. It also follows from the definition of G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT that eG(S)eG*(S*)subscript𝑒𝐺𝑆subscript𝑒superscript𝐺superscript𝑆e_{G}(S)\geq e_{G^{*}}(S^{*})italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ≥ italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) and |G(S)|=|G*(S*)|subscript𝐺𝑆subscriptsuperscript𝐺superscript𝑆|\partial_{G}(S)|=|\partial_{G^{*}}(S^{*})|| ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | = | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) |. Therefore,

|G*(S*)|degG*(S*)=|G*(S*)|2eG*(S*)+|G*(S*)|=|G(S)|2eG*(S*)+|G(S)||G(S)|2eG(S)+|G(S)|γ,subscriptsuperscript𝐺superscript𝑆subscriptdegsuperscript𝐺superscript𝑆subscriptsuperscript𝐺superscript𝑆2subscript𝑒superscript𝐺superscript𝑆subscriptsuperscript𝐺superscript𝑆subscript𝐺𝑆2subscript𝑒superscript𝐺superscript𝑆subscript𝐺𝑆subscript𝐺𝑆2subscript𝑒𝐺𝑆subscript𝐺𝑆𝛾\frac{|\partial_{G^{*}}(S^{*})|}{\operatorname{deg}_{G^{*}}(S^{*})}=\frac{|% \partial_{G^{*}}(S^{*})|}{2e_{G^{*}}(S^{*})+|\partial_{G^{*}}(S^{*})|}=\frac{|% \partial_{G}(S)|}{2e_{G^{*}}(S^{*})+|\partial_{G}(S)|}\geq\frac{|\partial_{G}(% S)|}{2e_{G}(S)+|\partial_{G}(S)|}\geq\gamma,divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) | end_ARG start_ARG roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG = divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) | end_ARG start_ARG 2 italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) + | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) | end_ARG = divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 2 italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) + | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG ≥ divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG start_ARG 2 italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) + | ∂ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) | end_ARG ≥ italic_γ ,

and so S*superscript𝑆S^{*}italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is γ𝛾\gammaitalic_γ-good in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. ∎

We will also need the fact that none of the sets from the statement are too large.

Claim 2.

Let S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) be a G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT-connected set such that πG*(S*)1/2subscript𝜋superscript𝐺superscript𝑆12\pi_{G^{*}}(S^{*})\leq 1/2italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ 1 / 2. Then, |S*|(12β)nsuperscript𝑆12𝛽𝑛|S^{*}|\leq(1-2\beta)n| italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | ≤ ( 1 - 2 italic_β ) italic_n.

Proof.

Assume for a contradiction that there is such a set with |S*|>(12β)nsuperscript𝑆12𝛽𝑛|S^{*}|>(1-2\beta)n| italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | > ( 1 - 2 italic_β ) italic_n. Letting S¯*V(G*)S*superscript¯𝑆𝑉superscript𝐺superscript𝑆\bar{S}^{*}\coloneqq V(G^{*})\setminus S^{*}over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≔ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ∖ italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, we have that πG*(S*)+πG*(S¯*)=1subscript𝜋superscript𝐺superscript𝑆subscript𝜋superscript𝐺superscript¯𝑆1\pi_{G^{*}}(S^{*})+\pi_{G^{*}}(\bar{S}^{*})=1italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) + italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = 1 and |G*(S*)|=|G*(S¯*)|subscriptsuperscript𝐺superscript𝑆subscriptsuperscript𝐺superscript¯𝑆|\partial_{G^{*}}(S^{*})|=|\partial_{G^{*}}(\bar{S}^{*})|| ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) | = | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) |, implying that

eG*(S¯*)eG*(S*)|S*|1(13β)n>3n/4,subscript𝑒superscript𝐺superscript¯𝑆subscript𝑒superscript𝐺superscript𝑆superscript𝑆113𝛽𝑛3𝑛4e_{G^{*}}(\bar{S}^{*})\geq e_{G^{*}}(S^{*})\geq|S^{*}|-1\geq(1-3\beta)n>3n/4,italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≥ italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≥ | italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | - 1 ≥ ( 1 - 3 italic_β ) italic_n > 3 italic_n / 4 ,

where we used here that S*superscript𝑆S^{*}italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT-connected and the fact that β=1/D21/16𝛽1superscript𝐷2116\beta=1/D^{2}\leq 1/16italic_β = 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ 1 / 16. Let TV(G)𝑇𝑉𝐺T\subseteq V(G)italic_T ⊆ italic_V ( italic_G ) be some set of size 3βn3𝛽𝑛3\beta n3 italic_β italic_n such that f1(S¯*)Tsuperscript𝑓1superscript¯𝑆𝑇f^{-1}(\bar{S}^{*})\subseteq Titalic_f start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ⊆ italic_T, noting that this is possible since, by Lemma 3.4, we have |f1(S¯*)||S¯*|+|U|3βnsuperscript𝑓1superscript¯𝑆superscript¯𝑆𝑈3𝛽𝑛|f^{-1}(\bar{S}^{*})|\leq|\bar{S}^{*}|+|U|\leq 3\beta n| italic_f start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) | ≤ | over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | + | italic_U | ≤ 3 italic_β italic_n. Now, by (S3)S3(\mathrm{S}3)( S3 ), we have that eG*(S¯*)eG(T)D|T|3Dβn3n/4subscript𝑒superscript𝐺superscript¯𝑆subscript𝑒𝐺𝑇𝐷𝑇3𝐷𝛽𝑛3𝑛4e_{G^{*}}(\bar{S}^{*})\leq e_{G}(T)\leq D|T|\leq 3D\beta n\leq 3n/4italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_T ) ≤ italic_D | italic_T | ≤ 3 italic_D italic_β italic_n ≤ 3 italic_n / 4, a contradiction, where we again appealed to the facts that β=1/D2𝛽1superscript𝐷2\beta=1/D^{2}italic_β = 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and D4𝐷4D\geq 4italic_D ≥ 4. ∎

Now suppose that there exists some G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT-connected set S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) with (logn)1/2/nπG*(S*)1/2superscript𝑛12𝑛subscript𝜋superscript𝐺superscript𝑆12{(\log n)^{1/2}}/{n}\leq\pi_{G^{*}}(S^{*})\leq 1/2( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT / italic_n ≤ italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ 1 / 2 and ΦG*(S*)<α2/8subscriptΦsuperscript𝐺superscript𝑆superscript𝛼28\Phi_{G^{*}}(S^{*})<\alpha^{2}/8roman_Φ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) < italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8. By the bound on the conductance, it follows from (2.6) that S*superscript𝑆S^{*}italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is γ𝛾\gammaitalic_γ-bad in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and so, by Lemma 3.2(1)1(1)( 1 ), we have |G*(S*)|2eG*(S*)subscriptsuperscript𝐺superscript𝑆2subscript𝑒superscript𝐺superscript𝑆|\partial_{G^{*}}(S^{*})|\leq 2e_{G^{*}}(S^{*})| ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) | ≤ 2 italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). This implies that

4eG*(S*)degG*(S*)4subscript𝑒superscript𝐺superscript𝑆subscriptdegsuperscript𝐺superscript𝑆\displaystyle 4e_{G^{*}}(S^{*})\geq\operatorname{deg}_{G^{*}}(S^{*})4 italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≥ roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) =πG*(S*)2e(G*)absentsubscript𝜋superscript𝐺superscript𝑆2𝑒superscript𝐺\displaystyle=\pi_{G^{*}}(S^{*})\cdot 2e(G^{*})= italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ⋅ 2 italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT )
(logn)1/2n2e(G*)absentsuperscript𝑛12𝑛2𝑒superscript𝐺\displaystyle\geq\frac{(\log n)^{1/2}}{n}2e(G^{*})≥ divide start_ARG ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_n end_ARG 2 italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT )
(logn)1/2(2e(G)degG(U))nabsentsuperscript𝑛122𝑒𝐺subscriptdeg𝐺𝑈𝑛\displaystyle\geq\frac{(\log n)^{1/2}(2e(G)-\operatorname{deg}_{G}(U))}{n}≥ divide start_ARG ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ( 2 italic_e ( italic_G ) - roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) ) end_ARG start_ARG italic_n end_ARG
(logn)1/22,absentsuperscript𝑛122\displaystyle\geq\frac{(\log n)^{1/2}}{2},≥ divide start_ARG ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG , (3.11)

where in the last inequality we used the fact that degG(U)=o(n)subscriptdeg𝐺𝑈𝑜𝑛\operatorname{deg}_{G}(U)=o(n)roman_deg start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) = italic_o ( italic_n ) from Lemma 3.4 and the fact that e(G)n/2𝑒𝐺𝑛2e(G)\geq n/2italic_e ( italic_G ) ≥ italic_n / 2 as G𝐺Gitalic_G is connected.

Observe that, since γ<1𝛾1\gamma<1italic_γ < 1 and G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is connected, no set S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) with |S*|=1superscript𝑆1|S^{*}|=1| italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | = 1 can be γ𝛾\gammaitalic_γ-bad, so we must have |S*|2superscript𝑆2|S^{*}|\geq 2| italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | ≥ 2. Then, Claim 1 implies that Sf1(S*)𝑆superscript𝑓1superscript𝑆S\coloneqq f^{-1}(S^{*})italic_S ≔ italic_f start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) has size |S|<(logn)1/5𝑆superscript𝑛15|S|<(\log n)^{1/5}| italic_S | < ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT or |S|>(1β)n𝑆1𝛽𝑛|S|>(1-\beta)n| italic_S | > ( 1 - italic_β ) italic_n. If |S|>(1β)n𝑆1𝛽𝑛|S|>(1-\beta)n| italic_S | > ( 1 - italic_β ) italic_n, then |S*||S||U|>(12β)nsuperscript𝑆𝑆𝑈12𝛽𝑛|S^{*}|\geq|S|-|U|>(1-2\beta)n| italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT | ≥ | italic_S | - | italic_U | > ( 1 - 2 italic_β ) italic_n (again by Lemma 3.4), and we know this cannot happen by Claim 2, so we must have |S|<(logn)1/5𝑆superscript𝑛15|S|<(\log n)^{1/5}| italic_S | < ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT. As, trivially, any vertex set SV(G)superscript𝑆𝑉𝐺S^{\prime}\subseteq V(G)italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G ) has eG(S)|S|2subscript𝑒𝐺superscript𝑆superscriptsuperscript𝑆2e_{G}(S^{\prime})\leq|S^{\prime}|^{2}italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≤ | italic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, we have that eG*(S*)eG(S)(logn)2/5subscript𝑒superscript𝐺superscript𝑆subscript𝑒𝐺𝑆superscript𝑛25e_{G^{*}}(S^{*})\leq e_{G}(S)\leq(\log n)^{2/5}italic_e start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ≤ ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 / 5 end_POSTSUPERSCRIPT. This contradicts (3.4). ∎

With this, we can prove Proposition 3.6.

Proof of Proposition 3.6.

Let G*G*(α,D)superscript𝐺superscript𝐺𝛼𝐷G^{*}\coloneqq G^{*}(\alpha,D)italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≔ italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_α , italic_D ). Observe that (S3)S3(\mathrm{S}3)( S3 ) implies e(G*)e(G)Dn𝑒superscript𝐺𝑒𝐺𝐷𝑛e(G^{*})\leq e(G)\leq Dnitalic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ italic_e ( italic_G ) ≤ italic_D italic_n. By Lemma 3.7, for any G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT-connected set S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) with (logn)1/2/nπG*(S*)1/2superscript𝑛12𝑛subscript𝜋superscript𝐺superscript𝑆12(\log n)^{1/2}/n\leq\pi_{G^{*}}(S^{*})\leq 1/2( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT / italic_n ≤ italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ 1 / 2 we have ΦG*(S*)α2/8subscriptΦsuperscript𝐺superscript𝑆superscript𝛼28\Phi_{G^{*}}(S^{*})\geq\alpha^{2}/8roman_Φ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≥ italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8. Consider now any G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT-connected set S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) with πG*(S*)(logn)1/2/nsubscript𝜋superscript𝐺superscript𝑆superscript𝑛12𝑛\pi_{G^{*}}(S^{*})\leq(\log n)^{1/2}/nitalic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT / italic_n (in particular, S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\neq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≠ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT )). The fact that G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is connected together with (2.4) and (2.5) ensures that

ΦG*(S*)|G*(S*)|4e(G*)πG*(S*)14DnπG*(S*).subscriptΦsuperscript𝐺superscript𝑆subscriptsuperscript𝐺superscript𝑆4𝑒superscript𝐺subscript𝜋superscript𝐺superscript𝑆14𝐷𝑛subscript𝜋superscript𝐺superscript𝑆\Phi_{G^{*}}(S^{*})\geq\frac{|\partial_{G^{*}}(S^{*})|}{4e(G^{*})\cdot\pi_{G^{% *}}(S^{*})}\geq\frac{1}{4Dn\pi_{G^{*}}(S^{*})}.roman_Φ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≥ divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) | end_ARG start_ARG 4 italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ⋅ italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG ≥ divide start_ARG 1 end_ARG start_ARG 4 italic_D italic_n italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG .

Let J𝐽Jitalic_J denote the set of indices j1𝑗1j\geq 1italic_j ≥ 1 such that 2j(logn)1/2/nsuperscript2𝑗superscript𝑛12𝑛2^{-j}\leq(\log n)^{1/2}/n2 start_POSTSUPERSCRIPT - italic_j end_POSTSUPERSCRIPT ≤ ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT / italic_n, and note that min(J)logn𝐽𝑛\min(J)\leq\log nroman_min ( italic_J ) ≤ roman_log italic_n. It then follows that

j=1log2πmin(G*)1ΦG*2(2j)superscriptsubscript𝑗1subscript2subscript𝜋superscriptsuperscript𝐺1superscriptsubscriptΦsuperscript𝐺2superscript2𝑗\displaystyle\sum_{j=1}^{\lceil\log_{2}\pi_{\min}(G^{*})^{-1}\rceil}\Phi_{G^{*% }}^{-2}(2^{-j})∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⌈ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ⌉ end_POSTSUPERSCRIPT roman_Φ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ( 2 start_POSTSUPERSCRIPT - italic_j end_POSTSUPERSCRIPT ) logn64α4+jJ22j(4Dn)2absent𝑛64superscript𝛼4subscript𝑗𝐽superscript22𝑗superscript4𝐷𝑛2\displaystyle\leq\log n\cdot\frac{64}{\alpha^{4}}+\sum_{j\in J}2^{-2j}(4Dn)^{2}≤ roman_log italic_n ⋅ divide start_ARG 64 end_ARG start_ARG italic_α start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG + ∑ start_POSTSUBSCRIPT italic_j ∈ italic_J end_POSTSUBSCRIPT 2 start_POSTSUPERSCRIPT - 2 italic_j end_POSTSUPERSCRIPT ( 4 italic_D italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
logn64α4+2maxjJ{22j}16D2n2(64α4+32D2)logn,absent𝑛64superscript𝛼42subscript𝑗𝐽superscript22𝑗16superscript𝐷2superscript𝑛264superscript𝛼432superscript𝐷2𝑛\displaystyle\leq\log n\cdot\frac{64}{\alpha^{4}}+2\max_{j\in J}\{2^{-2j}\}% \cdot 16D^{2}n^{2}\leq\left(\frac{64}{\alpha^{4}}+32D^{2}\right)\log n,≤ roman_log italic_n ⋅ divide start_ARG 64 end_ARG start_ARG italic_α start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG + 2 roman_max start_POSTSUBSCRIPT italic_j ∈ italic_J end_POSTSUBSCRIPT { 2 start_POSTSUPERSCRIPT - 2 italic_j end_POSTSUPERSCRIPT } ⋅ 16 italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ ( divide start_ARG 64 end_ARG start_ARG italic_α start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG + 32 italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) roman_log italic_n ,

where in the last inequality we use the definition of J𝐽Jitalic_J. By Theorem 2.1 we have

tmix(G*)C0(64α4+32D2)logn,subscript𝑡mixsuperscript𝐺subscript𝐶064superscript𝛼432superscript𝐷2𝑛t_{\textrm{mix}}(G^{*})\leq C_{0}\left(\frac{64}{\alpha^{4}}+32D^{2}\right)% \log n,italic_t start_POSTSUBSCRIPT mix end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( divide start_ARG 64 end_ARG start_ARG italic_α start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG + 32 italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) roman_log italic_n ,

where C0subscript𝐶0C_{0}italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is some absolute constant. Since the total variation distance decreases exponentially fast after the mixing time (see, e.g., [33, section 4.5]), we get

tmix(G*,ϵ)C0(64α4+32D2)log2(1/ϵ)logn,subscript𝑡mixsuperscript𝐺italic-ϵsubscript𝐶064superscript𝛼432superscript𝐷2subscript21italic-ϵ𝑛t_{\textrm{mix}}(G^{*},\epsilon)\leq C_{0}\left(\frac{64}{\alpha^{4}}+32D^{2}% \right)\lceil\log_{2}(1/\epsilon)\rceil\log n,italic_t start_POSTSUBSCRIPT mix end_POSTSUBSCRIPT ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_ϵ ) ≤ italic_C start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( divide start_ARG 64 end_ARG start_ARG italic_α start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT end_ARG + 32 italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ⌈ roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( 1 / italic_ϵ ) ⌉ roman_log italic_n ,

and the proposition holds by taking C𝐶Citalic_C appropriately. ∎

3.5. Hitting time of bad vertices

Let D4𝐷4D\geq 4italic_D ≥ 4 and 0<α<1/D20𝛼1superscript𝐷20<\alpha<1/D^{2}0 < italic_α < 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and consider a connected (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph G𝐺Gitalic_G. We now wish to study the hitting time to the set of bad vertices U(α,D)𝑈𝛼𝐷U(\alpha,D)italic_U ( italic_α , italic_D ) in G𝐺Gitalic_G and show that a.a.s. it is not too small.

Lemma 3.8.

Let D4𝐷4D\geq 4italic_D ≥ 4 and 0<α<1/D20𝛼1superscript𝐷20<\alpha<1/D^{2}0 < italic_α < 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Let G𝐺Gitalic_G be an n𝑛nitalic_n-vertex connected (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph, and let U=U(α,D)𝑈𝑈𝛼𝐷U=U(\alpha,D)italic_U = italic_U ( italic_α , italic_D ). Then,

1nvV(G)U[τG(μ0v,U)(logn)2]=o(1).1𝑛subscript𝑣𝑉𝐺𝑈delimited-[]subscript𝜏𝐺superscriptsubscript𝜇0𝑣𝑈superscript𝑛2𝑜1\frac{1}{n}\sum_{v\in V(G)\setminus U}\mathbb{P}[\tau_{G}(\mu_{0}^{v},U)\leq(% \log n)^{2}]=o(1).divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) ∖ italic_U end_POSTSUBSCRIPT blackboard_P [ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT , italic_U ) ≤ ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = italic_o ( 1 ) .
Proof.

In this proof we will use a new auxiliary multigraph G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG. Let us introduce it here. Consider the multigraph G*=G*(α,D)superscript𝐺superscript𝐺𝛼𝐷G^{*}=G^{*}(\alpha,D)italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_α , italic_D ), and let U*f(U)V(G*)superscript𝑈𝑓𝑈𝑉superscript𝐺U^{*}\coloneqq f(U)\subseteq V(G^{*})italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≔ italic_f ( italic_U ) ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). Recall that the definition of U*superscript𝑈U^{*}italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT implies that it is an independent set in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. Now, G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG is obtained from G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT by contracting U*superscript𝑈U^{*}italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT to a single new vertex u*superscript𝑢u^{*}italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. The fact that U*superscript𝑈U^{*}italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is an independent set in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT guarantees that e(G*)=e(G^)𝑒superscript𝐺𝑒^𝐺e(G^{*})=e(\hat{G})italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = italic_e ( over^ start_ARG italic_G end_ARG ), and since vertices outside U*superscript𝑈U^{*}italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT do not see their degree changed by this operation, it follows that πG^(v)=πG*(v)subscript𝜋^𝐺𝑣subscript𝜋superscript𝐺𝑣\pi_{\hat{G}}(v)=\pi_{G^{*}}(v)italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_v ) = italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_v ) for all vV(G)U𝑣𝑉𝐺𝑈v\in V(G)\setminus Uitalic_v ∈ italic_V ( italic_G ) ∖ italic_U and πG^(u*)=πG*(U*)subscript𝜋^𝐺superscript𝑢subscript𝜋superscript𝐺superscript𝑈\pi_{\hat{G}}(u^{*})=\pi_{G^{*}}(U^{*})italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ).

Let (Xt)t0subscriptsubscript𝑋𝑡𝑡0(X_{t})_{t\geq 0}( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT and (X^t)t0subscriptsubscript^𝑋𝑡𝑡0(\hat{X}_{t})_{t\geq 0}( over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT denote lazy random walks on G𝐺Gitalic_G and G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG starting on some vertex vV(G)U𝑣𝑉𝐺𝑈v\in V(G)\setminus Uitalic_v ∈ italic_V ( italic_G ) ∖ italic_U. Let us denote τUvτG(μ0v,U)subscriptsuperscript𝜏𝑣𝑈subscript𝜏𝐺superscriptsubscript𝜇0𝑣𝑈\tau^{v}_{U}\coloneqq\tau_{G}(\mu_{0}^{v},U)italic_τ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≔ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT , italic_U ) and τ^u*vτG^(μ0v,u*)subscriptsuperscript^𝜏𝑣superscript𝑢subscript𝜏^𝐺superscriptsubscript𝜇0𝑣superscript𝑢\hat{\tau}^{v}_{u^{*}}\coloneqq\tau_{\hat{G}}(\mu_{0}^{v},u^{*})over^ start_ARG italic_τ end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≔ italic_τ start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT , italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). Define the natural coupling (Xt,X^t)t0subscriptsubscript𝑋𝑡subscript^𝑋𝑡𝑡0(X_{t},\hat{X}_{t})_{t\geq 0}( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT as follows: for any t1𝑡1t\geq 1italic_t ≥ 1, while XtUsubscript𝑋𝑡𝑈X_{t}\notin Uitalic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∉ italic_U, let X^t=Xtsubscript^𝑋𝑡subscript𝑋𝑡\hat{X}_{t}=X_{t}over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT; if there is a t1𝑡1t\geq 1italic_t ≥ 1 such that XtUsubscript𝑋𝑡𝑈X_{t}\in Uitalic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∈ italic_U, then for the smallest such t𝑡titalic_t we let X^t=u*subscript^𝑋𝑡superscript𝑢\hat{X}_{t}=u^{*}over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT; otherwise (that is, for all t>τUv𝑡subscriptsuperscript𝜏𝑣𝑈t>\tau^{v}_{U}italic_t > italic_τ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT), we let Xtsubscript𝑋𝑡X_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and X^tsubscript^𝑋𝑡\hat{X}_{t}over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT evolve independently. Observe that this is indeed a valid coupling since for all vV(G)U𝑣𝑉𝐺𝑈v\in V(G)\setminus Uitalic_v ∈ italic_V ( italic_G ) ∖ italic_U we have eG(v,U)=eG^(v,u*)subscript𝑒𝐺𝑣𝑈subscript𝑒^𝐺𝑣superscript𝑢e_{G}(v,U)=e_{\hat{G}}(v,u^{*})italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v , italic_U ) = italic_e start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_v , italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). With this natural coupling, conditional on τUv>tsubscriptsuperscript𝜏𝑣𝑈𝑡\tau^{v}_{U}>titalic_τ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT > italic_t, we have Xt=X^tsubscript𝑋𝑡subscript^𝑋𝑡X_{t}=\hat{X}_{t}italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = over^ start_ARG italic_X end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and so τUv=τ^u*vsubscriptsuperscript𝜏𝑣𝑈subscriptsuperscript^𝜏𝑣superscript𝑢\tau^{v}_{U}=\hat{\tau}^{v}_{u^{*}}italic_τ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT = over^ start_ARG italic_τ end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT; in particular, for any T00subscript𝑇00T_{0}\geq 0italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≥ 0 we have

[τUv>T0]=[τ^u*v>T0].delimited-[]subscriptsuperscript𝜏𝑣𝑈subscript𝑇0delimited-[]subscriptsuperscript^𝜏𝑣superscript𝑢subscript𝑇0\mathbb{P}[\tau^{v}_{U}>T_{0}]=\mathbb{P}[\hat{\tau}^{v}_{u^{*}}>T_{0}].blackboard_P [ italic_τ start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT > italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] = blackboard_P [ over^ start_ARG italic_τ end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT > italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] . (3.12)

We will study the hitting times τ^u*vsubscriptsuperscript^𝜏𝑣superscript𝑢\hat{\tau}^{v}_{u^{*}}over^ start_ARG italic_τ end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT using Theorem 2.2 on G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG with T=T(n)=(logn)6𝑇𝑇𝑛superscript𝑛6T=T(n)=(\log{n})^{6}italic_T = italic_T ( italic_n ) = ( roman_log italic_n ) start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT and c=3𝑐3c=3italic_c = 3. We start with the following claim (which we make no efforts to optimise).

Claim 3.

We have that tmix(G^)(logn)4subscript𝑡normal-mixnormal-^𝐺superscript𝑛4t_{\mathrm{mix}}(\hat{G})\leq(\log n)^{4}italic_t start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( over^ start_ARG italic_G end_ARG ) ≤ ( roman_log italic_n ) start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT.

Proof.

First we prove that, for any G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG-connected set S^V(G^)^𝑆𝑉^𝐺\hat{S}\subseteq V(\hat{G})over^ start_ARG italic_S end_ARG ⊆ italic_V ( over^ start_ARG italic_G end_ARG ) such that πG^(S^)1/2subscript𝜋^𝐺^𝑆12\pi_{\hat{G}}(\hat{S})\leq 1/2italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) ≤ 1 / 2, we have that ΦG^(S^)1/(2logn)subscriptΦ^𝐺^𝑆12𝑛\Phi_{\hat{G}}(\hat{S})\geq 1/(2\log n)roman_Φ start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) ≥ 1 / ( 2 roman_log italic_n ). Indeed, fix such an S^^𝑆\hat{S}over^ start_ARG italic_S end_ARG and define S*V(G*)superscript𝑆𝑉superscript𝐺S^{*}\subseteq V(G^{*})italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ⊆ italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) as

S*={S^if u*S^,(S^{u*})U*if u*S^.superscript𝑆cases^𝑆if superscript𝑢^𝑆^𝑆superscript𝑢superscript𝑈if superscript𝑢^𝑆S^{*}=\begin{cases}\hat{S}&\text{if }u^{*}\notin\hat{S},\\ (\hat{S}\setminus\{u^{*}\})\cup U^{*}&\text{if }u^{*}\in\hat{S}.\end{cases}italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = { start_ROW start_CELL over^ start_ARG italic_S end_ARG end_CELL start_CELL if italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∉ over^ start_ARG italic_S end_ARG , end_CELL end_ROW start_ROW start_CELL ( over^ start_ARG italic_S end_ARG ∖ { italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT } ) ∪ italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_CELL start_CELL if italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ over^ start_ARG italic_S end_ARG . end_CELL end_ROW

Further, let S*=S1Srsuperscript𝑆subscript𝑆1subscript𝑆𝑟S^{*}=S_{1}\cup\ldots\cup S_{r}italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∪ … ∪ italic_S start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT for some r𝑟r\in\mathbb{N}italic_r ∈ blackboard_N be a decomposition of S*superscript𝑆S^{*}italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT into G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT-connected components (note that r=1𝑟1r=1italic_r = 1 if u*S^superscript𝑢^𝑆u^{*}\notin\hat{S}italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∉ over^ start_ARG italic_S end_ARG). Now, as degG^(S^)=degG*(S*)subscriptdeg^𝐺^𝑆subscriptdegsuperscript𝐺superscript𝑆\operatorname{deg}_{\hat{G}}(\hat{S})=\operatorname{deg}_{G^{*}}(S^{*})roman_deg start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) = roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) and e(G^)=e(G*)𝑒^𝐺𝑒superscript𝐺e(\hat{G})=e(G^{*})italic_e ( over^ start_ARG italic_G end_ARG ) = italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ), we have that πG*(S*)1/2subscript𝜋superscript𝐺superscript𝑆12\pi_{G^{*}}(S^{*})\leq 1/2italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ 1 / 2 and hence π(Si)1/2𝜋subscript𝑆𝑖12\pi(S_{i})\leq 1/2italic_π ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ≤ 1 / 2 for all i[r]𝑖delimited-[]𝑟i\in[r]italic_i ∈ [ italic_r ]. Moreover, as U*superscript𝑈U^{*}italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is an independent set in G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, we have that |G^(S^)|=i=1r|G*(Si)|subscript^𝐺^𝑆superscriptsubscript𝑖1𝑟subscriptsuperscript𝐺subscript𝑆𝑖|\partial_{\hat{G}}(\hat{S})|=\sum_{i=1}^{r}|\partial_{G^{*}}(S_{i})|| ∂ start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) | = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) | and degG^(S^)=i=1rdegG*(Si)subscriptdeg^𝐺^𝑆superscriptsubscript𝑖1𝑟subscriptdegsuperscript𝐺subscript𝑆𝑖\operatorname{deg}_{\hat{G}}(\hat{S})=\sum_{i=1}^{r}\operatorname{deg}_{G^{*}}% (S_{i})roman_deg start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ).

Returning to analyse the conductance of S^^𝑆\hat{S}over^ start_ARG italic_S end_ARG, from (2.6), we have that

ΦG^(S^)|G^(S^)|2degG^(S^)=i=1r|G*(Si)|2i=1rdegG*(Si)12mini[r]|G*(Si)|degG*(Si)=|G*(Si0)|2degG*(Si0),subscriptΦ^𝐺^𝑆subscript^𝐺^𝑆2subscriptdeg^𝐺^𝑆superscriptsubscript𝑖1𝑟subscriptsuperscript𝐺subscript𝑆𝑖2superscriptsubscript𝑖1𝑟subscriptdegsuperscript𝐺subscript𝑆𝑖12subscript𝑖delimited-[]𝑟subscriptsuperscript𝐺subscript𝑆𝑖subscriptdegsuperscript𝐺subscript𝑆𝑖subscriptsuperscript𝐺subscript𝑆subscript𝑖02subscriptdegsuperscript𝐺subscript𝑆subscript𝑖0\Phi_{\hat{G}}(\hat{S})\geq\frac{|\partial_{\hat{G}}(\hat{S})|}{2\operatorname% {deg}_{\hat{G}}(\hat{S})}=\frac{\sum_{i=1}^{r}|\partial_{G^{*}}(S_{i})|}{2\sum% _{i=1}^{r}\operatorname{deg}_{G^{*}}(S_{i})}\geq\frac{1}{2}\min_{i\in[r]}\frac% {|\partial_{G^{*}}(S_{i})|}{\operatorname{deg}_{G^{*}}(S_{i})}=\frac{|\partial% _{G^{*}}(S_{i_{0}})|}{2\operatorname{deg}_{G^{*}}(S_{i_{0}})},roman_Φ start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) ≥ divide start_ARG | ∂ start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) | end_ARG start_ARG 2 roman_deg start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) end_ARG = divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) | end_ARG start_ARG 2 ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG ≥ divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_min start_POSTSUBSCRIPT italic_i ∈ [ italic_r ] end_POSTSUBSCRIPT divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) | end_ARG start_ARG roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG = divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) | end_ARG start_ARG 2 roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG , (3.13)

letting i0[r]subscript𝑖0delimited-[]𝑟i_{0}\in[r]italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ [ italic_r ] be a minimising index. If degG*(Si0)lognsubscriptdegsuperscript𝐺subscript𝑆subscript𝑖0𝑛\operatorname{deg}_{G^{*}}(S_{i_{0}})\leq\log nroman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ≤ roman_log italic_n, then we are done due to the fact that |G*(Si0)|1subscriptsuperscript𝐺subscript𝑆subscript𝑖01|\partial_{G^{*}}(S_{i_{0}})|\geq 1| ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) | ≥ 1 as G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is connected. If degG*(Si0)>lognsubscriptdegsuperscript𝐺subscript𝑆subscript𝑖0𝑛\operatorname{deg}_{G^{*}}(S_{i_{0}})>\log nroman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) > roman_log italic_n, then πG*(Si0)=degG*(Si0)/(2e(G*))(logn)1/2/nsubscript𝜋superscript𝐺subscript𝑆subscript𝑖0subscriptdegsuperscript𝐺subscript𝑆subscript𝑖02𝑒superscript𝐺superscript𝑛12𝑛\pi_{G^{*}}(S_{i_{0}})=\operatorname{deg}_{G^{*}}(S_{i_{0}})/(2e(G^{*}))\geq(% \log n)^{1/2}/nitalic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) / ( 2 italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ) ≥ ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT / italic_n, using that e(G*)e(G)Dn𝑒superscript𝐺𝑒𝐺𝐷𝑛e(G^{*})\leq e(G)\leq Dnitalic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≤ italic_e ( italic_G ) ≤ italic_D italic_n from (S3)S3(\mathrm{S}3)( S3 ). Therefore, using (2.6) and (3.13), we have that

ΦG^(S^)|G*(Si0)|2degG*(Si0)=ΦG*(Si0)degG*(V(G*)Si0)2e(G*)ΦG*(Si0)2α216,subscriptΦ^𝐺^𝑆subscriptsuperscript𝐺subscript𝑆subscript𝑖02subscriptdegsuperscript𝐺subscript𝑆subscript𝑖0subscriptΦsuperscript𝐺subscript𝑆subscript𝑖0subscriptdegsuperscript𝐺𝑉superscript𝐺subscript𝑆subscript𝑖02𝑒superscript𝐺subscriptΦsuperscript𝐺subscript𝑆subscript𝑖02superscript𝛼216\Phi_{\hat{G}}(\hat{S})\geq\frac{|\partial_{G^{*}}(S_{i_{0}})|}{2\operatorname% {deg}_{G^{*}}(S_{i_{0}})}=\frac{\Phi_{G^{*}}(S_{i_{0}})\operatorname{deg}_{G^{% *}}(V(G^{*})\setminus S_{i_{0}})}{2e(G^{*})}\geq\frac{\Phi_{G^{*}}(S_{i_{0}})}% {2}\geq\frac{\alpha^{2}}{16},roman_Φ start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) ≥ divide start_ARG | ∂ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) | end_ARG start_ARG 2 roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG = divide start_ARG roman_Φ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) roman_deg start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ∖ italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG 2 italic_e ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG ≥ divide start_ARG roman_Φ start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG 2 end_ARG ≥ divide start_ARG italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 16 end_ARG ,

where we used Lemma 3.7 in last inequality and the fact that πG*(Si0)=1πG*(V(G*)Si0)1/2subscript𝜋superscript𝐺subscript𝑆subscript𝑖01subscript𝜋superscript𝐺𝑉superscript𝐺subscript𝑆subscript𝑖012\pi_{G^{*}}(S_{i_{0}})=1-\pi_{G^{*}}(V(G^{*})\setminus S_{i_{0}})\leq 1/2italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = 1 - italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_V ( italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ∖ italic_S start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ≤ 1 / 2 in the penultimate inequality.

So we have established that ΦG^(S^)1/(2logn)subscriptΦ^𝐺^𝑆12𝑛\Phi_{\hat{G}}(\hat{S})\geq 1/(2\log n)roman_Φ start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) ≥ 1 / ( 2 roman_log italic_n ) for all G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG-connected sets S^^𝑆\hat{S}over^ start_ARG italic_S end_ARG with πG^(S^)1/2subscript𝜋^𝐺^𝑆12\pi_{\hat{G}}(\hat{S})\leq 1/2italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( over^ start_ARG italic_S end_ARG ) ≤ 1 / 2. Now notice that πmin(G^)1/(2e(G^))1/(2Dn)subscript𝜋^𝐺12𝑒^𝐺12𝐷𝑛\pi_{\min}(\hat{G})\geq 1/(2e(\hat{G}))\geq 1/(2Dn)italic_π start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( over^ start_ARG italic_G end_ARG ) ≥ 1 / ( 2 italic_e ( over^ start_ARG italic_G end_ARG ) ) ≥ 1 / ( 2 italic_D italic_n ) due to (S3)S3(\mathrm{S}3)( S3 ), and hence, when applying Theorem 2.1, there are logarithmically many terms in the sum. This establishes the desired upper bound on tmix(G^)subscript𝑡𝑚𝑖𝑥^𝐺t_{mix}(\hat{G})italic_t start_POSTSUBSCRIPT italic_m italic_i italic_x end_POSTSUBSCRIPT ( over^ start_ARG italic_G end_ARG ). ∎

Using (1.1) and Claim 3 and as the total variation distance decreases exponentially fast after the mixing time (see, e.g., [33, section 4.5]), we have

maxx,yV(G^)|μ0xPG^T(y)πG^(y)|2maxxV(G^)dTV(μ0xPG^T,πG^)=o(n3)subscript𝑥𝑦𝑉^𝐺superscriptsubscript𝜇0𝑥superscriptsubscript𝑃^𝐺𝑇𝑦subscript𝜋^𝐺𝑦2subscript𝑥𝑉^𝐺subscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑇^𝐺subscript𝜋^𝐺𝑜superscript𝑛3\max_{x,y\in V(\hat{G})}|\mu_{0}^{x}P_{\hat{G}}^{T}(y)-\pi_{\hat{G}}(y)|\leq 2% \max_{x\in V(\hat{G})}d_{\mathrm{TV}}(\mu_{0}^{x}P^{T}_{\hat{G}},\pi_{\hat{G}}% )=o(n^{-3})roman_max start_POSTSUBSCRIPT italic_x , italic_y ∈ italic_V ( over^ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT | italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_y ) - italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_y ) | ≤ 2 roman_max start_POSTSUBSCRIPT italic_x ∈ italic_V ( over^ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ) = italic_o ( italic_n start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT )

and (HP1)HP1(\mathrm{HP}1)( HP1 ) is satisfied.

Let us now prove that πG^(u*)subscript𝜋^𝐺superscript𝑢\pi_{\hat{G}}(u^{*})italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) is small. By Remark 2.3, to prove our statement it suffices to have (HP2)superscriptHP2(\mathrm{HP}2^{\prime})( HP2 start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) for u=u*𝑢superscript𝑢u=u^{*}italic_u = italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. By Lemma 3.4 (3.5), πG(U)=o((logn)6)subscript𝜋𝐺𝑈𝑜superscript𝑛6\pi_{G}(U)=o((\log n)^{-6})italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) = italic_o ( ( roman_log italic_n ) start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT ). Moreover, as mentioned earlier, πG^(u*)=πG*(U*)subscript𝜋^𝐺superscript𝑢subscript𝜋superscript𝐺superscript𝑈\pi_{\hat{G}}(u^{*})=\pi_{G^{*}}(U^{*})italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_U start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). By Lemma 3.5, we have that

πG^(u*)=πG(U)+O(dTV(πG,πG*))=o((logn)6).subscript𝜋^𝐺superscript𝑢subscript𝜋𝐺𝑈𝑂subscript𝑑TVsubscript𝜋𝐺subscript𝜋superscript𝐺𝑜superscript𝑛6\displaystyle\pi_{\hat{G}}(u^{*})=\pi_{G}(U)+O(d_{\mathrm{TV}}(\pi_{G},\pi_{G^% {*}}))=o((\log n)^{-6}).italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_U ) + italic_O ( italic_d start_POSTSUBSCRIPT roman_TV end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ) = italic_o ( ( roman_log italic_n ) start_POSTSUPERSCRIPT - 6 end_POSTSUPERSCRIPT ) . (3.14)

It follows that TπG^(u*)=o(1)𝑇subscript𝜋^𝐺superscript𝑢𝑜1T\cdot\pi_{\hat{G}}(u^{*})=o(1)italic_T ⋅ italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = italic_o ( 1 ) and (HP2)superscriptHP2(\mathrm{HP}2^{\prime})( HP2 start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) holds.

Finally, since G𝐺Gitalic_G is a connected (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph with D𝐷Ditalic_D fixed and α<1𝛼1\alpha<1italic_α < 1, G^^𝐺\hat{G}over^ start_ARG italic_G end_ARG is a connected multigraph with e(G^)e(G)Dn𝑒^𝐺𝑒𝐺𝐷𝑛e(\hat{G})\leq e(G)\leq Dnitalic_e ( over^ start_ARG italic_G end_ARG ) ≤ italic_e ( italic_G ) ≤ italic_D italic_n, by (S3)S3(\mathrm{S}3)( S3 ). It follows by (2.2) that πmin(G^)1/2e(G^)=ω(n2)subscript𝜋^𝐺12𝑒^𝐺𝜔superscript𝑛2\pi_{\min}(\hat{G})\geq 1/2e(\hat{G})=\omega(n^{-2})italic_π start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( over^ start_ARG italic_G end_ARG ) ≥ 1 / 2 italic_e ( over^ start_ARG italic_G end_ARG ) = italic_ω ( italic_n start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT ) and (HP3)HP3(\mathrm{HP}3)( HP3 ) is satisfied.

Now, recalling the relevant definitions from Theorem 2.2 and letting T0(logn)2subscript𝑇0superscript𝑛2T_{0}\coloneqq\lceil(\log{n})^{2}\rceilitalic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≔ ⌈ ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⌉, as n𝑛nitalic_n goes to infinity we have

λu*T0=(1(1+o(1))πG^(u*)RT(u*))T01(1+o(1))T0πG^(u*)RT(u*)=1o(1),superscriptsubscript𝜆superscript𝑢subscript𝑇0superscript11𝑜1subscript𝜋^𝐺superscript𝑢subscript𝑅𝑇superscript𝑢subscript𝑇011𝑜1subscript𝑇0subscript𝜋^𝐺superscript𝑢subscript𝑅𝑇superscript𝑢1𝑜1\lambda_{u^{*}}^{T_{0}}=\left(1-(1+o(1))\frac{\pi_{\hat{G}}(u^{*})}{R_{T}(u^{*% })}\right)^{T_{0}}\geq 1-(1+o(1))\frac{T_{0}\pi_{\hat{G}}(u^{*})}{R_{T}(u^{*})% }=1-o(1),italic_λ start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT = ( 1 - ( 1 + italic_o ( 1 ) ) divide start_ARG italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG ) start_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ≥ 1 - ( 1 + italic_o ( 1 ) ) divide start_ARG italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT over^ start_ARG italic_G end_ARG end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) end_ARG = 1 - italic_o ( 1 ) ,

where we used (1x)T01T0xsuperscript1𝑥subscript𝑇01subscript𝑇0𝑥(1-x)^{T_{0}}\geq 1-T_{0}x( 1 - italic_x ) start_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ≥ 1 - italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_x, (3.14) and RT(u*)1subscript𝑅𝑇superscript𝑢1R_{T}(u^{*})\geq 1italic_R start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ( italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≥ 1. Therefore, appealing to Remark 2.4, we have that

1nvV(G)U[τ^u*vT0]1𝑛subscript𝑣𝑉𝐺𝑈delimited-[]subscriptsuperscript^𝜏𝑣superscript𝑢subscript𝑇0\displaystyle\frac{1}{n}\sum_{v\in V(G)\setminus U}\mathbb{P}[\hat{\tau}^{v}_{% u^{*}}\leq T_{0}]divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) ∖ italic_U end_POSTSUBSCRIPT blackboard_P [ over^ start_ARG italic_τ end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] 1|V(G^)|vV(G)U[τ^u*vT0]absent1𝑉^𝐺subscript𝑣𝑉𝐺𝑈delimited-[]subscriptsuperscript^𝜏𝑣superscript𝑢subscript𝑇0\displaystyle\leq\frac{1}{|V(\hat{G})|}\sum_{v\in V(G)\setminus U}\mathbb{P}[% \hat{\tau}^{v}_{u^{*}}\leq T_{0}]≤ divide start_ARG 1 end_ARG start_ARG | italic_V ( over^ start_ARG italic_G end_ARG ) | end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( italic_G ) ∖ italic_U end_POSTSUBSCRIPT blackboard_P [ over^ start_ARG italic_τ end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ]
1|V(G^)|vV(G^)[τ^u*vT0]=o(1).absent1𝑉^𝐺subscript𝑣𝑉^𝐺delimited-[]subscriptsuperscript^𝜏𝑣superscript𝑢subscript𝑇0𝑜1\displaystyle\leq\frac{1}{|V(\hat{G})|}\sum_{v\in V(\hat{G})}\mathbb{P}[\hat{% \tau}^{v}_{u^{*}}\leq T_{0}]=o(1).≤ divide start_ARG 1 end_ARG start_ARG | italic_V ( over^ start_ARG italic_G end_ARG ) | end_ARG ∑ start_POSTSUBSCRIPT italic_v ∈ italic_V ( over^ start_ARG italic_G end_ARG ) end_POSTSUBSCRIPT blackboard_P [ over^ start_ARG italic_τ end_ARG start_POSTSUPERSCRIPT italic_v end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_u start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] = italic_o ( 1 ) .

Together with (3.12), this completes the proof of the lemma. ∎

3.6. Proof of the main theorem

We are finally ready to prove Theorem 1.5.

Proof of Theorem 1.5.

Let U=U(α,D)𝑈𝑈𝛼𝐷U=U(\alpha,D)italic_U = italic_U ( italic_α , italic_D ) and G*=G*(α,D)superscript𝐺superscript𝐺𝛼𝐷G^{*}=G^{*}(\alpha,D)italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_α , italic_D ). Let (Xt)t0subscriptsubscript𝑋𝑡𝑡0(X_{t})_{t\geq 0}( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT and (Xt*)t0subscriptsubscriptsuperscript𝑋𝑡𝑡0(X^{*}_{t})_{t\geq 0}( italic_X start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT denote lazy random walks on G𝐺Gitalic_G and G*superscript𝐺G^{*}italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, respectively. For xV(G)𝑥𝑉𝐺x\in V(G)italic_x ∈ italic_V ( italic_G ), let τUxτG(μ0x,U)subscriptsuperscript𝜏𝑥𝑈subscript𝜏𝐺superscriptsubscript𝜇0𝑥𝑈\tau^{x}_{U}\coloneqq\tau_{G}(\mu_{0}^{x},U)italic_τ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≔ italic_τ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT , italic_U ). Similarly as in the proof of Lemma 3.8, we consider a natural coupling (Xt,Xt*)t0subscriptsubscript𝑋𝑡subscriptsuperscript𝑋𝑡𝑡0(X_{t},X^{*}_{t})_{t\geq 0}( italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_X start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_t ≥ 0 end_POSTSUBSCRIPT of the random walks so that for t<τU𝑡subscript𝜏𝑈t<\tau_{U}italic_t < italic_τ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT we let Xt*=Xtsubscriptsuperscript𝑋𝑡subscript𝑋𝑡X^{*}_{t}=X_{t}italic_X start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and otherwise we let the walks evolve independently.

First, observe that, by Lemma 3.5 and the triangle inequality (and adopting the abuse of notation introduced in Section 3.3), for any xV(G)𝑥𝑉𝐺x\in V(G)italic_x ∈ italic_V ( italic_G ) we have

dTV(μ0xPGt,πG)dTV(μ0xPGt,πG*)+dTV(πG,πG*)=dTV(μ0xPGt,πG*)+o(1).subscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡𝐺subscript𝜋𝐺subscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡𝐺subscript𝜋superscript𝐺subscript𝑑TVsubscript𝜋𝐺subscript𝜋superscript𝐺subscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡𝐺subscript𝜋superscript𝐺𝑜1\displaystyle d_{\textrm{TV}}(\mu_{0}^{x}P^{t}_{G},\pi_{G})\leq d_{\textrm{TV}% }(\mu_{0}^{x}P^{t}_{G},\pi_{G^{*}})+d_{\textrm{TV}}(\pi_{G},\pi_{G^{*}})=d_{% \textrm{TV}}(\mu_{0}^{x}P^{t}_{G},\pi_{G^{*}})+o(1).italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ≤ italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) + italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) = italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) + italic_o ( 1 ) . (3.15)

For every xV(G)U𝑥𝑉𝐺𝑈x\in V(G)\setminus Uitalic_x ∈ italic_V ( italic_G ) ∖ italic_U and yV(G)𝑦𝑉𝐺y\in V(G)italic_y ∈ italic_V ( italic_G ), we can write

μ0xPGt(y)superscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡𝐺𝑦\displaystyle\mu_{0}^{x}P^{t}_{G}(y)italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_y ) =[Xt=yX0=x]absentdelimited-[]subscript𝑋𝑡conditional𝑦subscript𝑋0𝑥\displaystyle=\mathbb{P}[X_{t}=y\mid X_{0}=x]= blackboard_P [ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_y ∣ italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x ]
=[Xt=y,τU>tX0=x]+[Xt=y,τUtX0=x]absentdelimited-[]formulae-sequencesubscript𝑋𝑡𝑦subscript𝜏𝑈conditional𝑡subscript𝑋0𝑥delimited-[]formulae-sequencesubscript𝑋𝑡𝑦subscript𝜏𝑈conditional𝑡subscript𝑋0𝑥\displaystyle=\mathbb{P}[X_{t}=y,\tau_{U}>t\mid X_{0}=x]+\mathbb{P}[X_{t}=y,% \tau_{U}\leq t\mid X_{0}=x]= blackboard_P [ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_y , italic_τ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT > italic_t ∣ italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x ] + blackboard_P [ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_y , italic_τ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≤ italic_t ∣ italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x ]
=[Xt*=y,τU>tX0=x]+[Xt=y,τUtX0=x]absentdelimited-[]formulae-sequencesubscriptsuperscript𝑋𝑡𝑦subscript𝜏𝑈conditional𝑡subscript𝑋0𝑥delimited-[]formulae-sequencesubscript𝑋𝑡𝑦subscript𝜏𝑈conditional𝑡subscript𝑋0𝑥\displaystyle=\mathbb{P}[X^{*}_{t}=y,\tau_{U}>t\mid X_{0}=x]+\mathbb{P}[X_{t}=% y,\tau_{U}\leq t\mid X_{0}=x]= blackboard_P [ italic_X start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_y , italic_τ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT > italic_t ∣ italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x ] + blackboard_P [ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_y , italic_τ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≤ italic_t ∣ italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x ]
μ0xPG*t(y)+[Xt=y,τUtX0=x],absentsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡superscript𝐺𝑦delimited-[]formulae-sequencesubscript𝑋𝑡𝑦subscript𝜏𝑈conditional𝑡subscript𝑋0𝑥\displaystyle\leq\mu_{0}^{x}P^{t}_{G^{*}}(y)+\mathbb{P}[X_{t}=y,\tau_{U}\leq t% \mid X_{0}=x],≤ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_y ) + blackboard_P [ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_y , italic_τ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≤ italic_t ∣ italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x ] ,

where we let μ0xPG*t(y)=0superscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡superscript𝐺𝑦0\mu_{0}^{x}P^{t}_{G^{*}}(y)=0italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_y ) = 0 if yU𝑦𝑈y\in Uitalic_y ∈ italic_U. Let A{yV(G):μ0xPGt(y)πG*(y)}𝐴conditional-set𝑦𝑉𝐺superscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡𝐺𝑦subscript𝜋superscript𝐺𝑦A\coloneqq\{y\in V(G):\mu_{0}^{x}P^{t}_{G}(y)\geq\pi_{G^{*}}(y)\}italic_A ≔ { italic_y ∈ italic_V ( italic_G ) : italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_y ) ≥ italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_y ) }. It follows that

dTV(μ0xPGt,πG*)subscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡𝐺subscript𝜋superscript𝐺\displaystyle d_{\textrm{TV}}(\mu_{0}^{x}P^{t}_{G},\pi_{G^{*}})italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) =yA(μ0xPGt(y)πG*(y))absentsubscript𝑦𝐴superscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡𝐺𝑦subscript𝜋superscript𝐺𝑦\displaystyle=\sum_{y\in A}(\mu_{0}^{x}P^{t}_{G}(y)-\pi_{G^{*}}(y))= ∑ start_POSTSUBSCRIPT italic_y ∈ italic_A end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_y ) - italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_y ) )
yA(μ0xPG*t(y)πG*(y))+yA[Xt=y,τUtX0=x]absentsubscript𝑦𝐴superscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡superscript𝐺𝑦subscript𝜋superscript𝐺𝑦subscript𝑦𝐴delimited-[]formulae-sequencesubscript𝑋𝑡𝑦subscript𝜏𝑈conditional𝑡subscript𝑋0𝑥\displaystyle\leq\sum_{y\in A}(\mu_{0}^{x}P^{t}_{G^{*}}(y)-\pi_{G^{*}}(y))+% \sum_{y\in A}\mathbb{P}[X_{t}=y,\tau_{U}\leq t\mid X_{0}=x]≤ ∑ start_POSTSUBSCRIPT italic_y ∈ italic_A end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_y ) - italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_y ) ) + ∑ start_POSTSUBSCRIPT italic_y ∈ italic_A end_POSTSUBSCRIPT blackboard_P [ italic_X start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_y , italic_τ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≤ italic_t ∣ italic_X start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_x ]
dTV(μ0xPG*t,πG*)+[τUxt].absentsubscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃𝑡superscript𝐺subscript𝜋superscript𝐺delimited-[]subscriptsuperscript𝜏𝑥𝑈𝑡\displaystyle\leq d_{\textrm{TV}}(\mu_{0}^{x}P^{t}_{G^{*}},\pi_{G^{*}})+% \mathbb{P}[\tau^{x}_{U}\leq t].≤ italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) + blackboard_P [ italic_τ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≤ italic_t ] . (3.16)

Let C>0𝐶0C>0italic_C > 0 be the constant given by Proposition 3.6 with ϵ/2italic-ϵ2\epsilon/2italic_ϵ / 2 playing the role of ϵitalic-ϵ\epsilonitalic_ϵ, and let T0Clognsubscript𝑇0𝐶𝑛T_{0}\coloneqq\lceil C\log{n}\rceilitalic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≔ ⌈ italic_C roman_log italic_n ⌉. By Proposition 3.6, we have that dTV(μ0xPG*T0,πG*)ϵ/2subscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃subscript𝑇0superscript𝐺subscript𝜋superscript𝐺italic-ϵ2d_{\textrm{TV}}(\mu_{0}^{x}P^{T_{0}}_{G^{*}},\pi_{G^{*}})\leq\epsilon/2italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ≤ italic_ϵ / 2. Combining these bounds with (3.15) and (3.6), we obtain for all xV(G)U𝑥𝑉𝐺𝑈x\in V(G)\setminus Uitalic_x ∈ italic_V ( italic_G ) ∖ italic_U that

dTV(μ0xPGT0,πG)ϵ/2+[τUxT0]+o(1).subscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃subscript𝑇0𝐺subscript𝜋𝐺italic-ϵ2delimited-[]subscriptsuperscript𝜏𝑥𝑈subscript𝑇0𝑜1\displaystyle d_{\textrm{TV}}(\mu_{0}^{x}P^{T_{0}}_{G},\pi_{G})\leq\epsilon/2+% \mathbb{P}[\tau^{x}_{U}\leq T_{0}]+o(1).italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ≤ italic_ϵ / 2 + blackboard_P [ italic_τ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] + italic_o ( 1 ) .

By Lemma 3.4 (3.3) and Lemma 3.8, we conclude that

1nxV(G)dTV(μ0xPGT0,πG)|U|n+n|U|n(ϵ2+o(1))+1nxV(G)U[τUxT0]ϵ1𝑛subscript𝑥𝑉𝐺subscript𝑑TVsuperscriptsubscript𝜇0𝑥subscriptsuperscript𝑃subscript𝑇0𝐺subscript𝜋𝐺𝑈𝑛𝑛𝑈𝑛italic-ϵ2𝑜11𝑛subscript𝑥𝑉𝐺𝑈delimited-[]subscriptsuperscript𝜏𝑥𝑈subscript𝑇0italic-ϵ\displaystyle\frac{1}{n}\sum_{x\in V(G)}d_{\textrm{TV}}(\mu_{0}^{x}P^{T_{0}}_{% G},\pi_{G})\leq\frac{|U|}{n}+\frac{n-|U|}{n}\left(\frac{\epsilon}{2}+o(1)% \right)+\frac{1}{n}\sum_{x\in V(G)\setminus U}\mathbb{P}[\tau^{x}_{U}\leq T_{0% }]\leq\epsilondivide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_x ∈ italic_V ( italic_G ) end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT TV end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_P start_POSTSUPERSCRIPT italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT , italic_π start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ) ≤ divide start_ARG | italic_U | end_ARG start_ARG italic_n end_ARG + divide start_ARG italic_n - | italic_U | end_ARG start_ARG italic_n end_ARG ( divide start_ARG italic_ϵ end_ARG start_ARG 2 end_ARG + italic_o ( 1 ) ) + divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_x ∈ italic_V ( italic_G ) ∖ italic_U end_POSTSUBSCRIPT blackboard_P [ italic_τ start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT ≤ italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ] ≤ italic_ϵ

and t¯mix(G,ϵ)T0subscript¯𝑡mix𝐺italic-ϵsubscript𝑇0\bar{t}_{\mathrm{mix}}(G,\epsilon)\leq T_{0}over¯ start_ARG italic_t end_ARG start_POSTSUBSCRIPT roman_mix end_POSTSUBSCRIPT ( italic_G , italic_ϵ ) ≤ italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, concluding the proof. ∎

4. Smoothed analysis on connected graphs

We next want to show applications of Theorem 1.5. We use this section to prove Theorem 1.6.

Proof of Theorem 1.6.

Let D2(Δ+1+δ)𝐷2Δ1𝛿D\coloneqq 2(\Delta+1+\delta)italic_D ≔ 2 ( roman_Δ + 1 + italic_δ ) and 0<α<δ/D20𝛼𝛿superscript𝐷20<\alpha<\delta/D^{2}0 < italic_α < italic_δ / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT be a sufficiently small constant. If Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is a connected (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph, the statement follows from Theorem 1.5. Since Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is connected by assumption, it suffices to show that a.a.s. Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is an (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph.

We need to verify that (S1)S1(\mathrm{S}1)( S1 ), (S2)S2(\mathrm{S}2)( S2 ) and (S3)S3(\mathrm{S}3)( S3 ) hold a.a.s. in G=GRsuperscript𝐺𝐺𝑅G^{\prime}=G\cup Ritalic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_G ∪ italic_R. The fact that (S2)S2(\mathrm{S}2)( S2 ) and (S3)S3(\mathrm{S}3)( S3 ) hold a.a.s. follows directly from the fact that G𝐺Gitalic_G is ΔΔ\Deltaroman_Δ-degenerate and that e(R[S])<max{2,2δ}|S|𝑒𝑅delimited-[]𝑆22𝛿𝑆e(R[S])<\max\{2,2\delta\}|S|italic_e ( italic_R [ italic_S ] ) < roman_max { 2 , 2 italic_δ } | italic_S | for all SV(R)𝑆𝑉𝑅S\subseteq V(R)italic_S ⊆ italic_V ( italic_R ) (see, for example, [31, Lemma 8]). The fact that property (S1)S1(\mathrm{S}1)( S1 ) holds a.a.s. in Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT follows from the proof of [31, Theorem 3]. Indeed, for each (logn)1/5k(11/D2)nsuperscript𝑛15𝑘11superscript𝐷2𝑛(\log n)^{1/5}\leq k\leq(1-1/D^{2})n( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ≤ italic_k ≤ ( 1 - 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_n, let Xksubscript𝑋𝑘X_{k}italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote the number of Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT-connected α𝛼\alphaitalic_α-thin sets SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|=k𝑆𝑘|S|=k| italic_S | = italic_k. Then (see [31, eq. (4)] and the claim immediately after), one can check that the expected number of such sets satisfies

𝔼[Xk]m=1αkb=mαknmexp(Cαlog(1/α)k)(δn)m1[Bin(k(nk),δn)<αk],𝔼delimited-[]subscript𝑋𝑘superscriptsubscript𝑚1𝛼𝑘superscriptsubscript𝑏𝑚𝛼𝑘superscript𝑛𝑚𝐶𝛼1𝛼𝑘superscript𝛿𝑛𝑚1delimited-[]Bin𝑘𝑛𝑘𝛿𝑛𝛼𝑘\mathbb{E}[X_{k}]\leq\sum_{m=1}^{\alpha k}\sum_{b=m}^{\alpha k}n^{m}\exp\left(% C\alpha\log(1/\alpha)k\right)\left(\frac{\delta}{n}\right)^{m-1}\mathbb{P}% \left[\mathrm{Bin}\left(k(n-k),\frac{\delta}{n}\right)<\alpha k\right],blackboard_E [ italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] ≤ ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α italic_k end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_b = italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α italic_k end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_exp ( italic_C italic_α roman_log ( 1 / italic_α ) italic_k ) ( divide start_ARG italic_δ end_ARG start_ARG italic_n end_ARG ) start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT blackboard_P [ roman_Bin ( italic_k ( italic_n - italic_k ) , divide start_ARG italic_δ end_ARG start_ARG italic_n end_ARG ) < italic_α italic_k ] ,

where C𝐶Citalic_C is some absolute constant. Following again the proof in [31] and choosing α𝛼\alphaitalic_α sufficiently small and n𝑛nitalic_n sufficiently large, one concludes that

𝔼[Xk]k2nexp((Cαlog(1/α)δ8D2)k)nexp(δ20D2k).𝔼delimited-[]subscript𝑋𝑘superscript𝑘2𝑛𝐶𝛼1𝛼𝛿8superscript𝐷2𝑘𝑛𝛿20superscript𝐷2𝑘\mathbb{E}[X_{k}]\leq k^{2}n\exp\left(\left(C\alpha\log(1/\alpha)-\frac{\delta% }{8D^{2}}\right)k\right)\leq n\exp\left(-\frac{\delta}{20D^{2}}k\right).blackboard_E [ italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] ≤ italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n roman_exp ( ( italic_C italic_α roman_log ( 1 / italic_α ) - divide start_ARG italic_δ end_ARG start_ARG 8 italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) italic_k ) ≤ italic_n roman_exp ( - divide start_ARG italic_δ end_ARG start_ARG 20 italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_k ) .

Property (S1)S1(\mathrm{S}1)( S1 ) then follows by Markov’s inequality and a union bound over all (logn)1/5k(11/D2)nsuperscript𝑛15𝑘11superscript𝐷2𝑛(\log n)^{1/5}\leq k\leq(1-1/D^{2})n( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ≤ italic_k ≤ ( 1 - 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_n. ∎

5. Random subgraphs of expanders

In order to prove Theorem 1.9, we will rely on several known properties of the giant component of a random subgraph of an (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graph. Recall from Section 2.1 that when we refer to asymptotic statements holding in an (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graph, implicitly what is meant is that the statement holds for any sequence (Gn)n1subscriptsubscript𝐺𝑛𝑛1(G_{n})_{n\geq 1}( italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ≥ 1 end_POSTSUBSCRIPT of (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graphs that satisfy the stated condition.

Lemma 5.1.

Let δ>0𝛿0\delta>0italic_δ > 0 be a sufficiently small constant and let G𝐺Gitalic_G be an (n,d,λ)𝑛𝑑𝜆(n,d,\lambda)( italic_n , italic_d , italic_λ )-graph with λδ4d𝜆superscript𝛿4𝑑\lambda\leq\delta^{4}ditalic_λ ≤ italic_δ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_d. Let p=(1+δ)/d𝑝1𝛿𝑑p=(1+\delta)/ditalic_p = ( 1 + italic_δ ) / italic_d and let L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT be a largest component in Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. Then, a.a.s. the following properties hold:

  1. (a)

    L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT has (1+o(1))(2δ+g(δ))n1𝑜12𝛿𝑔𝛿𝑛(1+o(1))(2\delta+g(\delta))n( 1 + italic_o ( 1 ) ) ( 2 italic_δ + italic_g ( italic_δ ) ) italic_n vertices, where g(δ)=o(δ)𝑔𝛿𝑜𝛿g(\delta)=o(\delta)italic_g ( italic_δ ) = italic_o ( italic_δ ) as δ𝛿\deltaitalic_δ tends to 00.

  2. (b)

    There exists some absolute constant c>0𝑐0c>0italic_c > 0 such that, for any Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT-connected SV(L1)𝑆𝑉subscript𝐿1S\subseteq V(L_{1})italic_S ⊆ italic_V ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) with 16logn/δ2|S|δ2n/5016𝑛superscript𝛿2𝑆superscript𝛿2𝑛5016\log n/\delta^{2}\leq|S|\leq\delta^{2}n/5016 roman_log italic_n / italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ | italic_S | ≤ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50, we have that

    |Gp(S)|cδ2|S|log(1/δ).subscriptsubscript𝐺𝑝𝑆𝑐superscript𝛿2𝑆1𝛿|\partial_{G_{p}}(S)|\geq\frac{c\delta^{2}|S|}{\log(1/\delta)}.| ∂ start_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_S ) | ≥ divide start_ARG italic_c italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_S | end_ARG start_ARG roman_log ( 1 / italic_δ ) end_ARG .
  3. (c)

    There exists some absolute constant c>0superscript𝑐0c^{\prime}>0italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT > 0 such that, for any SV(L1)𝑆𝑉subscript𝐿1S\subseteq V(L_{1})italic_S ⊆ italic_V ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) with δ2n/50|S|12δn/11superscript𝛿2𝑛50𝑆12𝛿𝑛11{\delta^{2}n}/{50}\leq|S|\leq{12\delta n}/{11}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50 ≤ | italic_S | ≤ 12 italic_δ italic_n / 11, we have that

    |Gp(S)|cδ2|S|log(1/δ).subscriptsubscript𝐺𝑝𝑆superscript𝑐superscript𝛿2𝑆1𝛿|\partial_{G_{p}}(S)|\geq\frac{c^{\prime}\delta^{2}|S|}{\log(1/\delta)}.| ∂ start_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_S ) | ≥ divide start_ARG italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_S | end_ARG start_ARG roman_log ( 1 / italic_δ ) end_ARG .
Proof.

Statement (a) follows from [25, Theorem 1]; see also the discussion following [17, Theorem 1.1]. Statement (b) is a consequence of [17, Theorem 1 (1)], while (c) is given in [17, Theorem 2]. ∎

With this, we can prove Theorem 1.9.

Proof of Theorem 1.9.

Let L1L1(Gp)subscript𝐿1subscript𝐿1subscript𝐺𝑝L_{1}\coloneqq L_{1}(G_{p})italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≔ italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ). Fix D12𝐷12D\coloneqq 12italic_D ≔ 12, c0min{c,c}subscript𝑐0𝑐superscript𝑐c_{0}\coloneqq\min\{c,c^{\prime}\}italic_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≔ roman_min { italic_c , italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT } (where c𝑐citalic_c and csuperscript𝑐c^{\prime}italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT are the absolute constants from Lemma 5.1(b) and (c)) and αc0δ2/(D2log(1/δ))𝛼subscript𝑐0superscript𝛿2superscript𝐷21𝛿\alpha\coloneqq c_{0}\delta^{2}/(D^{2}\log(1/\delta))italic_α ≔ italic_c start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_log ( 1 / italic_δ ) ). By Theorem 1.5, it suffices to show that a.a.s. L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is an (α,D)𝛼𝐷(\alpha,D)( italic_α , italic_D )-spreader graph. That is, we need to show that (for sufficiently small δ>0𝛿0\delta>0italic_δ > 0) a.a.s. L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT satisfies properties (S1)S1(\mathrm{S}1)( S1 )(S3)S3(\mathrm{S}3)( S3 ).

We are first going to show that (S1)S1(\mathrm{S}1)( S1 ) and (S2)S2(\mathrm{S}2)( S2 ) hold a.a.s. in Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT, rather than L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, for sets of size at least (logn)1/6superscript𝑛16(\log n)^{1/6}( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 6 end_POSTSUPERSCRIPT. Similarly as happened in the proof of Theorem 1.6, in this case properties (S1)S1(\mathrm{S}1)( S1 ) (for |S|δ2n/50𝑆superscript𝛿2𝑛50|S|\leq\delta^{2}n/50| italic_S | ≤ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50) and (S2)S2(\mathrm{S}2)( S2 ) can be obtained by following the proofs of [17, Theorem 1 (1)] and [17, Lemma 2.4], respectively. Let us give here a brief sketch. Let us first consider (S1)S1(\mathrm{S}1)( S1 ) (for |S|δ2n/50𝑆superscript𝛿2𝑛50|S|\leq\delta^{2}n/50| italic_S | ≤ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50). For each (logn)1/6kδ2n/50superscript𝑛16𝑘superscript𝛿2𝑛50(\log n)^{1/6}\leq k\leq\delta^{2}n/50( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 6 end_POSTSUPERSCRIPT ≤ italic_k ≤ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50, let Xksubscript𝑋𝑘X_{k}italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote the number of Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT-connected sets SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|=k𝑆𝑘|S|=k| italic_S | = italic_k which are α𝛼\alphaitalic_α-thin. Then, following [17, Theorem 1 (1)], we have 𝔼[Xk]3nexp(δ2k/8)𝔼delimited-[]subscript𝑋𝑘3𝑛superscript𝛿2𝑘8\mathbb{E}[X_{k}]\leq 3n\exp(-\delta^{2}k/8)blackboard_E [ italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] ≤ 3 italic_n roman_exp ( - italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_k / 8 ). By Markov’s inequality and a union bound over all (logn)1/6kδ2n/50superscript𝑛16𝑘superscript𝛿2𝑛50(\log n)^{1/6}\leq k\leq\delta^{2}n/50( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 6 end_POSTSUPERSCRIPT ≤ italic_k ≤ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50, we conclude that a.a.s.

  1. (d)

    for all (logn)1/6kδ2n/50superscript𝑛16𝑘superscript𝛿2𝑛50(\log n)^{1/6}\leq k\leq\delta^{2}n/50( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 6 end_POSTSUPERSCRIPT ≤ italic_k ≤ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50, the number of Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT-connected α𝛼\alphaitalic_α-thin sets SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|=k𝑆𝑘|S|=k| italic_S | = italic_k is less than nek𝑛superscripte𝑘n\mathrm{e}^{-\sqrt{k}}italic_n roman_e start_POSTSUPERSCRIPT - square-root start_ARG italic_k end_ARG end_POSTSUPERSCRIPT.

Similarly, for each k(logn)1/6𝑘superscript𝑛16k\geq(\log n)^{1/6}italic_k ≥ ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 6 end_POSTSUPERSCRIPT, one can check from the proof of [17, Lemma 2.4] that, if we let Yksubscript𝑌𝑘Y_{k}italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denote the number of Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT-connected sets SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|=k𝑆𝑘|S|=k| italic_S | = italic_k such that eGp(S)10ksubscript𝑒subscript𝐺𝑝𝑆10𝑘e_{G_{p}}(S)\geq 10kitalic_e start_POSTSUBSCRIPT italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_S ) ≥ 10 italic_k, then 𝔼[Yk]nexp(2k)𝔼delimited-[]subscript𝑌𝑘𝑛2𝑘\mathbb{E}[Y_{k}]\leq n\exp(-2k)blackboard_E [ italic_Y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] ≤ italic_n roman_exp ( - 2 italic_k ). Again by Markov’s inequality and a union bound over all k(logn)1/6𝑘superscript𝑛16k\geq(\log n)^{1/6}italic_k ≥ ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 6 end_POSTSUPERSCRIPT, we conclude that a.a.s.

  1. (e)

    for all (logn)1/6knsuperscript𝑛16𝑘𝑛(\log n)^{1/6}\leq k\leq n( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 6 end_POSTSUPERSCRIPT ≤ italic_k ≤ italic_n, the number of Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT-connected α1superscript𝛼1\alpha^{-1}italic_α start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT-loaded sets SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|=k𝑆𝑘|S|=k| italic_S | = italic_k is less than nek𝑛superscripte𝑘n\mathrm{e}^{-\sqrt{k}}italic_n roman_e start_POSTSUPERSCRIPT - square-root start_ARG italic_k end_ARG end_POSTSUPERSCRIPT.

We next work towards property (S3)S3(\mathrm{S}3)( S3 ). We are going to show that no set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|δαn𝑆𝛿𝛼𝑛|S|\geq\delta\alpha n| italic_S | ≥ italic_δ italic_α italic_n is D𝐷Ditalic_D-loaded in Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. Fix some δαnkn𝛿𝛼𝑛𝑘𝑛\delta\alpha n\leq k\leq nitalic_δ italic_α italic_n ≤ italic_k ≤ italic_n and let ηk/n𝜂𝑘𝑛\eta\coloneqq k/nitalic_η ≔ italic_k / italic_n. If δ>0𝛿0\delta>0italic_δ > 0 is sufficiently small, we have that δ4<αδηsuperscript𝛿4𝛼𝛿𝜂\delta^{4}<\alpha\delta\leq\etaitalic_δ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT < italic_α italic_δ ≤ italic_η. Now, by the expander mixing lemma (see, for example, [17, Lemma 2.1]), for any set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|=k=ηn𝑆𝑘𝜂𝑛|S|=k=\eta n| italic_S | = italic_k = italic_η italic_n we have that

eG(S)dk2n+λk(η+δ4)dk2ηdk,subscript𝑒𝐺𝑆𝑑superscript𝑘2𝑛𝜆𝑘𝜂superscript𝛿4𝑑𝑘2𝜂𝑑𝑘e_{G}(S)\leq\frac{dk^{2}}{n}+\lambda k\leq(\eta+\delta^{4})dk\leq 2\eta dk,italic_e start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_S ) ≤ divide start_ARG italic_d italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_n end_ARG + italic_λ italic_k ≤ ( italic_η + italic_δ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ) italic_d italic_k ≤ 2 italic_η italic_d italic_k ,

using that λδ4d𝜆superscript𝛿4𝑑\lambda\leq\delta^{4}ditalic_λ ≤ italic_δ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT italic_d. Hence, the probability that S𝑆Sitalic_S contains Dk𝐷𝑘Dkitalic_D italic_k edges in Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT is at most

(2ηdkDk)pDk(2ηdeD)Dk(1+δd)Dk(6ηD)Dk.FRACOP2𝜂𝑑𝑘𝐷𝑘superscript𝑝𝐷𝑘superscript2𝜂𝑑e𝐷𝐷𝑘superscript1𝛿𝑑𝐷𝑘superscript6𝜂𝐷𝐷𝑘\left(\kern-1.00006pt\genfrac{}{}{0.0pt}{}{2\eta dk}{Dk}\kern-1.00006pt\right)% p^{Dk}\leq\left(\frac{2\eta d\mathrm{e}}{D}\right)^{Dk}\left(\frac{1+\delta}{d% }\right)^{Dk}\leq\left(\frac{6\eta}{D}\right)^{Dk}.( FRACOP start_ARG 2 italic_η italic_d italic_k end_ARG start_ARG italic_D italic_k end_ARG ) italic_p start_POSTSUPERSCRIPT italic_D italic_k end_POSTSUPERSCRIPT ≤ ( divide start_ARG 2 italic_η italic_d roman_e end_ARG start_ARG italic_D end_ARG ) start_POSTSUPERSCRIPT italic_D italic_k end_POSTSUPERSCRIPT ( divide start_ARG 1 + italic_δ end_ARG start_ARG italic_d end_ARG ) start_POSTSUPERSCRIPT italic_D italic_k end_POSTSUPERSCRIPT ≤ ( divide start_ARG 6 italic_η end_ARG start_ARG italic_D end_ARG ) start_POSTSUPERSCRIPT italic_D italic_k end_POSTSUPERSCRIPT .

Taking a union bound over all possible sets of size k𝑘kitalic_k, we have that the probability of there existing a D𝐷Ditalic_D-loaded set of size k𝑘kitalic_k in Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT is at most

(nk)(6ηD)Dk(nek)k(6ηD)Dk=(eη(6ηD)D)k=(eηD1(12)D)k(12)αδn.FRACOP𝑛𝑘superscript6𝜂𝐷𝐷𝑘superscript𝑛e𝑘𝑘superscript6𝜂𝐷𝐷𝑘superscripte𝜂superscript6𝜂𝐷𝐷𝑘superscriptesuperscript𝜂𝐷1superscript12𝐷𝑘superscript12𝛼𝛿𝑛\left(\kern-1.00006pt\genfrac{}{}{0.0pt}{}{n}{k}\kern-1.00006pt\right)\left(% \frac{6\eta}{D}\right)^{Dk}\leq\left(\frac{n\mathrm{e}}{k}\right)^{k}\left(% \frac{6\eta}{D}\right)^{Dk}=\left(\frac{\mathrm{e}}{\eta}\left(\frac{6\eta}{D}% \right)^{D}\right)^{k}=\left(\mathrm{e}\eta^{D-1}\left(\frac{1}{2}\right)^{D}% \right)^{k}\leq\left(\frac{1}{2}\right)^{\alpha\delta n}.( FRACOP start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) ( divide start_ARG 6 italic_η end_ARG start_ARG italic_D end_ARG ) start_POSTSUPERSCRIPT italic_D italic_k end_POSTSUPERSCRIPT ≤ ( divide start_ARG italic_n roman_e end_ARG start_ARG italic_k end_ARG ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ( divide start_ARG 6 italic_η end_ARG start_ARG italic_D end_ARG ) start_POSTSUPERSCRIPT italic_D italic_k end_POSTSUPERSCRIPT = ( divide start_ARG roman_e end_ARG start_ARG italic_η end_ARG ( divide start_ARG 6 italic_η end_ARG start_ARG italic_D end_ARG ) start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT = ( roman_e italic_η start_POSTSUPERSCRIPT italic_D - 1 end_POSTSUPERSCRIPT ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT ≤ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT italic_α italic_δ italic_n end_POSTSUPERSCRIPT .

By a union bound over all αδnkn𝛼𝛿𝑛𝑘𝑛\alpha\delta n\leq k\leq nitalic_α italic_δ italic_n ≤ italic_k ≤ italic_n we conclude that a.a.s.

  1. (f)

    no set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) with |S|δαn𝑆𝛿𝛼𝑛|S|\geq\delta\alpha n| italic_S | ≥ italic_δ italic_α italic_n is D𝐷Ditalic_D-loaded in Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT.

Condition on the event that Lemma 5.1(a) and (c) as well as (d), (e) and (f) hold in Gpsubscript𝐺𝑝G_{p}italic_G start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT, which a.a.s. occurs. Let n|V(L1)|=(1+o(1))(2δ+g(δ))nsuperscript𝑛𝑉subscript𝐿11𝑜12𝛿𝑔𝛿𝑛n^{\prime}\coloneqq|V(L_{1})|=(1+o(1))(2\delta+g(\delta))nitalic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≔ | italic_V ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) | = ( 1 + italic_o ( 1 ) ) ( 2 italic_δ + italic_g ( italic_δ ) ) italic_n by (a). It follows that (logn)1/5(logn)1/6superscriptsuperscript𝑛15superscript𝑛16(\log n^{\prime})^{1/5}\geq(\log n)^{1/6}( roman_log italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT ≥ ( roman_log italic_n ) start_POSTSUPERSCRIPT 1 / 6 end_POSTSUPERSCRIPT and so, by (d), (S1)S1(\mathrm{S}1)( S1 ) holds in L1subscript𝐿1L_{1}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT for sets of size at most δ2n/50superscript𝛿2𝑛50\delta^{2}n/50italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50; similarly, (S2)S2(\mathrm{S}2)( S2 ) holds by (e), and (S3)S3(\mathrm{S}3)( S3 ) holds by (f). Thus, it only remains to establish (S1)S1(\mathrm{S}1)( S1 ) for sets SV(L1)𝑆𝑉subscript𝐿1S\subseteq V(L_{1})italic_S ⊆ italic_V ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) such that δ2n/50|S|(11/D2)nsuperscript𝛿2𝑛50𝑆11superscript𝐷2superscript𝑛\delta^{2}n/50\leq|S|\leq(1-1/D^{2})n^{\prime}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50 ≤ | italic_S | ≤ ( 1 - 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. For δ2n/50|S|12δn/11superscript𝛿2𝑛50𝑆12𝛿𝑛11\delta^{2}n/50\leq|S|\leq 12\delta n/11italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50 ≤ | italic_S | ≤ 12 italic_δ italic_n / 11, this is immediate from (c), and we can also use (c) for larger sets S𝑆Sitalic_S. Indeed, suppose 12δn/11|S|(11/D2)n12𝛿𝑛11𝑆11superscript𝐷2superscript𝑛{12\delta n}/{11}\leq|S|\leq(1-1/D^{2})n^{\prime}12 italic_δ italic_n / 11 ≤ | italic_S | ≤ ( 1 - 1 / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and let S¯V(L1)S¯𝑆𝑉subscript𝐿1𝑆\bar{S}\coloneqq V(L_{1})\setminus Sover¯ start_ARG italic_S end_ARG ≔ italic_V ( italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ∖ italic_S. It follows from (a), by taking a sufficiently small δ𝛿\deltaitalic_δ, that

|S¯|=|L1||S|n1211δn<n21211δn,¯𝑆subscript𝐿1𝑆superscript𝑛1211𝛿𝑛superscript𝑛21211𝛿𝑛|\bar{S}|=|L_{1}|-|S|\leq n^{\prime}-\frac{12}{11}\delta n<\frac{n^{\prime}}{2% }\leq\frac{12}{11}\delta n,| over¯ start_ARG italic_S end_ARG | = | italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | - | italic_S | ≤ italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - divide start_ARG 12 end_ARG start_ARG 11 end_ARG italic_δ italic_n < divide start_ARG italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ≤ divide start_ARG 12 end_ARG start_ARG 11 end_ARG italic_δ italic_n ,

where in the last two inequalities we use that δ2δ+g(δ)𝛿2𝛿𝑔𝛿\frac{\delta}{2\delta+g(\delta)}divide start_ARG italic_δ end_ARG start_ARG 2 italic_δ + italic_g ( italic_δ ) end_ARG tends to 1/2121/21 / 2 as δ𝛿\deltaitalic_δ tends to 00 due to the fact that g(δ)=o(δ)𝑔𝛿𝑜𝛿g(\delta)=o(\delta)italic_g ( italic_δ ) = italic_o ( italic_δ ). We also have that |S¯|n/D2|S|/D2δ2n/50¯𝑆superscript𝑛superscript𝐷2𝑆superscript𝐷2superscript𝛿2𝑛50|\bar{S}|\geq n^{\prime}/D^{2}\geq|S|/D^{2}\geq\delta^{2}n/50| over¯ start_ARG italic_S end_ARG | ≥ italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ | italic_S | / italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≥ italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n / 50, using also here that δ𝛿\deltaitalic_δ is sufficiently small. Hence, we can apply (c) to S¯¯𝑆\bar{S}over¯ start_ARG italic_S end_ARG and we obtain that

|L1(S)|=|L1(S¯)|cδ2|S¯|log(1/δ)cδ2|S|D2log(1/δ)α|S|.subscriptsubscript𝐿1𝑆subscriptsubscript𝐿1¯𝑆superscript𝑐superscript𝛿2¯𝑆1𝛿superscript𝑐superscript𝛿2𝑆superscript𝐷21𝛿𝛼𝑆|\partial_{L_{1}}(S)|=|\partial_{L_{1}}(\bar{S})|\geq\frac{c^{\prime}\delta^{2% }|\bar{S}|}{\log(1/\delta)}\geq\frac{c^{\prime}\delta^{2}|S|}{D^{2}\log(1/% \delta)}\geq\alpha|S|.\qed| ∂ start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_S ) | = | ∂ start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( over¯ start_ARG italic_S end_ARG ) | ≥ divide start_ARG italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | over¯ start_ARG italic_S end_ARG | end_ARG start_ARG roman_log ( 1 / italic_δ ) end_ARG ≥ divide start_ARG italic_c start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_S | end_ARG start_ARG italic_D start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_log ( 1 / italic_δ ) end_ARG ≥ italic_α | italic_S | . italic_∎

6. Open problems

Theorem 1.5 is only effective on graphs where the mixing is slowed down by few small bottlenecks. This is the case in the two applications presented. Nevertheless, there are other cases where both small and large bottlenecks exist. It would be interesting to study average-case mixing times in such scenarios and determine which improvement with respect to the worst-case can be attained. One such example is the small-world model of Kleinberg [30], whose mixing time has been studied in [20].

In recent years, the theory of random walks in random directed graphs has attracted a considerable amount of attention. As in the case of random regular graphs, under mild conditions on the bidegree sequence, the mixing time is logarithmic [9, 12]. From the point of view of smoothed analysis, a natural question is whether randomly perturbing a deterministic strongly connected digraph can yield logarithmic mixing time. Conductance-based bounds such as Jerrum-Sinclair and Fountoulakis-Reed are not valid in the non-reversible setting, which requires new ideas. Finally, we mention an analogous result to the mixing time in randomly perturbed connected graphs by Krivelevich, Reichman and Samotij [31], for graphs perturbed by a random perfect matching, has been obtained by Hermon, Sly and Sousi [27]. Considering such a model in the directed setting would also be interesting.

Acknowledgements. The authors would like to thank Matteo Quattropani for fruitful discussions on the First Visit Time Lemma (FVTL). They would also like to thank the anonymous referees for their insightful comments, in particular for spotting a misuse of the FVTL and for pointing out the non-contractivity of the average mixing time (see Remark 1.2).

References

  • Addario-Berry and Lei [2015] L. Addario-Berry and T. Lei, The mixing time of the Newman-Watts small-world model. Adv. Appl. Probab. 47.1 (2015), 37–56, doi: 10.1239/aap/1427814580.
  • Ajtai, Komlós and Szemerédi [1982] M. Ajtai, J. Komlós and E. Szemerédi, Largest random component of a k𝑘kitalic_k-cube. Combinatorica 2 (1982), 1–7, doi: 10.1007/BF02579276.
  • Arthur, Manthey and Röglin [2011] D. Arthur, B. Manthey and H. Röglin, Smoothed analysis of the k𝑘kitalic_k-means method. J. ACM 58.5 (2011), No. 19, doi: 10.1145/2027216.2027217.
  • Ben-Hamou and Salez [2017] A. Ben-Hamou and J. Salez, Cutoff for nonbacktracking random walks on sparse random graphs. Ann. Probab. 45.3 (2017), 1752–1770, doi: 10.1214/16-AOP1100.
  • Benjamini, Kozma and Wormald [2014] I. Benjamini, G. Kozma and N. Wormald, The mixing time of the giant component of a random graph. Random Struct. Algorithms 45.3 (2014), 383–407, doi: 10.1002/rsa.20539.
  • Berestycki, Lubetzky, Peres and Sly [2018] N. Berestycki, E. Lubetzky, Y. Peres and A. Sly, Random walks on the random graph. Ann. Probab. 46.1 (2018), 456–490, doi: 10.1214/17-AOP1189.
  • Bhaskara, Charikar, Moitra and Vijayaraghavan [2014] A. Bhaskara, M. Charikar, A. Moitra and A. Vijayaraghavan, Smoothed analysis of tensor decompositions. Proceedings of the Forty-Sixth Annual ACM Symposium on Theory of Computing, STOC ’14, Association for Computing Machinery, New York, NY, USA (2014) 594–603, doi: 10.1145/2591796.2591881.
  • Bohman, Frieze and Martin [2003] T. Bohman, A. Frieze and R. Martin, How many random edges make a dense graph Hamiltonian? Random Struct. Algorithms 22.1 (2003), 33–42, doi: 10.1002/rsa.10070.
  • Bordenave, Caputo and Salez [2018] C. Bordenave, P. Caputo and J. Salez, Random walk on sparse random digraphs. Probab. Theory Relat. Fields 170.3 (2018), 933–960, doi: 10.1007/s00440-017-0796-7.
  • Böttcher, Han, Kohayakawa, Montgomery, Parczyk and Person [2019] J. Böttcher, J. Han, Y. Kohayakawa, R. Montgomery, O. Parczyk and Y. Person, Universality for bounded degree spanning trees in randomly perturbed graphs. Random Struct. Algorithms 55.4 (2019), 854–864, doi: 10.1002/rsa.20850.
  • Böttcher, Montgomery, Parczyk and Person [2020] J. Böttcher, R. Montgomery, O. Parczyk and Y. Person, Embedding spanning bounded degree graphs in randomly perturbed graphs. Mathematika 66.2 (2020), 422–447, doi: 10.1112/mtk.12005.
  • Cai, Caputo, Perarnau and Quattropani [2021] X. S. Cai, P. Caputo, G. Perarnau and M. Quattropani, Rankings in directed configuration models with heavy tailed in-degrees. arXiv e-prints (2021). arXiv: 2104.08389.
  • Chung and Lu [2001] F. Chung and L. Lu, The diameter of sparse random graphs. Adv. Appl. Math. 26.4 (2001), 257–279, doi: 10.1006/aama.2001.0720.
  • Coja-Oghlan, Feige, Frieze, Krivelevich and Vilenchik [2009] A. Coja-Oghlan, U. Feige, A. Frieze, M. Krivelevich and D. Vilenchik, On smoothed k𝑘kitalic_k-CNF formulas and the Walksat algorithm. Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’09, Society for Industrial and Applied Mathematics, USA (2009) 451–460, doi: 10.1137/1.9781611973068.50.
  • Cooper and Frieze [2005] C. Cooper and A. Frieze, The cover time of random regular graphs. SIAM J. Discrete Math. 18.4 (2005), 728–740, doi: 10.1137/S0895480103428478.
  • Ding, Lubetzky and Peres [2014] J. Ding, E. Lubetzky and Y. Peres, Anatomy of the giant component: the strictly supercritical regime. Eur. J. Comb. 35 (2014), 155–168, doi: 10.1016/j.ejc.2013.06.004.
  • Diskin and Krivelevich [2022] S. Diskin and M. Krivelevich, Expansion in supercritical random subgraphs of expanders and its consequences. arXiv e-prints (2022). arXiv: 2205.04852.
  • Durrett [2006] R. Durrett, Random graph dynamics, Camb. Ser. Stat. Probab. Math., vol. 20. Cambridge University Press, Cambridge (2006), doi: 10.1017/CBO9780511546594.
  • Dyer, Frieze and Kannan [1991] M. Dyer, A. Frieze and R. Kannan, A random polynomial-time algorithm for approximating the volume of convex bodies. J. Assoc. Comput. Mach. 38.1 (1991), 1–17, doi: 10.1145/102782.102783.
  • Dyer, Galanis, Goldberg, Jerrum and Vigoda [2020] M. E. Dyer, A. Galanis, L. A. Goldberg, M. Jerrum and E. Vigoda, Random Walks on Small World Networks. ACM Trans. Algorithms 16.3 (2020), article 37, doi: 10.1145/3382208.
  • Erdős and Rényi [1960] P. Erdős and A. Rényi, On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci., Ser. A 5.1 (1960), 17–60.
  • Feige [2007] U. Feige, Refuting smoothed 3CNF formulas. 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS’07), IEEE (2007) 407–417, doi: 10.1109/FOCS.2007.16.
  • Fountoulakis and Reed [2007] N. Fountoulakis and B. A. Reed, Faster mixing and small bottlenecks. Probab. Theory Related Fields 137.3-4 (2007), 475–486, doi: 10.1007/s00440-006-0003-8.
  • Fountoulakis and Reed [2008] ———, The evolution of the mixing rate of a simple random walk on the giant component of a random graph. Random Struct. Algorithms 33.1 (2008), 68–86, doi: 10.1002/rsa.20210.
  • Frieze, Krivelevich and Martin [2004] A. Frieze, M. Krivelevich and R. Martin, The emergence of a giant component in random subgraphs of pseudo-random graphs. Random Struct. Algorithms 24.1 (2004), 42–50, doi: 10.1002/rsa.10100.
  • Han, Morris and Treglown [2021] J. Han, P. Morris and A. Treglown, Tilings in randomly perturbed graphs: bridging the gap between Hajnal-Szemerédi and Johansson-Kahn-Vu. Random Struct. Algorithms 58.3 (2021), 480–516, doi: 10.1002/rsa.20981.
  • Hermon, Sly and Sousi [2022] J. Hermon, A. Sly and P. Sousi, Universality of cutoff for graphs with an added random matching. Ann. Probab. 50.1 (2022), 203–240, doi: 10.1214/21-AOP1532.
  • Jerrum and Sinclair [1988] M. Jerrum and A. Sinclair, Conductance and the rapid mixing property for Markov chains: the approximation of permanent resolved. Proceedings of the Twentieth Annual ACM Symposium on Theory of Computing, STOC ’88, Association for Computing Machinery, New York, NY, USA (1988) 235–244, doi: 10.1145/62212.62234.
  • Kalai, Samorodnitsky and Teng [2009] A. T. Kalai, A. Samorodnitsky and S.-H. Teng, Learning and smoothed analysis. 2009 50th Annual IEEE Symposium on Foundations of Computer Science, IEEE (2009) 395–404, doi: 10.1109/FOCS.2009.60.
  • Kleinberg [2000] J. Kleinberg, The Small-World Phenomenon: An Algorithmic Perspective. Proceedings of the Thirty-Second Annual ACM Symposium on Theory of Computing, STOC ’00, Association for Computing Machinery, New York, NY, USA (2000) 163–170, doi: 10.1145/335305.335325.
  • Krivelevich, Reichman and Samotij [2015] M. Krivelevich, D. Reichman and W. Samotij, Smoothed analysis on connected graphs. SIAM J. Discrete Math. 29.3 (2015), 1654–1669, doi: 10.1137/151002496.
  • Krivelevich and Sudakov [2006] M. Krivelevich and B. Sudakov, Pseudo-random graphs. More sets, graphs and numbers, 199–262, Springer (2006), doi: 10.1007/978-3-540-32439-3_10.
  • Levin, Peres and Wilmer [2009] D. A. Levin, Y. Peres and E. L. Wilmer, Markov chains and mixing times. American Mathematical Society, Providence, RI (2009), ISBN 978-0-8218-4739-8, doi: 10.1090/mbk/058. With a chapter by James G. Propp and David B. Wilson.
  • Lubetzky and Sly [2010] E. Lubetzky and A. Sly, Cutoff phenomena for random walks on random regular graphs. Duke Math. J. 153.3 (2010), 475–510, doi: 10.1215/00127094-2010-029.
  • Manzo, Quattropani and Scoppola [2021] F. Manzo, M. Quattropani and E. Scoppola, A probabilistic proof of Cooper and Frieze’s “First Visit Time Lemma”. arXiv e-prints (2021). arXiv: 2101.10748.
  • Newman and Watts [1999a] M. E. J. Newman and D. J. Watts, Renormalization group analysis of the small-world network model. Phys. Lett., A 263.4-6 (1999a), 341–346, doi: 10.1016/S0375-9601(99)00757-4.
  • Newman and Watts [1999b] ———, Scaling and percolation in the small-world network model. Phys. Rev. E 60 (1999b), 7332–7342, doi: 10.1103/PhysRevE.60.7332.
  • Randall [2006] D. Randall, Rapidly mixing Markov chains with applications in computer science and physics. Comput. Sci. Eng. 8.2 (2006), 30–41, doi: 10.1109/MCSE.2006.30.
  • Sankar, Spielman and Teng [2006] A. Sankar, D. A. Spielman and S.-H. Teng, Smoothed analysis of the condition numbers and growth factors of matrices. SIAM J. Matrix Anal. Appl. 28.2 (2006), 446–476, doi: 10.1137/S0895479803436202.
  • Spielman and Teng [2004] D. A. Spielman and S.-H. Teng, Smoothed analysis of algorithms: Why the simplex algorithm usually takes polynomial time. J. ACM 51.3 (2004), 385–463, doi: 10.1145/990308.990310.
  • Spielman and Teng [2009] ———, Smoothed analysis: an attempt to explain the behavior of algorithms in practice. Commun. ACM 52.10 (2009), 76–84, doi: 10.1145/1562764.1562785.
  • Vigoda [2000] E. Vigoda, Improved bounds for sampling colorings. J. Math. Phys. 41.3 (2000), 1555–1569, doi: 10.1063/1.533196.
  • Vu and Tao [2007] V. H. Vu and T. Tao, The condition number of a randomly perturbed matrix. Proceedings of the Thirty-Ninth Annual ACM Symposium on Theory of Computing, STOC ’07, Association for Computing Machinery, New York, NY, USA (2007) 248–255, doi: 10.1145/1250790.1250828.