Search | arXiv e-print repository

Sampling Balanced Forests of Grids in Polynomial Time

Authors: Sarah Cannon, Wesley Pegden, Jamie Tucker-Foltz

Abstract: We prove that a polynomial fraction of the set of $k$-component forests in the $m \times n$ grid graph have equal numbers of vertices in each component, for any constant $k$. This resolves a conjecture of Charikar, Liu, Liu, and Vuong, and establishes the first provably polynomial-time algorithm for (exactly or approximately) sampling balanced grid graph partitions according to the spanning tree d… ▽ More We prove that a polynomial fraction of the set of $k$-component forests in the $m \times n$ grid graph have equal numbers of vertices in each component, for any constant $k$. This resolves a conjecture of Charikar, Liu, Liu, and Vuong, and establishes the first provably polynomial-time algorithm for (exactly or approximately) sampling balanced grid graph partitions according to the spanning tree distribution, which weights each $k$-partition according to the product, across its $k$ pieces, of the number of spanning trees of each piece. Our result follows from a careful analysis of the probability a uniformly random spanning tree of the grid can be cut into balanced pieces. Beyond grids, we show that for a broad family of lattice-like graphs, we achieve balance up to any multiplicative $(1 \pm \varepsilon)$ constant with constant probability, and up to an additive constant with polynomial probability. More generally, we show that, with constant probability, components derived from uniform spanning trees can approximate any given partition of a planar region specified by Jordan curves. These results imply polynomial time algorithms for sampling approximately balanced tree-weighted partitions for lattice-like graphs. Our results have applications to understanding political districtings, where there is an underlying graph of indivisible geographic units that must be partitioned into $k$ population-balanced connected subgraphs. In this setting, tree-weighted partitions have interesting geometric properties, and this has stimulated significant effort to develop methods to sample them. △ Less

Submitted 11 January, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

arXiv:2310.03222 [pdf, ps, other]

The bright side of simple heuristics for the TSP

Authors: Alan Frieze, Wesley Pegden

Abstract: The greedy and nearest-neighbor TSP heuristics can both have $\log n$ approximation factors from optimal in worst case, even just for $n$ points in Euclidean space. In this note, we show that this approximation factor is only realized when the optimal tour is unusually short. In particular, for points from any fixed $d$-Ahlfor's regular metric space (which includes any $d$-manifold like the $d$-cu… ▽ More The greedy and nearest-neighbor TSP heuristics can both have $\log n$ approximation factors from optimal in worst case, even just for $n$ points in Euclidean space. In this note, we show that this approximation factor is only realized when the optimal tour is unusually short. In particular, for points from any fixed $d$-Ahlfor's regular metric space (which includes any $d$-manifold like the $d$-cube $[0,1]^d$ in the case $d$ is an integer but also fractals of dimension $d$ when $d$ is real-valued), our results imply that the greedy and nearest-neighbor heuristics have \emph{additive} errors from optimal on the order of the \emph{optimal} tour length through \emph{random} points in the same space, for $d>1$. △ Less

Submitted 4 October, 2023; originally announced October 2023.

arXiv:2307.05042 [pdf, other]

Direct sampling of short paths for contiguous partitioning

Authors: Wesley Pegden, Anish Sevekari

Abstract: In this paper, we provide a family of dynamic programming based algorithms to sample nearly-shortest self avoiding walks between two points of the integer lattice $\mathbb{Z}^2$. We show that if the shortest path of between two points has length $n$, then we can sample paths (self-avoiding-walks) of length $n+O(n^{1-δ})$ in polynomial time. As an example of an application, we will show that the Gl… ▽ More In this paper, we provide a family of dynamic programming based algorithms to sample nearly-shortest self avoiding walks between two points of the integer lattice $\mathbb{Z}^2$. We show that if the shortest path of between two points has length $n$, then we can sample paths (self-avoiding-walks) of length $n+O(n^{1-δ})$ in polynomial time. As an example of an application, we will show that the Glauber dynamics Markov chain for partitions of the Aztec Diamonds in $\mathbb{Z}^2$ into two contiguous regions with nearly tight perimeter constraints has exponential mixing time, while the algorithm provided in this paper can be used be used to uniformly (and exactly) sample such partitions efficiently. △ Less

Submitted 11 July, 2023; originally announced July 2023.

Comments: 19 pages, 4 figures

arXiv:2206.00579 [pdf, other]

Subexponential mixing for partition chains on grid-like graphs

Authors: Alan Frieze, Wesley Pegden

Abstract: We consider the problem of generating uniformly random partitions of the vertex set of a graph such that every piece induces a connected subgraph. For the case where we want to have partitions with linearly many pieces of bounded size, we obtain approximate sampling algorithms based on Glauber dynamics which are fixed-parameter tractable with respect to the bandwidth of $G$, with simple-exponentia… ▽ More We consider the problem of generating uniformly random partitions of the vertex set of a graph such that every piece induces a connected subgraph. For the case where we want to have partitions with linearly many pieces of bounded size, we obtain approximate sampling algorithms based on Glauber dynamics which are fixed-parameter tractable with respect to the bandwidth of $G$, with simple-exponential dependence on the bandwidth. For example, for rectangles of constant or logarithmic width this gives polynomial-time sampling algorithms. More generally, this gives sub-exponential algorithms for bounded-degree graphs without large expander subgraphs (for example, we obtain $O(2^{\sqrt n})$ time algorithms for square grids). In the case where we instead want partitions with a small number of pieces of linear size, we show that Glauber dynamics can have exponential mixing time, even just for the case of 2 pieces, and even for 2-connected subgraphs of the grid with bounded bandwidth. △ Less

Submitted 1 June, 2022; originally announced June 2022.

Comments: 24 pages, 4 figures

MSC Class: 60J10

arXiv:2105.01718 [pdf, other]

Spanners in randomly weighted graphs: independent edge lengths

Authors: Alan Frieze, Wesley Pegden

Abstract: Given a connected graph $G=(V,E)$ and a length function $\ell:E\to {\mathbb R}$ we let $d_{v,w}$ denote the shortest distance between vertex $v$ and vertex $w$. A $t$-spanner is a subset $E'\subseteq E$ such that if $d'_{v,w}$ denotes shortest distances in the subgraph $G'=(V,E')$ then $d'_{v,w}\leq t d_{v,w}$ for all $v,w\in V$. We show that for a large class of graphs with suitable degree and ex… ▽ More Given a connected graph $G=(V,E)$ and a length function $\ell:E\to {\mathbb R}$ we let $d_{v,w}$ denote the shortest distance between vertex $v$ and vertex $w$. A $t$-spanner is a subset $E'\subseteq E$ such that if $d'_{v,w}$ denotes shortest distances in the subgraph $G'=(V,E')$ then $d'_{v,w}\leq t d_{v,w}$ for all $v,w\in V$. We show that for a large class of graphs with suitable degree and expansion properties with independent exponential mean one edge lengths, there is w.h.p.~a 1-spanner that uses $\approx \frac12n\log n$ edges and that this is best possible. In particular, our result applies to the random graphs $G_{n,p}$ for $np\gg \log n$. △ Less

Submitted 25 October, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

arXiv:2012.00292 [pdf, ps, other]

Comb inequalities for typical Euclidean TSP instances

Authors: Wesley Pegden, Anish Sevekari

Abstract: We prove that even in average case, the Euclidean Traveling Salesman Problem exhibits an integrality gap of $(1+ε)$ for $ε>0$ when the Held-Karp Linear Programming relaxation is augmented by all comb inequalities of bounded size. This implies that large classes of branch-and-cut algorithms take exponential time for the Euclidean TSP, even on random inputs. We prove that even in average case, the Euclidean Traveling Salesman Problem exhibits an integrality gap of $(1+ε)$ for $ε>0$ when the Held-Karp Linear Programming relaxation is augmented by all comb inequalities of bounded size. This implies that large classes of branch-and-cut algorithms take exponential time for the Euclidean TSP, even on random inputs. △ Less

Submitted 30 March, 2023; v1 submitted 1 December, 2020; originally announced December 2020.

Comments: 37 pages, 4 figures

MSC Class: Primary: 90B05; secondary: 90C40; 90C90

arXiv:1904.09014 [pdf, other]

Semi-bandit Optimization in the Dispersed Setting

Authors: Maria-Florina Balcan, Travis Dick, Wesley Pegden

Abstract: The goal of data-driven algorithm design is to obtain high-performing algorithms for specific application domains using machine learning and data. Across many fields in AI, science, and engineering, practitioners will often fix a family of parameterized algorithms and then optimize those parameters to obtain good performance on example instances from the application domain. In the online setting,… ▽ More The goal of data-driven algorithm design is to obtain high-performing algorithms for specific application domains using machine learning and data. Across many fields in AI, science, and engineering, practitioners will often fix a family of parameterized algorithms and then optimize those parameters to obtain good performance on example instances from the application domain. In the online setting, we must choose algorithm parameters for each instance as they arrive, and our goal is to be competitive with the best fixed algorithm in hindsight. There are two major challenges in online data-driven algorithm design. First, it can be computationally expensive to evaluate the loss functions that map algorithm parameters to performance, which often require the learner to run a combinatorial algorithm to measure its performance. Second, the losses can be extremely volatile and have sharp discontinuities. However, we show that in many applications, evaluating the loss function for one algorithm choice can sometimes reveal the loss for a range of similar algorithms, essentially for free. We develop online optimization algorithms capable of using this kind of extra information by working in the semi-bandit feedback setting. Our algorithms achieve regret bounds that are essentially as good as algorithms under full-information feedback and are significantly more computationally efficient. We apply our semi-bandit results to obtain the first provable guarantees for data-driven algorithm design for linkage-based clustering and we improve the best regret bounds for designing greedy knapsack algorithms. △ Less

Submitted 21 December, 2020; v1 submitted 18 April, 2019; originally announced April 2019.

arXiv:1901.07167 [pdf, ps, other]

On random multi-dimensional assignment problems

Authors: Alan Frieze, Wesley Pegden, Tomasz Tkocz

Abstract: We study random multidimensional assignment problems where the costs decompose into the sum of independent random variables. In particular, in three dimensions, we assume that the costs $W_{i,j,k}$ satisfy $W_{i,j,k}=a_{i,j}+b_{i,k}+c_{j,k}$ where the $a_{i,j},b_{i,k},c_{j,k}$ are independent exponential rate 1 random variables. Our objective is to minimize the total cost and we show that w.h.p. a… ▽ More We study random multidimensional assignment problems where the costs decompose into the sum of independent random variables. In particular, in three dimensions, we assume that the costs $W_{i,j,k}$ satisfy $W_{i,j,k}=a_{i,j}+b_{i,k}+c_{j,k}$ where the $a_{i,j},b_{i,k},c_{j,k}$ are independent exponential rate 1 random variables. Our objective is to minimize the total cost and we show that w.h.p. a simple greedy algorithm is a $(3+o(1))$-approximation. This is in contrast to the case where the $W_{i,j,k}$ are independent exponential rate 1 random variables. Here all that is known is an $n^{o(1)}$-approximation, due to Frieze and Sorkin. △ Less

Submitted 21 January, 2019; originally announced January 2019.

Journal ref: Discrete Appl. Math. 287 (2020), 1-9

arXiv:1810.04772 [pdf, ps, other]

On the cover time of dense graphs

Authors: Colin Cooper, Alan Frieze, Wesley Pegden

Abstract: We consider arbitrary graphs $G$ with $n$ vertices and minimum degree at least $δn$ where $δ>0$ is constant. If the conductance of $G$ is sufficiently large then we obtain an asymptotic expression for the cover time $C_G$ of $G$ as the solution to an explicit transcendental equation. Failing this, if the mixing time of a random walk on $G$ is of a lesser magnitude than the cover time, then we can… ▽ More We consider arbitrary graphs $G$ with $n$ vertices and minimum degree at least $δn$ where $δ>0$ is constant. If the conductance of $G$ is sufficiently large then we obtain an asymptotic expression for the cover time $C_G$ of $G$ as the solution to an explicit transcendental equation. Failing this, if the mixing time of a random walk on $G$ is of a lesser magnitude than the cover time, then we can obtain an asymptotic deterministic estimate via a decomposition into a bounded number of dense sub-graphs with high conductance. Failing this we give a deterministic asymptotic (2+o(1))-approximation of $C_G$. △ Less

Submitted 28 May, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

arXiv:1710.08781 [pdf, ps, other]

A partisan districting protocol with provably nonpartisan outcomes

Authors: Wesley Pegden, Ariel D. Procaccia, Dingli Yu

Abstract: We design and analyze a protocol for dividing a state into districts, where parties take turns proposing a division, and freezing a district from the other party's proposed division. We show that our protocol has predictable and provable guarantees for both the number of districts in which each party has a majority of supporters, and the extent to which either party has the power to pack a specifi… ▽ More We design and analyze a protocol for dividing a state into districts, where parties take turns proposing a division, and freezing a district from the other party's proposed division. We show that our protocol has predictable and provable guarantees for both the number of districts in which each party has a majority of supporters, and the extent to which either party has the power to pack a specific population into a single district. △ Less

Submitted 24 October, 2017; originally announced October 2017.

Comments: 20 pages, 3 figures

MSC Class: 91B14 (primary); 91A05 (secondary)

arXiv:1605.06072 [pdf, ps, other]

Online purchasing under uncertainty

Authors: Alan Frieze, Wesley Pegden

Abstract: Suppose there is a collection $x_1,x_2,\dots,x_N$ of independent uniform $[0,1]$ random variables, and a hypergraph $\cF$ of \emph{target structures} on the vertex set $\{1,\dots,N\}$. We would like to buy a target structure at small cost, but we do not know all the costs $x_i$ ahead of time. Instead, we inspect the random variables $x_i$ one at a time, and after each inspection, choose to either… ▽ More Suppose there is a collection $x_1,x_2,\dots,x_N$ of independent uniform $[0,1]$ random variables, and a hypergraph $\cF$ of \emph{target structures} on the vertex set $\{1,\dots,N\}$. We would like to buy a target structure at small cost, but we do not know all the costs $x_i$ ahead of time. Instead, we inspect the random variables $x_i$ one at a time, and after each inspection, choose to either keep the vertex $i$ at cost $x_i$, or reject vertex $i$ forever. In the present paper, we consider the case where $\{1,\dots,N\}$ is the edge-set of some graph, and the target structures are the spanning trees of a graph, spanning arborescences of a digraph, the paths between a fixed pair of vertices, perfect matchings, Hamilton cycles or the cliques of some fixed size. △ Less

Submitted 10 January, 2017; v1 submitted 19 May, 2016; originally announced May 2016.

Comments: Minor changes

arXiv:1604.04549 [pdf, other]

Scalefree hardness of average-case Euclidean TSP approximation

Authors: Alan Frieze, Wesley Pegden

Abstract: We show that if P$\neq$NP, then a wide class of TSP heuristics fail to approximate the length of the TSP to asymptotic optimality, even for random Euclidean instances. Previously, this result was not even known for any heuristics (greedy, etc) used in practice. As an application, we show that when using a heuristic from this class, a natural class of branch-and-bound algorithms takes exponential t… ▽ More We show that if P$\neq$NP, then a wide class of TSP heuristics fail to approximate the length of the TSP to asymptotic optimality, even for random Euclidean instances. Previously, this result was not even known for any heuristics (greedy, etc) used in practice. As an application, we show that when using a heuristic from this class, a natural class of branch-and-bound algorithms takes exponential time to find an optimal tour (again, even on a random point-set), regardless of the particular branching strategy or lower-bound algorithm used. △ Less

Submitted 1 August, 2019; v1 submitted 15 April, 2016; originally announced April 2016.

Comments: 20 pages, 4 figures

arXiv:1501.01944 [pdf, other]

Separating subadditive Euclidean functionals

Authors: Alan Frieze, Wesley Pegden

Abstract: If we are given $n$ random points in the hypercube $[0,1]^d$, then the minimum length of a Traveling Salesperson Tour through the points, the minimum length of a spanning tree, and the minimum length of a matching, etc., are known to be asymptotically $βn^{\frac{d-1}{d}}$ a.s., where $β$ is an absolute constant in each case. We prove separation results for these constants. In particular, concernin… ▽ More If we are given $n$ random points in the hypercube $[0,1]^d$, then the minimum length of a Traveling Salesperson Tour through the points, the minimum length of a spanning tree, and the minimum length of a matching, etc., are known to be asymptotically $βn^{\frac{d-1}{d}}$ a.s., where $β$ is an absolute constant in each case. We prove separation results for these constants. In particular, concerning the constants $β_{\mathrm{TSP}}^d$, $β_{\mathrm{MST}}^d$, $β_{\mathrm{MM}}^d$, and $β_{\mathrm{TF}}^d$ from the asymptotic formulas for the minimum length TSP, spanning tree, matching, and 2-factor, respectively, we prove that $β_{\mathrm{MST}}^d<β_{\mathrm{TSP}}^d$, $2β_{\mathrm{MM}}^d<β_{\mathrm{TSP}}^d$, and $β_{\mathrm{TF}}^d<β_{\mathrm{TSP}}^d$ for all $d\geq 2$. We also asymptotically separate the TSP from its linear programming relaxation in this setting. Our results have some computational relevance, showing that a certain natural class of simple algorithms cannot solve the random Euclidean TSP efficiently. △ Less

Submitted 21 May, 2015; v1 submitted 8 January, 2015; originally announced January 2015.

Comments: 32 pages, 5 figures. Branch and bound theorem is now unconditional

MSC Class: 60C05

arXiv:1408.6821 [pdf, ps, other]

Looking for vertex number one

Authors: Alan Frieze, Wesley Pegden

Abstract: Given an instance of the preferential attachment graph $G_n=([n],E_n)$, we would like to find vertex 1, using only 'local' information about the graph; that is, by exploring the neighborhoods of small sets of vertices. Borgs et. al gave an an algorithm which runs in time $O(\log^4 n)$, which is local in the sense that at each step, it needs only to search the neighborhood of a set of vertices of s… ▽ More Given an instance of the preferential attachment graph $G_n=([n],E_n)$, we would like to find vertex 1, using only 'local' information about the graph; that is, by exploring the neighborhoods of small sets of vertices. Borgs et. al gave an an algorithm which runs in time $O(\log^4 n)$, which is local in the sense that at each step, it needs only to search the neighborhood of a set of vertices of size $O(\log^4 n)$. We give an algorithm to find vertex 1, which w.h.p. runs in time $O(ω\log n)$ and which is local in the strongest sense of operating only on neighborhoods of single vertices. Here $ω=ω(n)$ is any function that goes to infinity with $n$. △ Less

Submitted 19 May, 2016; v1 submitted 28 August, 2014; originally announced August 2014.

Comments: As accepted for AAP

arXiv:1102.2853 [pdf, ps, other]

An extension of the Moser-Tardos algorithmic local lemma

Authors: Wesley Pegden

Abstract: A recent theorem of Bissacot, et al. proved using results about the cluster expansion in statistical mechanics extends the Lovász Local Lemma by weakening the conditions under which its conclusions holds. In this note, we prove an algorithmic analog of this result, extending Moser and Tardos's recent algorithmic Local Lemma, and providing an alternative proof of the theorem of Bissacot, et al. app… ▽ More A recent theorem of Bissacot, et al. proved using results about the cluster expansion in statistical mechanics extends the Lovász Local Lemma by weakening the conditions under which its conclusions holds. In this note, we prove an algorithmic analog of this result, extending Moser and Tardos's recent algorithmic Local Lemma, and providing an alternative proof of the theorem of Bissacot, et al. applicable in the Moser-Tardos algorithmic framework. △ Less

Submitted 12 March, 2011; v1 submitted 14 February, 2011; originally announced February 2011.

Comments: 8 pages

arXiv:1101.4420 [pdf, ps, other]

A finite goal set in the plane which is not a Winner

Authors: Wesley Pegden

Abstract: J. Beck has shown that if two players alternately select previously unchosen points from the plane, Player 1 can always build a congruent copy of any given finite goal set G, in spite of Player 2's efforts to stop him. We give a finite goal set G (it has 5 points) which Player 1 cannot construct before Player 2 in this achievement game played in the plane. J. Beck has shown that if two players alternately select previously unchosen points from the plane, Player 1 can always build a congruent copy of any given finite goal set G, in spite of Player 2's efforts to stop him. We give a finite goal set G (it has 5 points) which Player 1 cannot construct before Player 2 in this achievement game played in the plane. △ Less

Submitted 23 January, 2011; originally announced January 2011.

Comments: 8 pages, 4 figures

Showing 1–16 of 16 results for author: Pegden, W