-
Sampling Balanced Forests of Grids in Polynomial Time
Authors:
Sarah Cannon,
Wesley Pegden,
Jamie Tucker-Foltz
Abstract:
We prove that a polynomial fraction of the set of $k$-component forests in the $m \times n$ grid graph have equal numbers of vertices in each component, for any constant $k$. This resolves a conjecture of Charikar, Liu, Liu, and Vuong, and establishes the first provably polynomial-time algorithm for (exactly or approximately) sampling balanced grid graph partitions according to the spanning tree d…
▽ More
We prove that a polynomial fraction of the set of $k$-component forests in the $m \times n$ grid graph have equal numbers of vertices in each component, for any constant $k$. This resolves a conjecture of Charikar, Liu, Liu, and Vuong, and establishes the first provably polynomial-time algorithm for (exactly or approximately) sampling balanced grid graph partitions according to the spanning tree distribution, which weights each $k$-partition according to the product, across its $k$ pieces, of the number of spanning trees of each piece. Our result follows from a careful analysis of the probability a uniformly random spanning tree of the grid can be cut into balanced pieces.
Beyond grids, we show that for a broad family of lattice-like graphs, we achieve balance up to any multiplicative $(1 \pm \varepsilon)$ constant with constant probability, and up to an additive constant with polynomial probability. More generally, we show that, with constant probability, components derived from uniform spanning trees can approximate any given partition of a planar region specified by Jordan curves. These results imply polynomial time algorithms for sampling approximately balanced tree-weighted partitions for lattice-like graphs.
Our results have applications to understanding political districtings, where there is an underlying graph of indivisible geographic units that must be partitioned into $k$ population-balanced connected subgraphs. In this setting, tree-weighted partitions have interesting geometric properties, and this has stimulated significant effort to develop methods to sample them.
△ Less
Submitted 11 January, 2024; v1 submitted 23 October, 2023;
originally announced October 2023.
-
The bright side of simple heuristics for the TSP
Authors:
Alan Frieze,
Wesley Pegden
Abstract:
The greedy and nearest-neighbor TSP heuristics can both have $\log n$ approximation factors from optimal in worst case, even just for $n$ points in Euclidean space. In this note, we show that this approximation factor is only realized when the optimal tour is unusually short. In particular, for points from any fixed $d$-Ahlfor's regular metric space (which includes any $d$-manifold like the $d$-cu…
▽ More
The greedy and nearest-neighbor TSP heuristics can both have $\log n$ approximation factors from optimal in worst case, even just for $n$ points in Euclidean space. In this note, we show that this approximation factor is only realized when the optimal tour is unusually short. In particular, for points from any fixed $d$-Ahlfor's regular metric space (which includes any $d$-manifold like the $d$-cube $[0,1]^d$ in the case $d$ is an integer but also fractals of dimension $d$ when $d$ is real-valued), our results imply that the greedy and nearest-neighbor heuristics have \emph{additive} errors from optimal on the order of the \emph{optimal} tour length through \emph{random} points in the same space, for $d>1$.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Direct sampling of short paths for contiguous partitioning
Authors:
Wesley Pegden,
Anish Sevekari
Abstract:
In this paper, we provide a family of dynamic programming based algorithms to sample nearly-shortest self avoiding walks between two points of the integer lattice $\mathbb{Z}^2$. We show that if the shortest path of between two points has length $n$, then we can sample paths (self-avoiding-walks) of length $n+O(n^{1-δ})$ in polynomial time. As an example of an application, we will show that the Gl…
▽ More
In this paper, we provide a family of dynamic programming based algorithms to sample nearly-shortest self avoiding walks between two points of the integer lattice $\mathbb{Z}^2$. We show that if the shortest path of between two points has length $n$, then we can sample paths (self-avoiding-walks) of length $n+O(n^{1-δ})$ in polynomial time. As an example of an application, we will show that the Glauber dynamics Markov chain for partitions of the Aztec Diamonds in $\mathbb{Z}^2$ into two contiguous regions with nearly tight perimeter constraints has exponential mixing time, while the algorithm provided in this paper can be used be used to uniformly (and exactly) sample such partitions efficiently.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Subexponential mixing for partition chains on grid-like graphs
Authors:
Alan Frieze,
Wesley Pegden
Abstract:
We consider the problem of generating uniformly random partitions of the vertex set of a graph such that every piece induces a connected subgraph. For the case where we want to have partitions with linearly many pieces of bounded size, we obtain approximate sampling algorithms based on Glauber dynamics which are fixed-parameter tractable with respect to the bandwidth of $G$, with simple-exponentia…
▽ More
We consider the problem of generating uniformly random partitions of the vertex set of a graph such that every piece induces a connected subgraph. For the case where we want to have partitions with linearly many pieces of bounded size, we obtain approximate sampling algorithms based on Glauber dynamics which are fixed-parameter tractable with respect to the bandwidth of $G$, with simple-exponential dependence on the bandwidth. For example, for rectangles of constant or logarithmic width this gives polynomial-time sampling algorithms. More generally, this gives sub-exponential algorithms for bounded-degree graphs without large expander subgraphs (for example, we obtain $O(2^{\sqrt n})$ time algorithms for square grids).
In the case where we instead want partitions with a small number of pieces of linear size, we show that Glauber dynamics can have exponential mixing time, even just for the case of 2 pieces, and even for 2-connected subgraphs of the grid with bounded bandwidth.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
Spanners in randomly weighted graphs: independent edge lengths
Authors:
Alan Frieze,
Wesley Pegden
Abstract:
Given a connected graph $G=(V,E)$ and a length function $\ell:E\to {\mathbb R}$ we let $d_{v,w}$ denote the shortest distance between vertex $v$ and vertex $w$. A $t$-spanner is a subset $E'\subseteq E$ such that if $d'_{v,w}$ denotes shortest distances in the subgraph $G'=(V,E')$ then $d'_{v,w}\leq t d_{v,w}$ for all $v,w\in V$. We show that for a large class of graphs with suitable degree and ex…
▽ More
Given a connected graph $G=(V,E)$ and a length function $\ell:E\to {\mathbb R}$ we let $d_{v,w}$ denote the shortest distance between vertex $v$ and vertex $w$. A $t$-spanner is a subset $E'\subseteq E$ such that if $d'_{v,w}$ denotes shortest distances in the subgraph $G'=(V,E')$ then $d'_{v,w}\leq t d_{v,w}$ for all $v,w\in V$. We show that for a large class of graphs with suitable degree and expansion properties with independent exponential mean one edge lengths, there is w.h.p.~a 1-spanner that uses $\approx \frac12n\log n$ edges and that this is best possible. In particular, our result applies to the random graphs $G_{n,p}$ for $np\gg \log n$.
△ Less
Submitted 25 October, 2021; v1 submitted 4 May, 2021;
originally announced May 2021.
-
Comb inequalities for typical Euclidean TSP instances
Authors:
Wesley Pegden,
Anish Sevekari
Abstract:
We prove that even in average case, the Euclidean Traveling Salesman Problem exhibits an integrality gap of $(1+ε)$ for $ε>0$ when the Held-Karp Linear Programming relaxation is augmented by all comb inequalities of bounded size. This implies that large classes of branch-and-cut algorithms take exponential time for the Euclidean TSP, even on random inputs.
We prove that even in average case, the Euclidean Traveling Salesman Problem exhibits an integrality gap of $(1+ε)$ for $ε>0$ when the Held-Karp Linear Programming relaxation is augmented by all comb inequalities of bounded size. This implies that large classes of branch-and-cut algorithms take exponential time for the Euclidean TSP, even on random inputs.
△ Less
Submitted 30 March, 2023; v1 submitted 1 December, 2020;
originally announced December 2020.
-
Semi-bandit Optimization in the Dispersed Setting
Authors:
Maria-Florina Balcan,
Travis Dick,
Wesley Pegden
Abstract:
The goal of data-driven algorithm design is to obtain high-performing algorithms for specific application domains using machine learning and data. Across many fields in AI, science, and engineering, practitioners will often fix a family of parameterized algorithms and then optimize those parameters to obtain good performance on example instances from the application domain. In the online setting,…
▽ More
The goal of data-driven algorithm design is to obtain high-performing algorithms for specific application domains using machine learning and data. Across many fields in AI, science, and engineering, practitioners will often fix a family of parameterized algorithms and then optimize those parameters to obtain good performance on example instances from the application domain. In the online setting, we must choose algorithm parameters for each instance as they arrive, and our goal is to be competitive with the best fixed algorithm in hindsight.
There are two major challenges in online data-driven algorithm design. First, it can be computationally expensive to evaluate the loss functions that map algorithm parameters to performance, which often require the learner to run a combinatorial algorithm to measure its performance. Second, the losses can be extremely volatile and have sharp discontinuities. However, we show that in many applications, evaluating the loss function for one algorithm choice can sometimes reveal the loss for a range of similar algorithms, essentially for free. We develop online optimization algorithms capable of using this kind of extra information by working in the semi-bandit feedback setting. Our algorithms achieve regret bounds that are essentially as good as algorithms under full-information feedback and are significantly more computationally efficient. We apply our semi-bandit results to obtain the first provable guarantees for data-driven algorithm design for linkage-based clustering and we improve the best regret bounds for designing greedy knapsack algorithms.
△ Less
Submitted 21 December, 2020; v1 submitted 18 April, 2019;
originally announced April 2019.
-
On random multi-dimensional assignment problems
Authors:
Alan Frieze,
Wesley Pegden,
Tomasz Tkocz
Abstract:
We study random multidimensional assignment problems where the costs decompose into the sum of independent random variables. In particular, in three dimensions, we assume that the costs $W_{i,j,k}$ satisfy $W_{i,j,k}=a_{i,j}+b_{i,k}+c_{j,k}$ where the $a_{i,j},b_{i,k},c_{j,k}$ are independent exponential rate 1 random variables. Our objective is to minimize the total cost and we show that w.h.p. a…
▽ More
We study random multidimensional assignment problems where the costs decompose into the sum of independent random variables. In particular, in three dimensions, we assume that the costs $W_{i,j,k}$ satisfy $W_{i,j,k}=a_{i,j}+b_{i,k}+c_{j,k}$ where the $a_{i,j},b_{i,k},c_{j,k}$ are independent exponential rate 1 random variables. Our objective is to minimize the total cost and we show that w.h.p. a simple greedy algorithm is a $(3+o(1))$-approximation. This is in contrast to the case where the $W_{i,j,k}$ are independent exponential rate 1 random variables. Here all that is known is an $n^{o(1)}$-approximation, due to Frieze and Sorkin.
△ Less
Submitted 21 January, 2019;
originally announced January 2019.
-
On the cover time of dense graphs
Authors:
Colin Cooper,
Alan Frieze,
Wesley Pegden
Abstract:
We consider arbitrary graphs $G$ with $n$ vertices and minimum degree at least $δn$ where $δ>0$ is constant. If the conductance of $G$ is sufficiently large then we obtain an asymptotic expression for the cover time $C_G$ of $G$ as the solution to an explicit transcendental equation. Failing this, if the mixing time of a random walk on $G$ is of a lesser magnitude than the cover time, then we can…
▽ More
We consider arbitrary graphs $G$ with $n$ vertices and minimum degree at least $δn$ where $δ>0$ is constant. If the conductance of $G$ is sufficiently large then we obtain an asymptotic expression for the cover time $C_G$ of $G$ as the solution to an explicit transcendental equation. Failing this, if the mixing time of a random walk on $G$ is of a lesser magnitude than the cover time, then we can obtain an asymptotic deterministic estimate via a decomposition into a bounded number of dense sub-graphs with high conductance. Failing this we give a deterministic asymptotic (2+o(1))-approximation of $C_G$.
△ Less
Submitted 28 May, 2019; v1 submitted 10 October, 2018;
originally announced October 2018.
-
A partisan districting protocol with provably nonpartisan outcomes
Authors:
Wesley Pegden,
Ariel D. Procaccia,
Dingli Yu
Abstract:
We design and analyze a protocol for dividing a state into districts, where parties take turns proposing a division, and freezing a district from the other party's proposed division. We show that our protocol has predictable and provable guarantees for both the number of districts in which each party has a majority of supporters, and the extent to which either party has the power to pack a specifi…
▽ More
We design and analyze a protocol for dividing a state into districts, where parties take turns proposing a division, and freezing a district from the other party's proposed division. We show that our protocol has predictable and provable guarantees for both the number of districts in which each party has a majority of supporters, and the extent to which either party has the power to pack a specific population into a single district.
△ Less
Submitted 24 October, 2017;
originally announced October 2017.
-
Online purchasing under uncertainty
Authors:
Alan Frieze,
Wesley Pegden
Abstract:
Suppose there is a collection $x_1,x_2,\dots,x_N$ of independent uniform $[0,1]$ random variables, and a hypergraph $\cF$ of \emph{target structures} on the vertex set $\{1,\dots,N\}$. We would like to buy a target structure at small cost, but we do not know all the costs $x_i$ ahead of time. Instead, we inspect the random variables $x_i$ one at a time, and after each inspection, choose to either…
▽ More
Suppose there is a collection $x_1,x_2,\dots,x_N$ of independent uniform $[0,1]$ random variables, and a hypergraph $\cF$ of \emph{target structures} on the vertex set $\{1,\dots,N\}$. We would like to buy a target structure at small cost, but we do not know all the costs $x_i$ ahead of time. Instead, we inspect the random variables $x_i$ one at a time, and after each inspection, choose to either keep the vertex $i$ at cost $x_i$, or reject vertex $i$ forever.
In the present paper, we consider the case where $\{1,\dots,N\}$ is the edge-set of some graph, and the target structures are the spanning trees of a graph, spanning arborescences of a digraph, the paths between a fixed pair of vertices, perfect matchings, Hamilton cycles or the cliques of some fixed size.
△ Less
Submitted 10 January, 2017; v1 submitted 19 May, 2016;
originally announced May 2016.
-
Scalefree hardness of average-case Euclidean TSP approximation
Authors:
Alan Frieze,
Wesley Pegden
Abstract:
We show that if P$\neq$NP, then a wide class of TSP heuristics fail to approximate the length of the TSP to asymptotic optimality, even for random Euclidean instances. Previously, this result was not even known for any heuristics (greedy, etc) used in practice. As an application, we show that when using a heuristic from this class, a natural class of branch-and-bound algorithms takes exponential t…
▽ More
We show that if P$\neq$NP, then a wide class of TSP heuristics fail to approximate the length of the TSP to asymptotic optimality, even for random Euclidean instances. Previously, this result was not even known for any heuristics (greedy, etc) used in practice. As an application, we show that when using a heuristic from this class, a natural class of branch-and-bound algorithms takes exponential time to find an optimal tour (again, even on a random point-set), regardless of the particular branching strategy or lower-bound algorithm used.
△ Less
Submitted 1 August, 2019; v1 submitted 15 April, 2016;
originally announced April 2016.
-
Separating subadditive Euclidean functionals
Authors:
Alan Frieze,
Wesley Pegden
Abstract:
If we are given $n$ random points in the hypercube $[0,1]^d$, then the minimum length of a Traveling Salesperson Tour through the points, the minimum length of a spanning tree, and the minimum length of a matching, etc., are known to be asymptotically $βn^{\frac{d-1}{d}}$ a.s., where $β$ is an absolute constant in each case. We prove separation results for these constants. In particular, concernin…
▽ More
If we are given $n$ random points in the hypercube $[0,1]^d$, then the minimum length of a Traveling Salesperson Tour through the points, the minimum length of a spanning tree, and the minimum length of a matching, etc., are known to be asymptotically $βn^{\frac{d-1}{d}}$ a.s., where $β$ is an absolute constant in each case. We prove separation results for these constants. In particular, concerning the constants $β_{\mathrm{TSP}}^d$, $β_{\mathrm{MST}}^d$, $β_{\mathrm{MM}}^d$, and $β_{\mathrm{TF}}^d$ from the asymptotic formulas for the minimum length TSP, spanning tree, matching, and 2-factor, respectively, we prove that $β_{\mathrm{MST}}^d<β_{\mathrm{TSP}}^d$, $2β_{\mathrm{MM}}^d<β_{\mathrm{TSP}}^d$, and $β_{\mathrm{TF}}^d<β_{\mathrm{TSP}}^d$ for all $d\geq 2$. We also asymptotically separate the TSP from its linear programming relaxation in this setting. Our results have some computational relevance, showing that a certain natural class of simple algorithms cannot solve the random Euclidean TSP efficiently.
△ Less
Submitted 21 May, 2015; v1 submitted 8 January, 2015;
originally announced January 2015.
-
Looking for vertex number one
Authors:
Alan Frieze,
Wesley Pegden
Abstract:
Given an instance of the preferential attachment graph $G_n=([n],E_n)$, we would like to find vertex 1, using only 'local' information about the graph; that is, by exploring the neighborhoods of small sets of vertices. Borgs et. al gave an an algorithm which runs in time $O(\log^4 n)$, which is local in the sense that at each step, it needs only to search the neighborhood of a set of vertices of s…
▽ More
Given an instance of the preferential attachment graph $G_n=([n],E_n)$, we would like to find vertex 1, using only 'local' information about the graph; that is, by exploring the neighborhoods of small sets of vertices. Borgs et. al gave an an algorithm which runs in time $O(\log^4 n)$, which is local in the sense that at each step, it needs only to search the neighborhood of a set of vertices of size $O(\log^4 n)$. We give an algorithm to find vertex 1, which w.h.p. runs in time $O(ω\log n)$ and which is local in the strongest sense of operating only on neighborhoods of single vertices. Here $ω=ω(n)$ is any function that goes to infinity with $n$.
△ Less
Submitted 19 May, 2016; v1 submitted 28 August, 2014;
originally announced August 2014.
-
An extension of the Moser-Tardos algorithmic local lemma
Authors:
Wesley Pegden
Abstract:
A recent theorem of Bissacot, et al. proved using results about the cluster expansion in statistical mechanics extends the Lovász Local Lemma by weakening the conditions under which its conclusions holds. In this note, we prove an algorithmic analog of this result, extending Moser and Tardos's recent algorithmic Local Lemma, and providing an alternative proof of the theorem of Bissacot, et al. app…
▽ More
A recent theorem of Bissacot, et al. proved using results about the cluster expansion in statistical mechanics extends the Lovász Local Lemma by weakening the conditions under which its conclusions holds. In this note, we prove an algorithmic analog of this result, extending Moser and Tardos's recent algorithmic Local Lemma, and providing an alternative proof of the theorem of Bissacot, et al. applicable in the Moser-Tardos algorithmic framework.
△ Less
Submitted 12 March, 2011; v1 submitted 14 February, 2011;
originally announced February 2011.
-
A finite goal set in the plane which is not a Winner
Authors:
Wesley Pegden
Abstract:
J. Beck has shown that if two players alternately select previously unchosen points from the plane, Player 1 can always build a congruent copy of any given finite goal set G, in spite of Player 2's efforts to stop him. We give a finite goal set G (it has 5 points) which Player 1 cannot construct before Player 2 in this achievement game played in the plane.
J. Beck has shown that if two players alternately select previously unchosen points from the plane, Player 1 can always build a congruent copy of any given finite goal set G, in spite of Player 2's efforts to stop him. We give a finite goal set G (it has 5 points) which Player 1 cannot construct before Player 2 in this achievement game played in the plane.
△ Less
Submitted 23 January, 2011;
originally announced January 2011.