Search | arXiv e-print repository

A Polynomial-Time Approximation for Pairwise Fair $k$-Median Clustering

Authors: Sayan Bandyapadhyay, Eden Chlamtáč, Yury Makarychev, Ali Vakilian

Abstract: In this work, we study pairwise fair clustering with $\ell \ge 2$ groups, where for every cluster $C$ and every group $i \in [\ell]$, the number of points in $C$ from group $i$ must be at most $t$ times the number of points in $C$ from any other group $j \in [\ell]$, for a given integer $t$. To the best of our knowledge, only bi-criteria approximation and exponential-time algorithms follow for thi… ▽ More In this work, we study pairwise fair clustering with $\ell \ge 2$ groups, where for every cluster $C$ and every group $i \in [\ell]$, the number of points in $C$ from group $i$ must be at most $t$ times the number of points in $C$ from any other group $j \in [\ell]$, for a given integer $t$. To the best of our knowledge, only bi-criteria approximation and exponential-time algorithms follow for this problem from the prior work on fair clustering problems when $\ell > 2$. In our work, focusing on the $\ell > 2$ case, we design the first polynomial-time $(t^{\ell}\cdot \ell\cdot k)^{O(\ell)}$-approximation for this problem with $k$-median cost that does not violate the fairness constraints. We complement our algorithmic result by providing hardness of approximation results, which show that our problem even when $\ell=2$ is almost as hard as the popular uniform capacitated $k$-median, for which no polynomial-time algorithm with an approximation factor of $o(\log k)$ is known. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2404.17669 [pdf, ps, other]

Approximation Algorithms for $\ell_p$-Shortest Path and $\ell_p$-Group Steiner Tree

Authors: Yury Makarychev, Max Ovsiankin, Erasmo Tani

Abstract: We present polylogarithmic approximation algorithms for variants of the Shortest Path, Group Steiner Tree, and Group ATSP problems with vector costs. In these problems, each edge e has a non-negative vector cost $c_e \in \mathbb{R}^{\ell}_{\ge 0}$. For a feasible solution - a path, subtree, or tour (respectively) - we find the total vector cost of all the edges in the solution and then compute the… ▽ More We present polylogarithmic approximation algorithms for variants of the Shortest Path, Group Steiner Tree, and Group ATSP problems with vector costs. In these problems, each edge e has a non-negative vector cost $c_e \in \mathbb{R}^{\ell}_{\ge 0}$. For a feasible solution - a path, subtree, or tour (respectively) - we find the total vector cost of all the edges in the solution and then compute the $\ell_p$-norm of the obtained cost vector (we assume that $p \ge 1$ is an integer). Our algorithms for series-parallel graphs run in polynomial time and those for arbitrary graphs run in quasi-polynomial time. To obtain our results, we introduce and use new flow-based Sum-of-Squares relaxations. We also obtain a number of hardness results. △ Less

Submitted 26 April, 2024; originally announced April 2024.

arXiv:2403.02212 [pdf, ps, other]

Constraint Satisfaction Problems with Advice

Authors: Suprovat Ghoshal, Konstantin Makarychev, Yury Makarychev

Abstract: We initiate the study of algorithms for constraint satisfaction problems with ML oracle advice. We introduce two models of advice and then design an approximation algorithm for Max Cut and Max 2-Lin in these models. We initiate the study of algorithms for constraint satisfaction problems with ML oracle advice. We introduce two models of advice and then design an approximation algorithm for Max Cut and Max 2-Lin in these models. △ Less

Submitted 4 March, 2024; originally announced March 2024.

arXiv:2311.09460 [pdf, other]

Near-Optimal Streaming Ellipsoidal Rounding for General Convex Polytopes

Authors: Yury Makarychev, Naren Sarayu Manoj, Max Ovsiankin

Abstract: We give near-optimal algorithms for computing an ellipsoidal rounding of a convex polytope whose vertices are given in a stream. The approximation factor is linear in the dimension (as in John's theorem) and only loses an excess logarithmic factor in the aspect ratio of the polytope. Our algorithms are nearly optimal in two senses: first, their runtimes nearly match those of the most efficient kno… ▽ More We give near-optimal algorithms for computing an ellipsoidal rounding of a convex polytope whose vertices are given in a stream. The approximation factor is linear in the dimension (as in John's theorem) and only loses an excess logarithmic factor in the aspect ratio of the polytope. Our algorithms are nearly optimal in two senses: first, their runtimes nearly match those of the most efficient known algorithms for the offline version of the problem. Second, their approximation factors nearly match a lower bound we show against a natural class of geometric streaming algorithms. In contrast to existing works in the streaming setting that compute ellipsoidal roundings only for centrally symmetric convex polytopes, our algorithms apply to general convex polytopes. We also show how to use our algorithms to construct coresets from a stream of points that approximately preserve both the ellipsoidal rounding and the convex hull of the original set of points. △ Less

Submitted 15 November, 2023; originally announced November 2023.

arXiv:2308.10160 [pdf, other]

Higher-Order Cheeger Inequality for Partitioning with Buffers

Authors: Konstantin Makarychev, Yury Makarychev, Liren Shan, Aravindan Vijayaraghavan

Abstract: We prove a new generalization of the higher-order Cheeger inequality for partitioning with buffers. Consider a graph $G=(V,E)$. The buffered expansion of a set $S \subseteq V$ with a buffer $B \subseteq V \setminus S$ is the edge expansion of $S$ after removing all the edges from set $S$ to its buffer $B$. An $\varepsilon$-buffered $k$-partitioning is a partitioning of a graph into disjoint compon… ▽ More We prove a new generalization of the higher-order Cheeger inequality for partitioning with buffers. Consider a graph $G=(V,E)$. The buffered expansion of a set $S \subseteq V$ with a buffer $B \subseteq V \setminus S$ is the edge expansion of $S$ after removing all the edges from set $S$ to its buffer $B$. An $\varepsilon$-buffered $k$-partitioning is a partitioning of a graph into disjoint components $P_i$ and buffers $B_i$, in which the size of buffer $B_i$ for $P_i$ is small relative to the size of $P_i$: $|B_i| \le \varepsilon |P_i|$. The buffered expansion of a buffered partition is the maximum of buffered expansions of the $k$ sets $P_i$ with buffers $B_i$. Let $h^{k,\varepsilon}_G$ be the buffered expansion of the optimal $\varepsilon$-buffered $k$-partitioning, then for every $δ>0$, $$h_G^{k,\varepsilon} \le O_δ(1) \cdot \Big( \frac{\log k}{ \varepsilon}\Big) \cdot λ_{\lfloor (1+δ) k\rfloor},$$ where $λ_{\lfloor (1+δ)k\rfloor}$ is the $\lfloor (1+δ)k\rfloor$-th smallest eigenvalue of the normalized Laplacian of $G$. Our inequality is constructive and avoids the ``square-root loss'' that is present in the standard Cheeger inequalities (even for $k=2$). We also provide a complementary lower bound, and a novel generalization to the setting with arbitrary vertex weights and edge costs. Moreover our result implies and generalizes the standard higher-order Cheeger inequalities and another recent Cheeger-type inequality by Kwok, Lau, and Lee (2017) involving robust vertex expansion. △ Less

Submitted 20 August, 2023; originally announced August 2023.

Comments: 45 pages

arXiv:2308.08373 [pdf, ps, other]

Approximation Algorithms for Norm Multiway Cut

Authors: Charlie Carlson, Jafar Jafarov, Konstantin Makarychev, Yury Makarychev, Liren Shan

Abstract: We consider variants of the classic Multiway Cut problem. Multiway Cut asks to partition a graph $G$ into $k$ parts so as to separate $k$ given terminals. Recently, Chandrasekaran and Wang (ESA 2021) introduced $\ell_p$-norm Multiway, a generalization of the problem, in which the goal is to minimize the $\ell_p$ norm of the edge boundaries of $k$ parts. We provide an… ▽ More We consider variants of the classic Multiway Cut problem. Multiway Cut asks to partition a graph $G$ into $k$ parts so as to separate $k$ given terminals. Recently, Chandrasekaran and Wang (ESA 2021) introduced $\ell_p$-norm Multiway, a generalization of the problem, in which the goal is to minimize the $\ell_p$ norm of the edge boundaries of $k$ parts. We provide an $O(\log^{1/2} n\log^{1/2+1/p} k)$ approximation algorithm for this problem, improving upon the approximation guarantee of $O(\log^{3/2} n \log^{1/2} k)$ due to Chandrasekaran and Wang. We also introduce and study Norm Multiway Cut, a further generalization of Multiway Cut. We assume that we are given access to an oracle, which answers certain queries about the norm. We present an $O(\log^{1/2} n \log^{7/2} k)$ approximation algorithm with a weaker oracle and an $O(\log^{1/2} n \log^{5/2} k)$ approximation algorithm with a stronger oracle. Additionally, we show that without any oracle access, there is no $n^{1/4-\varepsilon}$ approximation algorithm for every $\varepsilon > 0$ assuming the Hypergraph Dense-vs-Random Conjecture. △ Less

Submitted 16 August, 2023; originally announced August 2023.

Comments: 25 pages, ESA 2023

arXiv:2302.00213 [pdf, other]

Approximating Red-Blue Set Cover and Minimum Monotone Satisfying Assignment

Authors: Eden Chlamtáč, Yury Makarychev, Ali Vakilian

Abstract: We provide new approximation algorithms for the Red-Blue Set Cover and Circuit Minimum Monotone Satisfying Assignment (MMSA) problems. Our algorithm for Red-Blue Set Cover achieves $\tilde O(m^{1/3})$-approximation improving on the $\tilde O(m^{1/2})$-approximation due to Elkin and Peleg (where $m$ is the number of sets). Our approximation algorithm for MMSA$_t$ (for circuits of depth $t$) gives a… ▽ More We provide new approximation algorithms for the Red-Blue Set Cover and Circuit Minimum Monotone Satisfying Assignment (MMSA) problems. Our algorithm for Red-Blue Set Cover achieves $\tilde O(m^{1/3})$-approximation improving on the $\tilde O(m^{1/2})$-approximation due to Elkin and Peleg (where $m$ is the number of sets). Our approximation algorithm for MMSA$_t$ (for circuits of depth $t$) gives an $\tilde O(N^{1-δ})$ approximation for $δ= \frac{1}{3}2^{3-\lceil t/2\rceil}$, where $N$ is the number of gates and variables. No non-trivial approximation algorithms for MMSA$_t$ with $t\geq 4$ were previously known. We complement these results with lower bounds for these problems: For Red-Blue Set Cover, we provide a nearly approximation preserving reduction from Min $k$-Union that gives an $\tildeΩ(m^{1/4 - \varepsilon})$ hardness under the Dense-vs-Random conjecture, while for MMSA we sketch a proof that an SDP relaxation strengthened by Sherali--Adams has an integrality gap of $N^{1-\varepsilon}$ where $\varepsilon \to 0$ as the circuit depth $t\to \infty$. △ Less

Submitted 7 July, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

Comments: APPROX 2023

arXiv:2206.07250 [pdf, ps, other]

Streaming Algorithms for Ellipsoidal Approximation of Convex Polytopes

Authors: Yury Makarychev, Naren Sarayu Manoj, Max Ovsiankin

Abstract: We give efficient deterministic one-pass streaming algorithms for finding an ellipsoidal approximation of a symmetric convex polytope. The algorithms are near-optimal in that their approximation factors differ from that of the optimal offline solution only by a factor sub-logarithmic in the aspect ratio of the polytope. We give efficient deterministic one-pass streaming algorithms for finding an ellipsoidal approximation of a symmetric convex polytope. The algorithms are near-optimal in that their approximation factors differ from that of the optimal offline solution only by a factor sub-logarithmic in the aspect ratio of the polytope. △ Less

Submitted 14 June, 2022; originally announced June 2022.

Comments: Accepted to COLT 2022

arXiv:2202.01391 [pdf, ps, other]

Fair Representation Clustering with Several Protected Classes

Authors: Zhen Dai, Yury Makarychev, Ali Vakilian

Abstract: We study the problem of fair $k$-median where each cluster is required to have a fair representation of individuals from different groups. In the fair representation $k$-median problem, we are given a set of points $X$ in a metric space. Each point $x\in X$ belongs to one of $\ell$ groups. Further, we are given fair representation parameters $α_j$ and $β_j$ for each group $j\in [\ell]$. We say tha… ▽ More We study the problem of fair $k$-median where each cluster is required to have a fair representation of individuals from different groups. In the fair representation $k$-median problem, we are given a set of points $X$ in a metric space. Each point $x\in X$ belongs to one of $\ell$ groups. Further, we are given fair representation parameters $α_j$ and $β_j$ for each group $j\in [\ell]$. We say that a $k$-clustering $C_1, \cdots, C_k$ fairly represents all groups if the number of points from group $j$ in cluster $C_i$ is between $α_j |C_i|$ and $β_j |C_i|$ for every $j\in[\ell]$ and $i\in [k]$. The goal is to find a set $\mathcal{C}$ of $k$ centers and an assignment $φ: X\rightarrow \mathcal{C}$ such that the clustering defined by $(\mathcal{C}, φ)$ fairly represents all groups and minimizes the $\ell_1$-objective $\sum_{x\in X} d(x, φ(x))$. We present an $O(\log k)$-approximation algorithm that runs in time $n^{O(\ell)}$. Note that the known algorithms for the problem either (i) violate the fairness constraints by an additive term or (ii) run in time that is exponential in both $k$ and $\ell$. We also consider an important special case of the problem where $α_j = β_j = \frac{f_j}{f}$ and $f_j, f \in \mathbb{N}$ for all $j\in [\ell]$. For this special case, we present an $O(\log k)$-approximation algorithm that runs in $(kf)^{O(\ell)}\log n + poly(n)$ time. △ Less

Submitted 2 February, 2022; originally announced February 2022.

arXiv:2111.04804 [pdf, ps, other]

Approximating Fair Clustering with Cascaded Norm Objectives

Authors: Eden Chlamtáč, Yury Makarychev, Ali Vakilian

Abstract: We introduce the $(p,q)$-Fair Clustering problem. In this problem, we are given a set of points $P$ and a collection of different weight functions $W$. We would like to find a clustering which minimizes the $\ell_q$-norm of the vector over $W$ of the $\ell_p$-norms of the weighted distances of points in $P$ from the centers. This generalizes various clustering problems, including Socially Fair… ▽ More We introduce the $(p,q)$-Fair Clustering problem. In this problem, we are given a set of points $P$ and a collection of different weight functions $W$. We would like to find a clustering which minimizes the $\ell_q$-norm of the vector over $W$ of the $\ell_p$-norms of the weighted distances of points in $P$ from the centers. This generalizes various clustering problems, including Socially Fair $k$-Median and $k$-Means, and is closely connected to other problems such as Densest $k$-Subgraph and Min $k$-Union. We utilize convex programming techniques to approximate the $(p,q)$-Fair Clustering problem for different values of $p$ and $q$. When $p\geq q$, we get an $O(k^{(p-q)/(2pq)})$, which nearly matches a $k^{Ω((p-q)/(pq))}$ lower bound based on conjectured hardness of Min $k$-Union and other problems. When $q\geq p$, we get an approximation which is independent of the size of the input for bounded $p,q$, and also matches the recent $O((\log n/(\log\log n))^{1/p})$-approximation for $(p, \infty)$-Fair Clustering by Makarychev and Vakilian (COLT 2021). △ Less

Submitted 8 November, 2021; originally announced November 2021.

Comments: SODA 2022

arXiv:2108.05697 [pdf, ps, other]

Local Correlation Clustering with Asymmetric Classification Errors

Authors: Jafar Jafarov, Sanchit Kalhan, Konstantin Makarychev, Yury Makarychev

Abstract: In the Correlation Clustering problem, we are given a complete weighted graph $G$ with its edges labeled as "similar" and "dissimilar" by a noisy binary classifier. For a clustering $\mathcal{C}$ of graph $G$, a similar edge is in disagreement with $\mathcal{C}$, if its endpoints belong to distinct clusters; and a dissimilar edge is in disagreement with $\mathcal{C}$ if its endpoints belong to the… ▽ More In the Correlation Clustering problem, we are given a complete weighted graph $G$ with its edges labeled as "similar" and "dissimilar" by a noisy binary classifier. For a clustering $\mathcal{C}$ of graph $G$, a similar edge is in disagreement with $\mathcal{C}$, if its endpoints belong to distinct clusters; and a dissimilar edge is in disagreement with $\mathcal{C}$ if its endpoints belong to the same cluster. The disagreements vector, $\text{dis}$, is a vector indexed by the vertices of $G$ such that the $v$-th coordinate $\text{dis}_v$ equals the weight of all disagreeing edges incident on $v$. The goal is to produce a clustering that minimizes the $\ell_p$ norm of the disagreements vector for $p\geq 1$. We study the $\ell_p$ objective in Correlation Clustering under the following assumption: Every similar edge has weight in the range of $[α\mathbf{w},\mathbf{w}]$ and every dissimilar edge has weight at least $α\mathbf{w}$ (where $α\leq 1$ and $\mathbf{w}>0$ is a scaling parameter). We give an $O\left((\frac{1}α)^{\frac{1}{2}-\frac{1}{2p}}\cdot \log\frac{1}α\right)$ approximation algorithm for this problem. Furthermore, we show an almost matching convex programming integrality gap. △ Less

Submitted 11 August, 2021; originally announced August 2021.

Comments: 24 pages, 2 figures. The conference version of this paper appeared in the proceedings of ICML 2021

arXiv:2108.05696 [pdf, ps, other]

Correlation Clustering with Asymmetric Classification Errors

Authors: Jafar Jafarov, Sanchit Kalhan, Konstantin Makarychev, Yury Makarychev

Abstract: In the Correlation Clustering problem, we are given a weighted graph $G$ with its edges labeled as "similar" or "dissimilar" by a binary classifier. The goal is to produce a clustering that minimizes the weight of "disagreements": the sum of the weights of "similar" edges across clusters and "dissimilar" edges within clusters. We study the correlation clustering problem under the following assumpt… ▽ More In the Correlation Clustering problem, we are given a weighted graph $G$ with its edges labeled as "similar" or "dissimilar" by a binary classifier. The goal is to produce a clustering that minimizes the weight of "disagreements": the sum of the weights of "similar" edges across clusters and "dissimilar" edges within clusters. We study the correlation clustering problem under the following assumption: Every "similar" edge $e$ has weight $\mathbf{w}_e\in[α\mathbf{w}, \mathbf{w}]$ and every "dissimilar" edge $e$ has weight $\mathbf{w}_e\geq α\mathbf{w}$ (where $α\leq 1$ and $\mathbf{w}>0$ is a scaling parameter). We give a $(3 + 2 \log_e (1/α))$ approximation algorithm for this problem. This assumption captures well the scenario when classification errors are asymmetric. Additionally, we show an asymptotically matching Linear Programming integrality gap of $Ω(\log 1/α)$. △ Less

Submitted 11 August, 2021; originally announced August 2021.

Comments: 24 pages, 2 figures. The conference version of this paper appeared in the proceedings of ICML 2020

arXiv:2103.02512 [pdf, ps, other]

Approximation Algorithms for Socially Fair Clustering

Authors: Yury Makarychev, Ali Vakilian

Abstract: We present an $(e^{O(p)} \frac{\log \ell}{\log\log\ell})$-approximation algorithm for socially fair clustering with the $\ell_p$-objective. In this problem, we are given a set of points in a metric space. Each point belongs to one (or several) of $\ell$ groups. The goal is to find a $k$-medians, $k$-means, or, more generally, $\ell_p$-clustering that is simultaneously good for all of the groups. M… ▽ More We present an $(e^{O(p)} \frac{\log \ell}{\log\log\ell})$-approximation algorithm for socially fair clustering with the $\ell_p$-objective. In this problem, we are given a set of points in a metric space. Each point belongs to one (or several) of $\ell$ groups. The goal is to find a $k$-medians, $k$-means, or, more generally, $\ell_p$-clustering that is simultaneously good for all of the groups. More precisely, we need to find a set of $k$ centers $C$ so as to minimize the maximum over all groups $j$ of $\sum_{u \text{ in group }j} d(u,C)^p$. The socially fair clustering problem was independently proposed by Ghadiri, Samadi, and Vempala [2021] and Abbasi, Bhaskara, and Venkatasubramanian [2021]. Our algorithm improves and generalizes their $O(\ell)$-approximation algorithms for the problem. The natural LP relaxation for the problem has an integrality gap of $Ω(\ell)$. In order to obtain our result, we introduce a strengthened LP relaxation and show that it has an integrality gap of $Θ(\frac{\log \ell}{\log\log\ell})$ for a fixed $p$. Additionally, we present a bicriteria approximation algorithm, which generalizes the bicriteria approximation of Abbasi et al. [2021]. △ Less

Submitted 15 July, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

Comments: COLT 2021

arXiv:1905.11930 [pdf, other]

Efficient Kirszbraun Extension with Applications to Regression

Authors: Hanan Zaichyk, Armin Biess, Aryeh Kontorovich, Yury Makarychev

Abstract: We introduce a framework for performing regression between two Hilbert spaces. This is done based on Kirszbraun's extension theorem, to the best of our knowledge, the first application of this technique to supervised learning. We analyze the statistical and computational aspects of this method. We decompose this task into two stages: training (which corresponds operationally to smoothing/regulariz… ▽ More We introduce a framework for performing regression between two Hilbert spaces. This is done based on Kirszbraun's extension theorem, to the best of our knowledge, the first application of this technique to supervised learning. We analyze the statistical and computational aspects of this method. We decompose this task into two stages: training (which corresponds operationally to smoothing/regularization) and prediction (which is achieved via Kirszbraun extension). Both are solved algorithmically via a novel multiplicative weight updates (MWU) scheme, which, for our problem formulation, achieves a quadratic runtime improvement over the state of the art. Our empirical results indicate a dramatic improvement over standard off-the-shelf solvers in our setting. △ Less

Submitted 8 March, 2022; v1 submitted 28 May, 2019; originally announced May 2019.

arXiv:1811.03591 [pdf, other]

Nonlinear Dimension Reduction via Outer Bi-Lipschitz Extensions

Authors: Sepideh Mahabadi, Konstantin Makarychev, Yury Makarychev, Ilya Razenshteyn

Abstract: We introduce and study the notion of an outer bi-Lipschitz extension of a map between Euclidean spaces. The notion is a natural analogue of the notion of a Lipschitz extension of a Lipschitz map. We show that for every map $f$ there exists an outer bi-Lipschitz extension $f'$ whose distortion is greater than that of $f$ by at most a constant factor. This result can be seen as a counterpart of the… ▽ More We introduce and study the notion of an outer bi-Lipschitz extension of a map between Euclidean spaces. The notion is a natural analogue of the notion of a Lipschitz extension of a Lipschitz map. We show that for every map $f$ there exists an outer bi-Lipschitz extension $f'$ whose distortion is greater than that of $f$ by at most a constant factor. This result can be seen as a counterpart of the classic Kirszbraun theorem for outer bi-Lipschitz extensions. We also study outer bi-Lipschitz extensions of near-isometric maps and show upper and lower bounds for them. Then, we present applications of our results to prioritized and terminal dimension reduction problems. * We prove a prioritized variant of the Johnson-Lindenstrauss lemma: given a set of points $X\subset \mathbb{R}^d$ of size $N$ and a permutation ("priority ranking") of $X$, there exists an embedding $f$ of $X$ into $\mathbb{R}^{O(\log N)}$ with distortion $O(\log \log N)$ such that the point of rank $j$ has only $O(\log^{3 + \varepsilon} j)$ non-zero coordinates - more specifically, all but the first $O(\log^{3+\varepsilon} j)$ coordinates are equal to $0$; the distortion of $f$ restricted to the first $j$ points (according to the ranking) is at most $O(\log\log j)$. The result makes a progress towards answering an open question by Elkin, Filtser, and Neiman about prioritized dimension reductions. * We prove that given a set $X$ of $N$ points in $\mathbb{R}^d$, there exists a terminal dimension reduction embedding of $\mathbb{R}^d$ into $\mathbb{R}^{d'}$, where $d' = O\left(\frac{\log N}{\varepsilon^4}\right)$, which preserves distances $\|x-y\|$ between points $x\in X$ and $y \in \mathbb{R}^{d}$, up to a multiplicative factor of $1 \pm \varepsilon$. This improves a recent result by Elkin, Filtser, and Neiman. The dimension reductions that we obtain are nonlinear, and this nonlinearity is necessary. △ Less

Submitted 8 November, 2018; originally announced November 2018.

Comments: 27 pages, 6 figures; an extended abstract appeared in the proceedings of STOC 2018

arXiv:1811.03195 [pdf, ps, other]

Performance of Johnson-Lindenstrauss Transform for k-Means and k-Medians Clustering

Authors: Konstantin Makarychev, Yury Makarychev, Ilya Razenshteyn

Abstract: Consider an instance of Euclidean $k$-means or $k$-medians clustering. We show that the cost of the optimal solution is preserved up to a factor of $(1+\varepsilon)$ under a projection onto a random $O(\log(k / \varepsilon) / \varepsilon^2)$-dimensional subspace. Further, the cost of every clustering is preserved within $(1+\varepsilon)$. More generally, our result applies to any dimension reducti… ▽ More Consider an instance of Euclidean $k$-means or $k$-medians clustering. We show that the cost of the optimal solution is preserved up to a factor of $(1+\varepsilon)$ under a projection onto a random $O(\log(k / \varepsilon) / \varepsilon^2)$-dimensional subspace. Further, the cost of every clustering is preserved within $(1+\varepsilon)$. More generally, our result applies to any dimension reduction map satisfying a mild sub-Gaussian-tail condition. Our bound on the dimension is nearly optimal. Additionally, our result applies to Euclidean $k$-clustering with the distances raised to the $p$-th power for any constant $p$. For $k$-means, our result resolves an open problem posed by Cohen, Elder, Musco, Musco, and Persu (STOC 2015); for $k$-medians, it answers a question raised by Kannan. △ Less

Submitted 8 April, 2020; v1 submitted 7 November, 2018; originally announced November 2018.

Comments: 31 pages, an extended abstract appeared in the proceedings of STOC 2019

arXiv:1611.07866 [pdf, ps, other]

Minimizing the Union: Tight Approximations for Small Set Bipartite Vertex Expansion

Authors: Eden Chlamtáč, Michael Dinitz, Yury Makarychev

Abstract: In the Minimum k-Union problem (MkU) we are given a set system with n sets and are asked to select k sets in order to minimize the size of their union. Despite being a very natural problem, it has received surprisingly little attention: the only known approximation algorithm is an $O(\sqrt{n})$-approximation due to [Chlamtáč et al APPROX '16]. This problem can also be viewed as the bipartite versi… ▽ More In the Minimum k-Union problem (MkU) we are given a set system with n sets and are asked to select k sets in order to minimize the size of their union. Despite being a very natural problem, it has received surprisingly little attention: the only known approximation algorithm is an $O(\sqrt{n})$-approximation due to [Chlamtáč et al APPROX '16]. This problem can also be viewed as the bipartite version of the Small Set Vertex Expansion problem (SSVE), which we call the Small Set Bipartite Vertex Expansion problem (SSBVE). SSVE, in which we are asked to find a set of k nodes to minimize their vertex expansion, has not been as well studied as its edge-based counterpart Small Set Expansion (SSE), but has recently received significant attention, e.g. [Louis-Makarychev APPROX '15]. However, due to the connection to Unique Games and hardness of approximation the focus has mostly been on sets of size $k = Ω(n)$, while we focus on the case of general $k$, for which no polylogarithmic approximation is known. We improve the upper bound for this problem by giving an $n^{1/4+\varepsilon}$ approximation for SSBVE for any constant $\varepsilon > 0$. Our algorithm follows in the footsteps of Densest $k$-Subgraph (DkS) and related problems, by designing a tight algorithm for random models, and then extending it to give the same guarantee for arbitrary instances. Moreover, we show that this is tight under plausible complexity conjectures: it cannot be approximated better than $O(n^{1/4})$ assuming an extension of the so-called "Dense vs Random" conjecture for DkS to hypergraphs. We show that the same bound is also matched by an integrality gap for a super-constant number of rounds of the Sherali-Adams LP hierarchy, and an even worse integrality gap for the natural SDP relaxation. Finally, we design a simple bicriteria $\tilde O(\sqrt{n})$ approximation for the more general SSVE problem. △ Less

Submitted 23 November, 2016; originally announced November 2016.

Comments: To appear in SODA 2017

arXiv:1611.06605 [pdf, other]

Algorithmic and Hardness Results for the Hub Labeling Problem

Authors: Haris Angelidakis, Yury Makarychev, Vsevolod Oparin

Abstract: There has been significant success in designing highly efficient algorithms for distance and shortest-path queries in recent years; many of the state-of-the-art algorithms use the hub labeling framework. In this paper, we study the approximability of the Hub Labeling problem. We prove a hardness of $Ω(\log n)$ for Hub Labeling, matching known approximation guarantees. The hardness result applies t… ▽ More There has been significant success in designing highly efficient algorithms for distance and shortest-path queries in recent years; many of the state-of-the-art algorithms use the hub labeling framework. In this paper, we study the approximability of the Hub Labeling problem. We prove a hardness of $Ω(\log n)$ for Hub Labeling, matching known approximation guarantees. The hardness result applies to graphs that have multiple shortest paths between some pairs of vertices. No hardness of approximation results were known previously. Then, we focus on graphs that have a unique shortest path between each pair of vertices. This is a very natural family of graphs, and much research on the Hub Labeling problem has studied such graphs. We give an $O(\log D)$ approximation algorithm for graphs of diameter $D$ with unique shortest paths. In particular, we get an $O(\log \log n)$ approximation for graphs of polylogarithmic diameter, while previously known algorithms gave an $O(\log n)$ proximation. Finally, we present a polynomial-time approximation scheme (PTAS) and quasi-polynomial time algorithms for Hub Labeling on trees; additionally, we analyze a simple combinatorial heuristic for Hub Labeling on trees, proposed by Peleg in 2000. We show that this heuristic gives an approximation factor of 2. △ Less

Submitted 20 November, 2016; originally announced November 2016.

Comments: To appear in SODA17

arXiv:1611.05530 [pdf, ps, other]

An Improved Integrality Gap for the Calinescu-Karloff-Rabani Relaxation for Multiway Cut

Authors: Haris Angelidakis, Yury Makarychev, Pasin Manurangsi

Abstract: We construct an improved integrality gap instance for the Calinescu-Karloff-Rabani LP relaxation of the Multiway Cut problem. In particular, for $k \geqslant 3$ terminals, our instance has an integrality ratio of $6 / (5 + \frac{1}{k - 1}) - \varepsilon$, for every constant $\varepsilon > 0$. For every $k \geqslant 4$, our result improves upon a long-standing lower bound of… ▽ More We construct an improved integrality gap instance for the Calinescu-Karloff-Rabani LP relaxation of the Multiway Cut problem. In particular, for $k \geqslant 3$ terminals, our instance has an integrality ratio of $6 / (5 + \frac{1}{k - 1}) - \varepsilon$, for every constant $\varepsilon > 0$. For every $k \geqslant 4$, our result improves upon a long-standing lower bound of $8 / (7 + \frac{1}{k - 1})$ by Freund and Karloff (2000). Due to Manokaran et al.'s result (2008), our integrality gap also implies Unique Games hardness of approximating Multiway Cut of the same ratio. △ Less

Submitted 16 November, 2016; originally announced November 2016.

Comments: 18 pages, 6 figures

arXiv:1607.06442 [pdf, ps, other]

Metric Perturbation Resilience

Authors: Konstantin Makarychev, Yury Makarychev

Abstract: We study the notion of perturbation resilience introduced by Bilu and Linial (2010) and Awasthi, Blum, and Sheffet (2012). A clustering problem is $α$-perturbation resilient if the optimal clustering does not change when we perturb all distances by a factor of at most $α$. We consider a class of clustering problems with center-based objectives, which includes such problems as k-means, k-median, an… ▽ More We study the notion of perturbation resilience introduced by Bilu and Linial (2010) and Awasthi, Blum, and Sheffet (2012). A clustering problem is $α$-perturbation resilient if the optimal clustering does not change when we perturb all distances by a factor of at most $α$. We consider a class of clustering problems with center-based objectives, which includes such problems as k-means, k-median, and k-center, and give an exact algorithm for clustering 2-perturbation resilient instances. Our result improves upon the result of Balcan and Liang (2016), who gave an algorithm for clustering $1+\sqrt{2}\approx 2.41$ perturbation resilient instances. Our result is tight in the sense that no polynomial-time algorithm can solve $(2-\varepsilon)$-perturbation resilient instances unless NP = RP, as was shown by Balcan, Haghtalab, and White (2016). We show that the algorithm works on instances satisfying a slightly weaker and more natural condition than perturbation resilience, which we call metric perturbation resilience. △ Less

Submitted 21 July, 2016; originally announced July 2016.

arXiv:1607.04787 [pdf, ps, other]

Robust algorithms with polynomial loss for near-unanimity CSPs

Authors: Víctor Dalmau, Marcin Kozik, Andrei Krokhin, Konstantin Makarychev, Yury Makarychev, Jakub Opršal

Abstract: An instance of the Constraint Satisfaction Problem (CSP) is given by a family of constraints on overlap** sets of variables, and the goal is to assign values from a fixed domain to the variables so that all constraints are satisfied. In the optimization version, the goal is to maximize the number of satisfied constraints. An approximation algorithm for CSP is called robust if it outputs an assig… ▽ More An instance of the Constraint Satisfaction Problem (CSP) is given by a family of constraints on overlap** sets of variables, and the goal is to assign values from a fixed domain to the variables so that all constraints are satisfied. In the optimization version, the goal is to maximize the number of satisfied constraints. An approximation algorithm for CSP is called robust if it outputs an assignment satisfying a $(1-g(\varepsilon))$-fraction of constraints on any $(1-\varepsilon)$-satisfiable instance, where the loss function $g$ is such that $g(\varepsilon)\rightarrow 0$ as $\varepsilon\rightarrow 0$. We study how the robust approximability of CSPs depends on the set of constraint relations allowed in instances, the so-called constraint language. All constraint languages admitting a robust polynomial-time algorithm (with some $g$) have been characterised by Barto and Kozik, with the general bound on the loss $g$ being doubly exponential, specifically $g(\varepsilon)=O((\log\log(1/\varepsilon))/\log(1/\varepsilon))$. It is natural to ask when a better loss can be achieved: in particular, polynomial loss $g(\varepsilon)=O(\varepsilon^{1/k})$ for some constant $k$. In this paper, we consider CSPs with a constraint language having a near-unanimity polymorphism. We give two randomized robust algorithms with polynomial loss for such CSPs: one works for any near-unanimity polymorphism and the parameter $k$ in the loss depends on the size of the domain and the arity of the relations in $Γ$, while the other works for a special ternary near-unanimity operation called dual discriminator with $k=2$ for any domain size. In the latter case, the CSP is a common generalisation of Unique Games with a fixed domain and 2-SAT. In the former case, we use the algebraic approach to the CSP. Both cases use the standard semidefinite programming relaxation for CSP. △ Less

Submitted 4 December, 2018; v1 submitted 16 July, 2016; originally announced July 2016.

Comments: A preliminary version of this paper appeared in SODA 2017. Journal referees' comments are incorporated

arXiv:1602.08426 [pdf, other]

doi 10.19086/da.876

A Union of Euclidean Metric Spaces is Euclidean

Authors: Konstantin Makarychev, Yury Makarychev

Abstract: Suppose that a metric space $X$ is the union of two metric subspaces $A$ and $B$ that embed into Euclidean space with distortions $D_A$ and $D_B$, respectively. We prove that then $X$ embeds into Euclidean space with a bounded distortion (namely, with distortion at most $7D_A D_B + 2(D_A+D_B)$). Our result settles an open problem posed by Naor. Additionally, we present some corollaries and extensi… ▽ More Suppose that a metric space $X$ is the union of two metric subspaces $A$ and $B$ that embed into Euclidean space with distortions $D_A$ and $D_B$, respectively. We prove that then $X$ embeds into Euclidean space with a bounded distortion (namely, with distortion at most $7D_A D_B + 2(D_A+D_B)$). Our result settles an open problem posed by Naor. Additionally, we present some corollaries and extensions of this result. In particular, we introduce and study a new concept of an "external bi-Lipschitz extension". In the end of the paper, we list a few related open problems. △ Less

Submitted 23 January, 2017; v1 submitted 26 February, 2016; originally announced February 2016.

Comments: Reformatted for Discrete Analysis, updated metadata, and edited the title. This version is otherwise identical to the previous one

Journal ref: A Union of Euclidean Metric Spaces is Euclidean, Discrete Analysis 2016:14

arXiv:1511.03229 [pdf, ps, other]

Learning Communities in the Presence of Errors

Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

Abstract: We study the problem of learning communities in the presence of modeling errors and give robust recovery algorithms for the Stochastic Block Model (SBM). This model, which is also known as the Planted Partition Model, is widely used for community detection and graph partitioning in various fields, including machine learning, statistics, and social sciences. Many algorithms exist for learning commu… ▽ More We study the problem of learning communities in the presence of modeling errors and give robust recovery algorithms for the Stochastic Block Model (SBM). This model, which is also known as the Planted Partition Model, is widely used for community detection and graph partitioning in various fields, including machine learning, statistics, and social sciences. Many algorithms exist for learning communities in the Stochastic Block Model, but they do not work well in the presence of errors. In this paper, we initiate the study of robust algorithms for partial recovery in SBM with modeling errors or noise. We consider graphs generated according to the Stochastic Block Model and then modified by an adversary. We allow two types of adversarial errors, Feige---Kilian or monotone errors, and edge outlier errors. Mossel, Neeman and Sly (STOC 2015) posed an open question about whether an almost exact recovery is possible when the adversary is allowed to add $o(n)$ edges. Our work answers this question affirmatively even in the case of $k>2$ communities. We then show that our algorithms work not only when the instances come from SBM, but also work when the instances come from any distribution of graphs that is $εm$ close to SBM in the Kullback---Leibler divergence. This result also works in the presence of adversarial errors. Finally, we present almost tight lower bounds for two communities. △ Less

Submitted 24 June, 2016; v1 submitted 10 November, 2015; originally announced November 2015.

Comments: 34 pages. Appearing in the Conference on Learning Theory (COLT)'16

arXiv:1507.04227 [pdf, ps, other]

A bi-criteria approximation algorithm for $k$ Means

Authors: Konstantin Makarychev, Yury Makarychev, Maxim Sviridenko, Justin Ward

Abstract: We consider the classical $k$-means clustering problem in the setting bi-criteria approximation, in which an algoithm is allowed to output $βk > k$ clusters, and must produce a clustering with cost at most $α$ times the to the cost of the optimal set of $k$ clusters. We argue that this approach is natural in many settings, for which the exact number of clusters is a priori unknown, or unimportant… ▽ More We consider the classical $k$-means clustering problem in the setting bi-criteria approximation, in which an algoithm is allowed to output $βk > k$ clusters, and must produce a clustering with cost at most $α$ times the to the cost of the optimal set of $k$ clusters. We argue that this approach is natural in many settings, for which the exact number of clusters is a priori unknown, or unimportant up to a constant factor. We give new bi-criteria approximation algorithms, based on linear programming and local search, respectively, which attain a guarantee $α(β)$ depending on the number $βk$ of clusters that may be opened. Our gurantee $α(β)$ is always at most $9 + ε$ and improves rapidly with $β$ (for example: $α(2)<2.59$, and $α(3) < 1.4$). Moreover, our algorithms have only polynomial dependence on the dimension of the input data, and so are applicable in high-dimensional settings. △ Less

Submitted 3 August, 2015; v1 submitted 15 July, 2015; originally announced July 2015.

arXiv:1503.03851 [pdf, ps, other]

Satisfiability of Ordering CSPs Above Average

Authors: Konstantin Makarychev, Yury Makarychev, Yuan Zhou

Abstract: We study the satisfiability of ordering constraint satisfaction problems (CSPs) above average. We prove the conjecture of Gutin, van Iersel, Mnich, and Yeo that the satisfiability above average of ordering CSPs of arity $k$ is fixed-parameter tractable for every $k$. Previously, this was only known for $k=2$ and $k=3$. We also generalize this result to more general classes of CSPs, including CSPs… ▽ More We study the satisfiability of ordering constraint satisfaction problems (CSPs) above average. We prove the conjecture of Gutin, van Iersel, Mnich, and Yeo that the satisfiability above average of ordering CSPs of arity $k$ is fixed-parameter tractable for every $k$. Previously, this was only known for $k=2$ and $k=3$. We also generalize this result to more general classes of CSPs, including CSPs with predicates defined by linear inequalities. To obtain our results, we prove a new Bonami-type inequality for the Efron-Stein decomposition. The inequality applies to functions defined on arbitrary product probability spaces. In contrast to other variants of the Bonami Inequality, it does not depend on the mass of the smallest atom in the probability space. We believe that this inequality is of independent interest. △ Less

Submitted 30 October, 2015; v1 submitted 12 March, 2015; originally announced March 2015.

arXiv:1406.5667 [pdf, ps, other]

Correlation Clustering with Noisy Partial Information

Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

Abstract: In this paper, we propose and study a semi-random model for the Correlation Clustering problem on arbitrary graphs G. We give two approximation algorithms for Correlation Clustering instances from this model. The first algorithm finds a solution of value $(1+ δ) optcost + O_δ(n\log^3 n)$ with high probability, where $optcost$ is the value of the optimal solution (for every $δ> 0$). The second algo… ▽ More In this paper, we propose and study a semi-random model for the Correlation Clustering problem on arbitrary graphs G. We give two approximation algorithms for Correlation Clustering instances from this model. The first algorithm finds a solution of value $(1+ δ) optcost + O_δ(n\log^3 n)$ with high probability, where $optcost$ is the value of the optimal solution (for every $δ> 0$). The second algorithm finds the ground truth clustering with an arbitrarily small classification error $η$ (under some additional assumptions on the instance). △ Less

Submitted 12 May, 2015; v1 submitted 21 June, 2014; originally announced June 2014.

Comments: To appear at Conference on Learning Theory (COLT) 2015. Substantial changes from previous version, including a new section on recovery of the ground truth clustering. 20 pages

arXiv:1406.5665 [pdf, ps, other]

Constant Factor Approximation for Balanced Cut in the PIE model

Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

Abstract: We propose and study a new semi-random semi-adversarial model for Balanced Cut, a planted model with permutation-invariant random edges (PIE). Our model is much more general than planted models considered previously. Consider a set of vertices V partitioned into two clusters $L$ and $R$ of equal size. Let $G$ be an arbitrary graph on $V$ with no edges between $L$ and $R$. Let $E_{random}$ be a set… ▽ More We propose and study a new semi-random semi-adversarial model for Balanced Cut, a planted model with permutation-invariant random edges (PIE). Our model is much more general than planted models considered previously. Consider a set of vertices V partitioned into two clusters $L$ and $R$ of equal size. Let $G$ be an arbitrary graph on $V$ with no edges between $L$ and $R$. Let $E_{random}$ be a set of edges sampled from an arbitrary permutation-invariant distribution (a distribution that is invariant under permutation of vertices in $L$ and in $R$). Then we say that $G + E_{random}$ is a graph with permutation-invariant random edges. We present an approximation algorithm for the Balanced Cut problem that finds a balanced cut of cost $O(|E_{random}|) + n \text{polylog}(n)$ in this model. In the regime when $|E_{random}| = Ω(n \text{polylog}(n))$, this is a constant factor approximation with respect to the cost of the planted cut. △ Less

Submitted 21 June, 2014; originally announced June 2014.

Comments: Full version of the paper at the 46th ACM Symposium on the Theory of Computing (STOC 2014). 32 pages

arXiv:1405.3167 [pdf, ps, other]

Clustering, Hamming Embedding, Generalized LSH and the Max Norm

Authors: Behnam Neyshabur, Yury Makarychev, Nathan Srebro

Abstract: We study the convex relaxation of clustering and hamming embedding, focusing on the asymmetric case (co-clustering and asymmetric hamming embedding), understanding their relationship to LSH as studied by (Charikar 2002) and to the max-norm ball, and the differences between their symmetric and asymmetric versions. We study the convex relaxation of clustering and hamming embedding, focusing on the asymmetric case (co-clustering and asymmetric hamming embedding), understanding their relationship to LSH as studied by (Charikar 2002) and to the max-norm ball, and the differences between their symmetric and asymmetric versions. △ Less

Submitted 13 May, 2014; originally announced May 2014.

Comments: 17 pages

arXiv:1404.4575 [pdf, other]

Approximation Algorithms for Hypergraph Small Set Expansion and Small Set Vertex Expansion

Authors: Anand Louis, Yury Makarychev

Abstract: The expansion of a hypergraph, a natural extension of the notion of expansion in graphs, is defined as the minimum over all cuts in the hypergraph of the ratio of the number of the hyperedges cut to the size of the smaller side of the cut. We study the Hypergraph Small Set Expansion problem, which, for a parameter $δ\in (0,1/2]$, asks to compute the cut having the least expansion while having at m… ▽ More The expansion of a hypergraph, a natural extension of the notion of expansion in graphs, is defined as the minimum over all cuts in the hypergraph of the ratio of the number of the hyperedges cut to the size of the smaller side of the cut. We study the Hypergraph Small Set Expansion problem, which, for a parameter $δ\in (0,1/2]$, asks to compute the cut having the least expansion while having at most $δ$ fraction of the vertices on the smaller side of the cut. We present two algorithms. Our first algorithm gives an $\tilde O(δ^{-1} \sqrt{\log n})$ approximation. The second algorithm finds a set with expansion $\tilde O(δ^{-1}(\sqrt{d_{\text{max}}r^{-1}\log r\, φ^*} + φ^*))$ in a $r$--uniform hypergraph with maximum degree $d_{\text{max}}$ (where $φ^*$ is the expansion of the optimal solution). Using these results, we also obtain algorithms for the Small Set Vertex Expansion problem: we get an $\tilde O(δ^{-1} \sqrt{\log n})$ approximation algorithm and an algorithm that finds a set with vertex expansion $O\left(δ^{-1}\sqrt{φ^V \log d_{\text{max}} } + δ^{-1} φ^V\right)$ (where $φ^V$ is the vertex expansion of the optimal solution). For $δ=1/2$, Hypergraph Small Set Expansion is equivalent to the hypergraph expansion problem. In this case, our approximation factor of $O(\sqrt{\log n})$ for expansion in hypergraphs matches the corresponding approximation factor for expansion in graphs due to ARV. △ Less

Submitted 17 April, 2014; originally announced April 2014.

arXiv:1401.0699 [pdf, ps, other]

Nonuniform Graph Partitioning with Unrelated Weights

Authors: Konstantin Makarychev, Yury Makarychev

Abstract: We give a bi-criteria approximation algorithm for the Minimum Nonuniform Partitioning problem, recently introduced by Krauthgamer, Naor, Schwartz and Talwar (2014). In this problem, we are given a graph $G=(V,E)$ on $n$ vertices and $k$ numbers $ρ_1,\dots, ρ_k$. The goal is to partition the graph into $k$ disjoint sets $P_1,\dots, P_k$ satisfying $|P_i|\leq ρ_i n$ so as to minimize the number of e… ▽ More We give a bi-criteria approximation algorithm for the Minimum Nonuniform Partitioning problem, recently introduced by Krauthgamer, Naor, Schwartz and Talwar (2014). In this problem, we are given a graph $G=(V,E)$ on $n$ vertices and $k$ numbers $ρ_1,\dots, ρ_k$. The goal is to partition the graph into $k$ disjoint sets $P_1,\dots, P_k$ satisfying $|P_i|\leq ρ_i n$ so as to minimize the number of edges cut by the partition. Our algorithm has an approximation ratio of $O(\sqrt{\log n \log k})$ for general graphs, and an $O(1)$ approximation for graphs with excluded minors. This is an improvement upon the $O(\log n)$ algorithm of Krauthgamer, Naor, Schwartz and Talwar (2014). Our approximation ratio matches the best known ratio for the Minimum (Uniform) $k$-Partitioning problem. We extend our results to the case of "unrelated weights" and to the case of "unrelated $d$-dimensional weights". In the former case, different vertices may have different weights and the weight of a vertex may depend on the set $P_i$ the vertex is assigned to. In the latter case, each vertex $u$ has a $d$-dimensional weight $r(u,i) = (r_1(u,i), \dots, r_d(u,i))$ if $u$ is assigned to $P_i$. Each set $P_i$ has a $d$-dimensional capacity $c(i) = (c_1(i),\dots, c_d(i))$. The goal is to find a partition such that $\sum_{u\in {P_i}} r(u,i) \leq c(i)$ coordinate-wise. △ Less

Submitted 28 April, 2014; v1 submitted 3 January, 2014; originally announced January 2014.

arXiv:1311.7662 [pdf, other]

The Power of Asymmetry in Binary Hashing

Authors: Behnam Neyshabur, Payman Yadollahpour, Yury Makarychev, Ruslan Salakhutdinov, Nathan Srebro

Abstract: When approximating binary similarity using the hamming distance between short binary hashes, we show that even if the similarity is symmetric, we can have shorter and more accurate hashes by using two distinct code maps. I.e. by approximating the similarity between $x$ and $x'$ as the hamming distance between $f(x)$ and $g(x')$, for two distinct binary codes $f,g$, rather than as the hamming dista… ▽ More When approximating binary similarity using the hamming distance between short binary hashes, we show that even if the similarity is symmetric, we can have shorter and more accurate hashes by using two distinct code maps. I.e. by approximating the similarity between $x$ and $x'$ as the hamming distance between $f(x)$ and $g(x')$, for two distinct binary codes $f,g$, rather than as the hamming distance between $f(x)$ and $f(x')$. △ Less

Submitted 29 November, 2013; originally announced November 2013.

Comments: Accepted to NIPS 2013, 9 pages, 5 figures

arXiv:1305.1681 [pdf, ps, other]

Bilu-Linial Stable Instances of Max Cut and Minimum Multiway Cut

Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

Abstract: We investigate the notion of stability proposed by Bilu and Linial. We obtain an exact polynomial-time algorithm for $γ$-stable Max Cut instances with $γ\geq c\sqrt{\log n}\log\log n$ for some absolute constant $c > 0$. Our algorithm is robust: it never returns an incorrect answer; if the instance is $γ$-stable, it finds the maximum cut, otherwise, it either finds the maximum cut or certifies that… ▽ More We investigate the notion of stability proposed by Bilu and Linial. We obtain an exact polynomial-time algorithm for $γ$-stable Max Cut instances with $γ\geq c\sqrt{\log n}\log\log n$ for some absolute constant $c > 0$. Our algorithm is robust: it never returns an incorrect answer; if the instance is $γ$-stable, it finds the maximum cut, otherwise, it either finds the maximum cut or certifies that the instance is not $γ$-stable. We prove that there is no robust polynomial-time algorithm for $γ$-stable instances of Max Cut when $γ< α_{SC}(n/2)$, where $α_{SC}$ is the best approximation factor for Sparsest Cut with non-uniform demands. Our algorithm is based on semidefinite programming. We show that the standard SDP relaxation for Max Cut (with $\ell_2^2$ triangle inequalities) is integral if $γ\geq D_{\ell_2^2\to \ell_1}(n)$, where $D_{\ell_2^2\to \ell_1}(n)$ is the least distortion with which every $n$ point metric space of negative type embeds into $\ell_1$. On the negative side, we show that the SDP relaxation is not integral when $γ< D_{\ell_2^2\to \ell_1}(n/2)$. Moreover, there is no tractable convex relaxation for $γ$-stable instances of Max Cut when $γ< α_{SC}(n/2)$. That suggests that solving $γ$-stable instances with $γ=o(\sqrt{\log n})$ might be difficult or impossible. Our results significantly improve previously known results. The best previously known algorithm for $γ$-stable instances of Max Cut required that $γ\geq c\sqrt{n}$ (for some $c > 0$) [Bilu, Daniely, Linial, and Saks]. No hardness results were known for the problem. Additionally, we present an algorithm for 4-stable instances of Minimum Multiway Cut. We also study a relaxed notion of weak stability. △ Less

Submitted 11 November, 2013; v1 submitted 7 May, 2013; originally announced May 2013.

Comments: 24 pages

arXiv:1206.4898 [pdf, ps, other]

Planarizing an Unknown Surface

Authors: Yury Makarychev, Anastasios Sidiropoulos

Abstract: It has been recently shown that any graph of genus g>0 can be stochastically embedded into a distribution over planar graphs, with distortion Olog (g+1)) [Sidiropoulos, FOCS 2010]. This embedding can be computed in polynomial time, provided that a drawing of the input graph into a genus-g surface is given. We show how to compute the above embedding without having such a drawing. This implies a g… ▽ More It has been recently shown that any graph of genus g>0 can be stochastically embedded into a distribution over planar graphs, with distortion Olog (g+1)) [Sidiropoulos, FOCS 2010]. This embedding can be computed in polynomial time, provided that a drawing of the input graph into a genus-g surface is given. We show how to compute the above embedding without having such a drawing. This implies a general reduction for solving problems on graphs of small genus, even when the drawing into a small genus surface is unknown. To the best of our knowledge, this is the first result of this type. △ Less

Submitted 21 June, 2012; originally announced June 2012.

Comments: The conference version of this paper will appear in the Proceedings of APPROX 2012

arXiv:1206.3603 [pdf, other]

Approximation Algorithm for Non-Boolean MAX k-CSP

Authors: Konstantin Makarychev, Yury Makarychev

Abstract: In this paper, we present a randomized polynomial-time approximation algorithm for k-CSPd. In k-CSPd, we are given a set of predicates of arity k over an alphabet of size d. Our goal is to find an assignment that maximizes the number of satisfied constraints. Our algorithm has approximation factor Omega(kd/d^k) (when k > Ω(log d)). This bound is asymptotically optimal assuming the Unique Games C… ▽ More In this paper, we present a randomized polynomial-time approximation algorithm for k-CSPd. In k-CSPd, we are given a set of predicates of arity k over an alphabet of size d. Our goal is to find an assignment that maximizes the number of satisfied constraints. Our algorithm has approximation factor Omega(kd/d^k) (when k > Ω(log d)). This bound is asymptotically optimal assuming the Unique Games Conjecture. The best previously known algorithm has approximation factor Omega(k log d/d^k). We also give an approximation algorithm for the boolean MAX k-CSP2 problem with a slightly improved approximation guarantee. △ Less

Submitted 15 June, 2012; originally announced June 2012.

Comments: The conference version of this paper will appear in the Proceedings of APPROX 2012

arXiv:1205.2234 [pdf, ps, other]

Approximation Algorithms for Semi-random Graph Partitioning Problems

Authors: Konstantin Makarychev, Yury Makarychev, Aravindan Vijayaraghavan

Abstract: In this paper, we propose and study a new semi-random model for graph partitioning problems. We believe that it captures many properties of real--world instances. The model is more flexible than the semi-random model of Feige and Kilian and planted random model of Bui, Chaudhuri, Leighton and Sipser. We develop a general framework for solving semi-random instances and apply it to several problem… ▽ More In this paper, we propose and study a new semi-random model for graph partitioning problems. We believe that it captures many properties of real--world instances. The model is more flexible than the semi-random model of Feige and Kilian and planted random model of Bui, Chaudhuri, Leighton and Sipser. We develop a general framework for solving semi-random instances and apply it to several problems of interest. We present constant factor bi-criteria approximation algorithms for semi-random instances of the Balanced Cut, Multicut, Min Uncut, Sparsest Cut and Small Set Expansion problems. We also show how to almost recover the optimal solution if the instance satisfies an additional expanding condition. Our algorithms work in a wider range of parameters than most algorithms for previously studied random and semi-random models. Additionally, we study a new planted algebraic expander model and develop constant factor bi-criteria approximation algorithms for graph partitioning problems in this model. △ Less

Submitted 10 May, 2012; originally announced May 2012.

Comments: To appear at the 44th ACM Symposium on Theory of Computing (STOC 2012)

arXiv:1112.3611 [pdf, other]

Approximation Algorithms and Hardness of the k-Route Cut Problem

Authors: Julia Chuzhoy, Yury Makarychev, Aravindan Vijayaraghavan, Yuan Zhou

Abstract: We study the k-route cut problem: given an undirected edge-weighted graph G=(V,E), a collection {(s_1,t_1),(s_2,t_2),...,(s_r,t_r)} of source-sink pairs, and an integer connectivity requirement k, the goal is to find a minimum-weight subset E' of edges to remove, such that the connectivity of every pair (s_i, t_i) falls below k. Specifically, in the edge-connectivity version, EC-kRC, the requireme… ▽ More We study the k-route cut problem: given an undirected edge-weighted graph G=(V,E), a collection {(s_1,t_1),(s_2,t_2),...,(s_r,t_r)} of source-sink pairs, and an integer connectivity requirement k, the goal is to find a minimum-weight subset E' of edges to remove, such that the connectivity of every pair (s_i, t_i) falls below k. Specifically, in the edge-connectivity version, EC-kRC, the requirement is that there are at most (k-1) edge-disjoint paths connecting s_i to t_i in G \ E', while in the vertex-connectivity version, NC-kRC, the same requirement is for vertex-disjoint paths. Prior to our work, poly-logarithmic approximation algorithms have been known for the special case where k >= 3, but no non-trivial approximation algorithms were known for any value k>3, except in the single-source setting. We show an O(k log^{3/2}r)-approximation algorithm for EC-kRC with uniform edge weights, and several polylogarithmic bi-criteria approximation algorithms for EC-kRC and NC-kRC, where the connectivity requirement k is violated by a constant factor. We complement these upper bounds by proving that NC-kRC is hard to approximate to within a factor of k^{eps} for some fixed eps>0. We then turn to study a simpler version of NC-kRC, where only one source-sink pair is present. We give a simple bi-criteria approximation algorithm for this case, and show evidence that even this restricted version of the problem may be hard to approximate. For example, we prove that the single source-sink pair version of NC-kRC has no constant-factor approximation, assuming Feige's Random k-AND assumption. △ Less

Submitted 15 December, 2011; v1 submitted 15 December, 2011; originally announced December 2011.

Comments: To appear in the Symposium on Discrete Algorithms (SODA) 2012. 44 pages

arXiv:1104.3806 [pdf, ps, other]

How to Play Unique Games against a Semi-Random Adversary

Authors: Alexandra Kolla, Konstantin Makarychev, Yury Makarychev

Abstract: In this paper, we study the average case complexity of the Unique Games problem. We propose a natural semi-random model, in which a unique game instance is generated in several steps. First an adversary selects a completely satisfiable instance of Unique Games, then she chooses an epsilon-fraction of all edges, and finally replaces ("corrupts") the constraints corresponding to these edges with new… ▽ More In this paper, we study the average case complexity of the Unique Games problem. We propose a natural semi-random model, in which a unique game instance is generated in several steps. First an adversary selects a completely satisfiable instance of Unique Games, then she chooses an epsilon-fraction of all edges, and finally replaces ("corrupts") the constraints corresponding to these edges with new constraints. If all steps are adversarial, the adversary can obtain any (1-epsilon) satisfiable instance, so then the problem is as hard as in the worst case. In our semi-random model, one of the steps is random, and all other steps are adversarial. We show that known algorithms for unique games (in particular, all algorithms that use the standard SDP relaxation) fail to solve semi-random instances of Unique Games. We present an algorithm that with high probability finds a solution satisfying a (1-delta) fraction of all constraints in semi-random instances (we require that the average degree of the graph is Omega(log k). To this end, we consider a new non-standard SDP program for Unique Games, which is not a relaxation for the problem, and show how to analyze it. We present a new rounding scheme that simultaneously uses SDP and LP solutions, which we believe is of independent interest. Our result holds only for epsilon less than some absolute constant. We prove that if epsilon > 1/2, then the problem is hard in one of the models, the result assumes the 2-to-2 conjecture. Finally, we study semi-random instances of Unique Games that are at most (1-epsilon) satisfiable. We present an algorithm that with high probability, distinguishes between the case when the instance is a semi-random instance and the case when the instance is an (arbitrary) (1-delta) satisfiable instance if epsilon > c delta. △ Less

Submitted 19 April, 2011; originally announced April 2011.

arXiv:1103.6161 [pdf, other]

The Grothendieck constant is strictly smaller than Krivine's bound

Authors: Mark Braverman, Konstantin Makarychev, Yury Makarychev, Assaf Naor

Abstract: We prove that $K_G<\fracπ{2\log(1+\sqrt{2})}$, where $K_G$ is the Grothendieck constant. We prove that $K_G<\fracπ{2\log(1+\sqrt{2})}$, where $K_G$ is the Grothendieck constant. △ Less

Submitted 17 August, 2011; v1 submitted 31 March, 2011; originally announced March 2011.

Comments: An extended abstract describing the contents of this work will appear in FOCS 2011. Suggestions of the FOCS reviewers have been addressed

arXiv:1010.3976 [pdf, other]

On Graph Crossing Number and Edge Planarization

Authors: Julia Chuzhoy, Yury Makarychev, Anastasios Sidiropoulos

Abstract: Given an n-vertex graph G, a drawing of G in the plane is a map** of its vertices into points of the plane, and its edges into continuous curves, connecting the images of their endpoints. A crossing in such a drawing is a point where two such curves intersect. In the Minimum Crossing Number problem, the goal is to find a drawing of G with minimum number of crossings. The value of the optimal sol… ▽ More Given an n-vertex graph G, a drawing of G in the plane is a map** of its vertices into points of the plane, and its edges into continuous curves, connecting the images of their endpoints. A crossing in such a drawing is a point where two such curves intersect. In the Minimum Crossing Number problem, the goal is to find a drawing of G with minimum number of crossings. The value of the optimal solution, denoted by OPT, is called the graph's crossing number. This is a very basic problem in topological graph theory, that has received a significant amount of attention, but is still poorly understood algorithmically. The best currently known efficient algorithm produces drawings with $O(\log^2 n)(n + OPT)$ crossings on bounded-degree graphs, while only a constant factor hardness of approximation is known. A closely related problem is Minimum Edge Planarization, in which the goal is to remove a minimum-cardinality subset of edges from G, such that the remaining graph is planar. Our main technical result establishes the following connection between the two problems: if we are given a solution of cost k to the Minimum Edge Planarization problem on graph G, then we can efficiently find a drawing of G with at most $\poly(d)\cdot k\cdot (k+OPT)$ crossings, where $d$ is the maximum degree in G. This result implies an $O(n\cdot \poly(d)\cdot \log^{3/2}n)$-approximation for Minimum Crossing Number, as well as improved algorithms for special cases of the problem, such as, for example, k-apex and bounded-genus graphs. △ Less

Submitted 19 October, 2010; originally announced October 2010.

arXiv:1006.4607 [pdf, ps, other]

Metric Extension Operators, Vertex Sparsifiers and Lipschitz Extendability

Authors: Konstantin Makarychev, Yury Makarychev

Abstract: We study vertex cut and flow sparsifiers that were recently introduced by Moitra, and Leighton and Moitra. We improve and generalize their results. We give a new polynomial-time algorithm for constructing O(log k / log log k) cut and flow sparsifiers, matching the best existential upper bound on the quality of a sparsifier, and improving the previous algorithmic upper bound of O(log^2 k / log log… ▽ More We study vertex cut and flow sparsifiers that were recently introduced by Moitra, and Leighton and Moitra. We improve and generalize their results. We give a new polynomial-time algorithm for constructing O(log k / log log k) cut and flow sparsifiers, matching the best existential upper bound on the quality of a sparsifier, and improving the previous algorithmic upper bound of O(log^2 k / log log k). We show that flow sparsifiers can be obtained from linear operators approximating minimum metric extensions. We introduce the notion of (linear) metric extension operators, prove that they exist, and give an exact polynomial-time algorithm for finding optimal operators. We then establish a direct connection between flow and cut sparsifiers and Lipschitz extendability of maps in Banach spaces, a notion studied in functional analysis since 1930s. Using this connection, we prove a lower bound of Omega(sqrt{log k/log log k}) for flow sparsifiers and a lower bound of Omega(sqrt{log k}/log log k) for cut sparsifiers. We show that if a certain open question posed by Ball in 1992 has a positive answer, then there exist \tilde O(sqrt{log k}) cut sparsifiers. On the other hand, any lower bound on cut sparsifiers better than \tilde Omega(sqrt{log k}) would imply a negative answer to this question. △ Less

Submitted 8 December, 2010; v1 submitted 23 June, 2010; originally announced June 2010.

Comments: Appeared at FOCS 2010

arXiv:0912.1623 [pdf, ps, other]

Subgraph Sparsification and Nearly Optimal Ultrasparsifiers

Authors: Alexandra Kolla, Yury Makarychev, Amin Saberi, Shanghua Teng

Abstract: We consider a variation of the spectral sparsification problem where we are required to keep a subgraph of the original graph. Formally, given a union of two weighted graphs $G$ and $W$ and an integer $k$, we are asked to find a $k$-edge weighted graph $W_k$ such that $G+W_k$ is a good spectral sparsifer of $G+W$. We will refer to this problem as the subgraph (spectral) sparsification. We presen… ▽ More We consider a variation of the spectral sparsification problem where we are required to keep a subgraph of the original graph. Formally, given a union of two weighted graphs $G$ and $W$ and an integer $k$, we are asked to find a $k$-edge weighted graph $W_k$ such that $G+W_k$ is a good spectral sparsifer of $G+W$. We will refer to this problem as the subgraph (spectral) sparsification. We present a nontrivial condition on $G$ and $W$ such that a good sparsifier exists and give a polynomial time algorithm to find the sparsifer. %$O(\frac{n}{k})\log n \tilde{O}(\log \log n)$ As a significant application of our technique, we show that for each positive integer $k$, every $n$-vertex weighted graph has an $(n-1+k)$-edge spectral sparsifier with relative condition number at most $\frac{n}{k} \log n \tilde{O}(\log\log n)$ where $\tilde{O}()$ hides lower order terms. Our bound is within a factor of $\tilde{O}(\log \log n)$ from optimal. This nearly settles a question left open by Spielman and Teng about ultrasparsifiers, which is a key component in their nearly linear-time algorithms for solving diagonally dominant symmetric linear systems. We also present another application of our technique to spectral optimization in which the goal is to maximize the algebraic connectivity of a graph (e.g. turn it into an expander) with a limited number of edges. △ Less

Submitted 8 December, 2009; originally announced December 2009.

arXiv:0903.0367 [pdf, ps, other]

How to Play Unique Games on Expanders

Authors: Konstantin Makarychev, Yury Makarychev

Abstract: In this note we improve a recent result by Arora, Khot, Kolla, Steurer, Tulsiani, and Vishnoi on solving the Unique Games problem on expanders. Given a $(1-\varepsilon)$-satisfiable instance of Unique Games with the constraint graph $G$, our algorithm finds an assignment satisfying at least a $1- C \varepsilon/h_G$ fraction of all constraints if $\varepsilon < c λ_G$ where $h_G$ is the edge ex… ▽ More In this note we improve a recent result by Arora, Khot, Kolla, Steurer, Tulsiani, and Vishnoi on solving the Unique Games problem on expanders. Given a $(1-\varepsilon)$-satisfiable instance of Unique Games with the constraint graph $G$, our algorithm finds an assignment satisfying at least a $1- C \varepsilon/h_G$ fraction of all constraints if $\varepsilon < c λ_G$ where $h_G$ is the edge expansion of $G$, $λ_G$ is the second smallest eigenvalue of the Laplacian of $G$, and $C$ and $c$ are some absolute constants. △ Less

Submitted 2 March, 2009; originally announced March 2009.

arXiv:0901.1155 [pdf, ps, other]

doi 10.1214/11-AAP804

Balanced allocation: Memory performance tradeoffs

Authors: Itai Benjamini, Yury Makarychev

Abstract: Suppose we sequentially put $n$ balls into $n$ bins. If we put each ball into a random bin then the heaviest bin will contain ${\sim}\log n/\log\log n$ balls with high probability. However, Azar, Broder, Karlin and Upfal [SIAM J. Comput. 29 (1999) 180--200] showed that if each time we choose two bins at random and put the ball in the least loaded bin among the two, then the heaviest bin will conta… ▽ More Suppose we sequentially put $n$ balls into $n$ bins. If we put each ball into a random bin then the heaviest bin will contain ${\sim}\log n/\log\log n$ balls with high probability. However, Azar, Broder, Karlin and Upfal [SIAM J. Comput. 29 (1999) 180--200] showed that if each time we choose two bins at random and put the ball in the least loaded bin among the two, then the heaviest bin will contain only ${\sim}\log\log n$ balls with high probability. How much memory do we need to implement this scheme? We need roughly $\log\log\log n$ bits per bin, and $n\log\log\log n$ bits in total. Let us assume now that we have limited amount of memory. For each ball, we are given two random bins and we have to put the ball into one of them. Our goal is to minimize the load of the heaviest bin. We prove that if we have $n^{1-δ}$ bits then the heaviest bin will contain at least $Ω(δ\log n/\log\log n)$ balls with high probability. The bound is tight in the communication complexity model. △ Less

Submitted 12 September, 2012; v1 submitted 8 January, 2009; originally announced January 2009.

Comments: Published in at http://dx.doi.org/10.1214/11-AAP804 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AAP-AAP804

Journal ref: Annals of Applied Probability 2012, Vol. 22, No. 4, 1642-1649

arXiv:0806.1745 [pdf, ps, other]

Eigenvalue multiplicity and volume growth

Authors: James R. Lee, Yury Makarychev

Abstract: Let $G$ be a finite group with symmetric generating set $S$, and let $c = \max_{R > 0} |B(2R)|/|B(R)|$ be the doubling constant of the corresponding Cayley graph, where $B(R)$ denotes an $R$-ball in the word-metric with respect to $S$. We show that the multiplicity of the $k$th eigenvalue of the Laplacian on the Cayley graph of $G$ is bounded by a function of only $c$ and $k$. More specifically,… ▽ More Let $G$ be a finite group with symmetric generating set $S$, and let $c = \max_{R > 0} |B(2R)|/|B(R)|$ be the doubling constant of the corresponding Cayley graph, where $B(R)$ denotes an $R$-ball in the word-metric with respect to $S$. We show that the multiplicity of the $k$th eigenvalue of the Laplacian on the Cayley graph of $G$ is bounded by a function of only $c$ and $k$. More specifically, the multiplicity is at most $\exp((\log c)(\log c + \log k))$. Similarly, if $X$ is a compact, $n$-dimensional Riemannian manifold with non-negative Ricci curvature, then the multiplicity of the $k$th eigenvalue of the Laplace-Beltrami operator on $X$ is at most $\exp(n^2 + n log k)$. The first result (for $k=2$) yields the following group-theoretic application. There exists a normal subgroup $N$ of $G$, with $[G : N] \leq α(c)$, and such that $N$ admits a homomorphism onto the cyclic group $Z_M$, where $M \geq |G|^{δ(c)}$ and $α(c), δ(c) > 0$ are explicit functions depending only on $c$. This is the finitary analog of a theorem of Gromov which states that every infinite group of polynomial growth has a subgroup of finite index which admits a homomorphism onto the integers. This addresses a question of Trevisan, and is proved by scaling down Kleiner's proof of Gromov's theorem. In particular, we replace the space of harmonic functions of fixed polynomial growth by the second eigenspace of the Laplacian on the Cayley graph of $G$. △ Less

Submitted 26 March, 2009; v1 submitted 10 June, 2008; originally announced June 2008.

arXiv:0710.1343 [pdf, ps, other]

Dimension Reduction for the Hyperbolic Space

Authors: itai benjamini, Yury Makarychev

Abstract: A dimension reduction for the hyperbolic space is established. When points are far apart an embedding with bounded distortion into the hyperbolic plane is achieved. A dimension reduction for the hyperbolic space is established. When points are far apart an embedding with bounded distortion into the hyperbolic plane is achieved. △ Less

Submitted 6 October, 2007; originally announced October 2007.

arXiv:cs/0510029 [pdf, ps, other]

Conditionally independent random variables

Authors: Konstantin Makarychev, Yury Makarychev

Abstract: In this paper we investigate the notion of conditional independence and prove several information inequalities for conditionally independent random variables. In this paper we investigate the notion of conditional independence and prove several information inequalities for conditionally independent random variables. △ Less

Submitted 11 October, 2005; originally announced October 2005.

Comments: 8 pages

Showing 1–46 of 46 results for author: Makarychev, Y