-
Krivine diffusions attain the Goemans--Williamson approximation ratio
Authors:
Ronen Eldan,
Assaf Naor
Abstract:
Answering a question of Abbasi-Zadeh, Bansal, Guruganesh, Nikolov, Schwartz and Singh (2018), we prove the existence of a slowed-down sticky Brownian motion whose induced rounding for MAXCUT attains the Goemans--Williamson approximation ratio. This is an especially simple particular case of the general rounding framework of Krivine diffusions that we investigate elsewhere.
Answering a question of Abbasi-Zadeh, Bansal, Guruganesh, Nikolov, Schwartz and Singh (2018), we prove the existence of a slowed-down sticky Brownian motion whose induced rounding for MAXCUT attains the Goemans--Williamson approximation ratio. This is an especially simple particular case of the general rounding framework of Krivine diffusions that we investigate elsewhere.
△ Less
Submitted 25 June, 2019;
originally announced June 2019.
-
The Andoni--Krauthgamer--Razenshteyn characterization of sketchable norms fails for sketchable metrics
Authors:
Subhash Khot,
Assaf Naor
Abstract:
Andoni, Krauthgamer and Razenshteyn (AKR) proved (STOC 2015) that a finite-dimensional normed space $(X,\|\cdot\|_X)$ admits a $O(1)$ sketching algorithm (namely, with $O(1)$ sketch size and $O(1)$ approximation) if and only if for every $\varepsilon\in (0,1)$ there exist $α\geqslant 1$ and an embedding $f:X\to \ell_{1-\varepsilon}$ such that…
▽ More
Andoni, Krauthgamer and Razenshteyn (AKR) proved (STOC 2015) that a finite-dimensional normed space $(X,\|\cdot\|_X)$ admits a $O(1)$ sketching algorithm (namely, with $O(1)$ sketch size and $O(1)$ approximation) if and only if for every $\varepsilon\in (0,1)$ there exist $α\geqslant 1$ and an embedding $f:X\to \ell_{1-\varepsilon}$ such that $\|x-y\|_X\leqslant \|f(x)-f(y)\|_{1-\varepsilon}\leqslant α\|x-y\|_X$ for all $x,y\in X$. The "if part" of this theorem follows from a sketching algorithm of Indyk (FOCS 2000). The contribution of AKR is therefore to demonstrate that the mere availability of a sketching algorithm implies the existence of the aforementioned geometric realization. Indyk's algorithm shows that the "if part" of the AKR characterization holds true for any metric space whatsoever, i.e., the existence of an embedding as above implies sketchability even when $X$ is not a normed space. Due to this, a natural question that AKR posed was whether the assumption that the underlying space is a normed space is needed for their characterization of sketchability. We resolve this question by proving that for arbitrarily large $n\in \mathbb{N}$ there is an $n$-point metric space $(M(n),d_{M(n)})$ which is $O(1)$-sketchable yet for every $\varepsilon\in (0,\frac12)$, if $α(n)\geqslant 1$ and $f_n:M(n)\to \ell_{1-\varepsilon}$ are such that $d_{M(n)}(x,y)\leqslant \|f_n(x)-f_n(y)\|_{1-\varepsilon}\leqslant α(n) d_{M(n)}(x,y)$ for all $x,y\in M(n)$, then necessarily $\lim_{n\to \infty} α(n)= \infty$.
△ Less
Submitted 9 October, 2018;
originally announced October 2018.
-
Metric dimension reduction: A snapshot of the Ribe program
Authors:
Assaf Naor
Abstract:
The purpose of this article is to survey some of the context, achievements, challenges and mysteries of the field of metric dimension reduction, including new perspectives on major older results as well as recent advances.
The purpose of this article is to survey some of the context, achievements, challenges and mysteries of the field of metric dimension reduction, including new perspectives on major older results as well as recent advances.
△ Less
Submitted 7 September, 2018;
originally announced September 2018.
-
Impossibility of dimension reduction in the nuclear norm
Authors:
Assaf Naor,
Gilles Pisier,
Gideon Schechtman
Abstract:
Let $\mathsf{S}_1$ (the Schatten--von Neumann trace class) denote the Banach space of all compact linear operators $T:\ell_2\to \ell_2$ whose nuclear norm $\|T\|_{\mathsf{S}_1}=\sum_{j=1}^\inftyσ_j(T)$ is finite, where $\{σ_j(T)\}_{j=1}^\infty$ are the singular values of $T$. We prove that for arbitrarily large $n\in \mathbb{N}$ there exists a subset $\mathcal{C}\subseteq \mathsf{S}_1$ with…
▽ More
Let $\mathsf{S}_1$ (the Schatten--von Neumann trace class) denote the Banach space of all compact linear operators $T:\ell_2\to \ell_2$ whose nuclear norm $\|T\|_{\mathsf{S}_1}=\sum_{j=1}^\inftyσ_j(T)$ is finite, where $\{σ_j(T)\}_{j=1}^\infty$ are the singular values of $T$. We prove that for arbitrarily large $n\in \mathbb{N}$ there exists a subset $\mathcal{C}\subseteq \mathsf{S}_1$ with $|\mathcal{C}|=n$ that cannot be embedded with bi-Lipschitz distortion $O(1)$ into any $n^{o(1)}$-dimensional linear subspace of $\mathsf{S}_1$. $\mathcal{C}$ is not even a $O(1)$-Lipschitz quotient of any subset of any $n^{o(1)}$-dimensional linear subspace of $\mathsf{S}_1$. Thus, $\mathsf{S}_1$ does not admit a dimension reduction result á la Johnson and Lindenstrauss (1984), which complements the work of Harrow, Montanaro and Short (2011) on the limitations of quantum dimension reduction under the assumption that the embedding into low dimensions is a quantum channel. Such a statement was previously known with $\mathsf{S}_1$ replaced by the Banach space $\ell_1$ of absolutely summable sequences via the work of Brinkman and Charikar (2003). In fact, the above set $\mathcal{C}$ can be taken to be the same set as the one that Brinkman and Charikar considered, viewed as a collection of diagonal matrices in $\mathsf{S}_1$. The challenge is to demonstrate that $\mathcal{C}$ cannot be faithfully realized in an arbitrary low-dimensional subspace of $\mathsf{S}_1$, while Brinkman and Charikar obtained such an assertion only for subspaces of $\mathsf{S}_1$ that consist of diagonal operators (i.e., subspaces of $\ell_1$). We establish this by proving that the Markov 2-convexity constant of any finite dimensional linear subspace $X$ of $\mathsf{S}_1$ is at most a universal constant multiple of $\sqrt{\log \mathrm{dim}(X)}$.
△ Less
Submitted 24 October, 2017;
originally announced October 2017.
-
The integrality gap of the Goemans--Linial SDP relaxation for Sparsest Cut is at least a constant multiple of $\sqrt{\log n}$
Authors:
Assaf Naor,
Robert Young
Abstract:
We prove that the integrality gap of the Goemans--Linial semidefinite programming relaxation for the Sparsest Cut Problem is $Ω(\sqrt{\log n})$ on inputs with $n$ vertices.
We prove that the integrality gap of the Goemans--Linial semidefinite programming relaxation for the Sparsest Cut Problem is $Ω(\sqrt{\log n})$ on inputs with $n$ vertices.
△ Less
Submitted 4 April, 2017;
originally announced April 2017.
-
Vertical perimeter versus horizontal perimeter
Authors:
Assaf Naor,
Robert Young
Abstract:
The discrete Heisenberg group $\mathbb{H}_{\mathbb{Z}}^{2k+1}$ is the group generated by $a_1,b_1,\ldots,a_k,b_k,c$, subject to the relations $[a_1,b_1]=\ldots=[a_k,b_k]=c$ and $[a_i,a_j]=[b_i,b_j]=[a_i,b_j]=[a_i,c]=[b_i,c]=1$ for every distinct $i,j\in \{1,\ldots,k\}$. Denote $S=\{a_1^{\pm 1},b_1^{\pm 1},\ldots,a_k^{\pm 1},b_k^{\pm 1}\}$. The horizontal boundary of…
▽ More
The discrete Heisenberg group $\mathbb{H}_{\mathbb{Z}}^{2k+1}$ is the group generated by $a_1,b_1,\ldots,a_k,b_k,c$, subject to the relations $[a_1,b_1]=\ldots=[a_k,b_k]=c$ and $[a_i,a_j]=[b_i,b_j]=[a_i,b_j]=[a_i,c]=[b_i,c]=1$ for every distinct $i,j\in \{1,\ldots,k\}$. Denote $S=\{a_1^{\pm 1},b_1^{\pm 1},\ldots,a_k^{\pm 1},b_k^{\pm 1}\}$. The horizontal boundary of $Ω\subset \mathbb{H}_{\mathbb{Z}}^{2k+1}$, denoted $\partial_{h}Ω$, is the set of all $(x,y)\in Ω\times (\mathbb{H}_{\mathbb{Z}}^{2k+1}\setminus Ω)$ such that $x^{-1}y\in S$. The horizontal perimeter of $Ω$ is $|\partial_{h}Ω|$. For $t\in \mathbb{N}$, define $\partial^t_{v} Ω$ to be the set of all $(x,y)\in Ω\times (\mathbb{H}_{\mathsf{Z}}^{2k+1}\setminus Ω)$ such that $x^{-1}y\in \{c^t,c^{-t}\}$. The vertical perimeter of $Ω$ is defined by $|\partial_{v}Ω|= \sqrt{\sum_{t=1}^\infty |\partial^t_{v}Ω|^2/t^2}$. It is shown here that if $k\ge 2$, then $|\partial_{v}Ω|\lesssim \frac{1}{k} |\partial_{h}Ω|$. The proof of this "vertical versus horizontal isoperimetric inequality" uses a new structural result that decomposes sets of finite perimeter in the Heisenberg group into pieces that admit an "intrinsic corona decomposition." This allows one to deduce an endpoint $W^{1,1}\to L_2(L_1)$ boundedness of a certain singular integral operator from a corresponding lower-dimensional $W^{1,2}\to L_2(L_2)$ boundedness. The above inequality has several applications, including that any embedding into $L_1$ of a ball of radius $n$ in the word metric on $\mathbb{H}_{\mathbb{Z}}^{5}$ incurs bi-Lipschitz distortion that is at least a constant multiple of $\sqrt{\log n}$. It follows that the integrality gap of the Goemans--Linial semidefinite program for the Sparsest Cut Problem on inputs of size $n$ is at least a constant multiple of $\sqrt{\log n}$.
△ Less
Submitted 13 March, 2018; v1 submitted 3 January, 2017;
originally announced January 2017.
-
A spectral gap precludes low-dimensional embeddings
Authors:
Assaf Naor
Abstract:
We prove that there is a universal constant $C>0$ with the following property. Suppose that $n\in \mathbb{N}$ and that $\mathsf{A}=(a_{ij})\in M_n(\mathbb{R})$ is a symmetric stochastic matrix. Denote the second-largest eigenvalue of $\mathsf{A}$ by $λ_2(\mathsf{A})$. Then for $\mathrm{\it any}$ finite-dimensional normed space $(X,\|\cdot\|)$ we have…
▽ More
We prove that there is a universal constant $C>0$ with the following property. Suppose that $n\in \mathbb{N}$ and that $\mathsf{A}=(a_{ij})\in M_n(\mathbb{R})$ is a symmetric stochastic matrix. Denote the second-largest eigenvalue of $\mathsf{A}$ by $λ_2(\mathsf{A})$. Then for $\mathrm{\it any}$ finite-dimensional normed space $(X,\|\cdot\|)$ we have $$ \forall\, x_1,\ldots,x_n\in X,\qquad \mathrm{dim}(X)\ge \frac12 \exp\left(C\frac{1-λ_2(\mathsf{A})}{\sqrt{n}}\bigg(\frac{\sum_{i=1}^n\sum_{j=1}^n\|x_i-x_j\|^2}{\sum_{i=1}^n\sum_{j=1}^na_{ij}\|x_i-x_j\|^2}\bigg)^{\frac12}\right). $$ This implies that if an $n$-vertex $O(1)$-expander embeds with average distortion $D\ge 1$ into $X$, then necessarily $\mathrm{dim}(X)\gtrsim n^{c/D}$ for some universal constant $c>0$, thus improving over the previously best-known estimate $\mathrm{dim}(X)\gtrsim (\log n)^2/D^2$ of Linial, London and Rabinovich, strengthening a theorem of Matoušek, and answering a question of Andoni, Nikolov, Razenshteyn and Waingarten.
△ Less
Submitted 27 November, 2016;
originally announced November 2016.
-
Expanders with respect to Hadamard spaces and random graphs
Authors:
Manor Mendel,
Assaf Naor
Abstract:
It is shown that there exists a sequence of 3-regular graphs $\{G_n\}_{n=1}^\infty$ and a Hadamard space $X$ such that $\{G_n\}_{n=1}^\infty$ forms an expander sequence with respect to $X$, yet random regular graphs are not expanders with respect to $X$. This answers a question of \cite{NS11}. $\{G_n\}_{n=1}^\infty$ are also shown to be expanders with respect to random regular graphs, yielding a d…
▽ More
It is shown that there exists a sequence of 3-regular graphs $\{G_n\}_{n=1}^\infty$ and a Hadamard space $X$ such that $\{G_n\}_{n=1}^\infty$ forms an expander sequence with respect to $X$, yet random regular graphs are not expanders with respect to $X$. This answers a question of \cite{NS11}. $\{G_n\}_{n=1}^\infty$ are also shown to be expanders with respect to random regular graphs, yielding a deterministic sublinear time constant factor approximation algorithm for computing the average squared distance in subsets of a random graph. The proof uses the Euclidean cone over a random graph, an auxiliary continuous geometric object that allows for the implementation of martingale methods.
△ Less
Submitted 18 July, 2014; v1 submitted 23 June, 2013;
originally announced June 2013.
-
Component Games on Regular Graphs
Authors:
Rani Hod,
Alon Naor
Abstract:
We study the (1:b) Maker-Breaker component game, played on the edge set of a d-regular graph. Maker's aim in this game is to build a large connected component, while Breaker's aim is to not let him do so. For all values of Breaker's bias b, we determine whether Breaker wins (on any d-regular graph) or Maker wins (on almost every d-regular graph) and provide explicit winning strategies for both pla…
▽ More
We study the (1:b) Maker-Breaker component game, played on the edge set of a d-regular graph. Maker's aim in this game is to build a large connected component, while Breaker's aim is to not let him do so. For all values of Breaker's bias b, we determine whether Breaker wins (on any d-regular graph) or Maker wins (on almost every d-regular graph) and provide explicit winning strategies for both players.
To this end, we prove an extension of a theorem by Gallai-Hasse-Roy-Vitaver about graph orientations without long directed simple paths.
△ Less
Submitted 2 January, 2013;
originally announced January 2013.
-
Efficient Rounding for the Noncommutative Grothendieck Inequality
Authors:
Assaf Naor,
Oded Regev,
Thomas Vidick
Abstract:
$ \newcommand{\cclass}[1]{\textsf{#1}} $The classical Grothendieck inequality has applications to the design of approximation algorithms for $\cclass{NP}$-hard optimization problems. We show that an algorithmic interpretation may also be given for a noncommutative generalization of the Grothendieck inequality due to Pisier and Haagerup. Our main result, an efficient rounding procedure for this ine…
▽ More
$ \newcommand{\cclass}[1]{\textsf{#1}} $The classical Grothendieck inequality has applications to the design of approximation algorithms for $\cclass{NP}$-hard optimization problems. We show that an algorithmic interpretation may also be given for a noncommutative generalization of the Grothendieck inequality due to Pisier and Haagerup. Our main result, an efficient rounding procedure for this inequality, leads to a polynomial-time constant-factor approximation algorithm for an optimization problem which generalizes the Cut Norm problem of Frieze and Kannan, and is shown here to have additional applications to robust principal component analysis and the orthogonal Procrustes problem.
△ Less
Submitted 22 February, 2022; v1 submitted 29 October, 2012;
originally announced October 2012.
-
Locally decodable codes and the failure of cotype for projective tensor products
Authors:
Jop Briet,
Assaf Naor,
Oded Regev
Abstract:
It is shown that for every $p\in (1,\infty)$ there exists a Banach space $X$ of finite cotype such that the projective tensor product $\ell_p\tp X$ fails to have finite cotype. More generally, if $p_1,p_2,p_3\in (1,\infty)$ satisfy $\frac{1}{p_1}+\frac{1}{p_2}+\frac{1}{p_3}\le 1$ then $\ell_{p_1}\tp\ell_{p_2}\tp\ell_{p_3}$ does not have finite cotype. This is a proved via a connection to the theor…
▽ More
It is shown that for every $p\in (1,\infty)$ there exists a Banach space $X$ of finite cotype such that the projective tensor product $\ell_p\tp X$ fails to have finite cotype. More generally, if $p_1,p_2,p_3\in (1,\infty)$ satisfy $\frac{1}{p_1}+\frac{1}{p_2}+\frac{1}{p_3}\le 1$ then $\ell_{p_1}\tp\ell_{p_2}\tp\ell_{p_3}$ does not have finite cotype. This is a proved via a connection to the theory of locally decodable codes.
△ Less
Submitted 2 August, 2012;
originally announced August 2012.
-
Solution of the propeller conjecture in $\mathbb{R}^3$
Authors:
Steven Heilman,
Aukosh Jagannath,
Assaf Naor
Abstract:
It is shown that every measurable partition ${A_1,..., A_k}$ of $\mathbb{R}^3$ satisfies $$\sum_{i=1}^k||\int_{A_i} xe^{-\frac12||x||_2^2}dx||_2^2\le 9π^2.\qquad(*)$$ Let ${P_1,P_2,P_3}$ be the partition of $\mathbb{R}^2$ into $120^\circ$ sectors centered at the origin. The bound is sharp, with equality holding if $A_i=P_i\times \mathbb{R}$ for $i\in {1,2,3}$ and $A_i=\emptyset$ for…
▽ More
It is shown that every measurable partition ${A_1,..., A_k}$ of $\mathbb{R}^3$ satisfies $$\sum_{i=1}^k||\int_{A_i} xe^{-\frac12||x||_2^2}dx||_2^2\le 9π^2.\qquad(*)$$ Let ${P_1,P_2,P_3}$ be the partition of $\mathbb{R}^2$ into $120^\circ$ sectors centered at the origin. The bound is sharp, with equality holding if $A_i=P_i\times \mathbb{R}$ for $i\in {1,2,3}$ and $A_i=\emptyset$ for $i\in \{4,...,k\}$ (up to measure zero corrections, orthogonal transformations and renumbering of the sets $\{A_1,...,A_k\}$). This settles positively the 3-dimensional Propeller Conjecture of Khot and Naor (FOCS 2008). The proof of reduces the problem to a finite set of numerical inequalities which are then verified with full rigor in a computer-assisted fashion. The main consequence (and motivation) of $(*)$ is complexity-theoretic: the Unique Games hardness threshold of the Kernel Clustering problem with $4 \times 4$ centered and spherical hypothesis matrix equals $\frac{2π}{3}$.
△ Less
Submitted 5 April, 2014; v1 submitted 13 December, 2011;
originally announced December 2011.
-
Grothendieck-type inequalities in combinatorial optimization
Authors:
Subhash Khot,
Assaf Naor
Abstract:
We survey connections of the Grothendieck inequality and its variants to combinatorial optimization and computational complexity.
We survey connections of the Grothendieck inequality and its variants to combinatorial optimization and computational complexity.
△ Less
Submitted 11 August, 2011;
originally announced August 2011.
-
The Grothendieck constant is strictly smaller than Krivine's bound
Authors:
Mark Braverman,
Konstantin Makarychev,
Yury Makarychev,
Assaf Naor
Abstract:
We prove that $K_G<\fracπ{2\log(1+\sqrt{2})}$, where $K_G$ is the Grothendieck constant.
We prove that $K_G<\fracπ{2\log(1+\sqrt{2})}$, where $K_G$ is the Grothendieck constant.
△ Less
Submitted 17 August, 2011; v1 submitted 31 March, 2011;
originally announced March 2011.
-
Overlap properties of geometric expanders
Authors:
Jacob Fox,
Mikhail Gromov,
Vincent Lafforgue,
Assaf Naor,
Janos Pach
Abstract:
The {\em overlap number} of a finite $(d+1)$-uniform hypergraph $H$ is defined as the largest constant $c(H)\in (0,1]$ such that no matter how we map the vertices of $H$ into $\R^d$, there is a point covered by at least a $c(H)$-fraction of the simplices induced by the images of its hyperedges. In~\cite{Gro2}, motivated by the search for an analogue of the notion of graph expansion for higher dim…
▽ More
The {\em overlap number} of a finite $(d+1)$-uniform hypergraph $H$ is defined as the largest constant $c(H)\in (0,1]$ such that no matter how we map the vertices of $H$ into $\R^d$, there is a point covered by at least a $c(H)$-fraction of the simplices induced by the images of its hyperedges. In~\cite{Gro2}, motivated by the search for an analogue of the notion of graph expansion for higher dimensional simplicial complexes, it was asked whether or not there exists a sequence $\{H_n\}_{n=1}^\infty$ of arbitrarily large $(d+1)$-uniform hypergraphs with bounded degree, for which $\inf_{n\ge 1} c(H_n)>0$. Using both random methods and explicit constructions, we answer this question positively by constructing infinite families of $(d+1)$-uniform hypergraphs with bounded degree such that their overlap numbers are bounded from below by a positive constant $c=c(d)$. We also show that, for every $d$, the best value of the constant $c=c(d)$ that can be achieved by such a construction is asymptotically equal to the limit of the overlap numbers of the complete $(d+1)$-uniform hypergraphs with $n$ vertices, as $n\rightarrow\infty$. For the proof of the latter statement, we establish the following geometric partitioning result of independent interest. For any $d$ and any $ε>0$, there exists $K=K(ε,d)\ge d+1$ satisfying the following condition. For any $k\ge K$, for any point $q \in \mathbb{R}^d$ and for any finite Borel measure $μ$ on $\mathbb{R}^d$ with respect to which every hyperplane has measure $0$, there is a partition $\mathbb{R}^d=A_1 \cup \ldots \cup A_{k}$ into $k$ measurable parts of equal measure such that all but at most an $ε$-fraction of the $(d+1)$-tuples $A_{i_1},\ldots,A_{i_{d+1}}$ have the property that either all simplices with one vertex in each $A_{i_j}$ contain $q$ or none of these simplices contain $q$.
△ Less
Submitted 9 May, 2010;
originally announced May 2010.
-
L_1 embeddings of the Heisenberg group and fast estimation of graph isoperimetry
Authors:
Assaf Naor
Abstract:
We survey connections between the theory of bi-Lipschitz embeddings and the Sparsest Cut Problem in combinatorial optimization. The story of the Sparsest Cut Problem is a striking example of the deep interplay between analysis, geometry, and probability on the one hand, and computational issues in discrete mathematics on the other. We explain how the key ideas evolved over the past 20 years, empha…
▽ More
We survey connections between the theory of bi-Lipschitz embeddings and the Sparsest Cut Problem in combinatorial optimization. The story of the Sparsest Cut Problem is a striking example of the deep interplay between analysis, geometry, and probability on the one hand, and computational issues in discrete mathematics on the other. We explain how the key ideas evolved over the past 20 years, emphasizing the interactions with Banach space theory, geometric measure theory, and geometric group theory. As an important illustrative example, we shall examine recently established connections to the the structure of the Heisenberg group, and the incompatibility of its Carnot-Carathéodory geometry with the geometry of the Lebesgue space $L_1$.
△ Less
Submitted 22 March, 2010;
originally announced March 2010.
-
Compression bounds for Lipschitz maps from the Heisenberg group to $L_1$
Authors:
Jeff Cheeger,
Bruce Kleiner,
Assaf Naor
Abstract:
We prove a quantitative bi-Lipschitz nonembedding theorem for the Heisenberg group with its Carnot-Carathéodory metric and apply it to give a lower bound on the integrality gap of the Goemans-Linial semidefinite relaxation of the Sparsest Cut problem.
We prove a quantitative bi-Lipschitz nonembedding theorem for the Heisenberg group with its Carnot-Carathéodory metric and apply it to give a lower bound on the integrality gap of the Goemans-Linial semidefinite relaxation of the Sparsest Cut problem.
△ Less
Submitted 11 October, 2009;
originally announced October 2009.
-
A $(\log n)^{Ω(1)}$ integrality gap for the Sparsest Cut SDP
Authors:
Jeff Cheeger,
Bruce Kleiner,
Assaf Naor
Abstract:
We show that the Goemans-Linial semidefinite relaxation of the Sparsest Cut problem with general demands has integrality gap $(\log n)^{Ω(1)}$. This is achieved by exhibiting $n$-point metric spaces of negative type whose $L_1$ distortion is $(\log n)^{Ω(1)}$. Our result is based on quantitative bounds on the rate of degeneration of Lipschitz maps from the Heisenberg group to $L_1$ when restrict…
▽ More
We show that the Goemans-Linial semidefinite relaxation of the Sparsest Cut problem with general demands has integrality gap $(\log n)^{Ω(1)}$. This is achieved by exhibiting $n$-point metric spaces of negative type whose $L_1$ distortion is $(\log n)^{Ω(1)}$. Our result is based on quantitative bounds on the rate of degeneration of Lipschitz maps from the Heisenberg group to $L_1$ when restricted to cosets of the center.
△ Less
Submitted 18 November, 2009; v1 submitted 11 October, 2009;
originally announced October 2009.
-
Sharp kernel clustering algorithms and their associated Grothendieck inequalities
Authors:
Subhash Khot,
Assaf Naor
Abstract:
In the kernel clustering problem we are given a (large) $n\times n$ symmetric positive semidefinite matrix $A=(a_{ij})$ with $\sum_{i=1}^n\sum_{j=1}^n a_{ij}=0$ and a (small) $k\times k$ symmetric positive semidefinite matrix $B=(b_{ij})$. The goal is to find a partition $\{S_1,...,S_k\}$ of $\{1,... n\}$ which maximizes $ \sum_{i=1}^k\sum_{j=1}^k (\sum_{(p,q)\in S_i\times S_j}a_{pq})b_{ij}$.…
▽ More
In the kernel clustering problem we are given a (large) $n\times n$ symmetric positive semidefinite matrix $A=(a_{ij})$ with $\sum_{i=1}^n\sum_{j=1}^n a_{ij}=0$ and a (small) $k\times k$ symmetric positive semidefinite matrix $B=(b_{ij})$. The goal is to find a partition $\{S_1,...,S_k\}$ of $\{1,... n\}$ which maximizes $ \sum_{i=1}^k\sum_{j=1}^k (\sum_{(p,q)\in S_i\times S_j}a_{pq})b_{ij}$.
We design a polynomial time approximation algorithm that achieves an approximation ratio of $\frac{R(B)^2}{C(B)}$, where $R(B)$ and $C(B)$ are geometric parameters that depend only on the matrix $B$, defined as follows: if $b_{ij} = < v_i, v_j>$ is the Gram matrix representation of $B$ for some $v_1,...,v_k\in \R^k$ then $R(B)$ is the minimum radius of a Euclidean ball containing the points $\{v_1, ..., v_k\}$. The parameter $C(B)$ is defined as the maximum over all measurable partitions $\{A_1,...,A_k\}$ of $\R^{k-1}$ of the quantity $\sum_{i=1}^k\sum_{j=1}^k b_{ij}< z_i,z_j>$, where for $i\in \{1,...,k\}$ the vector $z_i\in \R^{k-1}$ is the Gaussian moment of $A_i$, i.e., $z_i=\frac{1}{(2π)^{(k-1)/2}}\int_{A_i}xe^{-\|x\|_2^2/2}dx$. We also show that for every $\eps > 0$, achieving an approximation guarantee of $(1-\e)\frac{R(B)^2}{C(B)}$ is Unique Games hard.
△ Less
Submitted 25 June, 2009;
originally announced June 2009.
-
Approximate kernel clustering
Authors:
Subhash Khot,
Assaf Naor
Abstract:
In the kernel clustering problem we are given a large $n\times n$ positive semi-definite matrix $A=(a_{ij})$ with $\sum_{i,j=1}^na_{ij}=0$ and a small $k\times k$ positive semi-definite matrix $B=(b_{ij})$. The goal is to find a partition $S_1,...,S_k$ of $\{1,... n\}$ which maximizes the quantity $$ \sum_{i,j=1}^k (\sum_{(i,j)\in S_i\times S_j}a_{ij})b_{ij}. $$ We study the computational comple…
▽ More
In the kernel clustering problem we are given a large $n\times n$ positive semi-definite matrix $A=(a_{ij})$ with $\sum_{i,j=1}^na_{ij}=0$ and a small $k\times k$ positive semi-definite matrix $B=(b_{ij})$. The goal is to find a partition $S_1,...,S_k$ of $\{1,... n\}$ which maximizes the quantity $$ \sum_{i,j=1}^k (\sum_{(i,j)\in S_i\times S_j}a_{ij})b_{ij}. $$ We study the computational complexity of this generic clustering problem which originates in the theory of machine learning. We design a constant factor polynomial time approximation algorithm for this problem, answering a question posed by Song, Smola, Gretton and Borgwardt. In some cases we manage to compute the sharp approximation threshold for this problem assuming the Unique Games Conjecture (UGC). In particular, when $B$ is the $3\times 3$ identity matrix the UGC hardness threshold of this problem is exactly $\frac{16π}{27}$. We present and study a geometric conjecture of independent interest which we show would imply that the UGC threshold when $B$ is the $k\times k$ identity matrix is $\frac{8π}{9}(1-\frac{1}{k})$ for every $k\ge 3$.
△ Less
Submitted 9 December, 2008; v1 submitted 29 July, 2008;
originally announced July 2008.
-
The Johnson-Lindenstrauss lemma almost characterizes Hilbert space, but not quite
Authors:
William B. Johnson,
Assaf Naor
Abstract:
Let $X$ be a normed space that satisfies the Johnson-Lindenstrauss lemma (J-L lemma, in short) in the sense that for any integer $n$ and any $x_1,\ldots,x_n\in X$ there exists a linear map** $L:X\to F$, where $F\subseteq X$ is a linear subspace of dimension $O(\log n)$, such that $\|x_i-x_j\|\le\|L(x_i)-L(x_j)\|\le O(1)\cdot\|x_i-x_j\|$ for all $i,j\in \{1,\ldots, n\}$. We show that this impli…
▽ More
Let $X$ be a normed space that satisfies the Johnson-Lindenstrauss lemma (J-L lemma, in short) in the sense that for any integer $n$ and any $x_1,\ldots,x_n\in X$ there exists a linear map** $L:X\to F$, where $F\subseteq X$ is a linear subspace of dimension $O(\log n)$, such that $\|x_i-x_j\|\le\|L(x_i)-L(x_j)\|\le O(1)\cdot\|x_i-x_j\|$ for all $i,j\in \{1,\ldots, n\}$. We show that this implies that $X$ is almost Euclidean in the following sense: Every $n$-dimensional subspace of $X$ embeds into Hilbert space with distortion $2^{2^{O(\log^*n)}}$. On the other hand, we show that there exists a normed space $Y$ which satisfies the J-L lemma, but for every $n$ there exists an $n$-dimensional subspace $E_n\subseteq Y$ whose Euclidean distortion is at least $2^{Ω(α(n))}$, where $α$ is the inverse Ackermann function.
△ Less
Submitted 11 July, 2008;
originally announced July 2008.
-
Maximum gradient embeddings and monotone clustering
Authors:
Manor Mendel,
Assaf Naor
Abstract:
Let (X,d_X) be an n-point metric space. We show that there exists a distribution D over non-contractive embeddings into trees f:X-->T such that for every x in X, the expectation with respect to D of the maximum over y in X of the ratio d_T(f(x),f(y)) / d_X(x,y) is at most C (log n)^2, where C is a universal constant. Conversely we show that the above quadratic dependence on log n cannot be improve…
▽ More
Let (X,d_X) be an n-point metric space. We show that there exists a distribution D over non-contractive embeddings into trees f:X-->T such that for every x in X, the expectation with respect to D of the maximum over y in X of the ratio d_T(f(x),f(y)) / d_X(x,y) is at most C (log n)^2, where C is a universal constant. Conversely we show that the above quadratic dependence on log n cannot be improved in general. Such embeddings, which we call maximum gradient embeddings, yield a framework for the design of approximation algorithms for a wide range of clustering problems with monotone costs, including fault-tolerant versions of k-median and facility location.
△ Less
Submitted 29 August, 2010; v1 submitted 26 June, 2006;
originally announced June 2006.
-
Ramsey partitions and proximity data structures
Authors:
Manor Mendel,
Assaf Naor
Abstract:
This paper addresses two problems lying at the intersection of geometric analysis and theoretical computer science: The non-linear isomorphic Dvoretzky theorem and the design of good approximate distance oracles for large distortion. We introduce the notion of Ramsey partitions of a finite metric space, and show that the existence of good Ramsey partitions implies a solution to the metric Ramsey…
▽ More
This paper addresses two problems lying at the intersection of geometric analysis and theoretical computer science: The non-linear isomorphic Dvoretzky theorem and the design of good approximate distance oracles for large distortion. We introduce the notion of Ramsey partitions of a finite metric space, and show that the existence of good Ramsey partitions implies a solution to the metric Ramsey problem for large distortion (a.k.a. the non-linear version of the isomorphic Dvoretzky theorem, as introduced by Bourgain, Figiel, and Milman). We then proceed to construct optimal Ramsey partitions, and use them to show that for every e\in (0,1), any n-point metric space has a subset of size n^{1-e} which embeds into Hilbert space with distortion O(1/e). This result is best possible and improves part of the metric Ramsey theorem of Bartal, Linial, Mendel and Naor, in addition to considerably simplifying its proof. We use our new Ramsey partitions to design the best known approximate distance oracles when the distortion is large, closing a gap left open by Thorup and Zwick. Namely, we show that for any $n$ point metric space X, and k>1, there exists an O(k)-approximate distance oracle whose storage requirement is O(n^{1+1/k}), and whose query time is a universal constant. We also discuss applications of Ramsey partitions to various other geometric data structure problems, such as the design of efficient data structures for approximate ranking.
△ Less
Submitted 10 May, 2006; v1 submitted 23 November, 2005;
originally announced November 2005.
-
Lower bounds on Locality Sensitive Hashing
Authors:
Rajeev Motwani,
Assaf Naor,
Rina Panigrahy
Abstract:
Given a metric space $(X,d_X)$, $c\ge 1$, $r>0$, and $p,q\in [0,1]$, a distribution over map**s $\h:X\to \mathbb N$ is called a $(r,cr,p,q)$-sensitive hash family if any two points in $X$ at distance at most $r$ are mapped by $\h$ to the same value with probability at least $p$, and any two points at distance greater than $cr$ are mapped by $\h$ to the same value with probability at most $q$.…
▽ More
Given a metric space $(X,d_X)$, $c\ge 1$, $r>0$, and $p,q\in [0,1]$, a distribution over map**s $\h:X\to \mathbb N$ is called a $(r,cr,p,q)$-sensitive hash family if any two points in $X$ at distance at most $r$ are mapped by $\h$ to the same value with probability at least $p$, and any two points at distance greater than $cr$ are mapped by $\h$ to the same value with probability at most $q$. This notion was introduced by Indyk and Motwani in 1998 as the basis for an efficient approximate nearest neighbor search algorithm, and has since been used extensively for this purpose. The performance of these algorithms is governed by the parameter $ρ=\frac{\log(1/p)}{\log(1/q)}$, and constructing hash families with small $ρ$ automatically yields improved nearest neighbor algorithms. Here we show that for $X=\ell_1$ it is impossible to achieve $ρ\le \frac{1}{2c}$. This almost matches the construction of Indyk and Motwani which achieves $ρ\le \frac{1}{c}$.
△ Less
Submitted 26 November, 2005; v1 submitted 29 October, 2005;
originally announced October 2005.
-
Planar Earthmover is not in $L_1$
Authors:
Assaf Naor,
Gideon Schechtman
Abstract:
We show that any $L_1$ embedding of the transportation cost (a.k.a. Earthmover) metric on probability measures supported on the grid $\{0,1,...,n\}^2\subseteq \R^2$ incurs distortion $Ω(\sqrt{\log n})$. We also use Fourier analytic techniques to construct a simple $L_1$ embedding of this space which has distortion $O(\log n)$.
We show that any $L_1$ embedding of the transportation cost (a.k.a. Earthmover) metric on probability measures supported on the grid $\{0,1,...,n\}^2\subseteq \R^2$ incurs distortion $Ω(\sqrt{\log n})$. We also use Fourier analytic techniques to construct a simple $L_1$ embedding of this space which has distortion $O(\log n)$.
△ Less
Submitted 26 September, 2005;
originally announced September 2005.
-
Measured descent: A new embedding method for finite metrics
Authors:
Robert Krauthgamer,
James R. Lee,
Manor Mendel,
Assaf Naor
Abstract:
We devise a new embedding technique, which we call measured descent, based on decomposing a metric space locally, at varying speeds, according to the density of some probability measure. This provides a refined and unified framework for the two primary methods of constructing Frechet embeddings for finite metrics, due to [Bourgain, 1985] and [Rao, 1999]. We prove that any n-point metric space (X…
▽ More
We devise a new embedding technique, which we call measured descent, based on decomposing a metric space locally, at varying speeds, according to the density of some probability measure. This provides a refined and unified framework for the two primary methods of constructing Frechet embeddings for finite metrics, due to [Bourgain, 1985] and [Rao, 1999]. We prove that any n-point metric space (X,d) embeds in Hilbert space with distortion O(sqrt{alpha_X log n}), where alpha_X is a geometric estimate on the decomposability of X. As an immediate corollary, we obtain an O(sqrt{(log lambda_X) \log n}) distortion embedding, where λ_X is the doubling constant of X. Since λ_X\le n, this result recovers Bourgain's theorem, but when the metric X is, in a sense, ``low-dimensional,'' improved bounds are achieved.
Our embeddings are volume-respecting for subsets of arbitrary size. One consequence is the existence of (k, O(log n)) volume-respecting embeddings for all 1 \leq k \leq n, which is the best possible, and answers positively a question posed by U. Feige. Our techniques are also used to answer positively a question of Y. Rabinovich, showing that any weighted n-point planar graph embeds in l_\infty^{O(log n)} with O(1) distortion. The O(log n) bound on the dimension is optimal, and improves upon the previously known bound of O((log n)^2).
△ Less
Submitted 18 August, 2005; v1 submitted 2 December, 2004;
originally announced December 2004.
-
On metric Ramsey-type phenomena
Authors:
Yair Bartal,
Nathan Linial,
Manor Mendel,
Assaf Naor
Abstract:
The main question studied in this article may be viewed as a nonlinear analogue of Dvoretzky's theorem in Banach space theory or as part of Ramsey theory in combinatorics. Given a finite metric space on n points, we seek its subspace of largest cardinality which can be embedded with a given distortion in Hilbert space. We provide nearly tight upper and lower bounds on the cardinality of this sub…
▽ More
The main question studied in this article may be viewed as a nonlinear analogue of Dvoretzky's theorem in Banach space theory or as part of Ramsey theory in combinatorics. Given a finite metric space on n points, we seek its subspace of largest cardinality which can be embedded with a given distortion in Hilbert space. We provide nearly tight upper and lower bounds on the cardinality of this subspace in terms of n and the desired distortion. Our main theorem states that for any epsilon>0, every n point metric space contains a subset of size at least n^{1-ε} which is embeddable in Hilbert space with O(\frac{\log(1/ε)}ε) distortion. The bound on the distortion is tight up to the log(1/ε) factor. We further include a comprehensive study of various other aspects of this problem.
△ Less
Submitted 20 June, 2007; v1 submitted 17 June, 2004;
originally announced June 2004.