Skip to main content

Showing 1–45 of 45 results for author: Kothari, P K

.
  1. arXiv:2405.10238  [pdf, other

    cs.DS cs.CC

    Rounding Large Independent Sets on Expanders

    Authors: Mitali Bafna, Jun-Ting Hsieh, Pravesh K. Kothari

    Abstract: We develop a new approach for approximating large independent sets when the input graph is a one-sided spectral expander - that is, the uniform random walk matrix of the graph has the second eigenvalue bounded away from 1. Consequently, we obtain a polynomial time algorithm to find linear-sized independent sets in one-sided expanders that are almost $3$-colorable or are promised to contain an inde… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 57 pages, 3 figures

  2. arXiv:2404.14159  [pdf, ps, other

    cs.DS

    Semirandom Planted Clique and the Restricted Isometry Property

    Authors: Jarosław Błasiok, Rares-Darius Buhai, Pravesh K. Kothari, David Steurer

    Abstract: We give a simple, greedy $O(n^{ω+0.5})=O(n^{2.872})$-time algorithm to list-decode planted cliques in a semirandom model introduced in [CSV17] (following [FK01]) that succeeds whenever the size of the planted clique is $k\geq O(\sqrt{n} \log^2 n)$. In the model, the edges touching the vertices in the planted $k$-clique are drawn independently with probability $p=1/2$ while the edges not touching t… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 21 pages

  3. arXiv:2404.06513  [pdf, ps, other

    cs.CC

    Superpolynomial Lower Bounds for Smooth 3-LCCs and Sharp Bounds for Designs

    Authors: Pravesh K. Kothari, Peter Manohar

    Abstract: We give improved lower bounds for binary $3$-query locally correctable codes (3-LCCs) $C \colon \{0,1\}^k \rightarrow \{0,1\}^n$. Specifically, we prove: (1) If $C$ is a linear design 3-LCC, then $n \geq 2^{(1 - o(1))\sqrt{k} }$. A design 3-LCC has the additional property that the correcting sets for every codeword bit form a perfect matching and every pair of codeword bits is queried an equal n… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  4. arXiv:2401.11590  [pdf, ps, other

    cs.CC math.CO

    Small Even Covers, Locally Decodable Codes and Restricted Subgraphs of Edge-Colored Kikuchi Graphs

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari, Sidhanth Mohanty, David Munhá Correia, Benny Sudakov

    Abstract: Given a $k$-uniform hypergraph $H$ on $n$ vertices, an even cover in $H$ is a collection of hyperedges that touch each vertex an even number of times. Even covers are a generalization of cycles in graphs and are equivalent to linearly dependent subsets of a system of linear equations modulo $2$. As a result, they arise naturally in the context of well-studied questions in coding theory and refutin… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: 19 pages

  5. arXiv:2311.00558  [pdf, other

    cs.CC

    An Exponential Lower Bound for Linear 3-Query Locally Correctable Codes

    Authors: Pravesh K. Kothari, Peter Manohar

    Abstract: We prove that the blocklength $n$ of a linear $3$-query locally correctable code (LCC) $\mathcal{L} \colon {\mathbb F}^k \to {\mathbb F}^n$ with distance $δ$ must be at least $n \geq 2^{Ω\left(\left(\frac{δ^2 k}{(|{\mathbb F}|-1)^2}\right)^{1/8}\right)}$. In particular, the blocklength of a linear $3$-query LCC with constant distance over any small field grows exponentially with $k$. This improves… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  6. arXiv:2310.00393  [pdf, ps, other

    cs.DS cs.CC

    New SDP Roundings and Certifiable Approximation for Cubic Optimization

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari, Lucas Pesenti, Luca Trevisan

    Abstract: We give new rounding schemes for SDP relaxations for the problems of maximizing cubic polynomials over the unit sphere and the $n$-dimensional hypercube. In both cases, the resulting algorithms yield a $O(\sqrt{n/k})$ multiplicative approximation in $2^{O(k)} \text{poly}(n)$ time. In particular, we obtain a $O(\sqrt{n/\log n})$ approximation in polynomial time. For the unit sphere, this improves o… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  7. arXiv:2309.16897  [pdf, other

    cs.CC cs.DS

    Efficient Algorithms for Semirandom Planted CSPs at the Refutation Threshold

    Authors: Venkatesan Guruswami, Jun-Ting Hsieh, Pravesh K. Kothari, Peter Manohar

    Abstract: We present an efficient algorithm to solve semirandom planted instances of any Boolean constraint satisfaction problem (CSP). The semirandom model is a hybrid between worst-case and average-case input models, where the input is generated by (1) choosing an arbitrary planted assignment $x^*$, (2) choosing an arbitrary clause structure, and (3) choosing literal negations for each clause from an arbi… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: FOCS 2023

  8. arXiv:2308.15403  [pdf, ps, other

    cs.CC cs.IT

    A Near-Cubic Lower Bound for 3-Query Locally Decodable Codes from Semirandom CSP Refutation

    Authors: Omar Alrabiah, Venkatesan Guruswami, Pravesh K. Kothari, Peter Manohar

    Abstract: A code $C \colon \{0,1\}^k \to \{0,1\}^n$ is a $q$-locally decodable code ($q$-LDC) if one can recover any chosen bit $b_i$ of the message $b \in \{0,1\}^k$ with good confidence by randomly querying the encoding $x := C(b)$ on at most $q$ coordinates. Existing constructions of $2$-LDCs achieve $n = \exp(O(k))$, and lower bounds show that this is in fact tight. However, when $q = 3$, far less is kn… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  9. arXiv:2307.05954  [pdf, other

    math.PR cs.CC cs.DS

    Ellipsoid Fitting Up to a Constant

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari, Aaron Potechin, Jeff Xu

    Abstract: In [Sau11,SPW13], Saunderson, Parrilo and Willsky asked the following elegant geometric question: what is the largest $m= m(d)$ such that there is an ellipsoid in $\mathbb{R}^d$ that passes through $v_1, v_2, \ldots, v_m$ with high probability when the $v_i$s are chosen independently from the standard Gaussian distribution $N(0,I_{d})$. The existence of such an ellipsoid is equivalent to the exist… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: ICALP 2023

  10. arXiv:2303.00252  [pdf, ps, other

    cs.CC cs.DS

    Is Planted Coloring Easier than Planted Clique?

    Authors: Pravesh K. Kothari, Santosh S. Vempala, Alexander S. Wein, Jeff Xu

    Abstract: We study the computational complexity of two related problems: recovering a planted $q$-coloring in $G(n,1/2)$, and finding efficiently verifiable witnesses of non-$q$-colorability (a.k.a. refutations) in $G(n,1/2)$. Our main results show hardness for both these problems in a restricted-but-powerful class of algorithms based on computing low-degree polynomials in the inputs. The problem of recov… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 23 pages

  11. arXiv:2302.12289  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Beyond Moments: Robustly Learning Affine Transformations with Asymptotically Optimal Error

    Authors: He Jia, Pravesh K . Kothari, Santosh S. Vempala

    Abstract: We present a polynomial-time algorithm for robustly learning an unknown affine transformation of the standard hypercube from samples, an important and well-studied setting for independent component analysis (ICA). Specifically, given an $ε$-corrupted sample from a distribution $D$ obtained by applying an unknown affine transformation $x \rightarrow Ax+s$ to the uniform distribution on a $d$-dimens… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  12. arXiv:2212.08018  [pdf, ps, other

    cs.DS cs.CR cs.IT stat.ML

    Privately Estimating a Gaussian: Efficient, Robust and Optimal

    Authors: Daniel Alabi, Pravesh K. Kothari, Pranay Tankala, Prayaag Venkat, Fred Zhang

    Abstract: In this work, we give efficient algorithms for privately estimating a Gaussian distribution in both pure and approximate differential privacy (DP) models with optimal dependence on the dimension in the sample complexity. In the pure DP setting, we give an efficient algorithm that estimates an unknown $d$-dimensional Gaussian distribution up to an arbitrary tiny total variation error using… ▽ More

    Submitted 1 June, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

  13. arXiv:2212.05619  [pdf, ps, other

    cs.DS

    Algorithms approaching the threshold for semi-random planted clique

    Authors: Rares-Darius Buhai, Pravesh K. Kothari, David Steurer

    Abstract: We design new polynomial-time algorithms for recovering planted cliques in the semi-random graph model introduced by Feige and Kilian 2001. The previous best algorithms for this model succeed if the planted clique has size at least $n^{2/3}$ in a graph with $n$ vertices (Mehta, Mckenzie, Trevisan 2019 and Charikar, Steinhardt, Valiant 2017). Our algorithms work for planted-clique sizes approaching… ▽ More

    Submitted 6 June, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Comments: 51 pages, the arxiv landing page contains a shortened abstract

    ACM Class: F.2

  14. arXiv:2211.13312  [pdf, ps, other

    cs.LG cs.CC stat.ML

    A Moment-Matching Approach to Testable Learning and a New Characterization of Rademacher Complexity

    Authors: Aravind Gollakota, Adam R. Klivans, Pravesh K. Kothari

    Abstract: A remarkable recent paper by Rubinfeld and Vasilyan (2022) initiated the study of \emph{testable learning}, where the goal is to replace hard-to-verify distributional assumptions (such as Gaussianity) with efficiently testable ones and to require that the learner succeed whenever the unknown distribution passes the corresponding test. In this model, they gave an efficient algorithm for learning ha… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 34 pages

  15. arXiv:2208.00122  [pdf, ps, other

    cs.DS cs.CC

    Polynomial-Time Power-Sum Decomposition of Polynomials

    Authors: Mitali Bafna, Jun-Ting Hsieh, Pravesh K. Kothari, Jeff Xu

    Abstract: We give efficient algorithms for finding power-sum decomposition of an input polynomial $P(x)= \sum_{i\leq m} p_i(x)^d$ with component $p_i$s. The case of linear $p_i$s is equivalent to the well-studied tensor decomposition problem while the quadratic case occurs naturally in studying identifiability of non-spherical Gaussian mixtures from low-order moments. Unlike tensor decomposition, both the… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

    Comments: To appear in FOCS 2022

  16. arXiv:2207.10850  [pdf, other

    math.CO cs.DM cs.DS

    A simple and sharper proof of the hypergraph Moore bound

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari, Sidhanth Mohanty

    Abstract: The hypergraph Moore bound is an elegant statement that characterizes the extremal trade-off between the girth - the number of hyperedges in the smallest cycle or even cover (a subhypergraph with all degrees even) and size - the number of hyperedges in a hypergraph. For graphs (i.e., $2$-uniform hypergraphs), a bound tight up to the leading constant was proven in a classical work of Alon, Hoory an… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

  17. arXiv:2206.10942  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    List-Decodable Covariance Estimation

    Authors: Misha Ivkov, Pravesh K. Kothari

    Abstract: We give the first polynomial time algorithm for \emph{list-decodable covariance estimation}. For any $α> 0$, our algorithm takes input a sample $Y \subseteq \mathbb{R}^d$ of size $n\geq d^{\mathsf{poly}(1/α)}$ obtained by adversarially corrupting an $(1-α)n$ points in an i.i.d. sample $X$ of size $n$ from the Gaussian distribution with unknown mean $μ_*$ and covariance $Σ_*$. In… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: Abstract slightly clipped. To appear at STOC 2022

    ACM Class: F.2.1

  18. arXiv:2206.09204  [pdf, ps, other

    cs.DS cs.CC

    Approximating Max-Cut on Bounded Degree Graphs: Tighter Analysis of the FKL Algorithm

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari

    Abstract: In this note, we describe a $α_{GW} + \tildeΩ(1/d^2)$-factor approximation algorithm for Max-Cut on weighted graphs of degree $\leq d$. Here, $α_{GW}\approx 0.878$ is the worst-case approximation ratio of the Goemans-Williamson rounding for Max-Cut. This improves on previous results for unweighted graphs by Feige, Karpinski, and Langberg and Florén. Our guarantee is obtained by a tighter analysis… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

  19. arXiv:2205.06739  [pdf, ps, other

    cs.DS

    Bypassing the XOR Trick: Stronger Certificates for Hypergraph Clique Number

    Authors: Venkatesan Guruswami, Pravesh K. Kothari, Peter Manohar

    Abstract: Let $\mathcal{H}(k,n,p)$ be the distribution on $k$-uniform hypergraphs where every subset of $[n]$ of size $k$ is included as an hyperedge with probability $p$ independently. In this work, we design and analyze a simple spectral algorithm that certifies a bound on the size of the largest clique, $ω(H)$, in hypergraphs $H \sim \mathcal{H}(k,n,p)$. For example, for any constant $p$, with high proba… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

  20. arXiv:2112.03548  [pdf, ps, other

    stat.ML cs.CR cs.DS cs.IT cs.LG

    Private Robust Estimation by Stabilizing Convex Relaxations

    Authors: Pravesh K. Kothari, Pasin Manurangsi, Ameya Velingker

    Abstract: We give the first polynomial time and sample $(ε, δ)$-differentially private (DP) algorithm to estimate the mean, covariance and higher moments in the presence of a constant fraction of adversarial outliers. Our algorithm succeeds for families of distributions that satisfy two well-studied properties in prior works on robust estimation: certifiable subgaussianity of directional moments and certifi… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  21. arXiv:2110.11853  [pdf, ps, other

    cs.DS math.ST

    Polynomial-Time Sum-of-Squares Can Robustly Estimate Mean and Covariance of Gaussians Optimally

    Authors: Pravesh K. Kothari, Peter Manohar, Brian Hu Zhang

    Abstract: In this work, we revisit the problem of estimating the mean and covariance of an unknown $d$-dimensional Gaussian distribution in the presence of an $\varepsilon$-fraction of adversarial outliers. The pioneering work of [DKK+16] gave a polynomial time algorithm for this task with optimal $\tilde{O}(\varepsilon)$ error using $n = \textrm{poly}(d, 1/\varepsilon)$ samples. On the other hand, [KS17b… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

  22. arXiv:2110.08677  [pdf, ps, other

    cs.CC cs.DS

    Algorithmic Thresholds for Refuting Random Polynomial Systems

    Authors: Jun-Ting Hsieh, Pravesh K. Kothari

    Abstract: Consider a system of $m$ polynomial equations $\{p_i(x) = b_i\}_{i \leq m}$ of degree $D\geq 2$ in $n$-dimensional variable $x \in \mathbb{R}^n$ such that each coefficient of every $p_i$ and $b_i$s are chosen at random and independently from some continuous distribution. We study the basic question of determining the smallest $m$ -- the algorithmic threshold -- for which efficient algorithms can f… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  23. arXiv:2109.04415  [pdf, other

    cs.CC cs.DS

    Algorithms and Certificates for Boolean CSP Refutation: "Smoothed is no harder than Random"

    Authors: Venkatesan Guruswami, Pravesh K. Kothari, Peter Manohar

    Abstract: We present an algorithm for strongly refuting smoothed instances of all Boolean CSPs. The smoothed model is a hybrid between worst and average-case input models, where the input is an arbitrary instance of the CSP with only the negation patterns of the literals re-randomized with some small probability. For an $n$-variable smoothed instance of a $k$-arity CSP, our algorithm runs in $n^{O(\ell)}$ t… ▽ More

    Submitted 3 September, 2023; v1 submitted 9 September, 2021; originally announced September 2021.

  24. arXiv:2107.02320  [pdf, ps, other

    cs.LG cs.CC

    Memory-Sample Lower Bounds for Learning Parity with Noise

    Authors: Sumegha Garg, Pravesh K. Kothari, Pengda Liu, Ran Raz

    Abstract: In this work, we show, for the well-studied problem of learning parity under noise, where a learner tries to learn $x=(x_1,\ldots,x_n) \in \{0,1\}^n$ from a stream of random linear equations over $\mathrm{F}_2$ that are correct with probability $\frac{1}{2}+\varepsilon$ and flipped with probability $\frac{1}{2}-\varepsilon$, that any learning algorithm requires either a memory of size… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 19 pages. To appear in RANDOM 2021. arXiv admin note: substantial text overlap with arXiv:1708.02639

    ACM Class: F.2.3

  25. arXiv:2105.07517  [pdf, ps, other

    cs.CC

    A Stress-Free Sum-of-Squares Lower Bound for Coloring

    Authors: Pravesh K. Kothari, Peter Manohar

    Abstract: We prove that with high probability over the choice of a random graph $G$ from the Erdős-Rényi distribution $G(n,1/2)$, a natural $n^{O(\varepsilon^2 \log n)}$-time, degree $O(\varepsilon^2 \log n)$ sum-of-squares semidefinite program cannot refute the existence of a valid $k$-coloring of $G$ for $k = n^{1/2 +\varepsilon}$. Our result implies that the refutation guarantee of the basic semidefinite… ▽ More

    Submitted 16 May, 2021; originally announced May 2021.

  26. arXiv:2012.02119  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Robustly Learning Mixtures of $k$ Arbitrary Gaussians

    Authors: Ainesh Bakshi, Ilias Diakonikolas, He Jia, Daniel M. Kane, Pravesh K. Kothari, Santosh S. Vempala

    Abstract: We give a polynomial-time algorithm for the problem of robustly estimating a mixture of $k$ arbitrary Gaussians in $\mathbb{R}^d$, for any fixed $k$, in the presence of a constant fraction of arbitrary corruptions. This resolves the main open problem in several previous works on algorithmic robust statistics, which addressed the special cases of robustly estimating (a) a single Gaussian, (b) a mix… ▽ More

    Submitted 7 June, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: This version extends the previous one to yield 1) robust proper learning algorithm with poly(eps) error and 2) an information theoretic argument proving that the same algorithms in fact also yield parameter recovery guarantees. The updates are included in Sections 7,8, and 9 and the main result from the previous version (Thm 1.4) is presented and proved in Section 6

  27. arXiv:2011.06585  [pdf, ps, other

    cs.LG cs.DS

    Sparse PCA: Algorithms, Adversarial Perturbations and Certificates

    Authors: Tommaso d'Orsi, Pravesh K. Kothari, Gleb Novikov, David Steurer

    Abstract: We study efficient algorithms for Sparse PCA in standard statistical models (spiked covariance in its Wishart form). Our goal is to achieve optimal recovery guarantees while being resilient to small perturbations. Despite a long history of prior works, including explicit studies of perturbation resilience, the best known algorithmic guarantees for Sparse PCA are fragile and break down under small… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  28. arXiv:2009.08032  [pdf, ps, other

    cs.CC cs.DS

    Strongly refuting all semi-random Boolean CSPs

    Authors: Jackson Abascal, Venkatesan Guruswami, Pravesh K. Kothari

    Abstract: We give an efficient algorithm to strongly refute \emph{semi-random} instances of all Boolean constraint satisfaction problems. The number of constraints required by our algorithm matches (up to polylogarithmic factors) the best-known bounds for efficient refutation of fully random instances. Our main technical contribution is an algorithm to strongly refute semi-random instances of the Boolean… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 31 Pages

    ACM Class: F.2.2

  29. arXiv:2002.07235  [pdf, ps, other

    cs.CC cs.CR

    Time-Space Tradeoffs for Distinguishing Distributions and Applications to Security of Goldreich's PRG

    Authors: Sumegha Garg, Pravesh K. Kothari, Ran Raz

    Abstract: In this work, we establish lower-bounds against memory bounded algorithms for distinguishing between natural pairs of related distributions from samples that arrive in a streaming setting. In our first result, we show that any algorithm that distinguishes between uniform distribution on $\{0,1\}^n$ and uniform distribution on an $n/2$-dimensional linear subspace of $\{0,1\}^n$ with non-negligibl… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Comments: 35 pages

  30. arXiv:2002.05139  [pdf, ps, other

    cs.DS cs.LG stat.ML

    List-Decodable Subspace Recovery: Dimension Independent Error in Polynomial Time

    Authors: Ainesh Bakshi, Pravesh K. Kothari

    Abstract: In list-decodable subspace recovery, the input is a collection of $n$ points $αn$ (for some $α\ll 1/2$) of which are drawn i.i.d. from a distribution $\mathcal{D}$ with a isotropic rank $r$ covariance $Π_*$ (the \emph{inliers}) and the rest are arbitrary, potential adversarial outliers. The goal is to recover a $O(1/α)$ size list of candidate covariances that contains a $\hatΠ$ close to $Π_*$. Two… ▽ More

    Submitted 7 January, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: To appear in SODA 2021. This version fixes an issue in a technical claim bounding the variance of degree 2 polynomials and improves exposition

    ACM Class: F.2.2

  31. arXiv:1905.05679  [pdf, ps, other

    cs.DS cs.LG stat.ML

    List-Decodable Linear Regression

    Authors: Sushrut Karmalkar, Adam R. Klivans, Pravesh K. Kothari

    Abstract: We give the first polynomial-time algorithm for robust regression in the list-decodable setting where an adversary can corrupt a greater than $1/2$ fraction of examples. For any $α< 1$, our algorithm takes as input a sample $\{(x_i,y_i)\}_{i \leq n}$ of $n$ linear equations where $αn$ of the equations satisfy $y_i = \langle x_i,\ell^*\rangle +ζ$ for some small noise $ζ$ and $(1-α)n$ of the equat… ▽ More

    Submitted 30 May, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: 28 Pages

  32. arXiv:1902.04782  [pdf, ps, other

    cs.LG stat.ML

    On the Expressive Power of Kernel Methods and the Efficiency of Kernel Learning by Association Schemes

    Authors: Pravesh K. Kothari, Roi Livni

    Abstract: We study the expressive power of kernel methods and the algorithmic feasibility of multiple kernel learning for a special rich class of kernels. Specifically, we define \emph{Euclidean kernels}, a diverse class that includes most, if not all, families of kernels studied in literature such as polynomial kernels and radial basis functions. We then describe the geometric and spectral structure of t… ▽ More

    Submitted 13 February, 2019; originally announced February 2019.

  33. arXiv:1806.09426  [pdf, ps, other

    cs.CC

    Sum-of-Squares meets Nash: Optimal Lower Bounds for Finding any Equilibrium

    Authors: Pravesh K. Kothari, Ruta Mehta

    Abstract: Several works have shown unconditional hardness (via integrality gaps) of computing equilibria using strong hierarchies of convex relaxations. Such results however only apply to the problem of computing equilibria that optimize a certain objective function and not to the (arguably more fundamental) task of finding \emph{any} equilibrium. We present an algorithmic model based on the sum-of-square… ▽ More

    Submitted 25 June, 2018; originally announced June 2018.

    ACM Class: F.2.2

    Journal ref: Proceedings of STOC 2018

  34. arXiv:1804.08662  [pdf, ps, other

    cs.CC

    Small-Set Expansion in Shortcode Graph and the 2-to-2 Conjecture

    Authors: Boaz Barak, Pravesh K. Kothari, David Steurer

    Abstract: Dinur, Khot, Kindler, Minzer and Safra (2016) recently showed that the (imperfect completeness variant of) Khot's 2 to 2 games conjecture follows from a combinatorial hypothesis about the soundness of a certain "Grassmanian agreement tester". In this work, we show that the hypothesis of Dinur et. al. follows from a conjecture we call the "Inverse Shortcode Hypothesis" characterizing the non-expand… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: 13 pages

  35. arXiv:1803.03241  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    Efficient Algorithms for Outlier-Robust Regression

    Authors: Adam Klivans, Pravesh K. Kothari, Raghu Meka

    Abstract: We give the first polynomial-time algorithm for performing linear or polynomial regression resilient to adversarial corruptions in both examples and labels. Given a sufficiently large (polynomial-size) training set drawn i.i.d. from distribution D and subsequently corrupted on some fraction of points, our algorithm outputs a linear function whose squared error is close to the squared error of th… ▽ More

    Submitted 4 June, 2020; v1 submitted 8 March, 2018; originally announced March 2018.

    Comments: 27 pages. Appeared in COLT 2018. This update removes Lemma 6.2 that erroneously claimed an information-theoretic lower bound on error rate as a function of fraction of outliers

  36. arXiv:1803.01768  [pdf, other

    cs.LG

    An Analysis of the t-SNE Algorithm for Data Visualization

    Authors: Sanjeev Arora, Wei Hu, Pravesh K. Kothari

    Abstract: A first line of attack in exploratory data analysis is data visualization, i.e., generating a 2-dimensional representation of data that makes clusters of similar points visually identifiable. Standard Johnson-Lindenstrauss dimensionality reduction does not produce data visualizations. The t-SNE heuristic of van der Maaten and Hinton, which is based on non-convex optimization, has become the de fac… ▽ More

    Submitted 6 June, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: In Conference on Learning Theory (COLT) 2018

  37. arXiv:1801.10110  [pdf, ps, other

    cs.GT

    Surprise in Elections

    Authors: Palash Dey, Pravesh K. Kothari, Swaprava Nath

    Abstract: Elections involving a very large voter population often lead to outcomes that surprise many. This is particularly important for the elections in which results affect the economy of a sizable population. A better prediction of the true outcome helps reduce the surprise and keeps the voters prepared. This paper starts from the basic observation that individuals in the underlying population build est… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

    Comments: 18 pages, 6 figures

  38. arXiv:1711.11581  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Outlier-robust moment-estimation via sum-of-squares

    Authors: Pravesh K. Kothari, David Steurer

    Abstract: We develop efficient algorithms for estimating low-degree moments of unknown distributions in the presence of adversarial outliers. The guarantees of our algorithms improve in many cases significantly over the best previous ones, obtained in recent works of Diakonikolas et al, Lai et al, and Charikar et al. We also show that the guarantees of our algorithms match information-theoretic lower-bounds… ▽ More

    Submitted 23 December, 2017; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: Fix references for robust mean estimation without exploiting higher-order moments

  39. arXiv:1711.07465  [pdf, ps, other

    cs.LG

    Better Agnostic Clustering Via Relaxed Tensor Norms

    Authors: Pravesh K. Kothari, Jacob Steinhardt

    Abstract: We develop a new family of convex relaxations for $k$-means clustering based on sum-of-squares norms, a relaxation of the injective tensor norm that is efficiently computable using the Sum-of-Squares algorithm. We give an algorithm based on this relaxation that recovers a faithful approximation to the true means in the given data whenever the low-degree moments of the points in each cluster have b… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

    ACM Class: F.2.0

  40. arXiv:1710.05017  [pdf, ps, other

    cs.DS cs.CC

    The power of sum-of-squares for detecting hidden structures

    Authors: Samuel B. Hopkins, Pravesh K. Kothari, Aaron Potechin, Prasad Raghavendra, Tselil Schramm, David Steurer

    Abstract: We study planted problems---finding hidden structures in random noisy inputs---through the lens of the sum-of-squares semidefinite programming hierarchy (SoS). This family of powerful semidefinite programs has recently yielded many new algorithms for planted problems, often achieving the best known polynomial-time guarantees in terms of accuracy of recovered solutions and robustness to noise. One… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

  41. arXiv:1709.03871  [pdf, ps, other

    cs.LG

    Agnostic Learning by Refuting

    Authors: Pravesh K. Kothari, Roi Livni

    Abstract: The sample complexity of learning a Boolean-valued function class is precisely characterized by its Rademacher complexity. This has little bearing, however, on the sample complexity of \emph{efficient} agnostic learning. We introduce \emph{refutation complexity}, a natural computational analog of Rademacher complexity of a Boolean concept class and show that it exactly characterizes the sample c… ▽ More

    Submitted 30 November, 2017; v1 submitted 12 September, 2017; originally announced September 2017.

  42. arXiv:1701.04521  [pdf, other

    cs.CC

    Sum of squares lower bounds for refuting any CSP

    Authors: Pravesh K. Kothari, Ryuhei Mori, Ryan O'Donnell, David Witmer

    Abstract: Let $P:\{0,1\}^k \to \{0,1\}$ be a nontrivial $k$-ary predicate. Consider a random instance of the constraint satisfaction problem $\mathrm{CSP}(P)$ on $n$ variables with $Δn$ constraints, each being $P$ applied to $k$ randomly chosen literals. Provided the constraint density satisfies $Δ\gg 1$, such an instance is unsatisfiable with high probability. The \emph{refutation} problem is to efficientl… ▽ More

    Submitted 16 January, 2017; originally announced January 2017.

    Comments: 39 pages, 1 figure

    MSC Class: 68Q17 ACM Class: G.1.6; F.4.1

  43. arXiv:1610.02704  [pdf, other

    cs.CC cs.DM cs.DS math.CO

    Approximating Rectangles by Juntas and Weakly-Exponential Lower Bounds for LP Relaxations of CSPs

    Authors: Pravesh K. Kothari, Raghu Meka, Prasad Raghavendra

    Abstract: We show that for constraint satisfaction problems (CSPs), sub-exponential size linear programming relaxations are as powerful as $n^{Ω(1)}$-rounds of the Sherali-Adams linear programming hierarchy. As a corollary, we obtain sub-exponential size lower bounds for linear programming relaxations that beat random guessing for many CSPs such as MAX-CUT and MAX-3SAT. This is a nearly-exponential improvem… ▽ More

    Submitted 30 December, 2017; v1 submitted 9 October, 2016; originally announced October 2016.

    Comments: Fixed bug in the statement of Theorem 1.7

    ACM Class: F.2.0

  44. arXiv:1604.03084  [pdf, other

    cs.CC

    A Nearly Tight Sum-of-Squares Lower Bound for the Planted Clique Problem

    Authors: Boaz Barak, Samuel B. Hopkins, Jonathan Kelner, Pravesh K. Kothari, Ankur Moitra, Aaron Potechin

    Abstract: We prove that with high probability over the choice of a random graph $G$ from the Erdős-Rényi distribution $G(n,1/2)$, the $n^{O(d)}$-time degree $d$ Sum-of-Squares semidefinite programming relaxation for the clique problem will give a value of at least $n^{1/2-c(d/\log n)^{1/2}}$ for some constant $c>0$. This yields a nearly tight $n^{1/2 - o(1)}$ bound on the value of this program for any degre… ▽ More

    Submitted 12 April, 2016; v1 submitted 11 April, 2016; originally announced April 2016.

    Comments: 55 pages

    ACM Class: F.2.0

  45. arXiv:1507.05230  [pdf, ps, other

    cs.CC

    SoS and Planted Clique: Tight Analysis of MPW Moments at all Degrees and an Optimal Lower Bound at Degree Four

    Authors: Samuel B. Hopkins, Pravesh K. Kothari, Aaron Potechin

    Abstract: The problem of finding large cliques in random graphs and its "planted" variant, where one wants to recover a clique of size $ω\gg \log{(n)}$ added to an \Erdos-\Renyi graph $G \sim G(n,\frac{1}{2})$, have been intensely studied. Nevertheless, existing polynomial time algorithms can only recover planted cliques of size $ω= Ω(\sqrt{n})$. By contrast, information theoretically, one can recover plant… ▽ More

    Submitted 18 July, 2015; originally announced July 2015.

    Comments: 67 pages, 2 figures

    ACM Class: F.2.0