Skip to main content

Showing 1–44 of 44 results for author: Moitra, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.11686  [pdf, ps, other

    cs.LG cs.AI stat.ML

    The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation

    Authors: Noah Golowich, Ankur Moitra

    Abstract: In this paper, we study the offline RL problem with linear function approximation. Our main structural assumption is that the MDP has low inherent Bellman error, which stipulates that linear value functions have linear Bellman backups with respect to the greedy policy. This assumption is natural in that it is essentially the minimal assumption required for value iteration to succeed. We give a com… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: RLC 2024

  2. arXiv:2406.11640  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions

    Authors: Noah Golowich, Ankur Moitra

    Abstract: One of the most natural approaches to reinforcement learning (RL) with function approximation is value iteration, which inductively generates approximations to the optimal value function by solving a sequence of regression problems. To ensure the success of value iteration, it is typically assumed that Bellman completeness holds, which ensures that these regression problems are well-specified. We… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: COLT 2024

  3. arXiv:2311.00289  [pdf, ps, other

    math.ST stat.ML

    Precise Error Rates for Computationally Efficient Testing

    Authors: Ankur Moitra, Alexander S. Wein

    Abstract: We revisit the fundamental question of simple-versus-simple hypothesis testing with an eye towards computational complexity, as the statistically optimal likelihood ratio test is often computationally intractable in high-dimensional settings. In the classical spiked Wigner model (with a general i.i.d. spike prior) we show that an existing test based on linear spectral statistics achieves the best… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 30 pages, 1 figure

  4. arXiv:2309.09457  [pdf, ps, other

    cs.LG cs.AI cs.DS math.OC stat.ML

    Exploring and Learning in Sparse Linear MDPs without Computationally Intractable Oracles

    Authors: Noah Golowich, Ankur Moitra, Dhruv Rohatgi

    Abstract: The key assumption underlying linear Markov Decision Processes (MDPs) is that the learner has access to a known feature map $φ(x, a)$ that maps state-action pairs to $d$-dimensional vectors, and that the rewards and transitions are linear functions in this representation. But where do these features come from? In the absence of expert domain knowledge, a tempting strategy is to use the ``kitchen s… ▽ More

    Submitted 18 September, 2023; v1 submitted 17 September, 2023; originally announced September 2023.

  5. arXiv:2307.06538  [pdf, ps, other

    cs.LG cs.DS math.OC stat.ML

    Tensor Decompositions Meet Control Theory: Learning General Mixtures of Linear Dynamical Systems

    Authors: Ainesh Bakshi, Allen Liu, Ankur Moitra, Morris Yau

    Abstract: Recently Chen and Poor initiated the study of learning mixtures of linear dynamical systems. While linear dynamical systems already have wide-ranging applications in modeling time-series data, using mixture models can lead to a better fit or even a richer understanding of underlying subpopulations represented in the data. In this work we give a new approach to learning mixtures of linear dynamical… ▽ More

    Submitted 23 July, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: ICML 2023

  6. arXiv:2306.01993  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Provable benefits of score matching

    Authors: Chirag Pabbaraju, Dhruv Rohatgi, Anish Sevekari, Holden Lee, Ankur Moitra, Andrej Risteski

    Abstract: Score matching is an alternative to maximum likelihood (ML) for estimating a probability distribution parametrized up to a constant of proportionality. By fitting the ''score'' of the distribution, it sidesteps the need to compute this constant of proportionality (which is often intractable). While score matching and variants thereof are popular in practice, precise theoretical understanding of th… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 25 Pages

  7. arXiv:2301.09519  [pdf, ps, other

    math.OC cs.DS cs.LG stat.ML

    A New Approach to Learning Linear Dynamical Systems

    Authors: Ainesh Bakshi, Allen Liu, Ankur Moitra, Morris Yau

    Abstract: Linear dynamical systems are the foundational statistical model upon which control theory is built. Both the celebrated Kalman filter and the linear quadratic regulator require knowledge of the system dynamics to provide analytic guarantees. Naturally, learning the dynamics of a linear dynamical system from linear measurements has been intensively studied since Rudolph Kalman's pioneering work in… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  8. arXiv:2207.11903  [pdf, ps, other

    cs.DS cs.LG cs.SI math.PR stat.ML

    Minimax Rates for Robust Community Detection

    Authors: Allen Liu, Ankur Moitra

    Abstract: In this work, we study the problem of community detection in the stochastic block model with adversarial node corruptions. Our main result is an efficient algorithm that can tolerate an $ε$-fraction of corruptions and achieves error $O(ε) + e^{-\frac{C}{2} (1 \pm o(1))}$ where $C = (\sqrt{a} - \sqrt{b})^2$ is the signal-to-noise ratio and $a/n$ and $b/n$ are the inter-community and intra-community… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: To appear in FOCS 2022

  9. arXiv:2206.03446  [pdf, ps, other

    cs.LG cs.AI cs.DS math.OC stat.ML

    Learning in Observable POMDPs, without Computationally Intractable Oracles

    Authors: Noah Golowich, Ankur Moitra, Dhruv Rohatgi

    Abstract: Much of reinforcement learning theory is built on top of oracles that are computationally hard to implement. Specifically for learning near-optimal policies in Partially Observable Markov Decision Processes (POMDPs), existing algorithms either need to make strong assumptions about the model dynamics (e.g. deterministic transitions) or assume access to an oracle for solving a hard optimistic planni… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  10. arXiv:2205.14284  [pdf, other

    stat.ML cs.DS cs.LG econ.EM

    Provably Auditing Ordinary Least Squares in Low Dimensions

    Authors: Ankur Moitra, Dhruv Rohatgi

    Abstract: Measuring the stability of conclusions derived from Ordinary Least Squares linear regression is critically important, but most metrics either only measure local stability (i.e. against infinitesimal changes in the data), or are only interpretable under statistical assumptions. Recent work proposes a simple, global, finite-sample stability metric: the minimum number of samples that need to be remov… ▽ More

    Submitted 5 June, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: 32 pages, 4 figures. Added acknowledgments/funding

  11. arXiv:2201.04735  [pdf, ps, other

    cs.LG cs.DS math.OC stat.ML

    Planning in Observable POMDPs in Quasipolynomial Time

    Authors: Noah Golowich, Ankur Moitra, Dhruv Rohatgi

    Abstract: Partially Observable Markov Decision Processes (POMDPs) are a natural and general model in reinforcement learning that take into account the agent's uncertainty about its current state. In the literature on POMDPs, it is customary to assume access to a planning oracle that computes an optimal policy when the parameters are known, even though the problem is known to be computationally hard. Almost… ▽ More

    Submitted 23 March, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

    Comments: 52 pages

  12. arXiv:2112.06380  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Robust Voting Rules from Algorithmic Robust Statistics

    Authors: Allen Liu, Ankur Moitra

    Abstract: Maximum likelihood estimation furnishes powerful insights into voting theory, and the design of voting rules. However the MLE can usually be badly corrupted by a single outlying sample. This means that a single voter or a group of colluding voters can vote strategically and drastically affect the outcome. Motivated by recent progress in algorithmic robust statistics, we revisit the fundamental pro… ▽ More

    Submitted 16 July, 2022; v1 submitted 12 December, 2021; originally announced December 2021.

  13. arXiv:2111.06395  [pdf, ps, other

    stat.ML cs.DS cs.LG eess.SY

    Kalman Filtering with Adversarial Corruptions

    Authors: Sitan Chen, Frederic Koehler, Ankur Moitra, Morris Yau

    Abstract: Here we revisit the classic problem of linear quadratic estimation, i.e. estimating the trajectory of a linear dynamical system from noisy measurements. The celebrated Kalman filter gives an optimal estimator when the measurement noise is Gaussian, but is widely known to break down when one deviates from this assumption, e.g. when the noise is heavy-tailed. Many ad hoc heuristics have been employe… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: 57 pages, comments welcome

  14. arXiv:2110.13052  [pdf, ps, other

    cs.LG cs.AI cs.DS math.OC stat.ML

    Can Q-Learning be Improved with Advice?

    Authors: Noah Golowich, Ankur Moitra

    Abstract: Despite rapid progress in theoretical reinforcement learning (RL) over the last few years, most of the known guarantees are worst-case in nature, failing to take advantage of structure that may be known a priori about a given RL problem at hand. In this paper we address the question of whether worst-case lower bounds for regret in online learning of Markov decision processes (MDPs) can be circumve… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  15. arXiv:2106.02774  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    Robust Model Selection and Nearly-Proper Learning for GMMs

    Authors: Jerry Li, Allen Liu, Ankur Moitra

    Abstract: In learning theory, a standard assumption is that the data is generated from a finite mixture model. But what happens when the number of components is not known in advance? The problem of estimating the number of components, also called model selection, is important in its own right but there are essentially no known efficient algorithms with provable guarantees let alone ones that can tolerate ad… ▽ More

    Submitted 22 April, 2023; v1 submitted 4 June, 2021; originally announced June 2021.

  16. arXiv:2106.02680  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Algorithms from Invariants: Smoothed Analysis of Orbit Recovery over $SO(3)$

    Authors: Allen Liu, Ankur Moitra

    Abstract: In this work we study orbit recovery over $SO(3)$, where the goal is to recover a function on the sphere from noisy, randomly rotated copies of it. We assume that the function is a linear combination of low-degree spherical harmonics. This is a natural abstraction for the problem of recovering the three-dimensional structure of a molecule through cryo-electron tomography. For provably learning the… ▽ More

    Submitted 1 May, 2022; v1 submitted 4 June, 2021; originally announced June 2021.

  17. arXiv:2104.09665  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning GMMs with Nearly Optimal Robustness Guarantees

    Authors: Allen Liu, Ankur Moitra

    Abstract: In this work we solve the problem of robustly learning a high-dimensional Gaussian mixture model with $k$ components from $ε$-corrupted samples up to accuracy $\widetilde{O}(ε)$ in total variation distance for any constant $k$ and with mild assumptions on the mixture. This robustness guarantee is optimal up to polylogarithmic factors. The main challenge is that most earlier works rely on learning… ▽ More

    Submitted 14 November, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

  18. arXiv:2101.05657  [pdf, other

    math.OC cs.DS cs.LG stat.ML

    No-go Theorem for Acceleration in the Hyperbolic Plane

    Authors: Linus Hamilton, Ankur Moitra

    Abstract: In recent years there has been significant effort to adapt the key tools and ideas in convex optimization to the Riemannian setting. One key challenge has remained: Is there a Nesterov-like accelerated gradient method for geodesically convex functions on a Riemannian manifold? Recent work has given partial answers and the hope was that this ought to be possible. Here we dash these hopes. We prov… ▽ More

    Submitted 16 January, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: 12 pages

  19. arXiv:2011.03622  [pdf, ps, other

    cs.DS cs.LG math.ST stat.ML

    Settling the Robust Learnability of Mixtures of Gaussians

    Authors: Allen Liu, Ankur Moitra

    Abstract: This work represents a natural coalescence of two important lines of work: learning mixtures of Gaussians and algorithmic robust statistics. In particular we give the first provably robust algorithm for learning mixtures of any constant number of Gaussians. We require only mild assumptions on the mixing weights (bounded fractionality) and that the total variation distance between components is bou… ▽ More

    Submitted 25 July, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

  20. arXiv:2010.04157  [pdf, other

    cs.LG cs.DS stat.ML

    Online and Distribution-Free Robustness: Regression and Contextual Bandits with Huber Contamination

    Authors: Sitan Chen, Frederic Koehler, Ankur Moitra, Morris Yau

    Abstract: In this work we revisit two classic high-dimensional online learning problems, namely linear regression and contextual bandits, from the perspective of adversarial robustness. Existing works in algorithmic robust statistics make strong distributional assumptions that ensure that the input data is evenly spread out or comes from a nice generative model. Is it possible to achieve strong robustness g… ▽ More

    Submitted 10 June, 2021; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: 66 pages, 1 figure, v3: refined exposition and improved rates

  21. arXiv:2006.04787  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Classification Under Misspecification: Halfspaces, Generalized Linear Models, and Connections to Evolvability

    Authors: Sitan Chen, Frederic Koehler, Ankur Moitra, Morris Yau

    Abstract: In this paper we revisit some classic problems on classification under misspecification. In particular, we study the problem of learning halfspaces under Massart noise with rate $η$. In a recent work, Diakonikolas, Goulekakis, and Tzamos resolved a long-standing problem by giving the first efficient algorithm for learning to accuracy $η+ ε$ for any $ε> 0$. However, their algorithm outputs a compli… ▽ More

    Submitted 20 September, 2023; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: 52 pages, v2: updated references

  22. arXiv:2006.03134  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Tensor Completion Made Practical

    Authors: Allen Liu, Ankur Moitra

    Abstract: Tensor completion is a natural higher-order generalization of matrix completion where the goal is to recover a low-rank tensor from sparse observations of its entries. Existing algorithms are either heuristic without provable guarantees, based on solving large semidefinite programs which are impractical to run, or make strong assumptions such as requiring the factors to be nearly orthogonal. In th… ▽ More

    Submitted 25 December, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  23. arXiv:2002.10435  [pdf, other

    cs.LG cs.DS stat.ML

    Learning Structured Distributions From Untrusted Batches: Faster and Simpler

    Authors: Sitan Chen, Jerry Li, Ankur Moitra

    Abstract: We revisit the problem of learning from untrusted batches introduced by Qiao and Valiant [QV17]. Recently, Jain and Orlitsky [JO19] gave a simple semidefinite programming approach based on the cut-norm that achieves essentially information-theoretically optimal error in polynomial time. Concurrently, Chen et al. [CLM19] considered a variant of the problem where $μ$ is assumed to be structured, e.g… ▽ More

    Submitted 7 June, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: 37 pages, version 2 includes experiments

  24. arXiv:2002.05576  [pdf, other

    math.PR cs.DS cs.LG stat.ML

    Fast Convergence for Langevin Diffusion with Manifold Structure

    Authors: Ankur Moitra, Andrej Risteski

    Abstract: In this paper, we study the problem of sampling from distributions of the form p(x) \propto e^{-βf(x)} for some function f whose values and gradients we can query. This mode of access to f is natural in the scenarios in which such problems arise, for instance sampling from posteriors in parametric Bayesian models. Classical results show that a natural random walk, Langevin diffusion, mixes rapidly… ▽ More

    Submitted 21 September, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: 52 pages, in submission to NeurIPS 2020. This version: various typos fixed, minor reorganization

  25. arXiv:1912.01745  [pdf, other

    math.OC stat.ML

    Polynomial time guarantees for the Burer-Monteiro method

    Authors: Diego Cifuentes, Ankur Moitra

    Abstract: The Burer-Monteiro method is one of the most widely used techniques for solving large-scale semidefinite programs (SDP). The basic idea is to solve a nonconvex program in $Y$, where $Y$ is an $n \times p$ matrix such that $X = Y Y^T$. In this paper, we show that this method can solve SDPs in polynomial time in a smoothed analysis setting. More precisely, we consider an SDP whose domain satisfies s… ▽ More

    Submitted 7 May, 2021; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: 26 pages, 2 figures

    MSC Class: 90C22 (Primary) 90C26 (Secondary)

  26. arXiv:1911.02035  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Efficiently Learning Structured Distributions from Untrusted Batches

    Authors: Sitan Chen, Jerry Li, Ankur Moitra

    Abstract: We study the problem, introduced by Qiao and Valiant, of learning from untrusted batches. Here, we assume $m$ users, all of whom have samples from some underlying distribution $p$ over $1, \ldots, n$. Each user sends a batch of $k$ i.i.d. samples from this distribution; however an $ε$-fraction of users are untrustworthy and can send adversarially chosen responses. The goal is then to learn $p$ in… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 46 pages

  27. arXiv:1905.01282  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning Some Popular Gaussian Graphical Models without Condition Number Bounds

    Authors: Jonathan Kelner, Frederic Koehler, Raghu Meka, Ankur Moitra

    Abstract: Gaussian Graphical Models (GGMs) have wide-ranging applications in machine learning and the natural and social sciences. In most of the settings in which they are applied, the number of observed samples is much smaller than the dimension and they are assumed to be sparse. While there are a variety of algorithms (e.g. Graphical Lasso, CLIME) that provably recover the graph structure with a logarith… ▽ More

    Submitted 8 March, 2020; v1 submitted 3 May, 2019; originally announced May 2019.

    Comments: V2: Updated version with some new results

  28. arXiv:1811.00944  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Spectral Methods from Tensor Networks

    Authors: Ankur Moitra, Alexander S. Wein

    Abstract: A tensor network is a diagram that specifies a way to "multiply" a collection of tensors together to produce another tensor (or matrix). Many existing algorithms for tensor problems (such as tensor decomposition and tensor PCA), although they are not presented this way, can be viewed as spectral methods on matrices built from simple tensor networks. In this work we leverage the full power of this… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

    Comments: 30 pages, 8 figures

  29. arXiv:1808.05731  [pdf, other

    cs.DS cs.LG stat.ML

    Efficiently Learning Mixtures of Mallows Models

    Authors: Allen Liu, Ankur Moitra

    Abstract: Mixtures of Mallows models are a popular generative model for ranking data coming from a heterogeneous population. They have a variety of applications including social choice, recommendation systems and natural language processing. Here we give the first polynomial time algorithm for provably learning the parameters of a mixture of Mallows models with any constant number of components. Prior to ou… ▽ More

    Submitted 16 August, 2018; originally announced August 2018.

    Comments: 35 pages

    Journal ref: FOCS 2018

  30. arXiv:1807.00891  [pdf, ps, other

    math.ST cs.DS cs.IT math.PR stat.ML

    Optimality and Sub-optimality of PCA I: Spiked Random Matrix Models

    Authors: Amelia Perry, Alexander S. Wein, Afonso S. Bandeira, Ankur Moitra

    Abstract: A central problem of random matrix theory is to understand the eigenvalues of spiked random matrix models, introduced by Johnstone, in which a prominent eigenvector (or "spike") is planted into a random matrix. These distributions form natural statistical models for principal component analysis (PCA) problems throughout the sciences. Baik, Ben Arous and Peche showed that the spiked Wishart ensembl… ▽ More

    Submitted 12 July, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: 67 pages, 3 figures. This is the journal version of part I of arXiv:1609.05573, accepted to the Annals of Statistics. This version includes the supplementary material as appendices

    MSC Class: 62H15; 62B15

    Journal ref: Ann. Statist., Volume 46, Number 5 (2018), 2416-2451

  31. arXiv:1805.10262  [pdf, other

    cs.LG cs.DS math.PR stat.ML

    Learning Restricted Boltzmann Machines via Influence Maximization

    Authors: Guy Bresler, Frederic Koehler, Ankur Moitra, Elchanan Mossel

    Abstract: Graphical models are a rich language for describing high-dimensional distributions in terms of their dependence structure. While there are algorithms with provable guarantees for learning undirected graphical models in a variety of settings, there has been much less progress in the important scenario when there are latent variables. Here we study Restricted Boltzmann Machines (or RBMs), which are… ▽ More

    Submitted 5 November, 2018; v1 submitted 25 May, 2018; originally announced May 2018.

    Comments: 29 pages

  32. arXiv:1803.06521  [pdf, ps, other

    cs.LG cs.CC cs.DS stat.ML

    Beyond the Low-Degree Algorithm: Mixtures of Subcubes and Their Applications

    Authors: Sitan Chen, Ankur Moitra

    Abstract: We introduce the problem of learning mixtures of $k$ subcubes over $\{0,1\}^n$, which contains many classic learning theory problems as a special case (and is itself a special case of others). We give a surprising $n^{O(\log k)}$-time learning algorithm based on higher-order multilinear moments. It is not possible to learn the parameters because the same distribution can be represented by quite di… ▽ More

    Submitted 19 February, 2019; v1 submitted 17 March, 2018; originally announced March 2018.

    Comments: 62 pages; to appear in STOC 2019

  33. arXiv:1704.03866  [pdf, ps, other

    cs.DS cs.IT cs.LG math.ST stat.ML

    Robustly Learning a Gaussian: Getting Optimal Error, Efficiently

    Authors: Ilias Diakonikolas, Gautam Kamath, Daniel M. Kane, Jerry Li, Ankur Moitra, Alistair Stewart

    Abstract: We study the fundamental problem of learning the parameters of a high-dimensional Gaussian in the presence of noise -- where an $\varepsilon$-fraction of our samples were chosen by an adversary. We give robust estimators that achieve estimation error $O(\varepsilon)$ in the total variation distance, which is optimal up to a universal constant that is independent of the dimension. In the case whe… ▽ More

    Submitted 5 November, 2017; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: To appear in SODA 2018

  34. arXiv:1703.00893  [pdf, other

    cs.LG cs.DS cs.IT stat.ML

    Being Robust (in High Dimensions) Can Be Practical

    Authors: Ilias Diakonikolas, Gautam Kamath, Daniel M. Kane, Jerry Li, Ankur Moitra, Alistair Stewart

    Abstract: Robust estimation is much more challenging in high dimensions than it is in one dimension: Most techniques either lead to intractable optimization problems or estimators that can tolerate only a tiny fraction of errors. Recent work in theoretical computer science has shown that, in appropriate distributional models, it is possible to robustly estimate the mean and covariance with polynomial time a… ▽ More

    Submitted 13 March, 2018; v1 submitted 2 March, 2017; originally announced March 2017.

    Comments: Appeared in ICML 2017

  35. arXiv:1610.04583  [pdf, ps, other

    cs.IT cs.CV cs.DS math.OC stat.ML

    Message-passing algorithms for synchronization problems over compact groups

    Authors: Amelia Perry, Alexander S. Wein, Afonso S. Bandeira, Ankur Moitra

    Abstract: Various alignment problems arising in cryo-electron microscopy, community detection, time synchronization, computer vision, and other fields fall into a common framework of synchronization problems over compact groups such as Z/L, U(1), or SO(3). The goal of such problems is to estimate an unknown vector of group elements given noisy relative observations. We present an efficient iterative algorit… ▽ More

    Submitted 14 October, 2016; originally announced October 2016.

    Comments: 35 pages, 11 figures

  36. arXiv:1609.05573  [pdf, other

    math.ST cs.DS cs.IT math.PR stat.ML

    Optimality and Sub-optimality of PCA for Spiked Random Matrices and Synchronization

    Authors: Amelia Perry, Alexander S. Wein, Afonso S. Bandeira, Ankur Moitra

    Abstract: A central problem of random matrix theory is to understand the eigenvalues of spiked random matrix models, in which a prominent eigenvector is planted into a random matrix. These distributions form natural statistical models for principal component analysis (PCA) problems throughout the sciences. Baik, Ben Arous and Péché showed that the spiked Wishart ensemble exhibits a sharp phase transition as… ▽ More

    Submitted 23 December, 2016; v1 submitted 18 September, 2016; originally announced September 2016.

    Comments: 58 pages, 5 figures. This version adds improved results for the Wishart model

    MSC Class: 62H15; 62B15

  37. arXiv:1605.08491  [pdf, other

    cs.LG stat.ML

    Provable Algorithms for Inference in Topic Models

    Authors: Sanjeev Arora, Rong Ge, Frederic Koehler, Tengyu Ma, Ankur Moitra

    Abstract: Recently, there has been considerable progress on designing algorithms with provable guarantees -- typically using linear algebraic methods -- for parameter learning in latent variable models. But designing provable algorithms for inference has proven to be more challenging. Here we take a first step towards provable inference in topic models. We leverage a property of topic models that enables us… ▽ More

    Submitted 26 May, 2016; originally announced May 2016.

    Comments: to appear at ICML'2016

  38. arXiv:1604.06443  [pdf, ps, other

    cs.DS cs.IT cs.LG math.ST stat.ML

    Robust Estimators in High Dimensions without the Computational Intractability

    Authors: Ilias Diakonikolas, Gautam Kamath, Daniel Kane, Jerry Li, Ankur Moitra, Alistair Stewart

    Abstract: We study high-dimensional distribution learning in an agnostic setting where an adversary is allowed to arbitrarily corrupt an $\varepsilon$-fraction of the samples. Such questions have a rich history spanning statistics, machine learning and theoretical computer science. Even in the most basic settings, the only known approaches are either computationally inefficient or lose dimension-dependent f… ▽ More

    Submitted 14 March, 2019; v1 submitted 21 April, 2016; originally announced April 2016.

  39. arXiv:1511.01473  [pdf, ps, other

    cs.DS cs.IT cs.LG math.PR stat.ML

    How Robust are Reconstruction Thresholds for Community Detection?

    Authors: Ankur Moitra, William Perry, Alexander S. Wein

    Abstract: The stochastic block model is one of the oldest and most ubiquitous models for studying clustering and community detection. In an exciting sequence of developments, motivated by deep but non-rigorous ideas from statistical physics, Decelle et al. conjectured a sharp threshold for when community detection is possible in the sparse regime. Mossel, Neeman and Sly and Massoulie proved the conjecture a… ▽ More

    Submitted 21 March, 2016; v1 submitted 4 November, 2015; originally announced November 2015.

    Comments: 36 pages, 3 figures

  40. arXiv:1503.00778  [pdf, other

    cs.LG cs.DS cs.NE stat.ML

    Simple, Efficient, and Neural Algorithms for Sparse Coding

    Authors: Sanjeev Arora, Rong Ge, Tengyu Ma, Ankur Moitra

    Abstract: Sparse coding is a basic task in many fields including signal processing, neuroscience and machine learning where the goal is to learn a basis that enables a sparse representation of a given set of data, if one exists. Its standard formulation is as a non-convex optimization problem which is solved in practice by heuristics based on alternating minimization. Re- cent work has resulted in several a… ▽ More

    Submitted 2 March, 2015; originally announced March 2015.

    Comments: 37 pages, 1 figure

  41. arXiv:1501.06521  [pdf, other

    cs.LG cs.DS stat.ML

    Noisy Tensor Completion via the Sum-of-Squares Hierarchy

    Authors: Boaz Barak, Ankur Moitra

    Abstract: In the noisy tensor completion problem we observe $m$ entries (whose location is chosen uniformly at random) from an unknown $n_1 \times n_2 \times n_3$ tensor $T$. We assume that $T$ is entry-wise close to being rank $r$. Our goal is to fill in its missing entries using as few observations as possible. Let $n = \max(n_1, n_2, n_3)$. We show that if $m = n^{3/2} r$ then there is a polynomial time… ▽ More

    Submitted 18 February, 2016; v1 submitted 26 January, 2015; originally announced January 2015.

    Comments: 24 pages

  42. arXiv:1311.3651  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Smoothed Analysis of Tensor Decompositions

    Authors: Aditya Bhaskara, Moses Charikar, Ankur Moitra, Aravindan Vijayaraghavan

    Abstract: Low rank tensor decompositions are a powerful tool for learning generative models, and uniqueness results give them a significant advantage over matrix decomposition methods. However, tensors pose significant algorithmic challenges and tensors analogs of much of the matrix algebra toolkit are unlikely to exist because of hardness results. Efficient decomposition in the overcomplete case (where ran… ▽ More

    Submitted 20 January, 2014; v1 submitted 14 November, 2013; originally announced November 2013.

    Comments: 32 pages (including appendix)

  43. arXiv:1308.6273  [pdf, ps, other

    cs.DS cs.LG stat.ML

    New Algorithms for Learning Incoherent and Overcomplete Dictionaries

    Authors: Sanjeev Arora, Rong Ge, Ankur Moitra

    Abstract: In sparse recovery we are given a matrix $A$ (the dictionary) and a vector of the form $A X$ where $X$ is sparse, and the goal is to recover $X$. This is a central notion in signal processing, statistics and machine learning. But in applications such as sparse coding, edge detection, compression and super resolution, the dictionary $A$ is unknown and has to be learned from random examples of the f… ▽ More

    Submitted 26 May, 2014; v1 submitted 28 August, 2013; originally announced August 2013.

  44. arXiv:1212.4777  [pdf, other

    cs.LG cs.DS stat.ML

    A Practical Algorithm for Topic Modeling with Provable Guarantees

    Authors: Sanjeev Arora, Rong Ge, Yoni Halpern, David Mimno, Ankur Moitra, David Sontag, Yichen Wu, Michael Zhu

    Abstract: Topic models provide a useful method for dimensionality reduction and exploratory data analysis in large text corpora. Most approaches to topic model inference have been based on a maximum likelihood objective. Efficient algorithms exist that approximate this objective, but they have no provable guarantees. Recently, algorithms have been introduced that provide provable bounds, but these algorithm… ▽ More

    Submitted 19 December, 2012; originally announced December 2012.

    Comments: 26 pages