Skip to main content

Showing 1–38 of 38 results for author: Wein, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18735  [pdf, other

    math.ST cs.CC cs.DS math.PR stat.ML

    Tensor cumulants for statistical inference on invariant distributions

    Authors: Dmitriy Kunisky, Cristopher Moore, Alexander S. Wein

    Abstract: Many problems in high-dimensional statistics appear to have a statistical-computational gap: a range of values of the signal-to-noise ratio where inference is information-theoretically possible, but (conjecturally) computationally intractable. A canonical such problem is Tensor PCA, where we observe a tensor $Y$ consisting of a rank-one signal plus Gaussian noise. Multiple lines of work suggest th… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 72 pages, 12 figures

  2. arXiv:2402.05451  [pdf, ps, other

    cs.DS cs.CC stat.ML

    Low-degree phase transitions for detecting a planted clique in sublinear time

    Authors: Jay Mardia, Kabir Aladin Verchand, Alexander S. Wein

    Abstract: We consider the problem of detecting a planted clique of size $k$ in a random graph on $n$ vertices. When the size of the clique exceeds $Θ(\sqrt{n})$, polynomial-time algorithms for detection proliferate. We study faster -- namely, sublinear time -- algorithms in the high-signal regime when $k = Θ(n^{1/2 + δ})$, for some $δ> 0$. To this end, we consider algorithms that non-adaptively query a subs… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 23 pages, 2 figures

  3. arXiv:2402.00305  [pdf, ps, other

    math.ST cs.IT cs.SI stat.ML

    Information-Theoretic Thresholds for Planted Dense Cycles

    Authors: Cheng Mao, Alexander S. Wein, Shenduo Zhang

    Abstract: We study a random graph model for small-world networks which are ubiquitous in social and biological sciences. In this model, a dense cycle of expected bandwidth $n τ$, representing the hidden one-dimensional geometry of vertices, is planted in an ambient random graph on $n$ vertices. For both detection and recovery of the planted dense cycle, we characterize the information-theoretic thresholds i… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

    Comments: 31 pages, 1 figure

    MSC Class: 94A15; 62B10; 68Q87; 05C80; 05C60

  4. arXiv:2312.13554  [pdf, ps, other

    cs.DS math.OC math.PR

    Time Lower Bounds for the Metropolis Process and Simulated Annealing

    Authors: Zongchen Chen, Dan Mikulincer, Daniel Reichman, Alexander S. Wein

    Abstract: The Metropolis process (MP) and Simulated Annealing (SA) are stochastic local search heuristics that are often used in solving combinatorial optimization problems. Despite significant interest, there are very few theoretical results regarding the quality of approximation obtained by MP and SA (with polynomially many iterations) for NP-hard optimization problems. We provide rigorous lower bounds… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 44 pages

  5. arXiv:2304.08135  [pdf, ps, other

    cs.DS cs.CC math.ST stat.ML

    Detection of Dense Subhypergraphs by Low-Degree Polynomials

    Authors: Abhishek Dhawan, Cheng Mao, Alexander S. Wein

    Abstract: Detection of a planted dense subgraph in a random graph is a fundamental statistical and computational problem that has been extensively studied in recent years. We study a hypergraph version of the problem. Let $G^r(n,p)$ denote the $r$-uniform Erdős-Rényi hypergraph model with $n$ vertices and edge density $p$. We consider detecting the presence of a planted $G^r(n^γ, n^{-α})$ subhypergraph in a… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: 31 pages

  6. arXiv:2303.00252  [pdf, ps, other

    cs.CC cs.DS

    Is Planted Coloring Easier than Planted Clique?

    Authors: Pravesh K. Kothari, Santosh S. Vempala, Alexander S. Wein, Jeff Xu

    Abstract: We study the computational complexity of two related problems: recovering a planted $q$-coloring in $G(n,1/2)$, and finding efficiently verifiable witnesses of non-$q$-colorability (a.k.a. refutations) in $G(n,1/2)$. Our main results show hardness for both these problems in a restricted-but-powerful class of algorithms based on computing low-degree polynomials in the inputs. The problem of recov… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 23 pages

  7. arXiv:2302.06737  [pdf, ps, other

    math.ST cs.DS stat.ML

    Detection-Recovery Gap for Planted Dense Cycles

    Authors: Cheng Mao, Alexander S. Wein, Shenduo Zhang

    Abstract: Planted dense cycles are a type of latent structure that appears in many applications, such as small-world networks in social sciences and sequence assembly in computational biology. We consider a model where a dense cycle with expected bandwidth $n τ$ and edge density $p$ is planted in an Erdős-Rényi graph $G(n,q)$. We characterize the computational thresholds for the associated detection and rec… ▽ More

    Submitted 20 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 41 pages, 1 figure

  8. arXiv:2212.10872  [pdf, ps, other

    math.ST cs.CC cs.DS math.CO stat.ML

    Is it easier to count communities than find them?

    Authors: Cynthia Rush, Fiona Skerman, Alexander S. Wein, Dana Yang

    Abstract: Random graph models with community structure have been studied extensively in the literature. For both the problems of detecting and recovering community structure, an interesting landscape of statistical and computational phase transitions has emerged. A natural unanswered question is: might it be possible to infer properties of the community structure (for instance, the number and sizes of commu… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Accepted to Innovations in Theoretical Computer Science (ITCS) 2023

    MSC Class: 05C80; 62F03; 68Q25 ACM Class: F.2; G.2

  9. arXiv:2211.05274  [pdf, ps, other

    cs.CC cs.LG stat.ML

    Average-Case Complexity of Tensor Decomposition for Low-Degree Polynomials

    Authors: Alexander S. Wein

    Abstract: Suppose we are given an $n$-dimensional order-3 symmetric tensor $T \in (\mathbb{R}^n)^{\otimes 3}$ that is the sum of $r$ random rank-1 terms. The problem of recovering the rank-1 components is possible in principle when $r \lesssim n^2$ but polynomial-time algorithms are only known in the regime $r \ll n^{3/2}$. Similar "statistical-computational gaps" occur in many high-dimensional inference ta… ▽ More

    Submitted 26 March, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

    Comments: 42 pages; STOC 2023

  10. arXiv:2208.09493  [pdf, other

    cs.DS math.OC math.PR math.ST stat.ML

    Near-optimal fitting of ellipsoids to random points

    Authors: Aaron Potechin, Paxton Turner, Prayaag Venkat, Alexander S. Wein

    Abstract: Given independent standard Gaussian points $v_1, \ldots, v_n$ in dimension $d$, for what values of $(n, d)$ does there exist with high probability an origin-symmetric ellipsoid that simultaneously passes through all of the points? This basic problem of fitting an ellipsoid to random points has connections to low-rank matrix decompositions, independent component analysis, and principal component an… ▽ More

    Submitted 1 June, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: An earlier version of this paper contained an error in the proof of Proposition 5.2. The current version contains a corrected proof of the original result

  11. arXiv:2206.07640  [pdf, other

    stat.ML cs.DS cs.IT cs.LG math.ST

    Statistical and Computational Phase Transitions in Group Testing

    Authors: Amin Coja-Oghlan, Oliver Gebhard, Max Hahn-Klimroth, Alexander S. Wein, Ilias Zadik

    Abstract: We study the group testing problem where the goal is to identify a set of k infected individuals carrying a rare disease within a population of size n, based on the outcomes of pooled tests which return positive whenever there is at least one infected individual in the tested group. We consider two different simple random procedures for assigning individuals to tests: the constant-column design an… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2022

  12. arXiv:2205.09727  [pdf, other

    math.ST cond-mat.stat-mech cs.CC cs.DS stat.ML

    The Franz-Parisi Criterion and Computational Trade-offs in High Dimensional Statistics

    Authors: Afonso S. Bandeira, Ahmed El Alaoui, Samuel B. Hopkins, Tselil Schramm, Alexander S. Wein, Ilias Zadik

    Abstract: Many high-dimensional statistical inference problems are believed to possess inherent computational hardness. Various frameworks have been proposed to give rigorous evidence for such hardness, including lower bounds against restricted models of computation (such as low-degree functions), as well as methods rooted in statistical physics that are based on free energy landscapes. This paper aims to m… ▽ More

    Submitted 13 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: 52 pages, 1 figure

  13. arXiv:2112.03898  [pdf, ps, other

    cs.LG cs.CC cs.DS math.ST stat.ML

    Lattice-Based Methods Surpass Sum-of-Squares in Clustering

    Authors: Ilias Zadik, Min Jae Song, Alexander S. Wein, Joan Bruna

    Abstract: Clustering is a fundamental primitive in unsupervised learning which gives rise to a rich class of computationally-challenging inference tasks. In this work, we focus on the canonical task of clustering d-dimensional Gaussian mixtures with unknown (and possibly degenerate) covariance. Recent works (Ghosh et al. '20; Mao, Wein '21; Davis, Diaz, Wang '21) have established lower bounds against the cl… ▽ More

    Submitted 7 January, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: Added a new tight information-theoretic lower bound for label recovery

  14. arXiv:2109.01342  [pdf, ps, other

    cs.CC math.PR

    Circuit Lower Bounds for the p-Spin Optimization Problem

    Authors: David Gamarnik, Aukosh Jagannath, Alexander S. Wein

    Abstract: We consider the problem of finding a near ground state of a $p$-spin model with Rademacher couplings by means of a low-depth circuit. As a direct extension of the authors' recent work [Gamarnik, Jagannath, Wein 2020], we establish that any poly-size $n$-output circuit that produces a spin assignment with objective value within a certain constant factor of optimality, must have depth at least… ▽ More

    Submitted 21 January, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: 14 pages

  15. arXiv:2105.15081  [pdf, ps, other

    math.ST cs.DS stat.ML

    Optimal Spectral Recovery of a Planted Vector in a Subspace

    Authors: Cheng Mao, Alexander S. Wein

    Abstract: Recovering a planted vector $v$ in an $n$-dimensional random subspace of $\mathbb{R}^N$ is a generic task related to many problems in machine learning and statistics, such as dictionary learning, subspace recovery, principal component analysis, and non-Gaussian component analysis. In this work, we study computationally efficient estimation and detection of a planted vector $v$ whose $\ell_4$ norm… ▽ More

    Submitted 13 October, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: 54 pages

  16. arXiv:2012.02243  [pdf, other

    cs.DS cs.CC math.OC

    Average-Case Integrality Gap for Non-Negative Principal Component Analysis

    Authors: Afonso S. Bandeira, Dmitriy Kunisky, Alexander S. Wein

    Abstract: Montanari and Richard (2015) asked whether a natural semidefinite programming (SDP) relaxation can effectively optimize $\mathbf{x}^{\top}\mathbf{W} \mathbf{x}$ over $\|\mathbf{x}\| = 1$ with $x_i \geq 0$ for all coordinates $i$, where $\mathbf{W} \in \mathbb{R}^{n \times n}$ is drawn from the Gaussian orthogonal ensemble (GOE) or a spiked matrix model. In small numerical experiments, this SDP app… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures

  17. arXiv:2010.06563  [pdf, ps, other

    cs.CC cs.DS math.PR stat.ML

    Optimal Low-Degree Hardness of Maximum Independent Set

    Authors: Alexander S. Wein

    Abstract: We study the algorithmic task of finding a large independent set in a sparse Erdős-Rényi random graph with $n$ vertices and average degree $d$. The maximum independent set is known to have size $(2 \log d / d)n$ in the double limit $n \to \infty$ followed by $d \to \infty$, but the best known polynomial-time algorithms can only find an independent set of half-optimal size $(\log d / d)n$. We show… ▽ More

    Submitted 12 November, 2020; v1 submitted 13 October, 2020; originally announced October 2020.

    Comments: 19 pages

  18. arXiv:2008.12237  [pdf, ps, other

    cs.CC cs.DS cs.SI math.CO math.PR

    Spectral Planting and the Hardness of Refuting Cuts, Colorability, and Communities in Random Graphs

    Authors: Afonso S. Bandeira, Jess Banks, Dmitriy Kunisky, Cristopher Moore, Alexander S. Wein

    Abstract: We study the problem of efficiently refuting the k-colorability of a graph, or equivalently certifying a lower bound on its chromatic number. We give formal evidence of average-case computational hardness for this problem in sparse random regular graphs, showing optimality of a simple spectral certificate. This evidence takes the form of a computationally-quiet planting: we construct a distributio… ▽ More

    Submitted 27 August, 2020; originally announced August 2020.

    Comments: 59 pages

  19. arXiv:2008.02269  [pdf, ps, other

    math.ST cs.CC cs.DS stat.ML

    Computational Barriers to Estimation from Low-Degree Polynomials

    Authors: Tselil Schramm, Alexander S. Wein

    Abstract: One fundamental goal of high-dimensional statistics is to detect or recover planted structure (such as a low-rank matrix) hidden in noisy data. A growing body of work studies low-degree polynomials as a restricted model of computation for such problems: it has been demonstrated in various settings that low-degree polynomials of the data can match the statistical performance of the best known polyn… ▽ More

    Submitted 18 June, 2022; v1 submitted 5 August, 2020; originally announced August 2020.

    Comments: v2 adds new results on planted clique

    Journal ref: Annals of Statistics 2022, Vol. 50, No. 3, 1833-1858

  20. arXiv:2006.10689  [pdf, ps, other

    math.PR cs.DS cs.LG math.OC math.ST

    Free Energy Wells and Overlap Gap Property in Sparse PCA

    Authors: Gérard Ben Arous, Alexander S. Wein, Ilias Zadik

    Abstract: We study a variant of the sparse PCA (principal component analysis) problem in the "hard" regime, where the inference task is possible yet no polynomial-time algorithm is known to exist. Prior work, based on the low-degree likelihood ratio, has conjectured a precise expression for the best possible (sub-exponential) runtime throughout the hard regime. Following instead a statistical physics inspir… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: 63 pages. Accepted for presentation at the Conference on Learning Theory (COLT) 2020

  21. arXiv:2005.11270  [pdf, ps, other

    math.ST cs.CC cs.LG stat.ML

    The Average-Case Time Complexity of Certifying the Restricted Isometry Property

    Authors: Yunzi Ding, Dmitriy Kunisky, Alexander S. Wein, Afonso S. Bandeira

    Abstract: In compressed sensing, the restricted isometry property (RIP) on $M \times N$ sensing matrices (where $M < N$) guarantees efficient reconstruction of sparse vectors. A matrix has the $(s,δ)$-$\mathsf{RIP}$ property if behaves as a $δ$-approximate isometry on $s$-sparse vectors. It is well known that an $M\times N$ matrix with i.i.d. $\mathcal{N}(0,1/M)$ entries is $(s,δ)$-$\mathsf{RIP}$ with high… ▽ More

    Submitted 22 April, 2021; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: 14 pages

  22. arXiv:2005.10817  [pdf, ps, other

    math.ST cs.CC cs.LG stat.ML

    Computationally efficient sparse clustering

    Authors: Matthias Löffler, Alexander S. Wein, Afonso S. Bandeira

    Abstract: We study statistical and computational limits of clustering when the means of the centres are sparse and their dimension is possibly much larger than the sample size. Our theoretical analysis focuses on the model $X_i = z_i θ+ \varepsilon_i, ~z_i \in \{-1,1\}, ~\varepsilon_i \thicksim \mathcal{N}(0,I)$, which has two clusters with centres $θ$ and $-θ$. We provide a finite sample analysis of a new… ▽ More

    Submitted 22 March, 2021; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: 33 pages

    MSC Class: 62H30

  23. arXiv:2004.12063  [pdf, ps, other

    cs.CC cs.DS math-ph math.PR stat.ML

    Hardness of Random Optimization Problems for Boolean Circuits, Low-Degree Polynomials, and Langevin Dynamics

    Authors: David Gamarnik, Aukosh Jagannath, Alexander S. Wein

    Abstract: We consider the problem of finding nearly optimal solutions of optimization problems with random objective functions. Two concrete problems we consider are (a) optimizing the Hamiltonian of a spherical or Ising $p$-spin glass model, and (b) finding a large independent set in a sparse Erdős-Rényi graph. The following families of algorithms are considered: (a) low-degree polynomials of the input; (b… ▽ More

    Submitted 26 January, 2022; v1 submitted 25 April, 2020; originally announced April 2020.

    Comments: 41 pages; v1 is the conference paper "Low-Degree Hardness of Random Optimization Problems" (FOCS 2020); v2 is a journal version which adds circuit lower bounds for max independent set, based on ideas from our note arXiv:2109.01342

  24. arXiv:2004.08454  [pdf, ps, other

    cs.CC cs.DS stat.ML

    Counterexamples to the Low-Degree Conjecture

    Authors: Justin Holmgren, Alexander S. Wein

    Abstract: A conjecture of Hopkins (2018) posits that for certain high-dimensional hypothesis testing problems, no polynomial-time algorithm can outperform so-called "simple statistics", which are low-degree polynomials in the data. This conjecture formalizes the beliefs surrounding a line of recent work that seeks to understand statistical-versus-computational tradeoffs via the low-degree likelihood ratio.… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

    Comments: 10 pages

  25. arXiv:1907.11636  [pdf, ps, other

    math.ST cs.CC cs.DS stat.ML

    Notes on Computational Hardness of Hypothesis Testing: Predictions using the Low-Degree Likelihood Ratio

    Authors: Dmitriy Kunisky, Alexander S. Wein, Afonso S. Bandeira

    Abstract: These notes survey and explore an emerging method, which we call the low-degree method, for predicting and understanding statistical-versus-computational tradeoffs in high-dimensional inference problems. In short, the method posits that a certain quantity -- the second moment of the low-degree likelihood ratio -- gives insight into how much computational time is required to solve a given hypothesi… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

    Comments: 44 pages

  26. arXiv:1907.11635  [pdf, ps, other

    math.ST cs.CC cs.DS stat.ML

    Subexponential-Time Algorithms for Sparse PCA

    Authors: Yunzi Ding, Dmitriy Kunisky, Alexander S. Wein, Afonso S. Bandeira

    Abstract: We study the computational cost of recovering a unit-norm sparse principal component $x \in \mathbb{R}^n$ planted in a random matrix, in either the Wigner or Wishart spiked model (observing either $W + λxx^\top$ with $W$ drawn from the Gaussian orthogonal ensemble, or $N$ independent samples from $\mathcal{N}(0, I_n + βxx^\top)$, respectively). Prior work has shown that when the signal-to-noise ra… ▽ More

    Submitted 23 June, 2022; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: 44 pages

  27. arXiv:1904.03858  [pdf, ps, other

    cs.DS cond-mat.stat-mech cs.LG math.ST stat.ML

    The Kikuchi Hierarchy and Tensor PCA

    Authors: Alexander S. Wein, Ahmed El Alaoui, Cristopher Moore

    Abstract: For the tensor PCA (principal component analysis) problem, we propose a new hierarchy of increasingly powerful algorithms with increasing runtime. Our hierarchy is analogous to the sum-of-squares (SOS) hierarchy but is instead inspired by statistical physics and related algorithms such as belief propagation and AMP (approximate message passing). Our level-$\ell$ algorithm can be thought of as a li… ▽ More

    Submitted 1 October, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: 42 pages. This version adds results on odd-order tensor PCA and even-arity XOR refutation

    MSC Class: 68Q87 ACM Class: F.2.2

  28. arXiv:1902.07324  [pdf, ps, other

    cs.DS cs.CC math.ST

    Computational Hardness of Certifying Bounds on Constrained PCA Problems

    Authors: Afonso S. Bandeira, Dmitriy Kunisky, Alexander S. Wein

    Abstract: Given a random $n \times n$ symmetric matrix $\boldsymbol W$ drawn from the Gaussian orthogonal ensemble (GOE), we consider the problem of certifying an upper bound on the maximum value of the quadratic form $\boldsymbol x^\top \boldsymbol W \boldsymbol x$ over all vectors $\boldsymbol x$ in a constraint set $\mathcal{S} \subset \mathbb{R}^n$. For a certain class of normalized constraint sets… ▽ More

    Submitted 6 April, 2019; v1 submitted 19 February, 2019; originally announced February 2019.

    Comments: Submitted version (minor text revisions)

  29. arXiv:1901.08334  [pdf, ps, other

    stat.ML cs.LG

    Overcomplete Independent Component Analysis via SDP

    Authors: Anastasia Podosinnikova, Amelia Perry, Alexander Wein, Francis Bach, Alexandre d'Aspremont, David Sontag

    Abstract: We present a novel algorithm for overcomplete independent components analysis (ICA), where the number of latent sources k exceeds the dimension p of observed variables. Previous algorithms either suffer from high computational complexity or make strong assumptions about the form of the mixing matrix. Our algorithm does not make any sparsity assumption yet enjoys favorable computational and theoret… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: Appears in: Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019). 21 pages

  30. arXiv:1811.00944  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Spectral Methods from Tensor Networks

    Authors: Ankur Moitra, Alexander S. Wein

    Abstract: A tensor network is a diagram that specifies a way to "multiply" a collection of tensors together to produce another tensor (or matrix). Many existing algorithms for tensor problems (such as tensor decomposition and tensor PCA), although they are not presented this way, can be viewed as spectral methods on matrices built from simple tensor networks. In this work we leverage the full power of this… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

    Comments: 30 pages, 8 figures

  31. arXiv:1807.00891  [pdf, ps, other

    math.ST cs.DS cs.IT math.PR stat.ML

    Optimality and Sub-optimality of PCA I: Spiked Random Matrix Models

    Authors: Amelia Perry, Alexander S. Wein, Afonso S. Bandeira, Ankur Moitra

    Abstract: A central problem of random matrix theory is to understand the eigenvalues of spiked random matrix models, introduced by Johnstone, in which a prominent eigenvector (or "spike") is planted into a random matrix. These distributions form natural statistical models for principal component analysis (PCA) problems throughout the sciences. Baik, Ben Arous and Peche showed that the spiked Wishart ensembl… ▽ More

    Submitted 12 July, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: 67 pages, 3 figures. This is the journal version of part I of arXiv:1609.05573, accepted to the Annals of Statistics. This version includes the supplementary material as appendices

    MSC Class: 62H15; 62B15

    Journal ref: Ann. Statist., Volume 46, Number 5 (2018), 2416-2451

  32. arXiv:1803.11132  [pdf, other

    stat.ML cs.DS cs.LG

    Notes on computational-to-statistical gaps: predictions using statistical physics

    Authors: Afonso S. Bandeira, Amelia Perry, Alexander S. Wein

    Abstract: In these notes we describe heuristics to predict computational-to-statistical gaps in certain statistical problems. These are regimes in which the underlying statistical problem is information-theoretically possible although no efficient algorithm exists, rendering the problem essentially unsolvable for large instances. The methods we describe here are based on mature, albeit non-rigorous, tools f… ▽ More

    Submitted 20 April, 2018; v1 submitted 29 March, 2018; originally announced March 2018.

    Comments: 22 pages, 2 figures

  33. arXiv:1712.10163  [pdf, ps, other

    math.ST cs.DS cs.IT math.AC

    Estimation under group actions: recovering orbits from invariants

    Authors: Afonso S. Bandeira, Ben Blum-Smith, Joe Kileel, Amelia Perry, Jonathan Niles-Weed, Alexander S. Wein

    Abstract: We study a class of orbit recovery problems in which we observe independent copies of an unknown element of $\mathbb{R}^p$, each linearly acted upon by a random element of some group (such as $\mathbb{Z}/p$ or $\mathrm{SO}(3)$) and then corrupted by additive Gaussian noise. We prove matching upper and lower bounds on the number of samples required to approximately recover the group orbit of this u… ▽ More

    Submitted 13 June, 2023; v1 submitted 29 December, 2017; originally announced December 2017.

    Comments: 81 pages. Minor revisions since previous version, reflecting peer review feedback. To be published in Applied and Computational Harmonic Analysis

    MSC Class: 62F10; 92C55; 16W22

    Journal ref: Applied and Computational Harmonic Analysis 66 (2023) 236--319

  34. arXiv:1612.07728  [pdf, ps, other

    math.PR cs.IT math.ST stat.ML

    Statistical limits of spiked tensor models

    Authors: Amelia Perry, Alexander S. Wein, Afonso S. Bandeira

    Abstract: We study the statistical limits of both detecting and estimating a rank-one deformation of a symmetric random Gaussian tensor. We establish upper and lower bounds on the critical signal-to-noise ratio, under a variety of priors for the planted vector: (i) a uniformly sampled unit vector, (ii) i.i.d. $\pm 1$ entries, and (iii) a sparse vector where a constant fraction $ρ$ of entries are i.i.d.… ▽ More

    Submitted 24 January, 2017; v1 submitted 22 December, 2016; originally announced December 2016.

    Comments: 39 pages, 5 figures

  35. arXiv:1610.04583  [pdf, ps, other

    cs.IT cs.CV cs.DS math.OC stat.ML

    Message-passing algorithms for synchronization problems over compact groups

    Authors: Amelia Perry, Alexander S. Wein, Afonso S. Bandeira, Ankur Moitra

    Abstract: Various alignment problems arising in cryo-electron microscopy, community detection, time synchronization, computer vision, and other fields fall into a common framework of synchronization problems over compact groups such as Z/L, U(1), or SO(3). The goal of such problems is to estimate an unknown vector of group elements given noisy relative observations. We present an efficient iterative algorit… ▽ More

    Submitted 14 October, 2016; originally announced October 2016.

    Comments: 35 pages, 11 figures

  36. arXiv:1609.05573  [pdf, other

    math.ST cs.DS cs.IT math.PR stat.ML

    Optimality and Sub-optimality of PCA for Spiked Random Matrices and Synchronization

    Authors: Amelia Perry, Alexander S. Wein, Afonso S. Bandeira, Ankur Moitra

    Abstract: A central problem of random matrix theory is to understand the eigenvalues of spiked random matrix models, in which a prominent eigenvector is planted into a random matrix. These distributions form natural statistical models for principal component analysis (PCA) problems throughout the sciences. Baik, Ben Arous and Péché showed that the spiked Wishart ensemble exhibits a sharp phase transition as… ▽ More

    Submitted 23 December, 2016; v1 submitted 18 September, 2016; originally announced September 2016.

    Comments: 58 pages, 5 figures. This version adds improved results for the Wishart model

    MSC Class: 62H15; 62B15

  37. arXiv:1511.01473  [pdf, ps, other

    cs.DS cs.IT cs.LG math.PR stat.ML

    How Robust are Reconstruction Thresholds for Community Detection?

    Authors: Ankur Moitra, William Perry, Alexander S. Wein

    Abstract: The stochastic block model is one of the oldest and most ubiquitous models for studying clustering and community detection. In an exciting sequence of developments, motivated by deep but non-rigorous ideas from statistical physics, Decelle et al. conjectured a sharp threshold for when community detection is possible in the sparse regime. Mossel, Neeman and Sly and Massoulie proved the conjecture a… ▽ More

    Submitted 21 March, 2016; v1 submitted 4 November, 2015; originally announced November 2015.

    Comments: 36 pages, 3 figures

  38. arXiv:1507.05605  [pdf, ps, other

    cs.DS math.PR stat.ML

    A semidefinite program for unbalanced multisection in the stochastic block model

    Authors: Amelia Perry, Alexander S. Wein

    Abstract: We propose a semidefinite programming (SDP) algorithm for community detection in the stochastic block model, a popular model for networks with latent community structure. We prove that our algorithm achieves exact recovery of the latent communities, up to the information-theoretic limits determined by Abbe and Sandon (2015). Our result extends prior SDP approaches by allowing for many communities… ▽ More

    Submitted 2 December, 2016; v1 submitted 20 July, 2015; originally announced July 2015.

    Comments: 29 pages

    MSC Class: 68