Skip to main content

Showing 1–15 of 15 results for author: Donoho, D L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01413  [pdf, other

    cs.LG cs.AI cs.CL cs.ET stat.ML

    Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

    Authors: Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Henry Sleight, John Hughes, Tomasz Korbak, Rajashree Agrawal, Dhruv Pai, Andrey Gromov, Daniel A. Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo

    Abstract: The proliferation of generative models, combined with pretraining on web-scale data, raises a timely question: what happens when these models are trained on their own generated outputs? Recent investigations into model-data feedback loops proposed that such loops would lead to a phenomenon termed model collapse, under which performance progressively degrades with each model-data feedback iteration… ▽ More

    Submitted 29 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  2. arXiv:2106.02073  [pdf, other

    cs.LG cs.AI math.DG math.OC stat.ML

    Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path

    Authors: X. Y. Han, Vardan Papyan, David L. Donoho

    Abstract: The recently discovered Neural Collapse (NC) phenomenon occurs pervasively in today's deep net training paradigm of driving cross-entropy (CE) loss towards zero. During NC, last-layer features collapse to their class-means, both classifiers and class-means collapse to the same Simplex Equiangular Tight Frame, and classifier behavior collapses to the nearest-class-mean decision rule. Recent works d… ▽ More

    Submitted 9 May, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: ICLR 2022 Outstanding Paper Prize & Oral. Appendix contains [A] empirical experiments, [B-D] proofs of theoretical results, and [E] survey of related works examining Neural Collapse

  3. arXiv:2008.08186  [pdf, other

    cs.LG cs.CV stat.ML

    Prevalence of Neural Collapse during the terminal phase of deep learning training

    Authors: Vardan Papyan, X. Y. Han, David L. Donoho

    Abstract: Modern practice for training classification deepnets involves a Terminal Phase of Training (TPT), which begins at the epoch where training error first vanishes; During TPT, the training error stays effectively zero while training loss is pushed towards zero. Direct measurements of TPT, for three prototypical deepnet architectures and across seven canonical classification datasets, expose a pervasi… ▽ More

    Submitted 21 August, 2020; v1 submitted 18 August, 2020; originally announced August 2020.

  4. arXiv:1901.08705  [pdf, other

    cs.DC

    Ambitious Data Science Can Be Painless

    Authors: Hatef Monajemi, Riccardo Murri, Eric Jonas, Percy Liang, Victoria Stodden, David L. Donoho

    Abstract: Modern data science research can involve massive computational experimentation; an ambitious PhD in computational fields may do experiments consuming several million CPU hours. Traditional computing practices, in which researchers use laptops or shared campus-resident resources, are inadequate for experiments at the massive scale and varied scope that we now see in data science. On the other hand,… ▽ More

    Submitted 24 January, 2019; originally announced January 2019.

    Comments: Submitted to Harvard Data Science Review

  5. arXiv:1702.03062  [pdf, other

    cs.IT

    Sparsity/Undersampling Tradeoffs in Anisotropic Undersampling, with Applications in MR Imaging/Spectroscopy

    Authors: Hatef Monajemi, David L. Donoho

    Abstract: We study anisotropic undersampling schemes like those used in multi-dimensional NMR spectroscopy and MR imaging, which sample exhaustively in certain time dimensions and randomly in others. Our analysis shows that anisotropic undersampling schemes are equivalent to certain block-diagonal measurement systems. We develop novel exact formulas for the sparsity/undersampling tradeoffs in such measure… ▽ More

    Submitted 16 March, 2018; v1 submitted 9 February, 2017; originally announced February 2017.

  6. The Phase Transition of Matrix Recovery from Gaussian Measurements Matches the Minimax MSE of Matrix Denoising

    Authors: David L. Donoho, Matan Gavish, Andrea Montanari

    Abstract: Let $X_0$ be an unknown $M$ by $N$ matrix. In matrix recovery, one takes $n < MN$ linear measurements $y_1,..., y_n$ of $X_0$, where $y_i = \Tr(a_i^T X_0)$ and each $a_i$ is a $M$ by $N$ matrix. For measurement matrices with Gaussian i.i.d entries, it known that if $X_0$ is of low rank, it is recoverable from just a few measurements. A popular approach for matrix recovery is Nuclear Norm Minimizat… ▽ More

    Submitted 10 February, 2013; originally announced February 2013.

  7. arXiv:1112.0708  [pdf, other

    cs.IT cond-mat.stat-mech math.ST

    Information-Theoretically Optimal Compressed Sensing via Spatial Coupling and Approximate Message Passing

    Authors: David L. Donoho, Adel Javanmard, Andrea Montanari

    Abstract: We study the compressed sensing reconstruction problem for a broad class of random, band-diagonal sensing matrices. This construction is inspired by the idea of spatial coupling in coding theory. As demonstrated heuristically and numerically by Krzakala et al. \cite{KrzakalaEtAl}, message passing algorithms can effectively solve the reconstruction problem for spatially coupled measurements with un… ▽ More

    Submitted 18 January, 2013; v1 submitted 3 December, 2011; originally announced December 2011.

    Comments: 60 pages, 7 figures, Sections 3,5 and Appendices A,B are added. The stability constant is quantified (cf Theorem 1.7)

  8. arXiv:1004.3006  [pdf, ps, other

    math.FA cs.IT math.NA

    Microlocal Analysis of the Geometric Separation Problem

    Authors: David L. Donoho, Gitta Kutyniok

    Abstract: Image data are often composed of two or more geometrically distinct constituents; in galaxy catalogs, for instance, one sees a mixture of pointlike structures (galaxy superclusters) and curvelike structures (filaments). It would be ideal to process a single image and extract two geometrically `pure' images, each one containing features from only one of the two geometric constituents. This seems t… ▽ More

    Submitted 18 April, 2010; originally announced April 2010.

    Comments: 59 pages, 9 figures

    Report number: Technical Report No. 2010-01, Statistics Department, Stanford University

  9. arXiv:1004.1218  [pdf, other

    math.ST cs.IT

    The Noise-Sensitivity Phase Transition in Compressed Sensing

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: Consider the noisy underdetermined system of linear equations: y=Ax0 + z0, with n x N measurement matrix A, n < N, and Gaussian white noise z0 ~ N(0,σ^2 I). Both y and A are known, both x0 and z0 are unknown, and we seek an approximation to x0. When x0 has few nonzeros, useful approximations are obtained by l1-penalized l2 minimization, in which the reconstruction \hxl solves min || y - Ax||^2/2… ▽ More

    Submitted 7 April, 2010; originally announced April 2010.

    Comments: 40 pages, 13 pdf figures

  10. arXiv:0911.4222  [pdf, other

    cs.IT

    Message Passing Algorithms for Compressed Sensing: II. Analysis and Validation

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: In a recent paper, the authors proposed a new class of low-complexity iterative thresholding algorithms for reconstructing sparse signals from a small set of linear measurements \cite{DMM}. The new algorithms are broadly referred to as AMP, for approximate message passing. This is the second of two conference papers describing the derivation of these algorithms, connection with related literatur… ▽ More

    Submitted 21 November, 2009; originally announced November 2009.

    Comments: 5 pages, 3 pdf figures, IEEE Information Theory Workshop, Cairo 2010

  11. arXiv:0911.4219  [pdf, ps, other

    cs.IT

    Message Passing Algorithms for Compressed Sensing: I. Motivation and Construction

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: In a recent paper, the authors proposed a new class of low-complexity iterative thresholding algorithms for reconstructing sparse signals from a small set of linear measurements \cite{DMM}. The new algorithms are broadly referred to as AMP, for approximate message passing. This is the first of two conference papers describing the derivation of these algorithms, connection with the related litera… ▽ More

    Submitted 21 November, 2009; originally announced November 2009.

    Comments: 5 pages, IEEE Information Theory Workshop, Cairo 2010

  12. arXiv:0909.0777  [pdf, other

    math.NA cs.IT cs.MS

    Optimally Tuned Iterative Reconstruction Algorithms for Compressed Sensing

    Authors: Arian Maleki, David L. Donoho

    Abstract: We conducted an extensive computational experiment, lasting multiple CPU-years, to optimally select parameters for two important classes of algorithms for finding sparse solutions of underdetermined systems of linear equations. We make the optimally tuned implementations available at {\tt sparselab.stanford.edu}; they run `out of the box' with no user tuning: it is not necessary to select thresh… ▽ More

    Submitted 3 September, 2009; originally announced September 2009.

    Comments: 12 pages, 14 figures

  13. arXiv:0907.3574  [pdf, ps, other

    cs.IT cond-mat.dis-nn stat.CO

    Message Passing Algorithms for Compressed Sensing

    Authors: David L. Donoho, Arian Maleki, Andrea Montanari

    Abstract: Compressed sensing aims to undersample certain high-dimensional signals, yet accurately reconstruct them by exploiting signal characteristics. Accurate reconstruction is possible when the object to be recovered is sufficiently sparse in a known basis. Currently, the best known sparsity-undersampling tradeoff is achieved when reconstructing by convex optimization -- which is expensive in importan… ▽ More

    Submitted 21 July, 2009; originally announced July 2009.

    Comments: 6 pages paper + 9 pages supplementary information, 13 eps figure. Submitted to Proc. Natl. Acad. Sci. USA

  14. arXiv:0906.2530  [pdf, other

    math.ST cs.IT physics.data-an stat.CO

    Observed Universality of Phase Transitions in High-Dimensional Geometry, with Implications for Modern Data Analysis and Signal Processing

    Authors: David L. Donoho, Jared Tanner

    Abstract: We review connections between phase transitions in high-dimensional combinatorial geometry and phase transitions occurring in modern high-dimensional data analysis and signal processing. In data analysis, such transitions arise as abrupt breakdown of linear model selection, robust data fitting or compressed sensing reconstructions, when the complexity of the model or the number of outliers incre… ▽ More

    Submitted 14 June, 2009; originally announced June 2009.

    Comments: 47 pages, 24 figures, 10 tables

  15. arXiv:0807.3590  [pdf, ps, other

    math.MG cs.IT math.OC math.PR

    Counting the Faces of Randomly-Projected Hypercubes and Orthants, with Applications

    Authors: David L. Donoho, Jared Tanner

    Abstract: Let $A$ be an $n$ by $N$ real valued random matrix, and $\h$ denote the $N$-dimensional hypercube. For numerous random matrix ensembles, the expected number of $k$-dimensional faces of the random $n$-dimensional zonotope $A\h$ obeys the formula $E f_k(A\h) /f_k(\h) = 1-P_{N-n,N-k}$, where $P_{N-n,N-k}$ is a fair-coin-tossing probability. The formula applies, for example, where the columns of… ▽ More

    Submitted 22 July, 2008; originally announced July 2008.

    Comments: 21 pages, 3 figures

    MSC Class: 52A22; 52B05; 52B11; 52B12; 62E20; 68P30; 68P25; 68W20; 68W40; 94B20; 94B35; 94B65; 94B70