Skip to main content

Showing 1–20 of 20 results for author: Baraniuk, R G

Searching in archive math. Search in all archives.
.
  1. arXiv:2211.03751  [pdf, other

    math.NA cs.DS math.ST

    Asymptotics of the Sketched Pseudoinverse

    Authors: Daniel LeJeune, Pratik Patil, Hamid Javadi, Richard G. Baraniuk, Ryan J. Tibshirani

    Abstract: We take a random matrix theory approach to random sketching and show an asymptotic first-order equivalence of the regularized sketched pseudoinverse of a positive semidefinite matrix to a certain evaluation of the resolvent of the same matrix. We focus on real-valued regularization and extend previous results on an asymptotic equivalence of random matrices to the real setting, providing a precise… ▽ More

    Submitted 6 October, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: 45 pages, 9 figures

    MSC Class: 15B52; 46L54; 62J07

  2. arXiv:2208.00579  [pdf, other

    cs.LG math.NA

    Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization

    Authors: Tan Nguyen, Richard G. Baraniuk, Robert M. Kirby, Stanley J. Osher, Bao Wang

    Abstract: Transformers have achieved remarkable success in sequence modeling and beyond but suffer from quadratic computational and memory complexities with respect to the length of the input sequence. Leveraging techniques include sparse and linear attention and hashing tricks; efficient transformers have been proposed to reduce the quadratic complexity of transformers but significantly degrade the accurac… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: 22 pages, 5 figures. arXiv admin note: substantial text overlap with arXiv:2110.07034

    MSC Class: 65Pxx

  3. Singular Value Perturbation and Deep Network Optimization

    Authors: Rudolf H. Riedi, Randall Balestriero, Richard G. Baraniuk

    Abstract: We develop new theoretical results on matrix perturbation to shed light on the impact of architecture on the performance of a deep network. In particular, we explain analytically what deep learning practitioners have long observed empirically: the parameters of some deep architectures (e.g., residual networks, ResNets, and Dense networks, DenseNets) are easier to optimize than others (e.g., convol… ▽ More

    Submitted 5 December, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: Constr Approx (2022)

  4. arXiv:2006.06919  [pdf, other

    cs.LG math.DS stat.ML

    MomentumRNN: Integrating Momentum into Recurrent Neural Networks

    Authors: Tan M. Nguyen, Richard G. Baraniuk, Andrea L. Bertozzi, Stanley J. Osher, Bao Wang

    Abstract: Designing deep neural networks is an art that often involves an expensive search over candidate architectures. To overcome this for recurrent neural nets (RNNs), we establish a connection between the hidden state dynamics in an RNN and gradient descent (GD). We then integrate momentum into this framework and propose a new family of RNNs, called {\em MomentumRNNs}. We theoretically prove and numeri… ▽ More

    Submitted 11 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 21 pages, 11 figures, Accepted for publication at Advances in Neural Information Processing Systems (NeurIPS) 2020

    MSC Class: 68T07 ACM Class: I.2

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 2020

  5. arXiv:1511.01017  [pdf, ps, other

    math.ST cs.IT math.OC stat.ML

    Consistent Parameter Estimation for LASSO and Approximate Message Passing

    Authors: Ali Mousavi, Arian Maleki, Richard G. Baraniuk

    Abstract: We consider the problem of recovering a vector $β_o \in \mathbb{R}^p$ from $n$ random and noisy linear observations $y= Xβ_o + w$, where $X$ is the measurement matrix and $w$ is noise. The LASSO estimate is given by the solution to the optimization problem $\hatβ_λ = \arg \min_β \frac{1}{2} \|y-Xβ\|_2^2 + λ\| β\|_1$. Among the iterative algorithms that have been proposed for solving this optimizat… ▽ More

    Submitted 4 November, 2015; v1 submitted 3 November, 2015; originally announced November 2015.

    Comments: arXiv admin note: text overlap with arXiv:1309.5979

  6. arXiv:1406.4175  [pdf, other

    cs.IT math.ST stat.ML

    From Denoising to Compressed Sensing

    Authors: Christopher A. Metzler, Arian Maleki, Richard G. Baraniuk

    Abstract: A denoising algorithm seeks to remove noise, errors, or perturbations from a signal. Extensive research has been devoted to this arena over the last several decades, and as a result, today's denoisers can effectively remove large amounts of additive white Gaussian noise. A compressed sensing (CS) reconstruction algorithm seeks to recover a structured signal acquired using a small number of randomi… ▽ More

    Submitted 17 April, 2016; v1 submitted 16 June, 2014; originally announced June 2014.

  7. arXiv:1404.4104  [pdf, other

    math.OC cs.CV cs.LG

    Sparse Bilinear Logistic Regression

    Authors: Jianing V. Shi, Yangyang Xu, Richard G. Baraniuk

    Abstract: In this paper, we introduce the concept of sparse bilinear logistic regression for decision problems involving explanatory variables that are two-dimensional matrices. Such problems are common in computer vision, brain-computer interfaces, style/content factorization, and parallel factor analysis. The underlying optimization problem is bi-convex; we study its solution and develop an efficient algo… ▽ More

    Submitted 15 April, 2014; originally announced April 2014.

    Comments: 27 pages, 5 figures

    MSC Class: 65K10; 68W40; 68Q32

  8. arXiv:1404.3418  [pdf, ps, other

    stat.ML cs.IT math.ST

    Active Learning for Undirected Graphical Model Selection

    Authors: Divyanshu Vats, Robert D. Nowak, Richard G. Baraniuk

    Abstract: This paper studies graphical model selection, i.e., the problem of estimating a graph of statistical relationships among a collection of random variables. Conventional graphical model selection algorithms are passive, i.e., they require all the measurements to have been collected before processing begins. We propose an active learning algorithm that uses junction tree representations to adapt futu… ▽ More

    Submitted 13 April, 2014; originally announced April 2014.

    Comments: AISTATS 2014

    Journal ref: Proceedings of the 17th International Conference on Artificial Intelligence and Statistics (AISTATS) 2014, Reykjavik, Iceland. JMLR: W&CP volume 33

  9. arXiv:1402.5584  [pdf, ps, other

    math.ST cs.IT stat.ML

    Path Thresholding: Asymptotically Tuning-Free High-Dimensional Sparse Regression

    Authors: Divyanshu Vats, Richard G. Baraniuk

    Abstract: In this paper, we address the challenging problem of selecting tuning parameters for high-dimensional sparse regression. We propose a simple and computationally efficient method, called path thresholding (PaTh), that transforms any tuning parameter-dependent sparse regression algorithm into an asymptotically tuning-free sparse regression algorithm. More specifically, we prove that, as the problem… ▽ More

    Submitted 23 February, 2014; originally announced February 2014.

    Comments: AISTATS 2014

    Journal ref: Proceedings of the 17th International Conference on Artificial Intelligence and Statistics (AISTATS) 2014, Reykjavik, Iceland. JMLR: W&CP volume 33

  10. arXiv:1401.7715  [pdf, other

    cs.CV math.OC

    Video Compressive Sensing for Dynamic MRI

    Authors: Jianing V. Shi, Wotao Yin, Aswin C. Sankaranarayanan, Richard G. Baraniuk

    Abstract: We present a video compressive sensing framework, termed kt-CSLDS, to accelerate the image acquisition process of dynamic magnetic resonance imaging (MRI). We are inspired by a state-of-the-art model for video compressive sensing that utilizes a linear dynamical system (LDS) to model the motion manifold. Given compressive measurements, the state sequence of an LDS can be first estimated using syst… ▽ More

    Submitted 1 February, 2014; v1 submitted 29 January, 2014; originally announced January 2014.

    Comments: 30 pages, 9 figures

    MSC Class: 90-08; 90C25; 65P99; 65K10; 93E10; 93E12

  11. arXiv:1312.5734  [pdf, ps, other

    stat.ML cs.LG math.OC stat.AP

    Time-varying Learning and Content Analytics via Sparse Factor Analysis

    Authors: Andrew S. Lan, Christoph Studer, Richard G. Baraniuk

    Abstract: We propose SPARFA-Trace, a new machine learning-based framework for time-varying learning and content analytics for education applications. We develop a novel message passing-based, blind, approximate Kalman filter for sparse factor analysis (SPARFA), that jointly (i) traces learner concept knowledge over time, (ii) analyzes learner concept knowledge state transitions (induced by interacting with… ▽ More

    Submitted 19 December, 2013; originally announced December 2013.

  12. arXiv:1312.1706  [pdf, ps, other

    math.ST cs.IT stat.ML

    Swap** Variables for High-Dimensional Sparse Regression with Correlated Measurements

    Authors: Divyanshu Vats, Richard G. Baraniuk

    Abstract: We consider the high-dimensional sparse linear regression problem of accurately estimating a sparse vector using a small number of linear measurements that are contaminated by noise. It is well known that the standard cadre of computationally tractable sparse regression algorithms---such as the Lasso, Orthogonal Matching Pursuit (OMP), and their extensions---perform poorly when the measurement mat… ▽ More

    Submitted 22 February, 2014; v1 submitted 5 December, 2013; originally announced December 2013.

    Comments: Parts of this paper have appeared in NIPS 2013

  13. arXiv:1311.0035  [pdf, ps, other

    cs.IT math.ST stat.ML

    Parameterless Optimal Approximate Message Passing

    Authors: Ali Mousavi, Arian Maleki, Richard G. Baraniuk

    Abstract: Iterative thresholding algorithms are well-suited for high-dimensional problems in sparse recovery and compressive sensing. The performance of this class of algorithms depends heavily on the tuning of certain threshold parameters. In particular, both the final reconstruction error and the convergence rate of the algorithm crucially rely on how the threshold parameter is set at each step of the alg… ▽ More

    Submitted 31 October, 2013; originally announced November 2013.

  14. arXiv:1309.5979  [pdf, other

    math.ST cs.IT stat.ML

    Asymptotic Analysis of LASSOs Solution Path with Implications for Approximate Message Passing

    Authors: Ali Mousavi, Arian Maleki, Richard G. Baraniuk

    Abstract: This paper concerns the performance of the LASSO (also knows as basis pursuit denoising) for recovering sparse signals from undersampled, randomized, noisy measurements. We consider the recovery of the signal $x_o \in \mathbb{R}^N$ from $n$ random and noisy linear observations $y= Ax_o + w$, where $A$ is the measurement matrix and $w$ is the noise. The LASSO estimate is given by the solution to th… ▽ More

    Submitted 23 September, 2013; originally announced September 2013.

  15. arXiv:1303.5685  [pdf, ps, other

    stat.ML cs.LG math.OC stat.AP

    Sparse Factor Analysis for Learning and Content Analytics

    Authors: Andrew S. Lan, Andrew E. Waters, Christoph Studer, Richard G. Baraniuk

    Abstract: We develop a new model and algorithms for machine learning-based learning analytics, which estimate a learner's knowledge of the concepts underlying a domain, and content analytics, which estimate the relationships among a collection of questions and those concepts. Our model represents the probability that a learner provides the correct response to a question in terms of three factors: their unde… ▽ More

    Submitted 19 July, 2013; v1 submitted 22 March, 2013; originally announced March 2013.

    Journal ref: Journal of Machine Learning Research, vol. 15, pp. 1959-2008, June, 2014

  16. arXiv:1303.4778  [pdf, other

    cs.LG math.NA stat.ML

    Greedy Feature Selection for Subspace Clustering

    Authors: Eva L. Dyer, Aswin C. Sankaranarayanan, Richard G. Baraniuk

    Abstract: Unions of subspaces provide a powerful generalization to linear subspace models for collections of high-dimensional data. To learn a union of subspaces from a collection of data, sets of signals in the collection that belong to the same subspace must be identified in order to obtain accurate estimates of the subspace structures present in the data. Recently, sparse recovery methods have been shown… ▽ More

    Submitted 3 July, 2013; v1 submitted 19 March, 2013; originally announced March 2013.

    Comments: 32 pages, 7 figures, 1 table

    Journal ref: Journal of Machine Learning Research, Vol.14, Issue 1, pp. 2487-2517, January 2013

  17. arXiv:1112.0311  [pdf, other

    math.ST cs.IT

    Anisotropic Nonlocal Means Denoising

    Authors: Arian Maleki, Manjari Narayan, Richard G. Baraniuk

    Abstract: It has recently been proved that the popular nonlocal means (NLM) denoising algorithm does not optimally denoise images with sharp edges. Its weakness lies in the isotropic nature of the neighborhoods it uses to set its smoothing weights. In response, in this paper we introduce several theoretical and practical anisotropic nonlocal means (ANLM) algorithms and prove that they are near minimax optim… ▽ More

    Submitted 30 November, 2012; v1 submitted 30 November, 2011; originally announced December 2011.

    Comments: Accepted for publication in Applied and Computational Harmonic Analysis (ACHA)

  18. arXiv:1111.5867  [pdf, other

    math.ST cs.CV cs.IT

    Suboptimality of Nonlocal Means for Images with Sharp Edges

    Authors: Arian Maleki, Manjari Narayan, Richard G. Baraniuk

    Abstract: We conduct an asymptotic risk analysis of the nonlocal means image denoising algorithm for the Horizon class of images that are piecewise constant with a sharp edge discontinuity. We prove that the mean square risk of an optimally tuned nonlocal means algorithm decays according to $n^{-1}\log^{1/2+ε} n$, for an $n$-pixel image with $ε>0$. This decay rate is an improvement over some of the predeces… ▽ More

    Submitted 24 November, 2011; originally announced November 2011.

    Comments: 33 pages, 3 figures

  19. arXiv:0911.0736  [pdf, ps, other

    math.NA cs.IT

    A simple proof that random matrices are democratic

    Authors: Mark A. Davenport, Jason N. Laska, Petros T. Boufounos, Richard G. Baraniuk

    Abstract: The recently introduced theory of compressive sensing (CS) enables the reconstruction of sparse or compressible signals from a small set of nonadaptive, linear measurements. If properly chosen, the number of measurements can be significantly smaller than the ambient dimension of the signal and yet preserve the significant signal information. Interestingly, it can be shown that random measurement… ▽ More

    Submitted 4 November, 2009; originally announced November 2009.

    Report number: Rice University Department of Electrical and Computer Engineering Technical Report TREE0906 MSC Class: 41A46; 68W20; 90C27

  20. Optimal sampling strategies for multiscale stochastic processes

    Authors: Vinay J. Ribeiro, Rudolf H. Riedi, Richard G. Baraniuk

    Abstract: In this paper, we determine which non-random sampling of fixed size gives the best linear predictor of the sum of a finite spatial population. We employ different multiscale superpopulation models and use the minimum mean-squared error as our optimality criterion. In multiscale superpopulation tree models, the leaves represent the units of the population, interior nodes represent partial sums of… ▽ More

    Submitted 7 November, 2006; originally announced November 2006.

    Comments: Published at http://dx.doi.org/10.1214/074921706000000509 in the IMS Lecture Notes--Monograph Series (http://www.imstat.org/publications/lecnotes.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-LNMS49-LNMS4916 MSC Class: 94A20; 62M30; 60G18 (Primary) 62H11; 62H12; 78M50 (Secondary)

    Journal ref: IMS Lecture Notes--Monograph Series 2006, Vol. 49, 266-290