Skip to main content

Showing 1–21 of 21 results for author: Nowak, R D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2307.15772  [pdf, ps, other

    stat.ML cs.LG math.NA

    Weighted variation spaces and approximation by shallow ReLU networks

    Authors: Ronald DeVore, Robert D. Nowak, Rahul Parhi, Jonathan W. Siegel

    Abstract: We investigate the approximation of functions $f$ on a bounded domain $Ω\subset \mathbb{R}^d$ by the outputs of single-hidden-layer ReLU neural networks of width $n$. This form of nonlinear $n$-term dictionary approximation has been intensely studied since it is the simplest case of neural network approximation (NNA). There are several celebrated approximation results for this form of NNA that int… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  2. arXiv:2305.16534  [pdf, other

    stat.ML cs.LG

    Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network Compression

    Authors: Joseph Shenouda, Rahul Parhi, Kangwook Lee, Robert D. Nowak

    Abstract: This paper introduces a novel theoretical framework for the analysis of vector-valued neural networks through the development of vector-valued variation spaces, a new class of reproducing kernel Banach spaces. These spaces emerge from studying the regularization effect of weight decay in training networks with activations like the rectified linear unit (ReLU). This framework offers a deeper unders… ▽ More

    Submitted 9 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  3. arXiv:2301.09554  [pdf, other

    stat.ML cs.LG eess.SP

    Deep Learning Meets Sparse Regularization: A Signal Processing Perspective

    Authors: Rahul Parhi, Robert D. Nowak

    Abstract: Deep learning has been wildly successful in practice and most state-of-the-art machine learning methods are based on neural networks. Lacking, however, is a rigorous mathematical theory that adequately explains the amazing performance of deep neural networks. In this article, we present a relatively new mathematical framework that provides the beginning of a deeper understanding of deep learning.… ▽ More

    Submitted 8 June, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Journal ref: IEEE Signal Processing Magazine, vol. 40, no. 6, pp. 63-74, Sept. 2023

  4. arXiv:2109.08844  [pdf, other

    stat.ML cs.LG math.ST

    Near-Minimax Optimal Estimation With Shallow ReLU Neural Networks

    Authors: Rahul Parhi, Robert D. Nowak

    Abstract: We study the problem of estimating an unknown function from noisy data using shallow ReLU neural networks. The estimators we study minimize the sum of squared data-fitting errors plus a regularization term proportional to the squared Euclidean norm of the network weights. This minimization corresponds to the common approach of training a neural network with weight decay. We quantify the performanc… ▽ More

    Submitted 12 October, 2022; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: IEEE Transactions on Information Theory (in press)

    Journal ref: IEEE Transactions on Information Theory, vol. 69, no. 2, pp. 1125-1140, Feb. 2023

  5. arXiv:2105.03361  [pdf, other

    stat.ML cs.LG

    What Kinds of Functions do Deep Neural Networks Learn? Insights from Variational Spline Theory

    Authors: Rahul Parhi, Robert D. Nowak

    Abstract: We develop a variational framework to understand the properties of functions learned by fitting deep neural networks with rectified linear unit activations to data. We propose a new function space, which is reminiscent of classical bounded variation-type spaces, that captures the compositional structure associated with deep neural networks. We derive a representer theorem showing that deep ReLU ne… ▽ More

    Submitted 26 September, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

    Journal ref: SIAM Journal on Mathematics of Data Science, vol. 4, no. 2, pp. 464-489, 2022

  6. arXiv:2006.05626  [pdf, other

    stat.ML cs.LG

    Banach Space Representer Theorems for Neural Networks and Ridge Splines

    Authors: Rahul Parhi, Robert D. Nowak

    Abstract: We develop a variational framework to understand the properties of the functions learned by neural networks fit to data. We propose and study a family of continuous-domain linear inverse problems with total variation-like regularization in the Radon domain subject to data fitting constraints. We derive a representer theorem showing that finite-width, single-hidden layer neural networks are solutio… ▽ More

    Submitted 11 February, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: update to published version

    Journal ref: Journal of Machine Learning Research, vol. 22, no. 43, pp. 1-40, 2021

  7. arXiv:2002.01044  [pdf, other

    stat.ML cs.IT cs.LG math.ST

    Optimal Confidence Regions for the Multinomial Parameter

    Authors: Matthew L. Malloy, Ardhendu Tripathy, Robert D. Nowak

    Abstract: Construction of tight confidence regions and intervals is central to statistical inference and decision making. This paper develops new theory showing minimum average volume confidence regions for categorical data. More precisely, consider an empirical distribution $\widehat{\boldsymbol{p}}$ generated from $n$ iid realizations of a random variable that takes one of $k$ possible values according to… ▽ More

    Submitted 29 January, 2021; v1 submitted 3 February, 2020; originally announced February 2020.

  8. The Role of Neural Network Activation Functions

    Authors: Rahul Parhi, Robert D. Nowak

    Abstract: A wide variety of activation functions have been proposed for neural networks. The Rectified Linear Unit (ReLU) is especially popular today. There are many practical reasons that motivate the use of the ReLU. This paper provides new theoretical characterizations that support the use of the ReLU, its variants such as the leaky ReLU, as well as other activation functions in the case of univariate, s… ▽ More

    Submitted 16 October, 2020; v1 submitted 5 October, 2019; originally announced October 2019.

    Comments: update to published version

    Journal ref: IEEE Signal Processing Letters, vol. 27, pp. 1779-1783, 2020

  9. arXiv:1905.12782  [pdf, other

    cs.LG cs.AI stat.ML

    MaxiMin Active Learning in Overparameterized Model Classes}

    Authors: Mina Karzand, Robert D. Nowak

    Abstract: Generating labeled training datasets has become a major bottleneck in Machine Learning (ML) pipelines. Active ML aims to address this issue by designing learning algorithms that automatically and adaptively select the most informative examples for labeling so that human time is not wasted labeling irrelevant, redundant, or trivial examples. This paper proposes a new approach to active ML with nonp… ▽ More

    Submitted 28 April, 2020; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: 43 pages, 12 figures

  10. arXiv:1804.10266  [pdf, other

    stat.ML cs.LG

    Tensor Methods for Nonlinear Matrix Completion

    Authors: Greg Ongie, Daniel Pimentel-Alarcón, Laura Balzano, Rebecca Willett, Robert D. Nowak

    Abstract: In the low-rank matrix completion (LRMC) problem, the low-rank assumption means that the columns (or rows) of the matrix to be completed are points on a low-dimensional linear algebraic variety. This paper extends this thinking to cases where the columns are points on a low-dimensional nonlinear algebraic variety, a problem we call Low Algebraic Dimension Matrix Completion (LADMC). Matrices whose… ▽ More

    Submitted 4 September, 2020; v1 submitted 26 April, 2018; originally announced April 2018.

  11. arXiv:1703.09631  [pdf, other

    stat.ML

    Algebraic Variety Models for High-Rank Matrix Completion

    Authors: Greg Ongie, Rebecca Willett, Robert D. Nowak, Laura Balzano

    Abstract: We consider a generalization of low-rank matrix completion to the case where the data belongs to an algebraic variety, i.e. each data point is a solution to a system of polynomial equations. In this case the original matrix is possibly high-rank, but it becomes low-rank after map** each column to a higher dimensional space of monomial features. Many well-studied extensions of linear models, incl… ▽ More

    Submitted 28 March, 2017; originally announced March 2017.

  12. arXiv:1503.02596  [pdf, ps, other

    stat.ML cs.LG math.AG

    A Characterization of Deterministic Sampling Patterns for Low-Rank Matrix Completion

    Authors: Daniel L. Pimentel-Alarcón, Nigel Boston, Robert D. Nowak

    Abstract: Low-rank matrix completion (LRMC) problems arise in a wide variety of applications. Previous theory mainly provides conditions for completion under missing-at-random samplings. This paper studies deterministic conditions for completion. An incomplete $d \times N$ matrix is finitely rank-$r$ completable if there are at most finitely many rank-$r$ matrices that agree with all its observed entries. F… ▽ More

    Submitted 11 October, 2016; v1 submitted 9 March, 2015; originally announced March 2015.

    Comments: This update corrects an error in version 2 of this paper, where we erroneously assumed that columns with more than r+1 observed entries would yield multiple independent constraints

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, vol. 10, no. 4, pp. 623-636, June, 2016

  13. arXiv:1410.0633  [pdf, ps, other

    stat.ML cs.LG math.CO

    Deterministic Conditions for Subspace Identifiability from Incomplete Sampling

    Authors: Daniel L. Pimentel-Alarcón, Robert D. Nowak, Nigel Boston

    Abstract: Consider a generic $r$-dimensional subspace of $\mathbb{R}^d$, $r<d$, and suppose that we are only given projections of this subspace onto small subsets of the canonical coordinates. The paper establishes necessary and sufficient deterministic conditions on the subsets for subspace identifiability.

    Submitted 24 May, 2015; v1 submitted 2 October, 2014; originally announced October 2014.

    Comments: To appear in Proc. of IEEE ISIT, 2015

  14. arXiv:1409.4005  [pdf, ps, other

    stat.ML

    Sparse Estimation with Strongly Correlated Variables using Ordered Weighted L1 Regularization

    Authors: Mario A. T. Figueiredo, Robert D. Nowak

    Abstract: This paper studies ordered weighted L1 (OWL) norm regularization for sparse estimation problems with strongly correlated variables. We prove sufficient conditions for clustering based on the correlation/colinearity of variables using the OWL norm, of which the so-called OSCAR is a particular case. Our results extend previous ones for OSCAR in several ways: for the squared error loss, our condition… ▽ More

    Submitted 13 September, 2014; originally announced September 2014.

  15. arXiv:1404.3418  [pdf, ps, other

    stat.ML cs.IT math.ST

    Active Learning for Undirected Graphical Model Selection

    Authors: Divyanshu Vats, Robert D. Nowak, Richard G. Baraniuk

    Abstract: This paper studies graphical model selection, i.e., the problem of estimating a graph of statistical relationships among a collection of random variables. Conventional graphical model selection algorithms are passive, i.e., they require all the measurements to have been collected before processing begins. We propose an active learning algorithm that uses junction tree representations to adapt futu… ▽ More

    Submitted 13 April, 2014; originally announced April 2014.

    Comments: AISTATS 2014

    Journal ref: Proceedings of the 17th International Conference on Artificial Intelligence and Statistics (AISTATS) 2014, Reykjavik, Iceland. JMLR: W&CP volume 33

  16. arXiv:1306.6239  [pdf, ps, other

    cs.IT stat.ML

    Near-Optimal Adaptive Compressed Sensing

    Authors: Matthew L. Malloy, Robert D. Nowak

    Abstract: This paper proposes a simple adaptive sensing and group testing algorithm for sparse signal recovery. The algorithm, termed Compressive Adaptive Sense and Search (CASS), is shown to be near-optimal in that it succeeds at the lowest possible signal-to-noise-ratio (SNR) levels, improving on previous work in adaptive compressed sensing. Like traditional compressed sensing based on random non-adaptive… ▽ More

    Submitted 29 April, 2014; v1 submitted 26 June, 2013; originally announced June 2013.

  17. arXiv:1209.2434  [pdf, ps, other

    stat.ML cs.LG

    Query Complexity of Derivative-Free Optimization

    Authors: Kevin G. Jamieson, Robert D. Nowak, Benjamin Recht

    Abstract: This paper provides lower bounds on the convergence rate of Derivative Free Optimization (DFO) with noisy function evaluations, exposing a fundamental and unavoidable gap between the performance of algorithms with access to gradients and those with access to only function evaluations. However, there are situations in which DFO is unavoidable, and for such situations we propose a new DFO algorithm… ▽ More

    Submitted 11 September, 2012; originally announced September 2012.

  18. The Sample Complexity of Search over Multiple Populations

    Authors: Matthew L. Malloy, Gongguo Tang, Robert D. Nowak

    Abstract: This paper studies the sample complexity of searching over multiple populations. We consider a large number of populations, each corresponding to either distribution P0 or P1. The goal of the search problem studied here is to find one population corresponding to distribution P1 with as few samples as possible. The main contribution is to quantify the number of samples needed to correctly find one… ▽ More

    Submitted 1 May, 2013; v1 submitted 6 September, 2012; originally announced September 2012.

    Comments: To appear, IEEE Transactions on Information Theory

  19. arXiv:1109.3701  [pdf, other

    cs.LG cs.IT stat.ML

    Active Ranking using Pairwise Comparisons

    Authors: Kevin G. Jamieson, Robert D. Nowak

    Abstract: This paper examines the problem of ranking a collection of objects using pairwise comparisons (rankings of two objects). In general, the ranking of $n$ objects can be identified by standard sorting methods using $n log_2 n$ pairwise comparisons. We are interested in natural situations in which relationships among the objects may allow for ranking using far fewer pairwise comparisons. Specifically,… ▽ More

    Submitted 9 December, 2011; v1 submitted 16 September, 2011; originally announced September 2011.

    Comments: 17 pages, an extended version of our NIPS 2011 paper. The new version revises the argument of the robust section and slightly modifies the result there to give it more impact

  20. arXiv:1104.4385  [pdf, other

    cs.CV stat.ML

    Convex Approaches to Model Wavelet Sparsity Patterns

    Authors: Nikhil S Rao, Robert D. Nowak, Stephen J. Wright, Nick G. Kingsbury

    Abstract: Statistical dependencies among wavelet coefficients are commonly represented by graphical models such as hidden Markov trees(HMTs). However, in linear inverse problems such as deconvolution, tomography, and compressed sensing, the presence of a sensing or observation matrix produces a linear mixing of the simple Markovian dependency structure. This leads to reconstruction problems that are non-con… ▽ More

    Submitted 22 April, 2011; originally announced April 2011.

  21. arXiv:0910.4397  [pdf, other

    stat.ML cs.IT math.ST

    The Geometry of Generalized Binary Search

    Authors: Robert D. Nowak

    Abstract: This paper investigates the problem of determining a binary-valued function through a sequence of strategically selected queries. The focus is an algorithm called Generalized Binary Search (GBS). GBS is a well-known greedy algorithm for determining a binary-valued function through a sequence of strategically selected queries. At each step, a query is selected that most evenly splits the hypotheses… ▽ More

    Submitted 25 June, 2013; v1 submitted 22 October, 2009; originally announced October 2009.

    Comments: corrected typo in Thm 3