Skip to main content

Showing 1–15 of 15 results for author: Gamarnik, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2103.01887  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Self-Regularity of Non-Negative Output Weights for Overparameterized Two-Layer Neural Networks

    Authors: David Gamarnik, Eren C. Kızıldağ, Ilias Zadik

    Abstract: We consider the problem of finding a two-layer neural network with sigmoid, rectified linear unit (ReLU), or binary step activation functions that "fits" a training data set as accurately as possible as quantified by the training error; and study the following question: \emph{does a low training error guarantee that the norm of the output layer (outer norm) itself is small?} We answer affirmativel… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: 34 pages. Some of the results in the present paper are significantly strengthened versions of certain results appearing in arXiv:2003.10523

  2. arXiv:2004.12063  [pdf, ps, other

    cs.CC cs.DS math-ph math.PR stat.ML

    Hardness of Random Optimization Problems for Boolean Circuits, Low-Degree Polynomials, and Langevin Dynamics

    Authors: David Gamarnik, Aukosh Jagannath, Alexander S. Wein

    Abstract: We consider the problem of finding nearly optimal solutions of optimization problems with random objective functions. Two concrete problems we consider are (a) optimizing the Hamiltonian of a spherical or Ising $p$-spin glass model, and (b) finding a large independent set in a sparse Erdős-Rényi graph. The following families of algorithms are considered: (a) low-degree polynomials of the input; (b… ▽ More

    Submitted 26 January, 2022; v1 submitted 25 April, 2020; originally announced April 2020.

    Comments: 41 pages; v1 is the conference paper "Low-Degree Hardness of Random Optimization Problems" (FOCS 2020); v2 is a journal version which adds circuit lower bounds for max independent set, based on ideas from our note arXiv:2109.01342

  3. arXiv:2003.10523  [pdf, other

    stat.ML cs.LG cs.NE math.ST

    Neural Networks and Polynomial Regression. Demystifying the Overparametrization Phenomena

    Authors: Matt Emschwiller, David Gamarnik, Eren C. Kızıldağ, Ilias Zadik

    Abstract: In the context of neural network models, overparametrization refers to the phenomena whereby these models appear to generalize well on the unseen data, even though the number of parameters significantly exceeds the sample sizes, and the model perfectly fits the in-training data. A conventional explanation of this phenomena is based on self-regularization properties of algorithms used to train the… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: 59 pages, 3 figures

  4. arXiv:1912.01599  [pdf, ps, other

    stat.ML cs.LG math.OC math.PR math.ST

    Stationary Points of Shallow Neural Networks with Quadratic Activation Function

    Authors: David Gamarnik, Eren C. Kızıldağ, Ilias Zadik

    Abstract: We consider the teacher-student setting of learning shallow neural networks with quadratic activations and planted weight matrix $W^*\in\mathbb{R}^{m\times d}$, where $m$ is the width of the hidden layer and $d\le m$ is the data dimension. We study the optimization landscape associated with the empirical and the population squared risk of the problem. Under the assumption the planted weights are f… ▽ More

    Submitted 9 July, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

    Comments: 54 pages

  5. arXiv:1910.10890  [pdf, other

    math.ST math.PR stat.ML

    Inference in High-Dimensional Linear Regression via Lattice Basis Reduction and Integer Relation Detection

    Authors: David Gamarnik, Eren C. Kızıldağ, Ilias Zadik

    Abstract: We focus on the high-dimensional linear regression problem, where the algorithmic goal is to efficiently infer an unknown feature vector $β^*\in\mathbb{R}^p$ from its linear measurements, using a small number $n$ of samples. Unlike most of the literature, we make no sparsity assumption on $β^*$, but instead adopt a different regularization: In the noiseless setting, we assume $β^*$ consists of ent… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 56 pages. Parts of the material of this manuscript were presented at NeurIPS 2018, and ISIT 2019. This submission subsumes the content of arXiv:1803.06716

    Journal ref: IEEE Transactions on Information Theory (Volume: 67, Issue: 12, December 2021)

  6. arXiv:1805.11238  [pdf, ps, other

    math.PR cs.IT stat.CO

    Explicit construction of RIP matrices is Ramsey-hard

    Authors: David Gamarnik

    Abstract: Matrices $Φ\in\R^{n\times p}$ satisfying the Restricted Isometry Property (RIP) are an important ingredient of the compressive sensing methods. While it is known that random matrices satisfy the RIP with high probability even for $n=\log^{O(1)}p$, the explicit construction of such matrices defied the repeated efforts, and the most known approaches hit the so-called $\sqrt{n}$ sparsity bottleneck.… ▽ More

    Submitted 15 November, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: 4 pages

  7. arXiv:1803.06716  [pdf, other

    math.ST math.PR stat.ML

    High Dimensional Linear Regression using Lattice Basis Reduction

    Authors: David Gamarnik, Ilias Zadik

    Abstract: We consider a high dimensional linear regression problem where the goal is to efficiently recover an unknown vector $β^*$ from $n$ noisy linear observations $Y=Xβ^*+W \in \mathbb{R}^n$, for known $X \in \mathbb{R}^{n \times p}$ and unknown $W \in \mathbb{R}^n$. Unlike most of the literature on this model we make no sparsity assumption on $β^*$. Instead we adopt a regularization based on assuming t… ▽ More

    Submitted 8 November, 2018; v1 submitted 18 March, 2018; originally announced March 2018.

  8. arXiv:1711.04952  [pdf, ps, other

    math.ST math.PR stat.ML

    Sparse High-Dimensional Linear Regression. Algorithmic Barriers and a Local Search Algorithm

    Authors: David Gamarnik, Ilias Zadik

    Abstract: We consider a sparse high dimensional regression model where the goal is to recover a $k$-sparse unknown vector $β^*$ from $n$ noisy linear observations of the form $Y=Xβ^*+W \in \mathbb{R}^n$ where $X \in \mathbb{R}^{n \times p}$ has iid $N(0,1)$ entries and $W \in \mathbb{R}^n$ has iid $N(0,σ^2)$ entries. Under certain assumptions on the parameters, an intriguing assymptotic gap appears between… ▽ More

    Submitted 22 September, 2019; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: Added a result on the failure of the LASSO recovery mechanism in the conjectured algorithmically hard regime $n<c n_{alg}$ and minor corrections

  9. arXiv:1702.02267  [pdf, ps, other

    stat.ML cs.DS cs.LG math.OC

    Matrix Completion from $O(n)$ Samples in Linear Time

    Authors: David Gamarnik, Quan Li, Hongyi Zhang

    Abstract: We consider the problem of reconstructing a rank-$k$ $n \times n$ matrix $M$ from a sampling of its entries. Under a certain incoherence assumption on $M$ and for the case when both the rank and the condition number of $M$ are bounded, it was shown in \cite{CandesRecht2009, CandesTao2010, keshavan2010, Recht2011, Jain2012, Hardt2014} that $M$ can be recovered exactly or approximately (depending on… ▽ More

    Submitted 22 August, 2017; v1 submitted 7 February, 2017; originally announced February 2017.

    Comments: 45 pages, 1 figure. Short version accepted for presentation at Conference on Learning Theory (COLT) 2017

  10. arXiv:1701.04455  [pdf, other

    stat.ML math.PR math.ST

    High-Dimensional Regression with Binary Coefficients. Estimating Squared Error and a Phase Transition

    Authors: David Gamarnik, Ilias Zadik

    Abstract: We consider a sparse linear regression model Y=Xβ^{*}+W where X has a Gaussian entries, W is the noise vector with mean zero Gaussian entries, and β^{*} is a binary vector with support size (sparsity) k. Using a novel conditional second moment method we obtain a tight up to a multiplicative constant approximation of the optimal squared error \min_β\|Y-Xβ\|_{2}, where the minimization is over all k… ▽ More

    Submitted 25 September, 2019; v1 submitted 16 January, 2017; originally announced January 2017.

    Comments: 36 pages, 5 figures

  11. arXiv:1603.06002  [pdf, ps, other

    cs.DS stat.ML

    A Message Passing Algorithm for the Problem of Path Packing in Graphs

    Authors: Patrick Eschenfeldt, David Gamarnik

    Abstract: We consider the problem of packing node-disjoint directed paths in a directed graph. We consider a variant of this problem where each path starts within a fixed subset of root nodes, subject to a given bound on the length of paths. This problem is motivated by the so-called kidney exchange problem, but has potential other applications and is interesting in its own right. We propose a new algorit… ▽ More

    Submitted 18 March, 2016; originally announced March 2016.

    Comments: 34 pages

  12. arXiv:1602.02164  [pdf, other

    stat.ML cs.LG math.NA

    A Note on Alternating Minimization Algorithm for the Matrix Completion Problem

    Authors: David Gamarnik, Sidhant Misra

    Abstract: We consider the problem of reconstructing a low rank matrix from a subset of its entries and analyze two variants of the so-called Alternating Minimization algorithm, which has been proposed in the past. We establish that when the underlying matrix has rank $r=1$, has positive bounded entries, and the graph $\mathcal{G}$ underlying the revealed entries has bounded degree and diameter which is at m… ▽ More

    Submitted 5 February, 2016; originally announced February 2016.

    Comments: 8 pages, 2 figures

  13. arXiv:1412.1443  [pdf, ps, other

    stat.ML cs.IT cs.LG

    Structure learning of antiferromagnetic Ising models

    Authors: Guy Bresler, David Gamarnik, Devavrat Shah

    Abstract: In this paper we investigate the computational complexity of learning the graph structure underlying a discrete undirected graphical model from i.i.d. samples. We first observe that the notoriously difficult problem of learning parities with noise can be captured as a special case of learning graphical models. This leads to an unconditional computational lower bound of $Ω(p^{d/2})$ for learning ge… ▽ More

    Submitted 3 December, 2014; originally announced December 2014.

    Comments: 15 pages. NIPS 2014

  14. arXiv:1410.7659  [pdf, ps, other

    cs.LG cs.IT stat.CO stat.ML

    Learning graphical models from the Glauber dynamics

    Authors: Guy Bresler, David Gamarnik, Devavrat Shah

    Abstract: In this paper we consider the problem of learning undirected graphical models from data generated according to the Glauber dynamics. The Glauber dynamics is a Markov chain that sequentially updates individual nodes (variables) in a graphical model and it is frequently used to sample from the stationary distribution (to which it converges given sufficient time). Additionally, the Glauber dynamics i… ▽ More

    Submitted 28 November, 2014; v1 submitted 28 October, 2014; originally announced October 2014.

    Comments: 9 pages. Appeared in Allerton Conference 2014

  15. arXiv:1409.3836  [pdf, ps, other

    cs.CC cs.AI cs.IT stat.CO

    Hardness of parameter estimation in graphical models

    Authors: Guy Bresler, David Gamarnik, Devavrat Shah

    Abstract: We consider the problem of learning the canonical parameters specifying an undirected graphical model (Markov random field) from the mean parameters. For graphical models representing a minimal exponential family, the canonical parameters are uniquely determined by the mean parameters, so the problem is feasible in principle. The goal of this paper is to investigate the computational feasibility o… ▽ More

    Submitted 17 September, 2014; v1 submitted 12 September, 2014; originally announced September 2014.

    Comments: 15 pages. To appear in NIPS 2014