Skip to main content

Showing 1–4 of 4 results for author: Buhai, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.14103  [pdf, ps, other

    cs.LG cs.CC math.ST stat.ML

    Computational-Statistical Gaps for Improper Learning in Sparse Linear Regression

    Authors: Rares-Darius Buhai, **gqiu Ding, Stefan Tiegel

    Abstract: We study computational-statistical gaps for improper learning in sparse linear regression. More specifically, given $n$ samples from a $k$-sparse linear model in dimension $d$, we ask what is the minimum sample complexity to efficiently (in time polynomial in $d$, $k$, and $n$) find a potentially dense estimate for the regression vector that achieves non-trivial prediction error on the $n$ samples… ▽ More

    Submitted 25 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 24 pages; updated typos, some explanations, and references

  2. arXiv:2112.05445  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Beyond Parallel Pancakes: Quasi-Polynomial Time Guarantees for Non-Spherical Gaussian Mixtures

    Authors: Rares-Darius Buhai, David Steurer

    Abstract: We consider mixtures of $k\geq 2$ Gaussian components with unknown means and unknown covariance (identical for all components) that are well-separated, i.e., distinct components have statistical overlap at most $k^{-C}$ for a large enough constant $C\ge 1$. Previous statistical-query [DKS17] and lattice-based [BRST21, GVV22] lower bounds give formal evidence that even distinguishing such mixtures… ▽ More

    Submitted 7 June, 2023; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: 67 pages, the arxiv landing page contains a shortened abstract

  3. arXiv:2006.04166  [pdf, other

    cs.LG cs.DS cs.IT stat.ML

    Learning Restricted Boltzmann Machines with Sparse Latent Variables

    Authors: Guy Bresler, Rares-Darius Buhai

    Abstract: Restricted Boltzmann Machines (RBMs) are a common family of undirected graphical models with latent variables. An RBM is described by a bipartite graph, with all observed variables in one layer and all latent variables in the other. We consider the task of learning an RBM given samples generated according to it. The best algorithms for this task currently have time complexity $\tilde{O}(n^2)$ for… ▽ More

    Submitted 17 October, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: 33 pages, to appear at NeurIPS 2020

  4. arXiv:1907.00030  [pdf, other

    stat.ML cs.LG

    Empirical Study of the Benefits of Overparameterization in Learning Latent Variable Models

    Authors: Rares-Darius Buhai, Yoni Halpern, Yoon Kim, Andrej Risteski, David Sontag

    Abstract: One of the most surprising and exciting discoveries in supervised learning was the benefit of overparameterization (i.e. training a very large model) to improving the optimization landscape of a problem, with minimal effect on statistical performance (i.e. generalization). In contrast, unsupervised settings have been under-explored, despite the fact that it was observed that overparameterization c… ▽ More

    Submitted 16 July, 2020; v1 submitted 28 June, 2019; originally announced July 2019.

    Comments: 22 pages, to appear at ICML 2020