Skip to main content

Showing 1–10 of 10 results for author: Gollakota, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.10615  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Agnostically Learning Single-Index Models using Omnipredictors

    Authors: Aravind Gollakota, Parikshit Gopalan, Adam R. Klivans, Konstantinos Stavropoulos

    Abstract: We give the first result for agnostically learning Single-Index Models (SIMs) with arbitrary monotone and Lipschitz activations. All prior work either held only in the realizable setting or required the activation to be known. Moreover, we only require the marginal to have bounded second moments, whereas all prior work required stronger distributional assumptions (such as anticoncentration or boun… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

    Comments: 21 pages

  2. arXiv:2305.19256  [pdf, other

    cs.LG cs.AI cs.CV cs.IT

    Ambient Diffusion: Learning Clean Distributions from Corrupted Data

    Authors: Giannis Daras, Kulin Shah, Yuval Dagan, Aravind Gollakota, Alexandros G. Dimakis, Adam Klivans

    Abstract: We present the first diffusion-based framework that can learn an unknown distribution using only highly-corrupted samples. This problem arises in scientific applications where access to uncorrupted samples is impossible or expensive to acquire. Another benefit of our approach is the ability to train generative models that are less likely to memorize individual training samples since they never obs… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 24 pages, 11 figures

  3. arXiv:2305.11765  [pdf, other

    cs.LG cs.DS stat.ML

    Tester-Learners for Halfspaces: Universal Algorithms

    Authors: Aravind Gollakota, Adam R. Klivans, Konstantinos Stavropoulos, Arsen Vasilyan

    Abstract: We give the first tester-learner for halfspaces that succeeds universally over a wide class of structured distributions. Our universal tester-learner runs in fully polynomial time and has the following guarantee: the learner achieves error $O(\mathrm{opt}) + ε$ on any labeled distribution that the tester accepts, and moreover, the tester accepts whenever the marginal is any distribution that satis… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: 26 pages, 2 figures

  4. arXiv:2302.14853  [pdf, other

    cs.LG stat.ML

    An Efficient Tester-Learner for Halfspaces

    Authors: Aravind Gollakota, Adam R. Klivans, Konstantinos Stavropoulos, Arsen Vasilyan

    Abstract: We give the first efficient algorithm for learning halfspaces in the testable learning model recently defined by Rubinfeld and Vasilyan (2023). In this model, a learner certifies that the accuracy of its output hypothesis is near optimal whenever the training set passes an associated test, and training sets drawn from some target distribution -- e.g., the Gaussian -- must pass the test. This model… ▽ More

    Submitted 13 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: 26 pages, 3 figures, Version v2: strengthened the agnostic guarantee

  5. arXiv:2211.13312  [pdf, ps, other

    cs.LG cs.CC stat.ML

    A Moment-Matching Approach to Testable Learning and a New Characterization of Rademacher Complexity

    Authors: Aravind Gollakota, Adam R. Klivans, Pravesh K. Kothari

    Abstract: A remarkable recent paper by Rubinfeld and Vasilyan (2022) initiated the study of \emph{testable learning}, where the goal is to replace hard-to-verify distributional assumptions (such as Gaussianity) with efficiently testable ones and to require that the learner succeed whenever the unknown distribution passes the corresponding test. In this model, they gave an efficient algorithm for learning ha… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 34 pages

  6. arXiv:2202.05258  [pdf, ps, other

    cs.LG cs.CC stat.ML

    Hardness of Noise-Free Learning for Two-Hidden-Layer Neural Networks

    Authors: Sitan Chen, Aravind Gollakota, Adam R. Klivans, Raghu Meka

    Abstract: We give superpolynomial statistical query (SQ) lower bounds for learning two-hidden-layer ReLU networks with respect to Gaussian inputs in the standard (noise-free) model. No general SQ lower bounds were known for learning ReLU networks of any depth in this setting: previous SQ lower bounds held only for adversarial noise models (agnostic learning) or restricted models such as correlational SQ.… ▽ More

    Submitted 13 November, 2022; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: 35 pages, v3: refined exposition

  7. arXiv:2102.05174  [pdf, other

    quant-ph cs.DS cs.LG

    On the Hardness of PAC-learning Stabilizer States with Noise

    Authors: Aravind Gollakota, Daniel Liang

    Abstract: We consider the problem of learning stabilizer states with noise in the Probably Approximately Correct (PAC) framework of Aaronson (2007) for learning quantum states. In the noiseless setting, an algorithm for this problem was recently given by Rocchetto (2018), but the noisy case was left open. Motivated by approaches to noise tolerance from classical learning theory, we introduce the Statistical… ▽ More

    Submitted 31 January, 2022; v1 submitted 9 February, 2021; originally announced February 2021.

    Report number: 2022-02-02, volume 6, page 640

    Journal ref: Quantum 6, 640 (2022)

  8. arXiv:2010.11925  [pdf, ps, other

    cs.DS cs.LG

    The Polynomial Method is Universal for Distribution-Free Correlational SQ Learning

    Authors: Aravind Gollakota, Sushrut Karmalkar, Adam Klivans

    Abstract: We consider the problem of distribution-free learning for Boolean function classes in the PAC and agnostic models. Generalizing a beautiful work of Malach and Shalev-Shwartz (2022) that gave tight correlational SQ (CSQ) lower bounds for learning DNF formulas, we give new proofs that lower bounds on the threshold or approximate degree of any function class directly imply CSQ lower bounds for PAC or… ▽ More

    Submitted 24 August, 2023; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: v3: Improved discussion of relation to prior work

  9. arXiv:2006.15812  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Statistical-Query Lower Bounds via Functional Gradients

    Authors: Surbhi Goel, Aravind Gollakota, Adam Klivans

    Abstract: We give the first statistical-query lower bounds for agnostically learning any non-polynomial activation with respect to Gaussian marginals (e.g., ReLU, sigmoid, sign). For the specific problem of ReLU regression (equivalently, agnostically learning a ReLU), we show that any statistical-query algorithm with tolerance $n^{-(1/ε)^b}$ must use at least $2^{n^c} ε$ queries for some constant… ▽ More

    Submitted 22 October, 2020; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: 34 pages, NeurIPS 2020

  10. arXiv:2006.12011  [pdf, other

    cs.LG cs.DS stat.ML

    Superpolynomial Lower Bounds for Learning One-Layer Neural Networks using Gradient Descent

    Authors: Surbhi Goel, Aravind Gollakota, Zhihan **, Sushrut Karmalkar, Adam Klivans

    Abstract: We prove the first superpolynomial lower bounds for learning one-layer neural networks with respect to the Gaussian distribution using gradient descent. We show that any classifier trained using gradient descent with respect to square-loss will fail to achieve small test error in polynomial time given access to samples labeled by a one-layer neural network. For classification, we give a stronger r… ▽ More

    Submitted 22 October, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: 25 pages, ICML 2020