Skip to main content

Showing 1–18 of 18 results for author: Zarifis, N

Searching in archive math. Search in all archives.
.
  1. arXiv:2405.12958  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Online Learning of Halfspaces with Massart Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the task of online learning in the presence of Massart noise. Instead of assuming that the online adversary chooses an arbitrary sequence of labels, we assume that the context $\mathbf{x}$ is selected adversarially but the label $y$ presented to the learner disagrees with the ground-truth label of $\mathbf{x}$ with unknown probability at most $η$. We study the fundamental class of $γ$-mar… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  2. arXiv:2403.02300  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Statistical Query Lower Bounds for Learning Truncated Gaussians

    Authors: Ilias Diakonikolas, Daniel M. Kane, Thanasis Pittas, Nikos Zarifis

    Abstract: We study the problem of estimating the mean of an identity covariance Gaussian in the truncated setting, in the regime when the truncation set comes from a low-complexity family $\mathcal{C}$ of sets. Specifically, for a fixed but unknown truncation set $S \subseteq \mathbb{R}^d$, we are given access to samples from the distribution $\mathcal{N}(\boldsymbol{ μ}, \mathbf{ I})$ truncated to the set… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2402.17756  [pdf, other

    cs.LG cs.DS math.OC math.ST stat.ML

    Robustly Learning Single-Index Models via Alignment Sharpness

    Authors: Nikos Zarifis, Puqian Wang, Ilias Diakonikolas, Jelena Diakonikolas

    Abstract: We study the problem of learning Single-Index Models under the $L_2^2$ loss in the agnostic model. We give an efficient learning algorithm, achieving a constant factor approximation to the optimal loss, that succeeds under a range of distributions (including log-concave distributions) and a broad class of monotone and Lipschitz link functions. This is the first efficient constant factor approximat… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  4. arXiv:2312.16616  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Agnostically Learning Multi-index Models with Queries

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the power of query access for the task of agnostic learning under the Gaussian distribution. In the agnostic model, no assumptions are made on the labels and the goal is to compute a hypothesis that is competitive with the {\em best-fit} function in a known class, i.e., it achieves error $\mathrm{opt}+ε$, where $\mathrm{opt}$ is the error of the best function in the class. We focus on a g… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: abstract shortened due to arxiv requirements

  5. arXiv:2308.03142  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Self-Directed Linear Classification

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: In online classification, a learner is presented with a sequence of examples and aims to predict their labels in an online fashion so as to minimize the total number of mistakes. In the self-directed variant, the learner knows in advance the pool of examples and can adaptively choose the order in which predictions are made. Here we study the power of choosing the prediction order and establish the… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  6. arXiv:2307.08438  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Near-Optimal Bounds for Learning Gaussian Halfspaces with Random Classification Noise

    Authors: Ilias Diakonikolas, Jelena Diakonikolas, Daniel M. Kane, Puqian Wang, Nikos Zarifis

    Abstract: We study the problem of learning general (i.e., not necessarily homogeneous) halfspaces with Random Classification Noise under the Gaussian distribution. We establish nearly-matching algorithmic and Statistical Query (SQ) lower bound results revealing a surprising information-computation gap for this basic problem. Specifically, the sample complexity of this learning problem is $\widetildeΘ(d/ε)$,… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  7. arXiv:2306.16352  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Information-Computation Tradeoffs for Learning Margin Halfspaces with Random Classification Noise

    Authors: Ilias Diakonikolas, Jelena Diakonikolas, Daniel M. Kane, Puqian Wang, Nikos Zarifis

    Abstract: We study the problem of PAC learning $γ$-margin halfspaces with Random Classification Noise. We establish an information-computation tradeoff suggesting an inherent gap between the sample complexity of the problem and the sample complexity of computationally efficient algorithms. Concretely, the sample complexity of the problem is $\widetildeΘ(1/(γ^2 ε))$. We start by giving a simple efficient alg… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  8. arXiv:2306.13057  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    SQ Lower Bounds for Learning Bounded Covariance GMMs

    Authors: Ilias Diakonikolas, Daniel M. Kane, Thanasis Pittas, Nikos Zarifis

    Abstract: We study the complexity of learning mixtures of separated Gaussians with common unknown bounded covariance matrix. Specifically, we focus on learning Gaussian mixture models (GMMs) on $\mathbb{R}^d$ of the form $P= \sum_{i=1}^k w_i \mathcal{N}(\boldsymbol μ_i,\mathbf Σ_i)$, where $\mathbf Σ_i = \mathbf Σ\preceq \mathbf I$ and $\min_{i \neq j} \| \boldsymbol μ_i - \boldsymbol μ_j\|_2 \geq k^ε$ for… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  9. arXiv:2306.07892  [pdf, other

    cs.LG cs.DS math.OC math.ST stat.ML

    Robustly Learning a Single Neuron via Sharpness

    Authors: Puqian Wang, Nikos Zarifis, Ilias Diakonikolas, Jelena Diakonikolas

    Abstract: We study the problem of learning a single neuron with respect to the $L_2^2$-loss in the presence of adversarial label noise. We give an efficient algorithm that, for a broad family of activations including ReLUs, approximates the optimal $L_2^2$-error within a constant factor. Our algorithm applies under much milder distributional assumptions compared to prior work. The key ingredient enabling ou… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  10. arXiv:2206.08918  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning a Single Neuron with Adversarial Label Noise via Gradient Descent

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the fundamental problem of learning a single neuron, i.e., a function of the form $\mathbf{x}\mapstoσ(\mathbf{w}\cdot\mathbf{x})$ for monotone activations $σ:\mathbb{R}\mapsto\mathbb{R}$, with respect to the $L_2^2$-loss in the presence of adversarial label noise. Specifically, we are given labeled examples from a distribution $D$ on $(\mathbf{x}, y)\in\mathbb{R}^d \times \mathbb{R}$ such… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  11. arXiv:2108.08767  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning General Halfspaces with General Massart Noise under the Gaussian Distribution

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of PAC learning halfspaces on $\mathbb{R}^d$ with Massart noise under the Gaussian distribution. In the Massart model, an adversary is allowed to flip the label of each point $\mathbf{x}$ with unknown probability $η(\mathbf{x}) \leq η$, for some parameter $η\in [0,1/2]$. The goal is to find a hypothesis with misclassification error of $\mathrm{OPT} + ε$, where $\mathrm{OPT}$ i… ▽ More

    Submitted 8 November, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Revised presentation

  12. arXiv:2102.05629  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Agnostic Proper Learning of Halfspaces under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of agnostically learning halfspaces under the Gaussian distribution. Our main result is the {\em first proper} learning algorithm for this problem whose sample complexity and computational complexity qualitatively match those of the best known improper agnostic learner. Building on this result, we also obtain the first proper polynomial-time approximation scheme (PTAS) for agn… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  13. arXiv:2102.04401  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    The Optimality of Polynomial Regression for Agnostic Learning under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Thanasis Pittas, Nikos Zarifis

    Abstract: We study the problem of agnostic learning under the Gaussian distribution. We develop a method for finding hard families of examples for a wide class of problems by using LP duality. For Boolean-valued concept classes, we show that the $L^1$-regression algorithm is essentially best possible, and therefore that the computational difficulty of agnostically learning a concept class is closely related… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  14. arXiv:2010.01705  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    A Polynomial Time Algorithm for Learning Halfspaces with Tsybakov Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of PAC learning homogeneous halfspaces in the presence of Tsybakov noise. In the Tsybakov noise model, the label of every sample is independently flipped with an adversarially controlled probability that can be arbitrarily close to $1/2$ for a fraction of the samples. {\em We give the first polynomial-time algorithm for this fundamental learning problem.} Our algorithm learns… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

  15. arXiv:2006.16200  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Near-Optimal SQ Lower Bounds for Agnostically Learning Halfspaces and ReLUs under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Nikos Zarifis

    Abstract: We study the fundamental problems of agnostically learning halfspaces and ReLUs under Gaussian marginals. In the former problem, given labeled examples $(\mathbf{x}, y)$ from an unknown distribution on $\mathbb{R}^d \times \{ \pm 1\}$, whose marginal distribution on $\mathbf{x}$ is the standard Gaussian and the labels $y$ can be arbitrary, the goal is to output a hypothesis with 0-1 loss… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: 19 pages

  16. arXiv:2006.12476  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Algorithms and SQ Lower Bounds for PAC Learning One-Hidden-Layer ReLU Networks

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Nikos Zarifis

    Abstract: We study the problem of PAC learning one-hidden-layer ReLU networks with $k$ hidden units on $\mathbb{R}^d$ under Gaussian marginals in the presence of additive label noise. For the case of positive coefficients, we give the first polynomial-time algorithm for this learning problem for $k$ up to $\tilde{O}(\sqrt{\log d})$. Previously, no polynomial time algorithm was known, even for $k=3$. This an… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  17. arXiv:2006.06467  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning Halfspaces with Tsybakov Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the efficient PAC learnability of halfspaces in the presence of Tsybakov noise. In the Tsybakov noise model, each label is independently flipped with some probability which is controlled by an adversary. This noise model significantly generalizes the Massart noise model, by allowing the flip** probabilities to be arbitrarily close to $1/2$ for a fraction of the samples. Our main result… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  18. arXiv:2002.05632  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning Halfspaces with Massart Noise Under Structured Distributions

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of learning halfspaces with Massart noise in the distribution-specific PAC model. We give the first computationally efficient algorithm for this problem with respect to a broad family of distributions, including log-concave distributions. This resolves an open question posed in a number of prior works. Our approach is extremely simple: We identify a smooth {\em non-convex} sur… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.