Skip to main content

Showing 1–23 of 23 results for author: Tzamos, C

Searching in archive math. Search in all archives.
.
  1. arXiv:2405.12958  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Online Learning of Halfspaces with Massart Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the task of online learning in the presence of Massart noise. Instead of assuming that the online adversary chooses an arbitrary sequence of labels, we assume that the context $\mathbf{x}$ is selected adversarially but the label $y$ presented to the learner disagrees with the ground-truth label of $\mathbf{x}$ with unknown probability at most $η$. We study the fundamental class of $γ$-mar… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  2. arXiv:2312.16616  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Agnostically Learning Multi-index Models with Queries

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the power of query access for the task of agnostic learning under the Gaussian distribution. In the agnostic model, no assumptions are made on the labels and the goal is to compute a hypothesis that is competitive with the {\em best-fit} function in a known class, i.e., it achieves error $\mathrm{opt}+ε$, where $\mathrm{opt}$ is the error of the best function in the class. We focus on a g… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

    Comments: abstract shortened due to arxiv requirements

  3. arXiv:2309.11657  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Distribution-Independent Regression for Generalized Linear Models with Oblivious Corruptions

    Authors: Ilias Diakonikolas, Sushrut Karmalkar, Jongho Park, Christos Tzamos

    Abstract: We demonstrate the first algorithms for the problem of regression for generalized linear models (GLMs) in the presence of additive oblivious noise. We assume we have sample access to examples $(x, y)$ where $y$ is a noisy measurement of $g(w^* \cdot x)$. In particular, \new{the noisy labels are of the form} $y = g(w^* \cdot x) + ξ+ ε$, where $ξ$ is the oblivious noise drawn independently of $x$ \n… ▽ More

    Submitted 27 September, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: Published in COLT 2023

  4. arXiv:2308.03142  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Self-Directed Linear Classification

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: In online classification, a learner is presented with a sequence of examples and aims to predict their labels in an online fashion so as to minimize the total number of mistakes. In the self-directed variant, the learner knows in advance the pool of examples and can adaptively choose the order in which predictions are made. Here we study the power of choosing the prediction order and establish the… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  5. arXiv:2206.08918  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning a Single Neuron with Adversarial Label Noise via Gradient Descent

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the fundamental problem of learning a single neuron, i.e., a function of the form $\mathbf{x}\mapstoσ(\mathbf{w}\cdot\mathbf{x})$ for monotone activations $σ:\mathbb{R}\mapsto\mathbb{R}$, with respect to the $L_2^2$-loss in the presence of adversarial label noise. Specifically, we are given labeled examples from a distribution $D$ on $(\mathbf{x}, y)\in\mathbb{R}^d \times \mathbb{R}$ such… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  6. arXiv:2108.08767  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning General Halfspaces with General Massart Noise under the Gaussian Distribution

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of PAC learning halfspaces on $\mathbb{R}^d$ with Massart noise under the Gaussian distribution. In the Massart model, an adversary is allowed to flip the label of each point $\mathbf{x}$ with unknown probability $η(\mathbf{x}) \leq η$, for some parameter $η\in [0,1/2]$. The goal is to find a hypothesis with misclassification error of $\mathrm{OPT} + ε$, where $\mathrm{OPT}$ i… ▽ More

    Submitted 8 November, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Revised presentation

  7. arXiv:2106.15908  [pdf, other

    math.ST math.PR

    A Statistical Taylor Theorem and Extrapolation of Truncated Densities

    Authors: Constantinos Daskalakis, Vasilis Kontonis, Christos Tzamos, Manolis Zampetakis

    Abstract: We show a statistical version of Taylor's theorem and apply this result to non-parametric density estimation from truncated samples, which is a classical challenge in Statistics \cite{woodroofe1985estimating, stute1993almost}. The single-dimensional version of our theorem has the following implication: "For any distribution $P$ on $[0, 1]$ with a smooth log-density function, given samples from the… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: Appeared at COLT2021

  8. arXiv:2102.05629  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Agnostic Proper Learning of Halfspaces under Gaussian Marginals

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of agnostically learning halfspaces under the Gaussian distribution. Our main result is the {\em first proper} learning algorithm for this problem whose sample complexity and computational complexity qualitatively match those of the best known improper agnostic learner. Building on this result, we also obtain the first proper polynomial-time approximation scheme (PTAS) for agn… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

  9. arXiv:2012.00732  [pdf, other

    cs.LG math.ST

    Convergence and Sample Complexity of SGD in GANs

    Authors: Vasilis Kontonis, Sihan Liu, Christos Tzamos

    Abstract: We provide theoretical convergence guarantees on training Generative Adversarial Networks (GANs) via SGD. We consider learning a target distribution modeled by a 1-layer Generator network with a non-linear activation function $φ(\cdot)$ parametrized by a $d \times d$ weight matrix $\mathbf W_*$, i.e., $f_*(\mathbf x) = φ(\mathbf W_* \mathbf x)$. Our main result is that by training the Generator… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  10. arXiv:2011.06202  [pdf, ps, other

    math.ST cs.CR cs.DS math.PR

    Optimal Private Median Estimation under Minimal Distributional Assumptions

    Authors: Christos Tzamos, Emmanouil-Vasileios Vlatakis-Gkaragkounis, Ilias Zadik

    Abstract: We study the fundamental task of estimating the median of an underlying distribution from a finite number of samples, under pure differential privacy constraints. We focus on distributions satisfying the minimal assumption that they have a positive density at a small neighborhood around the median. In particular, the distribution is allowed to output unbounded values and is not required to have fi… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 49 pages, NeurIPS 2020, Spotlight talk

    MSC Class: Primary 68P27; secondary 68Q32

  11. arXiv:2010.12000  [pdf, other

    math.ST cs.DS cs.LG

    Computationally and Statistically Efficient Truncated Regression

    Authors: Constantinos Daskalakis, Themis Gouleakis, Christos Tzamos, Manolis Zampetakis

    Abstract: We provide a computationally and statistically efficient estimator for the classical problem of truncated linear regression, where the dependent variable $y = w^T x + ε$ and its corresponding vector of covariates $x \in R^k$ are only revealed if the dependent variable falls in some subset $S \subseteq R$; otherwise the existence of the pair $(x, y)$ is hidden. This problem has remained a challenge… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2019

  12. arXiv:2010.01705  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    A Polynomial Time Algorithm for Learning Halfspaces with Tsybakov Noise

    Authors: Ilias Diakonikolas, Daniel M. Kane, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of PAC learning homogeneous halfspaces in the presence of Tsybakov noise. In the Tsybakov noise model, the label of every sample is independently flipped with an adversarially controlled probability that can be arbitrarily close to $1/2$ for a fraction of the samples. {\em We give the first polynomial-time algorithm for this fundamental learning problem.} Our algorithm learns… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

  13. arXiv:2007.02392  [pdf, ps, other

    cs.LG cs.DS math.ST stat.CO stat.ML

    Efficient Parameter Estimation of Truncated Boolean Product Distributions

    Authors: Dimitris Fotakis, Alkis Kalavasis, Christos Tzamos

    Abstract: We study the problem of estimating the parameters of a Boolean product distribution in $d$ dimensions, when the samples are truncated by a set $S \subset \{0, 1\}^d$ accessible through a membership oracle. This is the first time that the computational and statistical complexity of learning from truncated samples is considered in a discrete setting. We introduce a natural notion of fatness of the… ▽ More

    Submitted 24 April, 2022; v1 submitted 5 July, 2020; originally announced July 2020.

    Comments: 33rd Conference on Learning Theory (COLT 2020)

  14. arXiv:2006.06467  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Learning Halfspaces with Tsybakov Noise

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the efficient PAC learnability of halfspaces in the presence of Tsybakov noise. In the Tsybakov noise model, each label is independently flipped with some probability which is controlled by an adversary. This noise model significantly generalizes the Massart noise model, by allowing the flip** probabilities to be arbitrarily close to $1/2$ for a fraction of the samples. Our main result… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  15. arXiv:2002.05632  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Learning Halfspaces with Massart Noise Under Structured Distributions

    Authors: Ilias Diakonikolas, Vasilis Kontonis, Christos Tzamos, Nikos Zarifis

    Abstract: We study the problem of learning halfspaces with Massart noise in the distribution-specific PAC model. We give the first computationally efficient algorithm for this problem with respect to a broad family of distributions, including log-concave distributions. This resolves an open question posed in a number of prior works. Our approach is extremely simple: We identify a smooth {\em non-convex} sur… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  16. arXiv:1908.01034  [pdf, other

    math.ST cs.DS cs.LG stat.CO stat.ML

    Efficient Truncated Statistics with Unknown Truncation

    Authors: Vasilis Kontonis, Christos Tzamos, Manolis Zampetakis

    Abstract: We study the problem of estimating the parameters of a Gaussian distribution when samples are only shown if they fall in some (unknown) subset $S \subseteq \R^d$. This core problem in truncated statistics has long history going back to Galton, Lee, Pearson and Fisher. Recent work by Daskalakis et al. (FOCS'18), provides the first efficient algorithm that works for arbitrary sets in high dimension… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Comments: to appear at 60th Annual IEEE Symposium on Foundations of Computer Science (FOCS), 2019

  17. arXiv:1906.10075  [pdf, other

    cs.LG cs.DS math.ST stat.ML

    Distribution-Independent PAC Learning of Halfspaces with Massart Noise

    Authors: Ilias Diakonikolas, Themis Gouleakis, Christos Tzamos

    Abstract: We study the problem of {\em distribution-independent} PAC learning of halfspaces in the presence of Massart noise. Specifically, we are given a set of labeled examples $(\mathbf{x}, y)$ drawn from a distribution $\mathcal{D}$ on $\mathbb{R}^{d+1}$ such that the marginal distribution on the unlabeled points $\mathbf{x}$ is arbitrary and the labels $y$ are generated by an unknown halfspace corrupte… ▽ More

    Submitted 10 December, 2019; v1 submitted 24 June, 2019; originally announced June 2019.

  18. arXiv:1809.03986  [pdf, other

    math.ST cs.DS cs.LG stat.CO stat.ML

    Efficient Statistics, in High Dimensions, from Truncated Samples

    Authors: Constantinos Daskalakis, Themis Gouleakis, Christos Tzamos, Manolis Zampetakis

    Abstract: We provide an efficient algorithm for the classical problem, going back to Galton, Pearson, and Fisher, of estimating, with arbitrary accuracy the parameters of a multivariate normal distribution from truncated samples. Truncated samples from a $d$-variate normal ${\cal N}(\mathbfμ,\mathbfΣ)$ means a samples is only revealed if it falls in some subset $S \subseteq \mathbb{R}^d$; otherwise the samp… ▽ More

    Submitted 22 October, 2020; v1 submitted 11 September, 2018; originally announced September 2018.

    Comments: Appeared at 59th Annual IEEE Symposium on Foundations of Computer Science (FOCS), 2018

  19. arXiv:1807.06168  [pdf, ps, other

    cs.DS cs.IT cs.LG math.PR math.ST

    Anaconda: A Non-Adaptive Conditional Sampling Algorithm for Distribution Testing

    Authors: Gautam Kamath, Christos Tzamos

    Abstract: We investigate distribution testing with access to non-adaptive conditional samples. In the conditional sampling model, the algorithm is given the following access to a distribution: it submits a query set $S$ to an oracle, which returns a sample from the distribution conditioned on being from $S$. In the non-adaptive setting, all query sets must be specified in advance of viewing the outcomes.… ▽ More

    Submitted 5 November, 2018; v1 submitted 16 July, 2018; originally announced July 2018.

    Comments: SODA 2019

  20. arXiv:1702.07339  [pdf, ps, other

    cs.CC cs.LG math.GN stat.ML

    A Converse to Banach's Fixed Point Theorem and its CLS Completeness

    Authors: Constantinos Daskalakis, Christos Tzamos, Manolis Zampetakis

    Abstract: Banach's fixed point theorem for contraction maps has been widely used to analyze the convergence of iterative methods in non-convex problems. It is a common experience, however, that iterative maps fail to be globally contracting under the natural metric in their domain, making the applicability of Banach's theorem limited. We explore how generally we can apply Banach's fixed point theorem to est… ▽ More

    Submitted 13 February, 2018; v1 submitted 23 February, 2017; originally announced February 2017.

  21. arXiv:1609.00368  [pdf, other

    stat.ML cs.DS math.ST

    Ten Steps of EM Suffice for Mixtures of Two Gaussians

    Authors: Constantinos Daskalakis, Christos Tzamos, Manolis Zampetakis

    Abstract: The Expectation-Maximization (EM) algorithm is a widely used method for maximum likelihood estimation in models with latent variables. For estimating mixtures of Gaussians, its iteration can be viewed as a soft version of the k-means clustering algorithm. Despite its wide use and applications, there are essentially no known convergence guarantees for this method. We provide global convergence guar… ▽ More

    Submitted 5 June, 2017; v1 submitted 1 September, 2016; originally announced September 2016.

    Comments: Accepted for presentation at Conference on Learning Theory (COLT) 2017

  22. arXiv:1511.03641  [pdf, ps, other

    cs.DS cs.GT cs.LG math.PR math.ST

    A Size-Free CLT for Poisson Multinomials and its Applications

    Authors: Constantinos Daskalakis, Anindya De, Gautam Kamath, Christos Tzamos

    Abstract: An $(n,k)$-Poisson Multinomial Distribution (PMD) is the distribution of the sum of $n$ independent random vectors supported on the set ${\cal B}_k=\{e_1,\ldots,e_k\}$ of standard basis vectors in $\mathbb{R}^k$. We show that any $(n,k)$-PMD is ${\rm poly}\left({k\over σ}\right)$-close in total variation distance to the (appropriately discretized) multi-dimensional Gaussian with the same first two… ▽ More

    Submitted 16 June, 2016; v1 submitted 11 November, 2015; originally announced November 2015.

    Comments: To appear in STOC 2016

  23. arXiv:1504.08363  [pdf, ps, other

    cs.DS cs.LG math.PR math.ST

    On the Structure, Covering, and Learning of Poisson Multinomial Distributions

    Authors: Constantinos Daskalakis, Gautam Kamath, Christos Tzamos

    Abstract: An $(n,k)$-Poisson Multinomial Distribution (PMD) is the distribution of the sum of $n$ independent random vectors supported on the set ${\cal B}_k=\{e_1,\ldots,e_k\}$ of standard basis vectors in $\mathbb{R}^k$. We prove a structural characterization of these distributions, showing that, for all $\varepsilon >0$, any $(n, k)$-Poisson multinomial random vector is $\varepsilon$-close, in total vari… ▽ More

    Submitted 23 November, 2015; v1 submitted 30 April, 2015; originally announced April 2015.

    Comments: 49 pages, extended abstract appeared in FOCS 2015