Skip to main content

Showing 1–37 of 37 results for author: Awasthi, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.17989  [pdf, ps, other

    cs.LG stat.ML

    Learning Neural Networks with Sparse Activations

    Authors: Pranjal Awasthi, Nishanth Dikkala, Pritish Kamath, Raghu Meka

    Abstract: A core component present in many successful neural network architectures, is an MLP block of two fully connected layers with a non-linear activation in between. An intriguing phenomenon observed empirically, including in transformer architectures, is that, after training, the activations in the hidden layer of this MLP block tend to be extremely sparse on any given input. Unlike traditional forms… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Proceedings of the 37th Conference on Learning Theory (COLT 2024), 20 pages

  2. arXiv:2403.04978  [pdf, other

    cs.LG stat.ML

    Stacking as Accelerated Gradient Descent

    Authors: Naman Agarwal, Pranjal Awasthi, Satyen Kale, Eric Zhao

    Abstract: Stacking, a heuristic technique for training deep residual networks by progressively increasing the number of layers and initializing new layers by copying parameters from older layers, has proven quite successful in improving the efficiency of training deep neural networks. In this paper, we propose a theoretical explanation for the efficacy of stacking: viz., stacking implements a form of Nester… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  3. arXiv:2310.00726  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Length-Generalization in Transformers via Task Hinting

    Authors: Pranjal Awasthi, Anupam Gupta

    Abstract: It has been observed in recent years that transformers have problems with length generalization for certain types of reasoning and arithmetic tasks. In particular, the performance of a transformer model trained on tasks (say addition) up to a certain length (e.g., 5 digit numbers) drops sharply when applied to longer instances of the same problem. This work proposes an approach based on task hinti… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  4. arXiv:2307.12135  [pdf, ps, other

    cs.LG stat.ML

    The Sample Complexity of Multi-Distribution Learning for VC Classes

    Authors: Pranjal Awasthi, Nika Haghtalab, Eric Zhao

    Abstract: Multi-distribution learning is a natural generalization of PAC learning to settings with multiple data distributions. There remains a significant gap between the known upper and lower bounds for PAC-learnable classes. In particular, though we understand the sample complexity of learning a VC dimension d class on $k$ distributions to be… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

    Comments: 11 pages. Authors are ordered alphabetically. Open problem presented at the 36th Annual Conference on Learning Theory

  5. arXiv:2305.05816  [pdf, other

    cs.LG stat.ML

    Best-Effort Adaptation

    Authors: Pranjal Awasthi, Corinna Cortes, Mehryar Mohri

    Abstract: We study a problem of best-effort adaptation motivated by several applications and considerations, which consists of determining an accurate predictor for a target domain, for which a moderate amount of labeled samples are available, while leveraging information from another domain for which substantially more labeled samples are at one's disposal. We present a new and general discrepancy-based th… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  6. arXiv:2301.09251  [pdf, other

    cs.LG stat.ML

    Congested Bandits: Optimal Routing via Short-term Resets

    Authors: Pranjal Awasthi, Kush Bhatia, Sreenivas Gollapudi, Kostas Kollias

    Abstract: For traffic routing platforms, the choice of which route to recommend to a user depends on the congestion on these routes -- indeed, an individual's utility depends on the number of people using the recommended route at that instance. Motivated by this, we introduce the problem of Congested Bandits where each arm's reward is allowed to depend on the number of times it was played in the past $Δ$ ti… ▽ More

    Submitted 22 January, 2023; originally announced January 2023.

    Comments: Published at ICML 2022

  7. arXiv:2208.02711  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Agnostic Learning of General ReLU Activation Using Gradient Descent

    Authors: Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

    Abstract: We provide a convergence analysis of gradient descent for the problem of agnostically learning a single ReLU function under Gaussian distributions. Unlike prior work that studies the setting of zero bias, we consider the more challenging scenario when the bias of the ReLU function is non-zero. Our main result establishes that starting from random initialization, in a polynomial number of iteration… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 28 oages

  8. arXiv:2206.04777  [pdf, ps, other

    cs.LG stat.ML

    Trimmed Maximum Likelihood Estimation for Robust Learning in Generalized Linear Models

    Authors: Pranjal Awasthi, Abhimanyu Das, Weihao Kong, Rajat Sen

    Abstract: We study the problem of learning generalized linear models under adversarial corruptions. We analyze a classical heuristic called the iterative trimmed maximum likelihood estimator which is known to be effective against label corruptions in practice. Under label corruptions, we prove that this simple estimator achieves minimax near-optimal risk on a wide range of generalized linear models, includi… ▽ More

    Submitted 23 October, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

  9. arXiv:2205.08017  [pdf, other

    cs.LG stat.ML

    $\mathscr{H}$-Consistency Estimation Error of Surrogate Loss Minimizers

    Authors: Pranjal Awasthi, Anqi Mao, Mehryar Mohri, Yutao Zhong

    Abstract: We present a detailed study of estimation errors in terms of surrogate loss estimation errors. We refer to such guarantees as $\mathscr{H}$-consistency estimation error bounds, since they account for the hypothesis set $\mathscr{H}$ adopted. These guarantees are significantly stronger than $\mathscr{H}$-calibration or $\mathscr{H}$-consistency. They are also more informative than similar excess er… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: ICML 2022 (long presentation)

  10. arXiv:2205.01789  [pdf, other

    cs.LG stat.ML

    Do More Negative Samples Necessarily Hurt in Contrastive Learning?

    Authors: Pranjal Awasthi, Nishanth Dikkala, Pritish Kamath

    Abstract: Recent investigations in noise contrastive estimation suggest, both empirically as well as theoretically, that while having more "negative samples" in the contrastive loss improves downstream classification performance initially, beyond a threshold, it hurts downstream performance due to a "collision-coverage" trade-off. But is such a phenomenon inherent in contrastive learning? We show in a simpl… ▽ More

    Submitted 22 June, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: 16 pages

  11. arXiv:2201.13419  [pdf, ps, other

    cs.LG math.OC stat.ML

    Agnostic Learnability of Halfspaces via Logistic Loss

    Authors: Ziwei Ji, Kwangjun Ahn, Pranjal Awasthi, Satyen Kale, Stefani Karp

    Abstract: We investigate approximation guarantees provided by logistic regression for the fundamental problem of agnostic learning of homogeneous halfspaces. Previously, for a certain broad class of "well-behaved" distributions on the examples, Diakonikolas et al. (2020) proved an $\tildeΩ(\textrm{OPT})$ lower bound, while Frei et al. (2021) proved an $\tilde{O}(\sqrt{\textrm{OPT}})$ upper bound, where… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  12. arXiv:2112.01694  [pdf, other

    cs.LG stat.ML

    On the Existence of the Adversarial Bayes Classifier (Extended Version)

    Authors: Pranjal Awasthi, Natalie S. Frank, Mehryar Mohri

    Abstract: Adversarial robustness is a critical property in a variety of modern machine learning applications. While it has been the subject of several recent theoretical studies, many important questions related to adversarial robustness are still open. In this work, we study a fundamental question regarding Bayes optimality for adversarial robustness. We provide general sufficient conditions under which th… ▽ More

    Submitted 28 August, 2023; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 27 pages, 3 figures. Version 2: Corrects 2 errors in the paper "On the Existence of the Adversarial Bayes Classifier" published in NeurIPS. Version 3: Update to acknowledgements

  13. arXiv:2107.10209  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Efficient Algorithms for Learning Depth-2 Neural Networks with General ReLU Activations

    Authors: Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

    Abstract: We present polynomial time and sample efficient algorithms for learning an unknown depth-2 feedforward neural network with general ReLU activations, under mild non-degeneracy assumptions. In particular, we consider learning an unknown network of the form $f(x) = {a}^{\mathsf{T}}σ({W}^\mathsf{T}x+b)$, where $x$ is drawn from the Gaussian distribution, and $σ(t) := \max(t,0)$ is the ReLU activation.… ▽ More

    Submitted 1 August, 2021; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: 45 pages (including appendix). This version fixes an error in the previous version of the paper

  14. arXiv:2106.10370  [pdf, other

    stat.ML cs.AI cs.LG

    On the benefits of maximum likelihood estimation for Regression and Forecasting

    Authors: Pranjal Awasthi, Abhimanyu Das, Rajat Sen, Ananda Theertha Suresh

    Abstract: We advocate for a practical Maximum Likelihood Estimation (MLE) approach towards designing loss functions for regression and forecasting, as an alternative to the typical approach of direct empirical risk minimization on a specific target metric. The MLE approach is better suited to capture inductive biases such as prior domain knowledge in datasets, and can output post-hoc estimators at inference… ▽ More

    Submitted 9 October, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

  15. arXiv:2105.09985  [pdf, other

    cs.LG stat.ML

    Measuring Model Fairness under Noisy Covariates: A Theoretical Perspective

    Authors: Flavien Prost, Pranjal Awasthi, Nick Blumm, Aditee Kumthekar, Trevor Potter, Li Wei, Xuezhi Wang, Ed H. Chi, Jilin Chen, Alex Beutel

    Abstract: In this work we study the problem of measuring the fairness of a machine learning model under noisy information. Focusing on group fairness metrics, we investigate the particular but common situation when the evaluation requires controlling for the confounding effect of covariate variables. In a practical setting, we might not be able to jointly observe the covariate and group information, and a s… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

  16. arXiv:2105.01550  [pdf, ps, other

    cs.LG stat.ML

    A Finer Calibration Analysis for Adversarial Robustness

    Authors: Pranjal Awasthi, Anqi Mao, Mehryar Mohri, Yutao Zhong

    Abstract: We present a more general analysis of $H$-calibration for adversarially robust classification. By adopting a finer definition of calibration, we can cover settings beyond the restricted hypothesis sets studied in previous work. In particular, our results hold for most common hypothesis sets used in machine learning. We both fix some previous calibration results (Bao et al., 2020) and generalize ot… ▽ More

    Submitted 6 May, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:2104.09658

  17. arXiv:2104.09658  [pdf, other

    cs.LG stat.ML

    Calibration and Consistency of Adversarial Surrogate Losses

    Authors: Pranjal Awasthi, Natalie Frank, Anqi Mao, Mehryar Mohri, Yutao Zhong

    Abstract: Adversarial robustness is an increasingly critical property of classifiers in applications. The design of robust algorithms relies on surrogate losses since the optimization of the adversarial loss with most hypothesis sets is NP-hard. But which surrogate losses should be used and when do they benefit from theoretical guarantees? We present an extensive study of this question, including a detailed… ▽ More

    Submitted 4 May, 2021; v1 submitted 19 April, 2021; originally announced April 2021.

  18. arXiv:2103.01276  [pdf, other

    cs.LG stat.ML

    A Multiclass Boosting Framework for Achieving Fast and Provable Adversarial Robustness

    Authors: Jacob Abernethy, Pranjal Awasthi, Satyen Kale

    Abstract: Alongside the well-publicized accomplishments of deep neural networks there has emerged an apparent bug in their success on tasks such as object recognition: with deep models trained using vanilla methods, input images can be slightly corrupted in order to modify output predictions, even when these corruptions are practically invisible. This apparent lack of robustness has led researchers to propo… ▽ More

    Submitted 3 March, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: Fixed misspelled first author name

  19. arXiv:2102.08410  [pdf, other

    cs.LG stat.ML

    Evaluating Fairness of Machine Learning Models Under Uncertain and Incomplete Information

    Authors: Pranjal Awasthi, Alex Beutel, Matthaeus Kleindessner, Jamie Morgenstern, Xuezhi Wang

    Abstract: Training and evaluation of fair classifiers is a challenging problem. This is partly due to the fact that most fairness metrics of interest depend on both the sensitive attribute information and label information of the data points. In many scenarios it is not possible to collect large datasets with such information. An alternate approach that is commonly used is to separately train an attribute c… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  20. arXiv:2008.09490  [pdf, other

    cs.LG stat.ML

    Beyond Individual and Group Fairness

    Authors: Pranjal Awasthi, Corinna Cortes, Yishay Mansour, Mehryar Mohri

    Abstract: We present a new data-driven model of fairness that, unlike existing static definitions of individual or group fairness is guided by the unfairness complaints received by the system. Our model supports multiple fairness criteria and takes into account their potential incompatibilities. We consider both a stochastic and an adversarial setting of our model. In the stochastic setting, we show that ou… ▽ More

    Submitted 21 August, 2020; originally announced August 2020.

  21. arXiv:2007.11045  [pdf, other

    cs.LG stat.ML

    On the Rademacher Complexity of Linear Hypothesis Sets

    Authors: Pranjal Awasthi, Natalie Frank, Mehryar Mohri

    Abstract: Linear predictors form a rich class of hypotheses used in a variety of learning algorithms. We present a tight analysis of the empirical Rademacher complexity of the family of linear hypothesis classes with weight vectors bounded in $\ell_p$-norm for any $p \geq 1$. This provides a tight analysis of generalization using these hypothesis sets and helps derive sharp data-dependent learning guarantee… ▽ More

    Submitted 21 July, 2020; originally announced July 2020.

  22. arXiv:2007.06555  [pdf, other

    cs.LG cs.DS stat.ML

    Adversarial robustness via robust low rank representations

    Authors: Pranjal Awasthi, Himanshu Jain, Ankit Singh Rawat, Aravindan Vijayaraghavan

    Abstract: Adversarial robustness measures the susceptibility of a classifier to imperceptible perturbations made to the inputs at test time. In this work we highlight the benefits of natural low rank representations that often exist for real data such as images, for training neural networks with certified robustness guarantees. Our first contribution is for certified robustness to perturbations measured i… ▽ More

    Submitted 1 August, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: fixed a bug in the proof of Proposition B.2

  23. arXiv:2006.06879  [pdf, other

    stat.ML cs.LG

    Active Sampling for Min-Max Fairness

    Authors: Jacob Abernethy, Pranjal Awasthi, Matthäus Kleindessner, Jamie Morgenstern, Chris Russell, Jie Zhang

    Abstract: We propose simple active sampling and reweighting strategies for optimizing min-max fairness that can be applied to any classification or regression model learned via loss minimization. The key intuition behind our approach is to use at each timestep a datapoint from the group that is worst off under the current model for updating the model. The ease of implementation and the generality of our rob… ▽ More

    Submitted 17 June, 2022; v1 submitted 11 June, 2020; originally announced June 2020.

  24. arXiv:2006.04960  [pdf, other

    stat.ML cs.LG

    A Notion of Individual Fairness for Clustering

    Authors: Matthäus Kleindessner, Pranjal Awasthi, Jamie Morgenstern

    Abstract: A common distinction in fair machine learning, in particular in fair classification, is between group fairness and individual fairness. In the context of clustering, group fairness has been studied extensively in recent years; however, individual fairness for clustering has hardly been explored. In this paper, we propose a natural notion of individual fairness for clustering. Our notion asks that… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  25. arXiv:2006.00602  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Estimating Principal Components under Adversarial Perturbations

    Authors: Pranjal Awasthi, Xue Chen, Aravindan Vijayaraghavan

    Abstract: Robustness is a key requirement for widespread deployment of machine learning algorithms, and has received much attention in both statistics and computer science. We study a natural model of robustness for high-dimensional statistical estimation problems that we call the adversarial perturbation model. An adversary can perturb every sample arbitrarily up to a specified magnitude $δ$ measured in so… ▽ More

    Submitted 1 June, 2020; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: It is to appear at COLT 2020

  26. arXiv:2004.13617  [pdf, other

    cs.LG stat.ML

    Adversarial Learning Guarantees for Linear Hypotheses and Neural Networks

    Authors: Pranjal Awasthi, Natalie Frank, Mehryar Mohri

    Abstract: Adversarial or test time robustness measures the susceptibility of a classifier to perturbations to the test input. While there has been a flurry of recent work on designing defenses against such perturbations, the theory of adversarial robustness is not well understood. In order to make progress on this, we focus on the problem of understanding generalization in adversarial settings, via the lens… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

  27. arXiv:2002.04840  [pdf, other

    cs.LG stat.ML

    Efficient active learning of sparse halfspaces with arbitrary bounded noise

    Authors: Chicheng Zhang, Jie Shen, Pranjal Awasthi

    Abstract: We study active learning of homogeneous $s$-sparse halfspaces in $\mathbb{R}^d$ under the setting where the unlabeled data distribution is isotropic log-concave and each label is flipped with probability at most $η$ for a parameter $η\in \big[0, \frac12\big)$, known as the bounded noise. Even in the presence of mild label noise, i.e. $η$ is a small constant, this is a challenging problem and only… ▽ More

    Submitted 13 August, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: 33 pages, 2 figures; NeurIPS 2020

  28. arXiv:2002.01523  [pdf, other

    cs.LG stat.ML

    A Deep Conditioning Treatment of Neural Networks

    Authors: Naman Agarwal, Pranjal Awasthi, Satyen Kale

    Abstract: We study the role of depth in training randomly initialized overparameterized neural networks. We give a general result showing that depth improves trainability of neural networks by improving the conditioning of certain kernel matrices of the input data. This result holds for arbitrary non-linear activation functions under a certain normalization. We provide versions of the result that hold for t… ▽ More

    Submitted 17 February, 2021; v1 submitted 4 February, 2020; originally announced February 2020.

    Comments: In proceedings of ALT 2021

  29. arXiv:1911.13268  [pdf, other

    cs.DS cs.LG math.ST stat.ML

    Adversarially Robust Low Dimensional Representations

    Authors: Pranjal Awasthi, Vaggos Chatziafratis, Xue Chen, Aravindan Vijayaraghavan

    Abstract: Many machine learning systems are vulnerable to small perturbations made to inputs either at test time or at training time. This has received much recent interest on the empirical front due to applications where reliability and security are critical. However, theoretical understanding of algorithms that are robust to adversarial perturbations is limited. In this work we focus on Principal Compon… ▽ More

    Submitted 13 August, 2021; v1 submitted 29 November, 2019; originally announced November 2019.

    Comments: 68 pages including references

  30. arXiv:1911.04681  [pdf, other

    cs.LG cs.DS stat.ML

    On Robustness to Adversarial Examples and Polynomial Optimization

    Authors: Pranjal Awasthi, Abhratanu Dutta, Aravindan Vijayaraghavan

    Abstract: We study the design of computationally efficient algorithms with provable guarantees, that are robust to adversarial (test time) perturbations. While there has been an proliferation of recent work on this topic due to its connections to test time robustness of deep networks, there is limited theoretical understanding of several basic questions like (i) when and how can one design provably robust l… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

    Comments: To appear at NeurIPS2019. 30 pages

  31. arXiv:1906.03284  [pdf, other

    stat.ML cs.LG

    Equalized odds postprocessing under imperfect group information

    Authors: Pranjal Awasthi, Matthäus Kleindessner, Jamie Morgenstern

    Abstract: Most approaches aiming to ensure a model's fairness with respect to a protected attribute (such as gender or race) assume to know the true value of the attribute for every data point. In this paper, we ask to what extent fairness interventions can be effective even when only imperfect information about the protected attribute is available. In particular, we study the prominent equalized odds postp… ▽ More

    Submitted 1 March, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

  32. arXiv:1901.08668  [pdf, other

    stat.ML cs.DS cs.LG

    Guarantees for Spectral Clustering with Fairness Constraints

    Authors: Matthäus Kleindessner, Samira Samadi, Pranjal Awasthi, Jamie Morgenstern

    Abstract: Given the widespread popularity of spectral clustering (SC) for partitioning graph data, we study a version of constrained SC in which we try to incorporate the fairness notion proposed by Chierichetti et al. (2017). According to this notion, a clustering is fair if every demographic group is approximately proportionally represented in each cluster. To this end, we develop variants of both normali… ▽ More

    Submitted 10 May, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

  33. arXiv:1901.08628  [pdf, other

    stat.ML cs.DS cs.LG

    Fair k-Center Clustering for Data Summarization

    Authors: Matthäus Kleindessner, Pranjal Awasthi, Jamie Morgenstern

    Abstract: In data summarization we want to choose $k$ prototypes in order to summarize a data set. We study a setting where the data set comprises several demographic groups and we are restricted to choose $k_i$ prototypes belonging to group $i$. A common approach to the problem without the fairness constraint is to optimize a centroid-based clustering objective such as $k$-center. A natural extension then… ▽ More

    Submitted 10 May, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

  34. arXiv:1804.08603  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Towards Learning Sparsely Used Dictionaries with Arbitrary Supports

    Authors: Pranjal Awasthi, Aravindan Vijayaraghavan

    Abstract: Dictionary learning is a popular approach for inferring a hidden basis or dictionary in which data has a sparse representation. Data generated from the dictionary A (an n by m matrix, with m > n in the over-complete setting) is given by Y = AX where X is a matrix whose columns have supports chosen from a distribution over k-sparse vectors, and the non-zero values chosen from a symmetric distributi… ▽ More

    Submitted 8 May, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: 72 pages, fixed minor typos, and added a new reference in related work

  35. arXiv:1503.06567  [pdf, ps, other

    cs.LG cs.DS stat.ML

    On some provably correct cases of variational inference for topic models

    Authors: Pranjal Awasthi, Andrej Risteski

    Abstract: Variational inference is a very efficient and popular heuristic used in various forms in the context of latent variable models. It's closely related to Expectation Maximization (EM), and is applied when exact EM is computationally infeasible. Despite being immensely popular, current theoretical understanding of the effectiveness of variaitonal inference based algorithms is very limited. In this wo… ▽ More

    Submitted 22 August, 2015; v1 submitted 23 March, 2015; originally announced March 2015.

    Comments: 46 pages, Compared to previous version: clarified notation, a number of typos fixed throughout paper

  36. arXiv:1408.4045  [pdf, other

    stat.ML cs.DS cs.LG math.ST

    Relax, no need to round: integrality of clustering formulations

    Authors: Pranjal Awasthi, Afonso S. Bandeira, Moses Charikar, Ravishankar Krishnaswamy, Soledad Villar, Rachel Ward

    Abstract: We study exact recovery conditions for convex relaxations of point cloud clustering problems, focusing on two of the most common optimization problems for unsupervised clustering: $k$-means and $k$-median clustering. Motivations for focusing on convex relaxations are: (a) they come with a certificate of optimality, and (b) they are generic tools which are relatively parameter-free, not tailored to… ▽ More

    Submitted 14 April, 2015; v1 submitted 18 August, 2014; originally announced August 2014.

    Comments: 30 pages, ITCS 2015

  37. arXiv:1307.8371  [pdf, ps, other

    cs.LG cs.CC cs.DS stat.ML

    The Power of Localization for Efficiently Learning Linear Separators with Noise

    Authors: Pranjal Awasthi, Maria Florina Balcan, Philip M. Long

    Abstract: We introduce a new approach for designing computationally efficient learning algorithms that are tolerant to noise, and demonstrate its effectiveness by designing algorithms with improved noise tolerance guarantees for learning linear separators. We consider both the malicious noise model and the adversarial label noise model. For malicious noise, where the adversary can corrupt both the label a… ▽ More

    Submitted 3 June, 2018; v1 submitted 31 July, 2013; originally announced July 2013.

    Comments: Contains improved label complexity analysis communicated to us by Steve Hanneke

    ACM Class: F.2