Skip to main content

Showing 1–27 of 27 results for author: Kar, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2212.05430  [pdf, other

    cs.LG stat.ML

    Corruption-tolerant Algorithms for Generalized Linear Models

    Authors: Bhaskar P Mukhoty, Debojyoti Dey, Purushottam Kar

    Abstract: This paper presents SVAM (Sequential Variance-Altered MLE), a unified framework for learning generalized linear models under adversarial label corruption in training data. SVAM extends to tasks such as least squares regression, logistic regression, and gamma regression, whereas many existing works on learning with label corruptions focus only on least squares regression. SVAM is based on a novel v… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

    Comments: 46 pages, 5 figures, to appear in the 31st AAAI Conference on Artificial Intelligence (AAAI), 2023

  2. arXiv:2111.03932  [pdf, other

    math.OC cs.LG stat.ML

    AGGLIO: Global Optimization for Locally Convex Functions

    Authors: Debojyoti Dey, Bhaskar Mukhoty, Purushottam Kar

    Abstract: This paper presents AGGLIO (Accelerated Graduated Generalized LInear-model Optimization), a stage-wise, graduated optimization technique that offers global convergence guarantees for non-convex optimization problems whose objectives offer only local convexity and may fail to be even quasi-convex at a global scale. In particular, this includes learning problems that utilize popular activation funct… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

    Comments: 33 pages, 7 figures, to appear at 9th ACM IKDD Conference on Data Science (CODS) 2022. Code for AGGLIO is available at https://github.com/purushottamkar/agglio/

  3. arXiv:2105.10190  [pdf, other

    cs.LG cs.NE stat.ML

    AngularGrad: A New Optimization Technique for Angular Convergence of Convolutional Neural Networks

    Authors: S. K. Roy, M. E. Paoletti, J. M. Haut, S. R. Dubey, P. Kar, A. Plaza, B. B. Chaudhuri

    Abstract: Convolutional neural networks (CNNs) are trained using stochastic gradient descent (SGD)-based optimizers. Recently, the adaptive moment estimation (Adam) optimizer has become very popular due to its adaptive momentum, which tackles the dying gradient problem of SGD. Nevertheless, existing optimizers are still unable to exploit the optimization curvature information efficiently. This paper propose… ▽ More

    Submitted 9 September, 2023; v1 submitted 21 May, 2021; originally announced May 2021.

  4. arXiv:2006.14211  [pdf, other

    cs.LG math.OC stat.ME stat.ML

    Globally-convergent Iteratively Reweighted Least Squares for Robust Regression Problems

    Authors: Bhaskar Mukhoty, Govind Gopakumar, Prateek Jain, Purushottam Kar

    Abstract: We provide the first global model recovery results for the IRLS (iteratively reweighted least squares) heuristic for robust regression problems. IRLS is known to offer excellent performance, despite bad initializations and data corruption, for several parameter estimation problems. Existing analyses of IRLS frequently require careful initialization, thus offering only local convergence guarantees.… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: 30 pages, 5 figures, appeared as a publication in the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS), 2019

    Journal ref: Proceedings of Machine Learning Research (PMLR) 89:313-322, 2019

  5. arXiv:2005.14015  [pdf, other

    cs.SE cs.LG cs.PL stat.ML

    MACER: A Modular Framework for Accelerated Compilation Error Repair

    Authors: Darshak Chhatbar, Umair Z. Ahmed, Purushottam Kar

    Abstract: Automated compilation error repair, the problem of suggesting fixes to buggy programs that fail to compile, has generated significant interest in recent years. Apart from being a tool of general convenience, automated code repair has significant pedagogical applications for novice programmers who find compiler error messages cryptic and unhelpful. Existing approaches largely solve this problem usi… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: 19 pages, 9 figures. A short version of this paper will appear at the 21st International Conference on Artificial Intelligence in Education (AIED). Code for the MACER tool-chain is available at https://github.com/purushottamkar/macer/

  6. arXiv:2005.11257  [pdf, other

    q-bio.PE cs.LG math.OC stat.ML

    Epidemiologically and Socio-economically Optimal Policies via Bayesian Optimization

    Authors: Amit Chandak, Debojyoti Dey, Bhaskar Mukhoty, Purushottam Kar

    Abstract: Mass public quarantining, colloquially known as a lock-down, is a non-pharmaceutical intervention to check spread of disease. This paper presents ESOP (Epidemiologically and Socio-economically Optimal Policies), a novel application of active machine learning techniques using Bayesian optimization, that interacts with an epidemiological model to arrive at lock-down schedules that optimally balance… ▽ More

    Submitted 14 June, 2020; v1 submitted 22 May, 2020; originally announced May 2020.

    Comments: Keywords: COVID-19, Optimal Policy, Lock-down, Epidemiology, Bayesian Optimization Code available at https://github.com/purushottamkar/esop

    MSC Class: 92D30 (Primary) 90C26; 90C56; 60G15 (Secondary)

  7. arXiv:1905.11769  [pdf, other

    cs.LG cs.AI stat.ML

    Accelerating Extreme Classification via Adaptive Feature Agglomeration

    Authors: Ankit Jalan, Purushottam Kar

    Abstract: Extreme classification seeks to assign each data point, the most relevant labels from a universe of a million or more labels. This task is faced with the dual challenge of high precision and scalability, with millisecond level prediction times being a benchmark. We propose DEFRAG, an adaptive feature agglomeration technique to accelerate extreme classification algorithms. Despite past works on fea… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

    Comments: A version of this paper without the appendices will appear at the 28th International Joint Conference on Artificial Intelligence (IJCAI 2019). Code for this paper is available at https://github.com/purushottamkar/defrag/

  8. DANTE: Deep AlterNations for Training nEural networks

    Authors: Vaibhav B Sinha, Sneha Kudugunta, Adepu Ravi Sankar, Surya Teja Chavali, Purushottam Kar, Vineeth N Balasubramanian

    Abstract: We present DANTE, a novel method for training neural networks using the alternating minimization principle. DANTE provides an alternate perspective to traditional gradient-based backpropagation techniques commonly used to train deep networks. It utilizes an adaptation of quasi-convexity to cast training a neural network as a bi-quasi-convex optimization problem. We show that for neural network con… ▽ More

    Submitted 9 August, 2020; v1 submitted 1 February, 2019; originally announced February 2019.

    Comments: 19 pages

    Journal ref: Neural Networks 131 (2020) 127-143

  9. Optimizing Non-decomposable Measures with Deep Networks

    Authors: Amartya Sanyal, Pawan Kumar, Purushottam Kar, Sanjay Chawla, Fabrizio Sebastiani

    Abstract: We present a class of algorithms capable of directly training deep neural networks with respect to large families of task-specific performance measures such as the F-measure and the Kullback-Leibler divergence that are structured and non-decomposable. This presents a departure from standard deep learning techniques that typically use squared or cross-entropy loss functions (that are decomposable)… ▽ More

    Submitted 31 January, 2018; originally announced February 2018.

    Journal ref: Final version published in Machine Learning, 107(8-10):1597-1620, 2018

  10. arXiv:1712.07897  [pdf, other

    stat.ML cs.LG math.OC

    Non-convex Optimization for Machine Learning

    Authors: Prateek Jain, Purushottam Kar

    Abstract: A vast majority of machine learning algorithms train their models and perform inference by solving optimization problems. In order to capture the learning and prediction problems accurately, structural constraints such as sparsity or low rank are frequently imposed or else the objective itself is designed to be a non-convex function. This is especially true of algorithms that operate in high-dimen… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

    Comments: The official publication is available from now publishers via http://dx.doi.org/10.1561/2200000058

    Journal ref: Foundations and Trends in Machine Learning: Vol. 10: No. 3-4, pp 142-336 (2017)

  11. arXiv:1608.03544  [pdf, other

    cs.LG cs.AI cs.IR stat.ML

    On Context-Dependent Clustering of Bandits

    Authors: Claudio Gentile, Shuai Li, Purushottam Kar, Alexandros Karatzoglou, Evans Etrue, Giovanni Zappella

    Abstract: We investigate a novel cluster-of-bandit algorithm CAB for collaborative recommendation tasks that implements the underlying feedback sharing mechanism by estimating the neighborhood of users in a context-dependent manner. CAB makes sharp departures from the state of the art by incorporating collaborative effects into inference as well as learning processes in a manner that seamlessly interleaving… ▽ More

    Submitted 27 February, 2017; v1 submitted 6 August, 2016; originally announced August 2016.

  12. arXiv:1607.00146  [pdf, ps, other

    cs.LG stat.ML

    Efficient and Consistent Robust Time Series Analysis

    Authors: Kush Bhatia, Prateek Jain, Parameswaran Kamalaruban, Purushottam Kar

    Abstract: We study the problem of robust time series analysis under the standard auto-regressive (AR) time series model in the presence of arbitrary outliers. We devise an efficient hard thresholding based algorithm which can obtain a consistent estimate of the optimal AR model despite a large fraction of the time series points being corrupted. Our algorithm alternately estimates the corrupted set of points… ▽ More

    Submitted 1 July, 2016; originally announced July 2016.

  13. arXiv:1605.04135  [pdf, other

    stat.ML cs.AI cs.IR cs.LG

    Online Optimization Methods for the Quantification Problem

    Authors: Purushottam Kar, Shuai Li, Harikrishna Narasimhan, Sanjay Chawla, Fabrizio Sebastiani

    Abstract: The estimation of class prevalence, i.e., the fraction of a population that belongs to a certain class, is a very useful tool in data analytics and learning, and finds applications in many domains such as sentiment analysis, epidemiology, etc. For example, in sentiment analysis, the objective is often not to estimate whether a specific text conveys a positive or a negative sentiment, but rather es… ▽ More

    Submitted 13 June, 2016; v1 submitted 13 May, 2016; originally announced May 2016.

    Comments: 26 pages, 6 figures. A short version of this manuscript will appear in the proceedings of the 22nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2016

    Journal ref: Final version published in Proceedings of the 22nd ACM Conference on Knowledge Discovery and Data Mining (KDD 2016), San Francisco, US, 2016, pp. 1625-1634

  14. arXiv:1510.03164   

    cs.LG cs.AI stat.ML

    Context-Aware Bandits

    Authors: Shuai Li, Purushottam Kar

    Abstract: We propose an efficient Context-Aware clustering of Bandits (CAB) algorithm, which can capture collaborative effects. CAB can be easily deployed in a real-world recommendation system, where multi-armed bandits have been shown to perform well in particular with respect to the cold-start problem. CAB utilizes a context-aware clustering augmented by exploration-exploitation strategies. CAB dynamicall… ▽ More

    Submitted 26 February, 2017; v1 submitted 12 October, 2015; originally announced October 2015.

    Comments: The paper has been withdrawn as the work has been superseded

  15. arXiv:1507.02743  [pdf, ps, other

    cs.LG cs.IR math.OC stat.ML

    Locally Non-linear Embeddings for Extreme Multi-label Learning

    Authors: Kush Bhatia, Himanshu Jain, Purushottam Kar, Prateek Jain, Manik Varma

    Abstract: The objective in extreme multi-label learning is to train a classifier that can automatically tag a novel data point with the most relevant subset of labels from an extremely large label set. Embedding based approaches make training and prediction tractable by assuming that the training label matrix is low-rank and hence the effective number of labels can be reduced by projecting the high dimensio… ▽ More

    Submitted 9 July, 2015; originally announced July 2015.

  16. arXiv:1506.02428  [pdf, other

    cs.LG stat.ML

    Robust Regression via Hard Thresholding

    Authors: Kush Bhatia, Prateek Jain, Purushottam Kar

    Abstract: We study the problem of Robust Least Squares Regression (RLSR) where several response variables can be adversarially corrupted. More specifically, for a data matrix X \in R^{p x n} and an underlying model w*, the response vector is generated as y = X'w* + b where b \in R^n is the corruption vector supported over at most C.n coordinates. Existing exact recovery results for RLSR focus solely on L1-p… ▽ More

    Submitted 8 June, 2015; originally announced June 2015.

    Comments: 24 pages, 3 figures

  17. arXiv:1505.06813  [pdf, other

    stat.ML cs.LG

    Surrogate Functions for Maximizing Precision at the Top

    Authors: Purushottam Kar, Harikrishna Narasimhan, Prateek Jain

    Abstract: The problem of maximizing precision at the top of a ranked list, often dubbed Precision@k (prec@k), finds relevance in myriad learning applications such as ranking, multi-label classification, and learning with severe label imbalance. However, despite its popularity, there exist significant gaps in our understanding of this problem and its associated performance measure. The most notable of thes… ▽ More

    Submitted 26 May, 2015; originally announced May 2015.

    Comments: To appear in the the proceedings of the 32nd International Conference on Machine Learning (ICML 2015)

    Journal ref: Journal of Machine Learning Research, W&CP 37 (2015)

  18. arXiv:1505.06812  [pdf, other

    stat.ML cs.LG

    Optimizing Non-decomposable Performance Measures: A Tale of Two Classes

    Authors: Harikrishna Narasimhan, Purushottam Kar, Prateek Jain

    Abstract: Modern classification problems frequently present mild to severe label imbalance as well as specific requirements on classification characteristics, and require optimizing performance measures that are non-decomposable over the dataset, such as F-measure. Such measures have spurred much interest and pose specific challenges to learning algorithms since their non-additive nature precludes a direct… ▽ More

    Submitted 26 May, 2015; originally announced May 2015.

    Comments: To appear in proceedings of the 32nd International Conference on Machine Learning (ICML 2015)

    Journal ref: Journal of Machine Learning Research, W&CP 37 (2015)

  19. arXiv:1410.6776  [pdf, other

    cs.LG stat.ML

    Online and Stochastic Gradient Methods for Non-decomposable Loss Functions

    Authors: Purushottam Kar, Harikrishna Narasimhan, Prateek Jain

    Abstract: Modern applications in sensitive domains such as biometrics and medicine frequently require the use of non-decomposable loss functions such as precision@k, F-measure etc. Compared to point loss functions such as hinge-loss, these offer much more fine grained control over prediction, but at the same time present novel challenges in terms of algorithm design and analysis. In this work we initiate a… ▽ More

    Submitted 24 October, 2014; originally announced October 2014.

    Comments: 25 pages, 3 figures, To appear in the proceedings of the 28th Annual Conference on Neural Information Processing Systems, NIPS 2014

  20. arXiv:1410.5137  [pdf, other

    cs.LG stat.ML

    On Iterative Hard Thresholding Methods for High-dimensional M-Estimation

    Authors: Prateek Jain, Ambuj Tewari, Purushottam Kar

    Abstract: The use of M-estimators in generalized linear regression models in high dimensional settings requires risk minimization with hard $L_0$ constraints. Of the known methods, the class of projected gradient descent (also known as iterative hard thresholding (IHT)) methods is known to offer the fastest and most scalable solutions. However, the current state-of-the-art is only able to analyze these meth… ▽ More

    Submitted 21 October, 2014; v1 submitted 19 October, 2014; originally announced October 2014.

    Comments: 20 pages, 3 figures, To appear in the proceedings of the 28th Annual Conference on Neural Information Processing Systems, NIPS 2014

  21. arXiv:1305.2505  [pdf, other

    cs.LG stat.ML

    On the Generalization Ability of Online Learning Algorithms for Pairwise Loss Functions

    Authors: Purushottam Kar, Bharath K Sriperumbudur, Prateek Jain, Harish C Karnick

    Abstract: In this paper, we study the generalization properties of online learning based stochastic methods for supervised learning problems where the loss function is dependent on more than one training sample (e.g., metric learning, ranking). We present a generic decoupling technique that enables us to provide Rademacher complexity-based generalization error bounds. Our bounds are in general tighter than… ▽ More

    Submitted 11 May, 2013; originally announced May 2013.

    Comments: To appear in proceedings of the 30th International Conference on Machine Learning (ICML 2013)

    Journal ref: Journal of Machine Learning Research, W&CP 28(3) (2013)

  22. arXiv:1302.4343  [pdf, ps, other

    math.FA cs.LG stat.ML

    On Translation Invariant Kernels and Screw Functions

    Authors: Purushottam Kar, Harish Karnick

    Abstract: We explore the connection between Hilbertian metrics and positive definite kernels on the real line. In particular, we look at a well-known characterization of translation invariant Hilbertian metrics on the real line by von Neumann and Schoenberg (1941). Using this result we are able to give an alternate proof of Bochner's theorem for translation invariant positive definite kernels on the real li… ▽ More

    Submitted 18 February, 2013; originally announced February 2013.

  23. arXiv:1302.0406  [pdf, ps, other

    cs.LG stat.ML

    Generalization Guarantees for a Binary Classification Framework for Two-Stage Multiple Kernel Learning

    Authors: Purushottam Kar

    Abstract: We present generalization bounds for the TS-MKL framework for two stage multiple kernel learning. We also present bounds for sparse kernel learning formulations within the TS-MKL framework.

    Submitted 2 February, 2013; originally announced February 2013.

  24. arXiv:1210.5840  [pdf, other

    cs.LG stat.ML

    Supervised Learning with Similarity Functions

    Authors: Purushottam Kar, Prateek Jain

    Abstract: We address the problem of general supervised learning when data can only be accessed through an (indefinite) similarity function between data points. Existing work on learning with indefinite kernels has concentrated solely on binary/multi-class classification problems. We propose a model that is generic enough to handle any supervised learning task and also subsumes the model previously proposed… ▽ More

    Submitted 22 October, 2012; originally announced October 2012.

    Comments: To appear in the proceedings of NIPS 2012, 30 pages

  25. arXiv:1201.6530  [pdf, other

    cs.LG cs.CG math.FA stat.ML

    Random Feature Maps for Dot Product Kernels

    Authors: Purushottam Kar, Harish Karnick

    Abstract: Approximating non-linear kernels using feature maps has gained a lot of interest in recent years due to applications in reducing training and testing times of SVM classifiers and other kernel based learning algorithms. We extend this line of work and present low distortion embeddings for dot product kernels into linear Euclidean spaces. We base our results on a classical result in harmonic analysi… ▽ More

    Submitted 26 March, 2012; v1 submitted 31 January, 2012; originally announced January 2012.

    Comments: To appear in the proceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS 2012). This version corrects a minor error with Lemma 10. Acknowledgements : Devanshu Bhimwal

    Journal ref: Journal of Machine Learning Research, W&CP 22 (2012) 583-591

  26. arXiv:1112.5404  [pdf, ps, other

    cs.LG stat.ML

    Similarity-based Learning via Data Driven Embeddings

    Authors: Purushottam Kar, Prateek Jain

    Abstract: We consider the problem of classification using similarity/distance functions over data. Specifically, we propose a framework for defining the goodness of a (dis)similarity function with respect to a given learning task and propose algorithms that have guaranteed generalization properties when working with such good functions. Our framework unifies and generalizes the frameworks proposed by [Balca… ▽ More

    Submitted 22 December, 2011; originally announced December 2011.

    Comments: To appear in the proceedings of NIPS 2011, 14 pages

  27. arXiv:1010.3812  [pdf, ps, other

    cs.DS cs.CG math.DG stat.ML

    Random Projection Trees Revisited

    Authors: Aman Dhesi, Purushottam Kar

    Abstract: The Random Projection Tree structures proposed in [Freund-Dasgupta STOC08] are space partitioning data structures that automatically adapt to various notions of intrinsic dimensionality of data. We prove new results for both the RPTreeMax and the RPTreeMean data structures. Our result for RPTreeMax gives a near-optimal bound on the number of levels required by this data structure to reduce the siz… ▽ More

    Submitted 20 October, 2010; v1 submitted 19 October, 2010; originally announced October 2010.

    Comments: Accepted for publication at NIPS 2010. This version corrects an incorrect usage of the term Assouad dimension - acknowledgments : James Lee