Skip to main content

Showing 1–11 of 11 results for author: Ramakrishnan, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2305.02997  [pdf, other

    cs.LG cs.AI stat.ML

    When Do Neural Nets Outperform Boosted Trees on Tabular Data?

    Authors: Duncan McElfresh, Sujay Khandagale, Jonathan Valverde, Vishak Prasad C, Benjamin Feuer, Chinmay Hegde, Ganesh Ramakrishnan, Micah Goldblum, Colin White

    Abstract: Tabular data is one of the most commonly used types of data in machine learning. Despite recent advances in neural nets (NNs) for tabular data, there is still an active discussion on whether or not NNs generally outperform gradient-boosted decision trees (GBDTs) on tabular data, with several recent works arguing either that GBDTs consistently outperform NNs on tabular data, or vice versa. In this… ▽ More

    Submitted 30 October, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

    Comments: NeurIPS Datasets and Benchmarks Track 2023

  2. arXiv:2210.03324  [pdf, other

    cs.LG cs.AI stat.ML

    AutoML for Climate Change: A Call to Action

    Authors: Renbo Tu, Nicholas Roberts, Vishak Prasad, Sibasis Nayak, Paarth Jain, Frederic Sala, Ganesh Ramakrishnan, Ameet Talwalkar, Willie Neiswanger, Colin White

    Abstract: The challenge that climate change poses to humanity has spurred a rapidly develo** field of artificial intelligence research focused on climate change applications. The climate change AI (CCAI) community works on a diverse, challenging set of problems which often involve physics-constrained ML or heterogeneous spatiotemporal data. It would be desirable to use automated machine learning (AutoML)… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  3. arXiv:2106.12491  [pdf, other

    cs.LG stat.ML

    Training Data Subset Selection for Regression with Controlled Generalization Error

    Authors: Durga Sivasubramanian, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

    Abstract: Data subset selection from a large number of training instances has been a successful approach toward efficient and cost-effective machine learning. However, models trained on a smaller subset may show poor generalization ability. In this paper, our goal is to design an algorithm for selecting a subset of the training data, so that the model can be trained quickly, without significantly sacrificin… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Journal ref: ICML 2021

  4. arXiv:2008.09887  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Data Programming with Subset Selection

    Authors: Ayush Maheshwari, Oishik Chatterjee, KrishnaTeja Killamsetty, Ganesh Ramakrishnan, Rishabh Iyer

    Abstract: The paradigm of data programming, which uses weak supervision in the form of rules/labelling functions, and semi-supervised learning, which augments small amounts of labelled data with a large unlabelled dataset, have shown great promise in several text classification scenarios. In this work, we argue that by not using any labelled data, data programming based approaches can yield sub-optimal perf… ▽ More

    Submitted 12 June, 2021; v1 submitted 22 August, 2020; originally announced August 2020.

    Comments: Findings of ACL, 2021

  5. Backdoors in Neural Models of Source Code

    Authors: Goutham Ramakrishnan, Aws Albarghouthi

    Abstract: Deep neural networks are vulnerable to a range of adversaries. A particularly pernicious class of vulnerabilities are backdoors, where model predictions diverge in the presence of subtle triggers in inputs. An attacker can implant a backdoor by poisoning the training data to yield a desired target prediction on triggered inputs. We study backdoors in the context of deep-learning for source code. (… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

  6. arXiv:2003.09374  [pdf, other

    eess.SP cs.LG stat.ML

    A Novel Deep Learning Architecture for Decoding Imagined Speech from EEG

    Authors: Jerrin Thomas Panachakel, A. G. Ramakrishnan, T. V. Ananthapadmanabha

    Abstract: The recent advances in the field of deep learning have not been fully utilised for decoding imagined speech primarily because of the unavailability of sufficient training samples to train a deep network. In this paper, we present a novel architecture that employs deep neural network (DNN) for classifying the words "in" and "cooperate" from the corresponding EEG signals in the ASU imagined speech d… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: Preprint of the paper presented at IEEE AIBEC 2019, Austria

  7. Semantic Robustness of Models of Source Code

    Authors: Goutham Ramakrishnan, Jordan Henkel, Zi Wang, Aws Albarghouthi, Somesh Jha, Thomas Reps

    Abstract: Deep neural networks are vulnerable to adversarial examples - small input perturbations that result in incorrect predictions. We study this problem for models of source code, where we want the network to be robust to source-code modifications that preserve code functionality. (1) We define a powerful adversary that can employ sequences of parametric, semantics-preserving program transformations; (… ▽ More

    Submitted 11 June, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  8. arXiv:1911.09860  [pdf, other

    cs.LG cs.CL stat.ML

    Data Programming using Continuous and Quality-Guided Labeling Functions

    Authors: Oishik Chatterjee, Ganesh Ramakrishnan, Sunita Sarawagi

    Abstract: Scarcity of labeled data is a bottleneck for supervised learning models. A paradigm that has evolved for dealing with this problem is data programming. An existing data programming paradigm allows human supervision to be provided as a set of discrete labeling functions (LF) that output possibly noisy labels to input instances and a generative modelfor consolidating the weak labels. We enhance and… ▽ More

    Submitted 22 November, 2019; originally announced November 2019.

    Comments: Accepted paper at the 34th AAAI Conference on Artificial Intelligence (AAAI-18), New York, USA

  9. arXiv:1902.05411  [pdf, other

    cs.CV cs.LG stat.ML

    Improving Facial Emotion Recognition Systems Using Gradient and Laplacian Images

    Authors: Ram Krishna Pandey, Souvik Karmakar, A G Ramakrishnan, Nabagata Saha

    Abstract: In this work, we have proposed several enhancements to improve the performance of any facial emotion recognition (FER) system. We believe that the changes in the positions of the fiducial points and the intensities capture the crucial information regarding the emotion of a face image. We propose the use of the gradient and the Laplacian of the input image together with the original input into a co… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  10. arXiv:1809.00961  [pdf, other

    cs.CV cs.LG stat.ML

    MSCE: An edge preserving robust loss function for improving super-resolution algorithms

    Authors: Ram Krishna Pandey, Nabagata Saha, Samarjit Karmakar, A G Ramakrishnan

    Abstract: With the recent advancement in the deep learning technologies such as CNNs and GANs, there is significant improvement in the quality of the images reconstructed by deep learning based super-resolution (SR) techniques. In this work, we propose a robust loss function based on the preservation of edges obtained by the Canny operator. This loss function, when combined with the existing loss function s… ▽ More

    Submitted 25 August, 2018; originally announced September 2018.

    Comments: Accepted in ICONIP-2018

  11. arXiv:1805.11191  [pdf, other

    cs.CV cs.LG stat.ML

    Learning From Less Data: Diversified Subset Selection and Active Learning in Image Classification Tasks

    Authors: Vishal Kaushal, Anurag Sahoo, Khoshrav Doctor, Narasimha Raju, Suyash Shetty, Pankaj Singh, Rishabh Iyer, Ganesh Ramakrishnan

    Abstract: Supervised machine learning based state-of-the-art computer vision techniques are in general data hungry and pose the challenges of not having adequate computing resources and of high costs involved in human labeling efforts. Training data subset selection and active learning techniques have been proposed as possible solutions to these challenges respectively. A special class of subset selection f… ▽ More

    Submitted 28 May, 2018; originally announced May 2018.

    Comments: 15 pages, 7 figures