Skip to main content

Showing 1–7 of 7 results for author: Thulasidasan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2105.07107  [pdf, other

    cs.LG cs.AI

    An Effective Baseline for Robustness to Distributional Shift

    Authors: Sunil Thulasidasan, Sushil Thapa, Sayera Dhaubhadel, Gopinath Chennupati, Tanmoy Bhattacharya, Jeff Bilmes

    Abstract: Refraining from confidently predicting when faced with categories of inputs different from those seen during training is an important requirement for the safe deployment of deep learning systems. While simple to state, this has been a particularly challenging problem in deep learning, where models often end up making overconfident predictions in such situations. In this work we present a simple, b… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  2. arXiv:2010.08141  [pdf, other

    cs.AI cs.LG physics.acc-ph

    Autonomous Control of a Particle Accelerator using Deep Reinforcement Learning

    Authors: Xiaoying Pang, Sunil Thulasidasan, Larry Rybarcyk

    Abstract: We describe an approach to learning optimal control policies for a large, linear particle accelerator using deep reinforcement learning coupled with a high-fidelity physics engine. The framework consists of an AI controller that uses deep neural nets for state and action-space representation and learns optimal policies using reward signals that are provided by the physics simulator. For this work,… ▽ More

    Submitted 19 December, 2020; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2020 Workshop on Machine Learning for Engineering, Modeling, Simulation and Design

  3. arXiv:2009.05094  [pdf, other

    cs.LG

    Why I'm not Answering: Understanding Determinants of Classification of an Abstaining Classifier for Cancer Pathology Reports

    Authors: Sayera Dhaubhadel, Jamaludin Mohd-Yusof, Kumkum Ganguly, Gopinath Chennupati, Sunil Thulasidasan, Nicolas W. Hengartner, Brent J. Mumphrey, Eric B. Durbin, Jennifer A. Doherty, Mireille Lemieux, Noah Schaefferkoetter, Georgia Tourassi, Linda Coyle, Lynne Penberthy, Benjamin H. McMahon, Tanmoy Bhattacharya

    Abstract: Safe deployment of deep learning systems in critical real world applications requires models to make very few mistakes, and only under predictable circumstances. In this work, we address this problem using an abstaining classifier that is tuned to have $>$95% accuracy, and then identify the determinants of abstention using LIME. Essentially, we are training our model to learn the attributes of pat… ▽ More

    Submitted 21 April, 2022; v1 submitted 10 September, 2020; originally announced September 2020.

  4. arXiv:1905.11001  [pdf, other

    stat.ML cs.LG

    On Mixup Training: Improved Calibration and Predictive Uncertainty for Deep Neural Networks

    Authors: Sunil Thulasidasan, Gopinath Chennupati, Jeff Bilmes, Tanmoy Bhattacharya, Sarah Michalak

    Abstract: Mixup~\cite{zhang2017mixup} is a recently proposed method for training deep neural networks where additional samples are generated during training by convexly combining random pairs of images and their associated labels. While simple to implement, it has been shown to be a surprisingly effective method of data augmentation for image classification: DNNs trained with mixup show noticeable gains in… ▽ More

    Submitted 6 January, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019

  5. arXiv:1905.10964  [pdf, other

    stat.ML cs.LG

    Combating Label Noise in Deep Learning Using Abstention

    Authors: Sunil Thulasidasan, Tanmoy Bhattacharya, Jeff Bilmes, Gopinath Chennupati, Jamal Mohd-Yusof

    Abstract: We introduce a novel method to combat label noise when training deep neural networks for classification. We propose a loss function that permits abstention during training thereby allowing the DNN to abstain on confusing samples while continuing to learn and improve classification performance on the non-abstained samples. We show how such a deep abstaining classifier (DAC) can be used for robust l… ▽ More

    Submitted 1 August, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: ICML 2019. Added source code link

  6. arXiv:1612.04899  [pdf, other

    stat.ML cs.LG

    Semi-Supervised Phone Classification using Deep Neural Networks and Stochastic Graph-Based Entropic Regularization

    Authors: Sunil Thulasidasan, Jeffrey Bilmes

    Abstract: We describe a graph-based semi-supervised learning framework in the context of deep neural networks that uses a graph-based entropic regularizer to favor smooth solutions over a graph induced by the data. The main contribution of this work is a computationally efficient, stochastic graph-regularization technique that uses mini-batches that are consistent with the graph structure, but also provides… ▽ More

    Submitted 30 May, 2018; v1 submitted 14 December, 2016; originally announced December 2016.

    Comments: InterSpeech Workshop on Machine Learning in Speech and Language Processing, 2016. Based on and extends work in arXiv:1612.04898

    Report number: LA-UR-16-24599

  7. arXiv:1612.04898  [pdf, other

    stat.ML cs.DC cs.LG

    Efficient Distributed Semi-Supervised Learning using Stochastic Regularization over Affinity Graphs

    Authors: Sunil Thulasidasan, Jeffrey Bilmes, Garrett Kenyon

    Abstract: We describe a computationally efficient, stochastic graph-regularization technique that can be utilized for the semi-supervised training of deep neural networks in a parallel or distributed setting. We utilize a technique, first described in [13] for the construction of mini-batches for stochastic gradient descent (SGD) based on synthesized partitions of an affinity graph that are consistent with… ▽ More

    Submitted 30 May, 2018; v1 submitted 14 December, 2016; originally announced December 2016.

    Comments: NIPS 2016 Workshop on Machine Learning Systems

    Report number: LA-UR-16-28681