Skip to main content

Showing 1–50 of 61 results for author: Thiagarajan, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.00529  [pdf, other

    cs.LG cs.CV stat.ML

    On the Use of Anchoring for Training Vision Models

    Authors: Vivek Narayanaswamy, Kowshik Thopalli, Rushil Anirudh, Yamen Mubarka, Wesam Sakla, Jayaraman J. Thiagarajan

    Abstract: Anchoring is a recent, architecture-agnostic principle for training deep neural networks that has been shown to significantly improve uncertainty estimation, calibration, and extrapolation capabilities. In this paper, we systematically explore anchoring as a general protocol for training vision models, providing fundamental insights into its training and inference processes and their implications… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2401.03350  [pdf, other

    cs.LG stat.ML

    Accurate and Scalable Estimation of Epistemic Uncertainty for Graph Neural Networks

    Authors: Puja Trivedi, Mark Heimann, Rushil Anirudh, Danai Koutra, Jayaraman J. Thiagarajan

    Abstract: While graph neural networks (GNNs) are widely used for node and graph representation learning tasks, the reliability of GNN uncertainty estimates under distribution shifts remains relatively under-explored. Indeed, while post-hoc calibration strategies can be used to improve in-distribution calibration, they need not also improve calibration under distribution shift. However, techniques which prod… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 33 pages; 10 Figures. arXiv admin note: text overlap with arXiv:2309.10976

  3. arXiv:2309.10977  [pdf, other

    cs.LG stat.ML

    PAGER: A Framework for Failure Analysis of Deep Regression Models

    Authors: Jayaraman J. Thiagarajan, Vivek Narayanaswamy, Puja Trivedi, Rushil Anirudh

    Abstract: Safe deployment of AI models requires proactive detection of failures to prevent costly errors. To this end, we study the important problem of detecting failures in deep regression models. Existing approaches rely on epistemic uncertainty estimates or inconsistency w.r.t the training data to identify failure. Interestingly, we find that while uncertainties are necessary they are insufficient to ac… ▽ More

    Submitted 1 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Published at ICML 2024

  4. arXiv:2303.13589  [pdf, other

    cs.LG stat.ML

    On the Efficacy of Generalization Error Prediction Scoring Functions

    Authors: Puja Trivedi, Danai Koutra, Jayaraman J. Thiagarajan

    Abstract: Generalization error predictors (GEPs) aim to predict model performance on unseen distributions by deriving dataset-level error estimates from sample-level scores. However, GEPs often utilize disparate mechanisms (e.g., regressors, thresholding functions, calibration datasets, etc), to derive such error estimates, which can obfuscate the benefits of a particular scoring function. Therefore, in thi… ▽ More

    Submitted 29 May, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023. (Previous title: A Closer Look at Scoring Functions and Generalization Prediction.)

  5. arXiv:2210.16692  [pdf, other

    cs.CV cs.LG stat.ML

    Single-Shot Domain Adaptation via Target-Aware Generative Augmentation

    Authors: Rakshith Subramanyam, Kowshik Thopalli, Spring Berman, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: The problem of adapting models from a source domain using data from any target domain of interest has gained prominence, thanks to the brittle generalization in deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic data augmentations in cases of limited target data availability. In this paper, we consider the challenging setting of singl… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  6. arXiv:2207.07235  [pdf, other

    cs.LG cs.CV stat.ML

    Single Model Uncertainty Estimation via Stochastic Data Centering

    Authors: Jayaraman J. Thiagarajan, Rushil Anirudh, Vivek Narayanaswamy, Peer-Timo Bremer

    Abstract: We are interested in estimating the uncertainties of deep neural networks, which play an important role in many scientific and engineering problems. In this paper, we present a striking new finding that an ensemble of neural networks with the same weight initialization, trained on datasets that are shifted by a constant bias gives rise to slightly inconsistent trained models, where the differences… ▽ More

    Submitted 1 December, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Spotlight at NeurIPS 2022

  7. arXiv:2110.02197  [pdf, other

    cs.LG cs.CV stat.ML

    $Δ$-UQ: Accurate Uncertainty Quantification via Anchor Marginalization

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: We present $Δ$-UQ -- a novel, general-purpose uncertainty estimator using the concept of anchoring in predictive models. Anchoring works by first transforming the input into a tuple consisting of an anchor point drawn from a prior distribution, and a combination of the input sample with the anchor using a pretext encoding scheme. This encoding is such that the original input can be perfectly recov… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  8. arXiv:2103.03788  [pdf, other

    cs.LG stat.ML

    Loss Estimators Improve Model Generalization

    Authors: Vivek Narayanaswamy, Jayaraman J. Thiagarajan, Deepta Rajan, Andreas Spanias

    Abstract: With increased interest in adopting AI methods for clinical diagnosis, a vital step towards safe deployment of such tools is to ensure that the models not only produce accurate predictions but also do not generalize to data regimes where the training data provide no meaningful evidence. Existing approaches for ensuring the distribution of model predictions to be similar to that of the true distrib… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  9. arXiv:2010.13749  [pdf, other

    stat.ML cs.LG physics.plasm-ph

    Meaningful uncertainties from deep neural network surrogates of large-scale numerical simulations

    Authors: Gemma J. Anderson, Jim A. Gaffney, Brian K. Spears, Peer-Timo Bremer, Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: Large-scale numerical simulations are used across many scientific disciplines to facilitate experimental development and provide insights into underlying physical processes, but they come with a significant computational cost. Deep neural networks (DNNs) can serve as highly-accurate surrogate models, with the capacity to handle diverse datatypes, offering tremendous speed-ups for prediction and ma… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  10. arXiv:2009.14455  [pdf, other

    stat.ML cs.CR cs.LG

    Uncertainty-Matching Graph Neural Networks to Defend Against Poisoning Attacks

    Authors: Uday Shankar Shanthamallu, Jayaraman J. Thiagarajan, Andreas Spanias

    Abstract: Graph Neural Networks (GNNs), a generalization of neural networks to graph-structured data, are often implemented using message passes between entities of a graph. While GNNs are effective for node classification, link prediction and graph classification, they are vulnerable to adversarial attacks, i.e., a small perturbation to the structure can lead to a non-trivial performance degradation. In th… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  11. arXiv:2009.14454  [pdf, other

    stat.ML cs.LG

    Accurate and Robust Feature Importance Estimation under Distribution Shifts

    Authors: Jayaraman J. Thiagarajan, Vivek Narayanaswamy, Rushil Anirudh, Peer-Timo Bremer, Andreas Spanias

    Abstract: With increasing reliance on the outcomes of black-box models in critical applications, post-hoc explainability tools that do not require access to the model internals are often used to enable humans understand and trust these models. In particular, we focus on the class of methods that can reveal the influence of input features on the predicted outputs. Despite their wide-spread adoption, existing… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  12. arXiv:2009.14448  [pdf, other

    stat.ML cs.CV cs.LG

    Ask-n-Learn: Active Learning via Reliable Gradient Representations for Image Classification

    Authors: Bindya Venkatesh, Jayaraman J. Thiagarajan

    Abstract: Deep predictive models rely on human supervision in the form of labeled training data. Obtaining large amounts of annotated training data can be expensive and time consuming, and this becomes a critical bottleneck while building such models in practice. In such scenarios, active learning (AL) strategies are used to achieve faster convergence in terms of labeling efforts. Existing active learning e… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  13. arXiv:2005.13769  [pdf, other

    eess.AS cs.SD stat.ML

    Unsupervised Audio Source Separation using Generative Priors

    Authors: Vivek Narayanaswamy, Jayaraman J. Thiagarajan, Rushil Anirudh, Andreas Spanias

    Abstract: State-of-the-art under-determined audio source separation systems rely on supervised end-end training of carefully tailored neural network architectures operating either in the time or the spectral domain. However, these methods are severely challenged in terms of requiring access to expensive source level labeled data and being specific to a given set of sources and the mixing process, which dema… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: 5 pages, 2 figures

  14. arXiv:2005.02328  [pdf, other

    stat.ML cs.LG physics.data-an

    Designing Accurate Emulators for Scientific Processes using Calibration-Driven Deep Models

    Authors: Jayaraman J. Thiagarajan, Bindya Venkatesh, Rushil Anirudh, Peer-Timo Bremer, Jim Gaffney, Gemma Anderson, Brian Spears

    Abstract: Predictive models that accurately emulate complex scientific processes can achieve exponential speed-ups over numerical simulators or experiments, and at the same time provide surrogates for improving the subsequent analysis. Consequently, there is a recent surge in utilizing modern machine learning (ML) methods, such as deep neural networks, to build data-driven emulators. While the majority of e… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  15. arXiv:2004.14480  [pdf, other

    cs.LG stat.ML

    Calibrating Healthcare AI: Towards Reliable and Interpretable Deep Predictive Models

    Authors: Jayaraman J. Thiagarajan, Prasanna Sattigeri, Deepta Rajan, Bindya Venkatesh

    Abstract: The wide-spread adoption of representation learning technologies in clinical decision making strongly emphasizes the need for characterizing model reliability and enabling rigorous introspection of model behavior. While the former need is often addressed by incorporating uncertainty quantification strategies, the latter challenge is addressed using a broad class of interpretability techniques. In… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

  16. arXiv:2002.03875  [pdf, other

    stat.ML cs.LG

    Calibrate and Prune: Improving Reliability of Lottery Tickets Through Prediction Calibration

    Authors: Bindya Venkatesh, Jayaraman J. Thiagarajan, Kowshik Thopalli, Prasanna Sattigeri

    Abstract: The hypothesis that sub-network initializations (lottery) exist within the initializations of over-parameterized networks, which when trained in isolation produce highly generalizable models, has led to crucial insights into network initialization and has enabled efficient inferencing. Supervised models with uncalibrated confidences tend to be overconfident even when making wrong prediction. In th… ▽ More

    Submitted 30 September, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

  17. arXiv:1912.08113  [pdf, other

    cs.LG cs.CV physics.comp-ph stat.ML

    Improved Surrogates in Inertial Confinement Fusion with Manifold and Cycle Consistencies

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Brian K. Spears

    Abstract: Neural networks have become very popular in surrogate modeling because of their ability to characterize arbitrary, high dimensional functions in a data driven fashion. This paper advocates for the training of surrogates that are consistent with the physical manifold -- i.e., predictions are always physically meaningful, and are cyclically consistent -- i.e., when the predictions of the surrogate,… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    Comments: 10 pages, 6 figures

  18. arXiv:1912.07748  [pdf, other

    cs.CV cs.LG stat.ML

    MimicGAN: Robust Projection onto Image Manifolds with Corruption Mimicking

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

    Abstract: In the past few years, Generative Adversarial Networks (GANs) have dramatically advanced our ability to represent and parameterize high-dimensional, non-linear image manifolds. As a result, they have been widely adopted across a variety of applications, ranging from challenging inverse problems like image completion, to problems such as anomaly detection and adversarial defense. A recurring theme… ▽ More

    Submitted 30 April, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: International Journal on Computer Vision's (IJCV) Special Issue on GANs

  19. arXiv:1910.14179  [pdf, other

    stat.ML cs.LG

    Heteroscedastic Calibration of Uncertainty Estimators in Deep Learning

    Authors: Bindya Venkatesh, Jayaraman J. Thiagarajan

    Abstract: The role of uncertainty quantification (UQ) in deep learning has become crucial with growing use of predictive models in high-risk applications. Though a large class of methods exists for measuring deep uncertainties, in practice, the resulting estimates are found to be poorly calibrated, thus making it challenging to translate them into actionable insights. A common workaround is to utilize a sep… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  20. arXiv:1910.14175  [pdf, other

    stat.ML cs.LG

    Learn-By-Calibrating: Using Calibration as a Training Objective

    Authors: Jayaraman J. Thiagarajan, Bindya Venkatesh, Deepta Rajan

    Abstract: Calibration error is commonly adopted for evaluating the quality of uncertainty estimators in deep neural networks. In this paper, we argue that such a metric is highly beneficial for training predictive models, even when we do not explicitly measure the uncertainties. This is conceptually similar to heteroscedastic neural networks that produce variance estimates for each prediction, with the key… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  21. arXiv:1910.01666  [pdf, other

    physics.comp-ph cs.CV cs.LG stat.ML

    Exploring Generative Physics Models with Scientific Priors in Inertial Confinement Fusion

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Shusen Liu, Peer-Timo Bremer, Brian K. Spears

    Abstract: There is significant interest in using modern neural networks for scientific applications due to their effectiveness in modeling highly complex, non-linear problems in a data-driven fashion. However, a common challenge is to verify the scientific plausibility or validity of outputs predicted by a neural network. This work advocates the use of known scientific constraints as a lens into evaluating,… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: Machine Learning for Physical Sciences Workshop at NeurIPS 2019

  22. arXiv:1910.01634  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Improving Limited Angle CT Reconstruction with a Robust GAN Prior

    Authors: Rushil Anirudh, Hyo** Kim, Jayaraman J. Thiagarajan, K. Aditya Mohan, Kyle M. Champley

    Abstract: Limited angle CT reconstruction is an under-determined linear inverse problem that requires appropriate regularization techniques to be solved. In this work we study how pre-trained generative adversarial networks (GANs) can be used to clean noisy, highly artifact laden reconstructions from conventional techniques, by effectively projecting onto the inferred image manifold. In particular, we use a… ▽ More

    Submitted 29 January, 2020; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 Workshop on Deep Inverse Problems

  23. arXiv:1909.11804  [pdf, other

    cs.LG cs.NE stat.ML

    Function Preserving Projection for Scalable Exploration of High-Dimensional Data

    Authors: Shusen Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Peer-Timo Bremer

    Abstract: We present function preserving projections (FPP), a scalable linear projection technique for discovering interpretable relationships in high-dimensional data. Conventional dimension reduction methods aim to maximally preserve the global and/or local geometric structure of a dataset. However, in practice one is often more interested in determining how one or multiple user-selected response function… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  24. arXiv:1909.04079  [pdf, other

    stat.ML cs.LG

    Building Calibrated Deep Models via Uncertainty Matching with Auxiliary Interval Predictors

    Authors: Jayaraman J. Thiagarajan, Bindya Venkatesh, Prasanna Sattigeri, Peer-Timo Bremer

    Abstract: With rapid adoption of deep learning in critical applications, the question of when and how much to trust these models often arises, which drives the need to quantify the inherent uncertainties. While identifying all sources that account for the stochasticity of models is challenging, it is common to augment predictions with confidence intervals to convey the expected variations in a model's behav… ▽ More

    Submitted 21 November, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: AAAI 2020

  25. arXiv:1907.12926  [pdf, other

    cs.CV cs.LG stat.ML

    Distill-to-Label: Weakly Supervised Instance Labeling Using Knowledge Distillation

    Authors: Jayaraman J. Thiagarajan, Satyananda Kashyap, Alexandros Karagyris

    Abstract: Weakly supervised instance labeling using only image-level labels, in lieu of expensive fine-grained pixel annotations, is crucial in several applications including medical image analysis. In contrast to conventional instance segmentation scenarios in computer vision, the problems that we consider are characterized by a small number of training images and non-local patterns that lead to the diagno… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

  26. arXiv:1907.08325  [pdf, other

    cs.LG cs.HC cs.NE stat.ML

    Scalable Topological Data Analysis and Visualization for Evaluating Data-Driven Models in Scientific Applications

    Authors: Shusen Liu, Di Wang, Dan Maljovec, Rushil Anirudh, Jayaraman J. Thiagarajan, Sam Ade Jacobs, Brian C. Van Essen, David Hysom, Jae-Seung Yeom, Jim Gaffney, Luc Peterson, Peter B. Robinson, Harsh Bhatia, Valerio Pascucci, Brian K. Spears, Peer-Timo Bremer

    Abstract: With the rapid adoption of machine learning techniques for large-scale applications in science and engineering comes the convergence of two grand challenges in visualization. First, the utilization of black box models (e.g., deep neural networks) calls for advanced techniques in exploring and interpreting model behaviors. Second, the rapid growth in computing has produced enormous datasets that re… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

  27. arXiv:1906.04338  [pdf, other

    stat.ML cs.CV cs.LG

    SALT: Subspace Alignment as an Auxiliary Learning Task for Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J. Thiagarajan, Rushil Anirudh, Pavan Turaga

    Abstract: Unsupervised domain adaptation aims to transfer and adapt knowledge learned from a labeled source domain to an unlabeled target domain. Key components of unsupervised domain adaptation include: (a) maximizing performance on the target, and (b) aligning the source and target domains. Traditionally, these tasks have either been considered as separate, or assumed to be implicitly addressed together w… ▽ More

    Submitted 18 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

  28. arXiv:1906.02732  [pdf, other

    cs.LG stat.ML

    A Look at the Effect of Sample Design on Generalization through the Lens of Spectral Analysis

    Authors: Bhavya Kailkhura, Jayaraman J. Thiagarajan, Qunwei Li, Peer-Timo Bremer

    Abstract: This paper provides a general framework to study the effect of sampling properties of training data on the generalization error of the learned machine learning (ML) models. Specifically, we propose a new spectral analysis of the generalization error, expressed in terms of the power spectra of the sampling pattern and the function involved. The framework is build in the Euclidean space using Fourie… ▽ More

    Submitted 8 June, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

  29. arXiv:1904.04161  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Audio Source Separation via Multi-Scale Learning with Dilated Dense U-Nets

    Authors: Vivek Sivaraman Narayanaswamy, Sameeksha Katoch, Jayaraman J. Thiagarajan, Huan Song, Andreas Spanias

    Abstract: Modern audio source separation techniques rely on optimizing sequence model architectures such as, 1D-CNNs, on mixture recordings to generalize well to unseen mixtures. Specifically, recent focus is on time-domain based architectures such as Wave-U-Net which exploit temporal context by extracting multi-scale features. However, the optimality of the feature extraction process in these architectures… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

  30. arXiv:1811.12156  [pdf, other

    cs.SI cs.LG stat.ML

    Improved Deep Embeddings for Inferencing with Multi-Layered Networks

    Authors: Huan Song, Jayaraman J. Thiagarajan

    Abstract: Inferencing with network data necessitates the map** of its nodes into a vector space, where the relationships are preserved. However, with multi-layered networks, where multiple types of relationships exist for the same set of nodes, it is crucial to exploit the information shared between layers, in addition to the distinct aspects of each layer. In this paper, we propose a novel approach that… ▽ More

    Submitted 1 March, 2019; v1 submitted 20 September, 2018; originally announced November 2018.

    Comments: IJCAI 2019

  31. arXiv:1811.08484  [pdf, other

    cs.CV cs.AI stat.ML

    MimicGAN: Corruption-Mimicking for Blind Image Recovery & Adversarial Defense

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

    Abstract: Solving inverse problems continues to be a central challenge in computer vision. Existing techniques either explicitly construct an inverse map** using prior knowledge about the corruption, or learn the inverse directly using a large collection of examples. However, in practice, the nature of corruption may be unknown, and thus it is challenging to regularize the problem of inferring a plausible… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

  32. arXiv:1811.00183  [pdf, other

    stat.ML cs.LG cs.SD eess.AS

    Designing an Effective Metric Learning Pipeline for Speaker Diarization

    Authors: Vivek Sivaraman Narayanaswamy, Jayaraman J. Thiagarajan, Huan Song, Andreas Spanias

    Abstract: State-of-the-art speaker diarization systems utilize knowledge from external data, in the form of a pre-trained distance metric, to effectively determine relative speaker identities to unseen data. However, much of recent focus has been on choosing the appropriate feature extractor, ranging from pre-trained $i-$vectors to representations learned via different sequence modeling architectures (e.g.… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.

  33. arXiv:1811.00181  [pdf, other

    cs.LG stat.ML

    A Regularized Attention Mechanism for Graph Attention Networks

    Authors: Uday Shankar Shanthamallu, Jayaraman J. Thiagarajan, Andreas Spanias

    Abstract: Machine learning models that can exploit the inherent structure in data have gained prominence. In particular, there is a surge in deep learning solutions for graph-structured data, due to its wide-spread applicability in several fields. Graph attention networks (GAT), a recent addition to the broad class of feature learning models in graphs, utilizes the attention mechanism to efficiently learn c… ▽ More

    Submitted 10 February, 2020; v1 submitted 31 October, 2018; originally announced November 2018.

  34. arXiv:1810.13427  [pdf, other

    stat.ML cs.LG

    Unsupervised Dimension Selection using a Blue Noise Spectrum

    Authors: Jayaraman J. Thiagarajan, Rushil Anirudh, Rahul Sridhar, Peer-Timo Bremer

    Abstract: Unsupervised dimension selection is an important problem that seeks to reduce dimensionality of data, while preserving the most useful characteristics. While dimensionality reduction is commonly utilized to construct low-dimensional embeddings, they produce feature spaces that are hard to interpret. Further, in applications such as sensor design, one needs to perform reduction directly in the inpu… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

  35. arXiv:1810.13425  [pdf, other

    stat.ML cs.LG

    Understanding Deep Neural Networks through Input Uncertainties

    Authors: Jayaraman J. Thiagarajan, Irene Kim, Rushil Anirudh, Peer-Timo Bremer

    Abstract: Techniques for understanding the functioning of complex machine learning models are becoming increasingly popular, not only to improve the validation process, but also to extract new insights about the data via exploratory analysis. Though a large class of such tools currently exists, most assume that predictions are point estimates and use a sensitivity analysis of these estimates to interpret th… ▽ More

    Submitted 31 October, 2018; v1 submitted 31 October, 2018; originally announced October 2018.

  36. arXiv:1810.01405  [pdf, other

    stat.ML cs.LG

    GrAMME: Semi-Supervised Learning using Multi-layered Graph Attention Models

    Authors: Uday Shankar Shanthamallu, Jayaraman J. Thiagarajan, Huan Song, Andreas Spanias

    Abstract: Modern data analysis pipelines are becoming increasingly complex due to the presence of multi-view information sources. While graphs are effective in modeling complex relationships, in many scenarios a single graph is rarely sufficient to succinctly represent all interactions, and hence multi-layered graphs have become popular. Though this leads to richer representations, extending solutions from… ▽ More

    Submitted 27 March, 2019; v1 submitted 2 October, 2018; originally announced October 2018.

  37. arXiv:1809.07806  [pdf, other

    stat.ML cs.AI cs.LG

    Understanding Behavior of Clinical Models under Domain Shifts

    Authors: Jayaraman J. Thiagarajan, Deepta Rajan, Prasanna Sattigeri

    Abstract: The hypothesis that computational models can be reliable enough to be adopted in prognosis and patient care is revolutionizing healthcare. Deep learning, in particular, has been a game changer in building predictive models, thus leading to community-wide data curation efforts. However, due to inherent variabilities in population characteristics and biological systems, these models are often biased… ▽ More

    Submitted 13 June, 2019; v1 submitted 20 September, 2018; originally announced September 2018.

  38. arXiv:1809.01712  [pdf, other

    cs.LG stat.ML

    Coverage-Based Designs Improve Sample Mining and Hyper-Parameter Optimization

    Authors: Gowtham Muniraju, Bhavya Kailkhura, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Cihan Tepedelenlioglu, Andreas Spanias

    Abstract: Sampling one or more effective solutions from large search spaces is a recurring idea in machine learning, and sequential optimization has become a popular solution. Typical examples include data summarization, sample mining for predictive modeling and hyper-parameter optimization. Existing solutions attempt to adaptively trade-off between global exploration and local exploitation, wherein the ini… ▽ More

    Submitted 4 April, 2019; v1 submitted 5 September, 2018; originally announced September 2018.

  39. arXiv:1808.01535  [pdf, other

    eess.AS cs.CL cs.LG stat.ML

    Triplet Network with Attention for Speaker Diarization

    Authors: Huan Song, Megan Willi, Jayaraman J. Thiagarajan, Visar Berisha, Andreas Spanias

    Abstract: In automatic speech processing systems, speaker diarization is a crucial front-end component to separate segments from different speakers. Inspired by the recent success of deep neural networks (DNNs) in semantic inferencing, triplet loss-based architectures have been successfully used for this problem. However, existing work utilizes conventional i-vectors as the input representation and builds s… ▽ More

    Submitted 4 August, 2018; originally announced August 2018.

    Comments: Interspeech2018

  40. arXiv:1805.07281  [pdf, other

    cs.CV stat.ML

    An Unsupervised Approach to Solving Inverse Problems using Generative Adversarial Networks

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

    Abstract: Solving inverse problems continues to be a challenge in a wide array of applications ranging from deblurring, image inpainting, source separation etc. Most existing techniques solve such inverse problems by either explicitly or implicitly finding the inverse of the model. The former class of techniques require explicit knowledge of the measurement process which can be unrealistic, and rely on stro… ▽ More

    Submitted 4 June, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

  41. arXiv:1802.06458  [pdf, other

    stat.ML cs.LG

    A Generative Modeling Approach to Limited Channel ECG Classification

    Authors: Deepta Rajan, Jayaraman J. Thiagarajan

    Abstract: Processing temporal sequences is central to a variety of applications in health care, and in particular multi-channel Electrocardiogram (ECG) is a highly prevalent diagnostic modality that relies on robust sequence modeling. While Recurrent Neural Networks (RNNs) have led to significant advances in automated diagnosis with time-series data, they perform poorly when models are trained using a limit… ▽ More

    Submitted 13 June, 2018; v1 submitted 18 February, 2018; originally announced February 2018.

  42. arXiv:1712.07106  [pdf, other

    stat.ML cs.LG

    Exploring High-Dimensional Structure via Axis-Aligned Decomposition of Linear Projections

    Authors: Jayaraman J. Thiagarajan, Shusen Liu, Karthikeyan Natesan Ramamurthy, Peer-Timo Bremer

    Abstract: Two-dimensional embeddings remain the dominant approach to visualize high dimensional data. The choice of embeddings ranges from highly non-linear ones, which can capture complex relationships but are difficult to interpret quantitatively, to axis-aligned projections, which are easy to interpret but are limited to bivariate relationships. Linear project can be considered as a compromise between co… ▽ More

    Submitted 19 December, 2017; v1 submitted 19 December, 2017; originally announced December 2017.

  43. arXiv:1712.06028  [pdf, other

    stat.ML cs.AI

    A Spectral Approach for the Design of Experiments: Design, Analysis and Algorithms

    Authors: Bhavya Kailkhura, Jayaraman J. Thiagarajan, Charvi Rastogi, Pramod K. Varshney, Peer-Timo Bremer

    Abstract: This paper proposes a new approach to construct high quality space-filling sample designs. First, we propose a novel technique to quantify the space-filling property and optimally trade-off uniformity and randomness in sample designs in arbitrary dimensions. Second, we connect the proposed metric (defined in the spatial domain) to the objective measure of the design performance (defined in the spe… ▽ More

    Submitted 16 December, 2017; originally announced December 2017.

  44. arXiv:1711.10388  [pdf, other

    cs.CV stat.ML

    Lose The Views: Limited Angle CT Reconstruction via Implicit Sinogram Completion

    Authors: Rushil Anirudh, Hyo** Kim, Jayaraman J. Thiagarajan, K. Aditya Mohan, Kyle Champley, Timo Bremer

    Abstract: Computed Tomography (CT) reconstruction is a fundamental component to a wide variety of applications ranging from security, to healthcare. The classical techniques require measuring projections, called sinograms, from a full 180$^\circ$ view of the object. This is impractical in a limited angle scenario, when the viewing angle is less than 180$^\circ$, which can occur due to different factors incl… ▽ More

    Submitted 11 July, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: Spotlight presentation at CVPR 2018

  45. arXiv:1711.05407  [pdf, other

    stat.ML cs.CV cs.LG

    MARGIN: Uncovering Deep Neural Networks using Graph Signal Analysis

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Rahul Sridhar, Peer-Timo Bremer

    Abstract: Interpretability has emerged as a crucial aspect of building trust in machine learning systems, aimed at providing insights into the working of complex neural networks that are otherwise opaque to a user. There are a plethora of existing solutions addressing various aspects of interpretability ranging from identifying prototypical samples in a dataset to explaining image predictions or explaining… ▽ More

    Submitted 15 January, 2021; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: Technical Report

  46. arXiv:1711.05374  [pdf, other

    stat.ML cs.LG

    Optimizing Kernel Machines using Deep Learning

    Authors: Huan Song, Jayaraman J. Thiagarajan, Prasanna Sattigeri, Andreas Spanias

    Abstract: Building highly non-linear and non-parametric models is central to several state-of-the-art machine learning systems. Kernel methods form an important class of techniques that induce a reproducing kernel Hilbert space (RKHS) for inferring non-linear models through the construction of similarity functions from data. These methods are particularly preferred in cases where the training data sizes are… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

  47. arXiv:1711.03905  [pdf, other

    stat.ML cs.LG

    Attend and Diagnose: Clinical Time Series Analysis using Attention Models

    Authors: Huan Song, Deepta Rajan, Jayaraman J. Thiagarajan, Andreas Spanias

    Abstract: With widespread adoption of electronic health records, there is an increased emphasis for predictive models that can effectively deal with clinical time-series data. Powered by Recurrent Neural Network (RNN) architectures with Long Short-Term Memory (LSTM) units, deep neural networks have achieved state-of-the-art results in several clinical prediction tasks. Despite the success of RNNs, its seque… ▽ More

    Submitted 19 November, 2017; v1 submitted 10 November, 2017; originally announced November 2017.

    Comments: AAAI 2018

  48. arXiv:1710.04329  [pdf, other

    cs.LG stat.ML

    Efficient Data-Driven Geologic Feature Detection from Pre-stack Seismic Measurements using Randomized Machine-Learning Algorithm

    Authors: Youzuo Lin, Shusen Wang, Jayaraman Thiagarajan, George Guthrie, David Coblentz

    Abstract: Conventional seismic techniques for detecting the subsurface geologic features are challenged by limited data coverage, computational inefficiency, and subjective human factors. We developed a novel data-driven geological feature detection approach based on pre-stack seismic measurements. Our detection method employs an efficient and accurate machine-learning detection approach to extract useful s… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

  49. arXiv:1708.00069  [pdf, other

    stat.ML cs.CV cs.LG

    Learning Robust Representations for Computer Vision

    Authors: Peng Zheng, Aleksandr Y. Aravkin, Karthikeyan Natesan Ramamurthy, Jayaraman Jayaraman Thiagarajan

    Abstract: Unsupervised learning techniques in computer vision often require learning latent representations, such as low-dimensional linear and non-linear subspaces. Noise and outliers in the data can frustrate these approaches by obscuring the latent spaces. Our main goal is deeper understanding and new development of robust approaches for representation learning. We provide a new interpretation for exis… ▽ More

    Submitted 31 July, 2017; originally announced August 2017.

    Comments: 8 pages, 7 pages

  50. arXiv:1705.00634  [pdf, other

    stat.ME

    Counterfactual-based Incrementality Measurement in a Digital Ad-Buying Platform

    Authors: Prasad Chalasani, Ari Buchalter, Jaynth Thiagarajan, Ezra Winston

    Abstract: The problem of measuring the true incremental effectiveness of a digital advertising campaign is of increasing importance to marketers. With a large and increasing percentage of digital advertising delivered via Demand-Side-Platforms (DSPs) executing campaigns via Real-Time-Bidding (RTB) auctions and programmatic approaches, a measurement solution that satisfies both advertiser concerns and the co… ▽ More

    Submitted 3 May, 2017; v1 submitted 1 May, 2017; originally announced May 2017.

    Comments: 44 pages, 6 figures, 1 table