Skip to main content

Showing 1–50 of 58 results for author: Anirudh, R

.
  1. arXiv:2406.00529  [pdf, other

    cs.LG cs.CV stat.ML

    On the Use of Anchoring for Training Vision Models

    Authors: Vivek Narayanaswamy, Kowshik Thopalli, Rushil Anirudh, Yamen Mubarka, Wesam Sakla, Jayaraman J. Thiagarajan

    Abstract: Anchoring is a recent, architecture-agnostic principle for training deep neural networks that has been shown to significantly improve uncertainty estimation, calibration, and extrapolation capabilities. In this paper, we systematically explore anchoring as a general protocol for training vision models, providing fundamental insights into its training and inference processes and their implications… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  2. arXiv:2404.08761  [pdf, ps, other

    cs.CV cs.LG

    `Eyes of a Hawk and Ears of a Fox': Part Prototype Network for Generalized Zero-Shot Learning

    Authors: Joshua Feinglass, Jayaraman J. Thiagarajan, Rushil Anirudh, T. S. Jayram, Yezhou Yang

    Abstract: Current approaches in Generalized Zero-Shot Learning (GZSL) are built upon base models which consider only a single class attribute vector representation over the entire image. This is an oversimplification of the process of novel category recognition, where different regions of the image may have properties from different seen classes and thus have different predominant attributes. With this in m… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted to the CVPR 2024 LIMIT Workshop

  3. arXiv:2401.03350  [pdf, other

    cs.LG stat.ML

    Accurate and Scalable Estimation of Epistemic Uncertainty for Graph Neural Networks

    Authors: Puja Trivedi, Mark Heimann, Rushil Anirudh, Danai Koutra, Jayaraman J. Thiagarajan

    Abstract: While graph neural networks (GNNs) are widely used for node and graph representation learning tasks, the reliability of GNN uncertainty estimates under distribution shifts remains relatively under-explored. Indeed, while post-hoc calibration strategies can be used to improve in-distribution calibration, they need not also improve calibration under distribution shift. However, techniques which prod… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: 33 pages; 10 Figures. arXiv admin note: text overlap with arXiv:2309.10976

  4. arXiv:2312.03642  [pdf, other

    cs.LG

    Transformer-Powered Surrogates Close the ICF Simulation-Experiment Gap with Extremely Limited Data

    Authors: Matthew L. Olson, Shusen Liu, Jayaraman J. Thiagarajan, Bogdan Kustowski, Weng-Keen Wong, Rushil Anirudh

    Abstract: Recent advances in machine learning, specifically transformer architecture, have led to significant advancements in commercial domains. These powerful models have demonstrated superior capability to learn complex relationships and often generalize better to new data and problems. This paper presents a novel transformer-powered approach for enhancing prediction accuracy in multi-modal output scenar… ▽ More

    Submitted 28 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: MLST

  5. arXiv:2311.02087  [pdf

    cs.SD cs.AI cs.LG eess.AS eess.SP

    Design Of Rubble Analyzer Probe Using ML For Earthquake

    Authors: Abhishek Sebastian, R Pragna, K Vishal Vythianathan, Dasaraju Sohan Sai, U Shiva Sri Hari Al, R Anirudh, Apurv Choudhary

    Abstract: The earthquake rubble analyzer uses machine learning to detect human presence via ambient sounds, achieving 97.45% accuracy. It also provides real-time environmental data, aiding in assessing survival prospects for trapped individuals, crucial for post-earthquake rescue efforts

    Submitted 24 October, 2023; originally announced November 2023.

  6. arXiv:2311.01508  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.IM astro-ph.SR

    $\texttt{slick}$: Modeling a Universe of Molecular Line Luminosities in Hydrodynamical Simulations

    Authors: Karolina Garcia, Desika Narayanan, Gergö Pop**, R. Anirudh, Sagan Sutherland, Melanie Kaasinen

    Abstract: We present {\sc slick} (the Scalable Line Intensity Computation Kit), a software package that calculates realistic CO, [\ion{C}{1}], and [\ion{C}{2}] luminosities for clouds and galaxies formed in hydrodynamic simulations. Built on the radiative transfer code {\sc despotic}, {\sc slick} computes the thermal, radiative, and statistical equilibrium in concentric zones of model clouds, based on their… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 17 pages, 11 figures, comments are welcome

  7. arXiv:2309.10977  [pdf, other

    cs.LG stat.ML

    PAGER: A Framework for Failure Analysis of Deep Regression Models

    Authors: Jayaraman J. Thiagarajan, Vivek Narayanaswamy, Puja Trivedi, Rushil Anirudh

    Abstract: Safe deployment of AI models requires proactive detection of failures to prevent costly errors. To this end, we study the important problem of detecting failures in deep regression models. Existing approaches rely on epistemic uncertainty estimates or inconsistency w.r.t the training data to identify failure. Interestingly, we find that while uncertainties are necessary they are insufficient to ac… ▽ More

    Submitted 1 June, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Published at ICML 2024

  8. arXiv:2309.10976  [pdf, other

    cs.LG

    Accurate and Scalable Estimation of Epistemic Uncertainty for Graph Neural Networks

    Authors: Puja Trivedi, Mark Heimann, Rushil Anirudh, Danai Koutra, Jayaraman J. Thiagarajan

    Abstract: Safe deployment of graph neural networks (GNNs) under distribution shift requires models to provide accurate confidence indicators (CI). However, while it is well-known in computer vision that CI quality diminishes under distribution shift, this behavior remains understudied for GNNs. Hence, we begin with a case study on CI calibration under controlled structural and feature distribution shifts an… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 22 pages, 11 figures

  9. arXiv:2307.04838  [pdf, other

    cs.CV cs.LG

    CREPE: Learnable Prompting With CLIP Improves Visual Relationship Prediction

    Authors: Rakshith Subramanyam, T. S. Jayram, Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: In this paper, we explore the potential of Vision-Language Models (VLMs), specifically CLIP, in predicting visual object relationships, which involves interpreting visual features from images into language-based relations. Current state-of-the-art methods use complex graphical models that utilize language cues and visual features to address this challenge. We hypothesize that the strong language p… ▽ More

    Submitted 19 July, 2023; v1 submitted 10 July, 2023; originally announced July 2023.

  10. The role of magnetic fields in the fragmentation of the Taurus B213 filament into Sun-type star-forming cores

    Authors: Anirudh R., Chakali Eswaraiah, Sihan Jiao, Jessy Jose

    Abstract: Fragmentation is a key step in the process of transforming clouds (and their substructures such as filaments, clumps, and cores) into protostars. The thermal gas pressure and gravitational collapse are believed to be the primary agents governing this process, referred to as the thermal Jeans fragmentation. However, the contributions of other factors (such as magnetic fields and turbulence) to the… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: 10 pages, 3 figures, and 5 tables ; Accepted for publication in JOAA

    Journal ref: J. Astrophys. Astr. (2023) 44:59

  11. arXiv:2303.10774  [pdf, other

    cs.LG cs.CV

    Cross-GAN Auditing: Unsupervised Identification of Attribute Level Similarities and Differences between Pretrained Generative Models

    Authors: Matthew L. Olson, Shusen Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Weng-Keen Wong

    Abstract: Generative Adversarial Networks (GANs) are notoriously difficult to train especially for complex distributions and with limited data. This has driven the need for tools to audit trained networks in human intelligible format, for example, to identify biases or ensure fairness. Existing GAN audit tools are restricted to coarse-grained, model-data comparisons based on summary statistics such as FID o… ▽ More

    Submitted 2 May, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: CVPR 2023. Source code is available at https://github.com/mattolson93/cross_gan_auditing

  12. arXiv:2211.12340  [pdf, other

    eess.IV cs.CV

    DOLCE: A Model-Based Probabilistic Diffusion Framework for Limited-Angle CT Reconstruction

    Authors: Jiaming Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Stewart He, K. Aditya Mohan, Ulugbek S. Kamilov, Hyo** Kim

    Abstract: Limited-Angle Computed Tomography (LACT) is a non-destructive evaluation technique used in a variety of applications ranging from security to medicine. The limited angle coverage in LACT is often a dominant source of severe artifacts in the reconstructed images, making it a challenging inverse problem. We present DOLCE, a new deep model-based framework for LACT that uses a conditional diffusion mo… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 29 pages, 21 figures

  13. arXiv:2210.16742  [pdf, other

    cs.CV cs.AI cs.LG

    On-the-fly Object Detection using StyleGAN with CLIP Guidance

    Authors: Yuzhe Lu, Shusen Liu, Jayaraman J. Thiagarajan, Wesam Sakla, Rushil Anirudh

    Abstract: We present a fully automated framework for building object detectors on satellite imagery without requiring any human annotation or intervention. We achieve this by leveraging the combined power of modern generative models (e.g., StyleGAN) and recent advances in multi-modal learning (e.g., CLIP). While deep generative models effectively encode the key semantics pertinent to a data distribution, th… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  14. arXiv:2207.12346  [pdf, other

    cs.LG

    Contrastive Knowledge-Augmented Meta-Learning for Few-Shot Classification

    Authors: Rakshith Subramanyam, Mark Heimann, Jayram Thathachar, Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: Model agnostic meta-learning algorithms aim to infer priors from several observed tasks that can then be used to adapt to a new task with few examples. Given the inherent diversity of tasks arising in existing benchmarks, recent methods use separate, learnable structure, such as hierarchies or graphs, for enabling task-specific adaptation of the prior. While these approaches have produced signific… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

  15. arXiv:2207.07235  [pdf, other

    cs.LG cs.CV stat.ML

    Single Model Uncertainty Estimation via Stochastic Data Centering

    Authors: Jayaraman J. Thiagarajan, Rushil Anirudh, Vivek Narayanaswamy, Peer-Timo Bremer

    Abstract: We are interested in estimating the uncertainties of deep neural networks, which play an important role in many scientific and engineering problems. In this paper, we present a striking new finding that an ensemble of neural networks with the same weight initialization, trained on datasets that are shifted by a constant bias gives rise to slightly inconsistent trained models, where the differences… ▽ More

    Submitted 1 December, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Spotlight at NeurIPS 2022

  16. arXiv:2207.05286  [pdf, other

    cs.CV cs.LG

    Know Your Space: Inlier and Outlier Construction for Calibrating Medical OOD Detectors

    Authors: Vivek Narayanaswamy, Yamen Mubarka, Rushil Anirudh, Deepta Rajan, Andreas Spanias, Jayaraman J. Thiagarajan

    Abstract: We focus on the problem of producing well-calibrated out-of-distribution (OOD) detectors, in order to enable safe deployment of medical image classifiers. Motivated by the difficulty of curating suitable calibration datasets, synthetic augmentations have become highly prevalent for inlier/outlier specification. While there have been rapid advances in data augmentation techniques, this paper makes… ▽ More

    Submitted 22 April, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

  17. arXiv:2207.04125  [pdf, other

    cs.LG cs.AI cs.CV

    Out of Distribution Detection via Neural Network Anchoring

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: Our goal in this paper is to exploit heteroscedastic temperature scaling as a calibration strategy for out of distribution (OOD) detection. Heteroscedasticity here refers to the fact that the optimal temperature parameter for each sample can be different, as opposed to conventional approaches that use the same value for the entire distribution. To enable this, we propose a new training strategy ca… ▽ More

    Submitted 1 December, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: ACML 2022

  18. arXiv:2206.07736  [pdf, other

    cs.LG cs.CV

    Improving Diversity with Adversarially Learned Transformations for Domain Generalization

    Authors: Tejas Gokhale, Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Chitta Baral, Yezhou Yang

    Abstract: To be successful in single source domain generalization, maximizing diversity of synthesized domains has emerged as one of the most effective strategies. Many of the recent successes have come from methods that pre-specify the types of diversity that a model is exposed to during training, so that it can ultimately generalize well to new domains. However, naïve diversity based augmentations do not… ▽ More

    Submitted 12 December, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: WACV 2023. Code: https://github.com/tejas-gokhale/ALT

  19. 2022 Review of Data-Driven Plasma Science

    Authors: Rushil Anirudh, Rick Archibald, M. Salman Asif, Markus M. Becker, Sadruddin Benkadda, Peer-Timo Bremer, Rick H. S. Budé, C. S. Chang, Lei Chen, R. M. Churchill, Jonathan Citrin, Jim A Gaffney, Ana Gainaru, Walter Gekelman, Tom Gibbs, Satoshi Hamaguchi, Christian Hill, Kelli Humbird, Sören Jalas, Satoru Kawaguchi, Gon-Ho Kim, Manuel Kirchen, Scott Klasky, John L. Kline, Karl Krushelnick , et al. (38 additional authors not shown)

    Abstract: Data science and technology offer transformative tools and methods to science. This review article highlights latest development and progress in the interdisciplinary field of data-driven plasma science (DDPS). A large amount of data and machine learning algorithms go hand in hand. Most plasma data, whether experimental, observational or computational, are generated or collected by machines today.… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: 112 pages (including 700+ references), 44 figures, submitted to IEEE Transactions on Plasma Science as a part of the IEEE Golden Anniversary Special Issue

    Report number: Los Alamos Report number LA-UR-22-24834

    Journal ref: IEEE Transactions on Plasma Science 51, 1750 - 1838 (2023)

  20. arXiv:2201.01806  [pdf, other

    cs.LG cs.CV

    Revisiting Deep Subspace Alignment for Unsupervised Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J Thiagarajan, Rushil Anirudh, Pavan K Turaga

    Abstract: Unsupervised domain adaptation (UDA) aims to transfer and adapt knowledge from a labeled source domain to an unlabeled target domain. Traditionally, subspace-based methods form an important class of solutions to this problem. Despite their mathematical elegance and tractability, these methods are often found to be ineffective at producing domain-invariant features with complex, real-world datasets… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:1906.04338

  21. arXiv:2111.12798  [pdf, other

    cs.LG cs.CV

    Geometric Priors for Scientific Generative Models in Inertial Confinement Fusion

    Authors: Ankita Shukla, Rushil Anirudh, Eugene Kur, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Brian K. Spears, Tammy Ma, Pavan Turaga

    Abstract: In this paper, we develop a Wasserstein autoencoder (WAE) with a hyperspherical prior for multimodal data in the application of inertial confinement fusion. Unlike a typical hyperspherical generative model that requires computationally inefficient sampling from distributions like the von Mis Fisher, we sample from a normal distribution followed by a projection layer before the generator. Finally,… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: 5 pages, 4 figures, Fourth Workshop on Machine Learning and the Physical Sciences, NeurIPS 2021

  22. arXiv:2110.02197  [pdf, other

    cs.LG cs.CV stat.ML

    $Δ$-UQ: Accurate Uncertainty Quantification via Anchor Marginalization

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: We present $Δ$-UQ -- a novel, general-purpose uncertainty estimator using the concept of anchoring in predictive models. Anchoring works by first transforming the input into a tuple consisting of an anchor point drawn from a prior distribution, and a combination of the input sample with the anchor using a pretext encoding scheme. This encoding is such that the original input can be perfectly recov… ▽ More

    Submitted 5 October, 2021; originally announced October 2021.

  23. arXiv:2104.11745  [pdf, other

    eess.IV

    Dynamic CT Reconstruction from Limited Views with Implicit Neural Representations and Parametric Motion Fields

    Authors: Albert W. Reed, Hyo** Kim, Rushil Anirudh, K. Aditya Mohan, Kyle Champley, **gu Kang, Suren Jayasuriya

    Abstract: Reconstructing dynamic, time-varying scenes with computed tomography (4D-CT) is a challenging and ill-posed problem common to industrial and medical settings. Existing 4D-CT reconstructions are designed for sparse sampling schemes that require fast CT scanners to capture multiple, rapid revolutions around the scene in order to generate high quality results. However, if the scene is moving too fast… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  24. arXiv:2104.09684  [pdf, other

    cs.LG

    Suppressing simulation bias using multi-modal data

    Authors: Bogdan Kustowski, Jim A. Gaffney, Brian K. Spears, Gemma J. Anderson, Rushil Anirudh, Peer-Timo Bremer, Jayaraman J. Thiagarajan, Michael K. G. Kruse, Ryan C. Nora

    Abstract: Many problems in science and engineering require making predictions based on few observations. To build a robust predictive model, these sparse data may need to be augmented with simulated data, especially when the design space is multi-dimensional. Simulations, however, often suffer from an inherent bias. Estimation of this bias may be poorly constrained not only because of data sparsity, but als… ▽ More

    Submitted 15 March, 2022; v1 submitted 19 April, 2021; originally announced April 2021.

    Report number: LLNL-JRNL-829622

  25. arXiv:2012.02043  [pdf, other

    cs.CV cs.LG

    Recovering Trajectories of Unmarked Joints in 3D Human Actions Using Latent Space Optimization

    Authors: Suhas Lohit, Rushil Anirudh, Pavan Turaga

    Abstract: Motion capture (mocap) and time-of-flight based sensing of human actions are becoming increasingly popular modalities to perform robust activity analysis. Applications range from action recognition to quantifying movement quality for health applications. While marker-less motion capture has made great progress, in critical applications such as healthcare, marker-based systems, especially active ma… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted at WACV 2021

  26. arXiv:2012.01806  [pdf, other

    cs.CV cs.LG

    Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

    Authors: Tejas Gokhale, Rushil Anirudh, Bhavya Kailkhura, Jayaraman J. Thiagarajan, Chitta Baral, Yezhou Yang

    Abstract: While existing work in robust deep learning has focused on small pixel-level norm-based perturbations, this may not account for perturbations encountered in several real-world settings. In many such cases although test data might not be available, broad specifications about the types of perturbations (such as an unknown degree of rotation) may be known. We consider a setup where robustness is expe… ▽ More

    Submitted 7 April, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: AAAI 2021. Camera Ready version + Appendix

  27. arXiv:2010.13749  [pdf, other

    stat.ML cs.LG physics.plasm-ph

    Meaningful uncertainties from deep neural network surrogates of large-scale numerical simulations

    Authors: Gemma J. Anderson, Jim A. Gaffney, Brian K. Spears, Peer-Timo Bremer, Rushil Anirudh, Jayaraman J. Thiagarajan

    Abstract: Large-scale numerical simulations are used across many scientific disciplines to facilitate experimental development and provide insights into underlying physical processes, but they come with a significant computational cost. Deep neural networks (DNNs) can serve as highly-accurate surrogate models, with the capacity to handle diverse datatypes, offering tremendous speed-ups for prediction and ma… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

  28. arXiv:2010.08478  [pdf, other

    cs.LG cs.CY

    Machine Learning-Powered Mitigation Policy Optimization in Epidemiological Models

    Authors: Jayaraman J. Thiagarajan, Peer-Timo Bremer, Rushil Anirudh, Timothy C. Germann, Sara Y. Del Valle, Frederick H. Streitz

    Abstract: A crucial aspect of managing a public health crisis is to effectively balance prevention and mitigation strategies, while taking their socio-economic impact into account. In particular, determining the influence of different non-pharmaceutical interventions (NPIs) on the effective use of public resources is an important problem, given the uncertainties on when a vaccine will be made available. In… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  29. arXiv:2010.06558  [pdf, other

    cs.LG

    Accurate Calibration of Agent-based Epidemiological Models with Neural Network Surrogates

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Timothy C. Germann, Sara Y. Del Valle, Frederick H. Streitz

    Abstract: Calibrating complex epidemiological models to observed data is a crucial step to provide both insights into the current disease dynamics, i.e.\ by estimating a reproductive number, as well as to provide reliable forecasts and scenario explorations. Here we present a new approach to calibrate an agent-based model -- EpiCast -- using a large set of simulation ensembles for different major metropolit… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

  30. arXiv:2009.14454  [pdf, other

    stat.ML cs.LG

    Accurate and Robust Feature Importance Estimation under Distribution Shifts

    Authors: Jayaraman J. Thiagarajan, Vivek Narayanaswamy, Rushil Anirudh, Peer-Timo Bremer, Andreas Spanias

    Abstract: With increasing reliance on the outcomes of black-box models in critical applications, post-hoc explainability tools that do not require access to the model internals are often used to enable humans understand and trust these models. In particular, we focus on the class of methods that can reveal the influence of input features on the predicted outputs. Despite their wide-spread adoption, existing… ▽ More

    Submitted 30 September, 2020; originally announced September 2020.

  31. arXiv:2006.10873  [pdf, other

    cs.CV cs.LG

    Generative Patch Priors for Practical Compressive Image Recovery

    Authors: Rushil Anirudh, Suhas Lohit, Pavan Turaga

    Abstract: In this paper, we propose the generative patch prior (GPP) that defines a generative prior for compressive image recovery, based on patch-manifold models. Unlike learned, image-level priors that are restricted to the range space of a pre-trained generator, GPP can recover a wide variety of natural images using a pre-trained patch generator. Additionally, GPP retains the benefits of generative prio… ▽ More

    Submitted 5 October, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

  32. arXiv:2005.13769  [pdf, other

    eess.AS cs.SD stat.ML

    Unsupervised Audio Source Separation using Generative Priors

    Authors: Vivek Narayanaswamy, Jayaraman J. Thiagarajan, Rushil Anirudh, Andreas Spanias

    Abstract: State-of-the-art under-determined audio source separation systems rely on supervised end-end training of carefully tailored neural network architectures operating either in the time or the spectral domain. However, these methods are severely challenged in terms of requiring access to expensive source level labeled data and being specific to a given set of sources and the mixing process, which dema… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: 5 pages, 2 figures

  33. arXiv:2005.02328  [pdf, other

    stat.ML cs.LG physics.data-an

    Designing Accurate Emulators for Scientific Processes using Calibration-Driven Deep Models

    Authors: Jayaraman J. Thiagarajan, Bindya Venkatesh, Rushil Anirudh, Peer-Timo Bremer, Jim Gaffney, Gemma Anderson, Brian Spears

    Abstract: Predictive models that accurately emulate complex scientific processes can achieve exponential speed-ups over numerical simulators or experiments, and at the same time provide surrogates for improving the subsequent analysis. Consequently, there is a recent surge in utilizing modern machine learning (ML) methods, such as deep neural networks, to build data-driven emulators. While the majority of e… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

  34. arXiv:1912.08113  [pdf, other

    cs.LG cs.CV physics.comp-ph stat.ML

    Improved Surrogates in Inertial Confinement Fusion with Manifold and Cycle Consistencies

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Peer-Timo Bremer, Brian K. Spears

    Abstract: Neural networks have become very popular in surrogate modeling because of their ability to characterize arbitrary, high dimensional functions in a data driven fashion. This paper advocates for the training of surrogates that are consistent with the physical manifold -- i.e., predictions are always physically meaningful, and are cyclically consistent -- i.e., when the predictions of the surrogate,… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    Comments: 10 pages, 6 figures

  35. arXiv:1912.07748  [pdf, other

    cs.CV cs.LG stat.ML

    MimicGAN: Robust Projection onto Image Manifolds with Corruption Mimicking

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

    Abstract: In the past few years, Generative Adversarial Networks (GANs) have dramatically advanced our ability to represent and parameterize high-dimensional, non-linear image manifolds. As a result, they have been widely adopted across a variety of applications, ranging from challenging inverse problems like image completion, to problems such as anomaly detection and adversarial defense. A recurring theme… ▽ More

    Submitted 30 April, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: International Journal on Computer Vision's (IJCV) Special Issue on GANs

  36. arXiv:1912.02892  [pdf, other

    cs.DC cs.LG physics.comp-ph physics.plasm-ph

    Enabling Machine Learning-Ready HPC Ensembles with Merlin

    Authors: J. Luc Peterson, Ben Bay, Joe Koning, Peter Robinson, Jessica Semler, Jeremy White, Rushil Anirudh, Kevin Athey, Peer-Timo Bremer, Francesco Di Natale, David Fox, Jim A. Gaffney, Sam A. Jacobs, Bhavya Kailkhura, Bogdan Kustowski, Steven Langer, Brian Spears, Jayaraman Thiagarajan, Brian Van Essen, Jae-Seung Yeom

    Abstract: With the growing complexity of computational and experimental facilities, many scientific researchers are turning to machine learning (ML) techniques to analyze large scale ensemble data. With complexities such as multi-component workflows, heterogeneous machine architectures, parallel file systems, and batch scheduling, care must be taken to facilitate this analysis in a high performance computin… ▽ More

    Submitted 1 July, 2021; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: 28 pages, 9 figures; Submitted to FGCS

    Report number: LLNL-JRNL-821884

  37. arXiv:1910.05375  [pdf, other

    eess.IV cs.CV cs.LG

    Extreme Few-view CT Reconstruction using Deep Inference

    Authors: Hyo** Kim, Rushil Anirudh, K. Aditya Mohan, Kyle Champley

    Abstract: Reconstruction of few-view x-ray Computed Tomography (CT) data is a highly ill-posed problem. It is often used in applications that require low radiation dose in clinical CT, rapid industrial scanning, or fixed-gantry CT. Existing analytic or iterative algorithms generally produce poorly reconstructed images, severely deteriorated by artifacts and noise, especially when the number of x-ray project… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

    Comments: Deep Inverse NeurIPS 2019 Workshop

  38. arXiv:1910.02270  [pdf, other

    cs.DC cs.LG hep-ex physics.comp-ph

    Parallelizing Training of Deep Generative Models on Massive Scientific Datasets

    Authors: Sam Ade Jacobs, Brian Van Essen, David Hysom, Jae-Seung Yeom, Tim Moon, Rushil Anirudh, Jayaraman J. Thiagaranjan, Shusen Liu, Peer-Timo Bremer, Jim Gaffney, Tom Benson, Peter Robinson, Luc Peterson, Brian Spears

    Abstract: Training deep neural networks on large scientific data is a challenging task that requires enormous compute power, especially if no pre-trained models exist to initialize the process. We present a novel tournament method to train traditional as well as generative adversarial networks built on LBANN, a scalable deep learning framework optimized for HPC systems. LBANN combines multiple levels of par… ▽ More

    Submitted 5 October, 2019; originally announced October 2019.

  39. arXiv:1910.01666  [pdf, other

    physics.comp-ph cs.CV cs.LG stat.ML

    Exploring Generative Physics Models with Scientific Priors in Inertial Confinement Fusion

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Shusen Liu, Peer-Timo Bremer, Brian K. Spears

    Abstract: There is significant interest in using modern neural networks for scientific applications due to their effectiveness in modeling highly complex, non-linear problems in a data-driven fashion. However, a common challenge is to verify the scientific plausibility or validity of outputs predicted by a neural network. This work advocates the use of known scientific constraints as a lens into evaluating,… ▽ More

    Submitted 3 October, 2019; originally announced October 2019.

    Comments: Machine Learning for Physical Sciences Workshop at NeurIPS 2019

  40. arXiv:1910.01634  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Improving Limited Angle CT Reconstruction with a Robust GAN Prior

    Authors: Rushil Anirudh, Hyo** Kim, Jayaraman J. Thiagarajan, K. Aditya Mohan, Kyle M. Champley

    Abstract: Limited angle CT reconstruction is an under-determined linear inverse problem that requires appropriate regularization techniques to be solved. In this work we study how pre-trained generative adversarial networks (GANs) can be used to clean noisy, highly artifact laden reconstructions from conventional techniques, by effectively projecting onto the inferred image manifold. In particular, we use a… ▽ More

    Submitted 29 January, 2020; v1 submitted 3 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 Workshop on Deep Inverse Problems

  41. arXiv:1909.11804  [pdf, other

    cs.LG cs.NE stat.ML

    Function Preserving Projection for Scalable Exploration of High-Dimensional Data

    Authors: Shusen Liu, Rushil Anirudh, Jayaraman J. Thiagarajan, Peer-Timo Bremer

    Abstract: We present function preserving projections (FPP), a scalable linear projection technique for discovering interpretable relationships in high-dimensional data. Conventional dimension reduction methods aim to maximally preserve the global and/or local geometric structure of a dataset. However, in practice one is often more interested in determining how one or multiple user-selected response function… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  42. arXiv:1907.08325  [pdf, other

    cs.LG cs.HC cs.NE stat.ML

    Scalable Topological Data Analysis and Visualization for Evaluating Data-Driven Models in Scientific Applications

    Authors: Shusen Liu, Di Wang, Dan Maljovec, Rushil Anirudh, Jayaraman J. Thiagarajan, Sam Ade Jacobs, Brian C. Van Essen, David Hysom, Jae-Seung Yeom, Jim Gaffney, Luc Peterson, Peter B. Robinson, Harsh Bhatia, Valerio Pascucci, Brian K. Spears, Peer-Timo Bremer

    Abstract: With the rapid adoption of machine learning techniques for large-scale applications in science and engineering comes the convergence of two grand challenges in visualization. First, the utilization of black box models (e.g., deep neural networks) calls for advanced techniques in exploring and interpreting model behaviors. Second, the rapid growth in computing has produced enormous datasets that re… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

  43. arXiv:1906.04338  [pdf, other

    stat.ML cs.CV cs.LG

    SALT: Subspace Alignment as an Auxiliary Learning Task for Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J. Thiagarajan, Rushil Anirudh, Pavan Turaga

    Abstract: Unsupervised domain adaptation aims to transfer and adapt knowledge learned from a labeled source domain to an unlabeled target domain. Key components of unsupervised domain adaptation include: (a) maximizing performance on the target, and (b) aligning the source and target domains. Traditionally, these tasks have either been considered as separate, or assumed to be implicitly addressed together w… ▽ More

    Submitted 18 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

  44. arXiv:1811.10427  [pdf, other

    cs.LG cs.CV stat.ML

    MR-GAN: Manifold Regularized Generative Adversarial Networks

    Authors: Qunwei Li, Bhavya Kailkhura, Rushil Anirudh, Yi Zhou, Yingbin Liang, Pramod Varshney

    Abstract: Despite the growing interest in generative adversarial networks (GANs), training GANs remains a challenging problem, both from a theoretical and a practical standpoint. To address this challenge, in this paper, we propose a novel way to exploit the unique geometry of the real data, especially the manifold information. More specifically, we design a method to regularize GAN training by adding an ad… ▽ More

    Submitted 22 November, 2018; originally announced November 2018.

    Comments: arXiv admin note: text overlap with arXiv:1706.04156 by other authors

  45. arXiv:1811.08484  [pdf, other

    cs.CV cs.AI stat.ML

    MimicGAN: Corruption-Mimicking for Blind Image Recovery & Adversarial Defense

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

    Abstract: Solving inverse problems continues to be a central challenge in computer vision. Existing techniques either explicitly construct an inverse map** using prior knowledge about the corruption, or learn the inverse directly using a large collection of examples. However, in practice, the nature of corruption may be unknown, and thus it is challenging to regularize the problem of inferring a plausible… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

  46. arXiv:1811.04491  [pdf, other

    cs.CV

    Multiple Subspace Alignment Improves Domain Adaptation

    Authors: Kowshik Thopalli, Rushil Anirudh, Jayaraman J. Thiagarajan, Pavan Turaga

    Abstract: We present a novel unsupervised domain adaptation (DA) method for cross-domain visual recognition. Though subspace methods have found success in DA, their performance is often limited due to the assumption of approximating an entire dataset using a single low-dimensional subspace. Instead, we develop a method to effectively represent the source and target datasets via a collection of low-dimension… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: under review in ICASSP 2019

  47. arXiv:1810.13427  [pdf, other

    stat.ML cs.LG

    Unsupervised Dimension Selection using a Blue Noise Spectrum

    Authors: Jayaraman J. Thiagarajan, Rushil Anirudh, Rahul Sridhar, Peer-Timo Bremer

    Abstract: Unsupervised dimension selection is an important problem that seeks to reduce dimensionality of data, while preserving the most useful characteristics. While dimensionality reduction is commonly utilized to construct low-dimensional embeddings, they produce feature spaces that are hard to interpret. Further, in applications such as sensor design, one needs to perform reduction directly in the inpu… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

  48. arXiv:1810.13425  [pdf, other

    stat.ML cs.LG

    Understanding Deep Neural Networks through Input Uncertainties

    Authors: Jayaraman J. Thiagarajan, Irene Kim, Rushil Anirudh, Peer-Timo Bremer

    Abstract: Techniques for understanding the functioning of complex machine learning models are becoming increasingly popular, not only to improve the validation process, but also to extract new insights about the data via exploratory analysis. Though a large class of such tools currently exists, most assume that predictions are point estimates and use a sensitivity analysis of these estimates to interpret th… ▽ More

    Submitted 31 October, 2018; v1 submitted 31 October, 2018; originally announced October 2018.

  49. arXiv:1805.07281  [pdf, other

    cs.CV stat.ML

    An Unsupervised Approach to Solving Inverse Problems using Generative Adversarial Networks

    Authors: Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Timo Bremer

    Abstract: Solving inverse problems continues to be a challenge in a wide array of applications ranging from deblurring, image inpainting, source separation etc. Most existing techniques solve such inverse problems by either explicitly or implicitly finding the inverse of the model. The former class of techniques require explicit knowledge of the measurement process which can be unrealistic, and rely on stro… ▽ More

    Submitted 4 June, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

  50. arXiv:1711.10388  [pdf, other

    cs.CV stat.ML

    Lose The Views: Limited Angle CT Reconstruction via Implicit Sinogram Completion

    Authors: Rushil Anirudh, Hyo** Kim, Jayaraman J. Thiagarajan, K. Aditya Mohan, Kyle Champley, Timo Bremer

    Abstract: Computed Tomography (CT) reconstruction is a fundamental component to a wide variety of applications ranging from security, to healthcare. The classical techniques require measuring projections, called sinograms, from a full 180$^\circ$ view of the object. This is impractical in a limited angle scenario, when the viewing angle is less than 180$^\circ$, which can occur due to different factors incl… ▽ More

    Submitted 11 July, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: Spotlight presentation at CVPR 2018