Skip to main content

Showing 1–25 of 25 results for author: Madireddy, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.05718  [pdf, other

    stat.ML cs.LG

    REMEDI: Corrective Transformations for Improved Neural Entropy Estimation

    Authors: Viktor Nilsson, Anirban Samaddar, Sandeep Madireddy, Pierre Nyquist

    Abstract: Information theoretic quantities play a central role in machine learning. The recent surge in the complexity of data and models has increased the demand for accurate estimation of these quantities. However, as the dimension grows the estimation presents significant challenges, with existing methods struggling already in relatively low dimensions. To address this issue, in this work, we introduce… ▽ More

    Submitted 19 May, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: To appear in ICML 2024. 30 pages, 18 figures

    MSC Class: 94A17 (Primary) 68T01; 94A08 (Secondary)

  2. arXiv:2312.03876  [pdf, other

    physics.ao-ph cs.AI cs.LG

    Scaling transformer neural networks for skillful and reliable medium-range weather forecasting

    Authors: Tung Nguyen, Rohan Shah, Hritik Bansal, Troy Arcomano, Sandeep Madireddy, Romit Maulik, Veerabhadra Kotamarthi, Ian Foster, Aditya Grover

    Abstract: Weather forecasting is a fundamental problem for anticipating and mitigating the impacts of climate change. Recently, data-driven approaches for weather forecasting based on deep learning have shown great promise, achieving accuracies that are competitive with operational systems. However, those methods often employ complex, customized architectures without sufficient ablation analysis, making it… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  3. arXiv:2311.08421  [pdf, other

    physics.ao-ph cs.LG

    Surrogate Neural Networks to Estimate Parametric Sensitivity of Ocean Models

    Authors: Yixuan Sun, Elizabeth Cucuzzella, Steven Brus, Sri Hari Krishna Narayanan, Balu Nadiga, Luke Van Roekel, Jan Hückelheim, Sandeep Madireddy

    Abstract: Modeling is crucial to understanding the effect of greenhouse gases, warming, and ice sheet melting on the ocean. At the same time, ocean processes affect phenomena such as hurricanes and droughts. Parameters in the models that cannot be physically measured have a significant effect on the model output. For an idealized ocean model, we generated perturbed parameter ensemble data and trained surrog… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  4. arXiv:2310.16996  [pdf, other

    cs.LG cs.DC

    Towards Continually Learning Application Performance Models

    Authors: Ray A. O. Sinurat, Anurag Daram, Haryadi S. Gunawi, Robert B. Ross, Sandeep Madireddy

    Abstract: Machine learning-based performance models are increasingly being used to build critical job scheduling and application optimization decisions. Traditionally, these models assume that data distribution does not change as more samples are collected over time. However, owing to the complexity and heterogeneity of production HPC systems, they are susceptible to hardware degradation, replacement, and/o… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Presented at Workshop on Machine Learning for Systems at 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  5. arXiv:2308.04539  [pdf, other

    cs.LG cs.AI cs.NE

    Improving Performance in Continual Learning Tasks using Bio-Inspired Architectures

    Authors: Sandeep Madireddy, Angel Yanguas-Gil, Prasanna Balaprakash

    Abstract: The ability to learn continuously from an incoming data stream without catastrophic forgetting is critical to designing intelligent systems. Many approaches to continual learning rely on stochastic gradient descent and its variants that employ global error updates, and hence need to adopt strategies such as memory buffers or replay to circumvent its stability, greed, and short-term memory limitati… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  6. arXiv:2302.13210  [pdf, other

    cs.NE cs.AR cs.LG

    AutoML for neuromorphic computing and application-driven co-design: asynchronous, massively parallel optimization of spiking architectures

    Authors: Angel Yanguas-Gil, Sandeep Madireddy

    Abstract: In this work we have extended AutoML inspired approaches to the exploration and optimization of neuromorphic architectures. Through the integration of a parallel asynchronous model-based search approach with a simulation framework to simulate spiking architectures, we are able to efficiently explore the configuration space of neuromorphic architectures and identify the subset of conditions leading… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

  7. arXiv:2302.02005  [pdf, other

    astro-ph.GA cs.AI cs.CV

    DeepAstroUDA: Semi-Supervised Universal Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection

    Authors: A. Ćiprijanović, A. Lewis, K. Pedro, S. Madireddy, B. Nord, G. N. Perdue, S. M. Wild

    Abstract: Artificial intelligence methods show great promise in increasing the quality and speed of work with large astronomical datasets, but the high complexity of these methods leads to the extraction of dataset-specific, non-robust features. Therefore, such methods do not generalize well across multiple datasets. We present a universal domain adaptation method, \textit{DeepAstroUDA}, as an approach to o… ▽ More

    Submitted 22 March, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted in Machine Learning Science and Technology (MLST); 24 pages, 14 figures

    Report number: FERMILAB-PUB-23-034-CSAID

  8. A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

    Authors: Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee , et al. (22 additional authors not shown)

    Abstract: Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through th… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: To appear in Neural Networks

  9. arXiv:2211.16759  [pdf, other

    cs.LG cs.AI

    General policy map**: online continual reinforcement learning inspired on the insect brain

    Authors: Angel Yanguas-Gil, Sandeep Madireddy

    Abstract: We have developed a model for online continual or lifelong reinforcement learning (RL) inspired on the insect brain. Our model leverages the offline training of a feature extraction and a common general policy layer to enable the convergence of RL algorithms in online settings. Sharing a common policy layer across tasks leads to positive backward transfer, where the agent continuously improved in… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  10. arXiv:2211.00677  [pdf, other

    astro-ph.GA cs.AI cs.CV cs.LG

    Semi-Supervised Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection

    Authors: Aleksandra Ćiprijanović, Ashia Lewis, Kevin Pedro, Sandeep Madireddy, Brian Nord, Gabriel N. Perdue, Stefan M. Wild

    Abstract: In the era of big astronomical surveys, our ability to leverage artificial intelligence algorithms simultaneously for multiple datasets will open new avenues for scientific discovery. Unfortunately, simply training a deep neural network on images from one data domain often leads to very poor performance on any other dataset. Here we develop a Universal Domain Adaptation method DeepAstroUDA, capabl… ▽ More

    Submitted 11 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: 3 figures, 1 table; accepted to Machine Learning and the Physical Sciences - Workshop at the 36th conference on Neural Information Processing Systems (NeurIPS)

    Report number: FERMILAB-CONF-22-791-SCD

  11. arXiv:2210.04083  [pdf, other

    cs.LG cs.AI

    Unified Probabilistic Neural Architecture and Weight Ensembling Improves Model Robustness

    Authors: Sumegha Premchandar, Sandeep Madireddy, Sanket Jantre, Prasanna Balaprakash

    Abstract: Robust machine learning models with accurately calibrated uncertainties are crucial for safety-critical applications. Probabilistic machine learning and especially the Bayesian formalism provide a systematic framework to incorporate robustness through the distributional estimates and reason about uncertainty. Recent works have shown that approximate inference approaches that take the weight space… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  12. HPC Storage Service Autotuning Using Variational-Autoencoder-Guided Asynchronous Bayesian Optimization

    Authors: Matthieu Dorier, Romain Egele, Prasanna Balaprakash, Jaehoon Koo, Sandeep Madireddy, Srinivasan Ramesh, Allen D. Malony, Rob Ross

    Abstract: Distributed data storage services tailored to specific applications have grown popular in the high-performance computing (HPC) community as a way to address I/O and storage challenges. These services offer a variety of specific interfaces, semantics, and data representations. They also expose many tuning parameters, making it difficult for their users to find the best configuration for a given wor… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: Accepted at IEEE Cluster 2022

  13. arXiv:2206.00794  [pdf, other

    stat.ML cs.LG math.ST

    Sequential Bayesian Neural Subnetwork Ensembles

    Authors: Sanket Jantre, Sandeep Madireddy, Shrijita Bhattacharya, Tapabrata Maiti, Prasanna Balaprakash

    Abstract: Deep neural network ensembles that appeal to model diversity have been used successfully to improve predictive performance and model robustness in several applications. Whereas, it has recently been shown that sparse subnetworks of dense models can match the performance of their dense counterparts and increase their robustness while effectively decreasing the model complexity. However, most ensemb… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  14. arXiv:2204.08180  [pdf, other

    cs.DC cs.PF

    A Taxonomy of Error Sources in HPC I/O Machine Learning Models

    Authors: Mihailo Isakov, Mikaela Currier, Eliakin del Rosario, Sandeep Madireddy, Prasanna Balaprakash, Philip Carns, Robert B. Ross, Glenn K. Lockwood, Michel A. Kinsy

    Abstract: I/O efficiency is crucial to productivity in scientific computing, but the increasing complexity of the system and the applications makes it difficult for practitioners to understand and optimize I/O behavior at scale. Data-driven machine learning-based I/O throughput models offer a solution: they can be used to identify bottlenecks, automate I/O tuning, or optimize job scheduling with minimal hum… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Report number: STAM01

  15. arXiv:2203.02592  [pdf, other

    stat.ML cs.LG stat.ME

    Sparsity-Inducing Categorical Prior Improves Robustness of the Information Bottleneck

    Authors: Anirban Samaddar, Sandeep Madireddy, Prasanna Balaprakash, Tapabrata Maiti, Gustavo de los Campos, Ian Fischer

    Abstract: The information bottleneck framework provides a systematic approach to learning representations that compress nuisance information in the input and extract semantically meaningful information about predictions. However, the choice of a prior distribution that fixes the dimensionality across all the data can restrict the flexibility of this approach for learning robust representations. We present a… ▽ More

    Submitted 27 October, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

  16. arXiv:2112.14299  [pdf, other

    cs.LG astro-ph.GA cs.AI cs.CV

    DeepAdversaries: Examining the Robustness of Deep Learning Models for Galaxy Morphology Classification

    Authors: Aleksandra Ćiprijanović, Diana Kafkes, Gregory Snyder, F. Javier Sánchez, Gabriel Nathan Perdue, Kevin Pedro, Brian Nord, Sandeep Madireddy, Stefan M. Wild

    Abstract: With increased adoption of supervised deep learning methods for processing and analysis of cosmological survey data, the assessment of data perturbation effects (that can naturally occur in the data processing and analysis pipelines) and the development of methods that increase model robustness are increasingly important. In the context of morphological classification of galaxies, we study the eff… ▽ More

    Submitted 6 July, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

    Comments: 20 pages, 6 figures, 5 tables; accepted in MLST

    Report number: FERMILAB-PUB-21-767-SCD

  17. arXiv:2111.00961  [pdf, other

    astro-ph.GA cs.CV cs.LG

    Robustness of deep learning algorithms in astronomy -- galaxy morphology studies

    Authors: A. Ćiprijanović, D. Kafkes, G. N. Perdue, K. Pedro, G. Snyder, F. J. Sánchez, S. Madireddy, S. M. Wild, B. Nord

    Abstract: Deep learning models are being increasingly adopted in wide array of scientific domains, especially to handle high-dimensionality and volume of the scientific data. However, these models tend to be brittle due to their complexity and overparametrization, especially to the inadvertent adversarial perturbations that can appear due to common image processing such as compression or blurring that are o… ▽ More

    Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted in: Fourth Workshop on Machine Learning and the Physical Sciences (35th Conference on Neural Information Processing Systems; NeurIPS2021); final version

    Report number: FERMILAB-CONF-21-561-SCD

  18. arXiv:2110.13041  [pdf, other

    cs.LG cs.AR physics.data-an physics.ins-det

    Applications and Techniques for Fast Machine Learning in Science

    Authors: Allison McCarn Deiana, Nhan Tran, Joshua Agar, Michaela Blott, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Scott Hauck, Mia Liu, Mark S. Neubauer, Jennifer Ngadiuba, Seda Ogrenci-Memik, Maurizio Pierini, Thea Aarrestad, Steffen Bahr, Jurgen Becker, Anne-Sophie Berthold, Richard J. Bonventre, Tomas E. Muller Bravo, Markus Diefenthaler, Zhen Dong, Nick Fritzsche, Amir Gholami, Ekaterina Govorkova, Kyle J Hazelwood , et al. (62 additional authors not shown)

    Abstract: In this community review report, we discuss applications and techniques for fast machine learning (ML) in science -- the concept of integrating power ML methods into the real-time experimental data processing loop to accelerate scientific discovery. The material for the report builds on two workshops held by the Fast ML for Science community and covers three main areas: applications for fast ML ac… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 66 pages, 13 figures, 5 tables

    Report number: FERMILAB-PUB-21-502-AD-E-SCD

    Journal ref: Front. Big Data 5, 787421 (2022)

  19. arXiv:2103.01373  [pdf, other

    astro-ph.IM astro-ph.GA cs.AI cs.CV cs.LG

    DeepMerge II: Building Robust Deep Learning Algorithms for Merging Galaxy Identification Across Domains

    Authors: A. Ćiprijanović, D. Kafkes, K. Downey, S. Jenkins, G. N. Perdue, S. Madireddy, T. Johnston, G. F. Snyder, B. Nord

    Abstract: In astronomy, neural networks are often trained on simulation data with the prospect of being used on telescope observations. Unfortunately, training a model on simulation data and then applying it to instrument data leads to a substantial and potentially even detrimental decrease in model accuracy on the new target dataset. Simulated and instrument data represent different data domains, and for a… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to MNRAS; 21 pages, 9 figures, 9 tables

    Report number: FERMILAB-PUB-21-072-SCD

    Journal ref: MNRAS, Volume 506, Issue 1, September 2021, Page 677

  20. arXiv:2011.03591  [pdf, other

    astro-ph.IM astro-ph.GA cs.AI cs.LG

    Domain adaptation techniques for improved cross-domain study of galaxy mergers

    Authors: A. Ćiprijanović, D. Kafkes, S. Jenkins, K. Downey, G. N. Perdue, S. Madireddy, T. Johnston, B. Nord

    Abstract: In astronomy, neural networks are often trained on simulated data with the prospect of being applied to real observations. Unfortunately, simply training a deep neural network on images from one domain does not guarantee satisfactory performance on new images from a different domain. The ability to share cross-domain knowledge is the main advantage of modern deep domain adaptation techniques. Here… ▽ More

    Submitted 13 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: Accepted in: Machine Learning and the Physical Sciences - Workshop at the 34th Conference on Neural Information Processing Systems (NeurIPS); final version

    Report number: FERMILAB-CONF-20-582-SCD

  21. arXiv:2007.08159  [pdf, other

    cs.LG stat.ML

    Neuromodulated Neural Architectures with Local Error Signals for Memory-Constrained Online Continual Learning

    Authors: Sandeep Madireddy, Angel Yanguas-Gil, Prasanna Balaprakash

    Abstract: The ability to learn continuously from an incoming data stream without catastrophic forgetting is critical for designing intelligent systems. Many existing approaches to continual learning rely on stochastic gradient descent and its variants. However, these algorithms have to implement various strategies, such as memory buffers or replay, to overcome well-known shortcomings of stochastic gradient… ▽ More

    Submitted 13 March, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

  22. arXiv:1911.07630  [pdf, other

    cs.OH cs.LG stat.ML

    Value-Added Chemical Discovery Using Reinforcement Learning

    Authors: Peihong Jiang, Hieu Doan, Sandeep Madireddy, Rajeev Surendran Assary, Prasanna Balaprakash

    Abstract: Computer-assisted synthesis planning aims to help chemists find better reaction pathways faster. Finding viable and short pathways from sugar molecules to value-added chemicals can be modeled as a retrosynthesis planning problem with a catalyst allowed. This is a crucial step in efficient biomass conversion. The traditional computational chemistry approach to identifying possible reaction pathways… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

  23. arXiv:1911.03867  [pdf, other

    astro-ph.IM astro-ph.CO cs.LG

    A Modular Deep Learning Pipeline for Galaxy-Scale Strong Gravitational Lens Detection and Modeling

    Authors: Sandeep Madireddy, Nesar Ramachandra, Nan Li, James Butler, Prasanna Balaprakash, Salman Habib, Katrin Heitmann, The LSST Dark Energy Science Collaboration

    Abstract: Upcoming large astronomical surveys are expected to capture an unprecedented number of strong gravitational lensing systems. Deep learning is emerging as a promising practical tool for the detection and quantification of these galaxy-scale image distortions. The absence of large quantities of representative data from current astronomical surveys motivates the development of a robust forward-modeli… ▽ More

    Submitted 21 October, 2022; v1 submitted 10 November, 2019; originally announced November 2019.

  24. arXiv:1909.09144  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Using recurrent neural networks for nonlinear component computation in advection-dominated reduced-order models

    Authors: Romit Maulik, Vishwas Rao, Sandeep Madireddy, Bethany Lusch, Prasanna Balaprakash

    Abstract: Rapid simulations of advection-dominated problems are vital for multiple engineering and geophysical applications. In this paper, we present a long short-term memory neural network to approximate the nonlinear component of the reduced-order model (ROM) of an advection-dominated partial differential equation. This is motivated by the fact that the nonlinear term is the most expensive component of a… ▽ More

    Submitted 1 November, 2019; v1 submitted 18 September, 2019; originally announced September 2019.

  25. arXiv:1906.01668  [pdf, other

    cs.LG cs.NE stat.ML

    Neuromorphic Architecture Optimization for Task-Specific Dynamic Learning

    Authors: Sandeep Madireddy, Angel Yanguas-Gil, Prasanna Balaprakash

    Abstract: The ability to learn and adapt in real time is a central feature of biological systems. Neuromorphic architectures demonstrating such versatility can greatly enhance our ability to efficiently process information at the edge. A key challenge, however, is to understand which learning rules are best suited for specific tasks and how the relevant hyperparameters can be fine-tuned. In this work, we in… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Report number: ANL/MCS-P9175-0419

    Journal ref: Proceedings of the International Conference on Neuromorphic Systems 2019. ACM, New York, NY, USA, Article 5, 5 pages