Skip to main content

Showing 1–50 of 190 results for author: Sarath

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04561  [pdf, other

    cs.NI eess.SP

    Wireless Spectrum in Rural Farmlands: Status, Challenges and Opportunities

    Authors: Mukaram Shahid, Kunal Das, Taimoor Ul Islam, Christ Somiah, Daji Qiao, Arsalan Ahmad, Jimming Song, Zhengyuan Zhu, Sarath Babu, Yong Guan, Tusher Chakraborty, Suraj Jog, Ranveer Chandra, Hongwei Zhang

    Abstract: Due to factors such as low population density and expansive geographical distances, network deployment falls behind in rural regions, leading to a broadband divide. Wireless spectrum serves as the blood and flesh of wireless communications. Shared white spaces such as those in the TVWS and CBRS spectrum bands offer opportunities to expand connectivity, innovate, and provide affordable access to hi… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2407.02678  [pdf, other

    cs.AI cs.CL

    Reasoning in Large Language Models: A Geometric Perspective

    Authors: Romain Cosentino, Sarath Shekkizhar

    Abstract: The advancement of large language models (LLMs) for real-world applications hinges critically on enhancing their reasoning capabilities. In this work, we explore the reasoning abilities of large language models (LLMs) through their geometrical understanding. We establish a connection between the expressive power of LLMs and the density of their self-attention graphs. Our analysis demonstrates that… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2406.05918  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Why Don't Prompt-Based Fairness Metrics Correlate?

    Authors: Abdelrahman Zayed, Goncalo Mordido, Ioana Baldini, Sarath Chandar

    Abstract: The widespread use of large language models has brought up essential questions about the potential biases these models might learn. This led to the development of several metrics aimed at evaluating and mitigating these biases. In this paper, we first demonstrate that prompt-based fairness metrics exhibit poor agreement, as measured by correlation, raising important questions about the reliability… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: In Proceedings of ACL main 2024

  4. arXiv:2406.04879  [pdf, other

    cs.CL

    A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment Techniques

    Authors: Megh Thakkar, Quentin Fournier, Matthew D Riemer, Pin-Yu Chen, Amal Zouaq, Payel Das, Sarath Chandar

    Abstract: Large language models are first pre-trained on trillions of tokens and then instruction-tuned or aligned to specific preferences. While pre-training remains out of reach for most researchers due to the compute required, fine-tuning has become affordable thanks to parameter-efficient methods such as LoRA and QLoRA. Alignment is known to be sensitive to the many factors involved, including the quant… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL (Main) 2024

  5. arXiv:2406.03686  [pdf, other

    cs.LG

    BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning

    Authors: Artem Zholus, Maksim Kuznetsov, Roman Schutski, Rim Shayakhmetov, Daniil Polykovskiy, Sarath Chandar, Alex Zhavoronkov

    Abstract: Generating novel active molecules for a given protein is an extremely challenging task for generative models that requires an understanding of the complex physical interactions between the molecule and its environment. In this paper, we present a novel generative model, BindGPT which uses a conceptually simple but powerful approach to create 3D molecules within the protein's binding site. Our mode… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  6. arXiv:2405.15895  [pdf, other

    cs.LG

    Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective

    Authors: Pranshu Malviya, Jerry Huang, Quentin Fournier, Sarath Chandar

    Abstract: The optimal model for a given task is often challenging to determine, requiring training multiple models from scratch which becomes prohibitive as dataset and model sizes grow. A more efficient alternative is to reuse smaller pre-trained models by expanding them, however, this is not widely adopted as how this impacts training dynamics remains poorly understood. While prior works have introduced s… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  7. arXiv:2405.15804  [pdf, other

    cs.AI

    Explainable Human-AI Interaction: A Planning Perspective

    Authors: Sarath Sreedharan, Anagha Kulkarni, Subbarao Kambhampati

    Abstract: From its inception, AI has had a rather ambivalent relationship with humans -- swinging between their augmentation and replacement. Now, as AI technologies enter our everyday lives at an ever increasing pace, there is a greater need for AI systems to work synergistically with humans. One critical requirement for such synergistic human-AI interaction is that the AI systems be explainable to the hum… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  8. arXiv:2405.07773  [pdf, other

    cs.RO cs.AI

    Human-Modeling in Sequential Decision-Making: An Analysis through the Lens of Human-Aware AI

    Authors: Silvia Tulli, Stylianos Loukas Vasileiou, Sarath Sreedharan

    Abstract: "Human-aware" has become a popular keyword used to describe a particular class of AI systems that are designed to work and interact with humans. While there exists a surprising level of consistency among the works that use the label human-aware, the term itself mostly remains poorly understood. In this work, we retroactively try to provide an account of what constitutes a human-aware AI system. We… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 9 pages, 1 figure, 1 table

    ACM Class: I.2

  9. arXiv:2405.05386  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Interpretability Needs a New Paradigm

    Authors: Andreas Madsen, Himabindu Lakkaraju, Siva Reddy, Sarath Chandar

    Abstract: Interpretability is the study of explaining models in understandable terms to humans. At present, interpretability is divided into two paradigms: the intrinsic paradigm, which believes that only models designed to be explained can be explained, and the post-hoc paradigm, which believes that black-box models can be explained. At the core of this debate is how each paradigm ensures its explanations… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  10. arXiv:2405.02749  [pdf, other

    cs.LG

    Sub-goal Distillation: A Method to Improve Small Language Agents

    Authors: Maryam Hashemzadeh, Elias Stengel-Eskin, Sarath Chandar, Marc-Alexandre Cote

    Abstract: While Large Language Models (LLMs) have demonstrated significant promise as agents in interactive tasks, their substantial computational requirements and restricted number of calls constrain their practical utility, especially in long-horizon interactive tasks such as decision-making or in scenarios involving continuous ongoing tasks. To address these constraints, we propose a method for transferr… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  11. arXiv:2405.01684  [pdf, other

    cs.LG cs.AI

    Intelligent Switching for Reset-Free RL

    Authors: Darshan Patil, Janarthanan Rajendran, Glen Berseth, Sarath Chandar

    Abstract: In the real world, the strong episode resetting mechanisms that are needed to train agents in simulation are unavailable. The \textit{resetting} assumption limits the potential of reinforcement learning in the real world, as providing resets to an agent usually requires the creation of additional handcrafted mechanisms or human interventions. Recent work aims to train agents (\textit{forward}) wit… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Published at ICLR 2024

  12. arXiv:2405.01114  [pdf, other

    cs.LG cs.RO

    Continual Imitation Learning for Prosthetic Limbs

    Authors: Sharmita Dey, Benjamin Paassen, Sarath Ravindran Nair, Sabri Boughorbel, Arndt F. Schilling

    Abstract: Lower limb amputations and neuromuscular impairments severely restrict mobility, necessitating advancements beyond conventional prosthetics. Motorized bionic limbs offer promise, but their utility depends on mimicking the evolving synergy of human movement in various settings. In this context, we present a novel model for bionic prostheses' application that leverages camera-based motion capture an… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  13. arXiv:2404.17434  [pdf, other

    cs.NI

    Exploring Wireless Channels in Rural Areas: A Comprehensive Measurement Study

    Authors: Tianyi Zhang, Guoying Zu, Taimoor Ul Islam, Evan Gossling, Sarath Babu, Daji Qiao, Hongwei Zhang

    Abstract: The study of wireless channel behavior has been an active research topic for many years. However, there exists a noticeable scarcity of studies focusing on wireless channel characteristics in rural areas. With the advancement of smart agriculture practices in rural regions, there has been an increasing demand for affordable, high-capacity, and low-latency wireless networks to support various preci… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  14. arXiv:2404.16063  [pdf, other

    cs.HC cs.GR

    Chronological Outlooks of Globe Illustrated with Web-Based Visualization

    Authors: Tahmim Hossain, Sai Sarath Movva, Ritika Ritika

    Abstract: Develo** visualizations with comprehensive annotations is crucial for research and educational purposes. We've been experimenting with various visualization tools like Plotly, Plotly.js, and D3.js to analyze global trends, focusing on areas such as Global Terrorism, the Global Air Quality Index (AQI), and Global Population dynamics. These visualizations help us gain insights into complex researc… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 4 pages, 10 figures

  15. arXiv:2404.15184  [pdf, other

    cs.AI

    Reducing Human-Robot Goal State Divergence with Environment Design

    Authors: Kelsey Sikes, Sarah Keren, Sarath Sreedharan

    Abstract: One of the most difficult challenges in creating successful human-AI collaborations is aligning a robot's behavior with a human user's expectations. When this fails to occur, a robot may misinterpret their specified goals, prompting it to perform actions with unanticipated, potentially dangerous side effects. To avoid this, we propose a new metric we call Goal State Divergence $\mathcal{(GSD)}$, w… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 8 pages, 1 figure

    ACM Class: I.2.8; I.2.9

  16. arXiv:2404.09339  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Practical Tool Usage for Continually Learning LLMs

    Authors: Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Sarath Chandar

    Abstract: Large language models (LLMs) show an innate skill for solving language based tasks. But insights have suggested an inability to adjust for information or task-solving skills becoming outdated, as their knowledge, stored directly within their parameters, remains static in time. Tool use helps by offloading work to systems that the LLM can access through an interface, but LLMs that use them still mu… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 20 pages, 11 tables, 7 figures

  17. arXiv:2404.08791  [pdf, other

    cs.AI cs.LG

    Handling Reward Misspecification in the Presence of Expectation Mismatch

    Authors: Sarath Sreedharan, Malek Mechergui

    Abstract: Detecting and handling misspecified objectives, such as reward functions, has been widely recognized as one of the central challenges within the domain of Artificial Intelligence (AI) safety research. However, even with the recognition of the importance of this problem, we are unaware of any works that attempt to provide a clear definition for what constitutes (a) misspecified objectives and (b) s… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  18. arXiv:2403.11901  [pdf, other

    cs.LG cs.AI

    Larimar: Large Language Models with Episodic Memory Control

    Authors: Payel Das, Subhajit Chaudhury, Elliot Nelson, Igor Melnyk, Sarath Swaminathan, Sihui Dai, Aurélie Lozano, Georgios Kollias, Vijil Chenthamarakshan, Jiří, Navrátil, Soham Dan, Pin-Yu Chen

    Abstract: Efficient and accurate updating of knowledge stored in Large Language Models (LLMs) is one of the most pressing research challenges today. This paper presents Larimar - a novel, brain-inspired architecture for enhancing LLMs with a distributed episodic memory. Larimar's memory allows for dynamic, one-shot updates of knowledge without the need for computationally expensive re-training or fine-tunin… ▽ More

    Submitted 6 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  19. arXiv:2403.06569  [pdf, other

    cs.LG cs.RO

    Enhancing Joint Motion Prediction for Individuals with Limb Loss Through Model Reprogramming

    Authors: Sharmita Dey, Sarath R. Nair

    Abstract: Mobility impairment caused by limb loss is a significant challenge faced by millions of individuals worldwide. The development of advanced assistive technologies, such as prosthetic devices, has the potential to greatly improve the quality of life for amputee patients. A critical component in the design of such technologies is the accurate prediction of reference joint motion for the missing limb.… ▽ More

    Submitted 12 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Journal ref: ICLR 2024 Workshop: Learning from Time Series for Health

  20. arXiv:2403.04253  [pdf, other

    cs.LG

    Mastering Memory Tasks with World Models

    Authors: Mohammad Reza Samsami, Artem Zholus, Janarthanan Rajendran, Sarath Chandar

    Abstract: Current model-based reinforcement learning (MBRL) agents struggle with long-term dependencies. This limits their ability to effectively solve tasks involving extended time gaps between actions and outcomes, or tasks demanding the recalling of distant observations to inform current actions. To improve temporal coherence, we integrate a new family of state space models (SSMs) in world models of MBRL… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Published as a conference paper at The International Conference on Learning Representations 2024

  21. arXiv:2402.11005  [pdf, other

    cs.CL cs.AI

    Exploring Value Biases: How LLMs Deviate Towards the Ideal

    Authors: Sarath Sivaprasad, Pramod Kaushik, Sahar Abdelnabi, Mario Fritz

    Abstract: Large-Language-Models (LLMs) are deployed in a wide range of applications, and their response has an increasing social impact. Understanding the non-deliberate(ive) mechanism of LLMs in giving responses is essential in explaining their performance and discerning their biases in real-world applications. This is analogous to human studies, where such inadvertent responses are referred to as sampling… ▽ More

    Submitted 21 February, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  22. arXiv:2401.07927  [pdf, other

    cs.CL cs.AI cs.LG

    Are self-explanations from Large Language Models faithful?

    Authors: Andreas Madsen, Sarath Chandar, Siva Reddy

    Abstract: Instruction-tuned Large Language Models (LLMs) excel at many tasks and will even explain their reasoning, so-called self-explanations. However, convincing and wrong self-explanations can lead to unsupported confidence in LLMs, thus increasing risk. Therefore, it's important to measure if self-explanations truly reflect the model's behavior. Such a measure is called interpretability-faithfulness an… ▽ More

    Submitted 16 May, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: The 62nd Annual Meeting of the Association for Computational Linguistics

  23. arXiv:2312.15398  [pdf, other

    cs.CL cs.CY cs.LG

    Fairness-Aware Structured Pruning in Transformers

    Authors: Abdelrahman Zayed, Goncalo Mordido, Samira Shabanian, Ioana Baldini, Sarath Chandar

    Abstract: The increasing size of large language models (LLMs) has introduced challenges in their training and inference. Removing model components is perceived as a solution to tackle the large model sizes, however, existing pruning methods solely focus on performance, without considering an essential aspect for the responsible use of LLMs: model fairness. It is crucial to address the fairness of LLMs towar… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: In Proceedings of AAAI 2024

  24. arXiv:2312.01648  [pdf, other

    cs.AI cs.CL cs.LG

    Characterizing Large Language Model Geometry Solves Toxicity Detection and Generation

    Authors: Randall Balestriero, Romain Cosentino, Sarath Shekkizhar

    Abstract: Large Language Models~(LLMs) drive current AI breakthroughs despite very little being known about their internal representations, e.g., how to extract a few informative features to solve various downstream tasks. To provide a practical and principled answer, we propose to characterize LLMs from a geometric perspective. We obtain in closed form (i) the intrinsic dimension in which the Multi-Head At… ▽ More

    Submitted 10 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

  25. arXiv:2311.13720  [pdf, other

    cs.AI

    Can LLMs Fix Issues with Reasoning Models? Towards More Likely Models for AI Planning

    Authors: Turgay Caglar, Sirine Belhaj, Tathagata Chakraborti, Michael Katz, Sarath Sreedharan

    Abstract: This is the first work to look at the application of large language models (LLMs) for the purpose of model space edits in automated planning tasks. To set the stage for this union, we explore two different flavors of model space problems that have been studied in the AI planning literature and explore the effect of an LLM on those tasks. We empirically demonstrate how the performance of an LLM con… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: 24 pages

  26. arXiv:2311.07687  [pdf, other

    cs.CL cs.AI cs.LG

    Language Model-In-The-Loop: Data Optimal Approach to Learn-To-Recommend Actions in Text Games

    Authors: Arjun Vaithilingam Sudhakar, Prasanna Parthasarathi, Janarthanan Rajendran, Sarath Chandar

    Abstract: Large Language Models (LLMs) have demonstrated superior performance in language understanding benchmarks. CALM, a popular approach, leverages linguistic priors of LLMs -- GPT-2 -- for action candidate recommendations to improve the performance in text games in Jericho without environment-provided actions. However, CALM adapts GPT-2 with annotated human gameplays and keeps the LLM fixed during the… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  27. arXiv:2311.01022  [pdf

    cs.CV cs.AI

    NeuroWrite: Predictive Handwritten Digit Classification using Deep Neural Networks

    Authors: Kottakota Asish, P. Sarath Teja, R. Kishan Chander, Dr. D. Deva Hema

    Abstract: The rapid evolution of deep neural networks has revolutionized the field of machine learning, enabling remarkable advancements in various domains. In this article, we introduce NeuroWrite, a unique method for predicting the categorization of handwritten digits using deep neural networks. Our model exhibits outstanding accuracy in identifying and categorising handwritten digits by utilising the str… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: 6 pages, 10 figures

    MSC Class: 68T10; 68T45; 68T60 ACM Class: I.4.8; I.5.2; J.4

  28. arXiv:2311.00913  [pdf, other

    cs.CL

    Self-Influence Guided Data Reweighting for Language Model Pre-training

    Authors: Megh Thakkar, Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, Sarath Chandar, Partha Talukdar

    Abstract: Language Models (LMs) pre-trained with self-supervision on large text corpora have become the default starting point for develo** models for various NLP tasks. Once the pre-training corpus has been assembled, all data samples in the corpus are treated with equal importance during LM pre-training. However, due to varying levels of relevance and quality of data, equal importance to all the data sa… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023

  29. arXiv:2310.15372  [pdf, other

    cs.CL cs.AI

    EpiK-Eval: Evaluation for Language Models as Epistemic Models

    Authors: Gabriele Prato, Jerry Huang, Prasannna Parthasarathi, Shagun Sodhani, Sarath Chandar

    Abstract: In the age of artificial intelligence, the role of large language models (LLMs) is becoming increasingly central. Despite their growing prevalence, their capacity to consolidate knowledge from different training documents - a crucial ability in numerous applications - remains unexplored. This paper presents the first study examining the capability of LLMs to effectively combine such information wi… ▽ More

    Submitted 22 February, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

  30. arXiv:2310.07819  [pdf, other

    cs.CL cs.LG

    Faithfulness Measurable Masked Language Models

    Authors: Andreas Madsen, Siva Reddy, Sarath Chandar

    Abstract: A common approach to explaining NLP models is to use importance measures that express which tokens are important for a prediction. Unfortunately, such explanations are often wrong despite being persuasive. Therefore, it is essential to measure their faithfulness. One such metric is if tokens are truly important, then masking them should result in worse model performance. However, token masking int… ▽ More

    Submitted 9 May, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  31. arXiv:2310.00797  [pdf, other

    cs.LG

    Don't Miss Out on Novelty: Importance of Novel Features for Deep Anomaly Detection

    Authors: Sarath Sivaprasad, Mario Fritz

    Abstract: Anomaly Detection (AD) is a critical task that involves identifying observations that do not conform to a learned model of normality. Prior work in deep AD is predominantly based on a familiarity hypothesis, where familiar features serve as the reference in a pre-trained embedding space. While this strategy has proven highly successful, it turns out that it causes consistent false negatives when a… ▽ More

    Submitted 26 February, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

  32. arXiv:2309.17234  [pdf, other

    cs.CL cs.CY cs.LG

    Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiation

    Authors: Sahar Abdelnabi, Amr Gomaa, Sarath Sivaprasad, Lea Schönherr, Mario Fritz

    Abstract: There is an growing interest in using Large Language Models (LLMs) in multi-agent systems to tackle interactive real-world tasks that require effective collaboration and assessing complex situations. Yet, we still have a limited understanding of LLMs' communication and decision-making abilities in multi-agent setups. The fundamental task of negotiation spans many key features of communication, suc… ▽ More

    Submitted 10 June, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Updated version with major additions (new experiments, evaluation, and attacks)

  33. arXiv:2309.16235  [pdf, other

    physics.chem-ph cs.AI cs.CL cs.LG q-bio.BM

    Language models in molecular discovery

    Authors: Nikita Janakarajan, Tim Erdmann, Sarath Swaminathan, Teodoro Laino, Jannis Born

    Abstract: The success of language models, especially transformer-based architectures, has trickled into other domains giving rise to "scientific language models" that operate on small molecules, proteins or polymers. In chemistry, language models contribute to accelerating the molecule discovery cycle as evidenced by promising recent findings in early-stage drug discovery. Here, we review the role of langua… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Under review

  34. arXiv:2308.10284  [pdf, other

    cs.LG cs.AI cs.MA

    Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi

    Authors: Hadi Nekoei, Xutong Zhao, Janarthanan Rajendran, Miao Liu, Sarath Chandar

    Abstract: Cooperative Multi-agent Reinforcement Learning (MARL) algorithms with Zero-Shot Coordination (ZSC) have gained significant attention in recent years. ZSC refers to the ability of agents to coordinate zero-shot (without additional interaction experience) with independently trained agents. While ZSC is crucial for cooperative MARL agents, it might not be possible for complex tasks and changing envir… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

  35. arXiv:2307.16704  [pdf, other

    cs.LG cs.AI

    Lookbehind-SAM: k steps back, 1 step forward

    Authors: Gonçalo Mordido, Pranshu Malviya, Aristide Baratin, Sarath Chandar

    Abstract: Sharpness-aware minimization (SAM) methods have gained increasing popularity by formulating the problem of minimizing both loss value and loss sharpness as a minimax objective. In this work, we increase the efficiency of the maximization and minimization parts of SAM's objective to achieve a better loss-sharpness trade-off. By taking inspiration from the Lookahead optimizer, which uses multiple de… ▽ More

    Submitted 16 May, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: ICML 2024

  36. arXiv:2307.09638  [pdf, other

    cs.LG cs.AI

    Promoting Exploration in Memory-Augmented Adam using Critical Momenta

    Authors: Pranshu Malviya, Gonçalo Mordido, Aristide Baratin, Reza Babanezhad Harikandeh, Jerry Huang, Simon Lacoste-Julien, Razvan Pascanu, Sarath Chandar

    Abstract: Adaptive gradient-based optimizers, notably Adam, have left their mark in training large-scale deep learning models, offering fast convergence and robustness to hyperparameter settings. However, they often struggle with generalization, attributed to their tendency to converge to sharp minima in the loss landscape. To address this, we propose a new memory-augmented version of Adam that encourages e… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Published in Transactions on Machine Learning Research

  37. arXiv:2306.17693  [pdf, other

    cs.LG

    Thompson sampling for improved exploration in GFlowNets

    Authors: Jarrid Rector-Brooks, Kanika Madan, Moksh Jain, Maksym Korablyov, Cheng-Hao Liu, Sarath Chandar, Nikolay Malkin, Yoshua Bengio

    Abstract: Generative flow networks (GFlowNets) are amortized variational inference algorithms that treat sampling from a distribution over compositional objects as a sequential decision-making problem with a learnable action policy. Unlike other algorithms for hierarchical sampling that optimize a variational bound, GFlowNet algorithms can stably run off-policy, which can be advantageous for discovering mod… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Structured Probabilistic Inference and Generative Modeling (SPIGM) workshop @ ICML 2023

  38. arXiv:2306.11800  [pdf, other

    cs.LG

    DynaQuant: Compressing Deep Learning Training Checkpoints via Dynamic Quantization

    Authors: Amey Agrawal, Sameer Reddy, Satwik Bhattamishra, Venkata Prabhakara Sarath Nookala, Vidushi Vashishth, Kexin Rong, Alexey Tumanov

    Abstract: With the increase in the scale of Deep Learning (DL) training workloads in terms of compute resources and time consumption, the likelihood of encountering in-training failures rises substantially, leading to lost work and resource wastage. Such failures are typically offset by a checkpointing mechanism, which comes at the cost of storage and network bandwidth overhead. State-of-the-art approaches… ▽ More

    Submitted 2 September, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

  39. arXiv:2306.11066  [pdf, other

    cs.CL cs.LG

    Adversarial Robustness of Prompt-based Few-Shot Learning for Natural Language Understanding

    Authors: Venkata Prabhakara Sarath Nookala, Gaurav Verma, Subhabrata Mukherjee, Srijan Kumar

    Abstract: State-of-the-art few-shot learning (FSL) methods leverage prompt-based fine-tuning to obtain remarkable results for natural language understanding (NLU) tasks. While much of the prior FSL methods focus on improving downstream task performance, there is a limited understanding of the adversarial robustness of such methods. In this work, we conduct an extensive study of several state-of-the-art FSL… ▽ More

    Submitted 20 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted full paper at Findings of ACL 2023; Code available at https://github.com/claws-lab/few-shot-adversarial-robustness

  40. arXiv:2306.10051  [pdf, other

    cs.DL cs.HC cs.IR

    TOBY: A Tool for Exploring Data in Academic Survey Papers

    Authors: Tathagata Chakraborti, Jungkoo Kang, Christian Muise, Sarath Sreedharan, Michael Walker, Daniel Szafir, Tom Williams

    Abstract: This paper describes TOBY, a visualization tool that helps a user explore the contents of an academic survey paper. The visualization consists of four components: a hierarchical view of taxonomic data in the survey, a document similarity view in the space of taxonomic classes, a network view of citations, and a new paper recommendation tool. In this paper, we will discuss these features in the con… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  41. arXiv:2305.15771  [pdf, other

    cs.AI

    On the Planning Abilities of Large Language Models : A Critical Investigation

    Authors: Karthik Valmeekam, Matthew Marquez, Sarath Sreedharan, Subbarao Kambhampati

    Abstract: Intrigued by the claims of emergent reasoning capabilities in LLMs trained on general web corpora, in this paper, we set out to investigate their planning capabilities. We aim to evaluate (1) the effectiveness of LLMs in generating plans autonomously in commonsense planning tasks and (2) the potential of LLMs in LLM-Modulo settings where they act as a source of heuristic guidance for external plan… ▽ More

    Submitted 6 November, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 Spotlight. arXiv admin note: substantial text overlap with arXiv:2206.10498

  42. arXiv:2305.14909  [pdf, other

    cs.AI

    Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning

    Authors: Lin Guan, Karthik Valmeekam, Sarath Sreedharan, Subbarao Kambhampati

    Abstract: There is a growing interest in applying pre-trained large language models (LLMs) to planning problems. However, methods that use LLMs directly as planners are currently impractical due to several factors, including limited correctness of plans, strong reliance on feedback from interactions with simulators or even the actual environment, and the inefficiency in utilizing human feedback. In this wor… ▽ More

    Submitted 1 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  43. arXiv:2305.14775  [pdf, other

    cs.CL cs.AI cs.LG

    Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models

    Authors: Amirhossein Kazemnejad, Mehdi Rezagholizadeh, Prasanna Parthasarathi, Sarath Chandar

    Abstract: While pre-trained language models (PLMs) have shown evidence of acquiring vast amounts of knowledge, it remains unclear how much of this parametric knowledge is actually usable in performing downstream tasks. We propose a systematic framework to measure parametric knowledge utilization in PLMs. Our framework first extracts knowledge from a PLM's parameters and subsequently constructs a downstream… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  44. arXiv:2305.13088  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Should We Attend More or Less? Modulating Attention for Fairness

    Authors: Abdelrahman Zayed, Goncalo Mordido, Samira Shabanian, Sarath Chandar

    Abstract: The abundance of annotated data in natural language processing (NLP) poses both opportunities and challenges. While it enables the development of high-performing models for a variety of tasks, it also poses the risk of models learning harmful biases from the data, such as gender stereotypes. In this work, we investigate the role of attention, a widely-used technique in current state-of-the-art NLP… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  45. On the Costs and Benefits of Adopting Lifelong Learning for Software Analytics -- Empirical Study on Brown Build and Risk Prediction

    Authors: Doriane Olewicki, Sarra Habchi, Mathieu Nayrolles, Mojtaba Faramarzi, Sarath Chandar, Bram Adams

    Abstract: Nowadays, software analytics tools using machine learning (ML) models to, for example, predict the risk of a code change are well established. However, as the goals of a project shift over time, and developers and their habits change, the performance of said models tends to degrade (drift) over time. Current retraining practices typically require retraining a new model from scratch on a large upda… ▽ More

    Submitted 12 February, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

    Journal ref: 46th International Conference on Software Engineering: Software Engineering in Practice 2024

  46. arXiv:2305.00474  [pdf, other

    cs.SI econ.TH

    Learning, Diversity and Adaptation in Changing Environments: The Role of Weak Links

    Authors: Daron Acemoglu, Asuman Ozdaglar, Sarath Pattathil

    Abstract: Adaptation to dynamic conditions requires a certain degree of diversity. If all agents take the best current action, learning that the underlying state has changed and behavior should adapt will be slower. Diversity is harder to maintain when there is fast communication between agents, because they tend to find out and pursue the best action rapidly. We explore these issues using a model of (Bayes… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  47. arXiv:2303.13798  [pdf, other

    cs.CV cs.RO

    2D Floor Plan Segmentation Based on Down-sampling

    Authors: Mohammadreza Sharif, Kiran Mohan, Sarath Suvarna

    Abstract: In recent years, floor plan segmentation has gained significant attention due to its wide range of applications in floor plan reconstruction and robotics. In this paper, we propose a novel 2D floor plan segmentation technique based on a down-sampling approach. Our method employs continuous down-sampling on a floor plan to maintain its structural information while reducing its complexity. We demons… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  48. arXiv:2303.09032  [pdf, other

    cs.LG cs.MA

    Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning

    Authors: Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran

    Abstract: Efficient exploration is critical in cooperative deep Multi-Agent Reinforcement Learning (MARL). In this work, we propose an exploration method that effectively encourages cooperative exploration based on the idea of sequential action-computation scheme. The high-level intuition is that to perform optimism-based exploration, agents would explore cooperative strategies if each agent's optimism esti… ▽ More

    Submitted 13 July, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted at UAI 2023

  49. arXiv:2303.08690  [pdf, other

    cs.LG cs.AI

    Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning

    Authors: Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Harm van Seijen, Sarath Chandar

    Abstract: One of the key behavioral characteristics used in neuroscience to determine whether the subject of study -- be it a rodent or a human -- exhibits model-based learning is effective adaptation to local changes in the environment, a particular form of adaptivity that is the focus of this work. In reinforcement learning, however, recent work has shown that modern deep model-based reinforcement-learnin… ▽ More

    Submitted 27 September, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  50. arXiv:2303.00822  [pdf, other

    cs.AI

    Planning for Attacker Entrapment in Adversarial Settings

    Authors: Brittany Cates, Anagha Kulkarni, Sarath Sreedharan

    Abstract: In this paper, we propose a planning framework to generate a defense strategy against an attacker who is working in an environment where a defender can operate without the attacker's knowledge. The objective of the defender is to covertly guide the attacker to a trap state from which the attacker cannot achieve their goal. Further, the defender is constrained to achieve its goal within K number of… ▽ More

    Submitted 5 April, 2023; v1 submitted 1 March, 2023; originally announced March 2023.