Skip to main content

Showing 1–4 of 4 results for author: Velu, A

.
  1. arXiv:2308.13957  [pdf, other

    cs.CV cs.AI cs.LG

    Differentiable Weight Masks for Domain Transfer

    Authors: Samar Khanna, Skanda Vaidyanath, Akash Velu

    Abstract: One of the major drawbacks of deep learning models for computer vision has been their inability to retain multiple sources of information in a modular fashion. For instance, given a network that has been trained on a source task, we would like to re-train this network on a similar, yet different, target task while maintaining its performance on the source task. Simultaneously, researchers have ext… ▽ More

    Submitted 7 October, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: Published in Out of Distribution Generalization in Computer Vision (OOD-CV) workshop at ICCV 2023

  2. arXiv:2307.11897  [pdf, other

    cs.LG cs.AI

    Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning

    Authors: Akash Velu, Skanda Vaidyanath, Dilip Arumugam

    Abstract: Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents. In the extreme case, long trajectories of behavior are merely punctuated with a single terminal feedback signal, leading to a significant temporal delay between the observation of a non-trivial reward and the individual steps of behavi… ▽ More

    Submitted 18 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  3. arXiv:2201.00042  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments

    Authors: Abhiram Iyer, Karan Grewal, Akash Velu, Lucas Oliveira Souza, Jeremy Forest, Subutai Ahmad

    Abstract: A key challenge for AI is to build embodied systems that operate in dynamically changing environments. Such systems must adapt to changing task contexts and learn continuously. Although standard deep learning systems achieve state of the art results on static benchmarks, they often struggle in dynamic scenarios. In these settings, error signals from multiple contexts can interfere with one another… ▽ More

    Submitted 25 April, 2022; v1 submitted 31 December, 2021; originally announced January 2022.

    Comments: 31 pages, 17 figures

    Journal ref: Frontiers in Neurorobotics 16 2022 (1-23)

  4. arXiv:2103.01955  [pdf, other

    cs.LG cs.AI cs.MA

    The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

    Authors: Chao Yu, Akash Velu, Eugene Vinitsky, Jiaxuan Gao, Yu Wang, Alexandre Bayen, Yi Wu

    Abstract: Proximal Policy Optimization (PPO) is a ubiquitous on-policy reinforcement learning algorithm but is significantly less utilized than off-policy learning algorithms in multi-agent settings. This is often due to the belief that PPO is significantly less sample efficient than off-policy methods in multi-agent systems. In this work, we carefully study the performance of PPO in cooperative multi-agent… ▽ More

    Submitted 4 November, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: This paper has been accepted by NeurIPS 2022 Datasets and Benchmarks