Skip to main content

Showing 1–5 of 5 results for author: Vaidyanath, S

.
  1. arXiv:2308.13957  [pdf, other

    cs.CV cs.AI cs.LG

    Differentiable Weight Masks for Domain Transfer

    Authors: Samar Khanna, Skanda Vaidyanath, Akash Velu

    Abstract: One of the major drawbacks of deep learning models for computer vision has been their inability to retain multiple sources of information in a modular fashion. For instance, given a network that has been trained on a source task, we would like to re-train this network on a similar, yet different, target task while maintaining its performance on the source task. Simultaneously, researchers have ext… ▽ More

    Submitted 7 October, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: Published in Out of Distribution Generalization in Computer Vision (OOD-CV) workshop at ICCV 2023

  2. arXiv:2307.11897  [pdf, other

    cs.LG cs.AI

    Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning

    Authors: Akash Velu, Skanda Vaidyanath, Dilip Arumugam

    Abstract: Oftentimes, environments for sequential decision-making problems can be quite sparse in the provision of evaluative feedback to guide reinforcement-learning agents. In the extreme case, long trajectories of behavior are merely punctuated with a single terminal feedback signal, leading to a significant temporal delay between the observation of a non-trivial reward and the individual steps of behavi… ▽ More

    Submitted 18 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

  3. arXiv:2301.10289  [pdf, other

    cs.AI

    PushWorld: A benchmark for manipulation planning with tools and movable obstacles

    Authors: Ken Kansky, Skanda Vaidyanath, Scott Swingle, Xinghua Lou, Miguel Lazaro-Gredilla, Dileep George

    Abstract: While recent advances in artificial intelligence have achieved human-level performance in environments like Starcraft and Go, many physical reasoning tasks remain challenging for modern algorithms. To date, few algorithms have been evaluated on physical tasks that involve manipulating objects when movable obstacles are present and when tools must be used to perform the manipulation. To promote res… ▽ More

    Submitted 1 February, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

  4. arXiv:2203.00054  [pdf, other

    cs.LG cs.AI cs.CL cs.RO

    LISA: Learning Interpretable Skill Abstractions from Language

    Authors: Divyansh Garg, Skanda Vaidyanath, Kuno Kim, Jiaming Song, Stefano Ermon

    Abstract: Learning policies that effectively utilize language instructions in complex, multi-task environments is an important problem in sequential decision-making. While it is possible to condition on the entire language instruction directly, such an approach could suffer from generalization issues. In our work, we propose \emph{Learning Interpretable Skill Abstractions (LISA)}, a hierarchical imitation l… ▽ More

    Submitted 6 December, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: NeurIPS 2022. Website: https://div99.github.io/LISA/

  5. arXiv:2112.02969  [pdf, other

    cs.SE cs.PL

    Jigsaw: Large Language Models meet Program Synthesis

    Authors: Naman Jain, Skanda Vaidyanath, Arun Iyer, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Large pre-trained language models such as GPT-3, Codex, and Google's language model are now capable of generating code from natural language specifications of programmer intent. We view these developments with a mixture of optimism and caution. On the optimistic side, such large language models have the potential to improve productivity by providing an automated AI pair programmer for every progra… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted to ICSE'22