Skip to main content

Showing 1–3 of 3 results for author: Ghugare, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.11237  [pdf, other

    cs.LG

    Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View

    Authors: Raj Ghugare, Matthieu Geist, Glen Berseth, Benjamin Eysenbach

    Abstract: Some reinforcement learning (RL) algorithms can stitch pieces of experience to solve a task never seen before during training. This oft-sought property is one of the few ways in which RL methods based on dynamic-programming differ from RL methods based on supervised-learning (SL). Yet, certain RL methods based on off-the-shelf SL algorithms achieve excellent results without an explicit mechanism f… ▽ More

    Submitted 11 March, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: ICLR 2024, Project code: https://github.com/RajGhugare19/stitching-is-combinatorial-generalisation

  2. arXiv:2310.02902  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI

    Searching for High-Value Molecules Using Reinforcement Learning and Transformers

    Authors: Raj Ghugare, Santiago Miret, Adriana Hugessen, Mariano Phielipp, Glen Berseth

    Abstract: Reinforcement learning (RL) over text representations can be effective for finding high-value policies that can search over graphs. However, RL requires careful structuring of the search space and algorithm design to be effective in this challenge. Through extensive experiments, we explore how different design choices for text grammar and algorithmic choices for training can affect an RL policy's… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  3. arXiv:2209.08466  [pdf, other

    cs.LG cs.AI cs.RO

    Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective

    Authors: Raj Ghugare, Homanga Bharadhwaj, Benjamin Eysenbach, Sergey Levine, Ruslan Salakhutdinov

    Abstract: While reinforcement learning (RL) methods that learn an internal model of the environment have the potential to be more sample efficient than their model-free counterparts, learning to model raw observations from high dimensional sensors can be challenging. Prior work has addressed this challenge by learning low-dimensional representation of observations through auxiliary objectives, such as recon… ▽ More

    Submitted 24 June, 2023; v1 submitted 17 September, 2022; originally announced September 2022.

    Comments: ICLR 2023, Project website with code: https://alignedlatentmodels.github.io/