Skip to main content

Showing 1–3 of 3 results for author: Subbaraj, G

.
  1. arXiv:2304.13892  [pdf, other

    cs.LG cs.AI

    Discovering Object-Centric Generalized Value Functions From Pixels

    Authors: Somjit Nath, Gopeshh Raaj Subbaraj, Khimya Khetarpal, Samira Ebrahimi Kahou

    Abstract: Deep Reinforcement Learning has shown significant progress in extracting useful representations from high-dimensional inputs albeit using hand-crafted auxiliary tasks and pseudo rewards. Automatically learning such representations in an object-centric manner geared towards control and fast adaptation remains an open research problem. In this paper, we introduce a method that tries to discover mean… ▽ More

    Submitted 27 June, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at ICML 2023

  2. arXiv:2304.00122  [pdf, other

    cs.RO

    Trajectory Control for Differential Drive Mobile Manipulators

    Authors: Harish Karunakaran, Gopeshh Raaj Subbaraj

    Abstract: Mobile manipulator systems are comprised of a mobile platform with one or more manipulators and are of great interest in a number of applications such as indoor warehouses, mining, construction, forestry etc. We present an approach for computing actuator commands for such systems so that they can follow desired end-effector and platform trajectories without the violation of the nonholonomic constr… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

    Comments: 9 pages

  3. arXiv:2112.07066  [pdf, other

    cs.LG

    Continual Learning In Environments With Polynomial Mixing Times

    Authors: Matthew Riemer, Sharath Chandra Raparthy, Ignacio Cases, Gopeshh Subbaraj, Maximilian Puelma Touzel, Irina Rish

    Abstract: The mixing time of the Markov chain induced by a policy limits performance in real-world continual learning scenarios. Yet, the effect of mixing times on learning in continual reinforcement learning (RL) remains underexplored. In this paper, we characterize problems that are of long-term interest to the development of continual RL, which we call scalable MDPs, through the lens of mixing times. In… ▽ More

    Submitted 13 October, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Accepted at NeurIPS 2022