Skip to main content

Showing 1–5 of 5 results for author: Semage, B L

.
  1. arXiv:2309.04085  [pdf, other

    cs.RO cs.LG

    Sample-Efficient Co-Design of Robotic Agents Using Multi-fidelity Training on Universal Policy Network

    Authors: Kishan R. Nagiredla, Buddhika L. Semage, Thommen G. Karimpanal, Arun Kumar A. V, Santu Rana

    Abstract: Co-design involves simultaneously optimizing the controller and agents physical design. Its inherent bi-level optimization formulation necessitates an outer loop design optimization driven by an inner loop control optimization. This can be challenging when the design space is large and each design evaluation involves data-intensive reinforcement learning process for control optimization. To improv… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: 17 pages, 10 figures

  2. arXiv:2302.04013  [pdf, other

    cs.LG cs.AI cs.RO

    Zero-shot Sim2Real Adaptation Across Environments

    Authors: Buddhika Laknath Semage, Thommen George Karimpanal, Santu Rana, Svetha Venkatesh

    Abstract: Simulation based learning often provides a cost-efficient recourse to reinforcement learning applications in robotics. However, simulators are generally incapable of accurately replicating real-world dynamics, and thus bridging the sim2real gap is an important problem in simulation based learning. Current solutions to bridge the sim2real gap involve hybrid simulators that are augmented with neural… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  3. arXiv:2202.05844  [pdf, other

    cs.LG cs.AI eess.SY

    Uncertainty Aware System Identification with Universal Policies

    Authors: Buddhika Laknath Semage, Thommen George Karimpanal, Santu Rana, Svetha Venkatesh

    Abstract: Sim2real transfer is primarily concerned with transferring policies trained in simulation to potentially noisy real world environments. A common problem associated with sim2real transfer is estimating the real-world environmental parameters to ground the simulated environment to. Although existing methods such as Domain Randomisation (DR) can produce robust policies by sampling from a distribution… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

  4. arXiv:2202.05843  [pdf, other

    cs.LG cs.AI eess.SY

    Fast Model-based Policy Search for Universal Policy Networks

    Authors: Buddhika Laknath Semage, Thommen George Karimpanal, Santu Rana, Svetha Venkatesh

    Abstract: Adapting an agent's behaviour to new environments has been one of the primary focus areas of physics based reinforcement learning. Although recent approaches such as universal policy networks partially address this issue by enabling the storage of multiple policies trained in simulation on a wide range of dynamic/latent factors, efficiently identifying the most appropriate policy for a given envir… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

  5. arXiv:2104.08795  [pdf, other

    cs.LG cs.RO eess.SY

    Intuitive Physics Guided Exploration for Sample Efficient Sim2real Transfer

    Authors: Buddhika Laknath Semage, Thommen George Karimpanal, Santu Rana, Svetha Venkatesh

    Abstract: Physics-based reinforcement learning tasks can benefit from simplified physics simulators as they potentially allow near-optimal policies to be learned in simulation. However, such simulators require the latent factors (e.g. mass, friction coefficient etc.) of the associated objects and other environment-specific factors (e.g. wind speed, air density etc.) to be accurately specified, without which… ▽ More

    Submitted 11 February, 2022; v1 submitted 18 April, 2021; originally announced April 2021.