Skip to main content

Showing 1–5 of 5 results for author: Zentner, K R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.05405  [pdf, other

    cs.LG

    Guaranteed Trust Region Optimization via Two-Phase KL Penalization

    Authors: K. R. Zentner, Ujjwal Puri, Zhehui Huang, Gaurav S. Sukhatme

    Abstract: On-policy reinforcement learning (RL) has become a popular framework for solving sequential decision problems due to its computational efficiency and theoretical simplicity. Some on-policy methods guarantee every policy update is constrained to a trust region relative to the prior policy to ensure training stability. These methods often require computationally intensive non-linear optimization or… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  2. arXiv:2310.17019  [pdf, other

    cs.LG cs.CL cs.RO

    Conditionally Combining Robot Skills using Large Language Models

    Authors: K. R. Zentner, Ryan Julian, Brian Ichter, Gaurav S. Sukhatme

    Abstract: This paper combines two contributions. First, we introduce an extension of the Meta-World benchmark, which we call "Language-World," which allows a large language model to operate in a simulated robotic environment using semi-structured natural language queries and scripted skills described using natural language. By using the same set of tasks as Meta-World, Language-World results can be easily c… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  3. arXiv:2305.18738  [pdf, other

    cs.LG cs.AI cs.RO

    Generating Behaviorally Diverse Policies with Latent Diffusion Models

    Authors: Shashank Hegde, Sumeet Batra, K. R. Zentner, Gaurav S. Sukhatme

    Abstract: Recent progress in Quality Diversity Reinforcement Learning (QD-RL) has enabled learning a collection of behaviorally diverse, high performing policies. However, these methods typically involve storing thousands of policies, which results in high space-complexity and poor scaling to additional behaviors. Condensing the archive into a single model while retaining the performance and coverage of the… ▽ More

    Submitted 23 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  4. arXiv:2110.10255  [pdf, other

    cs.LG cs.RO

    A Simple Approach to Continual Learning by Transferring Skill Parameters

    Authors: K. R. Zentner, Ryan Julian, Ujjwal Puri, Yulun Zhang, Gaurav S. Sukhatme

    Abstract: In order to be effective general purpose machines in real world environments, robots not only will need to adapt their existing manipulation skills to new circumstances, they will need to acquire entirely new skills on-the-fly. A great promise of continual learning is to endow robots with this ability, by using their accumulated knowledge and experience from prior skills. We take a fresh look at t… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: Submitted to ICRA 2022

  5. arXiv:2106.13237  [pdf, other

    cs.RO cs.AI

    Towards Exploiting Geometry and Time for Fast Off-Distribution Adaptation in Multi-Task Robot Learning

    Authors: K. R. Zentner, Ryan Julian, Ujjwal Puri, Yulun Zhang, Gaurav Sukhatme

    Abstract: We explore possible methods for multi-task transfer learning which seek to exploit the shared physical structure of robotics tasks. Specifically, we train policies for a base set of pre-training tasks, then experiment with adapting to new off-distribution tasks, using simple architectural approaches for re-using these policies as black-box priors. These approaches include learning an alignment of… ▽ More

    Submitted 29 June, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: Accepted to Challenges of Real World Reinforcement Learning, Virtual Workshop at NeurIPS 2020