Skip to main content

Showing 1–1 of 1 results for author: Freeman, D C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.04686  [pdf, other

    cs.LG cs.AI

    Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization

    Authors: Shixiang Shane Gu, Manfred Diaz, Daniel C. Freeman, Hiroki Furuta, Seyed Kamyar Seyed Ghasemipour, Anton Raichuk, Byron David, Erik Frey, Erwin Coumans, Olivier Bachem

    Abstract: The goal of continuous control is to synthesize desired behaviors. In reinforcement learning (RL)-driven approaches, this is often accomplished through careful task reward engineering for efficient exploration and running an off-the-shelf RL algorithm. While reward maximization is at the core of RL, reward engineering is not the only -- sometimes nor the easiest -- way for specifying complex behav… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.