Skip to main content

Showing 1–2 of 2 results for author: Roice, K

.
  1. arXiv:2406.01562  [pdf, other

    cs.LG cs.AI

    A New View on Planning in Online Reinforcement Learning

    Authors: Kevin Roice, Parham Mohammad Panahi, Scott M. Jordan, Adam White, Martha White

    Abstract: This paper investigates a new approach to model-based reinforcement learning using background planning: mixing (approximate) dynamic programming updates and model-free updates, similar to the Dyna architecture. Background planning with learned models is often worse than model-free alternatives, such as Double DQN, even though the former uses significantly more memory and computation. The fundament… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Published in the Planning and Reinforcement Learning Workshop at ICAPS 2024. arXiv admin note: text overlap with arXiv:2206.02902

  2. arXiv:2206.02902  [pdf, other

    cs.LG cs.AI

    Goal-Space Planning with Subgoal Models

    Authors: Chunlok Lo, Kevin Roice, Parham Mohammad Panahi, Scott Jordan, Adam White, Gabor Mihucz, Farzane Aminmansour, Martha White

    Abstract: This paper investigates a new approach to model-based reinforcement learning using background planning: mixing (approximate) dynamic programming updates and model-free updates, similar to the Dyna architecture. Background planning with learned models is often worse than model-free alternatives, such as Double DQN, even though the former uses significantly more memory and computation. The fundament… ▽ More

    Submitted 27 February, 2024; v1 submitted 6 June, 2022; originally announced June 2022.