Skip to main content

Showing 1–1 of 1 results for author: Kapeluck, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19561  [pdf, other

    cs.LG cs.AI

    Meta-Gradient Search Control: A Method for Improving the Efficiency of Dyna-style Planning

    Authors: Bradley Burega, John D. Martin, Luke Kapeluck, Michael Bowling

    Abstract: We study how a Reinforcement Learning (RL) system can remain sample-efficient when learning from an imperfect model of the environment. This is particularly challenging when the learning system is resource-constrained and in continual settings, where the environment dynamics change. To address these challenges, our paper introduces an online, meta-gradient algorithm that tunes a probability with w… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.