Skip to main content

Showing 1–1 of 1 results for author: Sabirov, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19768  [pdf, other

    cs.LG

    Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors

    Authors: Emma Cramer, Bernd Frauenknecht, Ramil Sabirov, Sebastian Trimpe

    Abstract: Combining Reinforcement Learning (RL) with a prior controller can yield the best out of two worlds: RL can solve complex nonlinear problems, while the control prior ensures safer exploration and speeds up training. Prior work largely blends both components with a fixed weight, neglecting that the RL agent's performance varies with the training progress and across regions in the state space. Theref… ▽ More

    Submitted 1 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.