Skip to main content

Showing 1–1 of 1 results for author: Subhasish, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19014  [pdf, other

    cs.LG

    Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption

    Authors: Bernd Frauenknecht, Artur Eisele, Devdutt Subhasish, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Dyna-style model-based reinforcement learning (MBRL) combines model-free agents with predictive transition models through model-based rollouts. This combination raises a critical question: 'When to trust your model?'; i.e., which rollout length results in the model providing useful data? Janner et al. (2019) address this question by gradually increasing rollout lengths throughout the training. Whi… ▽ More

    Submitted 21 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.