Skip to main content

Showing 1–1 of 1 results for author: Watahiki, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:1909.09540  [pdf, other

    cs.LG stat.ML

    Reconnaissance and Planning algorithm for constrained MDP

    Authors: Shin-ichi Maeda, Hayato Watahiki, Shintarou Okada, Masanori Koyama

    Abstract: Practical reinforcement learning problems are often formulated as constrained Markov decision process (CMDP) problems, in which the agent has to maximize the expected return while satisfying a set of prescribed safety constraints. In this study, we propose a novel simulator-based method to approximately solve a CMDP problem without making any compromise on the safety constraints. We achieve this b… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.