Skip to main content

Showing 1–1 of 1 results for author: Mitta, R

.
  1. arXiv:2312.11314  [pdf, other

    cs.LG cs.LO eess.SY

    Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

    Authors: Rohan Mitta, Hosein Hasanbeig, Jun Wang, Daniel Kroening, Yiannis Kantaros, Alessandro Abate

    Abstract: This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning. In a variety of RL applications the safety of the agent is particularly important, e.g. autonomous platforms or robots that work in proximity of humans. As enforcing safety during training might severely limit th… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.