Skip to main content

Showing 1–3 of 3 results for author: Goodall, A W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.00816  [pdf, other

    cs.LG cs.AI

    Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments

    Authors: Alexander W. Goodall, Francesco Belardinelli

    Abstract: Shielding is a popular technique for achieving safe reinforcement learning (RL). However, classical shielding approaches come with quite restrictive assumptions making them difficult to deploy in complex environments, particularly those with continuous state or action spaces. In this paper we extend the more versatile approximate model-based shielding (AMBS) framework to the continuous setting. In… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted as an Extended Abstract at AAMAS 2024

  2. arXiv:2308.00707  [pdf, other

    cs.LG cs.AI eess.SY

    Approximate Model-Based Shielding for Safe Reinforcement Learning

    Authors: Alexander W. Goodall, Francesco Belardinelli

    Abstract: Reinforcement learning (RL) has shown great potential for solving complex tasks in a variety of domains. However, applying RL to safety-critical systems in the real-world is not easy as many algorithms are sample-inefficient and maximising the standard RL objective comes with no guarantees on worst-case performance. In this paper we propose approximate model-based shielding (AMBS), a principled lo… ▽ More

    Submitted 27 July, 2023; originally announced August 2023.

    Comments: Accepted at ECAI 2023 (main technical track)

  3. arXiv:2304.11104  [pdf, other

    cs.AI

    Approximate Shielding of Atari Agents for Safe Exploration

    Authors: Alexander W. Goodall, Francesco Belardinelli

    Abstract: Balancing exploration and conservatism in the constrained setting is an important problem if we are to use reinforcement learning for meaningful tasks in the real world. In this paper, we propose a principled algorithm for safe exploration based on the concept of shielding. Previous approaches to shielding assume access to a safety-relevant abstraction of the environment or a high-fidelity simulat… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: Accepted for presentation at the ALA workshop as part of AAMAS 2023