Skip to main content

Showing 1–5 of 5 results for author: Thumm, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03704  [pdf, other

    cs.LG eess.SY

    Excluding the Irrelevant: Focusing Reinforcement Learning through Continuous Action Masking

    Authors: Roland Stolz, Hanna Krasowski, Jakob Thumm, Michael Eichelbeck, Philipp Gassert, Matthias Althoff

    Abstract: Continuous action spaces in reinforcement learning (RL) are commonly defined as interval sets. While intervals usually reflect the action boundaries for tasks well, they can be challenging for learning because the typically large global action space leads to frequent exploration of irrelevant actions. Yet, little task knowledge can be sufficient to identify significantly smaller state-specific set… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2310.06208  [pdf, other

    cs.RO

    Human-Robot Gym: Benchmarking Reinforcement Learning in Human-Robot Collaboration

    Authors: Jakob Thumm, Felix Trost, Matthias Althoff

    Abstract: Deep reinforcement learning (RL) has shown promising results in robot motion planning with first attempts in human-robot collaboration (HRC). However, a fair comparison of RL approaches in HRC under the constraint of guaranteed safety is yet to be made. We, therefore, present human-robot gym, a benchmark suite for safe RL in HRC. Our benchmark suite provides eight challenging, realistic HRC tasks… ▽ More

    Submitted 25 June, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  3. arXiv:2303.03339  [pdf, other

    cs.RO

    Reducing Safety Interventions in Provably Safe Reinforcement Learning

    Authors: Jakob Thumm, Guillaume Pelat, Matthias Althoff

    Abstract: Deep Reinforcement Learning (RL) has shown promise in addressing complex robotic challenges. In real-world applications, RL is often accompanied by failsafe controllers as a last resort to avoid catastrophic events. While necessary for safety, these interventions can result in undesirable behaviors, such as abrupt braking or aggressive steering. This paper proposes two safety intervention reductio… ▽ More

    Submitted 25 September, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 8 pages, 6 figures

  4. arXiv:2205.06750  [pdf, other

    cs.LG

    Provably Safe Reinforcement Learning: Conceptual Analysis, Survey, and Benchmarking

    Authors: Hanna Krasowski, Jakob Thumm, Marlon Müller, Lukas Schäfer, Xiao Wang, Matthias Althoff

    Abstract: Ensuring the safety of reinforcement learning (RL) algorithms is crucial to unlock their potential for many real-world tasks. However, vanilla RL and most safe RL approaches do not guarantee safety. In recent years, several methods have been proposed to provide hard safety guarantees for RL, which is essential for applications where unsafe actions could have disastrous consequences. Nevertheless,… ▽ More

    Submitted 18 November, 2023; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: The published paper is available at https://openreview.net/forum?id=mcN0ezbnzO

    Journal ref: Transactions on Machine Learning Research, 2023

  5. arXiv:2205.06311  [pdf, other

    cs.RO cs.AI

    Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments

    Authors: Jakob Thumm, Matthias Althoff

    Abstract: Deep reinforcement learning (RL) has shown promising results in the motion planning of manipulators. However, no method guarantees the safety of highly dynamic obstacles, such as humans, in RL-based manipulator control. This lack of formal safety assurances prevents the application of RL for manipulators in real-world human environments. Therefore, we propose a shielding mechanism that ensures ISO… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted for ICRA 2022