Search | arXiv e-print repository

How to Learn from Risk: Explicit Risk-Utility Reinforcement Learning for Efficient and Safe Driving Strategies

Authors: Lukas M. Schmidt, Sebastian Rietsch, Axel Plinge, Bjoern M. Eskofier, Christopher Mutschler

Abstract: Autonomous driving has the potential to revolutionize mobility and is hence an active area of research. In practice, the behavior of autonomous vehicles must be acceptable, i.e., efficient, safe, and interpretable. While vanilla reinforcement learning (RL) finds performant behavioral strategies, they are often unsafe and uninterpretable. Safety is introduced through Safe RL approaches, but they st… ▽ More Autonomous driving has the potential to revolutionize mobility and is hence an active area of research. In practice, the behavior of autonomous vehicles must be acceptable, i.e., efficient, safe, and interpretable. While vanilla reinforcement learning (RL) finds performant behavioral strategies, they are often unsafe and uninterpretable. Safety is introduced through Safe RL approaches, but they still mostly remain uninterpretable as the learned behaviour is jointly optimized for safety and performance without modeling them separately. Interpretable machine learning is rarely applied to RL. This paper proposes SafeDQN, which allows to make the behavior of autonomous vehicles safe and interpretable while still being efficient. SafeDQN offers an understandable, semantic trade-off between the expected risk and the utility of actions while being algorithmically transparent. We show that SafeDQN finds interpretable and safe driving policies for a variety of scenarios and demonstrate how state-of-the-art saliency techniques can help to assess both risk and utility. △ Less

Submitted 2 August, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: 8 pages, 5 figures

arXiv:2203.07676 [pdf, other]

An Introduction to Multi-Agent Reinforcement Learning and Review of its Application to Autonomous Mobility

Authors: Lukas M. Schmidt, Johanna Brosig, Axel Plinge, Bjoern M. Eskofier, Christopher Mutschler

Abstract: Many scenarios in mobility and traffic involve multiple different agents that need to cooperate to find a joint solution. Recent advances in behavioral planning use Reinforcement Learning to find effective and performant behavior strategies. However, as autonomous vehicles and vehicle-to-X communications become more mature, solutions that only utilize single, independent agents leave potential per… ▽ More Many scenarios in mobility and traffic involve multiple different agents that need to cooperate to find a joint solution. Recent advances in behavioral planning use Reinforcement Learning to find effective and performant behavior strategies. However, as autonomous vehicles and vehicle-to-X communications become more mature, solutions that only utilize single, independent agents leave potential performance gains on the road. Multi-Agent Reinforcement Learning (MARL) is a research field that aims to find optimal solutions for multiple agents that interact with each other. This work aims to give an overview of the field to researchers in autonomous mobility. We first explain MARL and introduce important concepts. Then, we discuss the central paradigms that underlie MARL algorithms, and give an overview of state-of-the-art methods and ideas in each paradigm. With this background, we survey applications of MARL in autonomous mobility scenarios and give an overview of existing scenarios and implementations. △ Less

Submitted 2 August, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: 8 pages, 2 figures

arXiv:2012.00795 [pdf, other]

The Exoplanet Transmission Spectroscopy Imager (ETSI)

Authors: Mary Anne Limbach, Luke M. Schmidt, D. L. DePoy, Jeffrey C. Mason, Mike Scobey, Pat Brown, Chelsea Taylor, Jennifer L. Marshall

Abstract: We present the design of a novel instrument tuned to detect transiting exoplanet atmospheres. The instrument, which we call the exoplanet transmission spectroscopy imager (ETSI), makes use of a new technique called common-path multi-band imaging (CMI). ETSI uses a prism and multi-band filter to simultaneously image 15 spectral bandpasses on two detectors from $430-975nm$ (with a average spectral r… ▽ More We present the design of a novel instrument tuned to detect transiting exoplanet atmospheres. The instrument, which we call the exoplanet transmission spectroscopy imager (ETSI), makes use of a new technique called common-path multi-band imaging (CMI). ETSI uses a prism and multi-band filter to simultaneously image 15 spectral bandpasses on two detectors from $430-975nm$ (with a average spectral resolution of $R = λ/Δλ= 23$) during exoplanet transits of a bright star. A prototype of the instrument achieved photon-noise limited results which were below the atmospheric amplitude scintillation noise limit. ETSI can detect the presence and composition of an exoplanet atmosphere in a relatively short time on a modest-size telescope. We show the optical design of the instrument. Further, we discuss design trades of the prism and multi-band filter which are driven by the science of the ETSI instrument. We describe the upcoming survey with ETSI that will measure dozens of exoplanet atmosphere spectra in $\sim2$ years on a two meter telescope. Finally, we will discuss how ETSI will be a powerful means for follow up on all gas giant exoplanets that transit bright stars, including a multitude of recently identified TESS (NASA's Transiting Exoplanet Survey Satellite) exoplanets. △ Less

Submitted 1 December, 2020; originally announced December 2020.

Comments: 14 pages, 9 figures, Proc. of SPIE 11447-100

Showing 1–3 of 3 results for author: Schmidt, L M