Skip to main content

Showing 1–7 of 7 results for author: Waltz, M

.
  1. arXiv:2404.03710  [pdf, other

    cs.LG cs.AI

    Self-organized arrival system for urban air mobility

    Authors: Martin Waltz, Ostap Okhrin, Michael Schultz

    Abstract: Urban air mobility is an innovative mode of transportation in which electric vertical takeoff and landing (eVTOL) vehicles operate between nodes called vertiports. We outline a self-organized vertiport arrival system based on deep reinforcement learning. The airspace around the vertiport is assumed to be circular, and the vehicles can freely operate inside. Each aircraft is considered an individua… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  2. arXiv:2311.16841  [pdf, other

    cs.RO cs.AI

    Two-step dynamic obstacle avoidance

    Authors: Fabian Hart, Martin Waltz, Ostap Okhrin

    Abstract: Dynamic obstacle avoidance (DOA) is a fundamental challenge for any autonomous vehicle, independent of whether it operates in sea, air, or land. This paper proposes a two-step architecture for handling DOA tasks by combining supervised and reinforcement learning (RL). In the first step, we introduce a data-driven approach to estimate the collision risk of an obstacle using a recurrent neural netwo… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  3. arXiv:2307.16769  [pdf, other

    eess.SY cs.AI

    2-Level Reinforcement Learning for Ships on Inland Waterways

    Authors: Martin Waltz, Niklas Paulig, Ostap Okhrin

    Abstract: This paper proposes a realistic modularized framework for controlling autonomous surface vehicles (ASVs) on inland waterways (IWs) based on deep reinforcement learning (DRL). The framework comprises two levels: a high-level local path planning (LPP) unit and a low-level path following (PF) unit, each consisting of a DRL agent. The LPP agent is responsible for planning a path under consideration of… ▽ More

    Submitted 28 November, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  4. arXiv:2211.01004  [pdf, other

    cs.LG cs.RO

    Spatial-temporal recurrent reinforcement learning for autonomous ships

    Authors: Martin Waltz, Ostap Okhrin

    Abstract: This paper proposes a spatial-temporal recurrent neural network architecture for deep $Q$-networks that can be used to steer an autonomous ship. The network design makes it possible to handle an arbitrary number of surrounding target ships while offering robustness to partial observability. Furthermore, a state-of-the-art collision risk metric is proposed to enable an easier assessment of differen… ▽ More

    Submitted 15 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  5. arXiv:2203.10777  [pdf, other

    q-fin.GN

    Vulnerability-CoVaR: Investigating the Crypto-market

    Authors: Martin Waltz, Abhay Kumar Singh, Ostap Okhrin

    Abstract: This paper proposes an important extension to Conditional Value-at-Risk (CoVaR), the popular systemic risk measure, and investigates its properties on the cryptocurrency market. The proposed Vulnerability-CoVaR (VCoVaR) is defined as the Value-at-Risk (VaR) of a financial system or institution, given that at least one other institution is equal or below its VaR. The VCoVaR relaxes normality assump… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  6. arXiv:2201.08078  [pdf, other

    cs.LG

    Addressing Maximization Bias in Reinforcement Learning with Two-Sample Testing

    Authors: Martin Waltz, Ostap Okhrin

    Abstract: Value-based reinforcement-learning algorithms have shown strong results in games, robotics, and other real-world applications. Overestimation bias is a known threat to those algorithms and can lead to dramatic performance decreases or even complete algorithmic failure. We frame the bias problem statistically and consider it an instance of estimating the maximum expected value (MEV) of a set of ran… ▽ More

    Submitted 18 October, 2023; v1 submitted 20 January, 2022; originally announced January 2022.

  7. arXiv:2112.12465  [pdf, other

    cs.LG

    Missing Velocity in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning

    Authors: Fabian Hart, Martin Waltz, Ostap Okhrin

    Abstract: We introduce a novel approach to dynamic obstacle avoidance based on Deep Reinforcement Learning by defining a traffic type independent environment with variable complexity. Filling a gap in the current literature, we thoroughly investigate the effect of missing velocity information on an agent's performance in obstacle avoidance tasks. This is a crucial issue in practice since several sensors yie… ▽ More

    Submitted 28 December, 2021; v1 submitted 23 December, 2021; originally announced December 2021.