Skip to main content

Showing 1–8 of 8 results for author: Huegle, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2012.03234  [pdf, other

    cs.LG cs.RO

    Amortized Q-learning with Model-based Action Proposals for Autonomous Driving on Highways

    Authors: Branka Mirchevska, Maria Hügle, Gabriel Kalweit, Moritz Werling, Joschka Boedecker

    Abstract: Well-established optimization-based methods can guarantee an optimal trajectory for a short optimization horizon, typically no longer than a few seconds. As a result, choosing the optimal trajectory for this short horizon may still result in a sub-optimal long-term solution. At the same time, the resulting short-term trajectories allow for effective, comfortable and provable safe maneuvers in a dy… ▽ More

    Submitted 6 December, 2020; originally announced December 2020.

  2. A Dynamic Deep Neural Network For Multimodal Clinical Data Analysis

    Authors: Maria Hügle, Gabriel Kalweit, Thomas Huegle, Joschka Boedecker

    Abstract: Clinical data from electronic medical records, registries or trials provide a large source of information to apply machine learning methods in order to foster precision medicine, e.g. by finding new disease phenotypes or performing individual disease prediction. However, to take full advantage of deep learning methods on clinical data, architectures are necessary that 1) are robust with respect to… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: Accepted at the AAAI 2020 International Workshop on Health Intelligence

  3. arXiv:2008.01712  [pdf, other

    cs.LG cs.RO stat.ML

    Deep Inverse Q-learning with Constraints

    Authors: Gabriel Kalweit, Maria Huegle, Moritz Werling, Joschka Boedecker

    Abstract: Popular Maximum Entropy Inverse Reinforcement Learning approaches require the computation of expected state visitation frequencies for the optimal policy under an estimate of the reward function. This usually requires intermediate value estimation in the inner loop of the algorithm, slowing down convergence considerably. In this work, we introduce a novel class of algorithms that only needs to sol… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

  4. arXiv:2003.09398  [pdf, other

    cs.LG cs.RO stat.ML

    Deep Constrained Q-learning

    Authors: Gabriel Kalweit, Maria Huegle, Moritz Werling, Joschka Boedecker

    Abstract: In many real world applications, reinforcement learning agents have to optimize multiple objectives while following certain rules or satisfying a list of constraints. Classical methods based on reward sha**, i.e. a weighted combination of different objectives in the reward signal, or Lagrangian methods, including constraints in the loss function, have no guarantees that the agent satisfies the c… ▽ More

    Submitted 14 September, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

  5. arXiv:1909.13582  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Dynamic Interaction-Aware Scene Understanding for Reinforcement Learning in Autonomous Driving

    Authors: Maria Huegle, Gabriel Kalweit, Moritz Werling, Joschka Boedecker

    Abstract: The common pipeline in autonomous driving systems is highly modular and includes a perception component which extracts lists of surrounding objects and passes these lists to a high-level decision component. In this case, leveraging the benefits of deep reinforcement learning for high-level decision making requires special architectures to deal with multiple variable-length sequences of different o… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

  6. arXiv:1909.13518  [pdf, other

    cs.LG cs.AI stat.ML

    Composite Q-learning: Multi-scale Q-function Decomposition and Separable Optimization

    Authors: Gabriel Kalweit, Maria Huegle, Joschka Boedecker

    Abstract: In the past few years, off-policy reinforcement learning methods have shown promising results in their application for robot control. Deep Q-learning, however, still suffers from poor data-efficiency and is susceptible to stochasticity in the environment or reward functions which is limiting with regard to real-world applications. We alleviate these problems by proposing two novel off-policy Tempo… ▽ More

    Submitted 14 August, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

  7. Dynamic Input for Deep Reinforcement Learning in Autonomous Driving

    Authors: Maria Hügle, Gabriel Kalweit, Branka Mirchevska, Moritz Werling, Joschka Boedecker

    Abstract: In many real-world decision making problems, reaching an optimal decision requires taking into account a variable number of objects around the agent. Autonomous driving is a domain in which this is especially relevant, since the number of cars surrounding the agent varies considerably over time and affects the optimal action to be taken. Classical methods that process object lists can deal with th… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: Accepted at IROS 2019

  8. arXiv:1806.04549  [pdf, other

    stat.ML cs.LG stat.AP

    Early Seizure Detection with an Energy-Efficient Convolutional Neural Network on an Implantable Microcontroller

    Authors: Maria Hügle, Simon Heller, Manuel Watter, Manuel Blum, Farrokh Manzouri, Matthias Dümpelmann, Andreas Schulze-Bonhage, Peter Woias, Joschka Boedecker

    Abstract: Implantable, closed-loop devices for automated early detection and stimulation of epileptic seizures are promising treatment options for patients with severe epilepsy that cannot be treated with traditional means. Most approaches for early seizure detection in the literature are, however, not optimized for implementation on ultra-low power microcontrollers required for long-term implantation. In t… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: Accepted at IJCNN 2018