Skip to main content

Showing 1–7 of 7 results for author: Dorka, N

.
  1. arXiv:2404.08755  [pdf, other

    cs.LG cs.AI cs.CV cs.HC

    Training a Vision Language Model as Smartphone Assistant

    Authors: Nicolai Dorka, Janusz Marecki, Ammar Anwar

    Abstract: Addressing the challenge of a digital assistant capable of executing a wide array of user tasks, our research focuses on the realm of instruction-based mobile device control. We leverage recent advancements in large language models (LLMs) and present a visual language model (VLM) that can fulfill diverse tasks on mobile devices. Our model functions by interacting solely with the user interface (UI… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: ICLR 2024 workshop on Generative Models for Decision Making

  2. arXiv:2303.11756  [pdf, other

    cs.RO cs.LG

    Improving Deep Dynamics Models for Autonomous Vehicles with Multimodal Latent Map** of Surfaces

    Authors: Johan Vertens, Nicolai Dorka, Tim Welschehold, Michael Thompson, Wolfram Burgard

    Abstract: The safe deployment of autonomous vehicles relies on their ability to effectively react to environmental changes. This can require maneuvering on varying surfaces which is still a difficult problem, especially for slippery terrains. To address this issue we propose a new approach that learns a surface-aware dynamics model by conditioning it on a latent variable vector storing surface information a… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

  3. arXiv:2303.10144  [pdf, other

    cs.LG stat.ML

    Dynamic Update-to-Data Ratio: Minimizing World Model Overfitting

    Authors: Nicolai Dorka, Tim Welschehold, Wolfram Burgard

    Abstract: Early stop** based on the validation set performance is a popular approach to find the right balance between under- and overfitting in the context of supervised learning. However, in reinforcement learning, even for supervised sub-problems such as world model learning, early stop** is not applicable as the dataset is continually evolving. As a solution, we propose a new general method that dyn… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

    Comments: ICLR 2023

  4. arXiv:2111.12673  [pdf, other

    cs.LG cs.AI cs.RO

    Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

    Authors: Nicolai Dorka, Tim Welschehold, Joschka Boedecker, Wolfram Burgard

    Abstract: Accurate value estimates are important for off-policy reinforcement learning. Algorithms based on temporal difference learning typically are prone to an over- or underestimation bias building up over time. In this paper, we propose a general method called Adaptively Calibrated Critics (ACC) that uses the most recent high variance but unbiased on-policy rollouts to alleviate the bias of the low var… ▽ More

    Submitted 21 October, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: Submitted to RA-L

  5. arXiv:2011.08726  [pdf, other

    cs.LG cs.CV cs.RO

    Modality-Buffet for Real-Time Object Detection

    Authors: Nicolai Dorka, Johannes Meyer, Wolfram Burgard

    Abstract: Real-time object detection in videos using lightweight hardware is a crucial component of many robotic tasks. Detectors using different modalities and with varying computational complexities offer different trade-offs. One option is to have a very lightweight model that can predict from all modalities at once for each frame. However, in some situations (e.g., in static scenes) it might be better t… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: Accepted at the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  6. arXiv:2007.02701  [pdf, other

    cs.LG cs.AI stat.ML

    Scaling Imitation Learning in Minecraft

    Authors: Artemij Amiranashvili, Nicolai Dorka, Wolfram Burgard, Vladlen Koltun, Thomas Brox

    Abstract: Imitation learning is a powerful family of techniques for learning sensorimotor coordination in immersive environments. We apply imitation learning to attain state-of-the-art performance on hard exploration problems in the Minecraft environment. We report experiments that highlight the influence of network architecture, loss function, and data augmentation. An early version of our approach reached… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

  7. arXiv:1903.07400  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Scheduled Intrinsic Drive: A Hierarchical Take on Intrinsically Motivated Exploration

    Authors: **gwei Zhang, Niklas Wetzel, Nicolai Dorka, Joschka Boedecker, Wolfram Burgard

    Abstract: Exploration in sparse reward reinforcement learning remains an open challenge. Many state-of-the-art methods use intrinsic motivation to complement the sparse extrinsic reward signal, giving the agent more opportunities to receive feedback during exploration. Commonly these signals are added as bonus rewards, which results in a mixture policy that neither conducts exploration nor task fulfillment… ▽ More

    Submitted 21 June, 2019; v1 submitted 18 March, 2019; originally announced March 2019.

    Comments: A video of our experimental results can be found at https://youtu.be/b0MbY3lUlEI