Skip to main content

Showing 1–9 of 9 results for author: Seyde, T

.
  1. arXiv:2404.04253  [pdf, other

    cs.LG cs.AI cs.RO

    Growing Q-Networks: Solving Continuous Control Tasks with Adaptive Control Resolution

    Authors: Tim Seyde, Peter Werner, Wilko Schwarting, Markus Wulfmeier, Daniela Rus

    Abstract: Recent reinforcement learning approaches have shown surprisingly strong capabilities of bang-bang policies for solving continuous control benchmarks. The underlying coarse action space discretizations often yield favourable exploration characteristics while final performance does not visibly suffer in the absence of action penalization in line with optimal control theory. In robotics applications,… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  2. arXiv:2212.11084  [pdf, other

    cs.RO cs.AI

    Towards Cooperative Flight Control Using Visual-Attention

    Authors: Lianhao Yin, Makram Chahine, Tsun-Hsuan Wang, Tim Seyde, Chao Liu, Mathias Lechner, Ramin Hasani, Daniela Rus

    Abstract: The cooperation of a human pilot with an autonomous agent during flight control realizes parallel autonomy. We propose an air-guardian system that facilitates cooperation between a pilot with eye tracking and a parallel end-to-end neural control system. Our vision-based air-guardian system combines a causal continuous-depth neural network model with a cooperation layer to enable parallel autonomy… ▽ More

    Submitted 20 September, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

  3. arXiv:2210.12566  [pdf, other

    cs.LG cs.AI cs.RO

    Solving Continuous Control via Q-learning

    Authors: Tim Seyde, Peter Werner, Wilko Schwarting, Igor Gilitschenski, Martin Riedmiller, Daniela Rus, Markus Wulfmeier

    Abstract: While there has been substantial success for solving continuous control with actor-critic methods, simpler critic-only methods such as Q-learning find limited application in the associated high-dimensional action spaces. However, most actor-critic methods come at the cost of added complexity: heuristics for stabilisation, compute requirements and wider hyperparameter search spaces. We show that a… ▽ More

    Submitted 25 September, 2023; v1 submitted 22 October, 2022; originally announced October 2022.

  4. arXiv:2210.06650  [pdf, other

    cs.LG cs.AI cs.NE cs.RO stat.ML

    Interpreting Neural Policies with Disentangled Tree Representations

    Authors: Tsun-Hsuan Wang, Wei Xiao, Tim Seyde, Ramin Hasani, Daniela Rus

    Abstract: The advancement of robots, particularly those functioning in complex human-centric environments, relies on control solutions that are driven by machine learning. Understanding how learning-based controllers make decisions is crucial since robots are often safety-critical systems. This urges a formal and quantitative understanding of the explanatory factors in the interpretability of robot learning… ▽ More

    Submitted 12 November, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  5. arXiv:2205.09117  [pdf, other

    cs.LG cs.RO eess.SY

    Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks

    Authors: Ryan Sander, Wilko Schwarting, Tim Seyde, Igor Gilitschenski, Sertac Karaman, Daniela Rus

    Abstract: Experience replay plays a crucial role in improving the sample efficiency of deep reinforcement learning agents. Recent advances in experience replay propose using Mixup (Zhang et al., 2018) to further improve sample efficiency via synthetic sample generation. We build upon this technique with Neighborhood Mixup Experience Replay (NMER), a geometrically-grounded replay buffer that interpolates tra… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: Accepted to L4DC 2022

  6. arXiv:2111.02552  [pdf, other

    cs.LG cs.AI cs.RO

    Is Bang-Bang Control All You Need? Solving Continuous Control with Bernoulli Policies

    Authors: Tim Seyde, Igor Gilitschenski, Wilko Schwarting, Bartolomeo Stellato, Martin Riedmiller, Markus Wulfmeier, Daniela Rus

    Abstract: Reinforcement learning (RL) for continuous control typically employs distributions whose support covers the entire action space. In this work, we investigate the colloquially known phenomenon that trained agents often prefer actions at the boundaries of that space. We draw theoretical connections to the emergence of bang-bang behavior in optimal control, and provide extensive empirical evaluation… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  7. arXiv:2102.09812  [pdf, other

    cs.LG cs.AI cs.RO

    Deep Latent Competition: Learning to Race Using Visual Control Policies in Latent Space

    Authors: Wilko Schwarting, Tim Seyde, Igor Gilitschenski, Lucas Liebenwein, Ryan Sander, Sertac Karaman, Daniela Rus

    Abstract: Learning competitive behaviors in multi-agent settings such as racing requires long-term reasoning about potential adversarial interactions. This paper presents Deep Latent Competition (DLC), a novel reinforcement learning algorithm that learns competitive visual control policies through self-play in imagination. The DLC agent imagines multi-agent interaction sequences in the compact latent space… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

    Comments: Wilko, Tim, and Igor contributed equally to this work; published in Conference on Robot Learning 2020

  8. arXiv:2010.14641  [pdf, other

    cs.LG cs.AI cs.RO

    Learning to Plan Optimistically: Uncertainty-Guided Deep Exploration via Latent Model Ensembles

    Authors: Tim Seyde, Wilko Schwarting, Sertac Karaman, Daniela Rus

    Abstract: Learning complex robot behaviors through interaction requires structured exploration. Planning should target interactions with the potential to optimize long-term performance, while only reducing uncertainty where conducive to this objective. This paper presents Latent Optimistic Value Exploration (LOVE), a strategy that enables deep exploration through optimism in the face of uncertain long-term… ▽ More

    Submitted 11 December, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

  9. arXiv:1903.03823  [pdf, other

    cs.RO

    Locomotion Planning through a Hybrid Bayesian Trajectory Optimization

    Authors: Tim Seyde, Jan Carius, Ruben Grandia, Farbod Farshidian, Marco Hutter

    Abstract: Locomotion planning for legged systems requires reasoning about suitable contact schedules. The contact sequence and timings constitute a hybrid dynamical system and prescribe a subset of achievable motions. State-of-the-art approaches cast motion planning as an optimal control problem. In order to decrease computational complexity, one common strategy separates footstep planning from motion optim… ▽ More

    Submitted 9 March, 2019; originally announced March 2019.

    Comments: Accepted for publication at the IEEE International Conference on Robotics and Automation (ICRA) 2019