Skip to main content

Showing 1–50 of 72 results for author: Trimpe, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19768  [pdf, other

    cs.LG

    Contextualized Hybrid Ensemble Q-learning: Learning Fast with Control Priors

    Authors: Emma Cramer, Bernd Frauenknecht, Ramil Sabirov, Sebastian Trimpe

    Abstract: Combining Reinforcement Learning (RL) with a prior controller can yield the best out of two worlds: RL can solve complex nonlinear problems, while the control prior ensures safer exploration and speeds up training. Prior work largely blends both components with a fixed weight, neglecting that the RL agent's performance varies with the training progress and across regions in the state space. Theref… ▽ More

    Submitted 1 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2406.18293  [pdf, other

    cs.LG cs.AI

    Combining Automated Optimisation of Hyperparameters and Reward Shape

    Authors: Julian Dierkes, Emma Cramer, Holger H. Hoos, Sebastian Trimpe

    Abstract: There has been significant progress in deep reinforcement learning (RL) in recent years. Nevertheless, finding suitable hyperparameter configurations and reward functions remains challenging even for experts, and performance heavily relies on these design choices. Also, most RL research is conducted on known benchmarks where knowledge about these choices already exists. However, novel practical ap… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Published in the Reinforcement Learning Journal 2024

  3. arXiv:2406.06101  [pdf, ps, other

    cs.LG stat.ML

    On the Consistency of Kernel Methods with Dependent Observations

    Authors: Pierre-François Massiani, Sebastian Trimpe, Friedrich Solowjow

    Abstract: The consistency of a learning method is usually established under the assumption that the observations are a realization of an independent and identically distributed (i.i.d.) or mixing process. Yet, kernel methods such as support vector machines (SVMs), Gaussian processes, or conditional kernel mean embeddings (CKMEs) all give excellent performance under sampling schemes that are obviously non-i.… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 26 pages, 1 figure

  4. arXiv:2405.19014  [pdf, other

    cs.LG

    Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption

    Authors: Bernd Frauenknecht, Artur Eisele, Devdutt Subhasish, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Dyna-style model-based reinforcement learning (MBRL) combines model-free agents with predictive transition models through model-based rollouts. This combination raises a critical question: 'When to trust your model?'; i.e., which rollout length results in the model providing useful data? Janner et al. (2019) address this question by gradually increasing rollout lengths throughout the training. Whi… ▽ More

    Submitted 21 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.10618  [pdf, other

    cs.LG math.OC stat.ML

    Distributed Event-Based Learning via ADMM

    Authors: Guner Dilsad Er, Sebastian Trimpe, Michael Muehlebach

    Abstract: We consider a distributed learning problem, where agents minimize a global objective function by exchanging information over a network. Our approach has two distinct features: (i) It substantially reduces communication by triggering communication only when necessary, and (ii) it is agnostic to the data-distribution among the different agents. We can therefore guarantee convergence even if the loca… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 29 pages, 12 figures

  6. arXiv:2404.05835  [pdf, other

    eess.SY cs.LG math.OC

    Parameter-Adaptive Approximate MPC: Tuning Neural-Network Controllers without Retraining

    Authors: Henrik Hose, Alexander Gräfe, Sebastian Trimpe

    Abstract: Model Predictive Control (MPC) is a method to control nonlinear systems with guaranteed stability and constraint satisfaction but suffers from high computation times. Approximate MPC (AMPC) with neural networks (NNs) has emerged to address this limitation, enabling deployment on resource-constrained embedded systems. However, when tuning AMPCs for real-world systems, large datasets need to be rege… ▽ More

    Submitted 6 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted to L4DC 2024

  7. arXiv:2403.12948  [pdf, other

    cs.LG stat.ML

    On Safety in Safe Bayesian Optimization

    Authors: Christian Fiedler, Johanna Menn, Lukas Kreisköther, Sebastian Trimpe

    Abstract: Optimizing an unknown function under safety constraints is a central task in robotics, biomedical engineering, and many other disciplines, and increasingly safe Bayesian Optimization (BO) is used for this. Due to the safety critical nature of these applications, it is of utmost importance that theoretical safety guarantees for these algorithms translate into the real world. In this work, we invest… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  8. arXiv:2312.10199  [pdf, other

    eess.SY cs.LG math.OC

    Automatic nonlinear MPC approximation with closed-loop guarantees

    Authors: Abdullah Tokmak, Christian Fiedler, Melanie N. Zeilinger, Sebastian Trimpe, Johannes Köhler

    Abstract: Safety guarantees are vital in many control applications, such as robotics. Model predictive control (MPC) provides a constructive framework for controlling safety-critical systems, but is limited by its computational complexity. We address this problem by presenting a novel algorithm that automatically computes an explicit approximation to nonlinear MPC schemes while retaining closed-loop guarant… ▽ More

    Submitted 11 April, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Submitted to IEEE Transactions on Automatic Control. Compared to the previously uploaded version, this version contains an additional numerical example

  9. arXiv:2312.00592  [pdf, other

    cs.LG cs.CV cs.RO

    Tracking Object Positions in Reinforcement Learning: A Metric for Keypoint Detection (extended version)

    Authors: Emma Cramer, Jonas Reiher, Sebastian Trimpe

    Abstract: Reinforcement learning (RL) for robot control typically requires a detailed representation of the environment state, including information about task-relevant objects not directly measurable. Keypoint detectors, such as spatial autoencoders (SAEs), are a common approach to extracting a low-dimensional representation from high-dimensional image data. SAEs aim at spatial features such as object posi… ▽ More

    Submitted 2 July, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 19 pages, 12 figures

  10. arXiv:2311.18393  [pdf, other

    cs.LG cs.RO

    Data-efficient Deep Reinforcement Learning for Vehicle Trajectory Control

    Authors: Bernd Frauenknecht, Tobias Ehlgen, Sebastian Trimpe

    Abstract: Advanced vehicle control is a fundamental building block in the development of autonomous driving systems. Reinforcement learning (RL) promises to achieve control performance superior to classical approaches while kee** computational demands low during deployment. However, standard RL approaches like soft-actor critic (SAC) require extensive amounts of training data to be collected and are thus… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  11. arXiv:2310.18074  [pdf, ps, other

    cs.LG math.ST stat.ML

    On kernel-based statistical learning in the mean field limit

    Authors: Christian Fiedler, Michael Herty, Sebastian Trimpe

    Abstract: In many applications of machine learning, a large number of variables are considered. Motivated by machine learning of interacting particle systems, we consider the situation when the number of input variables goes to infinity. First, we continue the recent investigation of the mean field limit of kernels and their reproducing kernel Hilbert spaces, completing the existing theory. Next, we provide… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  12. arXiv:2309.02873  [pdf, other

    cs.LG

    Learning Hybrid Dynamics Models With Simulator-Informed Latent States

    Authors: Katharina Ensinger, Sebastian Ziesche, Sebastian Trimpe

    Abstract: Dynamics model learning deals with the task of inferring unknown dynamics from measurement data and predicting the future behavior of the system. A typical approach to address this problem is to train recurrent models. However, predictions with these models are often not physically meaningful. Further, they suffer from deteriorated behavior over time due to accumulating errors. Often, simulators b… ▽ More

    Submitted 29 January, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: Accepted at The 38th Annual AAAI Conference on Artificial Intelligence, 2024

  13. arXiv:2309.02351  [pdf, other

    cs.LG stat.ML

    Exact Inference for Continuous-Time Gaussian Process Dynamics

    Authors: Katharina Ensinger, Nicholas Tagliapietra, Sebastian Ziesche, Sebastian Trimpe

    Abstract: Physical systems can often be described via a continuous-time dynamical system. In practice, the true system is often unknown and has to be learned from measurement data. Since data is typically collected in discrete time, e.g. by sensors, most methods in Gaussian process (GP) dynamics model learning are trained on one-step ahead predictions. This can become problematic in several scenarios, e.g.… ▽ More

    Submitted 29 January, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted at The 38th Annual AAAI Conference on Artificial Intelligence. 2024

  14. arXiv:2308.06022  [pdf, other

    cs.CV

    Scale-Preserving Automatic Concept Extraction (SPACE)

    Authors: Andrés Felipe Posada-Moreno, Lukas Kreisköther, Tassilo Glander, Sebastian Trimpe

    Abstract: Convolutional Neural Networks (CNN) have become a common choice for industrial quality control, as well as other critical applications in the Industry 4.0. When these CNNs behave in ways unexpected to human users or developers, severe consequences can arise, such as economic losses or an increased risk to human life. Concept extraction techniques can be applied to increase the reliability and tran… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 22 pages, 7 figures

  15. arXiv:2307.07975  [pdf, other

    cs.RO cs.LG

    Pseudo-rigid body networks: learning interpretable deformable object dynamics from partial observations

    Authors: Shamil Mamedov, A. René Geist, Jan Swevers, Sebastian Trimpe

    Abstract: Accurate prediction of deformable linear object (DLO) dynamics is challenging if the task at hand requires a human-interpretable yet computationally fast model. In this work, we draw inspiration from the pseudo-rigid body method (PRB) and model a DLO as a serial chain of rigid bodies whose internal state is unrolled through time by a dynamics network. This dynamics network is trained jointly with… ▽ More

    Submitted 20 March, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  16. arXiv:2306.16973  [pdf, other

    cs.RO eess.SY

    Robust Direct Data-Driven Control for Probabilistic Systems

    Authors: Alexander von Rohr, Dmitrii Likhachev, Sebastian Trimpe

    Abstract: We propose a data-driven control method for systems with aleatoric uncertainty, for example, robot fleets with variations between agents. Our method leverages shared trajectory data to increase the robustness of the designed controller and thus facilitate transfer to new variations without the need for prior parameter and uncertainty estimations. In contrast to existing work on experience transfer… ▽ More

    Submitted 22 March, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

  17. arXiv:2306.03551  [pdf, other

    cs.AI cs.CV cs.LG

    Scalable Concept Extraction in Industry 4.0

    Authors: Andrés Felipe Posada-Moreno, Kai Müller, Florian Brillowski, Friedrich Solowjow, Thomas Gries, Sebastian Trimpe

    Abstract: The industry 4.0 is leveraging digital technologies and machine learning techniques to connect and optimize manufacturing processes. Central to this idea is the ability to transform raw data into human understandable knowledge for reliable data-driven decision-making. Convolutional Neural Networks (CNNs) have been instrumental in processing image data, yet, their ``black box'' nature complicates t… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  18. arXiv:2304.09575  [pdf, other

    eess.SY cs.LG math.OC

    Approximate non-linear model predictive control with safety-augmented neural networks

    Authors: Henrik Hose, Johannes Köhler, Melanie N. Zeilinger, Sebastian Trimpe

    Abstract: Model predictive control (MPC) achieves stability and constraint satisfaction for general nonlinear systems, but requires computationally expensive online optimization. This paper studies approximations of such MPC controllers via neural networks (NNs) to achieve fast online evaluation. We propose safety augmentation that yields deterministic guarantees for convergence and constraint satisfaction… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  19. Multimodal Multi-User Surface Recognition with the Kernel Two-Sample Test

    Authors: Behnam Khojasteh, Friedrich Solowjow, Sebastian Trimpe, Katherine J. Kuchenbecker

    Abstract: Machine learning and deep learning have been used extensively to classify physical surfaces through images and time-series contact data. However, these methods rely on human expertise and entail the time-consuming processes of data and parameter tuning. To overcome these challenges, we propose an easily implemented framework that can directly handle heterogeneous data sources for classification ta… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  20. arXiv:2302.14446  [pdf, ps, other

    stat.ML cs.LG math.FA math.NA

    Reproducing kernel Hilbert spaces in the mean field limit

    Authors: Christian Fiedler, Michael Herty, Michael Rom, Chiara Segala, Sebastian Trimpe

    Abstract: Kernel methods, being supported by a well-developed theory and coming with efficient algorithms, are among the most popular and successful machine learning techniques. From a mathematical point of view, these methods rest on the concept of kernels and function spaces generated by kernels, so called reproducing kernel Hilbert spaces. Motivated by recent developments of learning approaches in the co… ▽ More

    Submitted 17 March, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Updated author email addresses

    MSC Class: 46E22; 82B40; 74A25; 82C40

  21. arXiv:2302.13754  [pdf, other

    cs.LG

    Combining Slow and Fast: Complementary Filtering for Dynamics Learning

    Authors: Katharina Ensinger, Sebastian Ziesche, Barbara Rakitsch, Michael Tiemann, Sebastian Trimpe

    Abstract: Modeling an unknown dynamical system is crucial in order to predict the future behavior of the system. A standard approach is training recurrent models on measurement data. While these models typically provide exact short-term predictions, accumulating errors yield deteriorated long-term behavior. In contrast, models with reliable long-term predictions can often be obtained, either by training a r… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  22. Data-Driven Observability Analysis for Nonlinear Stochastic Systems

    Authors: Pierre-François Massiani, Mona Buisson-Fenet, Friedrich Solowjow, Florent Di Meglio, Sebastian Trimpe

    Abstract: Distinguishability and, by extension, observability are key properties of dynamical systems. Establishing these properties is challenging, especially when no analytical model is available and they are to be inferred directly from measurement data. The presence of noise further complicates this analysis, as standard notions of distinguishability are tailored to deterministic systems. We build on di… ▽ More

    Submitted 7 June, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: 9 pages, 3 figures

    Journal ref: IEEE Transactions of Automatic Control 69 (2023) 4042 -- 4049

  23. arXiv:2208.10790  [pdf, other

    cs.LG stat.ML

    Event-Triggered Time-Varying Bayesian Optimization

    Authors: Paul Brunzema, Alexander von Rohr, Friedrich Solowjow, Sebastian Trimpe

    Abstract: We consider the problem of sequentially optimizing a time-varying objective function using time-varying Bayesian optimization (TVBO). To cope with stale data arising from time variations, current approaches to TVBO require prior knowledge of a constant rate of change. However, in practice, the rate of change is usually unknown. We propose an event-triggered algorithm, ET-GP-UCB, that treats the op… ▽ More

    Submitted 6 June, 2024; v1 submitted 23 August, 2022; originally announced August 2022.

  24. Learning Fast and Precise Pixel-to-Torque Control

    Authors: Steffen Bleher, Steve Heim, Sebastian Trimpe

    Abstract: In the field, robots often need to operate in unknown and unstructured environments, where accurate sensing and state estimation (SE) becomes a major challenge. Cameras have been used to great success in map** and planning in such environments, as well as complex but quasi-static tasks such as gras**, but are rarely integrated into the control loop for unstable systems. Learning pixel-to-torqu… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: video: https://www.youtube.com/watch?v=S2llScfG-8E 9 pages. Published in Robotics and Automation Magazine

  25. Improving the Performance of Robust Control through Event-Triggered Learning

    Authors: Alexander von Rohr, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Robust controllers ensure stability in feedback loops designed under uncertainty but at the cost of performance. Model uncertainty in time-invariant systems can be reduced by recently proposed learning-based methods, which improve the performance of robust controllers using data. However, in practice, many systems also exhibit uncertainty in the form of changes over time, e.g., due to weight shift… ▽ More

    Submitted 21 September, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: To appear in the proceedings of the 61st IEEE Conference on Decision and Control

  26. On Controller Tuning with Time-Varying Bayesian Optimization

    Authors: Paul Brunzema, Alexander von Rohr, Sebastian Trimpe

    Abstract: Changing conditions or environments can cause system dynamics to vary over time. To ensure optimal control performance, controllers should adapt to these changes. When the underlying cause and time of change is unknown, we need to rely on online data for this adaptation. In this paper, we will use time-varying Bayesian optimization (TVBO) to tune controllers online in changing environments using a… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: To appear in the proceedings of the 61st IEEE Conference on Decision and Control

    Journal ref: IEEE 61st Conference on Decision and Control (2022), p. 4046-4052

  27. arXiv:2207.06988  [pdf, other

    cs.RO

    The Wheelbot: A Jum** Reaction Wheel Unicycle

    Authors: A. René Geist, Jonathan Fiene, Naomi Tashiro, Zheng Jia, Sebastian Trimpe

    Abstract: Combining off-the-shelf components with 3D-printing, the Wheelbot is a symmetric reaction wheel unicycle that can jump onto its wheels from any initial position. With non-holonomic and under-actuated dynamics, as well as two coupled unstable degrees of freedom, the Wheelbot provides a challenging platform for nonlinear and data-driven control research. This paper presents the Wheelbot's mechanical… ▽ More

    Submitted 23 July, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Erratum: In the initial publication, Equation (3) was wrong and has been corrected in this version. Equation (3) relates to the transform from averaged body rates ${}^{\text{B}}ω_i$ to Euler rates. Importantly, the results in this papers are not affected by the wrong transform. More details are found in the projects github repo: https://github.com/AndReGeist/wheelbot-v2.5

  28. arXiv:2206.04531  [pdf, other

    cs.CV cs.AI cs.LG

    ECLAD: Extracting Concepts with Local Aggregated Descriptors

    Authors: Andres Felipe Posada-Moreno, Nikita Surya, Sebastian Trimpe

    Abstract: Convolutional neural networks (CNNs) are increasingly being used in critical systems, where robustness and alignment are crucial. In this context, the field of explainable artificial intelligence has proposed the generation of high-level explanations of the prediction process of CNNs through concept extraction. While these methods can detect whether or not a concept is present in an image, they ar… ▽ More

    Submitted 11 August, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 34 pages, under review

    MSC Class: 68T01 ACM Class: I.2.10; I.2.m

  29. arXiv:2205.12550  [pdf, other

    eess.SY cs.LG

    Recognition Models to Learn Dynamics from Partial Observations with Neural ODEs

    Authors: Mona Buisson-Fenet, Valery Morgenthaler, Sebastian Trimpe, Florent Di Meglio

    Abstract: Identifying dynamical systems from experimental data is a notably difficult task. Prior knowledge generally helps, but the extent of this knowledge varies with the application, and customized models are often needed. Neural ordinary differential equations can be written as a flexible framework for system identification and can incorporate a broad spectrum of physical insight, giving physical inter… ▽ More

    Submitted 12 January, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

  30. arXiv:2202.06052  [pdf, other

    cs.LG cs.RO eess.SY stat.ME stat.ML

    Learning by Doing: Controlling a Dynamical System using Causality, Control, and Reinforcement Learning

    Authors: Sebastian Weichwald, Søren Wengel Mogensen, Tabitha Edith Lee, Dominik Baumann, Oliver Kroemer, Isabelle Guyon, Sebastian Trimpe, Jonas Peters, Niklas Pfister

    Abstract: Questions in causality, control, and reinforcement learning go beyond the classical machine learning task of prediction under i.i.d. observations. Instead, these fields consider the problem of learning how to actively perturb a system to achieve a certain effect on a response variable. Arguably, they have complementary views on the problem: In control, one usually aims to first identify the system… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: https://learningbydoingcompetition.github.io/

  31. GoSafeOpt: Scalable Safe Exploration for Global Optimization of Dynamical Systems

    Authors: Bhavya Sukhija, Matteo Turchetta, David Lindner, Andreas Krause, Sebastian Trimpe, Dominik Baumann

    Abstract: Learning optimal control policies directly on physical systems is challenging since even a single failure can lead to costly hardware damage. Most existing model-free learning methods that guarantee safety, i.e., no failures, during exploration are limited to local optima. A notable exception is the GoSafe algorithm, which, unfortunately, cannot handle high-dimensional systems and hence cannot be… ▽ More

    Submitted 12 June, 2023; v1 submitted 24 January, 2022; originally announced January 2022.

    Journal ref: Artificial Intelligence, Volume 320, Year 2023

  32. arXiv:2106.11899  [pdf, other

    cs.LG stat.ML

    Local policy search with Bayesian optimization

    Authors: Sarah Müller, Alexander von Rohr, Sebastian Trimpe

    Abstract: Reinforcement learning (RL) aims to find an optimal policy by interaction with an environment. Consequently, learning complex behavior requires a vast number of samples, which can be prohibitive in practice. Nevertheless, instead of systematically reasoning and actively choosing informative samples, policy gradients for local search are often obtained from random perturbations. These random sample… ▽ More

    Submitted 22 November, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: Presented at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021

  33. arXiv:2105.13281  [pdf, other

    cs.RO cs.LG eess.SY

    GoSafe: Globally Optimal Safe Robot Learning

    Authors: Dominik Baumann, Alonso Marco, Matteo Turchetta, Sebastian Trimpe

    Abstract: When learning policies for robotic systems from data, safety is a major concern, as violation of safety constraints may cause hardware damage. SafeOpt is an efficient Bayesian optimization (BO) algorithm that can learn policies while guaranteeing safety with high probability. However, its search space is limited to an initially given safe region. We extend this method by exploring outside the init… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  34. arXiv:2105.12204  [pdf, other

    eess.SY cs.LG cs.RO

    Safe Value Functions

    Authors: Pierre-François Massiani, Steve Heim, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Safety constraints and optimality are important, but sometimes conflicting criteria for controllers. Although these criteria are often solved separately with different tools to maintain formal guarantees, it is also common practice in reinforcement learning to simply modify reward functions by penalizing failures, with the penalty treated as a mere heuristic. We rigorously examine the relationship… ▽ More

    Submitted 1 December, 2022; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: 16 pages, 6 figures. Accepted for publication in: Transactions of Automatic Control, special issue on Learning and Control

    Journal ref: IEEE Transactions of Automatic Control 68, Issue 5 (2023) 2743 -- 2757

  35. arXiv:2105.07668  [pdf, other

    eess.SY cs.LG

    Probabilistic Robust Linear Quadratic Regulators with Gaussian Processes

    Authors: Alexander von Rohr, Matthias Neumann-Brosig, Sebastian Trimpe

    Abstract: Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design. While learning-based control has the potential to yield superior performance in demanding applications, robustness to uncertainty remains an important challenge. Since Bayesian methods quantify uncertainty of the learning results, it is natural… ▽ More

    Submitted 21 September, 2022; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Published in the proceedings of the 3rd Conference on Learning for Dynamics and Control, this version fixes a typo in Algorithm 1

  36. arXiv:2105.03397  [pdf, other

    eess.SY cs.LG stat.ML

    Learning-enhanced robust controller synthesis with rigorous statistical and control-theoretic guarantees

    Authors: Christian Fiedler, Carsten W. Scherer, Sebastian Trimpe

    Abstract: The combination of machine learning with control offers many opportunities, in particular for robust control. However, due to strong safety and reliability requirements in many real-world applications, providing rigorous statistical and control-theoretic guarantees is of utmost importance, yet difficult to achieve for learning-based control schemes. We present a general framework for learning-enha… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

  37. arXiv:2105.02796  [pdf, other

    cs.LG eess.SY stat.ML

    Practical and Rigorous Uncertainty Bounds for Gaussian Process Regression

    Authors: Christian Fiedler, Carsten W. Scherer, Sebastian Trimpe

    Abstract: Gaussian Process Regression is a popular nonparametric regression method based on Bayesian principles that provides uncertainty estimates for its predictions. However, these estimates are of a Bayesian nature, whereas for some important applications, like learning-based control with safety guarantees, frequentist uncertainty bounds are required. Although such rigorous bounds are available for Gaus… ▽ More

    Submitted 8 August, 2023; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Contains supplementary material and corrections to the original version

  38. arXiv:2104.07989  [pdf, other

    eess.SY cs.MA cs.NI

    Scaling Beyond Bandwidth Limitations: Wireless Control With Stability Guarantees Under Overload

    Authors: Fabian Mager, Dominik Baumann, Carsten Herrmann, Sebastian Trimpe, Marco Zimmerling

    Abstract: An important class of cyber-physical systems relies on multiple agents that jointly perform a task by coordinating their actions over a wireless network. Examples include self-driving cars in intelligent transportation and production robots in smart manufacturing. However, the scalability of existing control-over-wireless solutions is limited as they cannot resolve overload situations in which the… ▽ More

    Submitted 25 January, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted article to appear in ACM Transactions on Cyber-Physical Systems

  39. arXiv:2102.01606  [pdf, other

    cs.LG

    Structure-preserving Gaussian Process Dynamics

    Authors: Katharina Ensinger, Friedrich Solowjow, Sebastian Ziesche, Michael Tiemann, Sebastian Trimpe

    Abstract: Most physical processes posses structural properties such as constant energies, volumes, and other invariants over time. When learning models of such dynamical systems, it is critical to respect these invariants to ensure accurate predictions and physically meaningful behavior. Strikingly, state-of-the-art methods in Gaussian process (GP) dynamics model learning are not addressing this issue. On t… ▽ More

    Submitted 9 January, 2022; v1 submitted 2 February, 2021; originally announced February 2021.

  40. arXiv:2012.06250  [pdf, other

    cs.LG cs.RO

    Structured learning of rigid-body dynamics: A survey and unified view from a robotics perspective

    Authors: A. René Geist, Sebastian Trimpe

    Abstract: Accurate models of mechanical system dynamics are often critical for model-based control and reinforcement learning. Fully data-driven dynamics models promise to ease the process of modeling and analysis, but require considerable amounts of data for training and often do not generalize well to unseen parts of the state space. Combining data-driven modelling with prior analytical knowledge is an at… ▽ More

    Submitted 16 April, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: - Added to title "... from a robotics perspective" - Added to Section 1.4 summary of used notation - Significantly extended the intro to mechanics in Section 2 - Adjusted Figure 4 to show the connection between model errors - Corrected typos

    ACM Class: A.1; I.2.6; I.2.9; J.2

  41. arXiv:2010.09087  [pdf, other

    eess.SY cs.MA cs.NI

    Wireless Control for Smart Manufacturing: Recent Approaches and Open Challenges

    Authors: Dominik Baumann, Fabian Mager, Ulf Wetzker, Lothar Thiele, Marco Zimmerling, Sebastian Trimpe

    Abstract: Smart manufacturing aims to overcome the limitations of today's rigid assembly lines by making the material flow and manufacturing process more flexible, versatile, and scalable. The main economic drivers are higher resource and cost efficiency as the manufacturers can more quickly adapt to changing market needs and also increase the lifespan of their production sites. The ability to close feedbac… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

    Comments: Accepted article to appear in Proceedings of the IEEE

  42. Robot Learning with Crash Constraints

    Authors: Alonso Marco, Dominik Baumann, Majid Khadiv, Philipp Hennig, Ludovic Righetti, Sebastian Trimpe

    Abstract: In the past decade, numerous machine learning algorithms have been shown to successfully learn optimal policies to control real robotic systems. However, it is common to encounter failing behaviors as the learning loop progresses. Specifically, in robot applications where failing is undesired but not catastrophic, many algorithms struggle with leveraging data obtained from failures. This is usuall… ▽ More

    Submitted 27 January, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: 8 pages, 4 figures, 1 table, 1 algorithm. Accepted for publication in IEEE Robotics and Automation Letters (RA-L). Video demonstration of the experiments available at https://youtu.be/RAiIo0l6_rE . Algorithm implementation available at https://github.com/alonrot/classified_regression.git

    Journal ref: IEEE Robotics and Automation Letters, Vol 6(2), pp. 1439-1446, 2021

  43. Learning Event-triggered Control from Data through Joint Optimization

    Authors: Niklas Funk, Dominik Baumann, Vincent Berenz, Sebastian Trimpe

    Abstract: We present a framework for model-free learning of event-triggered control strategies. Event-triggered methods aim to achieve high control performance while only closing the feedback loop when needed. This enables resource savings, e.g., network bandwidth if control commands are sent via communication networks, as in networked control systems. Event-triggered controllers consist of a communication… ▽ More

    Submitted 23 April, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

  44. arXiv:2006.03906  [pdf, other

    cs.LG eess.SY stat.ML

    Identifying Causal Structure in Dynamical Systems

    Authors: Dominik Baumann, Friedrich Solowjow, Karl H. Johansson, Sebastian Trimpe

    Abstract: Mathematical models are fundamental building blocks in the design of dynamical control systems. As control systems are becoming increasingly complex and networked, approaches for obtaining such models based on first principles reach their limits. Data-driven methods provide an alternative. However, without structural knowledge, these methods are prone to finding spurious correlations in the traini… ▽ More

    Submitted 18 July, 2022; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: Accepted final versions to appear in the Transactions on Machine Learning Research

  45. arXiv:2005.07443  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Excursion Search for Constrained Bayesian Optimization under a Limited Budget of Failures

    Authors: Alonso Marco, Alexander von Rohr, Dominik Baumann, José Miguel Hernández-Lobato, Sebastian Trimpe

    Abstract: When learning to ride a bike, a child falls down a number of times before achieving the first success. As falling down usually has only mild consequences, it can be seen as a tolerable failure in exchange for a faster learning process, as it provides rich information about an undesired behavior. In the context of Bayesian optimization under unknown constraints (BOC), typical strategies for safe le… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Comments: 14 pages, 4 figures, submitted

  46. arXiv:2004.11238  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Learning Constrained Dynamics with Gauss Principle adhering Gaussian Processes

    Authors: A. Rene Geist, Sebastian Trimpe

    Abstract: The identification of the constrained dynamics of mechanical systems is often challenging. Learning methods promise to ease an analytical analysis, but require considerable amounts of data for training. We propose to combine insights from analytical mechanics with Gaussian process regression to improve the model's data efficiency and constraint integrity. The result is a Gaussian process model tha… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: To be published in 2nd Annual Conference on Learning for Dynamics and Control (L4DC), Proceedings of Machine Learning Research 2020

    Journal ref: Proceedings of the 2nd Conference on Learning for Dynamics and Control, PMLR 120:225-234, 2020

  47. arXiv:2004.11098  [pdf, other

    stat.ML cs.LG

    A Kernel Two-sample Test for Dynamical Systems

    Authors: Friedrich Solowjow, Dominik Baumann, Christian Fiedler, Andreas Jocham, Thomas Seel, Sebastian Trimpe

    Abstract: Evaluating whether data streams are drawn from the same distribution is at the heart of various machine learning problems. This is particularly relevant for data generated by dynamical systems since such systems are essential for many real-world processes in biomedical, economic, or engineering systems. While kernel two-sample tests are powerful for comparing independent and identically distribute… ▽ More

    Submitted 4 September, 2022; v1 submitted 23 April, 2020; originally announced April 2020.

  48. arXiv:1912.10360  [pdf, other

    cs.RO cs.LG eess.SY

    Safe and Fast Tracking on a Robot Manipulator: Robust MPC and Neural Network Control

    Authors: Julian Nubert, Johannes Köhler, Vincent Berenz, Frank Allgöwer, Sebastian Trimpe

    Abstract: Fast feedback control and safety guarantees are essential in modern robotics. We present an approach that achieves both by combining novel robust model predictive control (MPC) with function approximation via (deep) neural networks (NNs). The result is a new approach for complex tasks with nonlinear, uncertain, and constrained dynamics as are common in robotics. Specifically, we leverage recent re… ▽ More

    Submitted 2 March, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: 8 pages, 4 figures,

    Journal ref: Robotics and Automation Letters, 2020

  49. arXiv:1911.09946  [pdf, other

    cs.LG cs.RO stat.ML

    Actively Learning Gaussian Process Dynamics

    Authors: Mona Buisson-Fenet, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Despite the availability of ever more data enabled through modern sensor and computer technology, it still remains an open problem to learn dynamical systems in a sample-efficient way. We propose active learning strategies that leverage information-theoretical properties arising naturally during Gaussian process regression, while respecting constraints on the sampling process imposed by the system… ▽ More

    Submitted 26 April, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Journal ref: Actively Learning Gaussian Process Dynamics, Proceedings of the 2nd Conference on Learning for Dynamics and Control, Proceedings of Machine Learning Research vol 120, pp. 5-15, 2020

  50. arXiv:1910.13399  [pdf, other

    cs.RO cs.AI cs.LG

    Robust Model-free Reinforcement Learning with Multi-objective Bayesian Optimization

    Authors: Matteo Turchetta, Andreas Krause, Sebastian Trimpe

    Abstract: In reinforcement learning (RL), an autonomous agent learns to perform complex tasks by maximizing an exogenous reward signal while interacting with its environment. In real-world applications, test conditions may differ substantially from the training scenario and, therefore, focusing on pure reward maximization during training may lead to poor results at test time. In these cases, it is important… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Submitted to IEEE Conference on Robotics and Automation 2020 (ICRA)