Skip to main content

Showing 1–30 of 30 results for author: Lederer, A

.
  1. arXiv:2405.08756  [pdf, other

    eess.SY cs.LG

    Stable Inverse Reinforcement Learning: Policies from Control Lyapunov Landscapes

    Authors: Samuel Tesfazgi, Leonhard Sprandl, Armin Lederer, Sandra Hirche

    Abstract: Learning from expert demonstrations to flexibly program an autonomous system with complex behaviors or to predict an agent's behavior is a powerful tool, especially in collaborative control settings. A common method to solve this problem is inverse reinforcement learning (IRL), where the observed agent, e.g., a human demonstrator, is assumed to behave according to the optimization of an intrinsic… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  2. arXiv:2405.08711  [pdf, other

    cs.RO cs.LG eess.SY

    Data-driven Force Observer for Human-Robot Interaction with Series Elastic Actuators using Gaussian Processes

    Authors: Samuel Tesfazgi, Markus Keßler, Emilio Trigili, Armin Lederer, Sandra Hirche

    Abstract: Ensuring safety and adapting to the user's behavior are of paramount importance in physical human-robot interaction. Thus, incorporating elastic actuators in the robot's mechanical design has become popular, since it offers intrinsic compliance and additionally provide a coarse estimate for the interaction force by measuring the deformation of the elastic components. While observer-based methods h… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  3. arXiv:2402.03048  [pdf, other

    cs.MA cs.LG eess.SY

    Cooperative Learning with Gaussian Processes for Euler-Lagrange Systems Tracking Control under Switching Topologies

    Authors: Zewen Yang, Songbo Dong, Armin Lederer, Xiaobing Dai, Siyu Chen, Stefan Sosnowski, Georges Hattab, Sandra Hirche

    Abstract: This work presents an innovative learning-based approach to tackle the tracking control problem of Euler-Lagrange multi-agent systems with partially unknown dynamics operating under switching communication topologies. The approach leverages a correlation-aware cooperative algorithm framework built upon Gaussian process regression, which adeptly captures inter-agent correlations for uncertainty pre… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 8 pages

  4. arXiv:2310.01538  [pdf, ps, other

    eess.SY

    Risk-Sensitive Inhibitory Control for Safe Reinforcement Learning

    Authors: Armin Lederer, Erfaun Noorani, John S. Baras, Sandra Hirche

    Abstract: Humans have the ability to deviate from their natural behavior when necessary, which is a cognitive process called response inhibition. Similar approaches have independently received increasing attention in recent years for ensuring the safety of control. Realized using control barrier functions or predictive safety filters, these approaches can effectively ensure the satisfaction of state constra… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: The 62nd IEEE Conference on Decision and Control, Dec. 13-15, 2023, Singapore

  5. arXiv:2307.04415  [pdf, other

    eess.SY cs.LG stat.ML

    Episodic Gaussian Process-Based Learning Control with Vanishing Tracking Errors

    Authors: Armin Lederer, Jonas Umlauft, Sandra Hirche

    Abstract: Due to the increasing complexity of technical systems, accurate first principle models can often not be obtained. Supervised machine learning can mitigate this issue by inferring models from measurement data. Gaussian process regression is particularly well suited for this purpose due to its high data-efficiency and its explicit uncertainty representation, which allows the derivation of prediction… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  6. arXiv:2305.16215  [pdf, other

    cs.LG eess.SY math.DS stat.ML

    Koopman Kernel Regression

    Authors: Petar Bevanda, Max Beier, Armin Lederer, Stefan Sosnowski, Eyke Hüllermeier, Sandra Hirche

    Abstract: Many machine learning approaches for decision making, such as reinforcement learning, rely on simulators or predictive models to forecast the time-evolution of quantities of interest, e.g., the state of an agent or the reward of a policy. Forecasts of such complex phenomena are commonly described by highly nonlinear dynamical systems, making their use in optimization-based decision-making challeng… ▽ More

    Submitted 16 January, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted to the thirty-seventh Conference on Neural Information Processing Systems (NeurIPS 2023)

  7. arXiv:2305.08169  [pdf, ps, other

    eess.SY cs.LG

    Can Learning Deteriorate Control? Analyzing Computational Delays in Gaussian Process-Based Event-Triggered Online Learning

    Authors: Xiaobing Dai, Armin Lederer, Zewen Yang, Sandra Hirche

    Abstract: When the dynamics of systems are unknown, supervised machine learning techniques are commonly employed to infer models from data. Gaussian process (GP) regression is a particularly popular learning method for this purpose due to the existence of prediction error bounds. Moreover, GP models can be efficiently updated online, such that event-triggered online learning strategies can be pursued to ens… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  8. arXiv:2303.17963  [pdf, other

    eess.SY cs.LG math.OC stat.ML

    Learning-Based Optimal Control with Performance Guarantees for Unknown Systems with Latent States

    Authors: Robert Lefringhausen, Supitsana Srithasan, Armin Lederer, Sandra Hirche

    Abstract: As control engineering methods are applied to increasingly complex systems, data-driven approaches for system identification appear as a promising alternative to physics-based modeling. While the Bayesian approaches prevalent for safety-critical applications usually rely on the availability of state measurements, the states of a complex system are often not directly measurable. It may then be nece… ▽ More

    Submitted 16 April, 2024; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Accepted version submitted to the 22nd European Control Conference

  9. arXiv:2212.00478  [pdf, ps, other

    eess.SY cs.RO

    Safe Learning-Based Control of Elastic Joint Robots via Control Barrier Functions

    Authors: Armin Lederer, Azra Begzadić, Neha Das, Sandra Hirche

    Abstract: Ensuring safety is of paramount importance in physical human-robot interaction applications. This requires both adherence to safety constraints defined on the system state, as well as guaranteeing compliant behavior of the robot. If the underlying dynamical system is known exactly, the former can be addressed with the help of control barrier functions. The incorporation of elastic actuators in the… ▽ More

    Submitted 14 April, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

  10. Vision-Based Uncertainty-Aware Motion Planning based on Probabilistic Semantic Segmentation

    Authors: Ralf Römer, Armin Lederer, Samuel Tesfazgi, Sandra Hirche

    Abstract: For safe operation, a robot must be able to avoid collisions in uncertain environments. Existing approaches for motion planning under uncertainties often assume parametric obstacle representations and Gaussian uncertainty, which can be inaccurate. While visual perception can deliver a more accurate representation of the environment, its use for safe motion planning is limited by the inherent misca… ▽ More

    Submitted 1 December, 2023; v1 submitted 14 September, 2022; originally announced September 2022.

    Journal ref: IEEE Robotics and Automation Letters, vol. 8, no. 11, pp. 7825-7832, 2023

  11. arXiv:2207.01337  [pdf, other

    cs.LG cs.AI eess.SY

    Safe Reinforcement Learning via Confidence-Based Filters

    Authors: Sebastian Curi, Armin Lederer, Sandra Hirche, Andreas Krause

    Abstract: Ensuring safety is a crucial challenge when deploying reinforcement learning (RL) to real-world systems. We develop confidence-based safety filters, a control-theoretic approach for certifying state safety constraints for nominal policies learned via standard RL techniques, based on probabilistic dynamics models. Our approach is based on a reformulation of state constraints in terms of cost functi… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  12. arXiv:2202.11491  [pdf, other

    eess.SY cs.LG

    Networked Online Learning for Control of Safety-Critical Resource-Constrained Systems based on Gaussian Processes

    Authors: Armin Lederer, Mingmin Zhang, Samuel Tesfazgi, Sandra Hirche

    Abstract: Safety-critical technical systems operating in unknown environments require the ability to quickly adapt their behavior, which can be achieved in control by inferring a model online from the data stream generated during operation. Gaussian process-based learning is particularly well suited for safety-critical applications as it ensures bounded prediction errors. While there exist computationally e… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  13. arXiv:2112.04085  [pdf, other

    cs.LG eess.SY

    Diffeomorphically Learning Stable Koopman Operators

    Authors: Petar Bevanda, Max Beier, Sebastian Kerz, Armin Lederer, Stefan Sosnowski, Sandra Hirche

    Abstract: System representations inspired by the infinite-dimensional Koopman operator (generator) are increasingly considered for predictive modeling. Due to the operator's linearity, a range of nonlinear systems admit linear predictor representations - allowing for simplified prediction, analysis and control. However, finding meaningful finite-dimensional representations for prediction is difficult as it… ▽ More

    Submitted 30 May, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: Revised version submitted to IEEE Control Systems Letters (L-CSS) with substantially revised exposition, evaluation and proof of Lemma 2 (previously Lemma 8)

  14. arXiv:2111.03617  [pdf, ps, other

    eess.SP cs.LG eess.SY

    Adaptive Low-Pass Filtering using Sliding Window Gaussian Processes

    Authors: Alejandro J. Ordóñez-Conejo, Armin Lederer, Sandra Hirche

    Abstract: When signals are measured through physical sensors, they are perturbed by noise. To reduce noise, low-pass filters are commonly employed in order to attenuate high frequency components in the incoming signal, regardless if they come from noise or the actual signal. Therefore, low-pass filters must be carefully tuned in order to avoid significant deterioration of the signal. This tuning requires pr… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  15. arXiv:2110.00481  [pdf, other

    cs.LG cs.RO

    Personalized Rehabilitation Robotics based on Online Learning Control

    Authors: Samuel Tesfazgi, Armin Lederer, Johannes F. Kunz, Alejandro J. Ordóñez-Conejo, Sandra Hirche

    Abstract: The use of rehabilitation robotics in clinical applications gains increasing importance, due to therapeutic benefits and the ability to alleviate labor-intensive works. However, their practical utility is dependent on the deployment of appropriate control algorithms, which adapt the level of task-assistance according to each individual patient's need. Generally, the required personalization is ach… ▽ More

    Submitted 15 September, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

  16. arXiv:2109.02606  [pdf, other

    cs.LG cs.RO eess.SY

    Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications

    Authors: Alexandre Capone, Armin Lederer, Sandra Hirche

    Abstract: Gaussian processes have become a promising tool for various safety-critical settings, since the posterior variance can be used to directly estimate the model error and quantify risk. However, state-of-the-art techniques for safety-critical settings hinge on the assumption that the kernel hyperparameters are known, which does not apply in general. To mitigate this, we introduce robust Gaussian proc… ▽ More

    Submitted 20 July, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  17. arXiv:2104.04483  [pdf, other

    cs.LG eess.SY

    Inverse Reinforcement Learning: A Control Lyapunov Approach

    Authors: Samuel Tesfazgi, Armin Lederer, Sandra Hirche

    Abstract: Inferring the intent of an intelligent agent from demonstrations and subsequently predicting its behavior, is a critical task in many collaborative settings. A common approach to solve this problem is the framework of inverse reinforcement learning (IRL), where the observed agent, e.g., a human demonstrator, is assumed to behave according to an intrinsic cost function that reflects its intent and… ▽ More

    Submitted 4 October, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: This work has been accepted for presentation at, and publication in the proceedings of, the 2021 IEEE Conference on Decision and Control (CDC)

  18. Distributed Bayesian Online Learning for Cooperative Manipulation

    Authors: Pablo Budde gen. Dohmann, Armin Lederer, Marcel Dißemond, Sandra Hirche

    Abstract: For tasks where the dynamics of multiple agents are physically coupled, e.g., in cooperative manipulation, the coordination between the individual agents becomes crucial, which requires exact knowledge of the interaction dynamics. This problem is typically addressed using centralized estimators, which can negatively impact the flexibility and robustness of the overall system. To overcome this shor… ▽ More

    Submitted 28 June, 2022; v1 submitted 9 April, 2021; originally announced April 2021.

  19. arXiv:2103.15929  [pdf, other

    eess.SY

    Distributed Learning Consensus Control for Unknown Nonlinear Multi-Agent Systems based on Gaussian Processes

    Authors: Zewen Yang, Stefan Sosnowski, Qingchen Liu, Junjie Jiao, Armin Lederer, Sandra Hirche

    Abstract: In this paper, a distributed learning leader-follower consensus protocol based on Gaussian process regression for a class of nonlinear multi-agent systems with unknown dynamics is designed. We propose a distributed learning approach to predict the residual dynamics for each agent. The stability of the consensus protocol using the data-driven model of the dynamics is shown via Lyapunov analysis. Th… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: This paper was submitted to IEEE CDC2021

  20. arXiv:2101.05328  [pdf, other

    cs.LG eess.SY stat.ML

    Uniform Error and Posterior Variance Bounds for Gaussian Process Regression with Application to Safe Control

    Authors: Armin Lederer, Jonas Umlauft, Sandra Hirche

    Abstract: In application areas where data generation is expensive, Gaussian processes are a preferred supervised learning model due to their high data-efficiency. Particularly in model-based control, Gaussian processes allow the derivation of performance guarantees using probabilistic model error bounds. To make these approaches applicable in practice, two open challenges must be solved i) Existing error bo… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  21. arXiv:2011.10596  [pdf, ps, other

    eess.SY cs.LG

    The Impact of Data on the Stability of Learning-Based Control- Extended Version

    Authors: Armin Lederer, Alexandre Capone, Thomas Beckers, Jonas Umlauft, Sandra Hirche

    Abstract: Despite the existence of formal guarantees for learning-based control approaches, the relationship between data and control performance is still poorly understood. In this paper, we propose a Lyapunov-based measure for quantifying the impact of data on the certifiable control performance. By modeling unknown system dynamics through Gaussian processes, we can determine the interrelation between mod… ▽ More

    Submitted 30 July, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

  22. arXiv:2010.02613  [pdf, other

    cs.LG cs.AI cs.RO eess.SY

    Deep Learning based Uncertainty Decomposition for Real-time Control

    Authors: Neha Das, Jonas Umlauft, Armin Lederer, Thomas Beckers, Sandra Hirche

    Abstract: Data-driven control in unknown environments requires a clear understanding of the involved uncertainties for ensuring safety and efficient exploration. While aleatoric uncertainty that arises from measurement noise can often be explicitly modeled given a parametric description, it can be harder to model epistemic uncertainty, which describes the presence or absence of training data. The latter can… ▽ More

    Submitted 12 July, 2023; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted at IFAC World Congress 2023

  23. arXiv:2006.09446  [pdf, ps, other

    cs.LG cs.RO stat.ML

    Real-Time Regression with Dividing Local Gaussian Processes

    Authors: Armin Lederer, Alejandro Jose Ordonez Conejo, Korbinian Maier, Wenxin Xiao, Jonas Umlauft, Sandra Hirche

    Abstract: The increased demand for online prediction and the growing availability of large data sets drives the need for computationally efficient models. While exact Gaussian process regression shows various favorable theoretical properties (uncertainty estimate, unlimited expressive power), the poor scaling with respect to the training set size prohibits its application in big data regimes in real-time. T… ▽ More

    Submitted 30 July, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  24. arXiv:2006.07871  [pdf, other

    cs.LG eess.SY stat.ML

    GP3: A Sampling-based Analysis Framework for Gaussian Processes

    Authors: Armin Lederer, Markus Kessler, Sandra Hirche

    Abstract: Although machine learning is increasingly applied in control approaches, only few methods guarantee certifiable safety, which is necessary for real world applications. These approaches typically rely on well-understood learning algorithms, which allow formal theoretical analysis. Gaussian process regression is a prominent example among those methods, which attracts growing attention due to its str… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  25. arXiv:2006.07868  [pdf, other

    eess.SY cs.LG

    Learning Stable Nonparametric Dynamical Systems with Gaussian Process Regression

    Authors: Wenxin Xiao, Armin Lederer, Sandra Hirche

    Abstract: Modelling real world systems involving humans such as biological processes for disease treatment or human behavior for robotic rehabilitation is a challenging problem because labeled training data is sparse and expensive, while high prediction accuracy is required from models of these dynamical systems. Due to the high nonlinearity of problems in this area, data-driven approaches gain increasing a… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  26. arXiv:2005.12062  [pdf, other

    eess.SY cs.LG

    How Training Data Impacts Performance in Learning-based Control

    Authors: Armin Lederer, Alexandre Capone, Jonas Umlauft, Sandra Hirche

    Abstract: When first principle models cannot be derived due to the complexity of the real system, data-driven methods allow us to build models from system observations. As these models are employed in learning-based control, the quality of the data plays a crucial role for the performance of the resulting control law. Nevertheless, there hardly exist measures for assessing training data sets, and the impact… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

  27. arXiv:2005.03270  [pdf, ps, other

    eess.SY

    Data selection for multi-task learning under dynamic constraints

    Authors: Alexandre Capone, Armin Lederer, Jonas Umlauft, Sandra Hirche

    Abstract: Learning-based techniques are increasingly effective at controlling complex systems using data-driven models. However, most work done so far has focused on learning individual tasks or control laws. Hence, it is still a largely unaddressed research question how multiple tasks can be learned efficiently and simultaneously on the same system. In particular, no efficient state space exploration schem… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

  28. arXiv:2005.02191  [pdf, ps, other

    cs.LG eess.SY stat.ML

    Localized active learning of Gaussian process state space models

    Authors: Alexandre Capone, Jonas Umlauft, Thomas Beckers, Armin Lederer, Sandra Hirche

    Abstract: The performance of learning-based control techniques crucially depends on how effectively the system is explored. While most exploration techniques aim to achieve a globally accurate model, such approaches are generally unsuited for systems with unbounded state spaces. Furthermore, a globally accurate model is not required to achieve good performance in many common control applications, e.g., loca… ▽ More

    Submitted 9 June, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: Submitted to Learning for Dynamics and Control (L4DC)

  29. arXiv:1906.01404  [pdf, ps, other

    cs.LG stat.ML

    Posterior Variance Analysis of Gaussian Processes with Application to Average Learning Curves

    Authors: Armin Lederer, Jonas Umlauft, Sandra Hirche

    Abstract: The posterior variance of Gaussian processes is a valuable measure of the learning error which is exploited in various applications such as safe reinforcement learning and control design. However, suitable analysis of the posterior variance which captures its behavior for finite and infinite number of training data is missing. This paper derives a novel bound for the posterior variance function wh… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

  30. arXiv:1906.01376  [pdf, other

    cs.LG eess.SY stat.ML

    Uniform Error Bounds for Gaussian Process Regression with Application to Safe Control

    Authors: Armin Lederer, Jonas Umlauft, Sandra Hirche

    Abstract: Data-driven models are subject to model errors due to limited and noisy training data. Key to the application of such models in safety-critical domains is the quantification of their model error. Gaussian processes provide such a measure and uniform error bounds have been derived, which allow safe control based on these models. However, existing error bounds require restrictive assumptions. In thi… ▽ More

    Submitted 19 December, 2019; v1 submitted 4 June, 2019; originally announced June 2019.