Skip to main content

Showing 51–93 of 93 results for author: Trimpe, S

.
  1. Learning Event-triggered Control from Data through Joint Optimization

    Authors: Niklas Funk, Dominik Baumann, Vincent Berenz, Sebastian Trimpe

    Abstract: We present a framework for model-free learning of event-triggered control strategies. Event-triggered methods aim to achieve high control performance while only closing the feedback loop when needed. This enables resource savings, e.g., network bandwidth if control commands are sent via communication networks, as in networked control systems. Event-triggered controllers consist of a communication… ▽ More

    Submitted 23 April, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

  2. arXiv:2006.03906  [pdf, other

    cs.LG eess.SY stat.ML

    Identifying Causal Structure in Dynamical Systems

    Authors: Dominik Baumann, Friedrich Solowjow, Karl H. Johansson, Sebastian Trimpe

    Abstract: Mathematical models are fundamental building blocks in the design of dynamical control systems. As control systems are becoming increasingly complex and networked, approaches for obtaining such models based on first principles reach their limits. Data-driven methods provide an alternative. However, without structural knowledge, these methods are prone to finding spurious correlations in the traini… ▽ More

    Submitted 18 July, 2022; v1 submitted 6 June, 2020; originally announced June 2020.

    Comments: Accepted final versions to appear in the Transactions on Machine Learning Research

  3. arXiv:2005.07443  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Excursion Search for Constrained Bayesian Optimization under a Limited Budget of Failures

    Authors: Alonso Marco, Alexander von Rohr, Dominik Baumann, José Miguel Hernández-Lobato, Sebastian Trimpe

    Abstract: When learning to ride a bike, a child falls down a number of times before achieving the first success. As falling down usually has only mild consequences, it can be seen as a tolerable failure in exchange for a faster learning process, as it provides rich information about an undesired behavior. In the context of Bayesian optimization under unknown constraints (BOC), typical strategies for safe le… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Comments: 14 pages, 4 figures, submitted

  4. arXiv:2004.11238  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Learning Constrained Dynamics with Gauss Principle adhering Gaussian Processes

    Authors: A. Rene Geist, Sebastian Trimpe

    Abstract: The identification of the constrained dynamics of mechanical systems is often challenging. Learning methods promise to ease an analytical analysis, but require considerable amounts of data for training. We propose to combine insights from analytical mechanics with Gaussian process regression to improve the model's data efficiency and constraint integrity. The result is a Gaussian process model tha… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: To be published in 2nd Annual Conference on Learning for Dynamics and Control (L4DC), Proceedings of Machine Learning Research 2020

    Journal ref: Proceedings of the 2nd Conference on Learning for Dynamics and Control, PMLR 120:225-234, 2020

  5. arXiv:2004.11098  [pdf, other

    stat.ML cs.LG

    A Kernel Two-sample Test for Dynamical Systems

    Authors: Friedrich Solowjow, Dominik Baumann, Christian Fiedler, Andreas Jocham, Thomas Seel, Sebastian Trimpe

    Abstract: Evaluating whether data streams are drawn from the same distribution is at the heart of various machine learning problems. This is particularly relevant for data generated by dynamical systems since such systems are essential for many real-world processes in biomedical, economic, or engineering systems. While kernel two-sample tests are powerful for comparing independent and identically distribute… ▽ More

    Submitted 4 September, 2022; v1 submitted 23 April, 2020; originally announced April 2020.

  6. Controller Design via Experimental Exploration with Robustness Guarantees

    Authors: Tobias Holicki, Carsten W. Scherer, Sebastian Trimpe

    Abstract: For a partially unknown linear systems, we present a systematic control design approach based on generated data from measurements of closed-loop experiments with suitable test controllers. These experiments are used to improve the achieved performance and to reduce the uncertainty about the unknown parts of the system. This is achieved through a parametrization of auspicious controllers with conve… ▽ More

    Submitted 26 June, 2020; v1 submitted 19 March, 2020; originally announced March 2020.

    Journal ref: IEEE Control Systems Letters, Volume 5, Issue 2, Pages 641-646, 2020

  7. arXiv:1912.10360  [pdf, other

    cs.RO cs.LG eess.SY

    Safe and Fast Tracking on a Robot Manipulator: Robust MPC and Neural Network Control

    Authors: Julian Nubert, Johannes Köhler, Vincent Berenz, Frank Allgöwer, Sebastian Trimpe

    Abstract: Fast feedback control and safety guarantees are essential in modern robotics. We present an approach that achieves both by combining novel robust model predictive control (MPC) with function approximation via (deep) neural networks (NNs). The result is a new approach for complex tasks with nonlinear, uncertain, and constrained dynamics as are common in robotics. Specifically, we leverage recent re… ▽ More

    Submitted 2 March, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: 8 pages, 4 figures,

    Journal ref: Robotics and Automation Letters, 2020

  8. arXiv:1911.09946  [pdf, other

    cs.LG cs.RO stat.ML

    Actively Learning Gaussian Process Dynamics

    Authors: Mona Buisson-Fenet, Friedrich Solowjow, Sebastian Trimpe

    Abstract: Despite the availability of ever more data enabled through modern sensor and computer technology, it still remains an open problem to learn dynamical systems in a sample-efficient way. We propose active learning strategies that leverage information-theoretical properties arising naturally during Gaussian process regression, while respecting constraints on the sampling process imposed by the system… ▽ More

    Submitted 26 April, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Journal ref: Actively Learning Gaussian Process Dynamics, Proceedings of the 2nd Conference on Learning for Dynamics and Control, Proceedings of Machine Learning Research vol 120, pp. 5-15, 2020

  9. arXiv:1910.13399  [pdf, other

    cs.RO cs.AI cs.LG

    Robust Model-free Reinforcement Learning with Multi-objective Bayesian Optimization

    Authors: Matteo Turchetta, Andreas Krause, Sebastian Trimpe

    Abstract: In reinforcement learning (RL), an autonomous agent learns to perform complex tasks by maximizing an exogenous reward signal while interacting with its environment. In real-world applications, test conditions may differ substantially from the training scenario and, therefore, focusing on pure reward maximization during training may lead to poor results at test time. In these cases, it is important… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Submitted to IEEE Conference on Robotics and Automation 2020 (ICRA)

  10. Event-triggered Learning for Linear Quadratic Control

    Authors: Henning Schlüter, Friedrich Solowjow, Sebastian Trimpe

    Abstract: When models are inaccurate, the performance of model-based control will degrade. For linear quadratic control, an event-triggered learning framework is proposed that automatically detects inaccurate models and triggers the learning of a new process model when needed. This is achieved by analyzing the probability distribution of the linear quadratic cost and designing a learning trigger that levera… ▽ More

    Submitted 5 October, 2020; v1 submitted 17 October, 2019; originally announced October 2019.

    Comments: 13 pages, 8 figures, accepted for publication in IEEE Transactions on Automatic Control

    Journal ref: IEEE Transactions on Automatic Control, vol. 66, no. 10, pp. 4485-4498, Oct. 2021

  11. arXiv:1910.02835  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    A Learnable Safety Measure

    Authors: Steve Heim, Alexander von Rohr, Sebastian Trimpe, Alexander Badri-Spröwitz

    Abstract: Failures are challenging for learning to control physical systems since they risk damage, time-consuming resets, and often provide little gradient information. Adding safety constraints to exploration typically requires a lot of prior knowledge and domain expertise. We present a safety measure which implicitly captures how the system dynamics relate to a set of failure states. Not only can this me… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: 10 pages, Conference on Robot Learning CoRL 2019, 3 figures

  12. arXiv:1909.10873  [pdf, other

    eess.SY cs.MA

    Fast Feedback Control over Multi-hop Wireless Networks with Mode Changes and Stability Guarantees

    Authors: Dominik Baumann, Fabian Mager, Romain Jacob, Lothar Thiele, Marco Zimmerling, Sebastian Trimpe

    Abstract: Closing feedback loops fast and over long distances is key to emerging cyber-physical applications; for example, robot motion control and swarm coordination require update intervals of tens of milliseconds. Low-power wireless communication technology is preferred for its low cost, small form factor, and flexibility, especially if the devices support multi-hop communication. Thus far, however, feed… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Accepted for publication in ACM Transactions on Cyber-Physical Systems. arXiv admin note: text overlap with arXiv:1804.08986

  13. arXiv:1907.12300  [pdf, other

    eess.SY

    Predictive Triggering for Distributed Control of Resource Constrained Multi-agent Systems

    Authors: José Mario Mastrangelo, Dominik Baumann, Sebastian Trimpe

    Abstract: A predictive triggering (PT) framework for the distributed control of resource constrained multi-agent systems is proposed. By predicting future communication demands and deriving a probabilistic priority measure, the PT framework is able to allocate limited communication resources in advance. The framework is evaluated through simulations of a cooperative adaptive cruise control system and experi… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: 6 pages, 3 figures, to appear in Proc. of the 8th IFAC Workshop on Distributed Estimation and Control in Networked Systems, 2019

  14. arXiv:1907.10383  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Classified Regression for Bayesian Optimization: Robot Learning with Unknown Penalties

    Authors: Alonso Marco, Dominik Baumann, Philipp Hennig, Sebastian Trimpe

    Abstract: Learning robot controllers by minimizing a black-box objective cost using Bayesian optimization (BO) can be time-consuming and challenging. It is very often the case that some roll-outs result in failure behaviors, causing premature experiment detention. In such cases, the designer is forced to decide on heuristic cost penalties because the acquired data is often scarce, or not comparable with tha… ▽ More

    Submitted 9 November, 2020; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: This paper was submitted to JMLR in 2018 and rejected. Currently, it is not published, nor under review in any conference or journal venue

  15. Demo Abstract: Fast Feedback Control and Coordination with Mode Changes for Wireless Cyber-Physical Systems

    Authors: Fabian Mager, Dominik Baumann, Romain Jacob, Lothar Thiele, Sebastian Trimpe, Marco Zimmerling

    Abstract: This abstract describes the first public demonstration of feedback control and coordination of multiple physical systems over a dynamic multi-hop low-power wireless network with update intervals of tens of milliseconds. Our running system can dynamically change between different sets of application tasks (e.g., sensing, actuation, control) executing on the spatially distributed embedded devices, w… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 18th International Conference on Information Processing in Sensor Networks (IPSN'19), April 16--18, 2019, Montreal, QC, Canada

  16. arXiv:1906.03458  [pdf, other

    eess.SY cs.MA cs.NI

    Control-guided Communication: Efficient Resource Arbitration and Allocation in Multi-hop Wireless Control Systems

    Authors: Dominik Baumann, Fabian Mager, Marco Zimmerling, Sebastian Trimpe

    Abstract: In future autonomous systems, wireless multi-hop communication is key to enable collaboration among distributed agents at low cost and high flexibility. When many agents need to transmit information over the same wireless network, communication becomes a shared and contested resource. Event-triggered and self-triggered control account for this by transmitting data only when needed, enabling signif… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

    Comments: Accepted final version to appear in: IEEE Control Systems Letters

  17. Hierarchical Event-triggered Learning for Cyclically Excited Systems with Application to Wireless Sensor Networks

    Authors: Jonas Beuchert, Friedrich Solowjow, Jörg Raisch, Sebastian Trimpe, Thomas Seel

    Abstract: Communication load is a limiting factor in many real-time systems. Event-triggered state estimation and event-triggered learning methods reduce network communication by sending information only when it cannot be adequately predicted based on previously transmitted data. This paper proposes an event-triggered learning approach for nonlinear discrete-time systems with cyclic excitation. The method a… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: 6 pages and 6 figures; to appear in IEEE Control Systems Letters

    Journal ref: IEEE Control Systems Letters, vol. 4, no. 1, pp. 103-108, Jan. 2020

  18. arXiv:1905.05710  [pdf, other

    cs.LG cs.AI stat.ML

    Trajectory-Based Off-Policy Deep Reinforcement Learning

    Authors: Andreas Doerr, Michael Volpp, Marc Toussaint, Sebastian Trimpe, Christian Daniel

    Abstract: Policy gradient methods are powerful reinforcement learning algorithms and have been demonstrated to solve many complex tasks. However, these methods are also data-inefficient, afflicted with high variance gradient estimates, and frequently get stuck in local optima. This work addresses these weaknesses by combining recent improvements in the reuse of off-policy data and exploration in parameter s… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Includes appendix. Accepted for ICML 2019

  19. Event-triggered Learning

    Authors: Friedrich Solowjow, Sebastian Trimpe

    Abstract: The efficient exchange of information is an essential aspect of intelligent collective behavior. Event-triggered control and estimation achieve some efficiency by replacing continuous data exchange between agents with intermittent, or event-triggered communication. Typically, model-based predictions are used at times of no data transmission, and updates are sent only when the prediction error grow… ▽ More

    Submitted 23 March, 2020; v1 submitted 5 April, 2019; originally announced April 2019.

  20. arXiv:1903.08046  [pdf, ps, other

    eess.SY

    Event-triggered Pulse Control with Model Learning (if Necessary)

    Authors: Dominik Baumann, Friedrich Solowjow, Karl Henrik Johansson, Sebastian Trimpe

    Abstract: In networked control systems, communication is a shared and therefore scarce resource. Event-triggered control (ETC) can achieve high performance control with a significantly reduced amount of samples compared to classical, periodic control schemes. However, ETC methods usually rely on the availability of an accurate dynamics model, which is oftentimes not readily available. In this paper, we prop… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: Accepted final version to appear in: Proc. of the American Control Conference, 2019

  21. arXiv:1901.07531  [pdf, other

    eess.SY cs.MA cs.NI

    Resource-aware IoT Control: Saving Communication through Predictive Triggering

    Authors: Sebastian Trimpe, Dominik Baumann

    Abstract: The Internet of Things (IoT) interconnects multiple physical devices in large-scale networks. When the 'things' coordinate decisions and act collectively on shared information, feedback is introduced between them. Multiple feedback loops are thus closed over a shared, general-purpose network. Traditional feedback control is unsuitable for design of IoT control because it relies on high-rate period… ▽ More

    Submitted 19 January, 2019; originally announced January 2019.

    Comments: 16 pages, 15 figures, accepted article to appear in IEEE Internet of Things Journal. arXiv admin note: text overlap with arXiv:1609.07534

  22. Online learning with stability guarantees: A memory-based real-time model predictive controller

    Authors: Lukas Schwenkel, Meriem Gharbi, Sebastian Trimpe, Christian Ebenbauer

    Abstract: We propose and analyze a real-time model predictive control (MPC) scheme that utilizes stored data to improve its performance by learning the value function online with stability guarantees. For linear and nonlinear systems, a learning method is presented that makes use of basic analytic properties of the cost function and is proven to learn the MPC control law and the value function on the limit… ▽ More

    Submitted 22 September, 2020; v1 submitted 22 December, 2018; originally announced December 2018.

    Comments: This article is an extended version of the paper "Online learning with stability guarantees: A memory-based warm starting for real-time MPC" published in Automatica, Volume 122, 109247, 2020, including all proofs, an application example, and a detailed description of the used algorithm

    Journal ref: Automatica, Volume 122, 109247, 2020

  23. arXiv:1812.06325  [pdf, other

    eess.SY cs.LG cs.RO

    Data-efficient Auto-tuning with Bayesian Optimization: An Industrial Control Study

    Authors: Matthias Neumann-Brosig, Alonso Marco, Dieter Schwarzmann, Sebastian Trimpe

    Abstract: Bayesian optimization is proposed for automatic learning of optimal controller parameters from experimental data. A probabilistic description (a Gaussian process) is used to model the unknown function from controller parameters to a user-defined cost. The probabilistic model is updated with data, which is obtained by testing a set of parameters on the physical system and evaluating the cost. In or… ▽ More

    Submitted 17 December, 2018; v1 submitted 15 December, 2018; originally announced December 2018.

    Comments: 11 pages, 7 figures and 4 tables. To appear in IEEE Transactions on Control Systems Technology

  24. arXiv:1809.05152  [pdf, other

    eess.SY cs.LG

    Deep Reinforcement Learning for Event-Triggered Control

    Authors: Dominik Baumann, Jia-Jie Zhu, Georg Martius, Sebastian Trimpe

    Abstract: Event-triggered control (ETC) methods can achieve high-performance control with a significantly lower number of samples compared to usual, time-triggered methods. These frameworks are often based on a mathematical model of the system and specific designs of controller and event trigger. In this paper, we show how deep reinforcement learning (DRL) algorithms can be leveraged to simultaneously learn… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

  25. arXiv:1809.03225  [pdf, other

    cs.RO cs.LG eess.SY

    Gait learning for soft microrobots controlled by light fields

    Authors: Alexander von Rohr, Sebastian Trimpe, Alonso Marco, Peer Fischer, Stefano Palagi

    Abstract: Soft microrobots based on photoresponsive materials and controlled by light fields can generate a variety of different gaits. This inherent flexibility can be exploited to maximize their locomotion performance in a given environment and used to adapt them to changing conditions. Albeit, because of the lack of accurate locomotion models, and given the intrinsic variability among microrobots, analyt… ▽ More

    Submitted 10 September, 2018; originally announced September 2018.

    Comments: 8 pages, 7 figures, to appear in the proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems 2018

  26. Learning an Approximate Model Predictive Controller with Guarantees

    Authors: Michael Hertneck, Johannes Köhler, Sebastian Trimpe, Frank Allgöwer

    Abstract: A supervised learning framework is proposed to approximate a model predictive controller (MPC) with reduced computational complexity and guarantees on stability and constraint satisfaction. The framework can be used for a wide class of nonlinear systems. Any standard supervised learning technique (e.g. neural networks) can be employed to approximate the MPC from samples. In order to obtain closed-… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

    Comments: 6 pages, 3 figures, to appear in IEEE Control Systems Letters

  27. arXiv:1805.10615  [pdf, other

    stat.ML cs.LG math.DS

    A Local Information Criterion for Dynamical Systems

    Authors: Arash Mehrjou, Friedrich Solowjow, Sebastian Trimpe, Bernhard Schölkopf

    Abstract: Encoding a sequence of observations is an essential task with many applications. The encoding can become highly efficient when the observations are generated by a dynamical system. A dynamical system imposes regularities on the observations that can be leveraged to achieve a more efficient code. We propose a method to encode a given or learned dynamical system. Apart from its application for encod… ▽ More

    Submitted 27 May, 2018; originally announced May 2018.

  28. arXiv:1805.09714  [pdf, other

    eess.SY

    Efficient Encoding of Dynamical Systems through Local Approximations

    Authors: Friedrich Solowjow, Arash Mehrjou, Bernhard Schölkopf, Sebastian Trimpe

    Abstract: An efficient representation of observed data has many benefits in various domains of engineering and science. Representing static data sets, such as images, is a living branch in machine learning and eases downstream tasks, such as classification, regression, or decision making. However, the representation of dynamical systems has received less attention. In this work, we develop a method to repre… ▽ More

    Submitted 27 September, 2018; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: 7 pages, 5 figures, to appear in 57th IEEE Conference on Decision and Control (CDC 2018)

  29. Evaluating Low-Power Wireless Cyber-Physical Systems

    Authors: Dominik Baumann, Fabian Mager, Harsoveet Singh, Marco Zimmerling, Sebastian Trimpe

    Abstract: Simulation tools and testbeds have been proposed to assess the performance of control designs and wireless protocols in isolation. A cyber-physical system (CPS), however, integrates control with network elements, which must be evaluated together under real-world conditions to assess control performance, stability, and associated costs. We present an approach to evaluate CPS relying on embedded dev… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: Accepted final version to appear in: Proceedings of the 1st Workshop on Benchmarking Cyber-Physical Networks and Systems

  30. arXiv:1804.08986  [pdf, other

    cs.NI cs.MA eess.SY

    Feedback Control Goes Wireless: Guaranteed Stability over Low-power Multi-hop Networks

    Authors: Fabian Mager, Dominik Baumann, Romain Jacob, Lothar Thiele, Sebastian Trimpe, Marco Zimmerling

    Abstract: Closing feedback loops fast and over long distances is key to emerging applications; for example, robot motion control and swarm coordination require update intervals of tens of milliseconds. Low-power wireless technology is preferred for its low cost, small form factor, and flexibility, especially if the devices support multi-hop communication. So far, however, feedback control over wireless mult… ▽ More

    Submitted 19 February, 2019; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: Accepted final version to appear in: 10th ACM/IEEE International Conference on Cyber-Physical Systems (with CPS-IoT Week 2019) (ICCPS '19), April 16--18, 2019, Montreal, QC, Canada

  31. Event-triggered Learning for Resource-efficient Networked Control

    Authors: Friedrich Solowjow, Dominik Baumann, Jochen Garcke, Sebastian Trimpe

    Abstract: Common event-triggered state estimation (ETSE) algorithms save communication in networked control systems by predicting agents' behavior, and transmitting updates only when the predictions deviate significantly. The effectiveness in reducing communication thus heavily depends on the quality of the dynamics models used to predict the agents' states or measurements. Event-triggered learning is propo… ▽ More

    Submitted 27 September, 2018; v1 submitted 5 March, 2018; originally announced March 2018.

    Comments: 7 pages, 4 figures, to appear in the 2018 American Control Conference (ACC)

  32. arXiv:1801.10395  [pdf, other

    stat.ML

    Probabilistic Recurrent State-Space Models

    Authors: Andreas Doerr, Christian Daniel, Martin Schiegg, Duy Nguyen-Tuong, Stefan Schaal, Marc Toussaint, Sebastian Trimpe

    Abstract: State-space models (SSMs) are a highly expressive model class for learning patterns in time series data and for system identification. Deterministic versions of SSMs (e.g. LSTMs) proved extremely successful in modeling complex time series data. Fully probabilistic SSMs, however, are often found hard to train, even for smaller problems. To overcome this limitation, we propose a novel model formulat… ▽ More

    Submitted 10 February, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

  33. arXiv:1709.07089  [pdf, other

    eess.SY cs.LG stat.ML

    On the Design of LQR Kernels for Efficient Controller Learning

    Authors: Alonso Marco, Philipp Hennig, Stefan Schaal, Sebastian Trimpe

    Abstract: Finding optimal feedback controllers for nonlinear dynamic systems from data is hard. Recently, Bayesian optimization (BO) has been proposed as a powerful framework for direct controller tuning from experimental trials. For selecting the next query point and finding the global optimum, BO relies on a probabilistic description of the latent objective function, typically a Gaussian process (GP). As… ▽ More

    Submitted 20 September, 2017; originally announced September 2017.

    Comments: 8 pages, 5 figures, to appear in 56th IEEE Conference on Decision and Control (CDC 2017)

  34. Distributed Event-Based State Estimation for Networked Systems: An LMI-Approach

    Authors: Michael Muehlebach, Sebastian Trimpe

    Abstract: In this work, a dynamic system is controlled by multiple sensor-actuator agents, each of them commanding and observing parts of the system's input and output. The different agents sporadically exchange data with each other via a common bus network according to local event-triggering protocols. From these data, each agent estimates the complete dynamic state of the system and uses its estimate for… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

    Comments: This is an extended version of an article to appear in the IEEE Transactions on Automatic Control (additional parts in the Appendix)

  35. Event-based State Estimation: An Emulation-based Approach

    Authors: Sebastian Trimpe

    Abstract: An event-based state estimation approach for reducing communication in a networked control system is proposed. Multiple distributed sensor agents observe a dynamic process and sporadically transmit their measurements to estimator agents over a shared bus network. Local event-triggering protocols ensure that data is transmitted only when necessary to meet a desired estimation accuracy. The event-ba… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

    Comments: 21 pages, 8 figures, this article is based on the technical report arXiv:1511.05223 and is accepted for publication in IET Control Theory & Applications

  36. arXiv:1703.02899  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Model-Based Policy Search for Automatic Tuning of Multivariate PID Controllers

    Authors: Andreas Doerr, Duy Nguyen-Tuong, Alonso Marco, Stefan Schaal, Sebastian Trimpe

    Abstract: PID control architectures are widely used in industrial applications. Despite their low number of open parameters, tuning multiple, coupled PID controllers can become tedious in practice. In this paper, we extend PILCO, a model-based policy search framework, to automatically tune multivariate PID controllers purely based on data observed on an otherwise unknown system. The system's state is extend… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: Accepted final version to appear in 2017 IEEE International Conference on Robotics and Automation (ICRA)

  37. arXiv:1703.01250  [pdf, other

    cs.RO cs.LG eess.SY

    Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

    Authors: Alonso Marco, Felix Berkenkamp, Philipp Hennig, Angela P. Schoellig, Andreas Krause, Stefan Schaal, Sebastian Trimpe

    Abstract: In practice, the parameters of control policies are often tuned manually. This is time-consuming and frustrating. Reinforcement learning is a promising alternative that aims to automate this process, yet often requires too many experiments to be practical. In this paper, we propose a solution to this problem by exploiting prior knowledge from simulations, which are readily available for most robot… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: 7 pages, 6 figures, to appear in IEEE 2017 International Conference on Robotics and Automation (ICRA)

  38. arXiv:1609.07534  [pdf, other

    eess.SY math.OC

    Predictive and Self Triggering for Event-based State Estimation

    Authors: Sebastian Trimpe

    Abstract: Event-based state estimation can achieve estimation quality comparable to traditional time-triggered methods, but with a significantly lower number of samples. In networked estimation problems, this reduction in sampling instants does, however, not necessarily translate into better usage of the shared communication resource. Because typical event-based approaches decide instantaneously whether com… ▽ More

    Submitted 23 September, 2016; originally announced September 2016.

    Comments: 8 pages, 6 figures, accepted at 55th IEEE Conference on Decision and Control 2016

  39. arXiv:1605.01950  [pdf, ps, other

    cs.RO cs.LG eess.SY

    Automatic LQR Tuning Based on Gaussian Process Global Optimization

    Authors: Alonso Marco, Philipp Hennig, Jeannette Bohg, Stefan Schaal, Sebastian Trimpe

    Abstract: This paper proposes an automatic controller tuning framework based on linear optimal control combined with Bayesian optimization. With this framework, an initial set of controller gains is automatically improved according to a pre-defined performance objective evaluated from experimental data. The underlying Bayesian optimization algorithm is Entropy Search, which represents the latent objective a… ▽ More

    Submitted 6 May, 2016; originally announced May 2016.

    Comments: 8 pages, 5 figures, to appear in IEEE 2016 International Conference on Robotics and Automation. Video demonstration of the experiments available at https://am.is.tuebingen.mpg.de/publications/marco_icra_2016

  40. Depth-Based Object Tracking Using a Robust Gaussian Filter

    Authors: Jan Issac, Manuel Wüthrich, Cristina Garcia Cifuentes, Jeannette Bohg, Sebastian Trimpe, Stefan Schaal

    Abstract: We consider the problem of model-based 3D-tracking of objects given dense depth images as input. Two difficulties preclude the application of a standard Gaussian filter to this problem. First of all, depth sensors are characterized by fat-tailed measurement noise. To address this issue, we show how a recently published robustification method for Gaussian filters can be applied to the problem at ha… ▽ More

    Submitted 19 February, 2016; originally announced February 2016.

  41. arXiv:1511.05223  [pdf, other

    eess.SY

    Distributed Event-based State Estimation

    Authors: Sebastian Trimpe

    Abstract: An event-based state estimation approach for reducing communication in a networked control system is proposed. Multiple distributed sensor-actuator-agents observe a dynamic process and sporadically exchange their measurements and inputs over a bus network. Based on these data, each agent estimates the full state of the dynamic system, which may exhibit arbitrary inter-agent couplings. Local event-… ▽ More

    Submitted 26 January, 2017; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: Technical report, 16 pages, 10 figures, minor updates

  42. arXiv:1509.04072  [pdf, other

    stat.ML eess.SY

    Robust Gaussian Filtering using a Pseudo Measurement

    Authors: Manuel Wüthrich, Cristina Garcia Cifuentes, Sebastian Trimpe, Franziska Meier, Jeannette Bohg, Jan Issac, Stefan Schaal

    Abstract: Many sensors, such as range, sonar, radar, GPS and visual devices, produce measurements which are contaminated by outliers. This problem can be addressed by using fat-tailed sensor models, which account for the possibility of outliers. Unfortunately, all estimation algorithms belonging to the family of Gaussian filters (such as the widely-used extended Kalman filter and unscented Kalman filter) ar… ▽ More

    Submitted 30 May, 2016; v1 submitted 14 September, 2015; originally announced September 2015.

  43. arXiv:1504.07941  [pdf, other

    cs.RO

    A New Perspective and Extension of the Gaussian Filter

    Authors: Manuel Wüthrich, Sebastian Trimpe, Daniel Kappler, Stefan Schaal

    Abstract: The Gaussian Filter (GF) is one of the most widely used filtering algorithms; instances are the Extended Kalman Filter, the Unscented Kalman Filter and the Divided Difference Filter. GFs represent the belief of the current state by a Gaussian with the mean being an affine function of the measurement. We show that this representation can be too restrictive to accurately capture the dependences in s… ▽ More

    Submitted 5 June, 2015; v1 submitted 29 April, 2015; originally announced April 2015.

    Comments: Will appear in Robotics: Science and Systems (R:SS) 2015