Skip to main content

Showing 1–27 of 27 results for author: Berkenkamp, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.15347  [pdf, other

    cs.LG cs.AI stat.ML

    Information-Theoretic Safe Bayesian Optimization

    Authors: Alessandro G. Bottero, Carlos E. Luis, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: We consider a sequential decision making task, where the goal is to optimize an unknown function without evaluating parameters that violate an a~priori unknown (safety) constraint. A common approach is to place a Gaussian process prior on the unknown functions and allow evaluations only in regions that are safe with high probability. Most current methods rely on a discretization of the domain and… ▽ More

    Submitted 10 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2212.04914

  2. arXiv:2312.17411  [pdf, other

    cs.LG stat.ML

    Generative Posterior Networks for Approximately Bayesian Epistemic Uncertainty Estimation

    Authors: Melrose Roderick, Felix Berkenkamp, Fatemeh Sheikholeslami, Zico Kolter

    Abstract: In many real-world problems, there is a limited set of training data, but an abundance of unlabeled data. We propose a new method, Generative Posterior Networks (GPNs), that uses unlabeled data to estimate epistemic uncertainty in high-dimensional problems. A GPN is a generative model that, given a prior distribution over functions, approximates the posterior distribution directly by regularizing… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 10 pages, 3 figures, 2 tables

  3. arXiv:2312.04386  [pdf, other

    cs.LG cs.AI

    Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization

    Authors: Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: We consider the problem of quantifying uncertainty over expected cumulative rewards in model-based reinforcement learning. In particular, we focus on characterizing the variance over values induced by a distribution over MDPs. Previous work upper bounds the posterior variance over values by solving a so-called uncertainty Bellman equation (UBE), but the over-approximation may result in inefficient… ▽ More

    Submitted 13 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.12526

  4. arXiv:2312.00742  [pdf, other

    stat.ML cs.AI cs.LG

    Scalable Meta-Learning with Gaussian Processes

    Authors: Petru Tighineanu, Lukas Grossberger, Paul Baireuther, Kathrin Skubch, Stefan Falkner, Julia Vinogradska, Felix Berkenkamp

    Abstract: Meta-learning is a powerful approach that exploits historical data to quickly solve new tasks from the same distribution. In the low-data regime, methods based on the closed-form posterior of Gaussian processes (GP) together with Bayesian optimization have achieved high performance. However, these methods are either computationally expensive or introduce assumptions that hinder a principled propag… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  5. arXiv:2311.14885  [pdf, other

    cs.LG

    Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning

    Authors: Melrose Roderick, Gaurav Manek, Felix Berkenkamp, J. Zico Kolter

    Abstract: A key problem in off-policy Reinforcement Learning (RL) is the mismatch, or distribution shift, between the dataset and the distribution over states and actions visited by the learned policy. This problem is exacerbated in the fully offline setting. The main approach to correct this shift has been through importance sampling, which leads to high-variance gradients. Other approaches, such as conser… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Comments: 10 pages

  6. arXiv:2308.06590  [pdf, other

    cs.LG cs.AI

    Value-Distributional Model-Based Reinforcement Learning

    Authors: Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: Quantifying uncertainty about a policy's long-term performance is important to solve sequential decision-making tasks. We study the problem from a model-based Bayesian reinforcement learning perspective, where the goal is to learn the posterior distribution over value functions induced by parameter (epistemic) uncertainty of the Markov decision process. Previous work restricts the analysis to a fe… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

  7. arXiv:2307.03565  [pdf, other

    cs.LG stat.ML

    MALIBO: Meta-learning for Likelihood-free Bayesian Optimization

    Authors: Jiarong Pan, Stefan Falkner, Felix Berkenkamp, Joaquin Vanschoren

    Abstract: Bayesian optimization (BO) is a popular method to optimize costly black-box functions. While traditional BO optimizes each new target task from scratch, meta-learning has emerged as a way to leverage knowledge from related tasks to optimize new tasks faster. However, existing meta-learning BO methods rely on surrogate models that suffer from scalability issues and are sensitive to observations wit… ▽ More

    Submitted 28 June, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

  8. arXiv:2302.12526  [pdf, other

    cs.LG cs.AI stat.ML

    Model-Based Uncertainty in Value Functions

    Authors: Carlos E. Luis, Alessandro G. Bottero, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: We consider the problem of quantifying uncertainty over expected cumulative rewards in model-based reinforcement learning. In particular, we focus on characterizing the variance over values induced by a distribution over MDPs. Previous work upper bounds the posterior variance over values by solving a so-called uncertainty Bellman equation, but the over-approximation may result in inefficient explo… ▽ More

    Submitted 7 March, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: AISTATS 2023

  9. arXiv:2212.04914  [pdf, other

    cs.LG

    Information-Theoretic Safe Exploration with Gaussian Processes

    Authors: Alessandro G. Bottero, Carlos E. Luis, Julia Vinogradska, Felix Berkenkamp, Jan Peters

    Abstract: We consider a sequential decision making task where we are not allowed to evaluate parameters that violate an a priori unknown (safety) constraint. A common approach is to place a Gaussian process prior on the unknown constraint and allow evaluations only in regions that are safe with high probability. Most current methods rely on a discretization of the domain and cannot be directly extended to t… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: Submitted to NeurIPS 2022

  10. arXiv:2111.11223  [pdf, other

    stat.ML cs.AI cs.LG

    Transfer Learning with Gaussian Processes for Bayesian Optimization

    Authors: Petru Tighineanu, Kathrin Skubch, Paul Baireuther, Attila Reiss, Felix Berkenkamp, Julia Vinogradska

    Abstract: Bayesian optimization is a powerful paradigm to optimize black-box functions based on scarce and noisy data. Its data efficiency can be further improved by transfer learning from related tasks. While recent transfer models meta-learn a prior based on large amount of data, in the low-data regime methods that exploit the closed-form posterior of Gaussian processes (GPs) have an advantage. In this se… ▽ More

    Submitted 15 March, 2022; v1 submitted 22 November, 2021; originally announced November 2021.

  11. arXiv:2110.07985  [pdf, other

    cs.LG cs.RO eess.SY

    On-Policy Model Errors in Reinforcement Learning

    Authors: Lukas P. Fröhlich, Maksym Lefarov, Melanie N. Zeilinger, Felix Berkenkamp

    Abstract: Model-free reinforcement learning algorithms can compute policy gradients given sampled environment transitions, but require large amounts of data. In contrast, model-based methods can use the learned model to generate new data, but model errors and bias can render learning unstable or suboptimal. In this paper, we present a novel method that combines real-world data and a learned model in order t… ▽ More

    Submitted 3 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Published at The Tenth International Conference on Learning Representations (ICLR 2022)

  12. arXiv:2006.08684  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Efficient Model-Based Reinforcement Learning through Optimistic Policy Search and Planning

    Authors: Sebastian Curi, Felix Berkenkamp, Andreas Krause

    Abstract: Model-based reinforcement learning algorithms with probabilistic dynamical models are amongst the most data-efficient learning methods. This is often attributed to their ability to distinguish between epistemic and aleatoric uncertainty. However, while most algorithms distinguish these two uncertainties for learning the model, they ignore it when optimizing the policy, which leads to greedy and in… ▽ More

    Submitted 1 December, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  13. arXiv:2002.08396  [pdf, other

    cs.LG cs.RO stat.ML

    Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning

    Authors: Noah Y. Siegel, Jost Tobias Springenberg, Felix Berkenkamp, Abbas Abdolmaleki, Michael Neunert, Thomas Lampe, Roland Hafner, Nicolas Heess, Martin Riedmiller

    Abstract: Off-policy reinforcement learning algorithms promise to be applicable in settings where only a fixed data-set (batch) of environment interactions is available and no new experience can be acquired. This property makes these algorithms appealing for real world problems such as robot control. In practice, however, standard off-policy algorithms fail in the batch setting for continuous control. In th… ▽ More

    Submitted 17 June, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    ACM Class: I.2.6; I.2.9

    Journal ref: ICLR 2020

  14. arXiv:1910.13726  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Safe Exploration for Interactive Machine Learning

    Authors: Matteo Turchetta, Felix Berkenkamp, Andreas Krause

    Abstract: In Interactive Machine Learning (IML), we iteratively make decisions and obtain noisy observations of an unknown function. While IML methods, e.g., Bayesian optimization and active learning, have been successful in applications, on real-world systems they must provably avoid unsafe decisions. To this end, safe IML algorithms must carefully learn about a priori unknown constraints without making un… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: Accepted at NeurIPS 2019

  15. arXiv:1907.07035  [pdf, other

    cs.LG eess.SY stat.ML

    Structured Variational Inference in Unstable Gaussian Process State Space Models

    Authors: Silvan Melchior, Sebastian Curi, Felix Berkenkamp, Andreas Krause

    Abstract: We propose a new variational inference algorithm for learning in Gaussian Process State-Space Models (GPSSMs). Our algorithm enables learning of unstable and partially observable systems, where previous algorithms fail. Our main algorithmic contribution is a novel approximate posterior that can be calculated efficiently using a single forward and backward pass along the training trajectories. The… ▽ More

    Submitted 10 June, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

  16. arXiv:1906.12189  [pdf, other

    eess.SY cs.AI cs.LG

    Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

    Authors: Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Joschka Boedecker, Andreas Krause

    Abstract: Reinforcement learning has been successfully used to solve difficult tasks in complex unknown environments. However, these methods typically do not provide any safety guarantees during the learning process. This is particularly problematic, since reinforcement learning agent actively explore their environment. This prevents their use in safety-critical, real-world applications. In this paper, we p… ▽ More

    Submitted 27 June, 2019; originally announced June 2019.

    Comments: 14 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:1803.08287

  17. arXiv:1901.03357  [pdf, other

    stat.ML cs.LG

    No-Regret Bayesian Optimization with Unknown Hyperparameters

    Authors: Felix Berkenkamp, Angela P. Schoellig, Andreas Krause

    Abstract: Bayesian optimization (BO) based on Gaussian process models is a powerful paradigm to optimize black-box functions that are expensive to evaluate. While several BO algorithms provably converge to the global optimum of the unknown function, they assume that the hyperparameters of the kernel are known in advance. This is not the case in practice and misspecification often causes these algorithms to… ▽ More

    Submitted 1 April, 2019; v1 submitted 10 January, 2019; originally announced January 2019.

  18. arXiv:1812.07544  [pdf, other

    cs.LG cs.AI stat.ML

    Information-Directed Exploration for Deep Reinforcement Learning

    Authors: Nikolay Nikolov, Johannes Kirschner, Felix Berkenkamp, Andreas Krause

    Abstract: Efficient exploration remains a major challenge for reinforcement learning. One reason is that the variability of the returns often depends on the current state and action, and is therefore heteroscedastic. Classical exploration strategies such as upper confidence bound algorithms and Thompson sampling fail to appropriately account for heteroscedasticity, even in the bandit setting. Motivated by r… ▽ More

    Submitted 24 March, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

  19. arXiv:1811.05788  [pdf, other

    cs.LG cs.AI eess.SY

    Learning to Compensate Photovoltaic Power Fluctuations from Images of the Sky by Imitating an Optimal Policy

    Authors: Robin Spiess, Felix Berkenkamp, Jan Poland, Andreas Krause

    Abstract: The energy output of photovoltaic (PV) power plants depends on the environment and thus fluctuates over time. As a result, PV power can cause instability in the power grid, in particular when increasingly used. Limiting the rate of change of the power output is a common way to mitigate these fluctuations, often with the help of large batteries. A reactive controller that uses these batteries to co… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Comments: 7 pages, 7 figures

  20. arXiv:1808.00924  [pdf, other

    eess.SY cs.LG cs.RO

    The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems

    Authors: Spencer M. Richards, Felix Berkenkamp, Andreas Krause

    Abstract: Learning algorithms have shown considerable prowess in simulation by allowing robots to adapt to uncertain environments and improve their performance. However, such algorithms are rarely used in practice on safety-critical systems, since the learned policy typically does not yield any safety guarantees. That is, the required exploration may cause physical harm to the robot or its environment. In t… ▽ More

    Submitted 1 October, 2018; v1 submitted 2 August, 2018; originally announced August 2018.

    Comments: Proc. of the 2nd Conference on Robot Learning (CoRL 2018)

  21. arXiv:1803.08287  [pdf, other

    eess.SY cs.AI cs.LG cs.RO

    Learning-based Model Predictive Control for Safe Exploration

    Authors: Torsten Koller, Felix Berkenkamp, Matteo Turchetta, Andreas Krause

    Abstract: Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that can provide provable high-probability safety gua… ▽ More

    Submitted 7 November, 2018; v1 submitted 22 March, 2018; originally announced March 2018.

    Comments: Proc. of the Conference on Decision and Control, 2018

  22. arXiv:1802.08678  [pdf, other

    eess.SY cs.LG cs.RO stat.ML

    Verifying Controllers Against Adversarial Examples with Bayesian Optimization

    Authors: Shromona Ghosh, Felix Berkenkamp, Gireeja Ranade, Shaz Qadeer, Ashish Kapoor

    Abstract: Recent successes in reinforcement learning have lead to the development of complex controllers for real-world robots. As these robots are deployed in safety-critical applications and interact with humans, it becomes critical to ensure safety in order to avoid causing harm. A first step in this direction is to test the controllers in simulation. To be able to do this, we need to capture what we mea… ▽ More

    Submitted 26 February, 2018; v1 submitted 23 February, 2018; originally announced February 2018.

    Comments: Proc. of the IEEE International Conference on Robotics and Automation, 2018

  23. arXiv:1705.08551  [pdf, other

    stat.ML cs.AI cs.LG eess.SY

    Safe Model-based Reinforcement Learning with Stability Guarantees

    Authors: Felix Berkenkamp, Matteo Turchetta, Angela P. Schoellig, Andreas Krause

    Abstract: Reinforcement learning is a powerful paradigm for learning optimal policies from experimental data. However, to find optimal policies, most reinforcement learning algorithms explore all possible actions, which may be harmful for real-world systems. As a consequence, learning algorithms are rarely applied on safety-critical systems in the real world. In this paper, we present a learning algorithm t… ▽ More

    Submitted 13 November, 2017; v1 submitted 23 May, 2017; originally announced May 2017.

    Comments: Proc. of Neural Information Processing Systems (NIPS), 2017

  24. arXiv:1703.01250  [pdf, other

    cs.RO cs.LG eess.SY

    Virtual vs. Real: Trading Off Simulations and Physical Experiments in Reinforcement Learning with Bayesian Optimization

    Authors: Alonso Marco, Felix Berkenkamp, Philipp Hennig, Angela P. Schoellig, Andreas Krause, Stefan Schaal, Sebastian Trimpe

    Abstract: In practice, the parameters of control policies are often tuned manually. This is time-consuming and frustrating. Reinforcement learning is a promising alternative that aims to automate this process, yet often requires too many experiments to be practical. In this paper, we propose a solution to this problem by exploiting prior knowledge from simulations, which are readily available for most robot… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

    Comments: 7 pages, 6 figures, to appear in IEEE 2017 International Conference on Robotics and Automation (ICRA)

  25. arXiv:1606.04753  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Safe Exploration in Finite Markov Decision Processes with Gaussian Processes

    Authors: Matteo Turchetta, Felix Berkenkamp, Andreas Krause

    Abstract: In classical reinforcement learning, when exploring an environment, agents accept arbitrary short term loss for long term gain. This is infeasible for safety critical applications, such as robotics, where even a single unsafe action may cause system failure. In this paper, we address the problem of safely exploring finite Markov decision processes (MDP). We define safety in terms of an, a priori u… ▽ More

    Submitted 15 November, 2016; v1 submitted 15 June, 2016; originally announced June 2016.

    Comments: 15 pages, extended version with proofs

    Journal ref: Proc. of Advances in Neural Information Processing Systems (NIPS), 2016, pp. 4305-4313

  26. arXiv:1602.04450  [pdf, other

    cs.RO cs.LG eess.SY

    Bayesian Optimization with Safety Constraints: Safe and Automatic Parameter Tuning in Robotics

    Authors: Felix Berkenkamp, Andreas Krause, Angela P. Schoellig

    Abstract: Robotic algorithms typically depend on various parameters, the choice of which significantly affects the robot's performance. While an initial guess for the parameters may be obtained from dynamic models of the robot, parameters are usually tuned manually on the real system to achieve the best performance. Optimization algorithms, such as Bayesian optimization, have been used to automate this proc… ▽ More

    Submitted 7 April, 2020; v1 submitted 14 February, 2016; originally announced February 2016.

  27. Safe Controller Optimization for Quadrotors with Gaussian Processes

    Authors: Felix Berkenkamp, Angela P. Schoellig, Andreas Krause

    Abstract: One of the most fundamental problems when designing controllers for dynamic systems is the tuning of the controller parameters. Typically, a model of the system is used to obtain an initial controller, but ultimately the controller parameters must be tuned manually on the real system to achieve the best performance. To avoid this manual tuning step, methods from machine learning, such as Bayesian… ▽ More

    Submitted 16 August, 2017; v1 submitted 3 September, 2015; originally announced September 2015.

    Comments: IEEE International Conference on Robotics and Automation, 2016. 6 pages, 4 figures. A video of the experiments can be found at http://tiny.cc/icra16_video . A Python implementation of the algorithm is available at https://github.com/befelix/SafeOpt

    Journal ref: Proc. of the IEEE International Conference on Robotics and Automation (ICRA), 2016, pp. 491-496