Skip to main content

Showing 1–8 of 8 results for author: Leibfried, F

Searching in archive stat. Search in all archives.
.
  1. arXiv:2104.05674  [pdf, ps, other

    stat.ML cs.LG

    GPflux: A Library for Deep Gaussian Processes

    Authors: Vincent Dutordoir, Hugh Salimbeni, Eric Hambro, John McLeod, Felix Leibfried, Artem Artemev, Mark van der Wilk, James Hensman, Marc P. Deisenroth, ST John

    Abstract: We introduce GPflux, a Python library for Bayesian deep learning with a strong emphasis on deep Gaussian processes (DGPs). Implementing DGPs is a challenging endeavour due to the various mathematical subtleties that arise when dealing with multivariate Gaussian distributions and the complex bookkee** of indices. To date, there are no actively maintained, open-sourced and extendable libraries ava… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  2. arXiv:2012.13962  [pdf, other

    cs.LG stat.ML

    A Tutorial on Sparse Gaussian Processes and Variational Inference

    Authors: Felix Leibfried, Vincent Dutordoir, ST John, Nicolas Durrande

    Abstract: Gaussian processes (GPs) provide a framework for Bayesian inference that can offer principled uncertainty estimates for a large range of problems. For example, if we consider regression problems with Gaussian likelihoods, a GP model enjoys a posterior in closed form. However, identifying the posterior GP scales cubically with the number of training examples and requires to store all examples in me… ▽ More

    Submitted 18 December, 2022; v1 submitted 27 December, 2020; originally announced December 2020.

  3. arXiv:1909.05950  [pdf, other

    cs.LG cs.AI stat.ML

    Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning

    Authors: Felix Leibfried, Jordi Grau-Moya

    Abstract: Cumulative entropy regularization introduces a regulatory signal to the reinforcement learning (RL) problem that encourages policies with high-entropy actions, which is equivalent to enforcing small deviations from a uniform reference marginal policy. This has been shown to improve exploration and robustness, and it tackles the value overestimation problem. It also leads to a significant performan… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: Proceedings of the 3rd Conference on Robot Learning (CoRL), Osaka, Japan, 2019

  4. arXiv:1907.12392  [pdf, other

    cs.LG cs.AI stat.ML

    A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment

    Authors: Felix Leibfried, Sergio Pascual-Diaz, Jordi Grau-Moya

    Abstract: Empowerment is an information-theoretic method that can be used to intrinsically motivate learning agents. It attempts to maximize an agent's control over the environment by encouraging visiting states with a large number of reachable next states. Empowered learning has been shown to lead to complex behaviors, without requiring an explicit reward signal. In this paper, we investigate the use of em… ▽ More

    Submitted 8 January, 2020; v1 submitted 26 July, 2019; originally announced July 2019.

    Comments: Proceedings of the 33rd Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, 2019

  5. arXiv:1810.05546  [pdf, other

    stat.ML cs.LG

    Uncertainty in Neural Networks: Approximately Bayesian Ensembling

    Authors: Tim Pearce, Felix Leibfried, Alexandra Brintrup, Mohamed Zaki, Andy Neely

    Abstract: Understanding the uncertainty of a neural network's (NN) predictions is essential for many purposes. The Bayesian framework provides a principled approach to this, however applying it to NNs is challenging due to large numbers of parameters and data. Ensembling NNs provides an easily implementable, scalable method for uncertainty quantification, however, it has been criticised for not being Bayesi… ▽ More

    Submitted 26 February, 2020; v1 submitted 12 October, 2018; originally announced October 2018.

    Comments: Please cite as published in AISTATS 2020

    Journal ref: The 23rd International Conference on Artificial Intelligence and Statistics, AISTATS 2020

  6. arXiv:1809.01906  [pdf, other

    cs.LG stat.ML

    Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks

    Authors: Felix Leibfried, Peter Vrancx

    Abstract: This paper proposes a new optimization objective for value-based deep reinforcement learning. We extend conventional Deep Q-Networks (DQNs) by adding a model-learning component yielding a transcoder network. The prediction errors for the model are included in the basic DQN loss as additional regularizers. This augmented objective leads to a richer training signal that provides feedback at every ti… ▽ More

    Submitted 20 November, 2018; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: Presented at the NIPS Deep Reinforcement Learning Workshop, Montreal, Canada, 2018

  7. arXiv:1708.01867  [pdf, other

    cs.AI cs.LG stat.ML

    An Information-Theoretic Optimality Principle for Deep Reinforcement Learning

    Authors: Felix Leibfried, Jordi Grau-Moya, Haitham Bou-Ammar

    Abstract: We methodologically address the problem of Q-value overestimation in deep reinforcement learning to handle high-dimensional state spaces efficiently. By adapting concepts from information theory, we introduce an intrinsic penalty signal encouraging reduced Q-value estimates. The resultant algorithm encompasses a wide range of learning outcomes containing deep Q-networks as a special case. Differen… ▽ More

    Submitted 20 November, 2018; v1 submitted 6 August, 2017; originally announced August 2017.

    Comments: Presented at the NIPS Deep Reinforcement Learning Workshop, Montreal, Canada, 2018

  8. arXiv:1611.07078  [pdf, other

    cs.AI cs.LG stat.ML

    A Deep Learning Approach for Joint Video Frame and Reward Prediction in Atari Games

    Authors: Felix Leibfried, Nate Kushman, Katja Hofmann

    Abstract: Reinforcement learning is concerned with identifying reward-maximizing behaviour policies in environments that are initially unknown. State-of-the-art reinforcement learning approaches, such as deep Q-networks, are model-free and learn to act effectively across a wide range of environments such as Atari games, but require huge amounts of data. Model-based techniques are more data-efficient, but ne… ▽ More

    Submitted 17 August, 2017; v1 submitted 21 November, 2016; originally announced November 2016.

    Comments: Presented at the ICML 2017 Workshop on Principled Approaches to Deep Learning, Sydney, Australia, 2017