Skip to main content

Showing 1–7 of 7 results for author: Henning, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.11763  [pdf, other

    cs.LG stat.ML

    Uncertainty estimation under model misspecification in neural network regression

    Authors: Maria R. Cervera, Rafael Dätwyler, Francesco D'Angelo, Hamza Keurti, Benjamin F. Grewe, Christian Henning

    Abstract: Although neural networks are powerful function approximators, the underlying modelling assumptions ultimately define the likelihood and thus the hypothesis class they are parameterizing. In classification, these assumptions are minimal as the commonly employed softmax is capable of representing any categorical distribution. In regression, however, restrictive assumptions on the type of continuous… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: Published at the NeurIPS 2021 workshop "Your Model Is Wrong: Robustness and Misspecification in Probabilistic Modeling"

  2. arXiv:2110.06020  [pdf, other

    cs.LG cs.AI stat.ML

    On out-of-distribution detection with Bayesian neural networks

    Authors: Francesco D'Angelo, Christian Henning

    Abstract: The question whether inputs are valid for the problem a neural network is trying to solve has sparked interest in out-of-distribution (OOD) detection. It is widely assumed that Bayesian neural networks (BNNs) are well suited for this task, as the endowed epistemic uncertainty should lead to disagreement in predictions on outliers. In this paper, we question this assumption and show that proper Bay… ▽ More

    Submitted 21 February, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: This work is an extension of our previous workshop contribution: arXiv:2107.12248

  3. arXiv:2107.12248  [pdf, other

    cs.LG stat.ML

    Are Bayesian neural networks intrinsically good at out-of-distribution detection?

    Authors: Christian Henning, Francesco D'Angelo, Benjamin F. Grewe

    Abstract: The need to avoid confident predictions on unfamiliar data has sparked interest in out-of-distribution (OOD) detection. It is widely assumed that Bayesian neural networks (BNN) are well suited for this task, as the endowed epistemic uncertainty should lead to disagreement in predictions on outliers. In this paper, we question this assumption and provide empirical evidence that proper Bayesian infe… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

    Comments: Published at UDL Workshop, ICML 2021

  4. arXiv:2103.01133  [pdf, other

    cs.LG cs.AI

    Posterior Meta-Replay for Continual Learning

    Authors: Christian Henning, Maria R. Cervera, Francesco D'Angelo, Johannes von Oswald, Regina Traber, Benjamin Ehret, Sei** Kobayashi, Benjamin F. Grewe, João Sacramento

    Abstract: Learning a sequence of tasks without access to i.i.d. observations is a widely studied form of continual learning (CL) that remains challenging. In principle, Bayesian learning directly applies to this setting, since recursive and one-off Bayesian updates yield the same result. In practice, however, recursive updating often leads to poor trade-off solutions across tasks because approximate inferen… ▽ More

    Submitted 21 October, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: Published at NeurIPS 2021

  5. arXiv:2007.12927  [pdf, other

    cs.LG cs.CV stat.ML

    Neural networks with late-phase weights

    Authors: Johannes von Oswald, Sei** Kobayashi, Alexander Meulemans, Christian Henning, Benjamin F. Grewe, João Sacramento

    Abstract: The largely successful method of training neural networks is to learn their weights using some variant of stochastic gradient descent (SGD). Here, we show that the solutions found by SGD can be further improved by ensembling a subset of the weights in late stages of learning. At the end of learning, we obtain back a single model by taking a spatial average in weight space. To avoid incurring incre… ▽ More

    Submitted 11 April, 2022; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: 25 pages, 6 figures

    Journal ref: Published as a conference paper at ICLR 2021

  6. arXiv:2006.12109  [pdf, other

    cs.LG stat.ML

    Continual Learning in Recurrent Neural Networks

    Authors: Benjamin Ehret, Christian Henning, Maria R. Cervera, Alexander Meulemans, Johannes von Oswald, Benjamin F. Grewe

    Abstract: While a diverse collection of continual learning (CL) methods has been proposed to prevent catastrophic forgetting, a thorough investigation of their effectiveness for processing sequential data with recurrent neural networks (RNNs) is lacking. Here, we provide the first comprehensive evaluation of established CL methods on a variety of sequential data benchmarks. Specifically, we shed light on th… ▽ More

    Submitted 10 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2021

  7. arXiv:1906.00695  [pdf, other

    cs.LG cs.AI stat.ML

    Continual learning with hypernetworks

    Authors: Johannes von Oswald, Christian Henning, Benjamin F. Grewe, João Sacramento

    Abstract: Artificial neural networks suffer from catastrophic forgetting when they are sequentially trained on multiple tasks. To overcome this problem, we present a novel approach based on task-conditioned hypernetworks, i.e., networks that generate the weights of a target model based on task identity. Continual learning (CL) is less difficult for this class of models thanks to a simple key feature: instea… ▽ More

    Submitted 11 April, 2022; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Published at ICLR 2020

    MSC Class: 68T99