Skip to main content

Showing 1–17 of 17 results for author: Grewe, B F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.01165  [pdf, other

    cs.LG cs.AI

    Towards guarantees for parameter isolation in continual learning

    Authors: Giulia Lanzillotta, Sidak Pal Singh, Benjamin F. Grewe, Thomas Hofmann

    Abstract: Deep learning has proved to be a successful paradigm for solving many challenges in machine learning. However, deep neural networks fail when trained sequentially on multiple tasks, a shortcoming known as catastrophic forgetting in the continual learning literature. Despite a recent flourish of learning algorithms successfully addressing this problem, we find that provable guarantees against catas… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: 10 pages, 3 figures

  2. A Comprehensive Survey of Deep Transfer Learning for Anomaly Detection in Industrial Time Series: Methods, Applications, and Directions

    Authors: Peng Yan, Ahmed Abdulkadir, Paul-Philipp Luley, Matthias Rosenthal, Gerrit A. Schatte, Benjamin F. Grewe, Thilo Stadelmann

    Abstract: Automating the monitoring of industrial processes has the potential to enhance efficiency and optimize quality by promptly detecting abnormal events and thus facilitating timely interventions. Deep learning, with its capacity to discern non-trivial patterns within large datasets, plays a pivotal role in this process. Standard deep learning methods are suitable to solve a specific task given a spec… ▽ More

    Submitted 10 January, 2024; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: 27 pages, 8 figures, 2 tables, published in IEEE Acess

    ACM Class: I.2.0; I.2.4

    Journal ref: IEEE Acess 12 (2024) 3768-3789

  3. arXiv:2305.05354  [pdf, other

    cs.RO cs.AI

    Safe Deep RL for Intraoperative Planning of Pedicle Screw Placement

    Authors: Yunke Ao, Hooman Esfandiari, Fabio Carrillo, Yarden As, Mazda Farshad, Benjamin F. Grewe, Andreas Krause, Philipp Fuernstahl

    Abstract: Spinal fusion surgery requires highly accurate implantation of pedicle screw implants, which must be conducted in critical proximity to vital structures with a limited view of anatomy. Robotic surgery systems have been proposed to improve placement accuracy, however, state-of-the-art systems suffer from the limitations of open-loop approaches, as they follow traditional concepts of preoperative pl… ▽ More

    Submitted 10 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: 10 pages, 4 figures

  4. arXiv:2212.04316  [pdf, other

    cs.NE cs.CV q-bio.NC

    Bio-Inspired, Task-Free Continual Learning through Activity Regularization

    Authors: Francesco Lässig, Pau Vilimelis Aceituno, Martino Sorbaro, Benjamin F. Grewe

    Abstract: The ability to sequentially learn multiple tasks without forgetting is a key skill of biological brains, whereas it represents a major challenge to the field of deep learning. To avoid catastrophic forgetting, various continual learning (CL) approaches have been devised. However, these usually require discrete task boundaries. This requirement seems biologically implausible and often limits the ap… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

  5. arXiv:2210.08942  [pdf, other

    cs.LG

    Meta-Learning via Classifier(-free) Diffusion Guidance

    Authors: Elvis Nava, Sei** Kobayashi, Yifei Yin, Robert K. Katzschmann, Benjamin F. Grewe

    Abstract: We introduce meta-learning algorithms that perform zero-shot weight-space adaptation of neural network models to unseen tasks. Our methods repurpose the popular generative image synthesis techniques of natural language guidance and diffusion models to generate neural network weights adapted for tasks. We first train an unconditional generative hypernetwork model to produce neural network weights;… ▽ More

    Submitted 31 January, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

  6. arXiv:2207.12067  [pdf, other

    cs.LG math.GR stat.ML

    Homomorphism Autoencoder -- Learning Group Structured Representations from Observed Transitions

    Authors: Hamza Keurti, Hsiao-Ru Pan, Michel Besserve, Benjamin F. Grewe, Bernhard Schölkopf

    Abstract: How can agents learn internal models that veridically represent interactions with the real world is a largely open question. As machine learning is moving towards representations containing not just observational but also interventional knowledge, we study this problem using tools from representation learning and group theory. We propose methods enabling an agent acting upon the world to learn int… ▽ More

    Submitted 2 July, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted at ICML2023, Presented at the Symmetry and Geometry in Neural Representations Workshop (NeurReps) @ NeurIPS2022, 26 pages, 17 figures

  7. arXiv:2205.00002  [pdf, other

    cs.AI q-bio.NC

    A Theory of Natural Intelligence

    Authors: Christoph von der Malsburg, Thilo Stadelmann, Benjamin F. Grewe

    Abstract: Introduction: In contrast to current AI technology, natural intelligence -- the kind of autonomous intelligence that is realized in the brains of animals and humans to attain in their natural environment goals defined by a repertoire of innate behavioral schemata -- is far superior in terms of learning speed, generalization capabilities, autonomy and creativity. How are these strengths, by what me… ▽ More

    Submitted 22 April, 2022; originally announced May 2022.

    ACM Class: I.2

  8. arXiv:2204.12584  [pdf, other

    cs.RO cs.LG physics.flu-dyn

    Fast Aquatic Swimmer Optimization with Differentiable Projective Dynamics and Neural Network Hydrodynamic Models

    Authors: Elvis Nava, John Z. Zhang, Mike Y. Michelis, Tao Du, **chuan Ma, Benjamin F. Grewe, Wojciech Matusik, Robert K. Katzschmann

    Abstract: Aquatic locomotion is a classic fluid-structure interaction (FSI) problem of interest to biologists and engineers. Solving the fully coupled FSI equations for incompressible Navier-Stokes and finite elasticity is computationally expensive. Optimizing robotic swimmer design within such a system generally involves cumbersome, gradient-free procedures on top of the already costly simulation. To addre… ▽ More

    Submitted 22 June, 2022; v1 submitted 30 March, 2022; originally announced April 2022.

    Comments: ICML 2022

  9. arXiv:2204.07249  [pdf, other

    cs.NE cs.LG

    Minimizing Control for Credit Assignment with Strong Feedback

    Authors: Alexander Meulemans, Matilde Tristany Farinha, Maria R. Cervera, João Sacramento, Benjamin F. Grewe

    Abstract: The success of deep learning ignited interest in whether the brain learns hierarchical representations using gradient-based learning. However, current biologically plausible methods for gradient-based credit assignment in deep neural networks need infinitesimally small feedback signals, which is problematic in biologically realistic noisy environments and at odds with experimental evidence in neur… ▽ More

    Submitted 22 June, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: 26 pages, 4 figures

    MSC Class: 68T07 ACM Class: I.2.6

  10. arXiv:2111.11763  [pdf, other

    cs.LG stat.ML

    Uncertainty estimation under model misspecification in neural network regression

    Authors: Maria R. Cervera, Rafael Dätwyler, Francesco D'Angelo, Hamza Keurti, Benjamin F. Grewe, Christian Henning

    Abstract: Although neural networks are powerful function approximators, the underlying modelling assumptions ultimately define the likelihood and thus the hypothesis class they are parameterizing. In classification, these assumptions are minimal as the commonly employed softmax is capable of representing any categorical distribution. In regression, however, restrictive assumptions on the type of continuous… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: Published at the NeurIPS 2021 workshop "Your Model Is Wrong: Robustness and Misspecification in Probabilistic Modeling"

  11. arXiv:2107.12248  [pdf, other

    cs.LG stat.ML

    Are Bayesian neural networks intrinsically good at out-of-distribution detection?

    Authors: Christian Henning, Francesco D'Angelo, Benjamin F. Grewe

    Abstract: The need to avoid confident predictions on unfamiliar data has sparked interest in out-of-distribution (OOD) detection. It is widely assumed that Bayesian neural networks (BNN) are well suited for this task, as the endowed epistemic uncertainty should lead to disagreement in predictions on outliers. In this paper, we question this assumption and provide empirical evidence that proper Bayesian infe… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

    Comments: Published at UDL Workshop, ICML 2021

  12. arXiv:2106.07887  [pdf, other

    cs.LG

    Credit Assignment in Neural Networks through Deep Feedback Control

    Authors: Alexander Meulemans, Matilde Tristany Farinha, Javier García Ordóñez, Pau Vilimelis Aceituno, João Sacramento, Benjamin F. Grewe

    Abstract: The success of deep learning sparked interest in whether the brain learns by using similar techniques for assigning credit to each synaptic weight for its contribution to the network output. However, the majority of current attempts at biologically-plausible learning methods are either non-local in time, require highly specific connectivity motives, or have no clear link to any known mathematical… ▽ More

    Submitted 17 January, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 14 pages and 4 figures in the main manuscript; 49 pages and 15 figures in the supplementary materials

    MSC Class: 68T07 ACM Class: I.2.6

  13. arXiv:2103.01133  [pdf, other

    cs.LG cs.AI

    Posterior Meta-Replay for Continual Learning

    Authors: Christian Henning, Maria R. Cervera, Francesco D'Angelo, Johannes von Oswald, Regina Traber, Benjamin Ehret, Sei** Kobayashi, Benjamin F. Grewe, João Sacramento

    Abstract: Learning a sequence of tasks without access to i.i.d. observations is a widely studied form of continual learning (CL) that remains challenging. In principle, Bayesian learning directly applies to this setting, since recursive and one-off Bayesian updates yield the same result. In practice, however, recursive updating often leads to poor trade-off solutions across tasks because approximate inferen… ▽ More

    Submitted 21 October, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: Published at NeurIPS 2021

  14. arXiv:2007.12927  [pdf, other

    cs.LG cs.CV stat.ML

    Neural networks with late-phase weights

    Authors: Johannes von Oswald, Sei** Kobayashi, Alexander Meulemans, Christian Henning, Benjamin F. Grewe, João Sacramento

    Abstract: The largely successful method of training neural networks is to learn their weights using some variant of stochastic gradient descent (SGD). Here, we show that the solutions found by SGD can be further improved by ensembling a subset of the weights in late stages of learning. At the end of learning, we obtain back a single model by taking a spatial average in weight space. To avoid incurring incre… ▽ More

    Submitted 11 April, 2022; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: 25 pages, 6 figures

    Journal ref: Published as a conference paper at ICLR 2021

  15. arXiv:2006.14331  [pdf, other

    cs.LG stat.ML

    A Theoretical Framework for Target Propagation

    Authors: Alexander Meulemans, Francesco S. Carzaniga, Johan A. K. Suykens, João Sacramento, Benjamin F. Grewe

    Abstract: The success of deep learning, a brain-inspired form of AI, has sparked interest in understanding how the brain could similarly learn across multiple layers of neurons. However, the majority of biologically-plausible learning algorithms have not yet reached the performance of backpropagation (BP), nor are they built on strong theoretical foundations. Here, we analyze target propagation (TP), a popu… ▽ More

    Submitted 16 December, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 13 pages and 4 figures in main manuscript; 41 pages and 8 figures in supplementary material

    MSC Class: 68T07

  16. arXiv:2006.12109  [pdf, other

    cs.LG stat.ML

    Continual Learning in Recurrent Neural Networks

    Authors: Benjamin Ehret, Christian Henning, Maria R. Cervera, Alexander Meulemans, Johannes von Oswald, Benjamin F. Grewe

    Abstract: While a diverse collection of continual learning (CL) methods has been proposed to prevent catastrophic forgetting, a thorough investigation of their effectiveness for processing sequential data with recurrent neural networks (RNNs) is lacking. Here, we provide the first comprehensive evaluation of established CL methods on a variety of sequential data benchmarks. Specifically, we shed light on th… ▽ More

    Submitted 10 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2021

  17. arXiv:1906.00695  [pdf, other

    cs.LG cs.AI stat.ML

    Continual learning with hypernetworks

    Authors: Johannes von Oswald, Christian Henning, Benjamin F. Grewe, João Sacramento

    Abstract: Artificial neural networks suffer from catastrophic forgetting when they are sequentially trained on multiple tasks. To overcome this problem, we present a novel approach based on task-conditioned hypernetworks, i.e., networks that generate the weights of a target model based on task identity. Continual learning (CL) is less difficult for this class of models thanks to a simple key feature: instea… ▽ More

    Submitted 11 April, 2022; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Published at ICLR 2020

    MSC Class: 68T99