Skip to main content

Showing 1–9 of 9 results for author: Meulemans, A

.
  1. arXiv:2402.04437  [pdf, other

    cs.CL cs.LG

    Learning to Extract Structured Entities Using Language Models

    Authors: Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra

    Abstract: Recent advances in machine learning have significantly impacted the field of information extraction, with Language Models (LMs) playing a pivotal role in extracting structured information from unstructured text. Prior works typically represent information extraction as triplet-centric and use classical metrics such as precision and recall for evaluation. We reformulate the task to be entity-centri… ▽ More

    Submitted 18 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  2. arXiv:2306.16803  [pdf, other

    cs.LG stat.ML

    Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis

    Authors: Alexander Meulemans, Simon Schug, Sei** Kobayashi, Nathaniel Daw, Gregory Wayne

    Abstract: To make reinforcement learning more sample efficient, we need better credit assignment methods that measure an action's influence on future rewards. Building upon Hindsight Credit Assignment (HCA), we introduce Counterfactual Contribution Analysis (COCOA), a new family of model-based credit assignment algorithms. Our algorithms achieve precise credit assignment by measuring the contribution of act… ▽ More

    Submitted 31 October, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 spotlight

  3. arXiv:2207.01332  [pdf, other

    cs.LG cs.NE

    The least-control principle for local learning at equilibrium

    Authors: Alexander Meulemans, Nicolas Zucchet, Sei** Kobayashi, Johannes von Oswald, João Sacramento

    Abstract: Equilibrium systems are a powerful way to express neural computations. As special cases, they include models of great current interest in both neuroscience and machine learning, such as deep neural networks, equilibrium recurrent neural networks, deep equilibrium models, or meta-learning. Here, we present a new principle for learning such systems with a temporally- and spatially-local rule. Our pr… ▽ More

    Submitted 31 October, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Published at NeurIPS 2022. 56 pages

    MSC Class: 68T07 ACM Class: I.2.6

  4. arXiv:2204.07249  [pdf, other

    cs.NE cs.LG

    Minimizing Control for Credit Assignment with Strong Feedback

    Authors: Alexander Meulemans, Matilde Tristany Farinha, Maria R. Cervera, João Sacramento, Benjamin F. Grewe

    Abstract: The success of deep learning ignited interest in whether the brain learns hierarchical representations using gradient-based learning. However, current biologically plausible methods for gradient-based credit assignment in deep neural networks need infinitesimally small feedback signals, which is problematic in biologically realistic noisy environments and at odds with experimental evidence in neur… ▽ More

    Submitted 22 June, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: 26 pages, 4 figures

    MSC Class: 68T07 ACM Class: I.2.6

  5. arXiv:2106.07887  [pdf, other

    cs.LG

    Credit Assignment in Neural Networks through Deep Feedback Control

    Authors: Alexander Meulemans, Matilde Tristany Farinha, Javier García Ordóñez, Pau Vilimelis Aceituno, João Sacramento, Benjamin F. Grewe

    Abstract: The success of deep learning sparked interest in whether the brain learns by using similar techniques for assigning credit to each synaptic weight for its contribution to the network output. However, the majority of current attempts at biologically-plausible learning methods are either non-local in time, require highly specific connectivity motives, or have no clear link to any known mathematical… ▽ More

    Submitted 17 January, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: 14 pages and 4 figures in the main manuscript; 49 pages and 15 figures in the supplementary materials

    MSC Class: 68T07 ACM Class: I.2.6

  6. arXiv:2101.12509  [pdf, ps, other

    cs.LG cs.AI

    Challenges for Using Impact Regularizers to Avoid Negative Side Effects

    Authors: David Lindner, Kyle Matoba, Alexander Meulemans

    Abstract: Designing reward functions for reinforcement learning is difficult: besides specifying which behavior is rewarded for a task, the reward also has to discourage undesired outcomes. Misspecified reward functions can lead to unintended negative side effects, and overall unsafe behavior. To overcome this problem, recent work proposed to augment the specified reward function with an impact regularizer… ▽ More

    Submitted 23 February, 2021; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: Presented at the SafeAI workshop at AAAI 2021

  7. arXiv:2007.12927  [pdf, other

    cs.LG cs.CV stat.ML

    Neural networks with late-phase weights

    Authors: Johannes von Oswald, Sei** Kobayashi, Alexander Meulemans, Christian Henning, Benjamin F. Grewe, João Sacramento

    Abstract: The largely successful method of training neural networks is to learn their weights using some variant of stochastic gradient descent (SGD). Here, we show that the solutions found by SGD can be further improved by ensembling a subset of the weights in late stages of learning. At the end of learning, we obtain back a single model by taking a spatial average in weight space. To avoid incurring incre… ▽ More

    Submitted 11 April, 2022; v1 submitted 25 July, 2020; originally announced July 2020.

    Comments: 25 pages, 6 figures

    Journal ref: Published as a conference paper at ICLR 2021

  8. arXiv:2006.14331  [pdf, other

    cs.LG stat.ML

    A Theoretical Framework for Target Propagation

    Authors: Alexander Meulemans, Francesco S. Carzaniga, Johan A. K. Suykens, João Sacramento, Benjamin F. Grewe

    Abstract: The success of deep learning, a brain-inspired form of AI, has sparked interest in understanding how the brain could similarly learn across multiple layers of neurons. However, the majority of biologically-plausible learning algorithms have not yet reached the performance of backpropagation (BP), nor are they built on strong theoretical foundations. Here, we analyze target propagation (TP), a popu… ▽ More

    Submitted 16 December, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 13 pages and 4 figures in main manuscript; 41 pages and 8 figures in supplementary material

    MSC Class: 68T07

  9. arXiv:2006.12109  [pdf, other

    cs.LG stat.ML

    Continual Learning in Recurrent Neural Networks

    Authors: Benjamin Ehret, Christian Henning, Maria R. Cervera, Alexander Meulemans, Johannes von Oswald, Benjamin F. Grewe

    Abstract: While a diverse collection of continual learning (CL) methods has been proposed to prevent catastrophic forgetting, a thorough investigation of their effectiveness for processing sequential data with recurrent neural networks (RNNs) is lacking. Here, we provide the first comprehensive evaluation of established CL methods on a variety of sequential data benchmarks. Specifically, we shed light on th… ▽ More

    Submitted 10 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2021