Search | arXiv e-print repository

Towards Scaling Difference Target Propagation by Learning Backprop Targets

Authors: Maxence Ernoult, Fabrice Normandin, Abhinav Moudgil, Sean Spinney, Eugene Belilovsky, Irina Rish, Blake Richards, Yoshua Bengio

Abstract: The development of biologically-plausible learning algorithms is important for understanding learning in the brain, but most of them fail to scale-up to real-world tasks, limiting their potential as explanations for learning by real brains. As such, it is important to explore learning algorithms that come with strong theoretical guarantees and can match the performance of backpropagation (BP) on c… ▽ More The development of biologically-plausible learning algorithms is important for understanding learning in the brain, but most of them fail to scale-up to real-world tasks, limiting their potential as explanations for learning by real brains. As such, it is important to explore learning algorithms that come with strong theoretical guarantees and can match the performance of backpropagation (BP) on complex tasks. One such algorithm is Difference Target Propagation (DTP), a biologically-plausible learning algorithm whose close relation with Gauss-Newton (GN) optimization has been recently established. However, the conditions under which this connection rigorously holds preclude layer-wise training of the feedback pathway synaptic weights (which is more biologically plausible). Moreover, good alignment between DTP weight updates and loss gradients is only loosely guaranteed and under very specific conditions for the architecture being trained. In this paper, we propose a novel feedback weight training scheme that ensures both that DTP approximates BP and that layer-wise feedback weight training can be restored without sacrificing any theoretical guarantees. Our theory is corroborated by experimental results and we report the best performance ever achieved by DTP on CIFAR-10 and ImageNet 32$\times$32 △ Less

Submitted 31 January, 2022; originally announced January 2022.

arXiv:2108.01005 [pdf, other]

Sequoia: A Software Framework to Unify Continual Learning Research

Authors: Fabrice Normandin, Florian Golemo, Oleksiy Ostapenko, Pau Rodriguez, Matthew D Riemer, Julio Hurtado, Khimya Khetarpal, Ryan Lindeborg, Lucas Cecchi, Timothée Lesort, Laurent Charlin, Irina Rish, Massimo Caccia

Abstract: The field of Continual Learning (CL) seeks to develop algorithms that accumulate knowledge and skills over time through interaction with non-stationary environments. In practice, a plethora of evaluation procedures (settings) and algorithmic solutions (methods) exist, each with their own potentially disjoint set of assumptions. This variety makes measuring progress in CL difficult. We propose a ta… ▽ More The field of Continual Learning (CL) seeks to develop algorithms that accumulate knowledge and skills over time through interaction with non-stationary environments. In practice, a plethora of evaluation procedures (settings) and algorithmic solutions (methods) exist, each with their own potentially disjoint set of assumptions. This variety makes measuring progress in CL difficult. We propose a taxonomy of settings, where each setting is described as a set of assumptions. A tree-shaped hierarchy emerges from this view, where more general settings become the parents of those with more restrictive assumptions. This makes it possible to use inheritance to share and reuse research, as develo** a method for a given setting also makes it directly applicable onto any of its children. We instantiate this idea as a publicly available software framework called Sequoia, which features a wide variety of settings from both the Continual Supervised Learning (CSL) and Continual Reinforcement Learning (CRL) domains. Sequoia also includes a growing suite of methods which are easy to extend and customize, in addition to more specialized methods from external libraries. We hope that this new paradigm and its first implementation can help unify and accelerate research in CL. You can help us grow the tree by visiting www.github.com/lebrice/Sequoia. △ Less

Submitted 5 June, 2023; v1 submitted 2 August, 2021; originally announced August 2021.

arXiv:2003.05856 [pdf, other]

Online Fast Adaptation and Knowledge Accumulation: a New Approach to Continual Learning

Authors: Massimo Caccia, Pau Rodriguez, Oleksiy Ostapenko, Fabrice Normandin, Min Lin, Lucas Caccia, Issam Laradji, Irina Rish, Alexandre Lacoste, David Vazquez, Laurent Charlin

Abstract: Continual learning studies agents that learn from streams of tasks without forgetting previous ones while adapting to new ones. Two recent continual-learning scenarios have opened new avenues of research. In meta-continual learning, the model is pre-trained to minimize catastrophic forgetting of previous tasks. In continual-meta learning, the aim is to train agents for faster remembering of previo… ▽ More Continual learning studies agents that learn from streams of tasks without forgetting previous ones while adapting to new ones. Two recent continual-learning scenarios have opened new avenues of research. In meta-continual learning, the model is pre-trained to minimize catastrophic forgetting of previous tasks. In continual-meta learning, the aim is to train agents for faster remembering of previous tasks through adaptation. In their original formulations, both methods have limitations. We stand on their shoulders to propose a more general scenario, OSAKA, where an agent must quickly solve new (out-of-distribution) tasks, while also requiring fast remembering. We show that current continual learning, meta-learning, meta-continual learning, and continual-meta learning techniques fail in this new scenario. We propose Continual-MAML, an online extension of the popular MAML algorithm as a strong baseline for this scenario. We empirically show that Continual-MAML is better suited to the new scenario than the aforementioned methodologies, as well as standard continual learning and meta-learning approaches. △ Less

Submitted 20 January, 2021; v1 submitted 12 March, 2020; originally announced March 2020.

Journal ref: NeurIPS 2020

arXiv:cond-mat/0608178 [pdf]

doi 10.1063/1.2346258

Growth, Structure and Properties of Epitaxial Thin Films of First Principles Predicted Multiferroic Bi2FeCrO6

Authors: Riad Nechache, Catalin Harnagea, Alain Pignolet, Francois Normandin, Teodor Veres, Louis-Philippe Carignan, David Menard

Abstract: We report the structural and physical properties of epitaxial Bi2FeCrO6 thin films on epitaxial SrRuO3 grown on (100)-oriented SrTiO3 substrates by pulsed laser ablation. The 300 nm thick films exhibit both ferroelectricity and magnetism at room temperature with a maximum dielectric polarization of 2.8 microC/cm2 at Emax = 82 kV/cm and a saturated magnetization of 20 emu/cc (corresponding to ~ 0… ▽ More We report the structural and physical properties of epitaxial Bi2FeCrO6 thin films on epitaxial SrRuO3 grown on (100)-oriented SrTiO3 substrates by pulsed laser ablation. The 300 nm thick films exhibit both ferroelectricity and magnetism at room temperature with a maximum dielectric polarization of 2.8 microC/cm2 at Emax = 82 kV/cm and a saturated magnetization of 20 emu/cc (corresponding to ~ 0.26 Bohr magneton per rhombohedral unit cell), with coercive fields below 100 Oe. Our results confirm the predictions made using ab-initio calculations about the existence of multiferroic properties in Bi2FeCrO6. △ Less

Submitted 7 August, 2006; originally announced August 2006.

Comments: Manuscript accepted for publication in Applied Physics Letters (in press). The paper consists of 1619 words, 13 references and 3 figures

Journal ref: APPLIED PHYSICS LETTERS 89, 102902 (2006)

Showing 1–4 of 4 results for author: Normandin, F