-
Towards Scaling Difference Target Propagation by Learning Backprop Targets
Authors:
Maxence Ernoult,
Fabrice Normandin,
Abhinav Moudgil,
Sean Spinney,
Eugene Belilovsky,
Irina Rish,
Blake Richards,
Yoshua Bengio
Abstract:
The development of biologically-plausible learning algorithms is important for understanding learning in the brain, but most of them fail to scale-up to real-world tasks, limiting their potential as explanations for learning by real brains. As such, it is important to explore learning algorithms that come with strong theoretical guarantees and can match the performance of backpropagation (BP) on c…
▽ More
The development of biologically-plausible learning algorithms is important for understanding learning in the brain, but most of them fail to scale-up to real-world tasks, limiting their potential as explanations for learning by real brains. As such, it is important to explore learning algorithms that come with strong theoretical guarantees and can match the performance of backpropagation (BP) on complex tasks. One such algorithm is Difference Target Propagation (DTP), a biologically-plausible learning algorithm whose close relation with Gauss-Newton (GN) optimization has been recently established. However, the conditions under which this connection rigorously holds preclude layer-wise training of the feedback pathway synaptic weights (which is more biologically plausible). Moreover, good alignment between DTP weight updates and loss gradients is only loosely guaranteed and under very specific conditions for the architecture being trained. In this paper, we propose a novel feedback weight training scheme that ensures both that DTP approximates BP and that layer-wise feedback weight training can be restored without sacrificing any theoretical guarantees. Our theory is corroborated by experimental results and we report the best performance ever achieved by DTP on CIFAR-10 and ImageNet 32$\times$32
△ Less
Submitted 31 January, 2022;
originally announced January 2022.
-
Sequoia: A Software Framework to Unify Continual Learning Research
Authors:
Fabrice Normandin,
Florian Golemo,
Oleksiy Ostapenko,
Pau Rodriguez,
Matthew D Riemer,
Julio Hurtado,
Khimya Khetarpal,
Ryan Lindeborg,
Lucas Cecchi,
Timothée Lesort,
Laurent Charlin,
Irina Rish,
Massimo Caccia
Abstract:
The field of Continual Learning (CL) seeks to develop algorithms that accumulate knowledge and skills over time through interaction with non-stationary environments. In practice, a plethora of evaluation procedures (settings) and algorithmic solutions (methods) exist, each with their own potentially disjoint set of assumptions. This variety makes measuring progress in CL difficult. We propose a ta…
▽ More
The field of Continual Learning (CL) seeks to develop algorithms that accumulate knowledge and skills over time through interaction with non-stationary environments. In practice, a plethora of evaluation procedures (settings) and algorithmic solutions (methods) exist, each with their own potentially disjoint set of assumptions. This variety makes measuring progress in CL difficult. We propose a taxonomy of settings, where each setting is described as a set of assumptions. A tree-shaped hierarchy emerges from this view, where more general settings become the parents of those with more restrictive assumptions. This makes it possible to use inheritance to share and reuse research, as develo** a method for a given setting also makes it directly applicable onto any of its children. We instantiate this idea as a publicly available software framework called Sequoia, which features a wide variety of settings from both the Continual Supervised Learning (CSL) and Continual Reinforcement Learning (CRL) domains. Sequoia also includes a growing suite of methods which are easy to extend and customize, in addition to more specialized methods from external libraries. We hope that this new paradigm and its first implementation can help unify and accelerate research in CL. You can help us grow the tree by visiting www.github.com/lebrice/Sequoia.
△ Less
Submitted 5 June, 2023; v1 submitted 2 August, 2021;
originally announced August 2021.
-
Online Fast Adaptation and Knowledge Accumulation: a New Approach to Continual Learning
Authors:
Massimo Caccia,
Pau Rodriguez,
Oleksiy Ostapenko,
Fabrice Normandin,
Min Lin,
Lucas Caccia,
Issam Laradji,
Irina Rish,
Alexandre Lacoste,
David Vazquez,
Laurent Charlin
Abstract:
Continual learning studies agents that learn from streams of tasks without forgetting previous ones while adapting to new ones. Two recent continual-learning scenarios have opened new avenues of research. In meta-continual learning, the model is pre-trained to minimize catastrophic forgetting of previous tasks. In continual-meta learning, the aim is to train agents for faster remembering of previo…
▽ More
Continual learning studies agents that learn from streams of tasks without forgetting previous ones while adapting to new ones. Two recent continual-learning scenarios have opened new avenues of research. In meta-continual learning, the model is pre-trained to minimize catastrophic forgetting of previous tasks. In continual-meta learning, the aim is to train agents for faster remembering of previous tasks through adaptation. In their original formulations, both methods have limitations. We stand on their shoulders to propose a more general scenario, OSAKA, where an agent must quickly solve new (out-of-distribution) tasks, while also requiring fast remembering. We show that current continual learning, meta-learning, meta-continual learning, and continual-meta learning techniques fail in this new scenario. We propose Continual-MAML, an online extension of the popular MAML algorithm as a strong baseline for this scenario. We empirically show that Continual-MAML is better suited to the new scenario than the aforementioned methodologies, as well as standard continual learning and meta-learning approaches.
△ Less
Submitted 20 January, 2021; v1 submitted 12 March, 2020;
originally announced March 2020.
-
Growth, Structure and Properties of Epitaxial Thin Films of First Principles Predicted Multiferroic Bi2FeCrO6
Authors:
Riad Nechache,
Catalin Harnagea,
Alain Pignolet,
Francois Normandin,
Teodor Veres,
Louis-Philippe Carignan,
David Menard
Abstract:
We report the structural and physical properties of epitaxial Bi2FeCrO6 thin films on epitaxial SrRuO3 grown on (100)-oriented SrTiO3 substrates by pulsed laser ablation. The 300 nm thick films exhibit both ferroelectricity and magnetism at room temperature with a maximum dielectric polarization of 2.8 microC/cm2 at Emax = 82 kV/cm and a saturated magnetization of 20 emu/cc (corresponding to ~ 0…
▽ More
We report the structural and physical properties of epitaxial Bi2FeCrO6 thin films on epitaxial SrRuO3 grown on (100)-oriented SrTiO3 substrates by pulsed laser ablation. The 300 nm thick films exhibit both ferroelectricity and magnetism at room temperature with a maximum dielectric polarization of 2.8 microC/cm2 at Emax = 82 kV/cm and a saturated magnetization of 20 emu/cc (corresponding to ~ 0.26 Bohr magneton per rhombohedral unit cell), with coercive fields below 100 Oe. Our results confirm the predictions made using ab-initio calculations about the existence of multiferroic properties in Bi2FeCrO6.
△ Less
Submitted 7 August, 2006;
originally announced August 2006.