Skip to main content

Showing 1–5 of 5 results for author: Lewandowski, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06811  [pdf, other

    cs.LG

    Learning Continually by Spectral Regularization

    Authors: Alex Lewandowski, Saurabh Kumar, Dale Schuurmans, András György, Marlos C. Machado

    Abstract: Loss of plasticity is a phenomenon where neural networks become more difficult to train during the course of learning. Continual learning algorithms seek to mitigate this effect by sustaining good predictive performance while maintaining network trainability. We develop new techniques for improving continual learning by first reconsidering how initialization can ensure trainability during early ph… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  2. arXiv:2312.00246  [pdf, other

    cs.LG

    Directions of Curvature as an Explanation for Loss of Plasticity

    Authors: Alex Lewandowski, Haruto Tanaka, Dale Schuurmans, Marlos C. Machado

    Abstract: Loss of plasticity is a phenomenon in which neural networks lose their ability to learn from new experience. Despite being empirically observed in several problem settings, little is understood about the mechanisms that lead to loss of plasticity. In this paper, we offer a consistent explanation for loss of plasticity: Neural networks lose directions of curvature during training and that loss of p… ▽ More

    Submitted 27 June, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

  3. Sharing Experience Around Component Compositions

    Authors: Grégory Bourguin, Arnaud Lewandowski, Myriam Lewkowicz

    Abstract: Society currently lives in a world of tailorable systems in which end-users are able to transform their working environment while achieving their tasks, day to day and over the time. Tailorability is most of the time achieved through dynamic component integration thanks to a huge number of components available over the Internet. In this context, the main problem for users is not anymore the integr… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Journal ref: International Journal of Distributed Systems and Technologies, 2013, 4 (4), pp.15-28

  4. arXiv:2204.11897  [pdf, other

    cs.LG

    Reinforcement Teaching

    Authors: Alex Lewandowski, Calarina Muslimani, Dale Schuurmans, Matthew E. Taylor, Jun Luo

    Abstract: Meta-learning strives to learn about and improve a student's machine learning algorithm. However, existing meta-learning methods either only work with differentiable algorithms or are hand-crafted to improve one specific component of an algorithm. We develop a unifying meta-learning framework, called Reinforcement Teaching, to improve the learning process of any algorithm. Under Reinforcement Teac… ▽ More

    Submitted 22 May, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: First two authors contributed equally

  5. arXiv:2011.08895  [pdf, other

    cs.LG cs.NE stat.ML

    ZORB: A Derivative-Free Backpropagation Algorithm for Neural Networks

    Authors: Varun Ranganathan, Alex Lewandowski

    Abstract: Gradient descent and backpropagation have enabled neural networks to achieve remarkable results in many real-world applications. Despite ongoing success, training a neural network with gradient descent can be a slow and strenuous affair. We present a simple yet faster training algorithm called Zeroth-Order Relaxed Backpropagation (ZORB). Instead of calculating gradients, ZORB uses the pseudoinvers… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Comments: To appear in "Beyond Backpropagation - Novel Ideas for Training Neural Architectures" Workshop at NeurIPS 2020