Skip to main content

Showing 1–25 of 25 results for author: Lesort, T

.
  1. arXiv:2403.08763  [pdf, other

    cs.LG cs.AI cs.CL

    Simple and Scalable Strategies to Continually Pre-train Large Language Models

    Authors: Adam Ibrahim, Benjamin Thérien, Kshitij Gupta, Mats L. Richter, Quentin Anthony, Timothée Lesort, Eugene Belilovsky, Irina Rish

    Abstract: Large language models (LLMs) are routinely pre-trained on billions of tokens, only to start the process over again once new data becomes available. A much more efficient solution is to continually pre-train these models, saving significant compute compared to re-training. However, the distribution shift induced by new data typically results in degraded performance on previous data or poor adaptati… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  2. arXiv:2311.01200  [pdf, other

    cs.CL cs.LG

    Continual Learning Under Language Shift

    Authors: Evangelia Gogoulou, Timothée Lesort, Magnus Boman, Joakim Nivre

    Abstract: The recent increase in data and model scale for language model pre-training has led to huge training costs. In scenarios where new data become available over time, updating a model instead of fully retraining it would therefore provide significant gains. We study the pros and cons of updating a language model when new data comes from new languages -- the case of continual learning under language s… ▽ More

    Submitted 27 June, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to TSD 2024

  3. Amplifying Pathological Detection in EEG Signaling Pathways through Cross-Dataset Transfer Learning

    Authors: Mohammad-Javad Darvishi-Bayazi, Mohammad Sajjad Ghaemi, Timothee Lesort, Md Rifat Arefin, Jocelyn Faubert, Irina Rish

    Abstract: Pathology diagnosis based on EEG signals and decoding brain activity holds immense importance in understanding neurological disorders. With the advancement of artificial intelligence methods and machine learning techniques, the potential for accurate data-driven diagnoses and effective treatments has grown significantly. However, applying machine learning algorithms to real-world datasets presents… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  4. arXiv:2308.04014  [pdf, other

    cs.CL cs.LG

    Continual Pre-Training of Large Language Models: How to (re)warm your model?

    Authors: Kshitij Gupta, Benjamin Thérien, Adam Ibrahim, Mats L. Richter, Quentin Anthony, Eugene Belilovsky, Irina Rish, Timothée Lesort

    Abstract: Large language models (LLMs) are routinely pre-trained on billions of tokens, only to restart the process over again once new data becomes available. A much cheaper and more efficient solution would be to enable the continual pre-training of these models, i.e. updating pre-trained models with new data instead of re-training them from scratch. However, the distribution shift induced by novel data t… ▽ More

    Submitted 6 September, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  5. arXiv:2208.14307  [pdf, other

    cs.LG

    Beyond Supervised Continual Learning: a Review

    Authors: Benedikt Bagus, Alexander Gepperth, Timothée Lesort

    Abstract: Continual Learning (CL, sometimes also termed incremental learning) is a flavor of machine learning where the usual assumption of stationary data distribution is relaxed or omitted. When naively applying, e.g., DNNs in CL problems, changes in the data distribution can cause the so-called catastrophic forgetting (CF) effect: an abrupt loss of previous knowledge. Although many significant contributi… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: Accepted at the ESANN2022, 19 pages, 1 figure

  6. arXiv:2207.04543  [pdf, other

    cs.LG cs.AI

    Challenging Common Assumptions about Catastrophic Forgetting

    Authors: Timothée Lesort, Oleksiy Ostapenko, Diganta Misra, Md Rifat Arefin, Pau Rodríguez, Laurent Charlin, Irina Rish

    Abstract: Building learning agents that can progressively learn and accumulate knowledge is the core goal of the continual learning (CL) research field. Unfortunately, training a model on new data usually compromises the performance on past data. In the CL literature, this effect is referred to as catastrophic forgetting (CF). CF has been largely studied, and a plethora of methods have been proposed to addr… ▽ More

    Submitted 15 May, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

  7. arXiv:2205.00329  [pdf, other

    cs.LG cs.AI

    Continual Learning with Foundation Models: An Empirical Study of Latent Replay

    Authors: Oleksiy Ostapenko, Timothee Lesort, Pau Rodríguez, Md Rifat Arefin, Arthur Douillard, Irina Rish, Laurent Charlin

    Abstract: Rapid development of large-scale pre-training has resulted in foundation models that can act as effective feature extractors on a variety of downstream tasks and domains. Motivated by this, we study the efficacy of pre-trained vision models as a foundation for downstream continual learning (CL) scenarios. Our goal is twofold. First, we want to understand the compute-accuracy trade-off between CL i… ▽ More

    Submitted 2 July, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

  8. arXiv:2203.01012  [pdf, other

    cs.LG cs.AI

    Continual Feature Selection: Spurious Features in Continual Learning

    Authors: Timothée Lesort

    Abstract: Continual Learning (CL) is the research field addressing learning without forgetting when the data distribution is not static. This paper studies spurious features' influence on continual learning algorithms. We show that continual learning algorithms solve tasks by selecting features that are not generalizable. Our experiments highlight that continual learning algorithms face two related problems… ▽ More

    Submitted 25 May, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  9. arXiv:2108.01005  [pdf, other

    cs.LG

    Sequoia: A Software Framework to Unify Continual Learning Research

    Authors: Fabrice Normandin, Florian Golemo, Oleksiy Ostapenko, Pau Rodriguez, Matthew D Riemer, Julio Hurtado, Khimya Khetarpal, Ryan Lindeborg, Lucas Cecchi, Timothée Lesort, Laurent Charlin, Irina Rish, Massimo Caccia

    Abstract: The field of Continual Learning (CL) seeks to develop algorithms that accumulate knowledge and skills over time through interaction with non-stationary environments. In practice, a plethora of evaluation procedures (settings) and algorithmic solutions (methods) exist, each with their own potentially disjoint set of assumptions. This variety makes measuring progress in CL difficult. We propose a ta… ▽ More

    Submitted 5 June, 2023; v1 submitted 2 August, 2021; originally announced August 2021.

  10. arXiv:2106.01834  [pdf, other

    cs.LG cs.AI

    Continual Learning in Deep Networks: an Analysis of the Last Layer

    Authors: Timothée Lesort, Thomas George, Irina Rish

    Abstract: We study how different output layer parameterizations of a deep neural network affects learning and forgetting in continual learning settings. The following three effects can cause catastrophic forgetting in the output layer: (1) weights modifications, (2) interference, and (3) projection drift. In this paper, our goal is to provide more insights into how changing the output layer parameterization… ▽ More

    Submitted 17 August, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

  11. arXiv:2104.01678  [pdf, other

    cs.LG cs.AI

    Understanding Continual Learning Settings with Data Distribution Drift Analysis

    Authors: Timothée Lesort, Massimo Caccia, Irina Rish

    Abstract: Classical machine learning algorithms often assume that the data are drawn i.i.d. from a stationary probability distribution. Recently, continual learning emerged as a rapidly growing area of machine learning where this assumption is relaxed, i.e. where the data distribution is non-stationary and changes over time. This paper represents the state of data distribution by a context variable $c$. A d… ▽ More

    Submitted 10 July, 2022; v1 submitted 4 April, 2021; originally announced April 2021.

  12. arXiv:2102.06253  [pdf, other

    cs.LG

    Continuum: Simple Management of Complex Continual Learning Scenarios

    Authors: Arthur Douillard, Timothée Lesort

    Abstract: Continual learning is a machine learning sub-field specialized in settings with non-iid data. Hence, the training data distribution is not static and drifts through time. Those drifts might cause interferences in the trained model and knowledge learned on previous states of the data distribution might be forgotten. Continual learning's challenge is to create algorithms able to learn an ever-growin… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: Code: https://github.com/Continvvm/continuum

  13. arXiv:2007.00487  [pdf, other

    cs.LG cs.AI cs.NE

    Continual Learning: Tackling Catastrophic Forgetting in Deep Neural Networks with Replay Processes

    Authors: Timothée Lesort

    Abstract: Humans learn all their life long. They accumulate knowledge from a sequence of learning experiences and remember the essential concepts without forgetting what they have learned previously. Artificial neural networks struggle to learn similarly. They often rely on data rigorously preprocessed to learn solutions to specific problems such as classification or regression. In particular, they forget t… ▽ More

    Submitted 8 December, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Doctoral Thesis Manuscript, Institut Polytechnique de Paris (2020)

  14. arXiv:1912.03049  [pdf, other

    cs.LG stat.ML

    Regularization Shortcomings for Continual Learning

    Authors: Timothée Lesort, Andrei Stoian, David Filliat

    Abstract: In most machine learning algorithms, training data is assumed to be independent and identically distributed (iid). When it is not the case, the algorithm's performances are challenged, leading to the famous phenomenon of catastrophic forgetting. Algorithms dealing with it are gathered in the Continual Learning research field. In this paper, we study the regularization based approaches to continual… ▽ More

    Submitted 3 April, 2021; v1 submitted 6 December, 2019; originally announced December 2019.

  15. arXiv:1907.05855  [pdf, other

    cs.LG cs.AI stat.ML

    DisCoRL: Continual Reinforcement Learning via Policy Distillation

    Authors: René Traoré, Hugo Caselles-Dupré, Timothée Lesort, Te Sun, Guanghang Cai, Natalia Díaz-Rodríguez, David Filliat

    Abstract: In multi-task reinforcement learning there are two main challenges: at training time, the ability to learn different policies with a single model; at test time, inferring which of those policies applying without an external signal. In the case of continual reinforcement learning a third challenge arises: learning tasks sequentially without forgetting the previous ones. In this paper, we tackle the… ▽ More

    Submitted 11 July, 2019; originally announced July 2019.

    Comments: arXiv admin note: text overlap with arXiv:1906.04452

  16. arXiv:1907.00182  [pdf, other

    cs.LG cs.RO

    Continual Learning for Robotics: Definition, Framework, Learning Strategies, Opportunities and Challenges

    Authors: Timothée Lesort, Vincenzo Lomonaco, Andrei Stoian, Davide Maltoni, David Filliat, Natalia Díaz-Rodríguez

    Abstract: Continual learning (CL) is a particular machine learning paradigm where the data distribution and learning objective changes through time, or where all the training data and objective criteria are never available at once. The evolution of the learning process is modeled by a sequence of learning experiences where the goal is to be able to learn new skills all along the sequence without forgetting… ▽ More

    Submitted 22 November, 2019; v1 submitted 29 June, 2019; originally announced July 2019.

  17. arXiv:1906.04452  [pdf, other

    cs.LG cs.RO stat.ML

    Continual Reinforcement Learning deployed in Real-life using Policy Distillation and Sim2Real Transfer

    Authors: René Traoré, Hugo Caselles-Dupré, Timothée Lesort, Te Sun, Natalia Díaz-Rodríguez, David Filliat

    Abstract: We focus on the problem of teaching a robot to solve tasks presented sequentially, i.e., in a continual learning scenario. The robot should be able to solve all tasks it has encountered, without forgetting past tasks. We provide preliminary work on applying Reinforcement Learning to such setting, on 2D navigation tasks for a 3 wheel omni-directional robot. Our approach takes advantage of state rep… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: accepted to the Workshop on Multi-Task and Lifelong Reinforcement Learning, ICML 2019

  18. arXiv:1901.08651  [pdf, other

    cs.LG cs.RO stat.ML

    Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

    Authors: Antonin Raffin, Ashley Hill, René Traoré, Timothée Lesort, Natalia Díaz-Rodríguez, David Filliat

    Abstract: Scaling end-to-end reinforcement learning to control real robots from vision presents a series of challenges, in particular in terms of sample efficiency. Against end-to-end learning, state representation learning can help learn a compact, efficient and relevant representation of states that speeds up policy learning, reducing the number of samples needed, and that is easier to interpret. We evalu… ▽ More

    Submitted 23 June, 2019; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: Github repo: https://github.com/araffin/srl-zoo Documentation: https://srl-zoo.readthedocs.io/en/latest/, As part of SRL-Toolbox: https://s-rl-toolbox.readthedocs.io/en/latest/. Accepted to the Workshop on Structure & Priors in Reinforcement Learning at ICLR 2019

  19. arXiv:1812.09111  [pdf, other

    cs.LG cs.AI cs.CV

    Generative Models from the perspective of Continual Learning

    Authors: Timothée Lesort, Hugo Caselles-Dupré, Michael Garcia-Ortiz, Andrei Stoian, David Filliat

    Abstract: Which generative model is the most suitable for Continual Learning? This paper aims at evaluating and comparing generative models on disjoint sequential image generation tasks. We investigate how several models learn and forget, considering various strategies: rehearsal, regularization, generative replay and fine-tuning. We used two quantitative metrics to estimate the generation quality and memor… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

  20. arXiv:1810.12069  [pdf, other

    cs.LG cs.AI stat.ML

    Marginal Replay vs Conditional Replay for Continual Learning

    Authors: Timothée Lesort, Alexander Gepperth, Andrei Stoian, David Filliat

    Abstract: We present a new replay-based method of continual classification learning that we term "conditional replay" which generates samples and labels together by sampling from a distribution conditioned on the class. We compare conditional replay to another replay-based continual learning paradigm (which we term "marginal replay") that generates samples independently of their class and assigns labels i… ▽ More

    Submitted 1 July, 2019; v1 submitted 29 October, 2018; originally announced October 2018.

  21. arXiv:1809.09369  [pdf, other

    cs.LG stat.ML

    S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning

    Authors: Antonin Raffin, Ashley Hill, René Traoré, Timothée Lesort, Natalia Díaz-Rodríguez, David Filliat

    Abstract: State representation learning aims at learning compact representations from raw observations in robotics and control applications. Approaches used for this objective are auto-encoders, learning forward models, inverse dynamics or learning using generic priors on the state characteristics. However, the diversity in applications and methods makes the field lack standard evaluation datasets, metrics… ▽ More

    Submitted 10 October, 2018; v1 submitted 25 September, 2018; originally announced September 2018.

    Comments: Github repo: https://github.com/araffin/robotics-rl-srl Documentation: https://s-rl-toolbox.readthedocs.io/en/latest/

  22. Training Discriminative Models to Evaluate Generative Ones

    Authors: Timothée Lesort, Andrei Stoain, Jean-François Goudou, David Filliat

    Abstract: Generative models are known to be difficult to assess. Recent works, especially on generative adversarial networks (GANs), produce good visual samples of varied categories of images. However, the validation of their quality is still difficult to define and there is no existing agreement on the best evaluation process. This paper aims at making a step toward an objective evaluation process for gene… ▽ More

    Submitted 1 July, 2019; v1 submitted 28 June, 2018; originally announced June 2018.

  23. arXiv:1804.00435  [pdf, other

    cs.CV cs.RO

    Exploring to learn visual saliency: The RL-IAC approach

    Authors: Celine Craye, Timothee Lesort, David Filliat, Jean-Francois Goudou

    Abstract: The problem of object localization and recognition on autonomous mobile robots is still an active topic. In this context, we tackle the problem of learning a model of visual saliency directly on a robot. This model, learned and improved on-the-fly during the robot's exploration provides an efficient tool for localizing relevant objects within their environment. The proposed approach includes two i… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

  24. arXiv:1802.04181  [pdf, ps, other

    cs.AI cs.LG stat.ML

    State Representation Learning for Control: An Overview

    Authors: Timothée Lesort, Natalia Díaz-Rodríguez, Jean-François Goudou, David Filliat

    Abstract: Representation learning algorithms are designed to learn abstract features that characterize data. State representation learning (SRL) focuses on a particular kind of representation learning where learned features are in low dimension, evolve through time, and are influenced by actions of an agent. The representation is learned to capture the variation in the environment generated by the agent's a… ▽ More

    Submitted 5 June, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

  25. arXiv:1709.05185  [pdf, other

    cs.AI cs.CV cs.RO

    Unsupervised state representation learning with robotic priors: a robustness benchmark

    Authors: Timothée Lesort, Mathieu Seurin, Xinrui Li, Natalia Díaz-Rodríguez, David Filliat

    Abstract: Our understanding of the world depends highly on our capacity to produce intuitive and simplified representations which can be easily used to solve problems. We reproduce this simplification process using a neural network to build a low dimensional state representation of the world from images acquired by a robot. As in Jonschkowski et al. 2015, we learn in an unsupervised way using prior knowledg… ▽ More

    Submitted 15 September, 2017; originally announced September 2017.

    Comments: ICRA 2018 submission