Skip to main content

Showing 1–19 of 19 results for author: Francois-Lavet, V

.
  1. arXiv:2406.09079  [pdf, other

    cs.LG

    Latent Assistance Networks: Rediscovering Hyperbolic Tangents in RL

    Authors: Jacob E. Kooi, Mark Hoogendoorn, Vincent François-Lavet

    Abstract: Activation functions are one of the key components of a neural network. The most commonly used activation functions can be classed into the category of continuously differentiable (e.g. tanh) and linear-unit functions (e.g. ReLU), both having their own strengths and drawbacks with respect to downstream performance and representation capacity through learning (e.g. measured by the number of dead ne… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 22 pages, 17 figures, 4 tables

  2. arXiv:2212.06451  [pdf, other

    cs.AI cs.LG

    Improving generalization in reinforcement learning through forked agents

    Authors: Olivier Moulin, Vincent Francois-Lavet, Mark Hoogendoorn

    Abstract: An eco-system of agents each having their own policy with some, but limited, generalizability has proven to be a reliable approach to increase generalization across procedurally generated environments. In such an approach, new agents are regularly added to the eco-system when encountering a new environment that is outside of the scope of the eco-system. The speed of adaptation and general effectiv… ▽ More

    Submitted 29 December, 2022; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: 12 pages

  3. A Machine with Short-Term, Episodic, and Semantic Memory Systems

    Authors: Taewoon Kim, Michael Cochez, Vincent François-Lavet, Mark Neerincx, Piek Vossen

    Abstract: Inspired by the cognitive science theory of the explicit human memory systems, we have modeled an agent with short-term, episodic, and semantic memory systems, each of which is modeled with a knowledge graph. To evaluate this system and analyze the behavior of this agent, we designed and released our own reinforcement learning agent environment, "the Room", where an agent has to learn how to encod… ▽ More

    Submitted 8 July, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence (2023), 37(1), 48-56

  4. Disentangled (Un)Controllable Features

    Authors: Jacob E. Kooi, Mark Hoogendoorn, Vincent François-Lavet

    Abstract: In the context of MDPs with high-dimensional states, downstream tasks are predominantly applied on a compressed, low-dimensional representation of the original input space. A variety of learning objectives have therefore been used to attain useful representations. However, these representations usually lack interpretability of the different features. We present a novel approach that is able to dis… ▽ More

    Submitted 3 January, 2024; v1 submitted 31 October, 2022; originally announced November 2022.

    Comments: 14 pages (8 main paper pages), 15 figures

  5. arXiv:2207.08457  [pdf, other

    cs.LG cs.AI stat.ME

    A Meta-Reinforcement Learning Algorithm for Causal Discovery

    Authors: Andreas Sauter, Erman Acar, Vincent François-Lavet

    Abstract: Causal discovery is a major task with the utmost importance for machine learning since causal structures can enable models to go beyond pure correlation-based inference and significantly boost their performance. However, finding causal structures from data poses a significant challenge both in computational effort and accuracy, let alone its impossibility without interventions in general. In this… ▽ More

    Submitted 21 February, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: Camera-ready version for CLEAR23

  6. arXiv:2204.06550  [pdf, other

    cs.AI cs.MA

    Improving generalization to new environments and removing catastrophic forgetting in Reinforcement Learning by using an eco-system of agents

    Authors: Olivier Moulin, Vincent Francois-Lavet, Paul Elbers, Mark Hoogendoorn

    Abstract: Adapting a Reinforcement Learning (RL) agent to an unseen environment is a difficult task due to typical over-fitting on the training environment. RL agents are often capable of solving environments very close to the trained environment, but when environments become substantially different, their performance quickly drops. When agents are retrained on new environments, a second issue arises: there… ▽ More

    Submitted 1 July, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: 9 pages, 12 figures

    MSC Class: I.2.11

  7. arXiv:2204.01611  [pdf, other

    cs.AI

    A Machine With Human-Like Memory Systems

    Authors: Taewoon Kim, Michael Cochez, Vincent Francois-Lavet, Mark Neerincx, Piek Vossen

    Abstract: Inspired by the cognitive science theory, we explicitly model an agent with both semantic and episodic memory systems, and show that it is better than having just one of the two memory systems. In order to show this, we have designed and released our own challenging environment, "the Room", compatible with OpenAI Gym, where an agent has to properly learn how to encode, store, and retrieve memories… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Submitted to Human-Centered Design of Symbiotic Hybrid Intelligence 2022 (https://ii.tudelft.nl/humancenteredsymbioticHI/)

  8. arXiv:2111.11525  [pdf, other

    cs.LG cs.AI

    Component Transfer Learning for Deep RL Based on Abstract Representations

    Authors: Geoffrey van Driessel, Vincent Francois-Lavet

    Abstract: In this work we investigate a specific transfer learning approach for deep reinforcement learning in the context where the internal dynamics between two tasks are the same but the visual representations differ. We learn a low-dimensional encoding of the environment, meant to capture summarizing abstractions, from which the internal dynamics and value functions are learned. Transfer is then obtaine… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: Workshop paper NeurIPS 2021

  9. arXiv:2110.01411  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey

    Authors: Amjad Yousef Majid, Serge Saaybi, Tomas van Rietbergen, Vincent Francois-Lavet, R Venkatesha Prasad, Chris Verhoeven

    Abstract: Deep Reinforcement Learning (DRL) and Evolution Strategies (ESs) have surpassed human-level control in many sequential decision-making problems, yet many open challenges still exist. To get insights into the strengths and weaknesses of DRL versus ESs, an analysis of their respective capabilities and limitations is provided. After presenting their fundamental concepts and algorithms, a comparison i… ▽ More

    Submitted 28 September, 2021; originally announced October 2021.

  10. arXiv:2102.07097  [pdf, other

    cs.LG cs.AI

    Domain Adversarial Reinforcement Learning

    Authors: Bonnie Li, Vincent François-Lavet, Thang Doan, Joelle Pineau

    Abstract: We consider the problem of generalization in reinforcement learning where visual aspects of the observations might differ, e.g. when there are different backgrounds or change in contrast, brightness, etc. We assume that our agent has access to only a few of the MDPs from the MDP distribution during training. The performance of the agent is then reported on new unknown test domains drawn from the d… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

  11. arXiv:2009.13579  [pdf, other

    cs.LG stat.ML

    Novelty Search in Representational Space for Sample Efficient Exploration

    Authors: Ruo Yu Tao, Vincent François-Lavet, Joelle Pineau

    Abstract: We present a new approach for efficient exploration which leverages a low-dimensional encoding of the environment learned with a combination of model-based and model-free objectives. Our approach uses intrinsic rewards that are based on the distance of nearest neighbors in the low dimensional representational space to gauge novelty. We then leverage these intrinsic rewards for sample-efficient exp… ▽ More

    Submitted 15 April, 2022; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 10 pages + references + appendix. Oral presentation at NeurIPS 2020

  12. arXiv:2003.01181  [pdf, other

    cs.LG cs.CV stat.ML

    RandomNet: Towards Fully Automatic Neural Architecture Design for Multimodal Learning

    Authors: Stefano Alletto, Shenyang Huang, Vincent Francois-Lavet, Yohei Nakata, Guillaume Rabusseau

    Abstract: Almost all neural architecture search methods are evaluated in terms of performance (i.e. test accuracy) of the model structures that it finds. Should it be the only metric for a good autoML approach? To examine aspects beyond performance, we propose a set of criteria aimed at evaluating the core of autoML problem: the amount of human intervention required to deploy these methods into real world s… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 6 pages, 1 figures

  13. arXiv:1909.06686  [pdf, other

    cs.LG stat.ML

    Neural Architecture Search for Class-incremental Learning

    Authors: Shenyang Huang, Vincent François-Lavet, Guillaume Rabusseau

    Abstract: In class-incremental learning, a model learns continuously from a sequential data stream in which new classes occur. Existing methods often rely on static architectures that are manually crafted. These methods can be prone to capacity saturation because a neural network's ability to generalize to new concepts is limited by its fixed capacity. To understand how to expand a continual learner, we foc… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

    Comments: 8 pages, 10 Figures

  14. arXiv:1811.12560  [pdf, other

    cs.LG cs.AI stat.ML

    An Introduction to Deep Reinforcement Learning

    Authors: Vincent Francois-Lavet, Peter Henderson, Riashat Islam, Marc G. Bellemare, Joelle Pineau

    Abstract: Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. This field of research has been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine. Thus, deep RL opens up many new applications in domains such as healthcare, robotics, smart grids, finance, and many more. This manuscript provides an introductio… ▽ More

    Submitted 3 December, 2018; v1 submitted 29 November, 2018; originally announced November 2018.

    Journal ref: Foundations and Trends in Machine Learning: Vol. 11, No. 3-4, 2018

  15. arXiv:1809.04506  [pdf, other

    cs.LG cs.AI stat.ML

    Combined Reinforcement Learning via Abstract Representations

    Authors: Vincent François-Lavet, Yoshua Bengio, Doina Precup, Joelle Pineau

    Abstract: In the quest for efficient and robust reinforcement learning methods, both model-free and model-based approaches offer advantages. In this paper we propose a new way of explicitly bridging both approaches via a shared low-dimensional learned encoding of the environment, meant to capture summarizing abstractions. We show that the modularity brought by this approach leads to good generalization whil… ▽ More

    Submitted 18 November, 2018; v1 submitted 12 September, 2018; originally announced September 2018.

    Comments: Accepted to the Thirty-Third AAAI Conference On Artificial Intelligence, 2019

  16. arXiv:1805.03359  [pdf, other

    cs.LG cs.AI stat.ML

    Reward Estimation for Variance Reduction in Deep Reinforcement Learning

    Authors: Joshua Romoff, Peter Henderson, Alexandre Piché, Vincent Francois-Lavet, Joelle Pineau

    Abstract: Reinforcement Learning (RL) agents require the specification of a reward signal for learning behaviours. However, introduction of corrupt or stochastic rewards can yield high variance in learning. Such corruption may be a direct result of goal misspecification, randomness in the reward signal, or correlation of the reward with external factors that are not known to the agent. Corruption or stochas… ▽ More

    Submitted 7 November, 2018; v1 submitted 8 May, 2018; originally announced May 2018.

    Comments: Version 1 as appears in the International Conference on Learning Representations (ICLR) 2018 Workshop Track; Version 2 as appears in the Proceedings of The 2nd Conference on Robot Learning

  17. arXiv:1709.07796  [pdf, other

    stat.ML cs.AI cs.LG

    On overfitting and asymptotic bias in batch reinforcement learning with partial observability

    Authors: Vincent Francois-Lavet, Guillaume Rabusseau, Joelle Pineau, Damien Ernst, Raphael Fonteneau

    Abstract: This paper provides an analysis of the tradeoff between asymptotic bias (suboptimality with unlimited data) and overfitting (additional suboptimality due to limited data) in the context of reinforcement learning with partial observability. Our theoretical analysis formally characterizes that while potentially increasing the asymptotic bias, a smaller state representation decreases the risk of over… ▽ More

    Submitted 6 February, 2019; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: Accepted at the Journal of Artificial Intelligence Research (JAIR) - 31 pages

  18. arXiv:1512.02011  [pdf, other

    cs.LG cs.AI

    How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies

    Authors: Vincent François-Lavet, Raphael Fonteneau, Damien Ernst

    Abstract: Using deep neural nets as function approximator for reinforcement learning tasks have recently been shown to be very powerful for solving problems approaching real-world complexity. Using these results as a benchmark, we discuss the role that the discount factor may play in the quality of the learning process of a deep Q-network (DQN). When the discount factor progressively increases up to its fin… ▽ More

    Submitted 20 January, 2016; v1 submitted 7 December, 2015; originally announced December 2015.

    Comments: NIPS 2015 Deep Reinforcement Learning Workshop

  19. arXiv:1406.7865  [pdf, other

    stat.ML cs.CE cs.LG

    Simple connectome inference from partial correlation statistics in calcium imaging

    Authors: Antonio Sutera, Arnaud Joly, Vincent François-Lavet, Zixiao Aaron Qiu, Gilles Louppe, Damien Ernst, Pierre Geurts

    Abstract: In this work, we propose a simple yet effective solution to the problem of connectome inference in calcium imaging data. The proposed algorithm consists of two steps. First, processing the raw signals to detect neural peak activities. Second, inferring the degree of association between neurons from partial correlation statistics. This paper summarises the methodology that led us to win the Connect… ▽ More

    Submitted 18 November, 2014; v1 submitted 30 June, 2014; originally announced June 2014.

    Journal ref: JMLR: Workshop and Conference Proceedings 46:23-35, 2015