Skip to main content

Showing 1–6 of 6 results for author: Bertoin, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08406  [pdf, other

    cs.LG

    RRLS : Robust Reinforcement Learning Suite

    Authors: Adil Zouitine, David Bertoin, Pierre Clavier, Matthieu Geist, Emmanuel Rachelson

    Abstract: Robust reinforcement learning is the problem of learning control policies that provide optimal worst-case performance against a span of adversarial environments. It is a crucial ingredient for deploying algorithms in real-world scenarios with prevalent environmental uncertainties and has been a long-standing object of attention in the community, without a standardized set of benchmarks. This contr… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  2. arXiv:2406.08395  [pdf, other

    cs.LG

    Time-Constrained Robust MDPs

    Authors: Adil Zouitine, David Bertoin, Pierre Clavier, Matthieu Geist, Emmanuel Rachelson

    Abstract: Robust reinforcement learning is essential for deploying reinforcement learning algorithms in real-world scenarios where environmental uncertainty predominates. Traditional robust reinforcement learning often depends on rectangularity assumptions, where adverse probability measures of outcome states are assumed to be independent across different states and actions. This assumption, rarely fulfille… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2209.09203  [pdf, other

    cs.LG cs.AI cs.CV

    Look where you look! Saliency-guided Q-networks for generalization in visual Reinforcement Learning

    Authors: David Bertoin, Adil Zouitine, Mehdi Zouitine, Emmanuel Rachelson

    Abstract: Deep reinforcement learning policies, despite their outstanding efficiency in simulated visual control tasks, have shown disappointing ability to generalize across disturbances in the input training images. Changes in image statistics or distracting background elements are pitfalls that prevent generalization and real-world applicability of such control policies. We elaborate on the intuition that… ▽ More

    Submitted 8 February, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022), Nov 2022, New Orleans, United States

  4. arXiv:2204.06355  [pdf, other

    cs.AI

    Local Feature Swap** for Generalization in Reinforcement Learning

    Authors: David Bertoin, Emmanuel Rachelson

    Abstract: Over the past few years, the acceleration of computing resources and research in deep learning has led to significant practical successes in a range of tasks, including in particular in computer vision. Building on these advances, reinforcement learning has also seen a leap forward with the emergence of agents capable of making decisions directly from visual observations. Despite these successes,… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

  5. arXiv:2112.12980  [pdf, other

    cs.LG

    Disentanglement by Cyclic Reconstruction

    Authors: David Bertoin, Emmanuel Rachelson

    Abstract: Deep neural networks have demonstrated their ability to automatically extract meaningful features from data. However, in supervised learning, information specific to the dataset used for training, but irrelevant to the task at hand, may remain encoded in the extracted representations. This remaining information introduces a domain-specific bias, weakening the generalization performance. In this wo… ▽ More

    Submitted 22 November, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

  6. arXiv:2106.12915  [pdf, other

    cs.LG cs.AI

    Numerical influence of ReLU'(0) on backpropagation

    Authors: David Bertoin, Jérôme Bolte, Sébastien Gerchinovitz, Edouard Pauwels

    Abstract: In theory, the choice of ReLU(0) in [0, 1] for a neural network has a negligible influence both on backpropagation and training. Yet, in the real world, 32 bits default precision combined with the size of deep learning problems makes it a hyperparameter of training methods. We investigate the importance of the value of ReLU'(0) for several precision levels (16, 32, 64 bits), on various networks (f… ▽ More

    Submitted 3 November, 2023; v1 submitted 23 June, 2021; originally announced June 2021.

    Journal ref: Advances in Neural Information Processing Systems, Dec 2021, Paris, France