Skip to main content

Showing 1–5 of 5 results for author: Gallouédec, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.09844  [pdf, other

    cs.AI

    Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

    Authors: Quentin Gallouédec, Edward Beeching, Clément Romac, Emmanuel Dellandréa

    Abstract: The search for a general model that can operate seamlessly across multiple domains remains a key goal in machine learning research. The prevailing methodology in Reinforcement Learning (RL) typically limits models to a single task within a unimodal framework, a limitation that contrasts with the broader vision of a versatile, multi-domain model. In this paper, we present Jack of All Trades (JAT),… ▽ More

    Submitted 22 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Under review

  2. arXiv:2402.03046  [pdf, other

    cs.LG

    Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

    Authors: Shengyi Huang, Quentin Gallouédec, Florian Felten, Antonin Raffin, Rousslan Fernand Julien Dossa, Yanxiao Zhao, Ryan Sullivan, Viktor Makoviychuk, Denys Makoviichuk, Mohamad H. Danesh, Cyril Roumégous, Jiayi Weng, Chufan Chen, Md Masudur Rahman, João G. M. Araújo, Guorui Quan, Daniel Tan, Timo Klein, Rujikorn Charakorn, Mark Towers, Yann Berthelot, Kinal Mehta, Dipam Chakraborty, Arjun KG, Valentin Charraut , et al. (8 additional authors not shown)

    Abstract: In many Reinforcement Learning (RL) papers, learning curves are useful indicators to measure the effectiveness of RL algorithms. However, the complete raw data of the learning curves are rarely available. As a result, it is usually necessary to reproduce the experiments from scratch, which can be time-consuming and error-prone. We present Open RL Benchmark, a set of fully tracked RL experiments, i… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Under review

  3. arXiv:2208.14928  [pdf, other

    cs.LG cs.AI

    Cell-Free Latent Go-Explore

    Authors: Quentin Gallouédec, Emmanuel Dellandréa

    Abstract: In this paper, we introduce Latent Go-Explore (LGE), a simple and general approach based on the Go-Explore paradigm for exploration in reinforcement learning (RL). Go-Explore was initially introduced with a strong domain knowledge constraint for partitioning the state space into cells. However, in most real-world scenarios, drawing domain knowledge from raw observations is complex and tedious. If… ▽ More

    Submitted 27 April, 2023; v1 submitted 31 August, 2022; originally announced August 2022.

    Comments: Proceedings of the International Conference on Machine Learning, 2023

  4. arXiv:2110.12794  [pdf, other

    cs.AR

    Mixed precision in Graphics Processing Unit

    Authors: Quentin Gallouédec

    Abstract: Modern graphics computing units (GPUs) are designed and optimized to perform highly parallel numerical calculations. This parallelism has enabled (and promises) significant advantages, both in terms of energy performance and calculation. In this document, we take stock of the different applications of mixed precision. We recall the standards currently used in the overwhelming majority of systems i… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: M.S dissertation

  5. arXiv:2106.13687  [pdf, other

    cs.LG

    panda-gym: Open-source goal-conditioned environments for robotic learning

    Authors: Quentin Gallouédec, Nicolas Cazin, Emmanuel Dellandréa, Liming Chen

    Abstract: This paper presents panda-gym, a set of Reinforcement Learning (RL) environments for the Franka Emika Panda robot integrated with OpenAI Gym. Five tasks are included: reach, push, slide, pick & place and stack. They all follow a Multi-Goal RL framework, allowing to use goal-oriented RL algorithms. To foster open-research, we chose to use the open-source physics engine PyBullet. The implementation… ▽ More

    Submitted 19 December, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 Workshop on Robot Learning: Self-Supervised and Lifelong Learning