Skip to main content

Showing 1–2 of 2 results for author: Bondoux, N

.
  1. arXiv:2110.00535  [pdf, other

    stat.ML cs.LG

    A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement Learning

    Authors: Alix Lhéritier, Nicolas Bondoux

    Abstract: Distributional reinforcement learning (DRL) extends the value-based approach by approximating the full distribution over future returns instead of the mean only, providing a richer signal that leads to improved performances. Quantile Regression (QR) based methods like QR-DQN project arbitrary distributions into a parametric subset of staircase distributions by minimizing the 1-Wasserstein distance… ▽ More

    Submitted 22 February, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: Substantial changes in the experimental part, in particular in the architectures used for our results. Improvements in the presentation of the proof of Lemma 2. Added a Section to show soundness of TD-learning. To be published in AISTATS 2022

  2. From Research to Proof-of-Concept: Analysis of a Deployment of FPGAs on a Commercial Search Engine

    Authors: Fabio Maschi, Gustavo Alonso, Anthony Hock-Koon, Nicolas Bondoux, Teddy Roy, Mourad Boudia, Matteo Casalino

    Abstract: FPGAs are quickly becoming available in the cloud as a one more heterogeneous processing element complementing CPUs and GPUs. There are many reports in the literature showing the potential for FPGAs to accelerate a wide variety of algorithms, which combined with their growing availability, would seem to also indicate a widespread use in many applications. Unfortunately, there is not much published… ▽ More

    Submitted 20 August, 2021; originally announced August 2021.