Skip to main content

Showing 1–3 of 3 results for author: Urvoy, T

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.12945  [pdf, other

    cs.LG stat.ML

    Under the Hood of Tabular Data Generation Models: the Strong Impact of Hyperparameter Tuning

    Authors: G. Charbel N. Kindji, Lina Maria Rojas-Barahona, Elisa Fromont, Tanguy Urvoy

    Abstract: We investigate the impact of dataset-specific hyperparameter, feature encoding, and architecture tuning on five recent model families for tabular data generation through an extensive benchmark on 16 datasets. This study addresses the practical need for a unified evaluation of models that fully considers hyperparameter optimization. Additionally, we propose a reduced search space for each model tha… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:1903.01004  [pdf, other

    cs.LG cs.AI stat.ML

    Budgeted Reinforcement Learning in Continuous State Space

    Authors: Nicolas Carrara, Edouard Leurent, Romain Laroche, Tanguy Urvoy, Odalric-Ambrym Maillard, Olivier Pietquin

    Abstract: A Budgeted Markov Decision Process (BMDP) is an extension of a Markov Decision Process to critical applications requiring safety constraints. It relies on a notion of risk implemented in the shape of a cost signal constrained to lie below an - adjustable - threshold. So far, BMDPs could only be solved in the case of finite state spaces with known dynamics. This work extends the state-of-the-art to… ▽ More

    Submitted 27 May, 2019; v1 submitted 3 March, 2019; originally announced March 2019.

    Comments: N. Carrara and E. Leurent have equally contributed

  3. arXiv:1708.05033  [pdf, other

    cs.LG stat.ML

    Corrupt Bandits for Preserving Local Privacy

    Authors: Pratik Gajane, Tanguy Urvoy, Emilie Kaufmann

    Abstract: We study a variant of the stochastic multi-armed bandit (MAB) problem in which the rewards are corrupted. In this framework, motivated by privacy preservation in online recommender systems, the goal is to maximize the sum of the (unobserved) rewards, based on the observation of transformation of these rewards through a stochastic corruption process with known parameters. We provide a lower bound o… ▽ More

    Submitted 2 November, 2017; v1 submitted 16 August, 2017; originally announced August 2017.