Skip to main content

Showing 1–2 of 2 results for author: Blier, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:1901.09732  [pdf, other

    cs.LG stat.ML

    Making Deep Q-learning methods robust to time discretization

    Authors: Corentin Tallec, Léonard Blier, Yann Ollivier

    Abstract: Despite remarkable successes, Deep Reinforcement Learning (DRL) is not robust to hyperparameterization, implementation details, or small environment changes (Henderson et al. 2017, Zhang et al. 2018). Overcoming such sensitivity is key to making DRL applicable to real world problems. In this paper, we identify sensitivity to time discretization in near continuous-time environments as a critical fa… ▽ More

    Submitted 29 January, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

  2. arXiv:1810.01322  [pdf, other

    cs.LG cs.NE stat.ML

    Learning with Random Learning Rates

    Authors: Léonard Blier, Pierre Wolinski, Yann Ollivier

    Abstract: Hyperparameter tuning is a bothersome step in the training of deep learning models. One of the most sensitive hyperparameters is the learning rate of the gradient descent. We present the 'All Learning Rates At Once' (Alrao) optimization method for neural networks: each unit or feature in the network gets its own learning rate sampled from a random distribution spanning several orders of magnitude.… ▽ More

    Submitted 29 January, 2019; v1 submitted 2 October, 2018; originally announced October 2018.

    Comments: 20 pages, 8 figures, code available on GitHub