Skip to main content

Showing 1–2 of 2 results for author: Marceau-Caron, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:1712.01076  [pdf, ps, other

    stat.ML cs.NE

    Natural Langevin Dynamics for Neural Networks

    Authors: Gaétan Marceau-Caron, Yann Ollivier

    Abstract: One way to avoid overfitting in machine learning is to use model parameters distributed according to a Bayesian posterior given the data, rather than the maximum likelihood estimator. Stochastic gradient Langevin dynamics (SGLD) is one algorithm to approximate such Bayesian posteriors for large models and datasets. SGLD is a standard stochastic gradient descent to which is added a controlled amoun… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

  2. arXiv:1602.08007  [pdf, other

    cs.NE cs.LG stat.ML

    Practical Riemannian Neural Networks

    Authors: Gaétan Marceau-Caron, Yann Ollivier

    Abstract: We provide the first experimental results on non-synthetic datasets for the quasi-diagonal Riemannian gradient descents for neural networks introduced in [Ollivier, 2015]. These include the MNIST, SVHN, and FACE datasets as well as a previously unpublished electroencephalogram dataset. The quasi-diagonal Riemannian algorithms consistently beat simple stochastic gradient gradient descents by a vary… ▽ More

    Submitted 25 February, 2016; originally announced February 2016.