Skip to main content

Showing 1–4 of 4 results for author: Laurent, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2008.00938  [pdf, other

    cs.LG stat.ML

    Implicit Regularization via Neural Feature Alignment

    Authors: Aristide Baratin, Thomas George, César Laurent, R Devon Hjelm, Guillaume Lajoie, Pascal Vincent, Simon Lacoste-Julien

    Abstract: We approach the problem of implicit regularization in deep learning from a geometrical viewpoint. We highlight a regularization effect induced by a dynamical alignment of the neural tangent features introduced by Jacot et al, along a small number of task-relevant directions. This can be interpreted as a combined mechanism of feature selection and compression. By extrapolating a new analysis of Rad… ▽ More

    Submitted 16 March, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: AISTATS 2021

  2. arXiv:2006.12279  [pdf, other

    cs.LG stat.ML

    Revisiting Loss Modelling for Unstructured Pruning

    Authors: César Laurent, Camille Ballas, Thomas George, Nicolas Ballas, Pascal Vincent

    Abstract: By removing parameters from deep neural networks, unstructured pruning methods aim at cutting down memory footprint and computational cost, while maintaining prediction accuracy. In order to tackle this otherwise intractable problem, many of these methods model the loss landscape using first or second order Taylor expansions to identify which parameters can be discarded. We revisit loss modelling… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  3. arXiv:1806.03884  [pdf, other

    cs.LG stat.ML

    Fast Approximate Natural Gradient Descent in a Kronecker-factored Eigenbasis

    Authors: Thomas George, César Laurent, Xavier Bouthillier, Nicolas Ballas, Pascal Vincent

    Abstract: Optimization algorithms that leverage gradient covariance information, such as variants of natural gradient descent (Amari, 1998), offer the prospect of yielding more effective descent directions. For models with many parameters, the covariance matrix they are based on becomes gigantic, making them inapplicable in their original form. This has motivated research into both simple diagonal approxima… ▽ More

    Submitted 26 July, 2021; v1 submitted 11 June, 2018; originally announced June 2018.

    Journal ref: Advances in Neural Information Processing Systems 2018

  4. arXiv:1510.01378  [pdf, other

    stat.ML cs.LG cs.NE

    Batch Normalized Recurrent Neural Networks

    Authors: César Laurent, Gabriel Pereyra, Philémon Brakel, Ying Zhang, Yoshua Bengio

    Abstract: Recurrent Neural Networks (RNNs) are powerful models for sequential data that have the potential to learn long-term dependencies. However, they are computationally expensive to train and difficult to parallelize. Recent work has shown that normalizing intermediate representations of neural networks can significantly improve convergence rates in feedforward neural networks . In particular, batch no… ▽ More

    Submitted 5 October, 2015; originally announced October 2015.