Nonlinear Acceleration of Deep Neural Networks

Scieur, Damien; Oyallon, Edouard; d'Aspremont, Alexandre; Bach, Francis

Mathematics > Optimization and Control

arXiv:1805.09639v1 (math)

[Submitted on 24 May 2018 (this version), latest version 21 Jun 2019 (v2)]

Title:Nonlinear Acceleration of Deep Neural Networks

Authors:Damien Scieur, Edouard Oyallon, Alexandre d'Aspremont, Francis Bach

View PDF

Abstract:Regularized nonlinear acceleration (RNA) is a generic extrapolation scheme for optimization methods, with marginal computational overhead. It aims to improve convergence using only the iterates of simple iterative algorithms. However, so far its application to optimization was theoretically limited to gradient descent and other single-step algorithms. Here, we adapt RNA to a much broader setting including stochastic gradient with momentum and Nesterov's fast gradient. We use it to train deep neural networks, and empirically observe that extrapolated networks are more accurate, especially in the early iterations. A straightforward application of our algorithm when training ResNet-152 on ImageNet produces a top-1 test error of 20.88%, improving by 0.8% the reference classification pipeline. Furthermore, the code runs offline in this case, so it never negatively affects performance.

Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1805.09639 [math.OC]
	(or arXiv:1805.09639v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.1805.09639

Submission history

From: Damien Scieur [view email]
[v1] Thu, 24 May 2018 12:49:13 UTC (558 KB)
[v2] Fri, 21 Jun 2019 18:56:01 UTC (1,836 KB)

Mathematics > Optimization and Control

Title:Nonlinear Acceleration of Deep Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Nonlinear Acceleration of Deep Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators