Singularity of the Hessian in Deep Learning

Sagun, Levent; Bottou, Leon; LeCun, Yann

Computer Science > Machine Learning

arXiv:1611.07476v1 (cs)

[Submitted on 22 Nov 2016 (this version), latest version 5 Oct 2017 (v2)]

Title:Singularity of the Hessian in Deep Learning

Authors:Levent Sagun, Leon Bottou, Yann LeCun

View PDF

Abstract:We look at the eigenvalues of the Hessian of a loss function before and after training. The eigenvalue distribution is seen to be composed of two parts, the bulk which is concentrated around zero, and the edges which are scattered away from zero. We present empirical evidence for the bulk indicating how over-parametrized the system is, and for the edges indicating the complexity of the input data.

Comments:	ICLR 2017 Submission on Nov 4, 2016
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1611.07476 [cs.LG]
	(or arXiv:1611.07476v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1611.07476

Submission history

From: Levent Sagun [view email]
[v1] Tue, 22 Nov 2016 19:24:49 UTC (4,520 KB)
[v2] Thu, 5 Oct 2017 13:28:50 UTC (2,514 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2016-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Levent Sagun
Léon Bottou
Yann LeCun

export BibTeX citation

Computer Science > Machine Learning

Title:Singularity of the Hessian in Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Singularity of the Hessian in Deep Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators