A Pitfall of Unsupervised Pre-Training

Alberti, Michele; Seuret, Mathias; Ingold, Rolf; Liwicki, Marcus

Computer Science > Computer Vision and Pattern Recognition

arXiv:1712.01655 (cs)

This paper has been withdrawn by Michele Alberti

[Submitted on 23 Nov 2017 (v1), last revised 17 Dec 2017 (this version, v3)]

Title:A Pitfall of Unsupervised Pre-Training

Authors:Michele Alberti, Mathias Seuret, Rolf Ingold, Marcus Liwicki

No PDF available, click to view other formats

Abstract:The point of this paper is to question typical assumptions in deep learning and suggest alternatives. A particular contribution is to prove that even if a Stacked Convolutional Auto-Encoder is good at reconstructing pictures, it is not necessarily good at discriminating their classes. When using Auto-Encoders, intuitively one assumes that features which are good for reconstruction will also lead to high classification accuracy. Indeed, it became research practice and is a suggested strategy by introductory books. However, we prove that this is not always the case. We thoroughly investigate the quality of features produced by Stacked Convolutional Auto-Encoders when trained to reconstruct their input. In particular, we analyze the relation between the reconstruction and classification capabilities of the network, if we were to use the same features for both tasks. Experimental results suggest that in fact, there is no correlation between the reconstruction score and the quality of features for a classification task. This means, more formally, that the sub-dimension representation space learned from the Stacked Convolutional Auto-Encoder (while being trained for input reconstruction) is not necessarily better separable than the initial input space. Furthermore, we show that the reconstruction error is not a good metric to assess the quality of features, because it is biased by the decoder quality. We do not question the usefulness of pre-training, but we conclude that aiming for the lowest reconstruction error is not necessarily a good idea if afterwards one performs a classification task.

Comments:	This submission has been withdrawn by the author, it is a duplicate of arXiv:1703.04332
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1712.01655 [cs.CV]
	(or arXiv:1712.01655v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1712.01655
Journal reference:	Conference on Neural Information Processing Systems, Deep Learning: Bridging Theory and Practice, December 2017

Submission history

From: Michele Alberti [view email]
[v1] Thu, 23 Nov 2017 14:54:18 UTC (2,684 KB)
[v2] Thu, 14 Dec 2017 11:51:44 UTC (1 KB) (withdrawn)
[v3] Sun, 17 Dec 2017 20:23:24 UTC (1 KB) (withdrawn)

Computer Science > Computer Vision and Pattern Recognition

Title:A Pitfall of Unsupervised Pre-Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Pitfall of Unsupervised Pre-Training

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators