Playing to distraction: towards a robust training of CNN classifiers through visual explanation techniques

Morales, David; Talavera, Estefania; Remeseiro, Beatriz

doi:10.1007/s00521-021-06282-2

Computer Science > Computer Vision and Pattern Recognition

arXiv:2012.14173 (cs)

[Submitted on 28 Dec 2020 (v1), last revised 29 Jul 2021 (this version, v3)]

Title:Playing to distraction: towards a robust training of CNN classifiers through visual explanation techniques

Authors:David Morales, Estefania Talavera, Beatriz Remeseiro

View PDF

Abstract:The field of deep learning is evolving in different directions, with still the need for more efficient training strategies. In this work, we present a novel and robust training scheme that integrates visual explanation techniques in the learning process. Unlike the attention mechanisms that focus on the relevant parts of images, we aim to improve the robustness of the model by making it pay attention to other regions as well. Broadly speaking, the idea is to distract the classifier in the learning process to force it to focus not only on relevant regions but also on those that, a priori, are not so informative for the discrimination of the class. We tested the proposed approach by embedding it into the learning process of a convolutional neural network for the analysis and classification of two well-known datasets, namely Stanford cars and FGVC-Aircraft. Furthermore, we evaluated our model on a real-case scenario for the classification of egocentric images, allowing us to obtain relevant information about peoples' lifestyles. In particular, we work on the challenging EgoFoodPlaces dataset, achieving state-of-the-art results with a lower level of complexity. The obtained results indicate the suitability of our proposed training scheme for image classification, improving the robustness of the final model.

Comments:	20 pages,3 figures, 4 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2012.14173 [cs.CV]
	(or arXiv:2012.14173v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2012.14173
Journal reference:	Neural Comput & Applic (2021)
Related DOI:	https://doi.org/10.1007/s00521-021-06282-2

Submission history

From: David Morales [view email]
[v1] Mon, 28 Dec 2020 10:24:32 UTC (5,390 KB)
[v2] Wed, 28 Jul 2021 15:28:14 UTC (5,394 KB)
[v3] Thu, 29 Jul 2021 08:49:37 UTC (5,394 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Playing to distraction: towards a robust training of CNN classifiers through visual explanation techniques

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Playing to distraction: towards a robust training of CNN classifiers through visual explanation techniques

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators