Hybrid multi-layer Deep CNN/Aggregator feature for image classification

Kulkarni, Praveen; Zepeda, Joaquin; Jurie, Frederic; Perez, Patrick; Chevallier, Louis

Computer Science > Computer Vision and Pattern Recognition

arXiv:1503.04065 (cs)

[Submitted on 13 Mar 2015]

Title:Hybrid multi-layer Deep CNN/Aggregator feature for image classification

Authors:Praveen Kulkarni, Joaquin Zepeda, Frederic Jurie, Patrick Perez, Louis Chevallier

View PDF

Abstract:Deep Convolutional Neural Networks (DCNN) have established a remarkable performance benchmark in the field of image classification, displacing classical approaches based on hand-tailored aggregations of local descriptors. Yet DCNNs impose high computational burdens both at training and at testing time, and training them requires collecting and annotating large amounts of training data. Supervised adaptation methods have been proposed in the literature that partially re-learn a transferred DCNN structure from a new target dataset. Yet these require expensive bounding-box annotations and are still computationally expensive to learn. In this paper, we address these shortcomings of DCNN adaptation schemes by proposing a hybrid approach that combines conventional, unsupervised aggregators such as Bag-of-Words (BoW), with the DCNN pipeline by treating the output of intermediate layers as densely extracted local descriptors.
We test a variant of our approach that uses only intermediate DCNN layers on the standard PASCAL VOC 2007 dataset and show performance significantly higher than the standard BoW model and comparable to Fisher vector aggregation but with a feature that is 150 times smaller. A second variant of our approach that includes the fully connected DCNN layers significantly outperforms Fisher vector schemes and performs comparably to DCNN approaches adapted to Pascal VOC 2007, yet at only a small fraction of the training and testing cost.

Comments:	Accepted in ICASSP 2015 conference, 5 pages including reference, 4 figures and 2 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1503.04065 [cs.CV]
	(or arXiv:1503.04065v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1503.04065

Submission history

From: Praveen Kulkarni [view email]
[v1] Fri, 13 Mar 2015 13:49:26 UTC (350 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Hybrid multi-layer Deep CNN/Aggregator feature for image classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Hybrid multi-layer Deep CNN/Aggregator feature for image classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators