Showing 1–2 of 2 results for author: Gormish, M

Search v0.5.6 released 2020-02-24

arXiv:1912.05295 [pdf, other]

cs.CV

Video Person Re-ID: Fantastic Techniques and Where to Find Them

Authors: Priyank Pathak, Amir Erfan Eshratifar, Michael Gormish

Abstract: The ability to identify the same person from multiple camera views without the explicit use of facial recognition is receiving commercial and academic interest. The current status-quo solutions are based on attention neural models. In this paper, we propose Attention and CL loss, which is a hybrid of center and Online Soft Mining (OSM) loss added to the attention loss on top of a temporal attentio… ▽ More The ability to identify the same person from multiple camera views without the explicit use of facial recognition is receiving commercial and academic interest. The current status-quo solutions are based on attention neural models. In this paper, we propose Attention and CL loss, which is a hybrid of center and Online Soft Mining (OSM) loss added to the attention loss on top of a temporal attention-based neural network. The proposed loss function applied with bag-of-tricks for training surpasses the state of the art on the common person Re-ID datasets, MARS and PRID 2011. Our source code is publicly available on github. △ Less

Submitted 20 November, 2019; originally announced December 2019.

Comments: 2 Page (Student Abstract) accepted in AAAI-20

Report number: AAAI-20 SA-572
arXiv:1909.02680 [pdf, other]

cs.CV

Coarse2Fine: A Two-stage Training Method for Fine-grained Visual Classification

Authors: Amir Erfan Eshratifar, David Eigen, Michael Gormish, Massoud Pedram

Abstract: Small inter-class and large intra-class variations are the main challenges in fine-grained visual classification. Objects from different classes share visually similar structures and objects in the same class can have different poses and viewpoints. Therefore, the proper extraction of discriminative local features (e.g. bird's beak or car's headlight) is crucial. Most of the recent successes on th… ▽ More Small inter-class and large intra-class variations are the main challenges in fine-grained visual classification. Objects from different classes share visually similar structures and objects in the same class can have different poses and viewpoints. Therefore, the proper extraction of discriminative local features (e.g. bird's beak or car's headlight) is crucial. Most of the recent successes on this problem are based upon the attention models which can localize and attend the local discriminative objects parts. In this work, we propose a training method for visual attention networks, Coarse2Fine, which creates a differentiable path from the input space to the attended feature maps. Coarse2Fine learns an inverse map** function from the attended feature maps to the informative regions in the raw image, which will guide the attention maps to better attend the fine-grained features. We show Coarse2Fine and orthogonal initialization of the attention weights can surpass the state-of-the-art accuracies on common fine-grained classification tasks. △ Less

Submitted 5 September, 2019; originally announced September 2019.

Search v0.5.6 released 2020-02-24