Showing 1–2 of 2 results for author: Kantorov, V

Search v0.5.6 released 2020-02-24

arXiv:2106.04550 [pdf, other]

cs.CV

DETReg: Unsupervised Pretraining with Region Priors for Object Detection

Authors: Amir Bar, Xin Wang, Vadim Kantorov, Colorado J Reed, Roei Herzig, Gal Chechik, Anna Rohrbach, Trevor Darrell, Amir Globerson

Abstract: Recent self-supervised pretraining methods for object detection largely focus on pretraining the backbone of the object detector, neglecting key parts of detection architecture. Instead, we introduce DETReg, a new self-supervised method that pretrains the entire object detection network, including the object localization and embedding components. During pretraining, DETReg predicts object localiza… ▽ More Recent self-supervised pretraining methods for object detection largely focus on pretraining the backbone of the object detector, neglecting key parts of detection architecture. Instead, we introduce DETReg, a new self-supervised method that pretrains the entire object detection network, including the object localization and embedding components. During pretraining, DETReg predicts object localizations to match the localizations from an unsupervised region proposal generator and simultaneously aligns the corresponding feature embeddings with embeddings from a self-supervised image encoder. We implement DETReg using the DETR family of detectors and show that it improves over competitive baselines when finetuned on COCO, PASCAL VOC, and Airbus Ship benchmarks. In low-data regimes DETReg achieves improved performance, e.g., when training with only 1% of the labels and in the few-shot learning settings. △ Less

Submitted 19 July, 2023; v1 submitted 8 June, 2021; originally announced June 2021.

Comments: Project page: https://www.amirbar.net/detreg/
arXiv:1609.04331 [pdf, other]

cs.CV

ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization

Authors: Vadim Kantorov, Maxime Oquab, Minsu Cho, Ivan Laptev

Abstract: We aim to localize objects in images using image-level supervision only. Previous approaches to this problem mainly focus on discriminative object regions and often fail to locate precise object boundaries. We address this problem by introducing two types of context-aware guidance models, additive and contrastive models, that leverage their surrounding context regions to improve localization. The… ▽ More We aim to localize objects in images using image-level supervision only. Previous approaches to this problem mainly focus on discriminative object regions and often fail to locate precise object boundaries. We address this problem by introducing two types of context-aware guidance models, additive and contrastive models, that leverage their surrounding context regions to improve localization. The additive model encourages the predicted object region to be supported by its surrounding context region. The contrastive model encourages the predicted object region to be outstanding from its surrounding context region. Our approach benefits from the recent success of convolutional neural networks for object recognition and extends Fast R-CNN to weakly supervised object localization. Extensive experimental evaluation on the PASCAL VOC 2007 and 2012 benchmarks shows hat our context-aware approach significantly improves weakly supervised localization and detection. △ Less

Submitted 14 September, 2016; originally announced September 2016.

Comments: Accepted paper at ECCV2016. The website and code is at http://www.di.ens.fr/willow/research/contextlocnet

Search v0.5.6 released 2020-02-24