Skip to main content

Showing 1–6 of 6 results for author: Picron, C

.
  1. arXiv:2402.12536  [pdf, other

    cs.CV

    Designing High-Performing Networks for Multi-Scale Computer Vision

    Authors: Cédric Picron

    Abstract: Since the emergence of deep learning, the computer vision field has flourished with models improving at a rapid pace on more and more complex tasks. We distinguish three main ways to improve a computer vision model: (1) improving the data aspect by for example training on a large, more diverse dataset, (2) improving the training aspect by for example designing a better optimizer, and (3) improving… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: PhD thesis

  2. arXiv:2307.01545  [pdf, other

    cs.CV

    EffSeg: Efficient Fine-Grained Instance Segmentation using Structure-Preserving Sparsity

    Authors: Cédric Picron, Tinne Tuytelaars

    Abstract: Many two-stage instance segmentation heads predict a coarse 28x28 mask per instance, which is insufficient to capture the fine-grained details of many objects. To address this issue, PointRend and RefineMask predict a 112x112 segmentation mask resulting in higher quality segmentations. Both methods however have limitations by either not having access to neighboring features (PointRend) or by perfo… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  3. arXiv:2210.02318  [pdf, other

    cs.CV

    FQDet: Fast-converging Query-based Detector

    Authors: Cédric Picron, Punarjay Chakravarty, Tinne Tuytelaars

    Abstract: Recently, two-stage Deformable DETR introduced the query-based two-stage head, a new type of two-stage head different from the region-based two-stage heads of classical detectors as Faster R-CNN. In query-based two-stage heads, the second stage selects one feature per detection processed by a transformer, called the query, as opposed to pooling a rectangular grid of features processed by CNNs as i… ▽ More

    Submitted 28 October, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: Accepted at NeurIPS VTTA workshop 2022

  4. arXiv:2208.06195  [pdf, other

    cs.CV

    Category-Level Pose Retrieval with Contrastive Features Learnt with Occlusion Augmentation

    Authors: Georgios Kouros, Shubham Shrivastava, Cédric Picron, Sushruth Nagesh, Punarjay Chakravarty, Tinne Tuytelaars

    Abstract: Pose estimation is usually tackled as either a bin classification or a regression problem. In both cases, the idea is to directly predict the pose of an object. This is a non-trivial task due to appearance variations between similar poses and similarities between dissimilar poses. Instead, we follow the key idea that comparing two poses is easier than directly predicting one. Render-and-compare ap… ▽ More

    Submitted 12 October, 2022; v1 submitted 12 August, 2022; originally announced August 2022.

    Comments: 29 pages, 16 Figures, 14 tables, BMVC 2022

  5. arXiv:2110.04004  [pdf, other

    cs.CV

    Trident Pyramid Networks: The importance of processing at the feature pyramid level for better object detection

    Authors: Cédric Picron, Tinne Tuytelaars

    Abstract: Feature pyramids have become ubiquitous in multi-scale computer vision tasks such as object detection. Given their importance, a computer vision network can be divided into three parts: a backbone (generating a feature pyramid), a neck (refining the feature pyramid) and a head (generating the final output). Many existing networks operating on feature pyramids, named necks, are shallow and mostly f… ▽ More

    Submitted 28 October, 2022; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: Accepted at BMVC 2022

  6. arXiv:2007.14812  [pdf, other

    cs.CV

    What My Motion tells me about Your Pose: A Self-Supervised Monocular 3D Vehicle Detector

    Authors: Cédric Picron, Punarjay Chakravarty, Tom Roussel, Tinne Tuytelaars

    Abstract: The estimation of the orientation of an observed vehicle relative to an Autonomous Vehicle (AV) from monocular camera data is an important building block in estimating its 6 DoF pose. Current Deep Learning based solutions for placing a 3D bounding box around this observed vehicle are data hungry and do not generalize well. In this paper, we demonstrate the use of monocular visual odometry for the… ▽ More

    Submitted 24 March, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: ICRA 2021 (presentation)