Skip to main content

Showing 1–10 of 10 results for author: Brasó, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.11426  [pdf, other

    cs.CV

    SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow

    Authors: Orcun Cetintas, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé

    Abstract: Increasing the annotation efficiency of trajectory annotations from videos has the potential to enable the next generation of data-hungry tracking algorithms to thrive on large-scale datasets. Despite the importance of this task, there are currently very few works exploring how to efficiently label tracking datasets comprehensively. In this work, we introduce SPAM, a tracking data engine that prov… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  2. arXiv:2212.03038  [pdf, other

    cs.CV

    Unifying Short and Long-Term Tracking with Graph Hierarchies

    Authors: Orcun Cetintas, Guillem Brasó, Laura Leal-Taixé

    Abstract: Tracking objects over long videos effectively means solving a spectrum of problems, from short-term association for un-occluded objects to long-term association for objects that are occluded and then reappear in the scene. Methods tackling these two tasks are often disjoint and crafted for specific scenarios, and top-performing approaches are often a mix of techniques, which yields engineering-hea… ▽ More

    Submitted 30 March, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: CVPR 2023

  3. arXiv:2210.05657  [pdf, other

    cs.CV cs.AI

    The Unreasonable Effectiveness of Fully-Connected Layers for Low-Data Regimes

    Authors: Peter Kocsis, Peter Súkeník, Guillem Brasó, Matthias Nießner, Laura Leal-Taixé, Ismail Elezi

    Abstract: Convolutional neural networks were the standard for solving many computer vision tasks until recently, when Transformers of MLP-based architectures have started to show competitive performance. These architectures typically have a vast number of weights and need to be trained on massive datasets; hence, they are not suitable for their use in low-data regimes. In this work, we propose a simple yet… ▽ More

    Submitted 13 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022, Homepage: https://peter-kocsis.github.io/LowDataGeneralization/ 24 pages, 14 figures, 12 tables

    ACM Class: I.2.10; I.5.1; I.4.8

  4. arXiv:2208.01957  [pdf, other

    cs.CV cs.LG

    PolarMOT: How Far Can Geometric Relations Take Us in 3D Multi-Object Tracking?

    Authors: Aleksandr Kim, Guillem Brasó, Aljoša Ošep, Laura Leal-Taixé

    Abstract: Most (3D) multi-object tracking methods rely on appearance-based cues for data association. By contrast, we investigate how far we can get by only encoding geometric relationships between objects in 3D space as cues for data-driven data association. We encode 3D detections as nodes in a graph, where spatial and temporal pairwise relations among objects are encoded via localized polar coordinates o… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: ECCV 2022, 17 pages, 5 pages of supplementary, 3 figures

  5. arXiv:2207.11103  [pdf, other

    cs.CV cs.LG cs.RO

    DeVIS: Making Deformable Transformers Work for Video Instance Segmentation

    Authors: Adrià Caelles, Tim Meinhardt, Guillem Brasó, Laura Leal-Taixé

    Abstract: Video Instance Segmentation (VIS) jointly tackles multi-object detection, tracking, and segmentation in video sequences. In the past, VIS methods mirrored the fragmentation of these subtasks in their architectural design, hence missing out on a joint solution. Transformers recently allowed to cast the entire VIS task as a single set-prediction problem. Nevertheless, the quadratic complexity of exi… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  6. arXiv:2207.07454  [pdf, other

    cs.CV

    Multi-Object Tracking and Segmentation via Neural Message Passing

    Authors: Guillem Braso, Orcun Cetintas, Laura Leal-Taixe

    Abstract: Graphs offer a natural way to formulate Multiple Object Tracking (MOT) and Multiple Object Tracking and Segmentation (MOTS) within the tracking-by-detection paradigm. However, they also introduce a major challenge for learning methods, as defining a model that can operate on such structured domain is not trivial. In this work, we exploit the classical network flow formulation of MOT to define a fu… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:1912.07515

  7. arXiv:2206.04656  [pdf, other

    cs.CV

    Simple Cues Lead to a Strong Multi-Object Tracker

    Authors: Jenny Seidenschwarz, Guillem Brasó, Victor Castro Serrano, Ismail Elezi, Laura Leal-Taixé

    Abstract: For a long time, the most common paradigm in Multi-Object Tracking was tracking-by-detection (TbD), where objects are first detected and then associated over video frames. For association, most models resourced to motion and appearance cues, e.g., re-identification networks. Recent approaches based on attention propose to learn the cues in a data-driven manner, showing impressive results. In this… ▽ More

    Submitted 26 April, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted to CVPR2023!

  8. arXiv:2110.05132  [pdf, other

    cs.CV

    The Center of Attention: Center-Keypoint Grou** via Attention for Multi-Person Pose Estimation

    Authors: Guillem Brasó, Nikita Kister, Laura Leal-Taixé

    Abstract: We introduce CenterGroup, an attention-based framework to estimate human poses from a set of identity-agnostic keypoints and person center predictions in an image. Our approach uses a transformer to obtain context-aware embeddings for all detected keypoints and centers and then applies multi-head attention to directly group joints into their corresponding person centers. While most bottom-up metho… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: Accepted to ICCV 2021; reports improved multi-scale results

  9. arXiv:2108.09518  [pdf, other

    cs.CV

    MOTSynth: How Can Synthetic Data Help Pedestrian Detection and Tracking?

    Authors: Matteo Fabbri, Guillem Braso, Gianluca Maugeri, Orcun Cetintas, Riccardo Gasparini, Aljosa Osep, Simone Calderara, Laura Leal-Taixe, Rita Cucchiara

    Abstract: Deep learning-based methods for video pedestrian detection and tracking require large volumes of training data to achieve good performance. However, data acquisition in crowded public environments raises data privacy concerns -- we are not allowed to simply record and store data without the explicit consent of all participants. Furthermore, the annotation of such data for computer vision applicati… ▽ More

    Submitted 21 August, 2021; originally announced August 2021.

    Comments: ICCV 2021 camera-ready version

  10. arXiv:1912.07515  [pdf, other

    cs.CV

    Learning a Neural Solver for Multiple Object Tracking

    Authors: Guillem Brasó, Laura Leal-Taixé

    Abstract: Graphs offer a natural way to formulate Multiple Object Tracking (MOT) within the tracking-by-detection paradigm. However, they also introduce a major challenge for learning methods, as defining a model that can operate on such \textit{structured domain} is not trivial. As a consequence, most learning-based work has been devoted to learning better features for MOT, and then using these with well-e… ▽ More

    Submitted 18 April, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: Accepted to CVPR 2020 (oral)