Skip to main content

Showing 1–13 of 13 results for author: Segu, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04221  [pdf, other

    cs.CV

    Matching Anything by Segmenting Anything

    Authors: Siyuan Li, Lei Ke, Martin Danelljan, Luigi Piccinelli, Mattia Segu, Luc Van Gool, Fisher Yu

    Abstract: The robust association of the same objects across video frames in complex scenes is crucial for many applications, especially Multiple Object Tracking (MOT). Current methods predominantly rely on labeled domain-specific video datasets, which limits the cross-domain generalization of learned similarity embeddings. We propose MASA, a novel method for robust instance association learning, capable of… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 Highlight. code at: https://github.com/siyuanliii/masa

  2. arXiv:2404.03658  [pdf, other

    cs.CV

    Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

    Authors: Rui Li, Tobias Fischer, Mattia Segu, Marc Pollefeys, Luc Van Gool, Federico Tombari

    Abstract: Recovering the 3D scene geometry from a single view is a fundamental yet ill-posed problem in computer vision. While classical depth estimation methods infer only a 2.5D scene representation limited to the image plane, recent approaches based on radiance fields reconstruct a full 3D representation. However, these methods still struggle with occluded regions since inferring geometry without visual… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project page: https://ruili3.github.io/kyn

  3. arXiv:2403.18913  [pdf, other

    cs.CV

    UniDepth: Universal Monocular Metric Depth Estimation

    Authors: Luigi Piccinelli, Yung-Hsu Yang, Christos Sakaridis, Mattia Segu, Siyuan Li, Luc Van Gool, Fisher Yu

    Abstract: Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains even in the presence of moderate domain gaps, which hinders their practical applicability. We propose a new model, UniDepth, capable o… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  4. arXiv:2310.03006  [pdf, other

    cs.CV

    COOLer: Class-Incremental Learning for Appearance-Based Multiple Object Tracking

    Authors: Zhizheng Liu, Mattia Segu, Fisher Yu

    Abstract: Continual learning allows a model to learn multiple tasks sequentially while retaining the old knowledge without the training data of the preceding tasks. This paper extends the scope of continual learning research to class-incremental learning for multiple object tracking (MOT), which is desirable to accommodate the continuously evolving needs of autonomous systems. Previous solutions for continu… ▽ More

    Submitted 5 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: GCPR 2023 Oral

  5. arXiv:2310.01926  [pdf, other

    cs.CV cs.AI

    DARTH: Holistic Test-time Adaptation for Multiple Object Tracking

    Authors: Mattia Segu, Bernt Schiele, Fisher Yu

    Abstract: Multiple object tracking (MOT) is a fundamental component of perception systems for autonomous driving, and its robustness to unseen conditions is a requirement to avoid life-critical failures. Despite the urge of safety in driving systems, no solution to the MOT adaptation problem to domain shift in test-time conditions has ever been proposed. However, the nature of a MOT system is manifold - req… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: Proceedings of the IEEE/CVF International Conference on Computer Vision

  6. arXiv:2211.04393  [pdf, other

    cs.CV

    Normalization Perturbation: A Simple Domain Generalization Method for Real-World Domain Shifts

    Authors: Qi Fan, Mattia Segu, Yu-Wing Tai, Fisher Yu, Chi-Keung Tang, Bernt Schiele, Dengxin Dai

    Abstract: Improving model's generalizability against domain shifts is crucial, especially for safety-critical applications such as autonomous driving. Real-world domain styles can vary substantially due to environment changes and sensor noises, but deep models only know the training domain style. Such domain style gap impedes model generalization on diverse real-world domains. Our proposed Normalization Per… ▽ More

    Submitted 8 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

  7. arXiv:2206.08367  [pdf, other

    cs.CV cs.LG

    SHIFT: A Synthetic Driving Dataset for Continuous Multi-Task Domain Adaptation

    Authors: Tao Sun, Mattia Segu, Janis Postels, Yuxuan Wang, Luc Van Gool, Bernt Schiele, Federico Tombari, Fisher Yu

    Abstract: Adapting to a continuously evolving environment is a safety-critical challenge inevitably faced by all autonomous driving systems. Existing image and video driving datasets, however, fall short of capturing the mutable nature of the real world. In this paper, we introduce the largest multi-task synthetic dataset for autonomous driving, SHIFT. It presents discrete and continuous shifts in cloudines… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Published at IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2022

  8. arXiv:2203.03962  [pdf, other

    cs.CV

    Generative Cooperative Learning for Unsupervised Video Anomaly Detection

    Authors: Muhammad Zaigham Zaheer, Arif Mahmood, Muhammad Haris Khan, Mattia Segu, Fisher Yu, Seung-Ik Lee

    Abstract: Video anomaly detection is well investigated in weakly-supervised and one-class classification (OCC) settings. However, unsupervised video anomaly detection methods are quite sparse, likely because anomalies are less frequent in occurrence and usually not well-defined, which when coupled with the absence of ground truth supervision, could adversely affect the performance of the learning algorithms… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted to the Conference on Computer Vision and Pattern Recognition CVPR 2022

  9. arXiv:2107.00649  [pdf, other

    cs.CV

    On the Practicality of Deterministic Epistemic Uncertainty

    Authors: Janis Postels, Mattia Segu, Tao Sun, Luca Sieber, Luc Van Gool, Fisher Yu, Federico Tombari

    Abstract: A set of novel approaches for estimating epistemic uncertainty in deep neural networks with a single forward pass has recently emerged as a valid alternative to Bayesian Neural Networks. On the premise of informative representations, these deterministic uncertainty methods (DUMs) achieve strong performance on detecting out-of-distribution (OOD) data while adding negligible computational costs at i… ▽ More

    Submitted 5 July, 2022; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: International Conference on Machine Learning 2022

  10. arXiv:2011.13399  [pdf, other

    cs.CV cs.LG

    Depth-Aware Action Recognition: Pose-Motion Encoding through Temporal Heatmaps

    Authors: Mattia Segu, Federico Pirovano, Gianmario Fumagalli, Amedeo Fabris

    Abstract: Most state-of-the-art methods for action recognition rely only on 2D spatial features encoding appearance, motion or pose. However, 2D data lacks the depth information, which is crucial for recognizing fine-grained actions. In this paper, we propose a depth-aware volumetric descriptor that encodes pose and motion information in a unified representation for action classification in-the-wild. Our fr… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

  11. arXiv:2011.13388  [pdf, other

    cs.CV cs.LG

    3DSNet: Unsupervised Shape-to-Shape 3D Style Transfer

    Authors: Mattia Segu, Margarita Grinvald, Roland Siegwart, Federico Tombari

    Abstract: Transferring the style from one image onto another is a popular and widely studied task in computer vision. Yet, style transfer in the 3D setting remains a largely unexplored problem. To our knowledge, we propose the first learning-based approach for style transfer between 3D objects based on disentangled content and style representations. The proposed method can synthesize new 3D shapes both in t… ▽ More

    Submitted 18 May, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

  12. arXiv:2011.12672  [pdf, other

    cs.LG cs.CV

    Batch Normalization Embeddings for Deep Domain Generalization

    Authors: Mattia Segu, Alessio Tonioni, Federico Tombari

    Abstract: Domain generalization aims at training machine learning models to perform robustly across different and unseen domains. Several recent methods use multiple datasets to train models to extract domain-invariant features, ho** to generalize to unseen domains. Instead, first we explicitly train domain-dependant representations by using ad-hoc batch normalization layers to collect independent domain'… ▽ More

    Submitted 18 May, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

  13. A General Framework for Uncertainty Estimation in Deep Learning

    Authors: Antonio Loquercio, Mattia Segù, Davide Scaramuzza

    Abstract: Neural networks predictions are unreliable when the input sample is out of the training distribution or corrupted by noise. Being able to detect such failures automatically is fundamental to integrate deep learning algorithms into robotics. Current approaches for uncertainty estimation of neural networks require changes to the network and optimization process, typically ignore prior knowledge abou… ▽ More

    Submitted 7 February, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

    Comments: Accepted for publication in the Robotics and Automation Letters 2020, and for presentation at the International Conference on Robotics and Automation (ICRA) 2020

    Journal ref: IEEE Robotics and Automation Letters 2020