Skip to main content

Showing 1–12 of 12 results for author: Gasperini, S

.
  1. arXiv:2403.14279  [pdf, other

    cs.CV

    Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

    Authors: Francesco Di Felice, Alberto Remus, Stefano Gasperini, Benjamin Busam, Lionel Ott, Federico Tombari, Roland Siegwart, Carlo Alberto Avizzano

    Abstract: Estimating the pose of objects through vision is essential to make robotic platforms interact with the environment. Yet, it presents many challenges, often related to the lack of flexibility and generalizability of state-of-the-art solutions. Diffusion models are a cutting-edge neural architecture transforming 2D and 3D computer vision, outlining remarkable performances in zero-shot novel-view syn… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 6 pages, 2 reference pages, 4 figures

  2. arXiv:2312.02255  [pdf, other

    cs.CV cs.GR cs.LG

    Re-Nerfing: Improving Novel Views Synthesis through Novel Views Synthesis

    Authors: Felix Tristram, Stefano Gasperini, Nassir Navab, Federico Tombari

    Abstract: Neural Radiance Fields (NeRFs) have shown remarkable novel view synthesis capabilities even in large-scale, unbounded scenes, albeit requiring hundreds of views or introducing artifacts in sparser settings. Their optimization suffers from shape-radiance ambiguities wherever only a small visual overlap is available. This leads to erroneous scene geometry and artifacts. In this paper, we propose Re-… ▽ More

    Submitted 17 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Code will be released upon acceptance

  3. arXiv:2311.05289  [pdf, other

    cs.CV cs.RO

    VoxNeRF: Bridging Voxel Representation and Neural Radiance Fields for Enhanced Indoor View Synthesis

    Authors: Sen Wang, Wei Zhang, Stefano Gasperini, Shun-Cheng Wu, Nassir Navab

    Abstract: Creating high-quality view synthesis is essential for immersive applications but continues to be problematic, particularly in indoor environments and for real-time deployment. Current techniques frequently require extensive computational time for both training and rendering, and often produce less-than-ideal 3D representations due to inadequate geometric structuring. To overcome this, we introduce… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 8 pages, 4 figures

  4. 3D Adversarial Augmentations for Robust Out-of-Domain Predictions

    Authors: Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari

    Abstract: Since real-world training datasets cannot properly sample the long tail of the underlying data distribution, corner cases and rare out-of-domain samples can severely hinder the performance of state-of-the-art models. This problem becomes even more severe for dense tasks, such as 3D semantic segmentation, where points of non-standard objects can be confidently associated to the wrong class. In this… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 37 pages, 12 figures

  5. Robust Monocular Depth Estimation under Challenging Conditions

    Authors: Stefano Gasperini, Nils Morbitzer, HyunJun Jung, Nassir Navab, Federico Tombari

    Abstract: While state-of-the-art monocular depth estimation approaches achieve impressive results in ideal settings, they are highly unreliable under challenging illumination and weather conditions, such as at nighttime or in the presence of rain. In this paper, we uncover these safety-critical issues and tackle them with md4all: a simple and effective solution that works reliably under both adverse and ide… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: ICCV 2023. Source code and data: https://md4all.github.io

  6. Segmenting Known Objects and Unseen Unknowns without Prior Knowledge

    Authors: Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari

    Abstract: Panoptic segmentation methods assign a known class to each pixel given in input. Even for state-of-the-art approaches, this inevitably enforces decisions that systematically lead to wrong predictions for objects outside the training categories. However, robustness against out-of-distribution samples and corner cases is crucial in safety-critical settings to avoid dangerous consequences. Since real… ▽ More

    Submitted 18 August, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: ICCV 2023. Project page: https://holisticseg.github.io

  7. 3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection

    Authors: Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Mohammad-Ali Nikouei Mahani, Nassir Navab, Benjamin Busam, Federico Tombari

    Abstract: As 3D object detection on point clouds relies on the geometrical relationships between the points, non-standard object shapes can hinder a method's detection capability. However, in safety-critical settings, robustness to out-of-domain and long-tail samples is fundamental to circumvent dangerous issues, such as the misdetection of damaged or rare cars. In this work, we substantially improve the ge… ▽ More

    Submitted 3 May, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: CVPR 2022. Project page: https://3d-vfield.github.io

  8. arXiv:2110.01604  [pdf, other

    cs.CV cs.LG cs.RO

    CertainNet: Sampling-free Uncertainty Estimation for Object Detection

    Authors: Stefano Gasperini, Jan Haug, Mohammad-Ali Nikouei Mahani, Alvaro Marcos-Ramiro, Nassir Navab, Benjamin Busam, Federico Tombari

    Abstract: Estimating the uncertainty of a neural network plays a fundamental role in safety-critical settings. In perception for autonomous driving, measuring the uncertainty means providing additional calibrated information to downstream tasks, such as path planning, that can use it towards safe navigation. In this work, we propose a novel sampling-free uncertainty estimation method for object detection. W… ▽ More

    Submitted 28 December, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: Published at IEEE Robotics and Automation Letters (RA-L)

  9. R4Dyn: Exploring Radar for Self-Supervised Monocular Depth Estimation of Dynamic Scenes

    Authors: Stefano Gasperini, Patrick Koch, Vinzenz Dallabetta, Nassir Navab, Benjamin Busam, Federico Tombari

    Abstract: While self-supervised monocular depth estimation in driving scenarios has achieved comparable performance to supervised approaches, violations of the static world assumption can still lead to erroneous depth predictions of traffic participants, posing a potential safety issue. In this paper, we present R4Dyn, a novel set of techniques to use cost-efficient radar data on top of a self-supervised de… ▽ More

    Submitted 29 November, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: Accepted at the International Conference on 3D Vision (3DV) 2021

  10. arXiv:2010.15157  [pdf, other

    cs.CV cs.LG cs.RO

    Panoster: End-to-end Panoptic Segmentation of LiDAR Point Clouds

    Authors: Stefano Gasperini, Mohammad-Ali Nikouei Mahani, Alvaro Marcos-Ramiro, Nassir Navab, Federico Tombari

    Abstract: Panoptic segmentation has recently unified semantic and instance segmentation, previously addressed separately, thus taking a step further towards creating more comprehensive and efficient perception systems. In this paper, we present Panoster, a novel proposal-free panoptic segmentation method for LiDAR point clouds. Unlike previous approaches relying on several steps to group pixels or points in… ▽ More

    Submitted 12 February, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: Preprint of IEEE RA-L article

  11. Signal Clustering with Class-independent Segmentation

    Authors: Stefano Gasperini, Magdalini Paschali, Carsten Hopke, David Wittmann, Nassir Navab

    Abstract: Radar signals have been dramatically increasing in complexity, limiting the source separation ability of traditional approaches. In this paper we propose a Deep Learning-based clustering method, which encodes concurrent signals into images, and, for the first time, tackles clustering with image segmentation. Novel loss functions are introduced to optimize a Neural Network to separate the input pul… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

    Comments: Under Review for IEEE ICASSP 2020

  12. 3DQ: Compact Quantized Neural Networks for Volumetric Whole Brain Segmentation

    Authors: Magdalini Paschali, Stefano Gasperini, Abhijit Guha Roy, Michael Y. -S. Fang, Nassir Navab

    Abstract: Model architectures have been dramatically increasing in size, improving performance at the cost of resource requirements. In this paper we propose 3DQ, a ternary quantization method, applied for the first time to 3D Fully Convolutional Neural Networks (F-CNNs), enabling 16x model compression while maintaining performance on par with full precision models. We extensively evaluate 3DQ on two datase… ▽ More

    Submitted 1 July, 2019; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted to MICCAI 2019