Skip to main content

Showing 1–15 of 15 results for author: Pilzer, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.15297  [pdf, other

    cs.LG cs.CV stat.ML

    Make Me a BNN: A Simple Strategy for Estimating Bayesian Uncertainty from Pre-trained Models

    Authors: Gianni Franchi, Olivier Laurent, Maxence Leguéry, Andrei Bursuc, Andrea Pilzer, Angela Yao

    Abstract: Deep Neural Networks (DNNs) are powerful tools for various computer vision tasks, yet they often struggle with reliable uncertainty quantification - a critical requirement for real-world applications. Bayesian Neural Networks (BNN) are equipped for uncertainty estimation but cannot scale to large DNNs that are highly unstable to train. To address this challenge, we introduce the Adaptable Bayesian… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  2. arXiv:2309.15478  [pdf, other

    cs.CV cs.LG

    The Robust Semantic Segmentation UNCV2023 Challenge Results

    Authors: Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli , et al. (12 additional authors not shown)

    Abstract: This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty q… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures, accepted at ICCV 2023 UNCV workshop

  3. arXiv:2307.02392  [pdf, other

    cs.CV

    RADiff: Controllable Diffusion Models for Radio Astronomical Maps Generation

    Authors: Renato Sortino, Thomas Cecconello, Andrea DeMarco, Giuseppe Fiameni, Andrea Pilzer, Andrew M. Hopkins, Daniel Magro, Simone Riggi, Eva Sciacca, Adriano Ingallinera, Cristobal Bordiu, Filomena Bufano, Concetto Spampinato

    Abstract: Along with the nearing completion of the Square Kilometre Array (SKA), comes an increasing demand for accurate and reliable automated solutions to extract valuable information from the vast amount of data it will allow acquiring. Automated source finding is a particularly important task in this context, as it enables the detection and classification of astronomical objects. Deep-learning-based obj… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

  4. arXiv:2306.07346  [pdf, other

    cs.CV cs.AI cs.MM

    Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training

    Authors: Lorenzo Baraldi, Roberto Amoroso, Marcella Cornia, Lorenzo Baraldi, Andrea Pilzer, Rita Cucchiara

    Abstract: The use of self-supervised pre-training has emerged as a promising approach to enhance the performance of visual tasks such as image classification. In this context, recent approaches have employed the Masked Image Modeling paradigm, which pre-trains a backbone by reconstructing visual tokens associated with randomly masked image patches. This masking approach, however, introduces noise into the i… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  5. arXiv:2303.16166  [pdf, other

    cs.CL cs.AI

    When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP

    Authors: Sara Papi, Marco Gaido, Andrea Pilzer, Matteo Negri

    Abstract: Despite its crucial role in research experiments, code correctness is often presumed only on the basis of the perceived quality of results. This assumption comes with the risk of erroneous outcomes and potentially misleading findings. To address this issue, we posit that the current focus on reproducibility should go hand in hand with the emphasis on software quality. We present a case study in wh… ▽ More

    Submitted 15 August, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  6. arXiv:2302.06359  [pdf, other

    cs.LG

    Fixing Overconfidence in Dynamic Neural Networks

    Authors: Lassi Meronen, Martin Trapp, Andrea Pilzer, Le Yang, Arno Solin

    Abstract: Dynamic neural networks are a recent technique that promises a remedy for the increasing size of modern deep learning models by dynamically adapting their computational cost to the difficulty of the inputs. In this way, the model can adjust to a limited computational budget. However, the poor quality of uncertainty estimates in deep learning models makes it difficult to distinguish between hard an… ▽ More

    Submitted 8 December, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  7. arXiv:2211.00392  [pdf, other

    cs.CV

    Expansion of Visual Hints for Improved Generalization in Stereo Matching

    Authors: Andrea Pilzer, Yuxin Hou, Niki Loppi, Arno Solin, Juho Kannala

    Abstract: We introduce visual hints expansion for guiding stereo matching to improve generalization. Our work is motivated by the robustness of Visual Inertial Odometry (VIO) in computer vision and robotics, where a sparse and unevenly distributed set of feature points characterizes a scene. To improve stereo matching, we propose to elevate 2D hints to 3D points. These sparse and unevenly distributed 3D vis… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 2023 IEEE Winter Conference on Applications of Computer Vision (WACV)

  8. arXiv:2208.07591  [pdf, other

    cs.CV cs.LG

    Uncertainty-guided Source-free Domain Adaptation

    Authors: Subhankar Roy, Martin Trapp, Andrea Pilzer, Juho Kannala, Nicu Sebe, Elisa Ricci, Arno Solin

    Abstract: Source-free domain adaptation (SFDA) aims to adapt a classifier to an unlabelled target data set by only using a pre-trained source model. However, the absence of the source data and the domain shift makes the predictions on the target data unreliable. We propose quantifying the uncertainty in the source model predictions and utilizing it to guide the target adaptation. For this, we construct a pr… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: ECCV 2022

  9. arXiv:2205.13821  [pdf, other

    cs.RO cs.CV

    A Look at Improving Robustness in Visual-inertial SLAM by Moment Matching

    Authors: Arno Solin, Rui Li, Andrea Pilzer

    Abstract: The fusion of camera sensor and inertial data is a leading method for ego-motion tracking in autonomous and smart devices. State estimation techniques that rely on non-linear filtering are a strong paradigm for solving the associated information fusion task. The de facto inference method in this space is the celebrated extended Kalman filter (EKF), which relies on first-order linearizations of bot… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: 8 pages, to appear in Proceedings of FUSION 2022

  10. arXiv:1909.07667  [pdf, other

    cs.CV

    Progressive Fusion for Unsupervised Binocular Depth Estimation using Cycled Networks

    Authors: Andrea Pilzer, Stéphane Lathuilière, Dan Xu, Mihai Marian Puscas, Elisa Ricci, Nicu Sebe

    Abstract: Recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance. However, they require costly ground truth annotations during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps. We introduce a new network architecture, named Progressive Fusion Network (PFN), that is… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted to TPAMI (SI RGB-D Vision), code https://github.com/andrea-pilzer/PFN-depth

  11. arXiv:1908.05794  [pdf, other

    cs.CV eess.IV

    Structured Coupled Generative Adversarial Networks for Unsupervised Monocular Depth Estimation

    Authors: Mihai Marian Puscas, Dan Xu, Andrea Pilzer, Nicu Sebe

    Abstract: Inspired by the success of adversarial learning, we propose a new end-to-end unsupervised deep learning framework for monocular depth estimation consisting of two Generative Adversarial Networks (GAN), deeply coupled with a structured Conditional Random Field (CRF) model. The two GANs aim at generating distinct and complementary disparity maps and at improving the generation quality via exploiting… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

    Comments: Accepted at 3DV 2019 as ORAL

  12. arXiv:1904.08462  [pdf, other

    cs.CV

    Online Adaptation through Meta-Learning for Stereo Depth Estimation

    Authors: Zhenyu Zhang, Stéphane Lathuilière, Andrea Pilzer, Nicu Sebe, Elisa Ricci, Jian Yang

    Abstract: In this work, we tackle the problem of online adaptation for stereo depth estimation, that consists in continuously adapting a deep network to a target video recordedin an environment different from that of the source training set. To address this problem, we propose a novel Online Meta-Learning model with Adaption (OMLA). Our proposal is based on two main contributions. First, to reducethe domain… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

    Comments: 12 pages

  13. arXiv:1903.04202  [pdf, other

    cs.CV

    Refine and Distill: Exploiting Cycle-Inconsistency and Knowledge Distillation for Unsupervised Monocular Depth Estimation

    Authors: Andrea Pilzer, Stéphane Lathuilière, Nicu Sebe, Elisa Ricci

    Abstract: Nowadays, the majority of state of the art monocular depth estimation techniques are based on supervised deep learning models. However, collecting RGB images with associated depth maps is a very time consuming procedure. Therefore, recent works have proposed deep architectures for addressing the monocular depth prediction task as a reconstruction problem, thus avoiding the need of collecting groun… ▽ More

    Submitted 20 April, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

    Comments: Accepted at CVPR2019

  14. arXiv:1807.10915  [pdf, other

    cs.CV

    Unsupervised Adversarial Depth Estimation using Cycled Generative Networks

    Authors: Andrea Pilzer, Dan Xu, Mihai Marian Puscas, Elisa Ricci, Nicu Sebe

    Abstract: While recent deep monocular depth estimation approaches based on supervised regression have achieved remarkable performance, costly ground truth annotations are required during training. To cope with this issue, in this paper we present a novel unsupervised deep learning approach for predicting depth maps and show that the depth estimation task can be effectively tackled within an adversarial lear… ▽ More

    Submitted 28 July, 2018; originally announced July 2018.

    Comments: To appear in 3DV 2018. Code is available on GitHub

  15. arXiv:1703.03937  [pdf, other

    cs.CV

    Viraliency: Pooling Local Virality

    Authors: Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, Elisa Ricci

    Abstract: In our overly-connected world, the automatic recognition of virality - the quality of an image or video to be rapidly and widely spread in social networks - is of crucial importance, and has recently awaken the interest of the computer vision community. Concurrently, recent progress in deep learning architectures showed that global pooling strategies allow the extraction of activation maps, which… ▽ More

    Submitted 15 March, 2017; v1 submitted 11 March, 2017; originally announced March 2017.

    Comments: Accepted at IEEE CVPR 2017