Skip to main content

Showing 1–14 of 14 results for author: Cannici, M

.
  1. arXiv:2404.01112   

    cs.CV cs.CG

    Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features

    Authors: Pietro Bonazzi, Marie-Julie Rakatosaona, Marco Cannici, Federico Tombari, Davide Scaramuzza

    Abstract: Existing deep learning methods for the reconstruction and denoising of point clouds rely on small datasets of 3D shapes. We circumvent the problem by leveraging deep learning methods trained on billions of images. We propose a method to reconstruct point clouds from few images and to denoise point clouds from their rendering by exploiting prior knowledge distilled from image-based deep learning mo… ▽ More

    Submitted 23 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: An author was not timely informed before the released submission

  2. arXiv:2403.19780  [pdf, other

    cs.CV

    Mitigating Motion Blur in Neural Radiance Fields with Events and Frames

    Authors: Marco Cannici, Davide Scaramuzza

    Abstract: Neural Radiance Fields (NeRFs) have shown great potential in novel view synthesis. However, they struggle to render sharp images when the data used for training is affected by motion blur. On the other hand, event cameras excel in dynamic scenes as they measure brightness changes with microsecond resolution and are thus only marginally affected by blur. Recent methods attempt to enhance NeRF recon… ▽ More

    Submitted 3 June, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024

  3. arXiv:2312.14261  [pdf, other

    cs.NE

    Low-power event-based face detection with asynchronous neuromorphic hardware

    Authors: Caterina Caccavella, Federico Paredes-Vallés, Marco Cannici, Lyes Khacef

    Abstract: The rise of mobility, IoT and wearables has shifted processing to the edge of the sensors, driven by the need to reduce latency, communication costs and overall energy consumption. While deep learning models have achieved remarkable results in various domains, their deployment at the edge for real-time applications remains computationally expensive. Neuromorphic computing emerges as a promising pa… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  4. A 5-Point Minimal Solver for Event Camera Relative Motion Estimation

    Authors: Ling Gao, Hang Su, Daniel Gehrig, Marco Cannici, Davide Scaramuzza, Laurent Kneip

    Abstract: Event-based cameras are ideal for line-based motion estimation, since they predominantly respond to edges in the scene. However, accurately determining the camera displacement based on events continues to be an open problem. This is because line feature extraction and dynamics estimation are tightly coupled when using event cameras, and no precise model is currently available for describing the co… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Journal ref: IEEE/CVF International Conference on Computer Vision (ICCV), 2023

  5. arXiv:2309.09947  [pdf, other

    cs.CV

    End-to-end Learned Visual Odometry with Events and Frames

    Authors: Roberto Pellerito, Marco Cannici, Daniel Gehrig, Joris Belhadj, Olivier Dubois-Matra, Massimo Casasco, Davide Scaramuzza

    Abstract: Visual Odometry (VO) is crucial for autonomous robotic navigation, especially in GPS-denied environments like planetary terrains. To improve robustness, recent model-based VO systems have begun combining standard and event-based cameras. Event cameras excel in low-light and high-speed motion, while standard cameras provide dense and easier-to-track features, even in low-textured areas. However, th… ▽ More

    Submitted 20 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 8 pages, 5 figures, 4 tables

  6. arXiv:2306.07050  [pdf, other

    cs.CV

    Revisiting Token Pruning for Object Detection and Instance Segmentation

    Authors: Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza

    Abstract: Vision Transformers (ViTs) have shown impressive performance in computer vision, but their high computational cost, quadratic in the number of tokens, limits their adoption in computation-constrained applications. However, this large number of tokens may not be necessary, as not all tokens are equally important. In this paper, we investigate token pruning to accelerate inference for object detecti… ▽ More

    Submitted 12 December, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

    Journal ref: IEEE Winter Conference on Applications of Computer Vision (WACV 2024)

  7. arXiv:2112.03596  [pdf, other

    cs.CV

    E$^2$(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition

    Authors: Chiara Plizzari, Mirco Planamente, Gabriele Goletto, Marco Cannici, Emanuele Gusso, Matteo Matteucci, Barbara Caputo

    Abstract: Event cameras are novel bio-inspired sensors, which asynchronously capture pixel-level intensity changes in the form of "events". Due to their sensing mechanism, event cameras have little to no motion blur, a very high temporal resolution and require significantly less power and memory than traditional frame-based cameras. These characteristics make them a perfect fit to several real-world applica… ▽ More

    Submitted 3 April, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: To be presented at CVPR2022

  8. Neural Weighted A*: Learning Graph Costs and Heuristics with Differentiable Anytime A*

    Authors: Alberto Archetti, Marco Cannici, Matteo Matteucci

    Abstract: Recently, the trend of incorporating differentiable algorithms into deep learning architectures arose in machine learning research, as the fusion of neural layers and algorithmic layers has been beneficial for handling combinatorial data, such as shortest paths on graphs. Recent works related to data-driven planning aim at learning either cost functions or heuristic functions, but not both. We pro… ▽ More

    Submitted 2 August, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

  9. arXiv:2103.12768  [pdf, other

    cs.CV

    DA4Event: towards bridging the Sim-to-Real Gap for Event Cameras using Domain Adaptation

    Authors: Mirco Planamente, Chiara Plizzari, Marco Cannici, Marco Ciccone, Francesco Strada, Andrea Bottino, Matteo Matteucci, Barbara Caputo

    Abstract: Event cameras are novel bio-inspired sensors, which asynchronously capture pixel-level intensity changes in the form of "events". The innovative way they acquire data presents several advantages over standard devices, especially in poor lighting and high-speed motion conditions. However, the novelty of these sensors results in the lack of a large amount of training data capable of fully unlocking… ▽ More

    Submitted 29 October, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Accepted at IROS21

  10. Spatial Temporal Transformer Network for Skeleton-based Action Recognition

    Authors: Chiara Plizzari, Marco Cannici, Matteo Matteucci

    Abstract: Skeleton-based human action recognition has achieved a great interest in recent years, as skeleton data has been demonstrated to be robust to illumination changes, body scales, dynamic camera views, and complex background. Nevertheless, an effective encoding of the latent information underlying the 3D skeleton is still an open problem. In this work, we propose a novel Spatial-Temporal Transformer… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: Accepted as ICPRW2020 (FBE2020, Workshop on Facial and Body Expressions, micro-expressions and behavior recognition) 8 pages, 2 figures. arXiv admin note: substantial text overlap with arXiv:2008.07404

    Journal ref: Pattern Recognition. ICPR International Workshops and Challenges. ICPR 2021. Lecture Notes in Computer Science, Springer, vol 12663, 694-701, ISBN: 978-3-030-68796-0

  11. Skeleton-based Action Recognition via Spatial and Temporal Transformer Networks

    Authors: Chiara Plizzari, Marco Cannici, Matteo Matteucci

    Abstract: Skeleton-based Human Activity Recognition has achieved great interest in recent years as skeleton data has demonstrated being robust to illumination changes, body scales, dynamic camera views, and complex background. In particular, Spatial-Temporal Graph Convolutional Networks (ST-GCN) demonstrated to be effective in learning both spatial and temporal dependencies on non-Euclidean data such as ske… ▽ More

    Submitted 22 June, 2021; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: Accepted at Computer Vision and Image Understanding (CVIU) 12 pages, 8 figures

    Journal ref: Computer Vision and Image Understanding, Volumes 208-209 (2021), 103219, ISSN 1077-3142

  12. arXiv:2001.03455  [pdf, other

    cs.CV

    A Differentiable Recurrent Surface for Asynchronous Event-Based Data

    Authors: Marco Cannici, Marco Ciccone, Andrea Romanoni, Matteo Matteucci

    Abstract: Dynamic Vision Sensors (DVSs) asynchronously stream events in correspondence of pixels subject to brightness changes. Differently from classic vision devices, they produce a sparse representation of the scene. Therefore, to apply standard computer vision algorithms, events need to be integrated into a frame or event-surface. This is usually attained through hand-crafted grids that reconstruct the… ▽ More

    Submitted 31 July, 2020; v1 submitted 10 January, 2020; originally announced January 2020.

    Comments: 23 pages, 6 figures

  13. arXiv:1807.09480  [pdf, other

    cs.CV

    Attention Mechanisms for Object Recognition with Event-Based Cameras

    Authors: Marco Cannici, Marco Ciccone, Andrea Romanoni, Matteo Matteucci

    Abstract: Event-based cameras are neuromorphic sensors capable of efficiently encoding visual information in the form of sparse sequences of events. Being biologically inspired, they are commonly used to exploit some of the computational and power consumption benefits of biological vision. In this paper we focus on a specific feature of vision: visual attention. We propose two attentive models for event bas… ▽ More

    Submitted 18 November, 2018; v1 submitted 25 July, 2018; originally announced July 2018.

    Comments: WACV2019 camera-ready submission

  14. arXiv:1805.07931  [pdf, other

    cs.CV

    Asynchronous Convolutional Networks for Object Detection in Neuromorphic Cameras

    Authors: Marco Cannici, Marco Ciccone, Andrea Romanoni, Matteo Matteucci

    Abstract: Event-based cameras, also known as neuromorphic cameras, are bioinspired sensors able to perceive changes in the scene at high frequency with low power consumption. Becoming available only very recently, a limited amount of work addresses object detection on these devices. In this paper we propose two neural networks architectures for object detection: YOLE, which integrates the events into surfac… ▽ More

    Submitted 13 June, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: accepted at CVPR2019 Event-based Vision Workshop

    MSC Class: 68T45