Skip to main content

Showing 1–15 of 15 results for author: Vezzani, R

.
  1. arXiv:2308.12914  [pdf, other

    cs.CV

    3D Pose Nowcasting: Forecast the Future to Improve the Present

    Authors: Alessandro Simoni, Francesco Marchetti, Guido Borghi, Federico Becattini, Lorenzo Seidenari, Roberto Vezzani, Alberto Del Bimbo

    Abstract: Technologies to enable safe and effective collaboration and coexistence between humans and robots have gained significant importance in the last few years. A critical component useful for realizing this collaborative paradigm is the understanding of human and robot 3D poses using non-invasive systems. Therefore, in this paper, we propose a novel vision-based system leveraging depth data to accurat… ▽ More

    Submitted 18 November, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  2. arXiv:2307.12718  [pdf, other

    cs.CV

    CarPatch: A Synthetic Benchmark for Radiance Field Evaluation on Vehicle Components

    Authors: Davide Di Nucci, Alessandro Simoni, Matteo Tomei, Luca Ciuffreda, Roberto Vezzani, Rita Cucchiara

    Abstract: Neural Radiance Fields (NeRFs) have gained widespread recognition as a highly effective technique for representing 3D reconstructions of objects and scenes derived from sets of images. Despite their efficiency, NeRF models can pose challenges in certain scenarios such as vehicle inspection, where the lack of sufficient data or the presence of challenging elements (e.g. reflections) strongly impact… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: Accepted at ICIAP2023

  3. arXiv:2207.02519  [pdf, other

    cs.CV cs.RO

    Semi-Perspective Decoupled Heatmaps for 3D Robot Pose Estimation from Depth Maps

    Authors: Alessandro Simoni, Stefano Pini, Guido Borghi, Roberto Vezzani

    Abstract: Knowing the exact 3D location of workers and robots in a collaborative environment enables several real applications, such as the detection of unsafe situations or the study of mutual interactions for statistical and social purposes. In this paper, we propose a non-invasive and light-invariant framework based on depth devices and deep neural networks to estimate the 3D pose of robots from an exter… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: IROS2022 and IEEE Robotics and Automation Letters (RA-L). Accepted June, 2022

  4. arXiv:2110.11256  [pdf, other

    cs.CV

    Multi-Category Mesh Reconstruction From Image Collections

    Authors: Alessandro Simoni, Stefano Pini, Roberto Vezzani, Rita Cucchiara

    Abstract: Recently, learning frameworks have shown the capability of inferring the accurate shape, pose, and texture of an object from a single RGB image. However, current methods are trained on image collections of a single category in order to exploit specific priors, and they often make use of category-specific 3D templates. In this paper, we present an alternative approach that infers the textured mesh… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: Accepted at 3DV 2021

  5. arXiv:2106.10980  [pdf, other

    cs.CV cs.LG

    SHREC 2021: Track on Skeleton-based Hand Gesture Recognition in the Wild

    Authors: Ariel Caputo, Andrea Giachetti, Simone Soso, Deborah Pintani, Andrea D'Eusanio, Stefano Pini, Guido Borghi, Alessandro Simoni, Roberto Vezzani, Rita Cucchiara, Andrea Ranieri, Franca Giannini, Katia Lupinetti, Marina Monti, Mehran Maghoumi, Joseph J. LaViola Jr, Minh-Quan Le, Hai-Dang Nguyen, Minh-Triet Tran

    Abstract: Gesture recognition is a fundamental tool to enable novel interaction paradigms in a variety of application scenarios like Mixed Reality environments, touchless public kiosks, entertainment systems, and more. Recognition of hand gestures can be nowadays performed directly from the stream of hand skeletons estimated by software provided by low-cost trackers (Ultraleap) and MR headsets (Hololens, Oc… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: 12 pages, to be published on Computers & Graphics

  6. arXiv:1901.08101  [pdf, other

    cs.CV

    Domain Translation with Conditional GANs: from Depth to RGB Face-to-Face

    Authors: Matteo Fabbri, Guido Borghi, Fabio Lanzi, Roberto Vezzani, Simone Calderara, Rita Cucchiara

    Abstract: Can faces acquired by low-cost depth sensors be useful to catch some characteristic details of the face? Typically the answer is no. However, new deep architectures can generate RGB images from data acquired in a different modality, such as depth data. In this paper, we propose a new \textit{Deterministic Conditional GAN}, trained on annotated RGB-D face datasets, effective for a face-to-face tran… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: Accepted at ICPR 2018

  7. arXiv:1812.02041  [pdf, other

    cs.CV

    Learn to See by Events: Color Frame Synthesis from Event and RGB Cameras

    Authors: Stefano Pini, Guido Borghi, Roberto Vezzani

    Abstract: Event cameras are biologically-inspired sensors that gather the temporal evolution of the scene. They capture pixel-wise brightness variations and output a corresponding stream of asynchronous events. Despite having multiple advantages with respect to traditional cameras, their use is partially prevented by the limited applicability of traditional data processing and vision algorithms. To this aim… ▽ More

    Submitted 10 December, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

    Comments: Accepted as full oral at the 15th International Conference on Computer Vision Theory and Applications (VISAPP) 2020

  8. arXiv:1805.11927  [pdf, other

    cs.CV

    Learning to Generate Facial Depth Maps

    Authors: Stefano Pini, Filippo Grazioli, Guido Borghi, Roberto Vezzani, Rita Cucchiara

    Abstract: In this paper, an adversarial architecture for facial depth map estimation from monocular intensity images is presented. By following an image-to-image approach, we combine the advantages of supervised learning and adversarial training, proposing a conditional Generative Adversarial Network that effectively learns to translate intensity face images into the corresponding depth maps. Two public dat… ▽ More

    Submitted 30 May, 2018; originally announced May 2018.

  9. arXiv:1803.08319  [pdf, other

    cs.CV

    Learning to Detect and Track Visible and Occluded Body Joints in a Virtual World

    Authors: Matteo Fabbri, Fabio Lanzi, Simone Calderara, Andrea Palazzi, Roberto Vezzani, Rita Cucchiara

    Abstract: Multi-People Tracking in an open-world setting requires a special effort in precise detection. Moreover, temporal continuity in the detection phase gains more importance when scene cluttering introduces the challenging problems of occluded targets. For the purpose, we propose a deep network architecture that jointly extracts people body parts and associates them across short temporal spans. Our mo… ▽ More

    Submitted 18 September, 2018; v1 submitted 22 March, 2018; originally announced March 2018.

    Comments: Accepted at ECCV 2018

  10. arXiv:1712.05277  [pdf, other

    cs.CV

    Face-from-Depth for Head Pose Estimation on Depth Images

    Authors: Guido Borghi, Matteo Fabbri, Roberto Vezzani, Simone Calderara, Rita Cucchiara

    Abstract: Depth cameras allow to set up reliable solutions for people monitoring and behavior understanding, especially when unstable or poor illumination conditions make unusable common RGB sensors. Therefore, we propose a complete framework for the estimation of the head and shoulder pose based on depth images only. A head detection and localization module is also included, in order to develop a complete… ▽ More

    Submitted 30 August, 2018; v1 submitted 12 December, 2017; originally announced December 2017.

    Comments: Submitted to IEEE Transactions on PAMI, updated version (second round). arXiv admin note: substantial text overlap with arXiv:1611.10195

  11. arXiv:1707.06786  [pdf, other

    cs.CV

    Head Detection with Depth Images in the Wild

    Authors: Diego Ballotta, Guido Borghi, Roberto Vezzani, Rita Cucchiara

    Abstract: Head detection and localization is a demanding task and a key element for many computer vision applications, like video surveillance, Human Computer Interaction and face analysis. The stunning amount of work done for detecting faces on RGB images, together with the availability of huge face datasets, allowed to setup very effective systems on that domain. However, due to illumination issues, infra… ▽ More

    Submitted 8 November, 2017; v1 submitted 21 July, 2017; originally announced July 2017.

    Comments: Accepted as full paper (oral) at VISAPP 2018

  12. arXiv:1703.03624  [pdf, other

    cs.CV

    From Depth Data to Head Pose Estimation: a Siamese approach

    Authors: Marco Venturelli, Guido Borghi, Roberto Vezzani, Rita Cucchiara

    Abstract: The correct estimation of the head pose is a problem of the great importance for many applications. For instance, it is an enabling technology in automotive for driver attention monitoring. In this paper, we tackle the pose estimation problem through a deep learning network working in regression manner. Traditional methods usually rely on visual facial features, such as facial landmarks or nose ti… ▽ More

    Submitted 10 March, 2017; originally announced March 2017.

    Comments: VISAPP 2017. arXiv admin note: text overlap with arXiv:1703.01883

  13. arXiv:1703.02931  [pdf, other

    cs.CV

    Fast Gesture Recognition with Multiple Stream Discrete HMMs on 3D Skeletons

    Authors: Guido Borghi, Roberto Vezzani, Rita Cucchiara

    Abstract: HMMs are widely used in action and gesture recognition due to their implementation simplicity, low computational requirement, scalability and high parallelism. They have worth performance even with a limited training set. All these characteristics are hard to find together in other even more accurate methods. In this paper, we propose a novel double-stage classification approach, based on Multiple… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: Accepted in ICPR 2016

  14. arXiv:1703.01883  [pdf, other

    cs.CV

    Deep Head Pose Estimation from Depth Data for In-car Automotive Applications

    Authors: Marco Venturelli, Guido Borghi, Roberto Vezzani, Rita Cucchiara

    Abstract: Recently, deep learning approaches have achieved promising results in various fields of computer vision. In this paper, we tackle the problem of head pose estimation through a Convolutional Neural Network (CNN). Differently from other proposals in the literature, the described system is able to work directly and based only on raw depth data. Moreover, the head pose estimation is solved as a regres… ▽ More

    Submitted 6 March, 2017; originally announced March 2017.

    Comments: 2nd International Workshop on Understanding Human Activities through 3D Sensors (ICPR 2016)

  15. arXiv:1611.10195  [pdf, other

    cs.CV

    POSEidon: Face-from-Depth for Driver Pose Estimation

    Authors: Guido Borghi, Marco Venturelli, Roberto Vezzani, Rita Cucchiara

    Abstract: Fast and accurate upper-body and head pose estimation is a key task for automatic monitoring of driver attention, a challenging context characterized by severe illumination changes, occlusions and extreme poses. In this work, we present a new deep learning framework for head localization and pose estimation on depth images. The core of the proposal is a regression neural network, called POSEidon,… ▽ More

    Submitted 12 December, 2017; v1 submitted 30 November, 2016; originally announced November 2016.

    Comments: Accepted in Computer Vision and Pattern Recognition (CVPR 2017)