Skip to main content

Showing 1–8 of 8 results for author: Tomè, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19726  [pdf, other

    cs.CV cs.GR cs.LG

    EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans

    Authors: Nicola Garau, Giulia Martinelli, Niccolò Bisagno, Denis Tomè, Carsten Stoll

    Abstract: Monocular Human Pose Estimation (HPE) aims at determining the 3D positions of human joints from a single 2D image captured by a camera. However, a single 2D point in the image may correspond to multiple points in 3D space. Typically, the uniqueness of the 2D-3D relationship is approximated using an orthographic or weak-perspective camera model. In this study, instead of relying on approximations,… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 17 pages, 7 figures

  2. arXiv:2404.10880  [pdf, other

    cs.CV cs.AI

    HumMUSS: Human Motion Understanding using State Space Models

    Authors: Arnab Kumar Mondal, Stefano Alletto, Denis Tome

    Abstract: Understanding human motion from video is essential for a range of applications, including pose estimation, mesh recovery and action recognition. While state-of-the-art methods predominantly rely on transformer-based architectures, these approaches have limitations in practical scenarios. Transformers are slower when sequentially predicting on a continuous stream of frames in real-time, and do not… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: CVPR 24

  3. SelfPose: 3D Egocentric Pose Estimation from a Headset Mounted Camera

    Authors: Denis Tome, Thiemo Alldieck, Patrick Peluse, Gerard Pons-Moll, Lourdes Agapito, Hernan Badino, Fernando De la Torre

    Abstract: We present a solution to egocentric 3D body pose estimation from monocular images captured from downward looking fish-eye cameras installed on the rim of a head mounted VR device. This unusual viewpoint leads to images with unique visual appearance, with severe self-occlusions and perspective distortions that result in drastic differences in resolution between lower and upper body. We propose an e… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: 14 pages. arXiv admin note: substantial text overlap with arXiv:1907.10045

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020

  4. arXiv:1907.10045  [pdf, other

    cs.CV

    xR-EgoPose: Egocentric 3D Human Pose from an HMD Camera

    Authors: Denis Tome, Patrick Peluse, Lourdes Agapito, Hernan Badino

    Abstract: We present a new solution to egocentric 3D body pose estimation from monocular images captured from a downward looking fish-eye camera installed on the rim of a head mounted virtual reality device. This unusual viewpoint, just 2 cm. away from the user's face, leads to images with unique visual appearance, characterized by severe self-occlusions and strong perspective distortions that result in a d… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: ICCV 2019

  5. arXiv:1808.01525  [pdf, other

    cs.CV

    Rethinking Pose in 3D: Multi-stage Refinement and Recovery for Markerless Motion Capture

    Authors: Denis Tome, Matteo Toso, Lourdes Agapito, Chris Russell

    Abstract: We propose a CNN-based approach for multi-camera markerless motion capture of the human body. Unlike existing methods that first perform pose estimation on individual cameras and generate 3D models as post-processing, our approach makes use of 3D reasoning throughout a multi-stage approach. This novelty allows us to use provisional 3D models of human pose to rethink where the joints should be loca… ▽ More

    Submitted 4 August, 2018; originally announced August 2018.

    Comments: International Conference on 3DVision (3dv)

  6. Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image

    Authors: Denis Tome, Chris Russell, Lourdes Agapito

    Abstract: We propose a unified formulation for the problem of 3D human pose estimation from a single raw RGB image that reasons jointly about 2D joint estimation and 3D pose reconstruction to improve both tasks. We take an integrated approach that fuses probabilistic knowledge of 3D human pose with a multi-stage CNN architecture and uses the knowledge of plausible 3D landmark locations to refine the search… ▽ More

    Submitted 11 October, 2017; v1 submitted 1 January, 2017; originally announced January 2017.

    Comments: Paper presented at CVPR 17

  7. Deep convolutional neural networks for pedestrian detection

    Authors: Denis Tomè, Federico Monti, Luca Baroffio, Luca Bondi, Marco Tagliasacchi, Stefano Tubaro

    Abstract: Pedestrian detection is a popular research topic due to its paramount importance for a number of applications, especially in the fields of automotive, surveillance and robotics. Despite the significant improvements, pedestrian detection is still an open challenge that calls for more and more accurate algorithms. In the last few years, deep learning and in particular convolutional neural networks e… ▽ More

    Submitted 7 March, 2016; v1 submitted 13 October, 2015; originally announced October 2015.

    Comments: submitted to Elsevier Signal Processing: Image Communication special Issue on Deep Learning

  8. Maximizing the Link Throughput between Smart-meters and Aggregators as Secondary Users under Power and Outage Constraints

    Authors: Pedro H. J. Nardelli, Mauricio de Castro Tomé, Hirley Alves, Carlos H. M. de Lima, Matti Latva-aho

    Abstract: This paper assesses the communication link from smart meters to aggregators as (unlicensed) secondary users that transmit their data over the (licensed) primary uplink channel. The proposed scenario assumes: (i) meters' and aggregators' positions are fixed so highly directional antennas are employed, (ii) secondary users transmit with limited power in relation to the primary, (iii) meters' transmi… ▽ More

    Submitted 16 February, 2016; v1 submitted 16 June, 2015; originally announced June 2015.