Skip to main content

Showing 1–8 of 8 results for author: Alletto, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.10880  [pdf, other

    cs.CV cs.AI

    HumMUSS: Human Motion Understanding using State Space Models

    Authors: Arnab Kumar Mondal, Stefano Alletto, Denis Tome

    Abstract: Understanding human motion from video is essential for a range of applications, including pose estimation, mesh recovery and action recognition. While state-of-the-art methods predominantly rely on transformer-based architectures, these approaches have limitations in practical scenarios. Transformers are slower when sequentially predicting on a continuous stream of frames in real-time, and do not… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: CVPR 24

  2. arXiv:2004.00329  [pdf, other

    cs.CV

    Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation

    Authors: Matteo Fabbri, Fabio Lanzi, Simone Calderara, Stefano Alletto, Rita Cucchiara

    Abstract: In this paper we present a novel approach for bottom-up multi-person 3D human pose estimation from monocular RGB images. We propose to use high resolution volumetric heatmaps to model joint locations, devising a simple and effective compression method to drastically reduce the size of this representation. At the core of the proposed method lies our Volumetric Heatmap Autoencoder, a fully-convoluti… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: CVPR 2020

  3. arXiv:2003.01181  [pdf, other

    cs.LG cs.CV stat.ML

    RandomNet: Towards Fully Automatic Neural Architecture Design for Multimodal Learning

    Authors: Stefano Alletto, Shenyang Huang, Vincent Francois-Lavet, Yohei Nakata, Guillaume Rabusseau

    Abstract: Almost all neural architecture search methods are evaluated in terms of performance (i.e. test accuracy) of the model structures that it finds. Should it be the only metric for a good autoML approach? To examine aspects beyond performance, we propose a set of criteria aimed at evaluating the core of autoML problem: the amount of human intervention required to deploy these methods into real world s… ▽ More

    Submitted 2 March, 2020; originally announced March 2020.

    Comments: 6 pages, 1 figures

  4. arXiv:1901.08097  [pdf, other

    cs.CV

    Can Adversarial Networks Hallucinate Occluded People With a Plausible Aspect?

    Authors: Federico Fulgeri, Matteo Fabbri, Stefano Alletto, Simone Calderara, Rita Cucchiara

    Abstract: When you see a person in a crowd, occluded by other persons, you miss visual information that can be used to recognize, re-identify or simply classify him or her. You can imagine its appearance given your experience, nothing more. Similarly, AI solutions can try to hallucinate missing information with specific deep learning architectures, suitably trained with people with and without occlusions. T… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: Under review at CVIU

  5. arXiv:1706.00322   

    cs.CV

    TransFlow: Unsupervised Motion Flow by Joint Geometric and Pixel-level Estimation

    Authors: Stefano Alletto, Davide Abati, Simone Calderara, Rita Cucchiara, Luca Rigazio

    Abstract: We address unsupervised optical flow estimation for ego-centric motion. We argue that optical flow can be cast as a geometrical war** between two successive video frames and devise a deep architecture to estimate such transformation in two stages. First, a dense pixel-level flow is computed with a geometric prior imposing strong spatial constraints. Such prior is typical of driving scenes, where… ▽ More

    Submitted 30 October, 2017; v1 submitted 1 June, 2017; originally announced June 2017.

    Comments: We have found a bug in the flow evaluation code compromising the experimental evaluation and the results provided in the paper are no longer correct. We are currently working on a new experimental campaign but we estimate that results will be available in a few weeks and will drastically change the paper, hence the withdraw request

  6. arXiv:1611.08215  [pdf, other

    cs.CV cs.HC

    Learning Where to Attend Like a Human Driver

    Authors: Andrea Palazzi, Francesco Solera, Simone Calderara, Stefano Alletto, Rita Cucchiara

    Abstract: Despite the advent of autonomous cars, it's likely - at least in the near future - that human attention will still maintain a central role as a guarantee in terms of legal responsibility during the driving task. In this paper we study the dynamics of the driver's gaze and use it as a proxy to understand related attentional mechanisms. First, we build our analysis upon two questions: where and what… ▽ More

    Submitted 9 May, 2017; v1 submitted 24 November, 2016; originally announced November 2016.

    Comments: To appear in IEEE Intelligent Vehicles Symposium 2017

  7. arXiv:1609.09156  [pdf, other

    cs.CV cs.LG

    Similarity Map** with Enhanced Siamese Network for Multi-Object Tracking

    Authors: Minyoung Kim, Stefano Alletto, Luca Rigazio

    Abstract: Multi-object tracking has recently become an important area of computer vision, especially for Advanced Driver Assistance Systems (ADAS). Despite growing attention, achieving high performance tracking is still challenging, with state-of-the- art systems resulting in high complexity with a large number of hyper parameters. In this paper, we focus on reducing overall system complexity and the number… ▽ More

    Submitted 23 January, 2017; v1 submitted 28 September, 2016; originally announced September 2016.

    Comments: 1) accepted as a poster presentation at WiML (Women in Machine Learning) workshop 2016, colocated with NIPS 2016 in Barcelona, Spain, 2) accepted as a poster presentation at MLITS (Machine Learning for Intelligent Transportation Systems) Workshop held in conjunction with the NIPS 2016 in Barcelona, Spain

  8. arXiv:1607.08434  [pdf, other

    cs.CV

    Video Registration in Egocentric Vision under Day and Night Illumination Changes

    Authors: Stefano Alletto, Giuseppe Serra, Rita Cucchiara

    Abstract: With the spread of wearable devices and head mounted cameras, a wide range of application requiring precise user localization is now possible. In this paper we propose to treat the problem of obtaining the user position with respect to a known environment as a video registration problem. Video registration, i.e. the task of aligning an input video sequence to a pre-built 3D model, relies on a matc… ▽ More

    Submitted 28 July, 2016; originally announced July 2016.