Skip to main content

Showing 1–11 of 11 results for author: Planamente, M

.
  1. Bringing Online Egocentric Action Recognition into the wild

    Authors: Gabriele Goletto, Mirco Planamente, Barbara Caputo, Giuseppe Averta

    Abstract: To enable a safe and effective human-robot cooperation, it is crucial to develop models for the identification of human activities. Egocentric vision seems to be a viable solution to solve this problem, and therefore many works provide deep learning solutions to infer human actions from first person videos. However, although very promising, most of these do not consider the major challenges that c… ▽ More

    Submitted 9 March, 2023; v1 submitted 5 November, 2022; originally announced November 2022.

    Comments: Accepted to RA-L, for associated video, see https://www.youtube.com/watch?v=7rtynmoYnuw&t=9s

  2. arXiv:2209.04525  [pdf, other

    cs.CV

    PoliTO-IIT-CINI Submission to the EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition

    Authors: Mirco Planamente, Gabriele Goletto, Gabriele Trivigno, Giuseppe Averta, Barbara Caputo

    Abstract: In this report, we describe the technical details of our submission to the EPIC-Kitchens-100 Unsupervised Domain Adaptation (UDA) Challenge in Action Recognition. To tackle the domain-shift which exists under the UDA setting, we first exploited a recent Domain Generalization (DG) technique, called Relative Norm Alignment (RNA). Secondly, we extended this approach to work on unlabelled target data,… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 3rd place in the 2022 EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition. arXiv admin note: substantial text overlap with arXiv:2107.00337

  3. arXiv:2112.03596  [pdf, other

    cs.CV

    E$^2$(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition

    Authors: Chiara Plizzari, Mirco Planamente, Gabriele Goletto, Marco Cannici, Emanuele Gusso, Matteo Matteucci, Barbara Caputo

    Abstract: Event cameras are novel bio-inspired sensors, which asynchronously capture pixel-level intensity changes in the form of "events". Due to their sensing mechanism, event cameras have little to no motion blur, a very high temporal resolution and require significantly less power and memory than traditional frame-based cameras. These characteristics make them a perfect fit to several real-world applica… ▽ More

    Submitted 3 April, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: To be presented at CVPR2022

  4. arXiv:2110.10101  [pdf, other

    cs.CV

    Domain Generalization through Audio-Visual Relative Norm Alignment in First Person Action Recognition

    Authors: Mirco Planamente, Chiara Plizzari, Emanuele Alberti, Barbara Caputo

    Abstract: First person action recognition is becoming an increasingly researched area thanks to the rising popularity of wearable cameras. This is bringing to light cross-domain issues that are yet to be addressed in this context. Indeed, the information extracted from learned representations suffers from an intrinsic "environmental bias". This strongly affects the ability to generalize to unseen scenarios,… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: Accepted at WACV 2022. arXiv admin note: substantial text overlap with arXiv:2106.01689

  5. arXiv:2107.00337  [pdf, other

    cs.CV

    PoliTO-IIT Submission to the EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition

    Authors: Chiara Plizzari, Mirco Planamente, Emanuele Alberti, Barbara Caputo

    Abstract: In this report, we describe the technical details of our submission to the EPIC-Kitchens-100 Unsupervised Domain Adaptation (UDA) Challenge in Action Recognition. To tackle the domain-shift which exists under the UDA setting, we first exploited a recent Domain Generalization (DG) technique, called Relative Norm Alignment (RNA). It consists in designing a model able to generalize well to any unseen… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 3rd place in the 2021 EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition

  6. arXiv:2106.01689  [pdf, other

    cs.CV

    Cross-Domain First Person Audio-Visual Action Recognition through Relative Norm Alignment

    Authors: Mirco Planamente, Chiara Plizzari, Emanuele Alberti, Barbara Caputo

    Abstract: First person action recognition is an increasingly researched topic because of the growing popularity of wearable cameras. This is bringing to light cross-domain issues that are yet to be addressed in this context. Indeed, the information extracted from learned representations suffers from an intrinsic environmental bias. This strongly affects the ability to generalize to unseen scenarios, limitin… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 11 pages, 7 figures

  7. arXiv:2103.12768  [pdf, other

    cs.CV

    DA4Event: towards bridging the Sim-to-Real Gap for Event Cameras using Domain Adaptation

    Authors: Mirco Planamente, Chiara Plizzari, Marco Cannici, Marco Ciccone, Francesco Strada, Andrea Bottino, Matteo Matteucci, Barbara Caputo

    Abstract: Event cameras are novel bio-inspired sensors, which asynchronously capture pixel-level intensity changes in the form of "events". The innovative way they acquire data presents several advantages over standard devices, especially in poor lighting and high-speed motion conditions. However, the novelty of these sensors results in the lack of a large amount of training data capable of fully unlocking… ▽ More

    Submitted 29 October, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Accepted at IROS21

  8. arXiv:2004.10016  [pdf, other

    cs.CV cs.RO

    Unsupervised Domain Adaptation through Inter-modal Rotation for RGB-D Object Recognition

    Authors: Mohammad Reza Loghmani, Luca Robbiano, Mirco Planamente, Kiru Park, Barbara Caputo, Markus Vincze

    Abstract: Unsupervised Domain Adaptation (DA) exploits the supervision of a label-rich source dataset to make predictions on an unlabeled target dataset by aligning the two data distributions. In robotics, DA is used to take advantage of automatically generated synthetic data, that come with "free" annotation, to make effective predictions on real data. However, existing DA methods are not designed to cope… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  9. arXiv:2002.03982  [pdf, other

    cs.CV

    Self-Supervised Joint Encoding of Motion and Appearance for First Person Action Recognition

    Authors: Mirco Planamente, Andrea Bottino, Barbara Caputo

    Abstract: Wearable cameras are becoming more and more popular in several applications, increasing the interest of the research community in develo** approaches for recognizing actions from the first-person point of view. An open challenge in egocentric action recognition is that videos lack detailed information about the main actor's pose and thus tend to record only parts of the movement when focusing on… ▽ More

    Submitted 7 December, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

    Comments: Accepted to ICPR 2020

  10. arXiv:1808.01357  [pdf, other

    cs.CV cs.LG stat.ML

    A recurrent multi-scale approach to RBG-D Object Recognition

    Authors: Mirco Planamente, Mohammad Reza Loghmani, Barbara Caputo

    Abstract: Technological development aims to produce generations of increasingly efficient robots able to perform complex tasks. This requires considerable efforts, from the scientific community, to find new algorithms that solve computer vision problems, such as object recognition. The diffusion of RGB-D cameras directed the study towards the research of new architectures able to exploit the RGB and Depth i… ▽ More

    Submitted 5 September, 2018; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: Master thesis extracted from the paper arXiv:1806.01673 submitted to accv 2018

  11. arXiv:1806.01673  [pdf, other

    cs.CV

    Recurrent Convolutional Fusion for RGB-D Object Recognition

    Authors: Mohammad Reza Loghmani, Mirco Planamente, Barbara Caputo, Markus Vincze

    Abstract: Providing machines with the ability to recognize objects like humans has always been one of the primary goals of machine vision. The introduction of RGB-D cameras has paved the way for a significant leap forward in this direction thanks to the rich information provided by these sensors. However, the machine vision community still lacks an effective method to synergically use the RGB and depth data… ▽ More

    Submitted 24 February, 2019; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: Under review at RA-L