Skip to main content

Showing 1–8 of 8 results for author: Coscia, P

.
  1. arXiv:2204.11561  [pdf, other

    cs.CV cs.AI cs.LG

    Goal-driven Self-Attentive Recurrent Networks for Trajectory Prediction

    Authors: Luigi Filippo Chiara, Pasquale Coscia, Sourav Das, Simone Calderara, Rita Cucchiara, Lamberto Ballan

    Abstract: Human trajectory forecasting is a key component of autonomous vehicles, social-aware robots and advanced video-surveillance applications. This challenging task typically requires knowledge about past motion, the environment and likely destination areas. In this context, multi-modality is a fundamental aspect and its effective modeling can be beneficial to any architecture. Inferring accurate traje… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to CVPR 2022 Precognition Workshop

  2. arXiv:2203.04781  [pdf, other

    cs.CV cs.AI

    How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting

    Authors: Alessio Monti, Angelo Porrello, Simone Calderara, Pasquale Coscia, Lamberto Ballan, Rita Cucchiara

    Abstract: Accurate prediction of future human positions is an essential task for modern video-surveillance systems. Current state-of-the-art models usually rely on a "history" of past tracked locations (e.g., 3 to 5 seconds) to predict a plausible sequence of future locations (e.g., up to the next 5 seconds). We feel that this common schema neglects critical traits of realistic applications: as the collecti… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022

  3. arXiv:2109.00829  [pdf, other

    cs.CV cs.AI cs.LG

    SlowFast Rolling-Unrolling LSTMs for Action Anticipation in Egocentric Videos

    Authors: Nada Osman, Guglielmo Camporese, Pasquale Coscia, Lamberto Ballan

    Abstract: Action anticipation in egocentric videos is a difficult task due to the inherently multi-modal nature of human actions. Additionally, some actions happen faster or slower than others depending on the actor or surrounding context which could vary each time and lead to different predictions. Based on this idea, we build upon RULSTM architecture, which is specifically designed for anticipating human… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: Accepted to EPIC@ICCV 2021

  4. AC-VRNN: Attentive Conditional-VRNN for Multi-Future Trajectory Prediction

    Authors: Alessia Bertugli, Simone Calderara, Pasquale Coscia, Lamberto Ballan, Rita Cucchiara

    Abstract: Anticipating human motion in crowded scenarios is essential for develo** intelligent transportation systems, social-aware robots and advanced video surveillance applications. A key component of this task is represented by the inherently multi-modal nature of human paths which makes socially acceptable multiple futures when human interactions are involved. To this end, we propose a generative arc… ▽ More

    Submitted 8 July, 2021; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: Accepted at Computer Vision and Image Understanding (CVIU)

  5. arXiv:2004.07711  [pdf, other

    cs.CV cs.LG

    Knowledge Distillation for Action Anticipation via Label Smoothing

    Authors: Guglielmo Camporese, Pasquale Coscia, Antonino Furnari, Giovanni Maria Farinella, Lamberto Ballan

    Abstract: Human capability to anticipate near future from visual observations and non-verbal cues is essential for develo** intelligent systems that need to interact with people. Several research areas, such as human-robot interaction (HRI), assisted living or autonomous driving need to foresee future events to avoid crashes or help people. Egocentric scenarios are classic examples where action anticipati… ▽ More

    Submitted 18 December, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted to ICPR 2020

  6. arXiv:1910.05770  [pdf, other

    cs.CV cs.LG cs.MM

    A CNN-RNN Framework for Image Annotation from Visual Cues and Social Network Metadata

    Authors: Tobia Tesan, Pasquale Coscia, Lamberto Ballan

    Abstract: Images represent a commonly used form of visual communication among people. Nevertheless, image classification may be a challenging task when dealing with unclear or non-common images needing more context to be correctly annotated. Metadata accompanying images on social-media represent an ideal source of additional information for retrieving proper neighborhoods easing image annotation task. To th… ▽ More

    Submitted 30 March, 2020; v1 submitted 13 October, 2019; originally announced October 2019.

  7. arXiv:1909.08840  [pdf, other

    cs.CV

    Social and Scene-Aware Trajectory Prediction in Crowded Spaces

    Authors: Matteo Lisotto, Pasquale Coscia, Lamberto Ballan

    Abstract: Mimicking human ability to forecast future positions or interpret complex interactions in urban scenarios, such as streets, shop** malls or squares, is essential to develop socially compliant robots or self-driving cars. Autonomous systems may gain advantage on anticipating human motion to avoid collisions or to naturally behave alongside people. To foresee plausible trajectories, we construct a… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Accepted to ICCV 2019 Workshop on Assistive Computer Vision and Robotics (ACVR)

  8. arXiv:1604.02032  [pdf, other

    cs.CV

    3-D Hand Pose Estimation from Kinect's Point Cloud Using Appearance Matching

    Authors: Pasquale Coscia, Francesco A. N. Palmieri, Francesco Castaldo, Alberto Cavallo

    Abstract: We present a novel appearance-based approach for pose estimation of a human hand using the point clouds provided by the low-cost Microsoft Kinect sensor. Both the free-hand case, in which the hand is isolated from the surrounding environment, and the hand-object case, in which the different types of interactions are classified, have been considered. The hand-object case is clearly the most challen… ▽ More

    Submitted 7 April, 2016; originally announced April 2016.