Skip to main content

Showing 1–15 of 15 results for author: Francesca, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.04079  [pdf, other

    cs.RO

    Leveraging swarm capabilities to assist other systems

    Authors: Miquel Kegeleirs, David Garzón Ramos, Guillermo Legarda Herranz, Ilyes Gharbi, Jeanne Szpirer, Ken Hasselmann, Lorenzo Garattoni, Gianpiero Francesca, Mauro Birattari

    Abstract: Most studies in swarm robotics treat the swarm as an isolated system of interest. We argue that the prevailing view of swarms as self-sufficient, independent systems limits the scope of potential applications for swarm robotics. A robot swarm could act as a support in an heterogeneous system comprising other robots and/or human operators, in particular by quickly providing access to a large amount… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Presented at the "Breaking swarm stereotypes" workshop at ICRA 2024

  2. arXiv:2308.14500  [pdf, other

    cs.CV

    LAC: Latent Action Composition for Skeleton-based Action Segmentation

    Authors: Di Yang, Yaohui Wang, Antitza Dantcheva, Quan Kong, Lorenzo Garattoni, Gianpiero Francesca, Francois Bremond

    Abstract: Skeleton-based action segmentation requires recognizing composable actions in untrimmed videos. Current approaches decouple this problem by first extracting local visual features from skeleton sequences and then processing them by a temporal model to classify frame-wise actions. However, their performances remain limited as the visual features cannot sufficiently express composable actions. In thi… ▽ More

    Submitted 21 February, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  3. arXiv:2308.11358  [pdf, other

    cs.CV cs.AI cs.LG

    How Much Temporal Long-Term Context is Needed for Action Segmentation?

    Authors: Emad Bahrami, Gianpiero Francesca, Juergen Gall

    Abstract: Modeling long-term context in videos is crucial for many fine-grained tasks including temporal action segmentation. An interesting question that is still open is how much long-term temporal context is needed for optimal performance. While transformers can model the long-term context of a video, this becomes computationally prohibitive for long videos. Recent works on temporal action segmentation t… ▽ More

    Submitted 25 September, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  4. arXiv:2305.16126  [pdf, other

    cs.RO cs.MA

    Automatic off-line design of robot swarms: exploring the transferability of control software and design methods across different platforms

    Authors: Miquel Kegeleirs, David Garzón Ramos, Lorenzo Garattoni, Gianpiero Francesca, Mauro Birattari

    Abstract: Automatic off-line design is an attractive approach to implementing robot swarms. In this approach, a designer specifies a mission for the swarm, and an optimization process generates suitable control software for the individual robots through computer-based simulations. Most relevant literature has focused on effectively transferring control software from simulation to physical robots. For the fi… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: ICRA 2023 Transferability in Robotics Workshop

  5. arXiv:2305.06437  [pdf, other

    cs.CV cs.AI

    Self-Supervised Video Representation Learning via Latent Time Navigation

    Authors: Di Yang, Yaohui Wang, Quan Kong, Antitza Dantcheva, Lorenzo Garattoni, Gianpiero Francesca, Francois Bremond

    Abstract: Self-supervised video representation learning aimed at maximizing similarity between different temporal segments of one video, in order to enforce feature persistence over time. This leads to loss of pertinent information related to temporal relationships, rendering actions such as `enter' and `leave' to be indistinguishable. To mitigate this limitation, we propose Latent Time Navigation (LTN), a… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: AAAI 2023

  6. arXiv:2301.07923  [pdf

    cs.CV

    Human-Scene Network: A Novel Baseline with Self-rectifying Loss for Weakly supervised Video Anomaly Detection

    Authors: Snehashis Majhi, Rui Dai, Quan Kong, Lorenzo Garattoni, Gianpiero Francesca, Francois Bremond

    Abstract: Video anomaly detection in surveillance systems with only video-level labels (i.e. weakly-supervised) is challenging. This is due to, (i) the complex integration of human and scene based anomalies comprising of subtle and sharp spatio-temporal cues in real-world scenarios, (ii) non-optimal optimization between normal and anomaly instances under weak supervision. In this paper, we propose a Human-S… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

  7. arXiv:2210.06501  [pdf, other

    cs.CV

    Robust Action Segmentation from Timestamp Supervision

    Authors: Yaser Souri, Yazan Abu Farha, Emad Bahrami, Gianpiero Francesca, Juergen Gall

    Abstract: Action segmentation is the task of predicting an action label for each frame of an untrimmed video. As obtaining annotations to train an approach for action segmentation in a fully supervised way is expensive, various approaches have been proposed to train action segmentation models using different forms of weak supervision, e.g., action transcripts, action sets, or more recently timestamps. Times… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: BMVC 2022

  8. arXiv:2209.00065  [pdf, other

    cs.CV

    ViA: View-invariant Skeleton Action Representation Learning via Motion Retargeting

    Authors: Di Yang, Yaohui Wang, Antitza Dantcheva, Lorenzo Garattoni, Gianpiero Francesca, Francois Bremond

    Abstract: Current self-supervised approaches for skeleton action representation learning often focus on constrained scenarios, where videos and skeleton data are recorded in laboratory settings. When dealing with estimated skeleton data in real-world videos, such methods perform poorly due to the large variations across subjects and camera viewpoints. To address this issue, we introduce ViA, a novel View-In… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

    Comments: project website: https://walker-a11y.github.io/ViA-project

  9. arXiv:2110.14392  [pdf, other

    cs.CV

    TaylorSwiftNet: Taylor Driven Temporal Modeling for Swift Future Frame Prediction

    Authors: Saber Pourheydari, Emad Bahrami, Mohsen Fayyaz, Gianpiero Francesca, Mehdi Noroozi, Juergen Gall

    Abstract: While recurrent neural networks (RNNs) demonstrate outstanding capabilities for future video frame prediction, they model dynamics in a discrete time space, i.e., they predict the frames sequentially with a fixed temporal step. RNNs are therefore prone to accumulate the error as the number of future frames increases. In contrast, partial differential equations (PDEs) model physical phenomena like… ▽ More

    Submitted 12 October, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: BMVC 2022

  10. arXiv:2108.03894  [pdf, other

    cs.CV cs.LG

    FIFA: Fast Inference Approximation for Action Segmentation

    Authors: Yaser Souri, Yazan Abu Farha, Fabien Despinoy, Gianpiero Francesca, Juergen Gall

    Abstract: We introduce FIFA, a fast approximate inference method for action segmentation and alignment. Unlike previous approaches, FIFA does not rely on expensive dynamic programming for inference. Instead, it uses an approximate differentiable energy function that can be minimized using gradient-descent. FIFA is a general approach that can replace exact inference improving its speed by more than 5 times w… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  11. arXiv:2107.08580  [pdf, other

    cs.CV

    UNIK: A Unified Framework for Real-world Skeleton-based Action Recognition

    Authors: Di Yang, Yaohui Wang, Antitza Dantcheva, Lorenzo Garattoni, Gianpiero Francesca, Francois Bremond

    Abstract: Action recognition based on skeleton data has recently witnessed increasing attention and progress. State-of-the-art approaches adopting Graph Convolutional networks (GCNs) can effectively extract features on human skeletons relying on the pre-defined human topology. Despite associated progress, GCN-based methods have difficulties to generalize across domains, especially with different human topol… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

    Comments: Code is available at: https://github.com/YangDi666/UNIK

  12. arXiv:2011.05358  [pdf, other

    cs.CV

    Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos

    Authors: Di Yang, Rui Dai, Yaohui Wang, Rupayan Mallick, Luca Minciullo, Gianpiero Francesca, Francois Bremond

    Abstract: Taking advantage of human pose data for understanding human activities has attracted much attention these days. However, state-of-the-art pose estimators struggle in obtaining high-quality 2D or 3D pose data due to occlusion, truncation and low-resolution in real-world un-annotated videos. Hence, in this work, we propose 1) a Selective Spatio-Temporal Aggregation mechanism, named SST-A, that refin… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: WACV2021

  13. arXiv:2010.14982  [pdf

    cs.CV

    Toyota Smarthome Untrimmed: Real-World Untrimmed Videos for Activity Detection

    Authors: Rui Dai, Srijan Das, Saurav Sharma, Luca Minciullo, Lorenzo Garattoni, Francois Bremond, Gianpiero Francesca

    Abstract: Designing activity detection systems that can be successfully deployed in daily-living environments requires datasets that pose the challenges typical of real-world scenarios. In this paper, we introduce a new untrimmed daily-living dataset that features several real-world challenges: Toyota Smarthome Untrimmed (TSU). TSU contains a wide variety of activities performed in a spontaneous manner. The… ▽ More

    Submitted 10 June, 2022; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: Toyota Smarthome Untrimmed dataset, project page: https://project.inria.fr/toyotasmarthome

  14. arXiv:1904.03116  [pdf, other

    cs.CV cs.LG

    Fast Weakly Supervised Action Segmentation Using Mutual Consistency

    Authors: Yaser Souri, Mohsen Fayyaz, Luca Minciullo, Gianpiero Francesca, Juergen Gall

    Abstract: Action segmentation is the task of predicting the actions for each frame of a video. As obtaining the full annotation of videos for action segmentation is expensive, weakly supervised approaches that can learn only from transcripts are appealing. In this paper, we propose a novel end-to-end approach for weakly supervised action segmentation based on a two-branch neural network. The two branches of… ▽ More

    Submitted 10 June, 2021; v1 submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted for publication at TPAMI (IEEE Transactions on Pattern Analysis and Machine Intelligence) in 2021. First two authors contributed equally

  15. arXiv:1802.00421  [pdf, other

    cs.CV

    Deep-Temporal LSTM for Daily Living Action Recognition

    Authors: Srijan Das, Michal Koperski, Francois Bremond, Gianpiero Francesca

    Abstract: In this paper, we propose to improve the traditional use of RNNs by employing a many to many model for video classification. We analyze the importance of modeling spatial layout and temporal encoding for daily living action recognition. Many RGB methods focus only on short term temporal information obtained from optical flow. Skeleton based methods on the other hand show that modeling long term sk… ▽ More

    Submitted 15 June, 2018; v1 submitted 1 February, 2018; originally announced February 2018.

    Comments: Submitted in conference