Skip to main content

Showing 1–18 of 18 results for author: Seidenari, L

.
  1. arXiv:2310.20650  [pdf, other

    cs.CV cs.RO

    Addressing Limitations of State-Aware Imitation Learning for Autonomous Driving

    Authors: Luca Cultrera, Federico Becattini, Lorenzo Seidenari, Pietro Pala, Alberto Del Bimbo

    Abstract: Conditional Imitation learning is a common and effective approach to train autonomous driving agents. However, two issues limit the full potential of this approach: (i) the inertia problem, a special case of causal confusion where the agent mistakenly correlates low speed with no acceleration, and (ii) low correlation between offline and online performance due to the accumulation of small errors t… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Submitted to IEEE Transactions on Intelligent Vehicles

  2. arXiv:2310.20621  [pdf, other

    cs.CV

    Deepfake detection by exploiting surface anomalies: the SurFake approach

    Authors: Andrea Ciamarra, Roberto Caldelli, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: The ever-increasing use of synthetically generated content in different sectors of our everyday life, one for all media information, poses a strong need for deepfake detection tools in order to avoid the proliferation of altered messages. The process to identify manipulated content, in particular images and videos, is basically performed by looking for the presence of some inconsistencies and/or a… ▽ More

    Submitted 17 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  3. arXiv:2310.20593  [pdf, other

    cs.CV

    FLODCAST: Flow and Depth Forecasting via Multimodal Recurrent Architectures

    Authors: Andrea Ciamarra, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: Forecasting motion and spatial positions of objects is of fundamental importance, especially in safety-critical settings such as autonomous driving. In this work, we address the issue by forecasting two different modalities that carry complementary information, namely optical flow and depth. To this end we propose FLODCAST a flow and depth forecasting model that leverages a multitask recurrent arc… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: Submitted to Pattern Recognition

  4. DiffDefense: Defending against Adversarial Attacks via Diffusion Models

    Authors: Hondamunige Prasanna Silva, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: This paper presents a novel reconstruction method that leverages Diffusion Models to protect machine learning classifiers against adversarial attacks, all without requiring any modifications to the classifiers themselves. The susceptibility of machine learning models to minor input perturbations renders them vulnerable to adversarial attacks. While diffusion-based methods are typically disregarded… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: Paper published at ICIAP23

    Journal ref: ICIAP 2023

  5. arXiv:2308.12914  [pdf, other

    cs.CV

    3D Pose Nowcasting: Forecast the Future to Improve the Present

    Authors: Alessandro Simoni, Francesco Marchetti, Guido Borghi, Federico Becattini, Lorenzo Seidenari, Roberto Vezzani, Alberto Del Bimbo

    Abstract: Technologies to enable safe and effective collaboration and coexistence between humans and robots have gained significant importance in the last few years. A critical component useful for realizing this collaborative paradigm is the understanding of human and robot 3D poses using non-invasive systems. Therefore, in this paper, we propose a novel vision-based system leveraging depth data to accurat… ▽ More

    Submitted 18 November, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  6. Forecasting Future Instance Segmentation with Learned Optical Flow and War**

    Authors: Andrea Ciamarra, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: For an autonomous vehicle it is essential to observe the ongoing dynamics of a scene and consequently predict imminent future scenarios to ensure safety to itself and others. This can be done using different sensors and modalities. In this paper we investigate the usage of optical flow for predicting future semantic segmentations. To do so we propose a model that forecasts flow fields autoregressi… ▽ More

    Submitted 6 September, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: Paper published as Poster at ICIAP21

    Journal ref: ICIAP 2022

  7. arXiv:2206.03086  [pdf, other

    cs.CV

    Online Deep Clustering with Video Track Consistency

    Authors: Alessandra Alfani, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: Several unsupervised and self-supervised approaches have been developed in recent years to learn visual features from large-scale unlabeled datasets. Their main drawback however is that these methods are hardly able to recognize visual features of the same object if it is simply rotated or the perspective of the camera changes. To overcome this limitation and at the same time exploit a useful sour… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: Accepted at ICPR2022 as oral

  8. arXiv:2203.12446  [pdf, other

    cs.CV

    SMEMO: Social Memory for Trajectory Forecasting

    Authors: Francesco Marchetti, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: Effective modeling of human interactions is of utmost importance when forecasting behaviors such as future trajectories. Each individual, with its motion, influences surrounding agents since everyone obeys to social non-written rules such as collision avoidance or group following. In this paper we model such interactions, which constantly evolve through time, by looking at the problem from an algo… ▽ More

    Submitted 18 February, 2024; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: Accepted for publication in IEEE Transaction on Pattern Analysis and Machine Intelligence (PAMI)

  9. Learning Group Activities from Skeletons without Individual Action Labels

    Authors: Fabio Zappardino, Tiberio Uricchio, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: To understand human behavior we must not just recognize individual actions but model possibly complex group activity and interactions. Hierarchical models obtain the best results in group activity recognition but require fine grained individual action annotations at the actor level. In this paper we show that using only skeletal data we can train a state-of-the art end-to-end system using only gro… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: ICPR 2020

  10. arXiv:2010.08948  [pdf, other

    cs.CV cs.RO

    Multiple Future Prediction Leveraging Synthetic Trajectories

    Authors: Lorenzo Berlincioni, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: Trajectory prediction is an important task, especially in autonomous driving. The ability to forecast the position of other moving agents can yield to an effective planning, ensuring safety for the autonomous vehicle as well for the observed entities. In this work we propose a data driven approach based on Markov Chains to generate synthetic trajectories, which are useful for training a multiple f… ▽ More

    Submitted 18 October, 2020; originally announced October 2020.

    Comments: Accepted at ICPR2020

  11. arXiv:2006.03347  [pdf, other

    cs.CV

    Explaining Autonomous Driving by Learning End-to-End Visual Attention

    Authors: Luca Cultrera, Lorenzo Seidenari, Federico Becattini, Pietro Pala, Alberto Del Bimbo

    Abstract: Current deep learning based autonomous driving approaches yield impressive results also leading to in-production deployment in certain controlled scenarios. One of the most popular and fascinating approaches relies on learning vehicle controls directly from data perceived by sensors. This end-to-end learning paradigm can be applied both in classical supervised settings and using reinforcement lear… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: accepted at CVPR20 Workshop on Safe Artificial Intelligence for Automated Driving (SAIAD20)

  12. arXiv:2006.03340  [pdf, other

    cs.CV

    MANTRA: Memory Augmented Networks for Multiple Trajectory Prediction

    Authors: Francesco Marchetti, Federico Becattini, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: Autonomous vehicles are expected to drive in complex scenarios with several independent non cooperating agents. Path planning for safely navigating in such environments can not just rely on perceiving present location and motion of other agents. It requires instead to predict such variables in a far enough future. In this paper we address the problem of multimodal trajectory prediction exploiting… ▽ More

    Submitted 3 June, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: Accepted at CVPR20

  13. arXiv:1910.04056  [pdf, other

    cs.LG cs.CL stat.ML

    Text-to-Image Synthesis Based on Machine Generated Captions

    Authors: Marco Menardi, Alex Falcon, Saida S. Mohamed, Lorenzo Seidenari, Giuseppe Serra, Alberto Del Bimbo, Carlo Tasso

    Abstract: Text to Image Synthesis refers to the process of automatic generation of a photo-realistic image starting from a given text and is revolutionizing many real-world applications. In order to perform such process it is necessary to exploit datasets containing captioned images, meaning that each image is associated with one (or more) captions describing it. Despite the abundance of uncaptioned images… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

  14. arXiv:1805.11746  [pdf, other

    cs.CV

    Semantic Road Layout Understanding by Generative Adversarial Inpainting

    Authors: Lorenzo Berlincioni, Federico Becattini, Leonardo Galteri, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: Autonomous driving is becoming a reality, yet vehicles still need to rely on complex sensor fusion to understand the scene they act in. The ability to discern static environment and dynamic entities provides a comprehension of the road layout that poses constraints to the reasoning process about moving objects. We pursue this through a GAN-based semantic segmentation inpainting model to remove all… ▽ More

    Submitted 20 November, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

  15. arXiv:1705.01781  [pdf, other

    cs.CV

    Am I Done? Predicting Action Progress in Videos

    Authors: Federico Becattini, Tiberio Uricchio, Lorenzo Seidenari, Lamberto Ballan, Alberto Del Bimbo

    Abstract: In this paper we deal with the problem of predicting action progress in videos. We argue that this is an extremely important task since it can be valuable for a wide range of interaction applications. To this end we introduce a novel approach, named ProgressNet, capable of predicting when an action takes place in a video, where it is located within the frames, and how far it has progressed during… ▽ More

    Submitted 9 March, 2020; v1 submitted 4 May, 2017; originally announced May 2017.

  16. arXiv:1704.02518  [pdf, other

    cs.CV

    Deep Generative Adversarial Compression Artifact Removal

    Authors: Leonardo Galteri, Lorenzo Seidenari, Marco Bertini, Alberto Del Bimbo

    Abstract: Compression artifacts arise in images whenever a lossy compression algorithm is applied. These artifacts eliminate details present in the original image, or add noise and small structures; because of these effects they make images less pleasant for the human eye, and may also lead to decreased performance of computer vision algorithms such as object detectors. To eliminate such artifacts, when dec… ▽ More

    Submitted 6 December, 2017; v1 submitted 8 April, 2017; originally announced April 2017.

    Comments: ICCV 2017 Camera Ready + Acknowledgements

  17. arXiv:1609.00221  [pdf, other

    cs.CV

    Segmentation Free Object Discovery in Video

    Authors: Giovanni Cuffaro, Federico Becattini, Claudio Baecchi, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: In this paper we present a simple yet effective approach to extend without supervision any object proposal from static images to videos. Unlike previous methods, these spatio-temporal proposals, to which we refer as tracks, are generated relying on little or no visual content by only exploiting bounding boxes spatial correlations through time. The tracks that we obtain are likely to represent obje… ▽ More

    Submitted 1 September, 2016; originally announced September 2016.

  18. Automatic Image Annotation via Label Transfer in the Semantic Space

    Authors: Tiberio Uricchio, Lamberto Ballan, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: Automatic image annotation is among the fundamental problems in computer vision and pattern recognition, and it is becoming increasingly important in order to develop algorithms that are able to search and browse large-scale image collections. In this paper, we propose a label propagation framework based on Kernel Canonical Correlation Analysis (KCCA), which builds a latent semantic space where co… ▽ More

    Submitted 1 June, 2017; v1 submitted 16 May, 2016; originally announced May 2016.

    Comments: To appear in Pattern Recognition