Skip to main content

Showing 1–10 of 10 results for author: Landi, F

.
  1. Embodied Navigation at the Art Gallery

    Authors: Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

    Abstract: Embodied agents, trained to explore and navigate indoor photorealistic environments, have achieved impressive results on standard datasets and benchmarks. So far, experiments and evaluations have involved domestic and working scenes like offices, flats, and houses. In this paper, we build and release a new 3D space with unique characteristics: the one of a complete art museum. We name this environ… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Accepted by 21st International Conference on Image Analysis and Processing (ICIAP 2021)

  2. arXiv:2204.08532  [pdf, other

    cs.CV cs.AI cs.GR cs.MM

    Dress Code: High-Resolution Multi-Category Virtual Try-On

    Authors: Davide Morelli, Matteo Fincato, Marcella Cornia, Federico Landi, Fabio Cesari, Rita Cucchiara

    Abstract: Image-based virtual try-on strives to transfer the appearance of a clothing item onto the image of a target person. Prior work focuses mainly on upper-body clothes (e.g. t-shirts, shirts, and tops) and neglects full-body or lower-body items. This shortcoming arises from a main factor: current publicly available datasets for image-based virtual try-on do not account for this variety, thus limiting… ▽ More

    Submitted 13 July, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

    Comments: ECCV 2022 - Video Demo: https://www.youtube.com/watch?v=qr6TW3uTHG4

  3. Spot the Difference: A Novel Task for Embodied Agents in Changing Environments

    Authors: Federico Landi, Roberto Bigazzi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

    Abstract: Embodied AI is a recent research area that aims at creating intelligent agents that can move and operate inside an environment. Existing approaches in this field demand the agents to act in completely new and unexplored scenes. However, this setting is far from realistic use cases that instead require executing multiple tasks in the same environment. Even if the environment changes over time, the… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: Accepted by 26TH International Conference on Pattern Recognition (ICPR 2022)

  4. arXiv:2109.08521  [pdf, other

    cs.RO cs.AI cs.CV

    Focus on Impact: Indoor Exploration with Intrinsic Motivation

    Authors: Roberto Bigazzi, Federico Landi, Silvia Cascianelli, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara

    Abstract: Exploration of indoor environments has recently experienced a significant interest, also thanks to the introduction of deep neural agents built in a hierarchical fashion and trained with Deep Reinforcement Learning (DRL) on simulated environments. Current state-of-the-art methods employ a dense extrinsic reward that requires the complete a priori knowledge of the layout of the training environment… ▽ More

    Submitted 4 February, 2022; v1 submitted 14 September, 2021; originally announced September 2021.

    Comments: Published in IEEE Robotics and Automation Letters. To appear in ICRA 2022

    Journal ref: IEEE Robotics and Automation Letters (Volume: 7, Issue: 2, April 2022)

  5. arXiv:2109.00020  [pdf, other

    cs.LG cs.CL cs.CV cs.NE

    Working Memory Connections for LSTM

    Authors: Federico Landi, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara

    Abstract: Recurrent Neural Networks with Long Short-Term Memory (LSTM) make use of gating mechanisms to mitigate exploding and vanishing gradients when learning long-term dependencies. For this reason, LSTMs and other gated RNNs are widely adopted, being the standard de facto for many sequence modeling tasks. Although the memory cell inside the LSTM contains essential information, it is not allowed to influ… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

    Comments: Accepted for publication in Neural Networks

  6. Out of the Box: Embodied Navigation in the Real World

    Authors: Roberto Bigazzi, Federico Landi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

    Abstract: The research field of Embodied AI has witnessed substantial progress in visual navigation and exploration thanks to powerful simulating platforms and the availability of 3D data of indoor and photorealistic environments. These two factors have opened the doors to a new generation of intelligent agents capable of achieving nearly perfect PointGoal Navigation. However, such architectures are commonl… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  7. arXiv:2007.07268  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    Explore and Explain: Self-supervised Navigation and Recounting

    Authors: Roberto Bigazzi, Federico Landi, Marcella Cornia, Silvia Cascianelli, Lorenzo Baraldi, Rita Cucchiara

    Abstract: Embodied AI has been recently gaining attention as it aims to foster the development of autonomous and intelligent agents. In this paper, we devise a novel embodied setting in which an agent needs to explore a previously unknown environment while recounting what it sees during the path. In this context, the agent needs to navigate the environment driven by an exploration goal, select proper moment… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: ICPR 2020

  8. arXiv:1911.12377  [pdf, other

    cs.CV cs.CL cs.LG

    Multimodal Attention Networks for Low-Level Vision-and-Language Navigation

    Authors: Federico Landi, Lorenzo Baraldi, Marcella Cornia, Massimiliano Corsini, Rita Cucchiara

    Abstract: Vision-and-Language Navigation (VLN) is a challenging task in which an agent needs to follow a language-specified path to reach a target destination. The goal gets even harder as the actions available to the agent get simpler and move towards low-level, atomic interactions with the environment. This setting takes the name of low-level VLN. In this paper, we strive for the creation of an agent able… ▽ More

    Submitted 30 July, 2021; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: Computer Vision and Image Understanding (CVIU)

  9. arXiv:1907.02985  [pdf, other

    cs.CV

    Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters

    Authors: Federico Landi, Lorenzo Baraldi, Massimiliano Corsini, Rita Cucchiara

    Abstract: In Vision-and-Language Navigation (VLN), an embodied agent needs to reach a target destination with the only guidance of a natural language instruction. To explore the environment and progress towards the target location, the agent must perform a series of low-level actions, such as rotate, before step** ahead. In this paper, we propose to exploit dynamic convolutional filters to encode the visu… ▽ More

    Submitted 25 September, 2019; v1 submitted 5 July, 2019; originally announced July 2019.

    Comments: BMVC 2019 (Oral). Code is available at https://github.com/aimagelab/DynamicConv-agent

  10. arXiv:1901.10364  [pdf, other

    cs.CV

    Anomaly Locality in Video Surveillance

    Authors: Federico Landi, Cees G. M. Snoek, Rita Cucchiara

    Abstract: This paper strives for the detection of real-world anomalies such as burglaries and assaults in surveillance videos. Although anomalies are generally local, as they happen in a limited portion of the frame, none of the previous works on the subject has ever studied the contribution of locality. In this work, we explore the impact of considering spatiotemporal tubes instead of whole-frame video seg… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

    Comments: Submitted to International Conference on Image Processing, 2019