Skip to main content

Showing 1–6 of 6 results for author: Marza, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.07739  [pdf, other

    cs.CV cs.LG cs.RO

    Task-conditioned adaptation of visual features in multi-task policy learning

    Authors: Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

    Abstract: Successfully addressing a wide variety of tasks is a core ability of autonomous agents, requiring flexibly adapting the underlying decision-making strategies and, as we argue in this work, also adapting the perception modules. An analogical argument would be the human visual system, which uses top-down signals to focus attention determined by the current task. Similarly, we adapt pre-trained large… ▽ More

    Submitted 6 May, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  2. arXiv:2304.11241  [pdf, other

    cs.CV cs.LG cs.RO

    AutoNeRF: Training Implicit Scene Representations with Autonomous Agents

    Authors: Pierre Marza, Laetitia Matignon, Olivier Simonin, Dhruv Batra, Christian Wolf, Devendra Singh Chaplot

    Abstract: Implicit representations such as Neural Radiance Fields (NeRF) have been shown to be very effective at novel view synthesis. However, these models typically require manual and careful human data collection for training. In this paper, we present AutoNeRF, a method to collect data required to train NeRFs using autonomous embodied agents. Our method allows an agent to explore an unseen environment e… ▽ More

    Submitted 22 December, 2023; v1 submitted 21 April, 2023; originally announced April 2023.

  3. arXiv:2210.05129  [pdf, other

    cs.CV cs.LG cs.RO

    Multi-Object Navigation with dynamically learned neural implicit representations

    Authors: Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

    Abstract: Understanding and map** a new environment are core abilities of any autonomously navigating agent. While classical robotics usually estimates maps in a stand-alone manner with SLAM variants, which maintain a topological or metric representation, end-to-end learning of navigation keeps some form of memory in a neural network. Networks are typically imbued with inductive biases, which can range fr… ▽ More

    Submitted 27 September, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  4. arXiv:2202.06858  [pdf, other

    cs.CV

    An experimental study of the vision-bottleneck in VQA

    Authors: Pierre Marza, Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf

    Abstract: As in many tasks combining vision and language, both modalities play a crucial role in Visual Question Answering (VQA). To properly solve the task, a given model should both understand the content of the proposed image and the nature of the question. While the fusion between modalities, which is another obviously important part of the problem, has been highly studied, the vision part has received… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  5. arXiv:2107.06011  [pdf, other

    cs.CV cs.LG cs.RO

    Teaching Agents how to Map: Spatial Reasoning for Multi-Object Navigation

    Authors: Pierre Marza, Laetitia Matignon, Olivier Simonin, Christian Wolf

    Abstract: In the context of visual navigation, the capacity to map a novel environment is necessary for an agent to exploit its observation history in the considered place and efficiently reach known goals. This ability can be associated with spatial reasoning, where an agent is able to perceive spatial relationships and regularities, and discover object characteristics. Recent work introduces learnable pol… ▽ More

    Submitted 25 April, 2023; v1 submitted 13 July, 2021; originally announced July 2021.

  6. arXiv:2003.13985  [pdf, other

    cs.CV

    DeepLPF: Deep Local Parametric Filters for Image Enhancement

    Authors: Sean Moran, Pierre Marza, Steven McDonagh, Sarah Parisot, Gregory Slabaugh

    Abstract: Digital artists often improve the aesthetic quality of digital photographs through manual retouching. Beyond global adjustments, professional image editing programs provide local adjustment tools operating on specific parts of an image. Options include parametric (graduated, radial filters) and unconstrained brush tools. These highly expressive tools enable a diverse set of local image enhancement… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at CVPR2020