Skip to main content

Showing 1–15 of 15 results for author: Bertozzi, M

.
  1. arXiv:2407.04287  [pdf, other

    cs.CV cs.AI

    MARS: Paying more attention to visual attributes for text-based person search

    Authors: Alex Ergasti, Tomaso Fontanini, Claudio Ferrari, Massimo Bertozzi, Andrea Prati

    Abstract: Text-based person search (TBPS) is a problem that gained significant interest within the research community. The task is that of retrieving one or more images of a specific individual based on a textual description. The multi-modal nature of the task requires learning representations that bridge text and image data within a shared latent space. Existing TBPS systems face two major challenges. One… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2404.18924  [pdf, other

    cs.CV eess.IV

    Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing

    Authors: Leonardo Rossi, Vittorio Bernuzzi, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

    Abstract: Due to the limitations of current optical and sensor technologies and the high cost of updating them, the spectral and spatial resolution of satellites may not always meet desired requirements. For these reasons, Remote-Sensing Single-Image Super-Resolution (RS-SISR) techniques have gained significant interest. In this paper, we propose Swin2-MoSE model, an enhanced version of Swin2SR. Our model i… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  3. arXiv:2403.12743  [pdf, other

    cs.CV

    Towards Controllable Face Generation with Semantic Latent Diffusion Models

    Authors: Alex Ergasti, Claudio Ferrari, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

    Abstract: Semantic Image Synthesis (SIS) is among the most popular and effective techniques in the field of face generation and editing, thanks to its good generation quality and the versatility is brings along. Recent works attempted to go beyond the standard GAN-based framework, and started to explore Diffusion Models (DMs) for this task as these stand out with respect to GANs in terms of both quality and… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2312.17561  [pdf, other

    cs.CV cs.AI cs.LG

    Informative Rays Selection for Few-Shot Neural Radiance Fields

    Authors: Marco Orsingher, Anthony Dell'Eva, Paolo Zani, Paolo Medici, Massimo Bertozzi

    Abstract: Neural Radiance Fields (NeRF) have recently emerged as a powerful method for image-based 3D reconstruction, but the lengthy per-scene optimization limits their practical usage, especially in resource-constrained settings. Existing approaches solve this issue by reducing the number of input views and regularizing the learned volumetric representation with either complex losses or additional inputs… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: To appear at VISAPP 2024

  5. arXiv:2309.16009  [pdf, ps, other

    math.SG math.RT

    Floer potentials, cluster algebras and quiver representations

    Authors: Peter Albers, Maria Bertozzi, Markus Reineke

    Abstract: We use cluster algebras to interpret Floer potentials of monotone Lagrangian tori in toric del Pezzo surfaces as cluster characters of quiver representations.

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 14 pages

    MSC Class: 53D12 (Primary) 13F60; 16G20 (Secondary)

  6. arXiv:2308.16071  [pdf, other

    cs.CV cs.AI

    Semantic Image Synthesis via Class-Adaptive Cross-Attention

    Authors: Tomaso Fontanini, Claudio Ferrari, Giuseppe Lisanti, Massimo Bertozzi, Andrea Prati

    Abstract: In semantic image synthesis the state of the art is dominated by methods that use customized variants of the SPatially-Adaptive DE-normalization (SPADE) layers, which allow for good visual generation quality and editing versatility. By design, such layers learn pixel-wise modulation parameters to de-normalize the generator activations based on the semantic class each pixel belongs to. Thus, they t… ▽ More

    Submitted 20 February, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Code and models available at https://github.com/TFonta/CA2SIS

  7. arXiv:2307.05317  [pdf, other

    cs.CV cs.AI

    Automatic Generation of Semantic Parts for Face Image Synthesis

    Authors: Tomaso Fontanini, Claudio Ferrari, Massimo Bertozzi, Andrea Prati

    Abstract: Semantic image synthesis (SIS) refers to the problem of generating realistic imagery given a semantic segmentation mask that defines the spatial layout of object classes. Most of the approaches in the literature, other than the quality of the generated images, put effort in finding solutions to increase the generation diversity in terms of style i.e. texture. However, they all neglect a different… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Preprint, accepted for publication at ICIAP 2023

  8. arXiv:2302.10719  [pdf, other

    cs.CV cs.AI

    Memory-augmented Online Video Anomaly Detection

    Authors: Leonardo Rossi, Vittorio Bernuzzi, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

    Abstract: The ability to understand the surrounding scene is of paramount importance for Autonomous Vehicles (AVs). This paper presents a system capable to work in an online fashion, giving an immediate response to the arise of anomalies surrounding the AV, exploiting only the videos captured by a dash-mounted camera. Our architecture, called MOVAD, relies on two main modules: a Short-Term Memory Module to… ▽ More

    Submitted 27 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    MSC Class: 68-02; 68-04; 68-06; 68T07; 68T10; 68T45 ACM Class: F.1.1

  9. arXiv:2210.13041  [pdf, other

    cs.CV

    Learning Neural Radiance Fields from Multi-View Geometry

    Authors: Marco Orsingher, Paolo Zani, Paolo Medici, Massimo Bertozzi

    Abstract: We present a framework, called MVG-NeRF, that combines classical Multi-View Geometry algorithms and Neural Radiance Fields (NeRF) for image-based 3D reconstruction. NeRF has revolutionized the field of implicit 3D representations, mainly due to a differentiable volumetric rendering formulation that enables high-quality and geometry-aware novel view synthesis. However, the underlying geometry of th… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: ECCV 2022 Workshop on "Learning to Generate 3D Shapes and Scenes"

  10. arXiv:2208.05274  [pdf, other

    cs.CV

    Arbitrary Point Cloud Upsampling with Spherical Mixture of Gaussians

    Authors: Anthony Dell'Eva, Marco Orsingher, Massimo Bertozzi

    Abstract: Generating dense point clouds from sparse raw data benefits downstream 3D understanding tasks, but existing models are limited to a fixed upsampling ratio or to a short range of integer values. In this paper, we present APU-SMOG, a Transformer-based model for Arbitrary Point cloud Upsampling (APU). The sparse input is firstly mapped to a Spherical Mixture of Gaussians (SMOG) distribution, from whi… ▽ More

    Submitted 10 January, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: Accepted to 3DV 2022 (Oral)

  11. arXiv:2207.08439  [pdf, other

    cs.CV

    Revisiting PatchMatch Multi-View Stereo for Urban 3D Reconstruction

    Authors: Marco Orsingher, Paolo Zani, Paolo Medici, Massimo Bertozzi

    Abstract: In this paper, a complete pipeline for image-based 3D reconstruction of urban scenarios is proposed, based on PatchMatch Multi-View Stereo (MVS). Input images are firstly fed into an off-the-shelf visual SLAM system to extract camera poses and sparse keypoints, which are used to initialize PatchMatch optimization. Then, pixelwise depths and normals are iteratively computed in a multi-scale framewo… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: Poster presentation at IEEE Intelligent Vehicles Symposium (IV 2022, https://iv2022.com/)

  12. Efficient View Clustering and Selection for City-Scale 3D Reconstruction

    Authors: Marco Orsingher, Paolo Zani, Paolo Medici, Massimo Bertozzi

    Abstract: Image datasets have been steadily growing in size, harming the feasibility and efficiency of large-scale 3D reconstruction methods. In this paper, a novel approach for scaling Multi-View Stereo (MVS) algorithms up to arbitrarily large collections of images is proposed. Specifically, the problem of reconstructing the 3D model of an entire city is targeted, starting from a set of videos acquired by… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: Oral presentation at ICIAP 2021 (https://www.iciap2021.org/)

  13. arXiv:2109.04468  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Leveraging Local Domains for Image-to-Image Translation

    Authors: Anthony Dell'Eva, Fabio Pizzati, Massimo Bertozzi, Raoul de Charette

    Abstract: Image-to-image (i2i) networks struggle to capture local changes because they do not affect the global scene structure. For example, translating from highway scenes to offroad, i2i networks easily focus on global color features but ignore obvious traits for humans like the absence of lane markings. In this paper, we leverage human knowledge about spatial domain characteristics which we refer to as… ▽ More

    Submitted 14 February, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: VISAPP 2022 Best Paper Award

  14. arXiv:2010.08567  [pdf, other

    math.SG

    Infinite staircases for Hirzebruch surfaces

    Authors: Maria Bertozzi, Tara S. Holm, Emily Maw, Dusa McDuff, Grace T. Mwakyoma, Ana Rita Pires, Morgan Weiler

    Abstract: We consider the embedding capacity functions $c_{H_b}(z)$ for symplectic embeddings of ellipsoids of eccentricity $z$ into the family of nontrivial rational Hirzebruch surfaces $H_b$ with symplectic form parametrized by $b\in [0,1)$. This function was known to have an infinite staircase in the monotone cases ($b= 0$ and $ b= 1/3$). It is also known that for each $b$ there is at most one value of… ▽ More

    Submitted 19 April, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: 90 pages, 12 figures. Version 2 has several typos fixed and numbering changed to match style in to-be-published version

    MSC Class: Primary: 53D05. Secondary: 53D35; 11A55; 53D42; 53-04

  15. arXiv:2001.08071  [pdf, ps, other

    math.SG math.RT

    Momentum map images of representation spaces of quivers

    Authors: Maria Bertozzi, Markus Reineke

    Abstract: We consider the base change action on real or complex representation spaces of quivers and the associated momentum map for a maximal compact subgroup of the base change group, as introduced by A. King. We give an explicit description of the momentum map image in terms of recursively defined inequalities on eigenvalues of Hermitian operators. Moreover, we characterize when the momentum map image is… ▽ More

    Submitted 14 May, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: 14 pages; corrected definition of height function, corrected examples, clarified relation to work of Baldoni-Vergne-Walter