Skip to main content

Showing 1–2 of 2 results for author: Ergasti, A

.
  1. arXiv:2407.04287  [pdf, other

    cs.CV cs.AI

    MARS: Paying more attention to visual attributes for text-based person search

    Authors: Alex Ergasti, Tomaso Fontanini, Claudio Ferrari, Massimo Bertozzi, Andrea Prati

    Abstract: Text-based person search (TBPS) is a problem that gained significant interest within the research community. The task is that of retrieving one or more images of a specific individual based on a textual description. The multi-modal nature of the task requires learning representations that bridge text and image data within a shared latent space. Existing TBPS systems face two major challenges. One… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2403.12743  [pdf, other

    cs.CV

    Towards Controllable Face Generation with Semantic Latent Diffusion Models

    Authors: Alex Ergasti, Claudio Ferrari, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

    Abstract: Semantic Image Synthesis (SIS) is among the most popular and effective techniques in the field of face generation and editing, thanks to its good generation quality and the versatility is brings along. Recent works attempted to go beyond the standard GAN-based framework, and started to explore Diffusion Models (DMs) for this task as these stand out with respect to GANs in terms of both quality and… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.