Skip to main content

Showing 1–3 of 3 results for author: Amoroso, R

.
  1. arXiv:2404.06542  [pdf, other

    cs.CV

    Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation

    Authors: Luca Barsellotti, Roberto Amoroso, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara

    Abstract: Open-vocabulary semantic segmentation aims at segmenting arbitrary categories expressed in textual form. Previous works have trained over large amounts of image-caption pairs to enforce pixel-level multimodal alignments. However, captions provide global information about the semantics of a given image but lack direct localization of individual concepts. Further, training on large-scale datasets in… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project page: https://aimagelab.github.io/freeda/

  2. arXiv:2306.07346  [pdf, other

    cs.CV cs.AI cs.MM

    Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training

    Authors: Lorenzo Baraldi, Roberto Amoroso, Marcella Cornia, Lorenzo Baraldi, Andrea Pilzer, Rita Cucchiara

    Abstract: The use of self-supervised pre-training has emerged as a promising approach to enhance the performance of visual tasks such as image classification. In this context, recent approaches have employed the Masked Image Modeling paradigm, which pre-trains a backbone by reconstructing visual tokens associated with randomly masked image patches. This masking approach, however, introduces noise into the i… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  3. arXiv:2304.00500  [pdf, other

    cs.CV cs.AI cs.MM

    Parents and Children: Distinguishing Multimodal DeepFakes from Natural Images

    Authors: Roberto Amoroso, Davide Morelli, Marcella Cornia, Lorenzo Baraldi, Alberto Del Bimbo, Rita Cucchiara

    Abstract: Recent advancements in diffusion models have enabled the generation of realistic deepfakes from textual prompts in natural language. While these models have numerous benefits across various sectors, they have also raised concerns about the potential misuse of fake images and cast new pressures on fake image detection. In this work, we pioneer a systematic study on deepfake detection generated by s… ▽ More

    Submitted 21 May, 2024; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: ACM Transactions on Multimedia Computing, Communications and Applications (2024)