Skip to main content

Showing 1–27 of 27 results for author: Prati, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18924  [pdf, other

    cs.CV eess.IV

    Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing

    Authors: Leonardo Rossi, Vittorio Bernuzzi, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

    Abstract: Due to the limitations of current optical and sensor technologies and the high cost of updating them, the spectral and spatial resolution of satellites may not always meet desired requirements. For these reasons, Remote-Sensing Single-Image Super-Resolution (RS-SISR) techniques have gained significant interest. In this paper, we propose Swin2-MoSE model, an enhanced version of Swin2SR. Our model i… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  2. Self-Balanced R-CNN for Instance Segmentation

    Authors: Leonardo Rossi, Akbar Karimi, Andrea Prati

    Abstract: Current state-of-the-art two-stage models on instance segmentation task suffer from several types of imbalances. In this paper, we address the Intersection over the Union (IoU) distribution imbalance of positive input Regions of Interest (RoIs) during the training of the second stage. Our Self-Balanced R-CNN (SBR-CNN), an evolved version of the Hybrid Task Cascade (HTC) model, brings brand new loo… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  3. arXiv:2404.10408  [pdf, other

    cs.CV

    Adversarial Identity Injection for Semantic Face Image Synthesis

    Authors: Giuseppe Tarollo, Tomaso Fontanini, Claudio Ferrari, Guido Borghi, Andrea Prati

    Abstract: Nowadays, deep learning models have reached incredible performance in the task of image generation. Plenty of literature works address the task of face generation and editing, with human and automatic systems that struggle to distinguish what's real from generated. Whereas most systems reached excellent visual generation quality, they still face difficulties in preserving the identity of the start… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Paper accepted at CVPR 2024 Biometrics Workshop

  4. arXiv:2403.12743  [pdf, other

    cs.CV

    Towards Controllable Face Generation with Semantic Latent Diffusion Models

    Authors: Alex Ergasti, Claudio Ferrari, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

    Abstract: Semantic Image Synthesis (SIS) is among the most popular and effective techniques in the field of face generation and editing, thanks to its good generation quality and the versatility is brings along. Recent works attempted to go beyond the standard GAN-based framework, and started to explore Diffusion Models (DMs) for this task as these stand out with respect to GANs in terms of both quality and… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  5. arXiv:2308.16071  [pdf, other

    cs.CV cs.AI

    Semantic Image Synthesis via Class-Adaptive Cross-Attention

    Authors: Tomaso Fontanini, Claudio Ferrari, Giuseppe Lisanti, Massimo Bertozzi, Andrea Prati

    Abstract: In semantic image synthesis the state of the art is dominated by methods that use customized variants of the SPatially-Adaptive DE-normalization (SPADE) layers, which allow for good visual generation quality and editing versatility. By design, such layers learn pixel-wise modulation parameters to de-normalize the generator activations based on the semantic class each pixel belongs to. Thus, they t… ▽ More

    Submitted 20 February, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: Code and models available at https://github.com/TFonta/CA2SIS

  6. arXiv:2307.05317  [pdf, other

    cs.CV cs.AI

    Automatic Generation of Semantic Parts for Face Image Synthesis

    Authors: Tomaso Fontanini, Claudio Ferrari, Massimo Bertozzi, Andrea Prati

    Abstract: Semantic image synthesis (SIS) refers to the problem of generating realistic imagery given a semantic segmentation mask that defines the spatial layout of object classes. Most of the approaches in the literature, other than the quality of the generated images, put effort in finding solutions to increase the generation diversity in terms of style i.e. texture. However, they all neglect a different… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: Preprint, accepted for publication at ICIAP 2023

  7. arXiv:2302.10719  [pdf, other

    cs.CV cs.AI

    Memory-augmented Online Video Anomaly Detection

    Authors: Leonardo Rossi, Vittorio Bernuzzi, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

    Abstract: The ability to understand the surrounding scene is of paramount importance for Autonomous Vehicles (AVs). This paper presents a system capable to work in an online fashion, giving an immediate response to the arise of anomalies surrounding the AV, exploiting only the videos captured by a dash-mounted camera. Our architecture, called MOVAD, relies on two main modules: a Short-Term Memory Module to… ▽ More

    Submitted 27 September, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    MSC Class: 68-02; 68-04; 68-06; 68T07; 68T10; 68T45 ACM Class: F.1.1

  8. LDD: A Dataset for Grape Diseases Object Detection and Instance Segmentation

    Authors: Leonardo Rossi, Marco Valenti, Sara Elisabetta Legler, Andrea Prati

    Abstract: The Instance Segmentation task, an extension of the well-known Object Detection task, is of great help in many areas, such as precision agriculture: being able to automatically identify plant organs and the possible diseases associated with them, allows to effectively scale and automate crop monitoring and its diseases control. To address the problem related to early disease detection and diagnosi… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Journal ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022

  9. Improving Localization for Semi-Supervised Object Detection

    Authors: Leonardo Rossi, Akbar Karimi, Andrea Prati

    Abstract: Nowadays, Semi-Supervised Object Detection (SSOD) is a hot topic, since, while it is rather easy to collect images for creating a new dataset, labeling them is still an expensive and time-consuming task. One of the successful methods to take advantage of raw images on a Semi-Supervised Learning (SSL) setting is the Mean Teacher technique, where the operations of pseudo-labeling by the Teacher and… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Journal ref: International Conference on Image Analysis and Processing. Springer, Cham, 2022

  10. arXiv:2108.13230  [pdf, other

    cs.CL

    AEDA: An Easier Data Augmentation Technique for Text Classification

    Authors: Akbar Karimi, Leonardo Rossi, Andrea Prati

    Abstract: This paper proposes AEDA (An Easier Data Augmentation) technique to help improve the performance on text classification tasks. AEDA includes only random insertion of punctuation marks into the original text. This is an easier technique to implement for data augmentation than EDA method (Wei and Zou, 2019) with which we compare our results. In addition, it keeps the order of the words while changin… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted at EMNLP 2021 Findings

  11. arXiv:2108.07466  [pdf, other

    cs.CV eess.IV

    Transferring Knowledge with Attention Distillation for Multi-Domain Image-to-Image Translation

    Authors: Runze Li, Tomaso Fontanini, Luca Donati, Andrea Prati, Bir Bhanu

    Abstract: Gradient-based attention modeling has been used widely as a way to visualize and understand convolutional neural networks. However, exploiting these visual explanations during the training of generative adversarial networks (GANs) is an unexplored area in computer vision research. Indeed, we argue that this kind of information can be used to influence GANs training in a positive way. For this reas… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: Preprint

  12. Recursively Refined R-CNN: Instance Segmentation with Self-RoI Rebalancing

    Authors: Leonardo Rossi, Akbar Karimi, Andrea Prati

    Abstract: Within the field of instance segmentation, most of the state-of-the-art deep learning networks rely nowadays on cascade architectures, where multiple object detectors are trained sequentially, re-sampling the ground truth at each step. This offers a solution to the problem of exponentially vanishing positive samples. However, it also translates into an increase in network complexity in terms of th… ▽ More

    Submitted 2 August, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    ACM Class: I.4.6; I.5.1

    Journal ref: International Conference on Computer Analysis of Images and Patterns. Springer, Cham, 2021

  13. arXiv:2103.09645  [pdf, other

    cs.CL

    UniParma at SemEval-2021 Task 5: Toxic Spans Detection Using CharacterBERT and Bag-of-Words Model

    Authors: Akbar Karimi, Leonardo Rossi, Andrea Prati

    Abstract: With the ever-increasing availability of digital information, toxic content is also on the rise. Therefore, the detection of this type of language is of paramount importance. We tackle this problem utilizing a combination of a state-of-the-art pre-trained language model (CharacterBERT) and a traditional bag-of-words technique. Since the content is full of toxic words that have not been written acc… ▽ More

    Submitted 9 April, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

  14. arXiv:2010.11731  [pdf, other

    cs.CL cs.LG

    Improving BERT Performance for Aspect-Based Sentiment Analysis

    Authors: Akbar Karimi, Leonardo Rossi, Andrea Prati

    Abstract: Aspect-Based Sentiment Analysis (ABSA) studies the consumer opinion on the market products. It involves examining the type of sentiments as well as sentiment targets expressed in product reviews. Analyzing the language used in a review is a difficult task that requires a deep understanding of the language. In recent years, deep language models, such as BERT \cite{devlin2019bert}, have shown great… ▽ More

    Submitted 1 March, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

  15. A novel Region of Interest Extraction Layer for Instance Segmentation

    Authors: Leonardo Rossi, Akbar Karimi, Andrea Prati

    Abstract: Given the wide diffusion of deep neural network architectures for computer vision tasks, several new applications are nowadays more and more feasible. Among them, a particular attention has been recently given to instance segmentation, by exploiting the results achievable by two-stage networks (such as Mask R-CNN or Faster R-CNN), derived from R-CNN. In these complex architectures, a crucial role… ▽ More

    Submitted 1 October, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    ACM Class: I.4.6; I.5.1

    Journal ref: International Conference on Pattern Recognition (ICPR). IEEE, 2021

  16. arXiv:2001.11316  [pdf, other

    cs.LG cs.CL stat.ML

    Adversarial Training for Aspect-Based Sentiment Analysis with BERT

    Authors: Akbar Karimi, Leonardo Rossi, Andrea Prati

    Abstract: Aspect-Based Sentiment Analysis (ABSA) deals with the extraction of sentiments and their targets. Collecting labeled data for this task in order to help neural networks generalize better can be laborious and time-consuming. As an alternative, similar data to the real-world examples can be produced artificially through an adversarial process which is carried out in the embedding space. Although the… ▽ More

    Submitted 23 October, 2020; v1 submitted 30 January, 2020; originally announced January 2020.

  17. arXiv:1912.02494  [pdf, other

    cs.LG cs.CV stat.ML

    MetalGAN: Multi-Domain Label-Less Image Synthesis Using cGANs and Meta-Learning

    Authors: Tomaso Fontanini, Eleonora Iotti, Luca Donati, Andrea Prati

    Abstract: Image synthesis is currently one of the most addressed image processing topic in computer vision and deep learning fields of study. Researchers have tackled this problem focusing their efforts on its several challenging problems, e.g. image quality and size, domain and pose changing, architecture of the networks, and so on. Above all, producing images belonging to different domains by using a sing… ▽ More

    Submitted 25 June, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

  18. arXiv:1909.07654  [pdf, other

    cs.LG eess.IV stat.ML

    MetalGAN: a Cluster-based Adaptive Training for Few-Shot Adversarial Colorization

    Authors: Tomaso Fontanini, Eleonora Iotti, Andrea Prati

    Abstract: In recent years, the majority of works on deep-learning-based image colorization have focused on how to make a good use of the enormous datasets currently available. What about when the data at disposal are scarce? The main objective of this work is to prove that a network can be trained and can provide excellent colorization results even without a large quantity of data. The adopted approach is a… ▽ More

    Submitted 17 September, 2019; originally announced September 2019.

  19. arXiv:1908.06896  [pdf, other

    cs.CV

    Genetic Algorithms for the Optimization of Diffusion Parameters in Content-Based Image Retrieval

    Authors: Federico Magliani, Laura Sani, Stefano Cagnoni, Andrea Prati

    Abstract: Several computer vision and artificial intelligence projects are nowadays exploiting the manifold data distribution using, e.g., the diffusion process. This approach has produced dramatic improvements on the final performance thanks to the application of such algorithms to the kNN graph. Unfortunately, this recent technique needs a manual configuration of several parameters, thus it is not straigh… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  20. arXiv:1904.08668  [pdf, other

    cs.CV

    An Efficient Approximate kNN Graph Method for Diffusion on Image Retrieval

    Authors: Federico Magliani, Kevin McGuinness, Eva Mohedano, Andrea Prati

    Abstract: The application of the diffusion in many computer vision and artificial intelligence projects has been shown to give excellent improvements in performance. One of the main bottlenecks of this technique is the quadratic growth of the kNN graph size due to the high-quantity of new connections between nodes in the graph, resulting in long computation times. Several strategies have been proposed to ad… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.

  21. arXiv:1808.05022  [pdf, other

    cs.CV

    A Dense-Depth Representation for VLAD descriptors in Content-Based Image Retrieval

    Authors: Federico Magliani, Tomaso Fontanini, Andrea Prati

    Abstract: The recent advances brought by deep learning allowed to improve the performance in image retrieval tasks. Through the many convolutional layers, available in a Convolutional Neural Network (CNN), it is possible to obtain a hierarchy of features from the evaluated image. At every step, the patches extracted are smaller than the previous levels and more representative. Following this idea, this pape… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.

  22. arXiv:1806.08565  [pdf, other

    cs.CV

    An accurate retrieval through R-MAC+ descriptors for landmark recognition

    Authors: Federico Magliani, Andrea Prati

    Abstract: The landmark recognition problem is far from being solved, but with the use of features extracted from intermediate layers of Convolutional Neural Networks (CNNs), excellent results have been obtained. In this work, we propose some improvements on the creation of R-MAC descriptors in order to make the newly-proposed R-MAC+ descriptors more representative than the previous ones. However, the main c… ▽ More

    Submitted 22 June, 2018; originally announced June 2018.

  23. arXiv:1806.05946  [pdf, other

    cs.CV

    Efficient Nearest Neighbors Search for Large-Scale Landmark Recognition

    Authors: Federico Magliani, Tomaso Fontanini, Andrea Prati

    Abstract: The problem of landmark recognition has achieved excellent results in small-scale datasets. When dealing with large-scale retrieval, issues that were irrelevant with small amount of data, quickly become fundamental for an efficient retrieval phase. In particular, computational time needs to be kept as low as possible, whilst the retrieval accuracy has to be preserved as much as possible. In this p… ▽ More

    Submitted 15 June, 2018; originally announced June 2018.

  24. arXiv:1802.05902  [pdf, other

    cs.CV

    A complete hand-drawn sketch vectorization framework

    Authors: Luca Donati, Simone Cesano, Andrea Prati

    Abstract: Vectorizing hand-drawn sketches is a challenging task, which is of paramount importance for creating CAD vectorized versions for the fashion and creative workflows. This paper proposes a complete framework that automatically transforms noisy and complex hand-drawn sketches with different stroke types in a precise, reliable and highly-simplified vectorized model. The proposed framework includes a n… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

  25. arXiv:1706.06196  [pdf, other

    cs.CV

    Multi-Target Tracking in Multiple Non-Overlap** Cameras using Constrained Dominant Sets

    Authors: Yonatan Tariku Tesfaye, Eyasu Zemene, Andrea Prati, Marcello Pelillo, Mubarak Shah

    Abstract: In this paper, a unified three-layer hierarchical approach for solving tracking problems in multiple non-overlap** cameras is proposed. Given a video and a set of detections (obtained by any person detector), we first solve within-camera tracking employing the first two layers of our framework and, then, in the third layer, we solve across-camera tracking by merging tracks of the same person in… ▽ More

    Submitted 19 June, 2017; originally announced June 2017.

  26. arXiv:1704.05754  [pdf, other

    cs.CV

    A location-aware embedding technique for accurate landmark recognition

    Authors: Federico Magliani, Navid Mahmoudian Bidgoli, Andrea Prati

    Abstract: The current state of the research in landmark recognition highlights the good accuracy which can be achieved by embedding techniques, such as Fisher vector and VLAD. All these techniques do not exploit spatial information, i.e. consider all the features and the corresponding descriptors without embedding their location in the image. This paper presents a new variant of the well-known VLAD (Vector… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

    Comments: 6 pages, 5 figures, ICDSC 2017

  27. arXiv:1702.01238  [pdf, other

    cs.CV

    Large-scale Image Geo-Localization Using Dominant Sets

    Authors: Eyasu Zemene, Yonatan Tariku, Haroon Idrees, Andrea Prati, Marcello Pelillo, Mubarak Shah

    Abstract: This paper presents a new approach for the challenging problem of geo-locating an image using image matching in a structured database of city-wide reference images with known GPS coordinates. We cast the geo-localization as a clustering problem on local image features. Akin to existing approaches on the problem, our framework builds on low-level features which allow partial matching between images… ▽ More

    Submitted 14 September, 2017; v1 submitted 4 February, 2017; originally announced February 2017.