Skip to main content

Showing 1–9 of 9 results for author: Uricchio, T

Searching in archive cs. Search in all archives.
.
  1. Exploiting CLIP-based Multi-modal Approach for Artwork Classification and Retrieval

    Authors: Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo

    Abstract: Given the recent advances in multimodal image pretraining where visual models trained with semantically dense textual supervision tend to have better generalization capabilities than those trained using categorical attributes or through unsupervised techniques, in this work we investigate how recent CLIP model can be applied in several tasks in artwork domain. We perform exhaustive experiments on… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Proc. of Florence Heri-Tech 2022: The Future of Heritage Science and Technologies: ICT and Digital Heritage, 2022

  2. arXiv:2308.11485  [pdf, other

    cs.CV

    Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features

    Authors: Alberto Baldrati, Marco Bertini, Tiberio Uricchio, Alberto del Bimbo

    Abstract: Given a query composed of a reference image and a relative caption, the Composed Image Retrieval goal is to retrieve images visually similar to the reference one that integrates the modifications expressed by the caption. Given that recent research has demonstrated the efficacy of large-scale vision and language pre-trained (VLP) models in various tasks, we rely on features from the OpenAI CLIP mo… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted in ACM Transactions on Multimedia Computing Communications and Applications (TOMM)

  3. Learning advisor networks for noisy image classification

    Authors: Simone Ricci, Tiberio Uricchio, Alberto Del Bimbo

    Abstract: In this paper, we introduced the novel concept of advisor network to address the problem of noisy labels in image classification. Deep neural networks (DNN) are prone to performance reduction and overfitting problems on training data with noisy annotations. Weighting loss methods aim to mitigate the influence of noisy labels during the training, completely removing their contribution. This discard… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Paper published as Poster at ICIAP21

    Journal ref: ICIAP 2022

  4. Learning Group Activities from Skeletons without Individual Action Labels

    Authors: Fabio Zappardino, Tiberio Uricchio, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: To understand human behavior we must not just recognize individual actions but model possibly complex group activity and interactions. Hierarchical models obtain the best results in group activity recognition but require fine grained individual action annotations at the actor level. In this paper we show that using only skeletal data we can train a state-of-the art end-to-end system using only gro… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

    Comments: ICPR 2020

  5. arXiv:2004.09695  [pdf, other

    cs.CV

    Image Retrieval using Multi-scale CNN Features Pooling

    Authors: Federico Vaccaro, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo

    Abstract: In this paper, we address the problem of image retrieval by learning images representation based on the activations of a Convolutional Neural Network. We present an end-to-end trainable network architecture that exploits a novel multi-scale local pooling based on NetVLAD and a triplet mining procedure based on samples difficulty to obtain an effective image representation. Extensive experiments sh… ▽ More

    Submitted 24 April, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted at ICMR 2020

  6. arXiv:1706.01788  [pdf, other

    cs.MM cs.CR

    Localization of JPEG double compression through multi-domain convolutional neural networks

    Authors: Irene Amerini, Tiberio Uricchio, Lamberto Ballan, Roberto Caldelli

    Abstract: When an attacker wants to falsify an image, in most of cases she/he will perform a JPEG recompression. Different techniques have been developed based on diverse theoretical assumptions but very effective solutions have not been developed yet. Recently, machine learning based approaches have been started to appear in the field of image forensics to solve diverse tasks such as acquisition source ide… ▽ More

    Submitted 6 June, 2017; originally announced June 2017.

    Comments: Accepted to CVPRW 2017, Workshop on Media Forensics

  7. arXiv:1705.01781  [pdf, other

    cs.CV

    Am I Done? Predicting Action Progress in Videos

    Authors: Federico Becattini, Tiberio Uricchio, Lorenzo Seidenari, Lamberto Ballan, Alberto Del Bimbo

    Abstract: In this paper we deal with the problem of predicting action progress in videos. We argue that this is an extremely important task since it can be valuable for a wide range of interaction applications. To this end we introduce a novel approach, named ProgressNet, capable of predicting when an action takes place in a video, where it is located within the frames, and how far it has progressed during… ▽ More

    Submitted 9 March, 2020; v1 submitted 4 May, 2017; originally announced May 2017.

  8. Automatic Image Annotation via Label Transfer in the Semantic Space

    Authors: Tiberio Uricchio, Lamberto Ballan, Lorenzo Seidenari, Alberto Del Bimbo

    Abstract: Automatic image annotation is among the fundamental problems in computer vision and pattern recognition, and it is becoming increasingly important in order to develop algorithms that are able to search and browse large-scale image collections. In this paper, we propose a label propagation framework based on Kernel Canonical Correlation Analysis (KCCA), which builds a latent semantic space where co… ▽ More

    Submitted 1 June, 2017; v1 submitted 16 May, 2016; originally announced May 2016.

    Comments: To appear in Pattern Recognition

  9. arXiv:1503.08248  [pdf, other

    cs.IR cs.CV cs.MM cs.SI

    Socializing the Semantic Gap: A Comparative Survey on Image Tag Assignment, Refinement and Retrieval

    Authors: Xirong Li, Tiberio Uricchio, Lamberto Ballan, Marco Bertini, Cees G. M. Snoek, Alberto Del Bimbo

    Abstract: Where previous reviews on content-based image retrieval emphasize on what can be seen in an image to bridge the semantic gap, this survey considers what people tag about an image. A comprehensive treatise of three closely linked problems, i.e., image tag assignment, refinement, and tag-based image retrieval is presented. While existing works vary in terms of their targeted tasks and methodology, t… ▽ More

    Submitted 23 March, 2016; v1 submitted 27 March, 2015; originally announced March 2015.

    Comments: to appear in ACM Computing Surveys

    ACM Class: H.3.1; H.3.3

    Journal ref: ACM Computing Surveys, Volume 49 Issue 1, 14:1-14:39, June 2016