Skip to main content

Showing 1–11 of 11 results for author: SanMiguel, J C

.
  1. arXiv:2407.01327  [pdf, other

    cs.CV cs.LG

    Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks

    Authors: Roberto Alcover-Couso, Marcos Escudero-Viñolo, Juan C. SanMiguel, Jesus Bescós

    Abstract: In unsupervised domain adaptation (UDA), where models are trained on source data (e.g., synthetic) and adapted to target data (e.g., real-world) without target annotations, addressing the challenge of significant class imbalance remains an open issue. Despite considerable progress in bridging the domain gap, existing methods often experience performance degradation when confronted with highly imba… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2403.14291  [pdf, other

    cs.CV

    Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models

    Authors: Pablo Marcos-Manchón, Roberto Alcover-Couso, Juan C. SanMiguel, Jose M. Martínez

    Abstract: Diffusion models represent a new paradigm in text-to-image generation. Beyond generating high-quality images from text prompts, models such as Stable Diffusion have been successfully extended to the joint generation of semantic segmentation pseudo-masks. However, current extensions primarily rely on extracting attentions linked to prompt words used for image synthesis. This approach limits the gen… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Journal ref: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)

  3. arXiv:2309.15478  [pdf, other

    cs.CV cs.LG

    The Robust Semantic Segmentation UNCV2023 Challenge Results

    Authors: Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli , et al. (12 additional authors not shown)

    Abstract: This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural adversarial scenarios. The report presents the results of 19 submitted entries, with numerous techniques drawing inspiration from cutting-edge uncertainty q… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: 11 pages, 4 figures, accepted at ICCV 2023 UNCV workshop

  4. arXiv:2302.13961  [pdf, other

    cs.CV

    Soft labelling for semantic segmentation: Bringing coherence to label down-sampling

    Authors: Roberto Alcover-Couso, Marcos Escudero-Vinolo, Juan C. SanMiguel, Jose M. Martinez

    Abstract: In semantic segmentation, training data down-sampling is commonly performed due to limited resources, the need to adapt image size to the model input, or improve data augmentation. This down-sampling typically employs different strategies for the image data and the annotated labels. Such discrepancy leads to mismatches between the down-sampled color and label images. Hence, the training performanc… ▽ More

    Submitted 19 February, 2024; v1 submitted 27 February, 2023; originally announced February 2023.

  5. Detection-aware multi-object tracking evaluation

    Authors: Juan C. SanMiguel, Jorge Muñoz, Fabio Poiesi

    Abstract: How would you fairly evaluate two multi-object tracking algorithms (i.e. trackers), each one employing a different object detector? Detectors keep improving, thus trackers can make less effort to estimate object states over time. Is it then fair to compare a new tracker employing a new detector with another tracker using an old detector? In this paper, we propose a novel performance measure, named… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: This paper was accepted at IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

  6. arXiv:2205.01997  [pdf, other

    cs.CV

    Attention-based Knowledge Distillation in Multi-attention Tasks: The Impact of a DCT-driven Loss

    Authors: Alejandro López-Cifuentes, Marcos Escudero-Viñolo, Jesús Bescós, Juan C. SanMiguel

    Abstract: Knowledge Distillation (KD) is a strategy for the definition of a set of transferability gangways to improve the efficiency of Convolutional Neural Networks. Feature-based Knowledge Distillation is a subfield of KD that relies on intermediate network representations, either unaltered or depth-reduced via maximum activation maps, as the source knowledge. In this paper, we propose and analyse the us… ▽ More

    Submitted 6 June, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Preprint under review in TCSVT Journal

  7. arXiv:2201.06311  [pdf, other

    cs.CV

    Graph Neural Networks for Cross-Camera Data Association

    Authors: Elena Luna, Juan C. SanMiguel, José M. Martínez, Pablo Carballeira

    Abstract: Cross-camera image data association is essential for many multi-camera computer vision tasks, such as multi-camera pedestrian detection, multi-camera multi-target tracking, 3D pose estimation, etc. This association task is typically stated as a bipartite graph matching problem and often solved by applying minimum-cost flow techniques, which may be computationally inefficient with large data. Furth… ▽ More

    Submitted 17 January, 2022; originally announced January 2022.

  8. arXiv:2112.12086  [pdf, other

    cs.CV

    Improved skin lesion recognition by a Self-Supervised Curricular Deep Learning approach

    Authors: Kirill Sirotkin, Marcos Escudero-Viñolo, Pablo Carballeira, Juan Carlos SanMiguel

    Abstract: State-of-the-art deep learning approaches for skin lesion recognition often require pretraining on larger and more varied datasets, to overcome the generalization limitations derived from the reduced size of the skin lesion imaging datasets. ImageNet is often used as the pretraining dataset, but its transferring potential is hindered by the domain gap between the source dataset and the target derm… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 11 pages, 8 figures, submitted to the Journal of Biomedical and Health Informatics (Special Issue on Skin Image Analysis in the Age of Deep Learning)

  9. arXiv:2109.09702  [pdf, other

    eess.IV cs.CV

    Deep Anomaly Generation: An Image Translation Approach of Synthesizing Abnormal Banded Chromosome Images

    Authors: Lukas Uzolas, Javier Rico, Pierrick Coupé, Juan C. SanMiguel, György Cserey

    Abstract: Advances in deep-learning-based pipelines have led to breakthroughs in a variety of microscopy image diagnostics. However, a sufficiently big training data set is usually difficult to obtain due to high annotation costs. In the case of banded chromosome images, the creation of big enough libraries is difficult for multiple pathologies due to the rarity of certain genetic disorders. Generative Adve… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: 8 pages, 4 figures, 2 tables

    MSC Class: I.2.1 Artificial Intelligence; Applications and Expert Systems; Medicine and Science

  10. arXiv:2102.04091  [pdf, other

    cs.CV

    Online Clustering-based Multi-Camera Vehicle Tracking in Scenarios with overlap** FOVs

    Authors: Elena Luna, Juan C. SanMiguel, Jose M. Martínez, Marcos Escudero-Viñolo

    Abstract: Multi-Target Multi-Camera (MTMC) vehicle tracking is an essential task of visual traffic monitoring, one of the main research fields of Intelligent Transportation Systems. Several offline approaches have been proposed to address this task; however, they are not compatible with real-world applications due to their high latency and post-processing requirements. In this paper, we present a new low-la… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: 10 pages

  11. arXiv:1904.11256  [pdf, other

    cs.CV

    On guiding video object segmentation

    Authors: Diego Ortego, Kevin McGuinness, Juan C. SanMiguel, Eric Arazo, José M. Martínez, Noel E. O'Connor

    Abstract: This paper presents a novel approach for segmenting moving objects in unconstrained environments using guided convolutional neural networks. This guiding process relies on foreground masks from independent algorithms (i.e. state-of-the-art algorithms) to implement an attention mechanism that incorporates the spatial location of foreground and background to compute their separated representations.… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.