Skip to main content

Showing 1–5 of 5 results for author: Villalonga, G

.
  1. arXiv:2405.00242  [pdf, other

    cs.CV cs.AI

    Guiding Attention in End-to-End Driving Models

    Authors: Diego Porres, Yi Xiao, Gabriel Villalonga, Alexandre Levy, Antonio M. López

    Abstract: Vision-based end-to-end driving models trained by imitation learning can lead to affordable solutions for autonomous driving. However, training these well-performing models usually requires a huge amount of data, while still lacking explicit and intuitive activation maps to reveal the inner workings of these models while driving. In this paper, we study how to guide the attention of these models t… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: Accepted for publication at the 35th IEEE Intelligent Vehicles Symposium (IV 2024)

  2. Co-Training for Unsupervised Domain Adaptation of Semantic Segmentation Models

    Authors: Jose L. Gómez, Gabriel Villalonga, Antonio M. López

    Abstract: Semantic image segmentation is a central and challenging task in autonomous driving, addressed by training deep models. Since this training draws to a curse of human-based image labeling, using synthetic images with automatically generated labels together with unlabeled real-world images is a promising alternative. This implies to address an unsupervised domain adaptation (UDA) problem. In this pa… ▽ More

    Submitted 30 January, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: Code available at https://github.com/JoseLGomez/Co-training_SemSeg_UDA. Paper accepted on Sensors at https://www.mdpi.com/1424-8220/23/2/621

    Journal ref: Sensors, Special Issue Machine Learning for Autonomous Driving Perception and Prediction (2023)

  3. Co-training for Deep Object Detection: Comparing Single-modal and Multi-modal Approaches

    Authors: Jose L. Gómez, Gabriel Villalonga, Antonio M. López

    Abstract: Top-performing computer vision models are powered by convolutional neural networks (CNNs). Training an accurate CNN highly depends on both the raw sensor data and their associated ground truth (GT). Collecting such GT is usually done through human labeling, which is time-consuming and does not scale as we wish. This data labeling bottleneck may be intensified due to domain shifts among image senso… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

    Report number: sensors-1185064

    Journal ref: special issue of Sensors (ISSN 1424-8220) "Feature Papers in Physical Sensors Section 2020"

  4. Co-training for On-board Deep Object Detection

    Authors: Gabriel Villalonga, Antonio M. Lopez

    Abstract: Providing ground truth supervision to train visual models has been a bottleneck over the years, exacerbated by domain shifts which degenerate the performance of such models. This was the case when visual tasks relied on handcrafted features and shallow machine learning and, despite its unprecedented performance gains, the problem remains open within the deep learning paradigm due to its data-hungr… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Journal ref: IEEE Access 8 (2020), 194441-194456

  5. arXiv:1908.11757  [pdf, other

    cs.CV cs.LG

    Temporal Coherence for Active Learning in Videos

    Authors: Javad Zolfaghari Bengar, Abel Gonzalez-Garcia, Gabriel Villalonga, Bogdan Raducanu, Hamed H. Aghdam, Mikhail Mozerov, Antonio M. Lopez, Joost van de Weijer

    Abstract: Autonomous driving systems require huge amounts of data to train. Manual annotation of this data is time-consuming and prohibitively expensive since it involves human resources. Therefore, active learning emerged as an alternative to ease this effort and to make data annotation more manageable. In this paper, we introduce a novel active learning approach for object detection in videos by exploitin… ▽ More

    Submitted 30 August, 2019; originally announced August 2019.

    Comments: Accepted at ICCVW 2019 (CVRSUAD-Road Scene Understanding and Autonomous Driving)