Skip to main content

Showing 1–50 of 55 results for author: Belagiannis, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17704  [pdf, other

    cs.CV

    Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation

    Authors: Amir El-Ghoussani, Julia Hornauer, Gustavo Carneiro, Vasileios Belagiannis

    Abstract: In monocular depth estimation, unsupervised domain adaptation has recently been explored to relax the dependence on large annotated image-based depth datasets. However, this comes at the cost of training multiple models or requiring complex training protocols. We formulate unsupervised domain adaptation for monocular depth estimation as a consistency-based semi-supervised learning problem by assum… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted to Conference on Lifelong Learning Agents (CoLLAs) 2024

  2. arXiv:2403.06020  [pdf, other

    cs.LG cs.CV

    Multi-conditioned Graph Diffusion for Neural Architecture Search

    Authors: Rohan Asthana, Joschua Conrad, Youssef Dawoud, Maurits Ortmanns, Vasileios Belagiannis

    Abstract: Neural architecture search automates the design of neural network architectures usually by exploring a large and thus complex architecture search space. To advance the architecture search, we present a graph diffusion-based NAS approach that uses discrete conditional graph diffusion processes to generate high-performing neural network architectures. We then propose a multi-conditioned classifier-f… ▽ More

    Submitted 22 March, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted at Transactions on Machine Learning Research (TMLR)

  3. arXiv:2308.09080  [pdf, other

    cs.CV cs.RO

    Pedestrian Environment Model for Automated Driving

    Authors: Adrian Holzbock, Alexander Tsaregorodtsev, Vasileios Belagiannis

    Abstract: Besides interacting correctly with other vehicles, automated vehicles should also be able to react in a safe manner to vulnerable road users like pedestrians or cyclists. For a safe interaction between pedestrians and automated vehicles, the vehicle must be able to interpret the pedestrian's behavior. Common environment models do not contain information like body poses used to understand the pedes… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: Accepted for presentation at the 26th IEEE International Conference on Intelligent Transportation Systems (ITSC 2023), 24-28 September 2023, Bilbao, Bizkaia, Spain

  4. arXiv:2308.06072  [pdf, other

    cs.CV

    Out-of-Distribution Detection for Monocular Depth Estimation

    Authors: Julia Hornauer, Adrian Holzbock, Vasileios Belagiannis

    Abstract: In monocular depth estimation, uncertainty estimation approaches mainly target the data uncertainty introduced by image noise. In contrast to prior work, we address the uncertainty due to lack of knowledge, which is relevant for the detection of data not represented by the training distribution, the so-called out-of-distribution (OOD) data. Motivated by anomaly detection, we propose to detect OOD… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  5. arXiv:2308.04946  [pdf, other

    cs.CV

    SelectNAdapt: Support Set Selection for Few-Shot Domain Adaptation

    Authors: Youssef Dawoud, Gustavo Carneiro, Vasileios Belagiannis

    Abstract: Generalisation of deep neural networks becomes vulnerable when distribution shifts are encountered between train (source) and test (target) domain data. Few-shot domain adaptation mitigates this issue by adapting deep neural networks pre-trained on the source domain to the target domain using a randomly selected and annotated support set from the target domain. This paper argues that randomly sele… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV Workshop

  6. arXiv:2308.01707  [pdf, other

    cs.RO

    Joint Out-of-Distribution Detection and Uncertainty Estimation for Trajectory Prediction

    Authors: Julian Wiederer, Julian Schmidt, Ulrich Kressel, Klaus Dietmayer, Vasileios Belagiannis

    Abstract: Despite the significant research efforts on trajectory prediction for automated driving, limited work exists on assessing the prediction reliability. To address this limitation we propose an approach that covers two sources of error, namely novel situations with out-of-distribution (OOD) detection and the complexity in in-distribution (ID) situations with uncertainty estimation. We introduce two m… ▽ More

    Submitted 4 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted to the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2023)

  7. arXiv:2306.13323  [pdf, other

    cs.RO cs.CV

    Automated Automotive Radar Calibration With Intelligent Vehicles

    Authors: Alexander Tsaregorodtsev, Michael Buchholz, Vasileios Belagiannis

    Abstract: While automotive radar sensors are widely adopted and have been used for automatic cruise control and collision avoidance tasks, their application outside of vehicles is still limited. As they have the ability to resolve multiple targets in 3D space, radars can also be used for improving environment perception. This application, however, requires a precise calibration, which is usually a time-cons… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: 5 pages, 4 figures, accepted for presentation at the 31st European Signal Processing Conference (EUSIPCO), September 4 - September 8, 2023, Helsinki, Finland

  8. arXiv:2306.12881  [pdf, other

    cs.CV

    Data-Free Backbone Fine-Tuning for Pruned Neural Networks

    Authors: Adrian Holzbock, Achyut Hegde, Klaus Dietmayer, Vasileios Belagiannis

    Abstract: Model compression techniques reduce the computational load and memory consumption of deep neural networks. After the compression operation, e.g. parameter pruning, the model is normally fine-tuned on the original training dataset to recover from the performance drop caused by compression. However, the training data is not always available due to privacy issues or other factors. In this work, we pr… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: Accpeted for presentation at the 31st European Signal Processing Conference (EUSIPCO) 2023, September 4-8, 2023, Helsinki, Finland

  9. arXiv:2304.10814  [pdf, other

    cs.CV

    Automated Static Camera Calibration with Intelligent Vehicles

    Authors: Alexander Tsaregorodtsev, Adrian Holzbock, Jan Strohbeck, Michael Buchholz, Vasileios Belagiannis

    Abstract: Connected and cooperative driving requires precise calibration of the roadside infrastructure for having a reliable perception system. To solve this requirement in an automated manner, we present a robust extrinsic calibration method for automated geo-referenced camera calibration. Our method requires a calibration vehicle equipped with a combined GNSS/RTK receiver and an inertial measurement unit… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

    Comments: 7 pages, 3 figures, accepted for presentation at the 34th IEEE Intelligent Vehicles Symposium (IV 2023), June 4 - June 7, 2023, Anchorage, Alaska, United States of America

  10. arXiv:2304.05856  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    RESET: Revisiting Trajectory Sets for Conditional Behavior Prediction

    Authors: Julian Schmidt, Pascal Huissel, Julian Wiederer, Julian Jordan, Vasileios Belagiannis, Klaus Dietmayer

    Abstract: It is desirable to predict the behavior of traffic participants conditioned on different planned trajectories of the autonomous vehicle. This allows the downstream planner to estimate the impact of its decisions. Recent approaches for conditional behavior prediction rely on a regression decoder, meaning that coordinates or polynomial coefficients are regressed. In this work we revisit set-based tr… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Accepted to the 2023 Intelligent Vehicles Symposium (IV 2023)

  11. arXiv:2303.08052  [pdf, other

    eess.AS cs.SD

    Localizing Spatial Information in Neural Spatiospectral Filters

    Authors: Annika Briegleb, Thomas Haubner, Vasileios Belagiannis, Walter Kellermann

    Abstract: Beamforming for multichannel speech enhancement relies on the estimation of spatial characteristics of the acoustic scene. In its simplest form, the delay-and-sum beamformer (DSB) introduces a time delay to all channels to align the desired signal components for constructive superposition. Recent investigations of neural spatiospectral filtering revealed that these filters can be characterized by… ▽ More

    Submitted 3 July, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted to the 31st European Signal Processing Conference (EUSIPCO 2023), Helsinki, Finland. 5 pages, 3 figures

  12. Gesture Recognition with Keypoint and Radar Stream Fusion for Automated Vehicles

    Authors: Adrian Holzbock, Nicolai Kern, Christian Waldschmidt, Klaus Dietmayer, Vasileios Belagiannis

    Abstract: We present a joint camera and radar approach to enable autonomous vehicles to understand and react to human gestures in everyday traffic. Initially, we process the radar data with a PointNet followed by a spatio-temporal multilayer perceptron (stMLP). Independently, the human body pose is extracted from the camera frame and processed with a separate stMLP network. We propose a fusion neural networ… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted for presentation at the 3rd AVVision Workshop at ECCV 2022, October 23, 2022, Tel Aviv, Israel

    Journal ref: In Computer Vision-ECCV 2022 Workshops: Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part I (pp. 570-584). Cham: Springer Nature Switzerland

  13. arXiv:2212.02875  [pdf, other

    cs.CV

    Multi-Task Edge Prediction in Temporally-Dynamic Video Graphs

    Authors: Osman Ülger, Julian Wiederer, Mohsen Ghafoorian, Vasileios Belagiannis, Pascal Mettes

    Abstract: Graph neural networks have shown to learn effective node representations, enabling node-, link-, and graph-level inference. Conventional graph networks assume static relations between nodes, while relations between entities in a video often evolve over time, with nodes entering and exiting dynamically. In such temporally-dynamic graphs, a core problem is inferring the future state of spatio-tempor… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: BMVC2022

  14. arXiv:2211.14512  [pdf, other

    cs.CV

    Residual Pattern Learning for Pixel-wise Out-of-Distribution Detection in Semantic Segmentation

    Authors: Yuyuan Liu, Choubo Ding, Yu Tian, Guansong Pang, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: Semantic segmentation models classify pixels into a set of known (``in-distribution'') visual classes. When deployed in an open world, the reliability of these models depends on their ability not only to classify in-distribution pixels but also to detect out-of-distribution (OoD) pixels. Historically, the poor OoD detection performance of these models has motivated the design of methods based on m… ▽ More

    Submitted 21 August, 2023; v1 submitted 26 November, 2022; originally announced November 2022.

    Comments: The paper contains 16 pages and it is accepted by ICCV'23

  15. arXiv:2211.10244  [pdf, other

    cs.CV

    Knowing What to Label for Few Shot Microscopy Image Cell Segmentation

    Authors: Youssef Dawoud, Arij Bouazizi, Katharina Ernst, Gustavo Carneiro, Vasileios Belagiannis

    Abstract: In microscopy image cell segmentation, it is common to train a deep neural network on source data, containing different types of microscopy images, and then fine-tune it using a support set comprising a few randomly selected and annotated training target images. In this paper, we argue that the random selection of unlabelled training target images to be annotated and included in the support set ma… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Accepted to WACV 2023

  16. Heatmap-based Out-of-Distribution Detection

    Authors: Julia Hornauer, Vasileios Belagiannis

    Abstract: Our work investigates out-of-distribution (OOD) detection as a neural network output explanation problem. We learn a heatmap representation for detecting OOD images while visualizing in- and out-of-distribution image regions at the same time. Given a trained and fixed classifier, we train a decoder neural network to produce heatmaps with zero response for in-distribution samples and high response… ▽ More

    Submitted 11 August, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: Accepted to WACV 2023

  17. arXiv:2209.01838  [pdf, other

    cs.RO

    A Benchmark for Unsupervised Anomaly Detection in Multi-Agent Trajectories

    Authors: Julian Wiederer, Julian Schmidt, Ulrich Kressel, Klaus Dietmayer, Vasileios Belagiannis

    Abstract: Human intuition allows to detect abnormal driving scenarios in situations they never experienced before. Like humans detect those abnormal situations and take countermeasures to prevent collisions, self-driving cars need anomaly detection mechanisms. However, the literature lacks a standard benchmark for the comparison of anomaly detection algorithms. We fill the gap and propose the R-U-MAAD bench… ▽ More

    Submitted 5 September, 2022; originally announced September 2022.

    Comments: 8 pages, 4 figures, 2 tables, accepted by IEEE ITSC 2022

  18. arXiv:2208.03949  [pdf, other

    cs.CV

    Extrinsic Camera Calibration with Semantic Segmentation

    Authors: Alexander Tsaregorodtsev, Johannes Müller, Jan Strohbeck, Martin Herrmann, Michael Buchholz, Vasileios Belagiannis

    Abstract: Monocular camera sensors are vital to intelligent vehicle operation and automated driving assistance and are also heavily employed in traffic control infrastructure. Calibrating the monocular camera, though, is time-consuming and often requires significant manual intervention. In this work, we present an extrinsic camera calibration approach that automatizes the parameter estimation by utilizing s… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Comments: 7 pages, 3 figures, accepted at the 25th International Conference on Intelligent Transportation Systems (ITSC) 2022

  19. arXiv:2208.02105  [pdf, other

    cs.CV cs.LG

    Edge-Based Self-Supervision for Semi-Supervised Few-Shot Microscopy Image Cell Segmentation

    Authors: Youssef Dawoud, Katharina Ernst, Gustavo Carneiro, Vasileios Belagiannis

    Abstract: Deep neural networks currently deliver promising results for microscopy image cell segmentation, but they require large-scale labelled databases, which is a costly and time-consuming process. In this work, we relax the labelling requirement by combining self-supervised with semi-supervised learning. We propose the prediction of edge-based maps for self-supervising the training of the unlabelled im… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted by MOVI 2022

  20. Gradient-based Uncertainty for Monocular Depth Estimation

    Authors: Julia Hornauer, Vasileios Belagiannis

    Abstract: In monocular depth estimation, disturbances in the image context, like moving objects or reflecting materials, can easily lead to erroneous predictions. For that reason, uncertainty estimates for each pixel are necessary, in particular for safety-critical applications such as automated driving. We propose a post hoc uncertainty estimation approach for an already trained and thus fixed depth estima… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Accepted to ECCV 2022

  21. arXiv:2207.00499  [pdf, other

    cs.CV cs.LG

    MotionMixer: MLP-based 3D Human Body Pose Forecasting

    Authors: Arij Bouazizi, Adrian Holzbock, Ulrich Kressel, Klaus Dietmayer, Vasileios Belagiannis

    Abstract: In this work, we present MotionMixer, an efficient 3D human body pose forecasting model based solely on multi-layer perceptrons (MLPs). MotionMixer learns the spatial-temporal 3D body pose dependencies by sequentially mixing both modalities. Given a stacked sequence of 3D body poses, a spatial-MLP extracts fine grained spatial dependencies of the body joints. The interaction of the body joints ove… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted by IJCAI-ECAI'22 (Oral-Long presentation)

  22. A Spatio-Temporal Multilayer Perceptron for Gesture Recognition

    Authors: Adrian Holzbock, Alexander Tsaregorodtsev, Youssef Dawoud, Klaus Dietmayer, Vasileios Belagiannis

    Abstract: Gesture recognition is essential for the interaction of autonomous vehicles with humans. While the current approaches focus on combining several modalities like image features, keypoints and bone vectors, we present neural network architecture that delivers state-of-the-art results only with body skeleton input data. We propose the spatio-temporal multilayer perceptron for gesture recognition in t… ▽ More

    Submitted 18 August, 2022; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted for presentation at the 33rd IEEE Intelligent Vehicles Symposium (IV 2022), June 5 - June 9, 2022, Aachen, Germany

    Journal ref: 2022 IEEE Intelligent Vehicles Symposium (IV), June 5th - 9th, 2022, Aachen, Germany, pp. 1099-1106

  23. arXiv:2203.14523  [pdf, other

    cs.CV cs.AI

    Translation Consistent Semi-supervised Segmentation for 3D Medical Images

    Authors: Yuyuan Liu, Yu Tian, Chong Wang, Yuanhong Chen, Fengbei Liu, Vasileios Belagiannis, Gustavo Carneiro

    Abstract: 3D medical image segmentation methods have been successful, but their dependence on large amounts of voxel-level annotated data is a disadvantage that needs to be addressed given the high cost to obtain such annotation. Semi-supervised learning (SSL) solve this issue by training models with a large unlabelled and a small labelled dataset. The most successful SSL approaches are based on consistency… ▽ More

    Submitted 21 April, 2023; v1 submitted 28 March, 2022; originally announced March 2022.

  24. arXiv:2203.04206  [pdf, other

    cs.CV cs.RO

    Lightweight Monocular Depth Estimation through Guided Decoding

    Authors: Michael Rudolph, Youssef Dawoud, Ronja Güldenring, Lazaros Nalpantidis, Vasileios Belagiannis

    Abstract: We present a lightweight encoder-decoder architecture for monocular depth estimation, specifically designed for embedded platforms. Our main contribution is the Guided Upsampling Block (GUB) for building the decoder of our model. Motivated by the concept of guided image filtering, GUB relies on the image to guide the decoder on upsampling the feature representation and the depth map reconstruction… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted to ICRA 2022

  25. arXiv:2202.04461  [pdf, other

    cs.RO

    A Multi-Task Recurrent Neural Network for End-to-End Dynamic Occupancy Grid Map**

    Authors: Marcel Schreiber, Vasileios Belagiannis, Claudius Gläser, Klaus Dietmayer

    Abstract: A common approach for modeling the environment of an autonomous vehicle are dynamic occupancy grid maps, in which the surrounding is divided into cells, each containing the occupancy and velocity state of its location. Despite the advantage of modeling arbitrary shaped objects, the used algorithms rely on hand-designed inverse sensor models and semantic information is missing. Therefore, we introd… ▽ More

    Submitted 5 May, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: Accepted for presentation at the 2022 33rd IEEE Intelligent Vehicles Symposium (IV) (IV 2022), June 5-9, 2022, in Aachen, Germany

  26. arXiv:2111.12918  [pdf, other

    cs.CV

    ACPL: Anti-curriculum Pseudo-labelling for Semi-supervised Medical Image Classification

    Authors: Fengbei Liu, Yu Tian, Yuanhong Chen, Yuyuan Liu, Vasileios Belagiannis, Gustavo Carneiro

    Abstract: Effective semi-supervised learning (SSL) in medical image analysis (MIA) must address two challenges: 1) work effectively on both multi-class (e.g., lesion classification) and multi-label (e.g., multiple-disease diagnosis) problems, and 2) handle imbalanced learning (because of the high variance in disease prevalence). One strategy to explore in SSL MIA is based on the pseudo labelling strategy, b… ▽ More

    Submitted 21 March, 2022; v1 submitted 25 November, 2021; originally announced November 2021.

    Comments: CVPR 2022

  27. arXiv:2111.12903  [pdf, other

    cs.CV

    Perturbed and Strict Mean Teachers for Semi-supervised Semantic Segmentation

    Authors: Yuyuan Liu, Yu Tian, Yuanhong Chen, Fengbei Liu, Vasileios Belagiannis, Gustavo Carneiro

    Abstract: Consistency learning using input image, feature, or network perturbations has shown remarkable results in semi-supervised semantic segmentation, but this approach can be seriously affected by inaccurate predictions of unlabelled training images. There are two consequences of these inaccurate predictions: 1) the training based on the "strict" cross-entropy (CE) loss can easily overfit prediction mi… ▽ More

    Submitted 26 March, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: CVPR 2022 camera-ready

  28. arXiv:2110.11809  [pdf, other

    cs.CV

    PropMix: Hard Sample Filtering and Proportional MixUp for Learning with Noisy Labels

    Authors: Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: The most competitive noisy label learning methods rely on an unsupervised classification of clean and noisy samples, where samples classified as noisy are re-labelled and "MixMatched" with the clean samples. These methods have two issues in large noise rate problems: 1) the noisy set is more likely to contain hard samples that are in-correctly re-labelled, and 2) the number of samples produced by… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: Paper accepted at BMVC'21: The 32nd British Machine Vision Conference

  29. arXiv:2110.07922  [pdf, other

    cs.RO cs.LG cs.MA

    Anomaly Detection in Multi-Agent Trajectories for Automated Driving

    Authors: Julian Wiederer, Arij Bouazizi, Marco Troina, Ulrich Kressel, Vasileios Belagiannis

    Abstract: Human drivers can recognise fast abnormal driving situations to avoid accidents. Similar to humans, automated vehicles are supposed to perform anomaly detection. In this work, we propose the spatio-temporal graph auto-encoder for learning normal driving behaviours. Our innovation is the ability to jointly learn multiple trajectories of a dynamic number of agents. To perform anomaly detection, we f… ▽ More

    Submitted 28 October, 2021; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: 15 pages incl. supplementary material, 8 figures, 4 tables (accepted by CoRL 2021)

  30. arXiv:2110.07578  [pdf, other

    cs.CV

    Learning Temporal 3D Human Pose Estimation with Pseudo-Labels

    Authors: Arij Bouazizi, Ulrich Kressel, Vasileios Belagiannis

    Abstract: We present a simple, yet effective, approach for self-supervised 3D human pose estimation. Unlike the prior work, we explore the temporal information next to the multi-view self-supervision. During training, we rely on triangulating 2D body pose estimates of a multiple-view camera system. A temporal convolutional neural network is trained with the generated 3D ground-truth and the geometric multi-… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted for publication at AVSS 2021. Project page:https://github.com/vru2020/TM_HPE/

  31. arXiv:2109.09734  [pdf, other

    eess.IV cs.CV cs.LG

    MetaMedSeg: Volumetric Meta-learning for Few-Shot Organ Segmentation

    Authors: Anastasia Makarevich, Azade Farshad, Vasileios Belagiannis, Nassir Navab

    Abstract: The lack of sufficient annotated image data is a common issue in medical image segmentation. For some organs and densities, the annotation may be scarce, leading to poor model training convergence, while other organs have plenty of annotated data. In this work, we present MetaMedSeg, a gradient-based meta-learning algorithm that redefines the meta-learning task for the volumetric medical data with… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

  32. arXiv:2108.07777  [pdf, other

    cs.CV

    Self-Supervised 3D Human Pose Estimation with Multiple-View Geometry

    Authors: Arij Bouazizi, Julian Wiederer, Ulrich Kressel, Vasileios Belagiannis

    Abstract: We present a self-supervised learning algorithm for 3D human pose estimation of a single person based on a multiple-view camera system and 2D body pose estimates for each view. To train our model, represented by a deep neural network, we propose a four-loss function learning algorithm, which does not require any 2D or 3D body pose ground-truth. The proposed loss functions make use of the multiple-… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: Accepted for publication at FG 2021

  33. Visual Domain Adaptation for Monocular Depth Estimation on Resource-Constrained Hardware

    Authors: Julia Hornauer, Lazaros Nalpantidis, Vasileios Belagiannis

    Abstract: Real-world perception systems in many cases build on hardware with limited resources to adhere to cost and power limitations of their carrying system. Deploying deep neural networks on resource-constrained hardware became possible with model compression techniques, as well as efficient and hardware-aware architecture design. However, model adaptation is additionally required due to the diverse ope… ▽ More

    Submitted 5 May, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021 Workshop on Embedded and Real-World Computer Vision in Autonomous Driving

    Journal ref: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2021, pp. 954-962

  34. arXiv:2107.07787  [pdf, other

    cs.CV cs.RO

    Attention-based Vehicle Self-Localization with HD Feature Maps

    Authors: Nico Engel, Vasileios Belagiannis, Klaus Dietmayer

    Abstract: We present a vehicle self-localization method using point-based deep neural networks. Our approach processes measurements and point features, i.e. landmarks, from a high-definition digital map to infer the vehicle's pose. To learn the best association and incorporate local information between the point sets, we propose an attention mechanism that matches the measurements to the corresponding landm… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at 24th IEEE International Conference on Intelligent Transportation Systems (ITSC 2021)

  35. arXiv:2106.08693  [pdf, ps, other

    cs.LG cs.CV

    ParticleAugment: Sampling-Based Data Augmentation

    Authors: Alexander Tsaregorodtsev, Vasileios Belagiannis

    Abstract: We present an automated data augmentation approach for image classification. We formulate the problem as Monte Carlo sampling where our goal is to approximate the optimal augmentation policies. We propose a particle filtering scheme for the policy search where the probability of applying a set of augmentation operations forms the state of the filter. We measure the policy performance based on the… ▽ More

    Submitted 15 October, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: 8 pages

  36. arXiv:2103.11395  [pdf, other

    cs.CV cs.LG

    ScanMix: Learning from Severe Label Noise via Semantic Clustering and Semi-Supervised Learning

    Authors: Ragav Sachdeva, Filipe R Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: We propose a new training algorithm, ScanMix, that explores semantic clustering and semi-supervised learning (SSL) to allow superior robustness to severe label noise and competitive robustness to non-severe label noise problems, in comparison to the state of the art (SOTA) methods. ScanMix is based on the expectation maximisation framework, where the E-step estimates the latent variable to cluster… ▽ More

    Submitted 16 October, 2022; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: Paper accepted at Pattern Recognition

  37. LongReMix: Robust Learning with High Confidence Samples in a Noisy Label Environment

    Authors: Filipe R. Cordeiro, Ragav Sachdeva, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: Deep neural network models are robust to a limited amount of label noise, but their ability to memorise noisy labels in high noise rate problems is still an open issue. The most competitive noisy-label learning algorithms rely on a 2-stage process comprising an unsupervised learning to classify training samples as clean or noisy, followed by a semi-supervised learning that minimises the empirical… ▽ More

    Submitted 4 September, 2022; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Published at Pattern Recognition 2022

  38. arXiv:2103.04053  [pdf, other

    cs.CV

    NVUM: Non-Volatile Unbiased Memory for Robust Medical Image Classification

    Authors: Fengbei Liu, Yuanhong Chen, Yu Tian, Yuyuan Liu, Chong Wang, Vasileios Belagiannis, Gustavo Carneiro

    Abstract: Real-world large-scale medical image analysis (MIA) datasets have three challenges: 1) they contain noisy-labelled samples that affect training convergence and generalisation, 2) they usually have an imbalanced distribution of samples per class, and 3) they normally comprise a multi-label problem, where samples can have multiple diagnoses. Current approaches are commonly trained to solve a subset… ▽ More

    Submitted 21 August, 2022; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: MICCAI 2022 Early Accept

  39. arXiv:2103.03629  [pdf, other

    cs.CV

    Self-supervised Mean Teacher for Semi-supervised Chest X-ray Classification

    Authors: Fengbei Liu, Yu Tian, Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: The training of deep learning models generally requires a large amount of annotated data for effective convergence and generalisation. However, obtaining high-quality annotations is a laboursome and expensive process due to the need of expert radiologists for the labelling task. The study of semi-supervised learning in medical image analysis is then of crucial importance given that it is much less… ▽ More

    Submitted 4 November, 2021; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: MLMI-MICCAI 2021

  40. Dynamic Occupancy Grid Map** with Recurrent Neural Networks

    Authors: Marcel Schreiber, Vasileios Belagiannis, Claudius Gläser, Klaus Dietmayer

    Abstract: Modeling and understanding the environment is an essential task for autonomous driving. In addition to the detection of objects, in complex traffic scenarios the motion of other road participants is of special interest. Therefore, we propose to use a recurrent neural network to predict a dynamic occupancy grid map, which divides the vehicle surrounding in cells, each containing the occupancy proba… ▽ More

    Submitted 5 May, 2022; v1 submitted 17 November, 2020; originally announced November 2020.

    Journal ref: 2021 IEEE International Conference on Robotics and Automation (ICRA), May 30 - June 5, 2021, Xi'an, China, pp. 6717-6724

  41. arXiv:2011.05704  [pdf, other

    cs.LG cs.CV

    EvidentialMix: Learning with Combined Open-set and Closed-set Noisy Labels

    Authors: Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

    Abstract: The efficacy of deep learning depends on large-scale data sets that have been carefully curated with reliable data acquisition and annotation processes. However, acquiring such large-scale data sets with precise annotations is very expensive and time-consuming, and the cheap alternatives often yield data sets that have noisy labels. The field has addressed this problem by focusing on training mode… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Paper accepted at WACV'21: Winter Conference on Applications of Computer Vision

  42. Point Transformer

    Authors: Nico Engel, Vasileios Belagiannis, Klaus Dietmayer

    Abstract: In this work, we present Point Transformer, a deep neural network that operates directly on unordered and unstructured point sets. We design Point Transformer to extract local and global features and relate both representations by introducing the local-global attention mechanism, which aims to capture spatial point relations and shape information. For that purpose, we propose SortNet, as part of t… ▽ More

    Submitted 14 October, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

  43. arXiv:2007.16072  [pdf, other

    cs.CV

    Traffic Control Gesture Recognition for Autonomous Vehicles

    Authors: Julian Wiederer, Arij Bouazizi, Ulrich Kressel, Vasileios Belagiannis

    Abstract: A car driver knows how to react on the gestures of the traffic officers. Clearly, this is not the case for the autonomous vehicle, unless it has road traffic control gesture recognition functionalities. In this work, we address the limitation of the existing autonomous driving datasets to provide learning data for traffic control gesture recognition. We introduce a dataset that is based on 3D body… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 8 pages, 8 figures, 3 tables, accepted by IROS 2020

  44. DeepCLR: Correspondence-Less Architecture for Deep End-to-End Point Cloud Registration

    Authors: Markus Horn, Nico Engel, Vasileios Belagiannis, Michael Buchholz, Klaus Dietmayer

    Abstract: This work addresses the problem of point cloud registration using deep neural networks. We propose an approach to predict the alignment between two point clouds with overlap** data content, but displaced origins. Such point clouds originate, for example, from consecutive measurements of a LiDAR mounted on a moving platform. The main difficulty in deep registration of raw point clouds is the fusi… ▽ More

    Submitted 13 January, 2021; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: 7 pages, 5 figures, 4 tables

    Journal ref: 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)

  45. arXiv:2007.01671  [pdf, other

    cs.CV cs.LG stat.ML

    Few-Shot Microscopy Image Cell Segmentation

    Authors: Youssef Dawoud, Julia Hornauer, Gustavo Carneiro, Vasileios Belagiannis

    Abstract: Automatic cell segmentation in microscopy images works well with the support of deep neural networks trained with full supervision. Collecting and annotating images, though, is not a sustainable solution for every new microscopy database and cell type. Instead, we assume that we can access a plethora of annotated image data sets from different domains (sources) and a limited number of annotated im… ▽ More

    Submitted 29 June, 2020; originally announced July 2020.

    Comments: 16 pages, 4 figures, Accepted by ECML-PKDD 2020 conference

  46. arXiv:1911.01711  [pdf, other

    eess.SP cs.CV

    LACI: Low-effort Automatic Calibration of Infrastructure Sensors

    Authors: Johannes Müller, Martin Herrmann, Jan Strohbeck, Vasileios Belagiannis, Michael Buchholz

    Abstract: Sensor calibration usually is a time consuming yet important task. While classical approaches are sensor-specific and often need calibration targets as well as a widely overlap** field of view (FOV), within this work, a cooperative intelligent vehicle is used as callibration target. The vehicleis detected in the sensor frame and then matched with the information received from the cooperative awa… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 6 pages, published at ITSC 2019

  47. Motion Estimation in Occupancy Grid Maps in Stationary Settings Using Recurrent Neural Networks

    Authors: Marcel Schreiber, Vasileios Belagiannis, Claudius Glaeser, Klaus Dietmayer

    Abstract: In this work, we tackle the problem of modeling the vehicle environment as dynamic occupancy grid map in complex urban scenarios using recurrent neural networks. Dynamic occupancy grid maps represent the scene in a bird's eye view, where each grid cell contains the occupancy probability and the two dimensional velocity. As input data, our approach relies on measurement grid maps, which contain occ… ▽ More

    Submitted 5 May, 2022; v1 submitted 25 September, 2019; originally announced September 2019.

    Journal ref: 2020 IEEE International Conference on Robotics and Automation (ICRA), May 31 - June 4, 2020, Paris, France, pp. 8587-8593

  48. arXiv:1908.00111  [pdf, other

    cs.CV cs.LG

    Few-Shot Meta-Denoising

    Authors: Leslie Casas, Attila Klimmek, Gustavo Carneiro, Nassir Navab, Vasileios Belagiannis

    Abstract: We study the problem of few-shot learning-based denoising where the training set contains just a handful of clean and noisy samples. A solution to mitigate the small training set issue is to pre-train a denoising model with small training sets containing pairs of clean and synthesized noisy signals, produced from empirical noise priors, and fine-tune on the available small training set. While such… ▽ More

    Submitted 25 November, 2019; v1 submitted 31 July, 2019; originally announced August 2019.

  49. arXiv:1904.09007  [pdf, other

    cs.RO cs.CV cs.LG stat.ML

    DeepLocalization: Landmark-based Self-Localization with Deep Neural Networks

    Authors: Nico Engel, Stefan Hoermann, Markus Horn, Vasileios Belagiannis, Klaus Dietmayer

    Abstract: We address the problem of vehicle self-localization from multi-modal sensor information and a reference map. The map is generated off-line by extracting landmarks from the vehicle's field of view, while the measurements are collected similarly on the fly. Our goal is to determine the autonomous vehicle's pose from the landmark measurements and map landmarks. To learn this map**, we propose DeepL… ▽ More

    Submitted 19 July, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: Accepted for publication by the IEEE Intelligent Transportation Systems Conference (ITSC 2019), Auckland, New Zealand

  50. arXiv:1901.02000  [pdf, other

    cs.CV

    Forecasting People Trajectories and Head Poses by Jointly Reasoning on Tracklets and Vislets

    Authors: Irtiza Hasan, Francesco Setti, Theodore Tsesmelis, Vasileios Belagiannis, Sikandar Amin, Alessio Del Bue, Marco Cristani, Fabio Galasso

    Abstract: In this work, we explore the correlation between people trajectories and their head orientations. We argue that people trajectory and head pose forecasting can be modelled as a joint problem. Recent approaches on trajectory forecasting leverage short-term trajectories (aka tracklets) of pedestrians to predict their future paths. In addition, sociological cues, such as expected destination or pedes… ▽ More

    Submitted 15 October, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

    Comments: Accepted at IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2019. arXiv admin note: text overlap with arXiv:1805.00652