-
UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps
Authors:
Maciej K Wozniak,
Mattias Hansson,
Marko Thiel,
Patric Jensfelt
Abstract:
In this study, we address a gap in existing unsupervised domain adaptation approaches on LiDAR-based 3D object detection, which have predominantly concentrated on adapting between established, high-density autonomous driving datasets. We focus on sparser point clouds, capturing scenarios from different perspectives: not just from vehicles on the road but also from mobile robots on sidewalks, which…
▽ More
In this study, we address a gap in existing unsupervised domain adaptation approaches on LiDAR-based 3D object detection, which have predominantly concentrated on adapting between established, high-density autonomous driving datasets. We focus on sparser point clouds, capturing scenarios from different perspectives: not just from vehicles on the road but also from mobile robots on sidewalks, which encounter significantly different environmental conditions and sensor configurations. We introduce Unsupervised Adversarial Domain Adaptation for 3D Object Detection (UADA3D). UADA3D does not depend on pre-trained source models or teacher-student architectures. Instead, it uses an adversarial approach to directly learn domain-invariant features. We demonstrate its efficacy in various adaptation scenarios, showing significant improvements in both self-driving car and mobile robot domains. Our code is open-source and will be available soon.
△ Less
Submitted 12 June, 2024; v1 submitted 26 March, 2024;
originally announced March 2024.
-
MCD: Diverse Large-Scale Multi-Campus Dataset for Robot Perception
Authors:
Thien-Minh Nguyen,
Shenghai Yuan,
Thien Hoang Nguyen,
Pengyu Yin,
Haozhi Cao,
Lihua Xie,
Maciej Wozniak,
Patric Jensfelt,
Marko Thiel,
Justin Ziegenbein,
Noel Blunder
Abstract:
Perception plays a crucial role in various robot applications. However, existing well-annotated datasets are biased towards autonomous driving scenarios, while unlabelled SLAM datasets are quickly over-fitted, and often lack environment and domain variations. To expand the frontier of these fields, we introduce a comprehensive dataset named MCD (Multi-Campus Dataset), featuring a wide range of sen…
▽ More
Perception plays a crucial role in various robot applications. However, existing well-annotated datasets are biased towards autonomous driving scenarios, while unlabelled SLAM datasets are quickly over-fitted, and often lack environment and domain variations. To expand the frontier of these fields, we introduce a comprehensive dataset named MCD (Multi-Campus Dataset), featuring a wide range of sensing modalities, high-accuracy ground truth, and diverse challenging environments across three Eurasian university campuses. MCD comprises both CCS (Classical Cylindrical Spinning) and NRE (Non-Repetitive Epicyclic) lidars, high-quality IMUs (Inertial Measurement Units), cameras, and UWB (Ultra-WideBand) sensors. Furthermore, in a pioneering effort, we introduce semantic annotations of 29 classes over 59k sparse NRE lidar scans across three domains, thus providing a novel challenge to existing semantic segmentation research upon this largely unexplored lidar modality. Finally, we propose, for the first time to the best of our knowledge, continuous-time ground truth based on optimization-based registration of lidar-inertial data on large survey-grade prior maps, which are also publicly released, each several times the size of existing ones. We conduct a rigorous evaluation of numerous state-of-the-art algorithms on MCD, report their performance, and highlight the challenges awaiting solutions from the research community.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Towards a Robust Sensor Fusion Step for 3D Object Detection on Corrupted Data
Authors:
Maciej K. Wozniak,
Viktor Karefjards,
Marko Thiel,
Patric Jensfelt
Abstract:
Multimodal sensor fusion methods for 3D object detection have been revolutionizing the autonomous driving research field. Nevertheless, most of these methods heavily rely on dense LiDAR data and accurately calibrated sensors which is often not the case in real-world scenarios. Data from LiDAR and cameras often come misaligned due to the miscalibration, decalibration, or different frequencies of th…
▽ More
Multimodal sensor fusion methods for 3D object detection have been revolutionizing the autonomous driving research field. Nevertheless, most of these methods heavily rely on dense LiDAR data and accurately calibrated sensors which is often not the case in real-world scenarios. Data from LiDAR and cameras often come misaligned due to the miscalibration, decalibration, or different frequencies of the sensors. Additionally, some parts of the LiDAR data may be occluded and parts of the data may be missing due to hardware malfunction or weather conditions. This work presents a novel fusion step that addresses data corruptions and makes sensor fusion for 3D object detection more robust. Through extensive experiments, we demonstrate that our method performs on par with state-of-the-art approaches on normal data and outperforms them on misaligned data.
△ Less
Submitted 12 June, 2023;
originally announced June 2023.
-
Comparison of Varied 2D Map** Approaches by Using Practice-Oriented Evaluation Criteria
Authors:
Justin Ziegenbein,
Manuel Schrick,
Marko Thiel,
Johannes Hinckeldeyn,
Jochen Kreutzfeldt
Abstract:
A key aspect of the precision of a mobile robots localization is the quality and aptness of the map it is using. A variety of map** approaches are available that can be employed to create such maps with varying degrees of effort, hardware requirements and quality of the resulting maps. To create a better understanding of the applicability of these different approaches to specific applications, t…
▽ More
A key aspect of the precision of a mobile robots localization is the quality and aptness of the map it is using. A variety of map** approaches are available that can be employed to create such maps with varying degrees of effort, hardware requirements and quality of the resulting maps. To create a better understanding of the applicability of these different approaches to specific applications, this paper evaluates and compares three different map** approaches based on simultaneous localization and map**, terrestrial laser scanning as well as publicly accessible building contours.
△ Less
Submitted 19 October, 2022;
originally announced October 2022.
-
Three dimensional waveguide-interconnects for scalable integration of photonic neural networks
Authors:
Johnny Moughames,
Xavier Porte,
Michael Thiel,
Gwenn Ulliac,
Maxime Jacquot,
Laurent Larger,
Muamer Kadic,
Daniel Brunner
Abstract:
Photonic waveguides are prime candidates for integrated and parallel photonic interconnects. Such interconnects correspond to large-scale vector matrix products, which are at the heart of neural network computation. However, parallel interconnect circuits realized in two dimensions, for example by lithography, are strongly limited in size due to disadvantageous scaling. We use three dimensional (3…
▽ More
Photonic waveguides are prime candidates for integrated and parallel photonic interconnects. Such interconnects correspond to large-scale vector matrix products, which are at the heart of neural network computation. However, parallel interconnect circuits realized in two dimensions, for example by lithography, are strongly limited in size due to disadvantageous scaling. We use three dimensional (3D) printed photonic waveguides to overcome this limitation. 3D optical-couplers with fractal topology efficiently connect large numbers of input and output channels, and we show that the substrate's footprint area scales linearly. Going beyond simple couplers, we introduce functional circuits for discrete spatial filters identical to those used in deep convolutional neural networks.
△ Less
Submitted 9 January, 2020; v1 submitted 17 December, 2019;
originally announced December 2019.
-
Leveraging Implicit Expert Knowledge for Non-Circular Machine Learning in Sepsis Prediction
Authors:
Shigehiko Schamoni,
Holger A. Lindner,
Verena Schneider-Lindner,
Manfred Thiel,
Stefan Riezler
Abstract:
Sepsis is the leading cause of death in non-coronary intensive care units. Moreover, a delay of antibiotic treatment of patients with severe sepsis by only few hours is associated with increased mortality. This insight makes accurate models for early prediction of sepsis a key task in machine learning for healthcare. Previous approaches have achieved high AUROC by learning from electronic health r…
▽ More
Sepsis is the leading cause of death in non-coronary intensive care units. Moreover, a delay of antibiotic treatment of patients with severe sepsis by only few hours is associated with increased mortality. This insight makes accurate models for early prediction of sepsis a key task in machine learning for healthcare. Previous approaches have achieved high AUROC by learning from electronic health records where sepsis labels were defined automatically following established clinical criteria. We argue that the practice of incorporating the clinical criteria that are used to automatically define ground truth sepsis labels as features of severity scoring models is inherently circular and compromises the validity of the proposed approaches. We propose to create an independent ground truth for sepsis research by exploiting implicit knowledge of clinical practitioners via an electronic questionnaire which records attending physicians' daily judgements of patients' sepsis status. We show that despite its small size, our dataset allows to achieve state-of-the-art AUROC scores. An inspection of learned weights for standardized features of the linear model lets us infer potentially surprising feature contributions and allows to interpret seemingly counterintuitive findings.
△ Less
Submitted 20 September, 2019;
originally announced September 2019.