Skip to main content

Showing 1–13 of 13 results for author: Gavrila, D M

.
  1. arXiv:2406.04723  [pdf, other

    eess.SP eess.IV

    A Deep Automotive Radar Detector using the RaDelft Dataset

    Authors: Ignacio Roldan, Andras Palffy, Julian F. P. Kooij, Dariu M. Gavrila, Francesco Fioranelli, Alexander Yarovoy

    Abstract: The detection of multiple extended targets in complex environments using high-resolution automotive radar is considered. A data-driven approach is proposed where unlabeled synchronized lidar data is used as ground truth to train a neural network with only radar data as input. To this end, the novel, large-scale, real-life, and multi-sensor RaDelft dataset has been recorded using a demonstrator veh… ▽ More

    Submitted 27 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Under review at IEEE Transaction on Radar Systems

  2. arXiv:2405.15688  [pdf, other

    cs.CV

    UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes

    Authors: Ted Lentsch, Holger Caesar, Dariu M. Gavrila

    Abstract: Unsupervised 3D object detection methods have emerged to leverage vast amounts of data efficiently without requiring manual labels for training. Recent approaches rely on dynamic objects for learning to detect objects but penalize the detections of static instances during training. Multiple rounds of (self) training are used in which detected static instances are added to the set of training targe… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Under review

    MSC Class: 68T10; 62H35; 68T05; 68U10 ACM Class: I.2.10; I.4.8; I.5.1; I.5.4

  3. See Further Than CFAR: a Data-Driven Radar Detector Trained by Lidar

    Authors: Ignacio Roldan, Andras Palffy, Julian F. P. Kooij, Dariu M. Gavrila, Francesco Fioranelli, Alexander Yarovoy

    Abstract: In this paper, we address the limitations of traditional constant false alarm rate (CFAR) target detectors in automotive radars, particularly in complex urban environments with multiple objects that appear as extended targets. We propose a data-driven radar target detector exploiting a highly efficient 2D CNN backbone inspired by the computer vision domain. Our approach is distinguished by a uniqu… ▽ More

    Submitted 27 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted for lecture presentation at IEEE RadarConf'24, Denver, USA

    Journal ref: 2024 IEEE Radar Conference (RadarConf24)

  4. arXiv:2310.10353  [pdf, other

    cs.CV cs.LG

    Multimodal Object Query Initialization for 3D Object Detection

    Authors: Mathijs R. van Geerenstein, Felicia Ruppel, Klaus Dietmayer, Dariu M. Gavrila

    Abstract: 3D object detection models that exploit both LiDAR and camera sensor features are top performers in large-scale autonomous driving benchmarks. A transformer is a popular network architecture used for this task, in which so-called object queries act as candidate objects. Initializing these object queries based on current sensor inputs is a common practice. For this, existing methods strongly rely o… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  5. arXiv:2303.00462  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Hidden Gems: 4D Radar Scene Flow Learning Using Cross-Modal Supervision

    Authors: Fangqiang Ding, Andras Palffy, Dariu M. Gavrila, Chris Xiaoxuan Lu

    Abstract: This work proposes a novel approach to 4D radar-based scene flow estimation via cross-modal learning. Our approach is motivated by the co-located sensing redundancy in modern autonomous vehicles. Such redundancy implicitly provides various forms of supervision cues to the radar scene flow estimation. Specifically, we introduce a multi-task model architecture for the identified cross-modal learning… ▽ More

    Submitted 17 March, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: 10 pages, 7 figures. Accepted by CVPR 2023. See our code at https://github.com/Toytiny/CMFlow. Supplementary materials can be found at https://drive.google.com/file/d/1Iewcqnjzecge2ePBM8k2tg-85LX5xs3N/view

  6. arXiv:2211.13309  [pdf, other

    cs.CV cs.LG

    How do Cross-View and Cross-Modal Alignment Affect Representations in Contrastive Learning?

    Authors: Thomas M. Hehn, Julian F. P. Kooij, Dariu M. Gavrila

    Abstract: Various state-of-the-art self-supervised visual representation learning approaches take advantage of data from multiple sensors by aligning the feature representations across views and/or modalities. In this work, we investigate how aligning representations affects the visual features obtained from cross-view and cross-modal contrastive learning on images and point clouds. On five real-world datas… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  7. arXiv:2211.13133  [pdf, other

    cs.CV cs.AI

    Structural Knowledge Distillation for Object Detection

    Authors: Philip de Rijk, Lukas Schneider, Marius Cordts, Dariu M. Gavrila

    Abstract: Knowledge Distillation (KD) is a well-known training paradigm in deep neural networks where knowledge acquired by a large teacher model is transferred to a small student. KD has proven to be an effective technique to significantly improve the student's performance for various tasks including object detection. As such, KD techniques mostly rely on guidance at the intermediate feature level, which i… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

  8. arXiv:2011.09141  [pdf, other

    cs.CV cs.LG

    Semantic Scene Completion using Local Deep Implicit Functions on LiDAR Data

    Authors: Christoph B. Rist, David Emmerichs, Markus Enzweiler, Dariu M. Gavrila

    Abstract: Semantic scene completion is the task of jointly estimating 3D geometry and semantics of objects and surfaces within a given extent. This is a particularly challenging task on real-world data that is sparse and occluded. We propose a scene segmentation network based on local Deep Implicit Functions as a novel learning-based method for scene completion. Unlike previous work on scene completion, our… ▽ More

    Submitted 12 April, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    ACM Class: I.4.8

  9. arXiv:2004.12678  [pdf, other

    cs.LG stat.ML

    Diversity in Action: General-Sum Multi-Agent Continuous Inverse Optimal Control

    Authors: Christian Muench, Frans A. Oliehoek, Dariu M. Gavrila

    Abstract: Traffic scenarios are inherently interactive. Multiple decision-makers predict the actions of others and choose strategies that maximize their rewards. We view these interactions from the perspective of game theory which introduces various challenges. Humans are not entirely rational, their rewards need to be inferred from real-world data, and any prediction algorithm needs to be real-time capable… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: 16 pages, 6 figures

  10. CNN based Road User Detection using the 3D Radar Cube

    Authors: Andras Palffy, Jiaao Dong, Julian F. P. Kooij, Dariu M. Gavrila

    Abstract: This letter presents a novel radar based, single-frame, multi-class detection method for moving road users (pedestrian, cyclist, car), which utilizes low-level radar cube data. The method provides class information both on the radar target- and object-level. Radar targets are classified individually after extending the target features with a cropped block of the 3D radar cube around their position… ▽ More

    Submitted 16 July, 2020; v1 submitted 25 April, 2020; originally announced April 2020.

    Journal ref: IEEE Robotics and Automation Letters (RAL), vol. 5, nr. 2, pp. 1263-1270, 2020

  11. arXiv:1905.06113  [pdf, other

    cs.RO cs.CV cs.LG

    Human Motion Trajectory Prediction: A Survey

    Authors: Andrey Rudenko, Luigi Palmieri, Michael Herman, Kris M. Kitani, Dariu M. Gavrila, Kai O. Arras

    Abstract: With growing numbers of intelligent autonomous systems in human environments, the ability of such systems to perceive, understand and anticipate human behavior becomes increasingly important. Specifically, predicting future positions of dynamic agents and planning considering such predictions are key tasks for self-driving vehicles, service robots and advanced surveillance systems. This paper prov… ▽ More

    Submitted 17 December, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: Submitted to the International Journal of Robotics Research (IJRR), 37 pages

  12. arXiv:1903.11532  [pdf, other

    cs.CV

    Privacy Protection in Street-View Panoramas using Depth and Multi-View Imagery

    Authors: Ries Uittenbogaard, Clint Sebastian, Julien Vijverberg, Bas Boom, Dariu M. Gavrila, Peter H. N. de With

    Abstract: The current paradigm in privacy protection in street-view images is to detect and blur sensitive information. In this paper, we propose a framework that is an alternative to blurring, which automatically removes and inpaints moving objects (e.g. pedestrians, vehicles) in street-view imagery. We propose a novel moving object segmentation algorithm exploiting consistencies in depth across multiple s… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: Accepted to CVPR 2019. Dataset (and provided link) will be made available before the CVPR

  13. arXiv:1805.07193  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    The EuroCity Persons Dataset: A Novel Benchmark for Object Detection

    Authors: Markus Braun, Sebastian Krebs, Fabian Flohr, Dariu M. Gavrila

    Abstract: Big data has had a great share in the success of deep learning in computer vision. Recent works suggest that there is significant further potential to increase object detection performance by utilizing even bigger datasets. In this paper, we introduce the EuroCity Persons dataset, which provides a large number of highly diverse, accurate and detailed annotations of pedestrians, cyclists and other… ▽ More

    Submitted 5 June, 2018; v1 submitted 18 May, 2018; originally announced May 2018.

    Comments: Submitted to IEEE Trans. on Pattern Analysis and Machine Intelligence

    Journal ref: Published in IEEE Trans. on Pattern Analysis and Machine Intelligence, 2019