Skip to main content

Showing 1–50 of 79 results for author: Yogamani, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18852  [pdf, other

    cs.CV cs.AI cs.RO

    LetsMap: Unsupervised Representation Learning for Semantic BEV Map**

    Authors: Nikhil Gosala, Kürsat Petek, B Ravi Kiran, Senthil Yogamani, Paulo Drews-Jr, Wolfram Burgard, Abhinav Valada

    Abstract: Semantic Bird's Eye View (BEV) maps offer a rich representation with strong occlusion reasoning for various decision making tasks in autonomous driving. However, most BEV map** approaches employ a fully supervised learning paradigm that relies on large amounts of human-annotated BEV ground truth data. In this work, we address this limitation by proposing the first unsupervised representation lea… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 23 pages, 5 figures

  2. arXiv:2404.13443  [pdf, other

    cs.CV cs.RO

    FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving

    Authors: Ganesh Sistu, Senthil Yogamani

    Abstract: Object detection is a mature problem in autonomous driving with pedestrian detection being one of the first deployed algorithms. It has been comprehensively studied in the literature. However, object detection is relatively less explored for fisheye cameras used for surround-view near field sensing. The standard bounding box representation fails in fisheye cameras due to heavy radial distortion, p… ▽ More

    Submitted 27 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2206.05542 by other authors

  3. arXiv:2404.06352  [pdf, other

    cs.CV cs.RO

    DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird's Eye View Segmentation with Occlusion Reasoning

    Authors: Senthil Yogamani, David Unger, Venkatraman Narayanan, Varun Ravi Kumar

    Abstract: Semantic segmentation is an effective way to perform scene understanding. Recently, segmentation in 3D Bird's Eye View (BEV) space has become popular as its directly used by drive policy. However, there is limited work on BEV segmentation for surround-view fisheye cameras, commonly used in commercial vehicles. As this task has no real-world public dataset and existing synthetic datasets do not han… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  4. arXiv:2403.16338  [pdf, other

    cs.CV cs.AI

    Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks

    Authors: Madhumitha Sakthi, Louis Kerofsky, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Autonomous driving systems require extensive data collection schemes to cover the diverse scenarios needed for building a robust and safe system. The data volumes are in the order of Exabytes and have to be stored for a long period of time (i.e., more than 10 years of the vehicle's life cycle). Lossless compression doesn't provide sufficient compression ratios, hence, lossy video compression has b… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  5. arXiv:2403.11761  [pdf, other

    cs.RO cs.CV

    BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation

    Authors: Jonas Schramm, Niclas Vödisch, Kürsat Petek, B Ravi Kiran, Senthil Yogamani, Wolfram Burgard, Abhinav Valada

    Abstract: Semantic scene segmentation from a bird's-eye-view (BEV) perspective plays a crucial role in facilitating planning and decision-making for mobile robots. Although recent vision-only methods have demonstrated notable advancements in performance, they often struggle under adverse illumination conditions such as rain or nighttime. While active sensors offer a solution to this challenge, the prohibiti… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  6. arXiv:2402.06826  [pdf, other

    cs.CV cs.RO

    Neural Rendering based Urban Scene Reconstruction for Autonomous Driving

    Authors: Shihao Shen, Louis Kerofsky, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Dense 3D reconstruction has many applications in automated driving including automated annotation validation, multimodal data augmentation, providing ground truth annotations for systems lacking LiDAR, as well as enhancing auto-labeling accuracy. LiDAR provides highly accurate but sparse depth, whereas camera images enable estimation of dense depth but noisy particularly at long ranges. In this pa… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in Electronic Imaging, Autonomous Vehicles and Machines 2024. Qualitative results are shared in https://youtu.be/EK47fYJiY3M

  7. arXiv:2309.09080  [pdf, other

    cs.RO cs.CV

    Multi-camera Bird's Eye View Perception for Autonomous Driving

    Authors: David Unger, Nikhil Gosala, Varun Ravi Kumar, Shubhankar Borse, Abhinav Valada, Senthil Yogamani

    Abstract: Most automated driving systems comprise a diverse sensor set, including several cameras, Radars, and LiDARs, ensuring a complete 360°coverage in near and far regions. Unlike Radar and LiDAR, which measure directly in 3D, cameras capture a 2D perspective projection with inherent depth ambiguity. However, it is essential to produce perception outputs in 3D to enable the spatial reasoning of other ag… ▽ More

    Submitted 19 September, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: Taylor & Francis (CRC Press) book chapter. Book title: Computer Vision: Challenges, Trends, and Opportunities

  8. arXiv:2307.08850  [pdf, other

    cs.CV cs.RO

    LiDAR-BEVMTN: Real-Time LiDAR Bird's-Eye View Multi-Task Perception Network for Autonomous Driving

    Authors: Sambit Mohapatra, Senthil Yogamani, Varun Ravi Kumar, Stefan Milz, Heinrich Gotzig, Patrick Mäder

    Abstract: LiDAR is crucial for robust 3D scene perception in autonomous driving. LiDAR perception has the largest body of literature after camera perception. However, multi-task learning across tasks like detection, segmentation, and motion estimation using LiDAR remains relatively unexplored, especially on automotive-grade embedded platforms. We present a real-time multi-task convolutional neural network f… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  9. arXiv:2306.03810  [pdf, other

    cs.CV cs.RO

    X-Align++: cross-modal cross-view alignment for Bird's-eye-view segmentation

    Authors: Shubhankar Borse, Senthil Yogamani, Marvin Klingner, Varun Ravi, Hong Cai, Abdulaziz Almuzairee, Fatih Porikli

    Abstract: Bird's-eye-view (BEV) grid is a typical representation of the perception of road components, e.g., drivable area, in autonomous driving. Most existing approaches rely on cameras only to perform segmentation in BEV space, which is fundamentally constrained by the absence of reliable depth information. The latest works leverage both camera and LiDAR modalities but suboptimally fuse their features us… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted for publication at Springer Machine Vision and Applications Journal. The Version of Record of this article is published in Machine Vision and Applications Journal, and is available online at https://doi.org/10.1007/s00138-023-01400-7. arXiv admin note: substantial text overlap with arXiv:2210.06778

  10. arXiv:2303.02203  [pdf, other

    cs.CV cs.RO

    X$^3$KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection

    Authors: Marvin Klingner, Shubhankar Borse, Varun Ravi Kumar, Behnaz Rezaei, Venkatraman Narayanan, Senthil Yogamani, Fatih Porikli

    Abstract: Recent advances in 3D object detection (3DOD) have obtained remarkably strong results for LiDAR-based models. In contrast, surround-view 3DOD models based on multiple camera images underperform due to the necessary view transformation of features from perspective view (PV) to a 3D world representation which is ambiguous due to missing depth information. This paper introduces X$^3$KD, a comprehensi… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR 2023

  11. arXiv:2301.04422  [pdf, other

    cs.CV cs.RO

    Optical Flow for Autonomous Driving: Applications, Challenges and Improvements

    Authors: Shihao Shen, Louis Kerofsky, Senthil Yogamani

    Abstract: Optical flow estimation is a well-studied topic for automated driving applications. Many outstanding optical flow estimation methods have been proposed, but they become erroneous when tested in challenging scenarios that are commonly encountered. Despite the increasing use of fisheye cameras for near-field sensing in automated driving, there is very limited literature on optical flow estimation wi… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted for publication in Electronic Imaging, Autonomous Vehicles and Machines 2023

  12. arXiv:2210.06778  [pdf, other

    cs.CV

    X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View Segmentation

    Authors: Shubhankar Borse, Marvin Klingner, Varun Ravi Kumar, Hong Cai, Abdulaziz Almuzairee, Senthil Yogamani, Fatih Porikli

    Abstract: Bird's-eye-view (BEV) grid is a common representation for the perception of road components, e.g., drivable area, in autonomous driving. Most existing approaches rely on cameras only to perform segmentation in BEV space, which is fundamentally constrained by the absence of reliable depth information. Latest works leverage both camera and LiDAR modalities, but sub-optimally fuse their features usin… ▽ More

    Submitted 31 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023

  13. arXiv:2206.12912  [pdf, other

    cs.CV

    Woodscape Fisheye Object Detection for Autonomous Driving -- CVPR 2022 OmniCV Workshop Challenge

    Authors: Saravanabalagi Ramachandran, Ganesh Sistu, Varun Ravi Kumar, John McDonald, Senthil Yogamani

    Abstract: Object detection is a comprehensively studied problem in autonomous driving. However, it has been relatively less explored in the case of fisheye cameras. The strong radial distortion breaks the translation invariance inductive bias of Convolutional Neural Networks. Thus, we present the WoodScape fisheye object detection challenge for autonomous driving which was held as part of the CVPR 2022 Work… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Workshop on Omnidirectional Computer Vision (OmniCV) at Conference on Computer Vision and Pattern Recognition (CVPR) 2021

  14. arXiv:2206.12738  [pdf, other

    cs.CV cs.LG

    Self-Supervised 3D Monocular Object Detection by Recycling Bounding Boxes

    Authors: Sugirtha T, Sridevi M, Khailash Santhakumar, Hao Liu, B Ravi Kiran, Thomas Gauthier, Senthil Yogamani

    Abstract: Modern object detection architectures are moving towards employing self-supervised learning (SSL) to improve performance detection with related pretext tasks. Pretext tasks for monocular 3D object detection have not yet been explored yet in literature. The paper studies the application of established self-supervised bounding box recycling by labeling random windows as the pretext task. The classif… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Comments: Published at ICCVW-SSLAD 2021. arXiv admin note: substantial text overlap with arXiv:2104.10786

  15. arXiv:2206.02876  [pdf, other

    cs.CV cs.RO

    SpikiLi: A Spiking Simulation of LiDAR based Real-time Object Detection for Autonomous Driving

    Authors: Sambit Mohapatra, Thomas Mesquida, Mona Hodaei, Senthil Yogamani, Heinrich Gotzig, Patrick Mader

    Abstract: Spiking Neural Networks are a recent and new neural network design approach that promises tremendous improvements in power efficiency, computation efficiency, and processing latency. They do so by using asynchronous spike-based data flow, event-based signal generation, processing, and modifying the neuron model to resemble biological neurons closely. While some initial works have shown significant… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted at Workshop on Event Sensing and Neuromorphic Engineering - 8th International Conference on Event-based Control, Communication, and Signal Processing

  16. arXiv:2205.15667  [pdf, other

    cs.CV

    ViT-BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation

    Authors: Pramit Dutta, Ganesh Sistu, Senthil Yogamani, Edgar Galván, John McDonald

    Abstract: Generating a detailed near-field perceptual model of the environment is an important and challenging problem in both self-driving vehicles and autonomous mobile robotics. A Bird Eye View (BEV) map, providing a panoptic representation, is a commonly used approach that provides a simplified 2D representation of the vehicle surroundings with accurate semantic level segmentation for many downstream ta… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: Accepted for 2022 IEEE World Congress on Computational Intelligence (Track: IJCNN)

  17. arXiv:2205.13281  [pdf, other

    cs.CV

    Surround-view Fisheye Camera Perception for Automated Driving: Overview, Survey and Challenges

    Authors: Varun Ravi Kumar, Ciaran Eising, Christian Witt, Senthil Yogamani

    Abstract: Surround-view fisheye cameras are commonly used for near-field sensing in automated driving. Four fisheye cameras on four sides of the vehicle are sufficient to cover 360° around the vehicle capturing the entire near-field region. Some primary use cases are automated parking, traffic jam assist, and urban driving. There are limited datasets and very little work on near-field perception tasks as th… ▽ More

    Submitted 5 January, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at IEEE Transactions on Intelligent Transportation Systems

  18. arXiv:2205.02105  [pdf, ps, other

    cs.NE cs.CV

    Neuroevolutionary Multi-objective approaches to Trajectory Prediction in Autonomous Vehicles

    Authors: Fergal Stapleton, Edgar Galván, Ganesh Sistu, Senthil Yogamani

    Abstract: The incentive for using Evolutionary Algorithms (EAs) for the automated optimization and training of deep neural networks (DNNs), a process referred to as neuroevolution, has gained momentum in recent years. The configuration and training of these networks can be posed as optimization problems. Indeed, most of the recent works on neuroevolution have focused their attention on single-objective opti… ▽ More

    Submitted 6 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted in Genetic and Evolutionary Computation Conference Companion (GECCO '22 Companion), July 9--13, 2022, Boston, MA, USA, 4 pages, 1 figure, 6 tables

  19. UnShadowNet: Illumination Critic Guided Contrastive Learning For Shadow Removal

    Authors: Subhrajyoti Dasgupta, Arindam Das, Senthil Yogamani, Sudip Das, Ciaran Eising, Andrei Bursuc, Ujjwal Bhattacharya

    Abstract: Shadows are frequently encountered natural phenomena that significantly hinder the performance of computer vision perception systems in practical settings, e.g., autonomous driving. A solution to this would be to eliminate shadow regions from the images before the processing of the perception system. Yet, training such a solution requires pairs of aligned shadowed and non-shadowed images which are… ▽ More

    Submitted 24 August, 2023; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: Accepted for publication at IEEE Access, vol. 11, pp. 87760-87774, 2023

  20. SynWoodScape: Synthetic Surround-view Fisheye Camera Dataset for Autonomous Driving

    Authors: Ahmed Rida Sekkat, Yohan Dupuis, Varun Ravi Kumar, Hazem Rashed, Senthil Yogamani, Pascal Vasseur, Paul Honeine

    Abstract: Surround-view cameras are a primary sensor for automated driving, used for near-field perception. It is one of the most commonly used sensors in commercial vehicles primarily used for parking visualization and automated parking. Four fisheye cameras with a 190° field of view cover the 360° around the vehicle. Due to its high radial distortion, the standard algorithms do not extend easily. Previous… ▽ More

    Submitted 2 January, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: IEEE Robotics and Automation Letters (RA-L) and IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022). An initial sample of the dataset is released in https://drive.google.com/drive/folders/1N5rrySiw1uh9kLeBuOblMbXJ09YsqO7I

    Journal ref: IEEE Robotics and Automation Letters ( Volume: 7, Issue: 3, July 2022)

  21. arXiv:2203.01177  [pdf, other

    cs.CV

    Detecting Adversarial Perturbations in Multi-Task Perception

    Authors: Marvin Klingner, Varun Ravi Kumar, Senthil Yogamani, Andreas Bär, Tim Fingscheidt

    Abstract: While deep neural networks (DNNs) achieve impressive performance on environment perception tasks, their sensitivity to adversarial perturbations limits their use in practical applications. In this paper, we (i) propose a novel adversarial perturbation detection scheme based on multi-task perception of complex vision tasks (i.e., depth estimation and semantic segmentation). Specifically, adversaria… ▽ More

    Submitted 11 September, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: Accepted at IROS 2022

  22. arXiv:2111.04875  [pdf, other

    cs.CV cs.RO

    LiMoSeg: Real-time Bird's Eye View based LiDAR Motion Segmentation

    Authors: Sambit Mohapatra, Mona Hodaei, Senthil Yogamani, Stefan Milz, Heinrich Gotzig, Martin Simon, Hazem Rashed, Patrick Maeder

    Abstract: Moving object detection and segmentation is an essential task in the Autonomous Driving pipeline. Detecting and isolating static and moving components of a vehicle's surroundings are particularly crucial in path planning and localization tasks. This paper proposes a novel real-time architecture for motion segmentation of Light Detection and Ranging (LiDAR) data. We use three successive scans of Li… ▽ More

    Submitted 22 January, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Accepted for Presentation at International Conference on Computer Vision Theory and Applications (VISAPP 2022)

  23. arXiv:2108.07736  [pdf, other

    cs.CV

    A Hybrid Sparse-Dense Monocular SLAM System for Autonomous Driving

    Authors: Louis Gallagher, Varun Ravi Kumar, Senthil Yogamani, John B. McDonald

    Abstract: In this paper, we present a system for incrementally reconstructing a dense 3D model of the geometry of an outdoor environment using a single monocular camera attached to a moving vehicle. Dense models provide a rich representation of the environment facilitating higher-level scene understanding, perception, and planning. Our system employs dense depth prediction with a hybrid map** architecture… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: 8 pages, 5 figures. To be published in the proceedings of the 10th European Conference on Mobile Robotics 2021

    ACM Class: I.2.10

  24. arXiv:2107.08246  [pdf, other

    cs.CV cs.RO

    Woodscape Fisheye Semantic Segmentation for Autonomous Driving -- CVPR 2021 OmniCV Workshop Challenge

    Authors: Saravanabalagi Ramachandran, Ganesh Sistu, John McDonald, Senthil Yogamani

    Abstract: We present the WoodScape fisheye semantic segmentation challenge for autonomous driving which was held as part of the CVPR 2021 Workshop on Omnidirectional Computer Vision (OmniCV). This challenge is one of the first opportunities for the research community to evaluate the semantic segmentation techniques targeted for fisheye camera perception. Due to strong radial distortion standard models don't… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

    Comments: Workshop on Omnidirectional Computer Vision (OmniCV) at Conference on Computer Vision and Pattern Recognition (CVPR) 2021. Presentation video is available at https://youtu.be/xa7Fl2mD4CA?t=12253

  25. arXiv:2107.07449  [pdf, other

    cs.CV

    Adversarial Attacks on Multi-task Visual Perception for Autonomous Driving

    Authors: Ibrahim Sobh, Ahmed Hamed, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Deep neural networks (DNNs) have accomplished impressive success in various applications, including autonomous driving perception tasks, in recent years. On the other hand, current deep neural networks are easily fooled by adversarial attacks. This vulnerability raises significant concerns, particularly in safety-critical applications. As a result, research into attacking and defending DNNs has ga… ▽ More

    Submitted 7 November, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at Journal of Imaging Science and Technology

  26. arXiv:2107.04937  [pdf, other

    cs.CV cs.RO

    BEV-MODNet: Monocular Camera based Bird's Eye View Moving Object Detection for Autonomous Driving

    Authors: Hazem Rashed, Mariam Essam, Maha Mohamed, Ahmad El Sallab, Senthil Yogamani

    Abstract: Detection of moving objects is a very important task in autonomous driving systems. After the perception phase, motion planning is typically performed in Bird's Eye View (BEV) space. This would require projection of objects detected on the image plane to top view BEV plane. Such a projection is prone to errors due to lack of depth information and noisy map** in far away areas. CNNs can leverage… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2021

  27. arXiv:2105.12763  [pdf, other

    cs.CV

    An Online Learning System for Wireless Charging Alignment using Surround-view Fisheye Cameras

    Authors: Ashok Dahal, Varun Ravi Kumar, Senthil Yogamani, Ciaran Eising

    Abstract: Electric Vehicles are increasingly common, with inductive chargepads being considered a convenient and efficient means of charging electric vehicles. However, drivers are typically poor at aligning the vehicle to the necessary accuracy for efficient inductive charging, making the automated alignment of the two charging plates desirable. In parallel to the electrification of the vehicular fleet, au… ▽ More

    Submitted 21 December, 2022; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: Accepted for publication at IEEE Transactions on Intelligent Transportation Systems. Chargepad Dataset is shared at https://drive.google.com/drive/folders/1KeLFIqOnhU2CGsD0vbiN9UqKmBSyHERd

  28. arXiv:2105.12713  [pdf, other

    cs.CV

    Spatio-Contextual Deep Network Based Multimodal Pedestrian Detection For Autonomous Driving

    Authors: Kinjal Dasgupta, Arindam Das, Sudip Das, Ujjwal Bhattacharya, Senthil Yogamani

    Abstract: Pedestrian Detection is the most critical module of an Autonomous Driving system. Although a camera is commonly used for this purpose, its quality degrades severely in low-light night time driving scenarios. On the other hand, the quality of a thermal camera image remains unaffected in similar conditions. This paper proposes an end-to-end multimodal fusion model for pedestrian detection using RGB… ▽ More

    Submitted 24 January, 2022; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: To be published at IEEE Transactions on Intelligent Transportation Systems

  29. arXiv:2105.07930  [pdf, other

    cs.CV

    Ensemble-based Semi-supervised Learning to Improve Noisy Soiling Annotations in Autonomous Driving

    Authors: Michal Uricar, Ganesh Sistu, Lucie Yahiaoui, Senthil Yogamani

    Abstract: Manual annotation of soiling on surround view cameras is a very challenging and expensive task. The unclear boundary for various soiling categories like water drops or mud particles usually results in a large variance in the annotation quality. As a result, the models trained on such poorly annotated data are far from being optimal. In this paper, we focus on handling such noisy annotations via ps… ▽ More

    Submitted 11 July, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2021

  30. arXiv:2104.14042  [pdf, other

    cs.CV cs.RO

    Weather and Light Level Classification for Autonomous Driving: Dataset, Baseline and Active Learning

    Authors: Mahesh M Dhananjaya, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Autonomous driving is rapidly advancing, and Level 2 functions are becoming a standard feature. One of the foremost outstanding hurdles is to obtain robust visual perception in harsh weather and low light conditions where accuracy degradation is severe. It is critical to have a weather classification model to decrease visual perception confidence during these scenarios. Thus, we have built a new d… ▽ More

    Submitted 29 November, 2021; v1 submitted 28 April, 2021; originally announced April 2021.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2021. Dataset is released in https://drive.google.com/drive/folders/1t3hwbPCfbokUaaWROr6PBA4WTW0GuQJi

  31. Vision-based Driver Assistance Systems: Survey, Taxonomy and Advances

    Authors: Jonathan Horgan, Ciarán Hughes, John McDonald, Senthil Yogamani

    Abstract: Vision-based driver assistance systems is one of the rapidly growing research areas of ITS, due to various factors such as the increased level of safety requirements in automotive, computational power in embedded systems, and desire to get closer to autonomous driving. It is a cross disciplinary area encompassing specialised fields like computer vision, machine learning, robotic navigation, embedd… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Journal ref: 2015 IEEE 18th International Conference on Intelligent Transportation Systems

  32. Computer vision in automated parking systems: Design, implementation and challenges

    Authors: Markus Heimberger, Jonathan Horgan, Ciaran Hughes, John McDonald, Senthil Yogamani

    Abstract: Automated driving is an active area of research in both industry and academia. Automated Parking, which is automated driving in a restricted scenario of parking with low speed manoeuvring, is a key enabling product for fully autonomous driving systems. It is also an important milestone from the perspective of a higher end system built from the previous generation driver assistance systems comprisi… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Journal ref: Image and Vision Computing, Volume 68, December 2017, Pages 88-101

  33. arXiv:2104.10985  [pdf, other

    cs.CV

    VM-MODNet: Vehicle Motion aware Moving Object Detection for Autonomous Driving

    Authors: Hazem Rashed, Ahmad El Sallab, Senthil Yogamani

    Abstract: Moving object Detection (MOD) is a critical task in autonomous driving as moving agents around the ego-vehicle need to be accurately detected for safe trajectory planning. It also enables appearance agnostic detection of objects based on motion cues. There are geometric challenges like motion-parallax ambiguity which makes it a difficult problem. In this work, we aim to leverage the vehicle motion… ▽ More

    Submitted 10 July, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2021

  34. arXiv:2104.10786  [pdf, other

    cs.CV cs.RO

    Exploring 2D Data Augmentation for 3D Monocular Object Detection

    Authors: Sugirtha T, Sridevi M, Khailash Santhakumar, B Ravi Kiran, Thomas Gauthier, Senthil Yogamani

    Abstract: Data augmentation is a key component of CNN based image recognition tasks like object detection. However, it is relatively less explored for 3D object detection. Many standard 2D object detection data augmentation techniques do not extend to 3D box. Extension of these data augmentations for 3D object detection requires adaptation of the 3D geometry of the input scene and synthesis of new viewpoint… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

  35. arXiv:2104.10780  [pdf, other

    cs.CV

    BEVDetNet: Bird's Eye View LiDAR Point Cloud based Real-time 3D Object Detection for Autonomous Driving

    Authors: Sambit Mohapatra, Senthil Yogamani, Heinrich Gotzig, Stefan Milz, Patrick Mader

    Abstract: 3D object detection based on LiDAR point clouds is a crucial module in autonomous driving particularly for long range sensing. Most of the research is focused on achieving higher accuracy and these models are not optimized for deployment on embedded systems from the perspective of latency and power efficiency. For high speed driving scenarios, latency is a crucial parameter as it provides more tim… ▽ More

    Submitted 10 July, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2021

  36. arXiv:2104.04420  [pdf, other

    cs.CV cs.RO

    SVDistNet: Self-Supervised Near-Field Distance Estimation on Surround View Fisheye Cameras

    Authors: Varun Ravi Kumar, Marvin Klingner, Senthil Yogamani, Markus Bach, Stefan Milz, Tim Fingscheidt, Patrick Mäder

    Abstract: A 360° perception of scene geometry is essential for automated driving, notably for parking and urban driving scenarios. Typically, it is achieved using surround-view fisheye cameras, focusing on the near-field area around the vehicle. The majority of current depth estimation approaches focus on employing just a single camera, which cannot be straightforwardly generalized to multiple cameras. The… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: To be published at IEEE Transactions on Intelligent Transportation Systems

  37. arXiv:2103.17001  [pdf, other

    cs.CV cs.RO

    Near-field Perception for Low-Speed Vehicle Automation using Surround-view Fisheye Cameras

    Authors: Ciaran Eising, Jonathan Horgan, Senthil Yogamani

    Abstract: Cameras are the primary sensor in automated driving systems. They provide high information density and are optimal for detecting road infrastructure cues laid out for human vision. Surround-view camera systems typically comprise of four fisheye cameras with 190°+ field of view covering the entire 360° around the vehicle focused on near-field sensing. They are the principal sensors for low-speed, h… ▽ More

    Submitted 6 June, 2023; v1 submitted 31 March, 2021; originally announced March 2021.

    Comments: Accepted for publication at IEEE Transactions on Intelligent Transportation Systems

  38. arXiv:2103.00191  [pdf, other

    cs.CV cs.RO

    FisheyeSuperPoint: Keypoint Detection and Description Network for Fisheye Images

    Authors: Anna Konrad, Ciarán Eising, Ganesh Sistu, John McDonald, Rudi Villing, Senthil Yogamani

    Abstract: Keypoint detection and description is a commonly used building block in computer vision systems particularly for robotics and autonomous driving. However, the majority of techniques to date have focused on standard cameras with little consideration given to fisheye cameras which are commonly used in urban driving and automated parking. In this paper, we propose a novel training and evaluation pipe… ▽ More

    Submitted 29 November, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

    Comments: Accepted for Presentation at International Conference on Computer Vision Theory and Applications (VISAPP 2022)

  39. arXiv:2102.07448  [pdf, other

    cs.CV cs.RO

    OmniDet: Surround View Cameras based Multi-task Visual Perception Network for Autonomous Driving

    Authors: Varun Ravi Kumar, Senthil Yogamani, Hazem Rashed, Ganesh Sistu, Christian Witt, Isabelle Leang, Stefan Milz, Patrick Mäder

    Abstract: Surround View fisheye cameras are commonly deployed in automated driving for 360° near-field sensing around the vehicle. This work presents a multi-task visual perception network on unrectified fisheye images to enable the vehicle to sense its surrounding environment. It consists of six primary tasks necessary for an autonomous driving system: depth estimation, visual odometry, semantic segmentati… ▽ More

    Submitted 6 June, 2023; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Best Robot Vision paper award finalist (top 4). Camera ready version accepted for RA-L and ICRA 2021 publication

  40. arXiv:2012.02124  [pdf, other

    cs.CV cs.RO

    Generalized Object Detection on Fisheye Cameras for Autonomous Driving: Dataset, Representations and Baseline

    Authors: Hazem Rashed, Eslam Mohamed, Ganesh Sistu, Varun Ravi Kumar, Ciaran Eising, Ahmad El-Sallab, Senthil Yogamani

    Abstract: Object detection is a comprehensively studied problem in autonomous driving. However, it has been relatively less explored in the case of fisheye cameras. The standard bounding box fails in fisheye cameras due to the strong radial distortion, particularly in the image's periphery. We explore better representations like oriented bounding box, ellipse, and generic polygon for object detection in fis… ▽ More

    Submitted 21 December, 2022; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Camera ready version. Accepted for presentation at Winter Conference on Applications of Computer Vision 2021. Dataset is shared at https://drive.google.com/drive/folders/1bobmY2wlIBozeU5ZgPfYPqVAnpPw4QrM

  41. arXiv:2010.11681  [pdf, other

    cs.CV cs.RO

    Learning Panoptic Segmentation from Instance Contours

    Authors: Sumanth Chennupati, Venkatraman Narayanan, Ganesh Sistu, Senthil Yogamani, Samir A Rawashdeh

    Abstract: Panoptic Segmentation aims to provide an understanding of background (stuff) and instances of objects (things) at a pixel level. It combines the separate tasks of semantic segmentation (pixel level classification) and instance segmentation to build a single unified scene understanding task. Typically, panoptic segmentation is derived by combining semantic and instance segmentation tasks that are l… ▽ More

    Submitted 5 April, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted at ICRA 2021. Overview Video: https://youtu.be/wBtcxRhG3e0

  42. arXiv:2008.07008  [pdf, other

    cs.CV cs.RO

    Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline

    Authors: Eslam Mohamed, Mahmoud Ewaisha, Mennatullah Siam, Hazem Rashed, Senthil Yogamani, Waleed Hamdy, Muhammad Helmi, Ahmad El-Sallab

    Abstract: Moving object segmentation is a crucial task for autonomous vehicles as it can be used to segment objects in a class agnostic manner based on their motion cues. It enables the detection of unseen objects during training (e.g., moose or a construction truck) based on their motion and independent of their appearance. Although pixel-wise motion segmentation has been studied in autonomous driving lite… ▽ More

    Submitted 26 May, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: Accepted for presentation at IEEE IV 2021 (Intelligent Vehicles Symposium) conference

  43. arXiv:2008.04017  [pdf, other

    cs.CV cs.RO

    SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation Synergized with Semantic Segmentation for Autonomous Driving

    Authors: Varun Ravi Kumar, Marvin Klingner, Senthil Yogamani, Stefan Milz, Tim Fingscheidt, Patrick Maeder

    Abstract: State-of-the-art self-supervised learning approaches for monocular depth estimation usually suffer from scale ambiguity. They do not generalize well when applied on distance estimation for complex projection models such as in fisheye and omnidirectional cameras. This paper introduces a novel multi-task learning strategy to improve self-supervised monocular distance estimation on fisheye and pinhol… ▽ More

    Submitted 14 November, 2020; v1 submitted 10 August, 2020; originally announced August 2020.

    Comments: Camera ready version + supplementary. Accepted for presentation at Winter Conference on Applications of Computer Vision 2021

  44. arXiv:2007.09746  [pdf, other

    cs.CV cs.RO

    Beyond Single Stage Encoder-Decoder Networks: Deep Decoders for Semantic Image Segmentation

    Authors: Gabriel L. Oliveira, Senthil Yogamani, Wolfram Burgard, Thomas Brox

    Abstract: Single encoder-decoder methodologies for semantic segmentation are reaching their peak in terms of segmentation quality and efficiency per number of layers. To address these limitations, we propose a new architecture based on a decoder which uses a set of shallow networks for capturing more information content. The new decoder has a new topology of skip connections, namely backward and stacked res… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

  45. arXiv:2007.06676  [pdf, other

    cs.CV cs.LG cs.RO

    UnRectDepthNet: Self-Supervised Monocular Depth Estimation using a Generic Framework for Handling Common Camera Distortion Models

    Authors: Varun Ravi Kumar, Senthil Yogamani, Markus Bach, Christian Witt, Stefan Milz, Patrick Mader

    Abstract: In classical computer vision, rectification is an integral part of multi-view depth estimation. It typically includes epipolar rectification and lens distortion correction. This process simplifies the depth estimation significantly, and thus it has been adopted in CNN approaches. However, rectification has several side effects, including a reduced field of view (FOV), resampling distortion, and se… ▽ More

    Submitted 6 June, 2023; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Minor fixes added after IROS 2020 Camera ready submission. IROS 2020 presentation video - https://www.youtube.com/watch?v=3Br2KSWZRrY

  46. arXiv:2007.00801  [pdf, other

    cs.CV cs.RO

    TiledSoilingNet: Tile-level Soiling Detection on Automotive Surround-view Cameras Using Coverage Metric

    Authors: Arindam Das, Pavel Krizek, Ganesh Sistu, Fabian Burger, Sankaralingam Madasamy, Michal Uricar, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Automotive cameras, particularly surround-view cameras, tend to get soiled by mud, water, snow, etc. For higher levels of autonomous driving, it is necessary to have a soiling detection algorithm which will trigger an automatic cleaning system. Localized detection of soiling in an image is necessary to control the cleaning system. It is also necessary to enable partial functionality in unsoiled ar… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2020

  47. arXiv:2002.00444  [pdf, other

    cs.LG cs.AI cs.RO

    Deep Reinforcement Learning for Autonomous Driving: A Survey

    Authors: B Ravi Kiran, Ibrahim Sobh, Victor Talpaert, Patrick Mannion, Ahmad A. Al Sallab, Senthil Yogamani, Patrick Pérez

    Abstract: With the development of deep representation learning, the domain of reinforcement learning (RL) has become a powerful learning framework now capable of learning complex policies in high dimensional environments. This review summarises deep reinforcement learning (DRL) algorithms and provides a taxonomy of automated driving tasks where (D)RL methods have been employed, while addressing key computat… ▽ More

    Submitted 23 January, 2021; v1 submitted 2 February, 2020; originally announced February 2020.

    Comments: Accepted for publication at IEEE Transactions on Intelligent Transportation Systems

  48. arXiv:2001.02223  [pdf, other

    cs.CV cs.LG cs.RO

    Dynamic Task Weighting Methods for Multi-task Networks in Autonomous Driving Systems

    Authors: Isabelle Leang, Ganesh Sistu, Fabian Burger, Andrei Bursuc, Senthil Yogamani

    Abstract: Deep multi-task networks are of particular interest for autonomous driving systems. They can potentially strike an excellent trade-off between predictive performance, hardware constraints and efficient use of information from multiple types of annotations and modalities. However, training such models is non-trivial and requires balancing learning over all tasks as their respective losses display d… ▽ More

    Submitted 27 June, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2020

  49. arXiv:2001.02161  [pdf, other

    cs.CV cs.RO

    Trained Trajectory based Automated Parking System using Visual SLAM on Surround View Cameras

    Authors: Nivedita Tripathi, Senthil Yogamani

    Abstract: Automated Parking is becoming a standard feature in modern vehicles. Existing parking systems build a local map to be able to plan for maneuvering towards a detected slot. Next generation parking systems have an use case where they build a persistent map of the environment where the car is frequently parked, say for example, home parking or office parking. The pre-built map helps in re-localizing… ▽ More

    Submitted 19 May, 2021; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: Accepted for presentation at CVPR 2021 Workshop on Women in Computer Vision

  50. arXiv:1912.11066  [pdf, other

    cs.CV cs.RO

    FisheyeMultiNet: Real-time Multi-task Learning Architecture for Surround-view Automated Parking System

    Authors: Pullarao Maddu, Wayne Doherty, Ganesh Sistu, Isabelle Leang, Michal Uricar, Sumanth Chennupati, Hazem Rashed, Jonathan Horgan, Ciaran Hughes, Senthil Yogamani

    Abstract: Automated Parking is a low speed manoeuvring scenario which is quite unstructured and complex, requiring full 360° near-field sensing around the vehicle. In this paper, we discuss the design and implementation of an automated parking system from the perspective of camera based deep learning algorithms. We provide a holistic overview of an industrial system covering the embedded system, use cases a… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: Accepted for publication at Irish Machine Vision and Image Processing (IMVIP) 2019