Skip to main content

Showing 1–38 of 38 results for author: Sistu, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09203  [pdf, other

    cs.CV

    Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns

    Authors: Kaavya Rekanar, Martin Hayes, Ganesh Sistu, Ciaran Eising

    Abstract: Visual Question Answering (VQA) models play a critical role in enhancing the perception capabilities of autonomous driving systems by allowing vehicles to analyze visual inputs alongside textual queries, fostering natural interaction and trust between the vehicle and its occupants or other road users. This study investigates the attention patterns of humans compared to a VQA model when answering d… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2404.13443  [pdf, other

    cs.CV cs.RO

    FisheyeDetNet: 360° Surround view Fisheye Camera based Object Detection System for Autonomous Driving

    Authors: Ganesh Sistu, Senthil Yogamani

    Abstract: Object detection is a mature problem in autonomous driving with pedestrian detection being one of the first deployed algorithms. It has been comprehensively studied in the literature. However, object detection is relatively less explored for fisheye cameras used for surround-view near field sensing. The standard bounding box representation fails in fisheye cameras due to heavy radial distortion, p… ▽ More

    Submitted 27 April, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2206.05542 by other authors

  3. Depth Estimation using Weighted-loss and Transfer Learning

    Authors: Muhammad Adeel Hafeez, Michael G. Madden, Ganesh Sistu, Ihsan Ullah

    Abstract: Depth estimation from 2D images is a common computer vision task that has applications in many fields including autonomous vehicles, scene understanding and robotics. The accuracy of a supervised depth estimation method mainly relies on the chosen loss function, the model architecture, quality of data and performance metrics. In this study, we propose a simplified and adaptable approach to improve… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  4. Scalable and Efficient Hierarchical Visual Topological Map**

    Authors: Saravanabalagi Ramachandran, Jonathan Horgan, Ganesh Sistu, John McDonald

    Abstract: Hierarchical topological representations can significantly reduce search times within map** and localization algorithms. Although recent research has shown the potential for such approaches, limited consideration has been given to the suitability and comparative performance of different global feature representations within this context. In this work, we evaluate state-of-the-art hand-crafted an… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Comments: Published in the 21st International Conference on Advanced Robotics (ICAR 2023)

  5. arXiv:2404.04456  [pdf

    cs.CV cs.AI cs.LG

    Beyond the Known: Adversarial Autoencoders in Novelty Detection

    Authors: Muhammad Asad, Ihsan Ullah, Ganesh Sistu, Michael G. Madden

    Abstract: In novelty detection, the goal is to decide if a new data point should be categorized as an inlier or an outlier, given a training dataset that primarily captures the inlier distribution. Recent approaches typically use deep encoder and decoder network frameworks to derive a reconstruction error, and employ this error either to determine a novelty score, or as the basis for a one-class classifier.… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Accepted at the VISAAP 2024

  6. arXiv:2402.00637  [pdf, other

    cs.CV

    Fisheye Camera and Ultrasonic Sensor Fusion For Near-Field Obstacle Perception in Bird's-Eye-View

    Authors: Arindam Das, Sudarshan Paul, Niko Scholz, Akhilesh Kumar Malviya, Ganesh Sistu, Ujjwal Bhattacharya, Ciarán Eising

    Abstract: Accurate obstacle identification represents a fundamental challenge within the scope of near-field perception for autonomous driving. Conventionally, fisheye cameras are frequently employed for comprehensive surround-view perception, including rear-view obstacle localization. However, the performance of such cameras can significantly deteriorate in low-light conditions, during nighttime, or when s… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 16 pages, 12 Figures, 6 tables

  7. arXiv:2401.00910  [pdf, other

    cs.CV cs.LG

    WoodScape Motion Segmentation for Autonomous Driving -- CVPR 2023 OmniCV Workshop Challenge

    Authors: Saravanabalagi Ramachandran, Nathaniel Cibik, Ganesh Sistu, John McDonald

    Abstract: Motion segmentation is a complex yet indispensable task in autonomous driving. The challenges introduced by the ego-motion of the cameras, radial distortion in fisheye lenses, and the need for temporal consistency make the task more complicated, rendering traditional and standard Convolutional Neural Network (CNN) approaches less effective. The consequent laborious data labeling, representation of… ▽ More

    Submitted 16 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: CVPR 2023 OmniCV Workshop Challenge

  8. arXiv:2312.13104  [pdf, other

    cs.CV

    Optimizing Ego Vehicle Trajectory Prediction: The Graph Enhancement Approach

    Authors: Sushil Sharma, Aryan Singh, Ganesh Sistu, Mark Halton, Ciarán Eising

    Abstract: Predicting the trajectory of an ego vehicle is a critical component of autonomous driving systems. Current state-of-the-art methods typically rely on Deep Neural Networks (DNNs) and sequential models to process front-view images for future trajectory prediction. However, these approaches often struggle with perspective issues affecting object features in the scene. To address this, we advocate for… ▽ More

    Submitted 10 January, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in the Electronic Imagine Autonomous Vehicles and Machines (EI-AVM) Conference 2024

  9. arXiv:2312.13081  [pdf, other

    cs.CV

    BEVSeg2TP: Surround View Camera Bird's-Eye-View Based Joint Vehicle Segmentation and Ego Vehicle Trajectory Prediction

    Authors: Sushil Sharma, Arindam Das, Ganesh Sistu, Mark Halton, Ciarán Eising

    Abstract: Trajectory prediction is, naturally, a key task for vehicle autonomy. While the number of traffic rules is limited, the combinations and uncertainties associated with each agent's behaviour in real-world scenarios are nearly impossible to encode. Consequently, there is a growing interest in learning-based trajectory prediction. The proposed method in this paper predicts trajectories by considering… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in the International Conference on Computer Vision Theory and Applications (VISAPP) 2024

  10. arXiv:2308.08495  [pdf

    cs.CV cs.RO

    Self-Supervised Online Camera Calibration for Automated Driving and Parking Applications

    Authors: Ciarán Hogan, Ganesh Sistu, Ciarán Eising

    Abstract: Camera-based perception systems play a central role in modern autonomous vehicles. These camera based perception algorithms require an accurate calibration to map the real world distances to image pixels. In practice, calibration is a laborious procedure requiring specialised data collection and careful tuning. This process must be repeated whenever the parameters of the camera change, which can b… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Journal ref: Proceedings of the Irish Machine Vision and Image Processing Conference 2023

  11. arXiv:2307.09329  [pdf, other

    cs.CV

    Towards a performance analysis on pre-trained Visual Question Answering models for autonomous driving

    Authors: Kaavya Rekanar, Ciarán Eising, Ganesh Sistu, Martin Hayes

    Abstract: This short paper presents a preliminary analysis of three popular Visual Question Answering (VQA) models, namely ViLBERT, ViLT, and LXMERT, in the context of answering questions relating to driving scenarios. The performance of these models is evaluated by comparing the similarity of responses to reference answers provided by computer vision experts. Model selection is predicated on the analysis o… ▽ More

    Submitted 28 July, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    Journal ref: Proceedings of the Irish Machine Vision and Image Processing Conference 2023

  12. arXiv:2307.05288  [pdf, other

    cs.CV

    Navigating Uncertainty: The Role of Short-Term Trajectory Prediction in Autonomous Vehicle Safety

    Authors: Sushil Sharma, Ganesh Sistu, Lucie Yahiaoui, Arindam Das, Mark Halton, Ciarán Eising

    Abstract: Autonomous vehicles require accurate and reliable short-term trajectory predictions for safe and efficient driving. While most commercial automated vehicles currently use state machine-based algorithms for trajectory forecasting, recent efforts have focused on end-to-end data-driven systems. Often, the design of these models is limited by the availability of datasets, which are typically restricte… ▽ More

    Submitted 12 July, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Journal ref: Proceedings of the Irish Machine Vision and Image Processing Conference 2023

  13. arXiv:2304.07047  [pdf, other

    cs.RO cs.CV

    Near Field iToF LIDAR Depth Improvement from Limited Number of Shots

    Authors: Mena Nagiub, Thorsten Beuth, Ganesh Sistu, Heinrich Gotzig, Ciarán Eising

    Abstract: Indirect Time of Flight LiDARs can indirectly calculate the scene's depth from the phase shift angle between transmitted and received laser signals with amplitudes modulated at a predefined frequency. Unfortunately, this method generates ambiguity in calculated depth when the phase shift angle value exceeds $2π$. Current state-of-the-art methods use raw samples generated using two distinct modulat… ▽ More

    Submitted 28 July, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Journal ref: Proceedings of the 2023 IEEE 97th Vehicular Technology Conference (VTC2023-Spring)

  14. arXiv:2302.12589  [pdf, other

    cs.CV cs.AI

    Revisiting Modality Imbalance In Multimodal Pedestrian Detection

    Authors: Arindam Das, Sudip Das, Ganesh Sistu, Jonathan Horgan, Ujjwal Bhattacharya, Edward Jones, Martin Glavin, Ciarán Eising

    Abstract: Multimodal learning, particularly for pedestrian detection, has recently received emphasis due to its capability to function equally well in several critical autonomous driving scenarios such as low-light, night-time, and adverse weather conditions. However, in most cases, the training distribution largely emphasizes the contribution of one specific input that makes the network biased towards one… ▽ More

    Submitted 7 July, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: 5 pages, 3 figure, 4 tables

    Journal ref: In proceedings of the IEEE 2023 International Conference on Image Processing

  15. Fast and Efficient Scene Categorization for Autonomous Driving using VAEs

    Authors: Saravanabalagi Ramachandran, Jonathan Horgan, Ganesh Sistu, John McDonald

    Abstract: Scene categorization is a useful precursor task that provides prior knowledge for many advanced computer vision tasks with a broad range of applications in content-based image indexing and retrieval systems. Despite the success of data driven approaches in the field of computer vision such as object detection, semantic segmentation, etc., their application in learning high-level features for scene… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Published in the 24th Irish Machine Vision and Image Processing Conference (IMVIP 2022)

    Journal ref: The 24th Irish Machine Vision and Image Processing Conference (IMVIP), 2022, 9-16

  16. arXiv:2206.12912  [pdf, other

    cs.CV

    Woodscape Fisheye Object Detection for Autonomous Driving -- CVPR 2022 OmniCV Workshop Challenge

    Authors: Saravanabalagi Ramachandran, Ganesh Sistu, Varun Ravi Kumar, John McDonald, Senthil Yogamani

    Abstract: Object detection is a comprehensively studied problem in autonomous driving. However, it has been relatively less explored in the case of fisheye cameras. The strong radial distortion breaks the translation invariance inductive bias of Convolutional Neural Networks. Thus, we present the WoodScape fisheye object detection challenge for autonomous driving which was held as part of the CVPR 2022 Work… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

    Comments: Workshop on Omnidirectional Computer Vision (OmniCV) at Conference on Computer Vision and Pattern Recognition (CVPR) 2021

  17. arXiv:2206.07510  [pdf, other

    cs.CV cs.LG

    Deep Multi-Task Networks For Occluded Pedestrian Pose Estimation

    Authors: Arindam Das, Sudip Das, Ganesh Sistu, Jonathan Horgan, Ujjwal Bhattacharya, Edward Jones, Martin Glavin, Ciarán Eising

    Abstract: Most of the existing works on pedestrian pose estimation do not consider estimating the pose of an occluded pedestrian, as the annotations of the occluded parts are not available in relevant automotive datasets. For example, CityPersons, a well-known dataset for pedestrian detection in automotive scenes does not provide pose annotations, whereas MS-COCO, a non-automotive dataset, contains human po… ▽ More

    Submitted 8 August, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: 4 pages, 5 tables, 2 figures

    Journal ref: Proceedings of the 2022 Irish Machine Vision and Image Processing Conference

  18. arXiv:2205.15667  [pdf, other

    cs.CV

    ViT-BEVSeg: A Hierarchical Transformer Network for Monocular Birds-Eye-View Segmentation

    Authors: Pramit Dutta, Ganesh Sistu, Senthil Yogamani, Edgar Galván, John McDonald

    Abstract: Generating a detailed near-field perceptual model of the environment is an important and challenging problem in both self-driving vehicles and autonomous mobile robotics. A Bird Eye View (BEV) map, providing a panoptic representation, is a commonly used approach that provides a simplified 2D representation of the vehicle surroundings with accurate semantic level segmentation for many downstream ta… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: Accepted for 2022 IEEE World Congress on Computational Intelligence (Track: IJCNN)

  19. arXiv:2205.02105  [pdf, ps, other

    cs.NE cs.CV

    Neuroevolutionary Multi-objective approaches to Trajectory Prediction in Autonomous Vehicles

    Authors: Fergal Stapleton, Edgar Galván, Ganesh Sistu, Senthil Yogamani

    Abstract: The incentive for using Evolutionary Algorithms (EAs) for the automated optimization and training of deep neural networks (DNNs), a process referred to as neuroevolution, has gained momentum in recent years. The configuration and training of these networks can be posed as optimization problems. Indeed, most of the recent works on neuroevolution have focused their attention on single-objective opti… ▽ More

    Submitted 6 May, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted in Genetic and Evolutionary Computation Conference Companion (GECCO '22 Companion), July 9--13, 2022, Boston, MA, USA, 4 pages, 1 figure, 6 tables

  20. arXiv:2107.08246  [pdf, other

    cs.CV cs.RO

    Woodscape Fisheye Semantic Segmentation for Autonomous Driving -- CVPR 2021 OmniCV Workshop Challenge

    Authors: Saravanabalagi Ramachandran, Ganesh Sistu, John McDonald, Senthil Yogamani

    Abstract: We present the WoodScape fisheye semantic segmentation challenge for autonomous driving which was held as part of the CVPR 2021 Workshop on Omnidirectional Computer Vision (OmniCV). This challenge is one of the first opportunities for the research community to evaluate the semantic segmentation techniques targeted for fisheye camera perception. Due to strong radial distortion standard models don't… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

    Comments: Workshop on Omnidirectional Computer Vision (OmniCV) at Conference on Computer Vision and Pattern Recognition (CVPR) 2021. Presentation video is available at https://youtu.be/xa7Fl2mD4CA?t=12253

  21. arXiv:2105.07930  [pdf, other

    cs.CV

    Ensemble-based Semi-supervised Learning to Improve Noisy Soiling Annotations in Autonomous Driving

    Authors: Michal Uricar, Ganesh Sistu, Lucie Yahiaoui, Senthil Yogamani

    Abstract: Manual annotation of soiling on surround view cameras is a very challenging and expensive task. The unclear boundary for various soiling categories like water drops or mud particles usually results in a large variance in the annotation quality. As a result, the models trained on such poorly annotated data are far from being optimal. In this paper, we focus on handling such noisy annotations via ps… ▽ More

    Submitted 11 July, 2021; v1 submitted 17 May, 2021; originally announced May 2021.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2021

  22. arXiv:2103.00191  [pdf, other

    cs.CV cs.RO

    FisheyeSuperPoint: Keypoint Detection and Description Network for Fisheye Images

    Authors: Anna Konrad, Ciarán Eising, Ganesh Sistu, John McDonald, Rudi Villing, Senthil Yogamani

    Abstract: Keypoint detection and description is a commonly used building block in computer vision systems particularly for robotics and autonomous driving. However, the majority of techniques to date have focused on standard cameras with little consideration given to fisheye cameras which are commonly used in urban driving and automated parking. In this paper, we propose a novel training and evaluation pipe… ▽ More

    Submitted 29 November, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

    Comments: Accepted for Presentation at International Conference on Computer Vision Theory and Applications (VISAPP 2022)

  23. arXiv:2102.07448  [pdf, other

    cs.CV cs.RO

    OmniDet: Surround View Cameras based Multi-task Visual Perception Network for Autonomous Driving

    Authors: Varun Ravi Kumar, Senthil Yogamani, Hazem Rashed, Ganesh Sistu, Christian Witt, Isabelle Leang, Stefan Milz, Patrick Mäder

    Abstract: Surround View fisheye cameras are commonly deployed in automated driving for 360° near-field sensing around the vehicle. This work presents a multi-task visual perception network on unrectified fisheye images to enable the vehicle to sense its surrounding environment. It consists of six primary tasks necessary for an autonomous driving system: depth estimation, visual odometry, semantic segmentati… ▽ More

    Submitted 6 June, 2023; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Best Robot Vision paper award finalist (top 4). Camera ready version accepted for RA-L and ICRA 2021 publication

  24. arXiv:2012.02124  [pdf, other

    cs.CV cs.RO

    Generalized Object Detection on Fisheye Cameras for Autonomous Driving: Dataset, Representations and Baseline

    Authors: Hazem Rashed, Eslam Mohamed, Ganesh Sistu, Varun Ravi Kumar, Ciaran Eising, Ahmad El-Sallab, Senthil Yogamani

    Abstract: Object detection is a comprehensively studied problem in autonomous driving. However, it has been relatively less explored in the case of fisheye cameras. The standard bounding box fails in fisheye cameras due to the strong radial distortion, particularly in the image's periphery. We explore better representations like oriented bounding box, ellipse, and generic polygon for object detection in fis… ▽ More

    Submitted 21 December, 2022; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Camera ready version. Accepted for presentation at Winter Conference on Applications of Computer Vision 2021. Dataset is shared at https://drive.google.com/drive/folders/1bobmY2wlIBozeU5ZgPfYPqVAnpPw4QrM

  25. arXiv:2010.11681  [pdf, other

    cs.CV cs.RO

    Learning Panoptic Segmentation from Instance Contours

    Authors: Sumanth Chennupati, Venkatraman Narayanan, Ganesh Sistu, Senthil Yogamani, Samir A Rawashdeh

    Abstract: Panoptic Segmentation aims to provide an understanding of background (stuff) and instances of objects (things) at a pixel level. It combines the separate tasks of semantic segmentation (pixel level classification) and instance segmentation to build a single unified scene understanding task. Typically, panoptic segmentation is derived by combining semantic and instance segmentation tasks that are l… ▽ More

    Submitted 5 April, 2021; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted at ICRA 2021. Overview Video: https://youtu.be/wBtcxRhG3e0

  26. arXiv:2007.00801  [pdf, other

    cs.CV cs.RO

    TiledSoilingNet: Tile-level Soiling Detection on Automotive Surround-view Cameras Using Coverage Metric

    Authors: Arindam Das, Pavel Krizek, Ganesh Sistu, Fabian Burger, Sankaralingam Madasamy, Michal Uricar, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Automotive cameras, particularly surround-view cameras, tend to get soiled by mud, water, snow, etc. For higher levels of autonomous driving, it is necessary to have a soiling detection algorithm which will trigger an automatic cleaning system. Localized detection of soiling in an image is necessary to control the cleaning system. It is also necessary to enable partial functionality in unsoiled ar… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2020

  27. arXiv:2001.02223  [pdf, other

    cs.CV cs.LG cs.RO

    Dynamic Task Weighting Methods for Multi-task Networks in Autonomous Driving Systems

    Authors: Isabelle Leang, Ganesh Sistu, Fabian Burger, Andrei Bursuc, Senthil Yogamani

    Abstract: Deep multi-task networks are of particular interest for autonomous driving systems. They can potentially strike an excellent trade-off between predictive performance, hardware constraints and efficient use of information from multiple types of annotations and modalities. However, training such models is non-trivial and requires balancing learning over all tasks as their respective losses display d… ▽ More

    Submitted 27 June, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2020

  28. arXiv:1912.11066  [pdf, other

    cs.CV cs.RO

    FisheyeMultiNet: Real-time Multi-task Learning Architecture for Surround-view Automated Parking System

    Authors: Pullarao Maddu, Wayne Doherty, Ganesh Sistu, Isabelle Leang, Michal Uricar, Sumanth Chennupati, Hazem Rashed, Jonathan Horgan, Ciaran Hughes, Senthil Yogamani

    Abstract: Automated Parking is a low speed manoeuvring scenario which is quite unstructured and complex, requiring full 360° near-field sensing around the vehicle. In this paper, we discuss the design and implementation of an automated parking system from the perspective of camera based deep learning algorithms. We provide a holistic overview of an industrial system covering the embedded system, use cases a… ▽ More

    Submitted 23 December, 2019; originally announced December 2019.

    Comments: Accepted for publication at Irish Machine Vision and Image Processing (IMVIP) 2019

  29. arXiv:1912.02249  [pdf, other

    cs.CV cs.LG cs.RO

    Let's Get Dirty: GAN Based Data Augmentation for Camera Lens Soiling Detection in Autonomous Driving

    Authors: Michal Uricar, Ganesh Sistu, Hazem Rashed, Antonin Vobecky, Varun Ravi Kumar, Pavel Krizek, Fabian Burger, Senthil Yogamani

    Abstract: Wide-angle fisheye cameras are commonly used in automated driving for parking and low-speed navigation tasks. Four of such cameras form a surround-view system that provides a complete and detailed view of the vehicle. These cameras are directly exposed to harsh environmental settings and can get soiled very easily by mud, dust, water, frost. Soiling on the camera lens can severely degrade the visu… ▽ More

    Submitted 14 November, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: Camera ready version + supplementary material. Accepted for presentation at Winter Conference on Applications of Computer Vision 2021

  30. arXiv:1910.05395  [pdf, other

    cs.CV cs.RO eess.IV

    FuseMODNet: Real-Time Camera and LiDAR based Moving Object Detection for robust low-light Autonomous Driving

    Authors: Hazem Rashed, Mohamed Ramzy, Victor Vaquero, Ahmad El Sallab, Ganesh Sistu, Senthil Yogamani

    Abstract: Moving object detection is a critical task for autonomous vehicles. As dynamic objects represent higher collision risk than static ones, our own ego-trajectories have to be planned attending to the future states of the moving elements of the scene. Motion can be perceived using temporal information such as optical flow. Conventional optical flow computation is based on camera sensors only, which m… ▽ More

    Submitted 20 November, 2019; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: Accepted for Oral presentation at ICCV 2019 Workshop on Autonomous Driving. https://sites.google.com/view/fusemodnet

  31. arXiv:1908.11789  [pdf, other

    cs.CV eess.IV

    FisheyeMODNet: Moving Object detection on Surround-view Cameras for Autonomous Driving

    Authors: Marie Yahiaoui, Hazem Rashed, Letizia Mariotti, Ganesh Sistu, Ian Clancy, Lucie Yahiaoui, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Moving Object Detection (MOD) is an important task for achieving robust autonomous driving. An autonomous vehicle has to estimate collision risk with other interacting objects in the environment and calculate an optional trajectory. Collision risk is typically higher for moving objects than static ones due to the need to estimate the future states and poses of the objects for decision making. This… ▽ More

    Submitted 30 August, 2019; originally announced August 2019.

    Comments: Accepted for ICCV 2019 Workshop on 360° Perception and Interaction. A shorter version was presented at IMVIP 2019

  32. arXiv:1905.01492  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    SoilingNet: Soiling Detection on Automotive Surround-View Cameras

    Authors: Michal Uricar, Pavel Krizek, Ganesh Sistu, Senthil Yogamani

    Abstract: Cameras are an essential part of sensor suite in autonomous driving. Surround-view cameras are directly exposed to external environment and are vulnerable to get soiled. Cameras have a much higher degradation in performance due to soiling compared to other sensors. Thus it is critical to accurately detect soiling on the cameras, particularly for higher levels of autonomous driving. We created a ne… ▽ More

    Submitted 17 July, 2019; v1 submitted 4 May, 2019; originally announced May 2019.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2019

  33. arXiv:1905.01489  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    WoodScape: A multi-task, multi-camera fisheye dataset for autonomous driving

    Authors: Senthil Yogamani, Ciaran Hughes, Jonathan Horgan, Ganesh Sistu, Padraig Varley, Derek O'Dea, Michal Uricar, Stefan Milz, Martin Simon, Karl Amende, Christian Witt, Hazem Rashed, Sumanth Chennupati, Sanjaya Nayak, Saquib Mansoor, Xavier Perroton, Patrick Perez

    Abstract: Fisheye cameras are commonly employed for obtaining a large field of view in surveillance, augmented reality and in particular automotive applications. In spite of their prevalence, there are few public datasets for detailed evaluation of computer vision algorithms on fisheye images. We release the first extensive fisheye automotive dataset, WoodScape, named after Robert Wood who invented the fish… ▽ More

    Submitted 2 July, 2021; v1 submitted 4 May, 2019; originally announced May 2019.

    Comments: Accepted for Oral Presentation at IEEE International Conference on Computer Vision (ICCV) 2019. Please refer to our website https://woodscape.valeo.com and https://github.com/valeoai/woodscape for release status and updates

  34. arXiv:1904.08492  [pdf, other

    cs.CV

    MultiNet++: Multi-Stream Feature Aggregation and Geometric Loss Strategy for Multi-Task Learning

    Authors: Sumanth Chennupati, Ganesh Sistu, Senthil Yogamani, Samir A Rawashdeh

    Abstract: Multi-task learning is commonly used in autonomous driving for solving various visual perception tasks. It offers significant benefits in terms of both performance and computational complexity. Current work on multi-task learning networks focus on processing a single input image and there is no known implementation of multi-task learning handling a sequence of images. In this work, we propose a mu… ▽ More

    Submitted 22 April, 2019; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: Accepted for CVPR 2019 Workshop on Autonomous Driving (WAD). Demo Video can be accessed at https://youtu.be/E378PzLq7lQ

  35. arXiv:1902.03589  [pdf, other

    cs.CV cs.LG cs.RO stat.ML

    NeurAll: Towards a Unified Visual Perception Model for Automated Driving

    Authors: Ganesh Sistu, Isabelle Leang, Sumanth Chennupati, Senthil Yogamani, Ciaran Hughes, Stefan Milz, Samir Rawashdeh

    Abstract: Convolutional Neural Networks (CNNs) are successfully used for the important automotive visual perception tasks including object recognition, motion and depth estimation, visual SLAM, etc. However, these tasks are typically independently explored and modeled. In this paper, we propose a joint multi-task network design for learning several tasks simultaneously. Our main motivation is the computatio… ▽ More

    Submitted 9 March, 2024; v1 submitted 10 February, 2019; originally announced February 2019.

    Comments: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2019

  36. arXiv:1901.05808  [pdf, other

    cs.CV cs.LG stat.ML

    AuxNet: Auxiliary tasks enhanced Semantic Segmentation for Automated Driving

    Authors: Sumanth Chennupati, Ganesh Sistu, Senthil Yogamani, Samir Rawashdeh

    Abstract: Decision making in automated driving is highly specific to the environment and thus semantic segmentation plays a key role in recognizing the objects in the environment around the car. Pixel level classification once considered a challenging task which is now becoming mature to be productized in a car. However, semantic annotation is time consuming and quite expensive. Synthetic datasets with doma… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

    Comments: Accepted as a Short Paper for a poster presentation at VISAPP 2019

  37. arXiv:1901.03912  [pdf, other

    cs.CV cs.LG stat.ML

    Real-time Joint Object Detection and Semantic Segmentation Network for Automated Driving

    Authors: Ganesh Sistu, Isabelle Leang, Senthil Yogamani

    Abstract: Convolutional Neural Networks (CNN) are successfully used for various visual perception tasks including bounding box object detection, semantic segmentation, optical flow, depth estimation and visual SLAM. Generally these tasks are independently explored and modeled. In this paper, we present a joint multi-task network design for learning object detection and semantic segmentation simultaneously.… ▽ More

    Submitted 12 January, 2019; originally announced January 2019.

    Comments: Presented at NeurIPS 2018 Workshop on Machine Learning on the Phone and other Consumer Devices (MLPCD 2)

  38. arXiv:1901.02511  [pdf, other

    cs.CV cs.LG stat.ML

    Multi-stream CNN based Video Semantic Segmentation for Automated Driving

    Authors: Ganesh Sistu, Sumanth Chennupati, Senthil Yogamani

    Abstract: Majority of semantic segmentation algorithms operate on a single frame even in the case of videos. In this work, the goal is to exploit temporal information within the algorithm model for leveraging motion cues and temporal consistency. We propose two simple high-level architectures based on Recurrent FCN (RFCN) and Multi-Stream FCN (MSFCN) networks. In case of RFCN, a recurrent network namely LST… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: Accepted for Oral Presentation at VISAPP 2019