Skip to main content

Showing 1–3 of 3 results for author: Almuzairee, A

.
  1. arXiv:2405.17416  [pdf, other

    cs.LG cs.CV cs.RO

    A Recipe for Unbounded Data Augmentation in Visual Reinforcement Learning

    Authors: Abdulaziz Almuzairee, Nicklas Hansen, Henrik I. Christensen

    Abstract: $Q$-learning algorithms are appealing for real-world applications due to their data-efficiency, but they are very prone to overfitting and training instabilities when trained from visual observations. Prior work, namely SVEA, finds that selective application of data augmentation can improve the visual generalization of RL agents without destabilizing training. We revisit its recipe for data augmen… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted at RLC 2024

  2. arXiv:2306.03810  [pdf, other

    cs.CV cs.RO

    X-Align++: cross-modal cross-view alignment for Bird's-eye-view segmentation

    Authors: Shubhankar Borse, Senthil Yogamani, Marvin Klingner, Varun Ravi, Hong Cai, Abdulaziz Almuzairee, Fatih Porikli

    Abstract: Bird's-eye-view (BEV) grid is a typical representation of the perception of road components, e.g., drivable area, in autonomous driving. Most existing approaches rely on cameras only to perform segmentation in BEV space, which is fundamentally constrained by the absence of reliable depth information. The latest works leverage both camera and LiDAR modalities but suboptimally fuse their features us… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: Accepted for publication at Springer Machine Vision and Applications Journal. The Version of Record of this article is published in Machine Vision and Applications Journal, and is available online at https://doi.org/10.1007/s00138-023-01400-7. arXiv admin note: substantial text overlap with arXiv:2210.06778

  3. arXiv:2210.06778  [pdf, other

    cs.CV

    X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View Segmentation

    Authors: Shubhankar Borse, Marvin Klingner, Varun Ravi Kumar, Hong Cai, Abdulaziz Almuzairee, Senthil Yogamani, Fatih Porikli

    Abstract: Bird's-eye-view (BEV) grid is a common representation for the perception of road components, e.g., drivable area, in autonomous driving. Most existing approaches rely on cameras only to perform segmentation in BEV space, which is fundamentally constrained by the absence of reliable depth information. Latest works leverage both camera and LiDAR modalities, but sub-optimally fuse their features usin… ▽ More

    Submitted 31 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to WACV 2023