Skip to main content

Showing 1–13 of 13 results for author: Harakeh, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.13148  [pdf, other

    cs.CV

    BACS: Background Aware Continual Semantic Segmentation

    Authors: Mostafa ElAraby, Ali Harakeh, Liam Paull

    Abstract: Semantic segmentation plays a crucial role in enabling comprehensive scene understanding for robotic systems. However, generating annotations is challenging, requiring labels for every pixel in an image. In scenarios like autonomous driving, there's a need to progressively incorporate new classes as the operating environment of the deployed agent becomes more complex. For enhanced annotation effic… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 8 pages, 4 figures, CRV 2024

  2. arXiv:2301.05709  [pdf, other

    cs.CV

    Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss

    Authors: Anas Mahmoud, Jordan S. K. Hu, Tianshu Kuai, Ali Harakeh, Liam Paull, Steven L. Waslander

    Abstract: An effective framework for learning 3D representations for perception tasks is distilling rich self-supervised image features via contrastive learning. However, image-to point representation learning for autonomous driving datasets faces two main challenges: 1) the abundance of self-similarity, which results in the contrastive losses pushing away semantically similar point and image regions and th… ▽ More

    Submitted 24 March, 2023; v1 submitted 12 January, 2023; originally announced January 2023.

    Comments: Accepted in CVPR 2023

  3. arXiv:2211.13724  [pdf, other

    cs.LG cs.CV

    Estimating Regression Predictive Distributions with Sample Networks

    Authors: Ali Harakeh, Jordan Hu, Naiqing Guan, Steven L. Waslander, Liam Paull

    Abstract: Estimating the uncertainty in deep neural network predictions is crucial for many real-world applications. A common approach to model uncertainty is to choose a parametric distribution and fit the data to it using maximum likelihood estimation. The chosen parametric form can be a poor fit to the data-generating distribution, resulting in unreliable uncertainty estimates. In this work, we propose S… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted for publication in AAAI 2023. Example code at: https://samplenet.github.io/

  4. arXiv:2107.13682  [pdf, other

    cs.CV

    Bayesian Embeddings for Few-Shot Open World Recognition

    Authors: John Willes, James Harrison, Ali Harakeh, Chelsea Finn, Marco Pavone, Steven Waslander

    Abstract: As autonomous decision-making agents move from narrow operating environments to unstructured worlds, learning systems must move from a closed-world formulation to an open-world and few-shot setting in which agents continuously learn new classes from small amounts of information. This stands in stark contrast to modern machine learning systems that are typically designed with a known set of classes… ▽ More

    Submitted 5 October, 2022; v1 submitted 28 July, 2021; originally announced July 2021.

  5. arXiv:2103.01100  [pdf, other

    cs.CV

    Categorical Depth Distribution Network for Monocular 3D Object Detection

    Authors: Cody Reading, Ali Harakeh, Julia Chae, Steven L. Waslander

    Abstract: Monocular 3D object detection is a key problem for autonomous vehicles, as it provides a solution with simple configuration compared to typical multi-sensor systems. The main challenge in monocular 3D detection lies in accurately predicting object depth, which must be inferred from object and scene cues due to the lack of direct range measurement. Many methods attempt to directly estimate depth to… ▽ More

    Submitted 23 March, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: Accepted in CVPR 2021

  6. arXiv:2101.05036  [pdf, other

    cs.CV stat.ML

    Estimating and Evaluating Regression Predictive Uncertainty in Deep Object Detectors

    Authors: Ali Harakeh, Steven L. Waslander

    Abstract: Predictive uncertainty estimation is an essential next step for the reliable deployment of deep object detectors in safety-critical tasks. In this work, we focus on estimating predictive distributions for bounding box regression output with variance networks. We show that in the context of object detection, training variance networks with negative log likelihood (NLL) can lead to high entropy pred… ▽ More

    Submitted 12 March, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: Published as a conference paper at ICLR 2021. Link: https://openreview.net/forum?id=YLewtnvKgR7. This is the final camera-ready version

  7. A Review and Comparative Study on Probabilistic Object Detection in Autonomous Driving

    Authors: Di Feng, Ali Harakeh, Steven Waslander, Klaus Dietmayer

    Abstract: Capturing uncertainty in object detection is indispensable for safe autonomous driving. In recent years, deep learning has become the de-facto approach for object detection, and many probabilistic object detectors have been proposed. However, there is no summary on uncertainty estimation in deep object detection, and existing methods are not only built with different network architectures and unce… ▽ More

    Submitted 11 July, 2021; v1 submitted 20 November, 2020; originally announced November 2020.

    Comments: Accepted in the IEEE Transactions on Intelligent Transportation Systems

  8. arXiv:1903.03838  [pdf, other

    cs.CV

    BayesOD: A Bayesian Approach for Uncertainty Estimation in Deep Object Detectors

    Authors: Ali Harakeh, Michael Smart, Steven L. Waslander

    Abstract: When incorporating deep neural networks into robotic systems, a major challenge is the lack of uncertainty measures associated with their output predictions. Methods for uncertainty estimation in the output of deep object detectors (DNNs) have been proposed in recent works, but have had limited success due to 1) information loss at the detectors non-maximum suppression (NMS) stage, and 2) failure… ▽ More

    Submitted 16 September, 2019; v1 submitted 9 March, 2019; originally announced March 2019.

  9. arXiv:1807.06072  [pdf, other

    cs.LG cs.AI stat.ML

    Leveraging Pre-Trained 3D Object Detection Models For Fast Ground Truth Generation

    Authors: Jungwook Lee, Sean Walsh, Ali Harakeh, Steven L. Waslander

    Abstract: Training 3D object detectors for autonomous driving has been limited to small datasets due to the effort required to generate annotations. Reducing both task complexity and the amount of task switching done by annotators is key to reducing the effort and time required to generate 3D bounding box annotations. This paper introduces a novel ground truth generation method that combines human supervisi… ▽ More

    Submitted 16 July, 2018; originally announced July 2018.

  10. arXiv:1807.06056  [pdf, other

    cs.CV

    Unlimited Road-scene Synthetic Annotation (URSA) Dataset

    Authors: Matt Angus, Mohamed ElBalkini, Samin Khan, Ali Harakeh, Oles Andrienko, Cody Reading, Steven Waslander, Krzysztof Czarnecki

    Abstract: In training deep neural networks for semantic segmentation, the main limiting factor is the low amount of ground truth annotation data that is available in currently existing datasets. The limited availability of such data is due to the time cost and human effort required to accurately and consistently label real images on a pixel level. Modern sandbox video game engines provide open world environ… ▽ More

    Submitted 16 July, 2018; originally announced July 2018.

    Comments: Accepted in The 21st IEEE International Conference on Intelligent Transportation Systems

  11. arXiv:1806.07987  [pdf, other

    cs.CV

    A Hierarchical Deep Architecture and Mini-Batch Selection Method For Joint Traffic Sign and Light Detection

    Authors: Alex D. Pon, Oles Andrienko, Ali Harakeh, Steven L. Waslander

    Abstract: Traffic light and sign detectors on autonomous cars are integral for road scene perception. The literature is abundant with deep learning networks that detect either lights or signs, not both, which makes them unsuitable for real-life deployment due to the limited graphics processing unit (GPU) memory and power available on embedded systems. The root cause of this issue is that no public dataset c… ▽ More

    Submitted 13 September, 2018; v1 submitted 20 June, 2018; originally announced June 2018.

    Comments: Accepted in the IEEE 15th Conference on Computer and Robot Vision

  12. arXiv:1802.00036  [pdf, other

    cs.CV

    In Defense of Classical Image Processing: Fast Depth Completion on the CPU

    Authors: Jason Ku, Ali Harakeh, Steven L. Waslander

    Abstract: With the rise of data driven deep neural networks as a realization of universal function approximators, most research on computer vision problems has moved away from hand crafted classical image processing algorithms. This paper shows that with a well designed algorithm, we are capable of outperforming neural network based methods on the task of depth completion. The proposed algorithm is simple a… ▽ More

    Submitted 31 January, 2018; originally announced February 2018.

  13. arXiv:1712.02294  [pdf, other

    cs.CV

    Joint 3D Proposal Generation and Object Detection from View Aggregation

    Authors: Jason Ku, Melissa Mozifian, Jungwook Lee, Ali Harakeh, Steven Waslander

    Abstract: We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios. The proposed neural network architecture uses LIDAR point clouds and RGB images to generate features that are shared by two subnetworks: a region proposal network (RPN) and a second stage detector network. The proposed RPN uses a novel architecture capable of performing multimodal feature fusion on high r… ▽ More

    Submitted 12 July, 2018; v1 submitted 6 December, 2017; originally announced December 2017.

    Comments: For any inquiries contact aharakeh(at)uwaterloo(dot)ca