Skip to main content

Showing 1–9 of 9 results for author: El-Sallab, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2107.05887  [pdf, other

    cs.CV

    ST-DETR: Spatio-Temporal Object Traces Attention Detection Transformer

    Authors: Eslam Mohamed, Ahmad El-Sallab

    Abstract: We propose ST-DETR, a Spatio-Temporal Transformer-based architecture for object detection from a sequence of temporal frames. We treat the temporal frames as sequences in both space and time and employ the full attention mechanisms to take advantage of the features correlations over both dimensions. This treatment enables us to deal with frames sequence as temporal object features traces over ever… ▽ More

    Submitted 24 July, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2106.11401

  2. arXiv:2106.11422  [pdf, other

    cs.CV cs.LG

    MODETR: Moving Object Detection with Transformers

    Authors: Eslam Mohamed, Ahmad El-Sallab

    Abstract: Moving Object Detection (MOD) is a crucial task for the Autonomous Driving pipeline. MOD is usually handled via 2-stream convolutional architectures that incorporates both appearance and motion cues, without considering the inter-relations between the spatial or motion features. In this paper, we tackle this problem through multi-head attention mechanisms, both across the spatial and motion stream… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Journal ref: Machine Learning for Autonomous Driving Workshop at the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada

  3. arXiv:2106.11401  [pdf, other

    cs.CV cs.LG

    Spatio-Temporal Multi-Task Learning Transformer for Joint Moving Object Detection and Segmentation

    Authors: Eslam Mohamed, Ahmed El-Sallab

    Abstract: Moving objects have special importance for Autonomous Driving tasks. Detecting moving objects can be posed as Moving Object Segmentation, by segmenting the object pixels, or Moving Object Detection, by generating a bounding box for the moving targets. In this paper, we present a Multi-Task Learning architecture, based on Transformers, to jointly perform both tasks through one network. Due to the i… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  4. arXiv:2102.06777  [pdf, other

    cs.CV cs.LG

    INSTA-YOLO: Real-Time Instance Segmentation

    Authors: Eslam Mohamed, Abdelrahman Shaker, Ahmad El-Sallab, Mayada Hadhoud

    Abstract: Instance segmentation has gained recently huge attention in various computer vision applications. It aims at providing different IDs to different objects of the scene, even if they belong to the same class. Instance segmentation is usually performed as a two-stage pipeline. First, an object is detected, then semantic segmentation within the detected box area is performed which involves costly up-s… ▽ More

    Submitted 24 July, 2021; v1 submitted 12 February, 2021; originally announced February 2021.

  5. arXiv:2012.02124  [pdf, other

    cs.CV cs.RO

    Generalized Object Detection on Fisheye Cameras for Autonomous Driving: Dataset, Representations and Baseline

    Authors: Hazem Rashed, Eslam Mohamed, Ganesh Sistu, Varun Ravi Kumar, Ciaran Eising, Ahmad El-Sallab, Senthil Yogamani

    Abstract: Object detection is a comprehensively studied problem in autonomous driving. However, it has been relatively less explored in the case of fisheye cameras. The standard bounding box fails in fisheye cameras due to the strong radial distortion, particularly in the image's periphery. We explore better representations like oriented bounding box, ellipse, and generic polygon for object detection in fis… ▽ More

    Submitted 21 December, 2022; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Camera ready version. Accepted for presentation at Winter Conference on Applications of Computer Vision 2021. Dataset is shared at https://drive.google.com/drive/folders/1bobmY2wlIBozeU5ZgPfYPqVAnpPw4QrM

  6. arXiv:2008.07008  [pdf, other

    cs.CV cs.RO

    Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline

    Authors: Eslam Mohamed, Mahmoud Ewaisha, Mennatullah Siam, Hazem Rashed, Senthil Yogamani, Waleed Hamdy, Muhammad Helmi, Ahmad El-Sallab

    Abstract: Moving object segmentation is a crucial task for autonomous vehicles as it can be used to segment objects in a class agnostic manner based on their motion cues. It enables the detection of unseen objects during training (e.g., moose or a construction truck) based on their motion and independent of their appearance. Although pixel-wise motion segmentation has been studied in autonomous driving lite… ▽ More

    Submitted 26 May, 2021; v1 submitted 16 August, 2020; originally announced August 2020.

    Comments: Accepted for presentation at IEEE IV 2021 (Intelligent Vehicles Symposium) conference

  7. arXiv:1901.07355  [pdf, other

    cs.CV cs.LG stat.ML

    Optical Flow augmented Semantic Segmentation networks for Automated Driving

    Authors: Hazem Rashed, Senthil Yogamani, Ahmad El-Sallab, Pavel Krizek, Mohamed El-Helw

    Abstract: Motion is a dominant cue in automated driving systems. Optical flow is typically computed to detect moving objects and to estimate depth using triangulation. In this paper, our motivation is to leverage the existing dense optical flow to improve the performance of semantic segmentation. To provide a systematic study, we construct four different architectures which use RGB only, flow only, RGBF con… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: Accepted for Oral Presentation at VISAPP 2019

  8. arXiv:1901.01536  [pdf, other

    cs.LG cs.RO stat.ML

    Exploring applications of deep reinforcement learning for real-world autonomous driving systems

    Authors: Victor Talpaert, Ibrahim Sobh, B Ravi Kiran, Patrick Mannion, Senthil Yogamani, Ahmad El-Sallab, Patrick Perez

    Abstract: Deep Reinforcement Learning (DRL) has become increasingly powerful in recent years, with notable achievements such as Deepmind's AlphaGo. It has been successfully deployed in commercial vehicles like Mobileye's path planning system. However, a vast majority of work on DRL is focused on toy examples in controlled synthetic car simulator environments such as TORCS and CARLA. In general, DRL is still… ▽ More

    Submitted 16 January, 2019; v1 submitted 6 January, 2019; originally announced January 2019.

    Comments: Accepted for Oral Presentation at VISAPP 2019

  9. arXiv:1709.04821  [pdf, other

    cs.CV cs.RO

    MODNet: Moving Object Detection Network with Motion and Appearance for Autonomous Driving

    Authors: Mennatullah Siam, Heba Mahgoub, Mohamed Zahran, Senthil Yogamani, Martin Jagersand, Ahmad El-Sallab

    Abstract: We propose a novel multi-task learning system that combines appearance and motion cues for a better semantic reasoning of the environment. A unified architecture for joint vehicle detection and motion segmentation is introduced. In this architecture, a two-stream encoder is shared among both tasks. In order to evaluate our method in autonomous driving setting, KITTI annotated sequences with detect… ▽ More

    Submitted 12 November, 2017; v1 submitted 14 September, 2017; originally announced September 2017.