Skip to main content

Showing 1–15 of 15 results for author: Eraqi, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.10854  [pdf, other

    cs.CV cs.LG

    The Right Losses for the Right Gains: Improving the Semantic Consistency of Deep Text-to-Image Generation with Distribution-Sensitive Losses

    Authors: Mahmoud Ahmed, Omer Moussa, Ismail Shaheen, Mohamed Abdelfattah, Amr Abdalla, Marwan Eid, Hesham Eraqi, Mohamed Moustafa

    Abstract: One of the major challenges in training deep neural networks for text-to-image generation is the significant linguistic discrepancy between ground-truth captions of each image in most popular datasets. The large difference in the choice of words in such captions results in synthesizing images that are semantically dissimilar to each other and to their ground-truth counterparts. Moreover, existing… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  2. arXiv:2311.09178  [pdf, other

    cs.CV

    RBPGAN: Recurrent Back-Projection GAN for Video Super Resolution

    Authors: Marwah Sulaiman, Zahraa Shehabeldin, Israa Fahmy, Mohammed Barakat, Mohammed El-Naggar, Dareen Hussein, Moustafa Youssef, Hesham M. Eraqi

    Abstract: Recently, video super resolution (VSR) has become a very impactful task in the area of Computer Vision due to its various applications. In this paper, we propose Recurrent Back-Projection Generative Adversarial Network (RBPGAN) for VSR in an attempt to generate temporally coherent solutions while preserving spatial details. RBPGAN integrates two state-of-the-art models to get the best in both worl… ▽ More

    Submitted 10 December, 2023; v1 submitted 15 November, 2023; originally announced November 2023.

  3. Dynamic Conditional Imitation Learning for Autonomous Driving

    Authors: Hesham M. Eraqi, Mohamed N. Moustafa, Jens Honer

    Abstract: Conditional imitation learning (CIL) trains deep neural networks, in an end-to-end manner, to mimic human driving. This approach has demonstrated suitable vehicle control when following roads, avoiding obstacles, or taking specific turns at intersections to reach a destination. Unfortunately, performance dramatically decreases when deployed to unseen environments and is inconsistent against varyin… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 14 pages, 11 figures, 7 tables

    Journal ref: IEEE Transactions on Intelligent Transportation Systems

  4. arXiv:2207.05692  [pdf, other

    cs.MM cs.CL cs.SD eess.AS

    Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models

    Authors: Hadeel Mabrouk, Omar Abugabal, Nourhan Sakr, Hesham M. Eraqi

    Abstract: In this work, we propose a technique to transfer speech recognition capabilities from audio speech recognition systems to visual speech recognizers, where our goal is to utilize audio data during lipreading model training. Impressive progress in the domain of speech recognition has been exhibited by audio and audio-visual systems. Nevertheless, there is still much to be explored with regards to vi… ▽ More

    Submitted 5 June, 2022; originally announced July 2022.

    Comments: arXiv admin note: text overlap with arXiv:2108.03543

  5. arXiv:2203.15522  [pdf, other

    cs.RO cs.AI

    Collision-Free Navigation using Evolutionary Symmetrical Neural Networks

    Authors: Hesham M. Eraqi, Mena Nagiub, Peter Sidra

    Abstract: Collision avoidance systems play a vital role in reducing the number of vehicle accidents and saving human lives. This paper extends the previous work using evolutionary neural networks for reactive collision avoidance. We are proposing a new method we have called symmetric neural networks. The method improves the model's performance by enforcing constraints between the network weights which reduc… ▽ More

    Submitted 11 April, 2022; v1 submitted 29 March, 2022; originally announced March 2022.

    Comments: arXiv admin note: text overlap with arXiv:1609.08414

    Journal ref: 2022 IEEE International Conference on Evolving and Adaptive Intelligent Systems (EAIS 2022), Larnaca, Cyprus

  6. arXiv:2111.01136  [pdf

    cs.CL cs.AI cs.LG cs.NI

    ASMDD: Arabic Speech Mispronunciation Detection Dataset

    Authors: Salah A. Aly, Abdelrahman Salah, Hesham M. Eraqi

    Abstract: The largest dataset of Arabic speech mispronunciation detections in Egyptian dialogues is introduced. The dataset is composed of annotated audio files representing the top 100 words that are most frequently used in the Arabic language, pronounced by 100 Egyptian children (aged between 2 and 8 years old). The dataset is collected and annotated on segmental pronunciation error detections by expert l… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 3 pages, 2 tables, 2 figures, dataset link: https://drive.google.com/drive/folders/1dhlp-L0n6_RAzoosVK4bRa7hxBnzebqs

  7. arXiv:2108.07661  [pdf, other

    cs.CV

    An Evaluation of RGB and LiDAR Fusion for Semantic Segmentation

    Authors: Amr S. Mohamed, Ali Abdelkader, Mohamed Anany, Omar El-Behady, Muhammad Faisal, Asser Hangal, Hesham M. Eraqi, Mohamed N. Moustafa

    Abstract: LiDARs and cameras are the two main sensors that are planned to be included in many announced autonomous vehicles prototypes. Each of the two provides a unique form of data from a different perspective to the surrounding environment. In this paper, we explore and attempt to answer the question: is there an added benefit by fusing those two forms of data for the purpose of semantic segmentation wit… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

  8. arXiv:2108.03543  [pdf, other

    cs.CV cs.AI

    Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading

    Authors: Shahd Elashmawy, Marian Ramsis, Hesham M. Eraqi, Farah Eldeshnawy, Hadeel Mabrouk, Omar Abugabal, Nourhan Sakr

    Abstract: Despite the advancement in the domain of audio and audio-visual speech recognition, visual speech recognition systems are still quite under-explored due to the visual ambiguity of some phonemes. In this work, we propose a new lip-reading model that combines three contributions. First, the model front-end adopts a spatio-temporal attention mechanism to help extract the informative data from the inp… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

  9. arXiv:2108.02148  [pdf, other

    cs.SD cs.CV cs.HC cs.LG eess.AS

    Pervasive Hand Gesture Recognition for Smartphones using Non-audible Sound and Deep Learning

    Authors: Ahmed Ibrahim, Ayman El-Refai, Sara Ahmed, Mariam Aboul-Ela, Hesham M. Eraqi, Mohamed Moustafa

    Abstract: Due to the mass advancement in ubiquitous technologies nowadays, new pervasive methods have come into the practice to provide new innovative features and stimulate the research on new human-computer interactions. This paper presents a hand gesture recognition method that utilizes the smartphone's built-in speakers and microphones. The proposed system emits an ultrasonic sonar-based signal (inaudib… ▽ More

    Submitted 4 August, 2021; originally announced August 2021.

  10. arXiv:1907.07748  [pdf, other

    eess.IV cs.CV

    End-to-end sensor modeling for LiDAR Point Cloud

    Authors: Khaled Elmadawi, Moemen Abdelrazek, Mohamed Elsobky, Hesham M. Eraqi, Mohamed Zahran

    Abstract: Advanced sensors are a key to enable self-driving cars technology. Laser scanner sensors (LiDAR, Light Detection And Ranging) became a fundamental choice due to its long-range and robustness to low light driving conditions. The problem of designing a control software for self-driving cars is a complex task to explicitly formulate in rule-based systems, thus recent approaches rely on machine learni… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: Accepted in IEEE Intelligent Transportation Systems Conference - ITSC 2019

  11. arXiv:1901.09097  [pdf, other

    cs.CV cs.LG stat.ML

    Driver Distraction Identification with an Ensemble of Convolutional Neural Networks

    Authors: Hesham M. Eraqi, Yehya Abouelnaga, Mohamed H. Saad, Mohamed N. Moustafa

    Abstract: The World Health Organization (WHO) reported 1.25 million deaths yearly due to road traffic accidents worldwide and the number has been continuously increasing over the last few years. Nearly fifth of these accidents are caused by distracted drivers. Existing work of distracted driver detection is concerned with a small set of distractions (mostly, cell phone usage). Unreliable ad-hoc methods are… ▽ More

    Submitted 22 January, 2019; originally announced January 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1706.09498

    Journal ref: Journal of Advanced Transportation, Machine Learning in Transportation (MLT) Issue, 2019

  12. arXiv:1801.00600  [pdf

    cs.RO

    Static Free Space Detection with Laser Scanner using Occupancy Grid Maps

    Authors: Hesham M. Eraqi, Jens Honer, Sebastian Zuther

    Abstract: Drivable free space information is vital for autonomous vehicles that have to plan evasive maneuvers in real-time. In this paper, we present a new efficient method for environmental free space detection with laser scanner based on 2D occupancy grid maps (OGM) to be used for Advanced Driving Assistance Systems (ADAS) and Collision Avoidance Systems (CAS). Firstly, we introduce an enhanced inverse s… ▽ More

    Submitted 30 June, 2020; v1 submitted 2 January, 2018; originally announced January 2018.

    Journal ref: IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, JAPAN, October 2017

  13. arXiv:1710.03804  [pdf, other

    cs.LG

    End-to-End Deep Learning for Steering Autonomous Vehicles Considering Temporal Dependencies

    Authors: Hesham M. Eraqi, Mohamed N. Moustafa, Jens Honer

    Abstract: Steering a car through traffic is a complex task that is difficult to cast into algorithms. Therefore, researchers turn to training artificial neural networks from front-facing camera data stream along with the associated steering angles. Nevertheless, most existing solutions consider only the visual camera frames as input, thus ignoring the temporal relationship between frames. In this work, we p… ▽ More

    Submitted 22 November, 2017; v1 submitted 10 October, 2017; originally announced October 2017.

    Comments: 31st Conference on Neural Information Processing Systems (NIPS), Machine Learning for Intelligent Transportation Systems Workshop, Long Beach, CA, USA, 2017

  14. arXiv:1706.09498  [pdf, other

    cs.CV

    Real-time Distracted Driver Posture Classification

    Authors: Yehya Abouelnaga, Hesham M. Eraqi, Mohamed N. Moustafa

    Abstract: In this paper, we present a new dataset for "distracted driver" posture estimation. In addition, we propose a novel system that achieves 95.98% driving posture estimation classification accuracy. The system consists of a genetically-weighted ensemble of Convolutional Neural Networks (CNNs). We show that a weighted ensemble of classifiers using a genetic algorithm yields in better classification co… ▽ More

    Submitted 29 November, 2018; v1 submitted 28 June, 2017; originally announced June 2017.

    Journal ref: 32nd Conference on Neural Information Processing Systems (NIPS 2018), Workshop on Machine Learning for Intelligent Transportation Systems

  15. arXiv:1609.08414  [pdf

    cs.NE

    Reactive Collision Avoidance using Evolutionary Neural Networks

    Authors: Hesham Eraqi, Youssef EmadEldin, Mohamed Moustafa

    Abstract: Collision avoidance systems can play a vital role in reducing the number of accidents and saving human lives. In this paper, we introduce and validate a novel method for vehicles reactive collision avoidance using evolutionary neural networks (ENN). A single front-facing rangefinder sensor is the only input required by our method. The training process and the proposed method analysis and validatio… ▽ More

    Submitted 27 September, 2016; originally announced September 2016.

    Comments: ECTA 2016. Final paper is at SCITEPRESS digital library