Skip to main content

Showing 1–4 of 4 results for author: Hussain, K F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.03767  [pdf, other

    cs.CV cs.AI cs.CL eess.IV

    Enhancing image captioning with depth information using a Transformer-based framework

    Authors: Aya Mahmoud Ahmed, Mohamed Yousef, Khaled F. Hussain, Yousef Bassyouni Mahdy

    Abstract: Captioning images is a challenging scene-understanding task that connects computer vision and natural language processing. While image captioning models have been successful in producing excellent descriptions, the field has primarily focused on generating a single sentence for 2D images. This paper investigates whether integrating depth information with RGB images can enhance the captioning task… ▽ More

    Submitted 24 July, 2023; originally announced August 2023.

    Comments: 19 pages, 5 figures, 13 tables

  2. arXiv:1906.08864  [pdf, other

    cs.NE cs.LG stat.ML

    Accurate and Energy-Efficient Classification with Spiking Random Neural Network: Corrected and Expanded Version

    Authors: Khaled F. Hussain, Mohamed Yousef Bassyouni, Erol Gelenbe

    Abstract: Artificial Neural Network (ANN) based techniques have dominated state-of-the-art results in most problems related to computer vision, audio recognition, and natural language processing in the past few years, resulting in strong industrial adoption from all leading technology companies worldwide. One of the major obstacles that have historically delayed large scale adoption of ANNs is the huge comp… ▽ More

    Submitted 1 June, 2019; originally announced June 2019.

    ACM Class: I.2; G.3

  3. arXiv:1812.11894  [pdf, other

    cs.CV

    Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks

    Authors: Mohamed Yousef, Khaled F. Hussain, Usama S. Mohammed

    Abstract: Unconstrained text recognition is an important computer vision task, featuring a wide variety of different sub-tasks, each with its own set of challenges. One of the biggest promises of deep neural networks has been the convergence and automation of feature extractors from input raw signals, allowing for the highest possible performance with minimum required domain knowledge. To this end, we propo… ▽ More

    Submitted 31 December, 2018; originally announced December 2018.

    Comments: Submitted for publication

  4. arXiv:1711.00972  [pdf, other

    cs.CV

    The Achievement of Higher Flexibility in Multiple Choice-based Tests Using Image Classification Techniques

    Authors: Mahmoud Afifi, Khaled F. Hussain

    Abstract: In spite of the high accuracy of the existing optical mark reading (OMR) systems and devices, a few restrictions remain existent. In this work, we aim to reduce the restrictions of multiple choice questions (MCQ) within tests. We use an image registration technique to extract the answer boxes from answer sheets. Unlike other systems that rely on simple image processing steps to recognize the extra… ▽ More

    Submitted 11 January, 2019; v1 submitted 2 November, 2017; originally announced November 2017.