Skip to main content

Showing 1–22 of 22 results for author: Hussein, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.16904  [pdf

    eess.SP

    Intelligent energy management of steam generators

    Authors: Ahmed S. Hussein, Noha H. El-Amary, Loai Saad El-din Nasrat, Ali Selim

    Abstract: This paper introduces a smart model for intelligent energy management of steam generators which are utilized for steam generator and controlling the air to fuel ratio for steam generator all over the firing curve and transient mode operation. Nowadays, the environment faces a lot of pollution and global warming phenomena. With the spread of electrical devices, electric cars with conventional elect… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2309.15686  [pdf, other

    cs.CL cs.SD eess.AS

    Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization

    Authors: Amir Hussein, Brian Yan, Antonios Anastasopoulos, Shinji Watanabe, Sanjeev Khudanpur

    Abstract: Incorporating longer context has been shown to benefit machine translation, but the inclusion of context in end-to-end speech translation (E2E-ST) remains under-studied. To bridge this gap, we introduce target language context in E2E-ST, enhancing coherence and overcoming memory constraints of extended audio segments. Additionally, we propose context dropout to ensure robustness to the absence of… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  3. arXiv:2309.15674  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Speech collage: code-switched audio generation by collaging monolingual corpora

    Authors: Amir Hussein, Dorsa Zeinali, Ondřej Klejch, Matthew Wiesner, Brian Yan, Shammur Chowdhury, Ahmed Ali, Shinji Watanabe, Sanjeev Khudanpur

    Abstract: Designing effective automatic speech recognition (ASR) systems for Code-Switching (CS) often depends on the availability of the transcribed CS resources. To address data scarcity, this paper introduces Speech Collage, a method that synthesizes CS data from monolingual corpora by splicing audio segments. We further improve the smoothness quality of audio generation using an overlap-add approach. We… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  4. arXiv:2211.16319  [pdf, other

    eess.AS cs.CL cs.SD

    Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition

    Authors: Injy Hamed, Amir Hussein, Oumnia Chellah, Shammur Chowdhury, Hamdy Mubarak, Sunayana Sitaram, Nizar Habash, Ahmed Ali

    Abstract: Code-switching poses a number of challenges and opportunities for multilingual automatic speech recognition. In this paper, we focus on the question of robust and fair evaluation metrics. To that end, we develop a reference benchmark data set of code-switching speech recognition hypotheses with human judgments. We define clear guidelines for minimal editing of automatic hypotheses. We validate the… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: Accepted to SLT 2022

  5. arXiv:2208.07414  [pdf, other

    eess.SP

    Progress towards machine learning methodologies for laser-induced breakdown spectroscopy with an emphasis on soil analysis

    Authors: Yingchao Huang, Sivanandan S. Harilal, Abdul Bais, Amina E. Hussein

    Abstract: Optical emission spectroscopy of laser-produced plasmas, commonly known as laser-induced breakdown spectroscopy (LIBS), is an emerging analytical tool for rapid soil analysis. However, specific challenges with LIBS exist, such as matrix effects and quantification issues, that require further study in the application of LIBS, particularly for analysis of heterogeneous samples such as soils. Advance… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  6. arXiv:2206.09790  [pdf, other

    cs.CL cs.SD eess.AS

    The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition

    Authors: Jonathan Mukiibi, Andrew Katumba, Joyce Nakatumba-Nabende, Ali Hussein, Josh Meyer

    Abstract: Building a usable radio monitoring automatic speech recognition (ASR) system is a challenging task for under-resourced languages and yet this is paramount in societies where radio is the main medium of public communication and discussions. Initial efforts by the United Nations in Uganda have proved how understanding the perceptions of rural people who are excluded from social media is important in… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 1945 to 1954 Marseille, 20 to 25 June 2022

  7. arXiv:2201.02550  [pdf, other

    cs.CL cs.SD eess.AS

    Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition

    Authors: Amir Hussein, Shammur Absar Chowdhury, Ahmed Abdelali, Najim Dehak, Ahmed Ali, Sanjeev Khudanpur

    Abstract: The pervasiveness of intra-utterance code-switching (CS) in spoken content requires that speech recognition (ASR) systems handle mixed language. Designing a CS-ASR system has many challenges, mainly due to data scarcity, grammatical structure complexity, and domain mismatch. The most common method for addressing CS is to train an ASR system with the available transcribed CS speech, along with mono… ▽ More

    Submitted 11 January, 2023; v1 submitted 7 January, 2022; originally announced January 2022.

  8. arXiv:2107.01573  [pdf, other

    cs.CL cs.SD eess.AS

    Arabic Code-Switching Speech Recognition using Monolingual Data

    Authors: Ahmed Ali, Shammur Chowdhury, Amir Hussein, Yasser Hifny

    Abstract: Code-switching in automatic speech recognition (ASR) is an important challenge due to globalization. Recent research in multilingual ASR shows potential improvement over monolingual systems. We study key issues related to multilingual modeling for ASR through a series of large-scale ASR experiments. Our innovative framework deploys a multi-graph approach in the weighted finite state transducers (W… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: Accepted in Interspeech 2021, speech recognition, code-switching, ASR, transformer, WFST, graph approach

  9. arXiv:2106.13000  [pdf, other

    cs.CL cs.SD eess.AS

    QASR: QCRI Aljazeera Speech Resource -- A Large Scale Annotated Arabic Speech Corpus

    Authors: Hamdy Mubarak, Amir Hussein, Shammur Absar Chowdhury, Ahmed Ali

    Abstract: We introduce the largest transcribed Arabic speech corpus, QASR, collected from the broadcast domain. This multi-dialect speech dataset contains 2,000 hours of speech sampled at 16kHz crawled from Aljazeera news channel. The dataset is released with lightly supervised transcriptions, aligned with the audio segments. Unlike previous datasets, QASR contains linguistically motivated segmentation, pun… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: Speech Corpus, Spoken Conversation, ASR, Dialect Identification, Punctuation Restoration, Speaker Verification, NER, Named Entity, Arabic, Speaker gender, Turn-taking Accepted in ACL 2021

  10. arXiv:2106.05885  [pdf, other

    cs.CL cs.SD eess.AS

    Balanced End-to-End Monolingual pre-training for Low-Resourced Indic Languages Code-Switching Speech Recognition

    Authors: Amir Hussein, Shammur Chowdhury, Najim Dehak, Ahmed Ali

    Abstract: The success in designing Code-Switching (CS) ASR often depends on the availability of the transcribed CS resources. Such dependency harms the development of ASR in low-resourced languages such as Bengali and Hindi. In this paper, we exploit the transfer learning approach to design End-to-End (E2E) CS ASR systems for the two low-resourced language pairs using different monolingual speech data and a… ▽ More

    Submitted 15 February, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

  11. arXiv:2105.14779  [pdf, other

    cs.CL cs.HC cs.SD eess.AS

    Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR

    Authors: Shammur Absar Chowdhury, Amir Hussein, Ahmed Abdelali, Ahmed Ali

    Abstract: With the advent of globalization, there is an increasing demand for multilingual automatic speech recognition (ASR), handling language and dialectal variation of spoken content. Recent studies show its efficacy over monolingual systems. In this study, we design a large multilingual end-to-end ASR using self-attention based conformer architecture. We trained the system using Arabic (Ar), English (E… ▽ More

    Submitted 5 July, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Accepted in INTERSPEECH 2021, Multilingual ASR, Multi-dialectal ASR, Code-Switching ASR, Arabic ASR, Conformer, Transformer, E2E ASR, Speech Recognition, ASR, Arabic, English, French

  12. arXiv:2101.08454  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Arabic Speech Recognition by End-to-End, Modular Systems and Human

    Authors: Amir Hussein, Shinji Watanabe, Ahmed Ali

    Abstract: Recent advances in automatic speech recognition (ASR) have achieved accuracy levels comparable to human transcribers, which led researchers to debate if the machine has reached human performance. Previous work focused on the English language and modular hidden Markov model-deep neural network (HMM-DNN) systems. In this paper, we perform a comprehensive benchmarking for end-to-end transformer ASR,… ▽ More

    Submitted 29 June, 2021; v1 submitted 21 January, 2021; originally announced January 2021.

  13. Vehicle Platooning Impact on Drag Coefficients and Energy/Fuel Saving Implications

    Authors: Ahmed A. Hussein, Hesham A. Rakha

    Abstract: In this paper, empirical data from the literature are used to develop general power models that capture the impact of a vehicle position, in a platoon of homogeneous vehicles, and the distance gap to its lead (and following) vehicle on its drag coefficient. These models are developed for light duty vehicles, buses, and heavy duty trucks. The models were fit using a constrained optimization framewo… ▽ More

    Submitted 2 March, 2020; v1 submitted 2 January, 2020; originally announced January 2020.

    Comments: In review in the Journal of Applied Energy. IEEE Transactions on Vehicular Technology, 2021

  14. arXiv:1912.00157  [pdf, other

    cs.CV cs.LG eess.IV

    Correction Filter for Single Image Super-Resolution: Robustifying Off-the-Shelf Deep Super-Resolvers

    Authors: Shady Abu Hussein, Tom Tirer, Raja Giryes

    Abstract: The single image super-resolution task is one of the most examined inverse problems in the past decade. In the recent years, Deep Neural Networks (DNNs) have shown superior performance over alternative methods when the acquisition process uses a fixed known downsampling kernel-typically a bicubic kernel. However, several recent works have shown that in practical scenarios, where the test data mism… ▽ More

    Submitted 24 May, 2020; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: Accepted to CVPR 2020 (Oral). Code is available at https://github.com/shadyabh/Correction-Filter

  15. arXiv:1907.09455  [pdf

    eess.SP cs.LG eess.SY stat.ML

    Latent Function Decomposition for Forecasting Li-ion Battery Cells Capacity: A Multi-Output Convolved Gaussian Process Approach

    Authors: Abdallah A. Chehade, Ala A. Hussein

    Abstract: A latent function decomposition method is proposed for forecasting the capacity of lithium-ion battery cells. The method uses the Multi-Output Gaussian Process, a generative machine learning framework for multi-task and transfer learning. The MCGP decomposes the available capacity trends from multiple battery cells into latent functions. The latent functions are then convolved over kernel smoother… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

  16. arXiv:1906.05284  [pdf, other

    eess.IV cs.CV cs.LG

    Image-Adaptive GAN based Reconstruction

    Authors: Shady Abu Hussein, Tom Tirer, Raja Giryes

    Abstract: In the recent years, there has been a significant improvement in the quality of samples produced by (deep) generative models such as variational auto-encoders and generative adversarial networks. However, the representation capabilities of these methods still do not capture the full distribution for complex classes of images, such as human faces. This deficiency has been clearly observed in previo… ▽ More

    Submitted 25 November, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: Accepted to AAAI 2020. Code available at https://github.com/shadyabh/IAGAN

  17. arXiv:1904.12687  [pdf

    eess.SP

    Artificial Neural Network for LiDAL Systems

    Authors: Aubida A. Al-Hameed, Safwan Hafeedh Younus, Ahmed Taha Hussein, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

    Abstract: In this paper, we introduce an intelligent light detection and localization (LiDAL) system that uses artificial neural networks (ANN). The LiDAL systems of interest are MIMO LiDAL and MISO IMG LiDAL systems. A trained ANN with the LiDAL system of interest is used to distinguish a human (target) from the background obstacles (furniture) in a realistic indoor environment. In the LiDAL systems, the r… ▽ More

    Submitted 26 April, 2019; originally announced April 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1903.09896

  18. arXiv:1903.09896  [pdf

    eess.SP

    LiDAL: Light Detection and Localization

    Authors: Aubida A. Al-Hameed, Safwan Hafeedh Younus, Ahmed Taha Hussein, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

    Abstract: In this paper, we present the first indoor light-based detection and localization system that builds on concepts from radio detection and ranging (radar) making use of the expected growth in the use and adoption of visible light communication (VLC), which can provide the infrastructure for our LiDAL system. Our system enables active detection, counting and localization of people, in addition to be… ▽ More

    Submitted 26 April, 2019; v1 submitted 23 March, 2019; originally announced March 2019.

  19. arXiv:1812.11544  [pdf

    eess.SP

    Optical Wireless Communication Systems, A Survey

    Authors: Osama Alsulami, Ahmed Taha Hussein, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

    Abstract: In the past few years, the demand for high data rate services has increased dramatically. The congestion in the radio frequency (RF) spectrum (3 kHz ~ 300 GHz) is expected to limit the growth of future wireless systems unless new parts of the spectrum are opened. Even with the use of advanced engineering, such as signal processing and advanced modulation schemes, it will be very challenging to mee… ▽ More

    Submitted 30 December, 2018; originally announced December 2018.

  20. arXiv:1812.06938  [pdf

    eess.SP cs.NI physics.optics

    VLC Systems with CGHs

    Authors: Safwan Hafeedh Younus, Ahmed Taha Hussein, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

    Abstract: The achievable data rate in indoor wireless systems that employ visible light communication (VLC) can be limited by multipath propagation. Here, we use computer generated holograms (CGHs) in VLC system design to improve the achievable system data rate. The CGHs are utilized to produce a fixed broad beam from the light source, selecting the light source that offers the best performance. The CGHs di… ▽ More

    Submitted 12 November, 2018; originally announced December 2018.

  21. arXiv:1811.01341  [pdf

    eess.SP

    WDM for Multi-user Indoor VLC Systems with SCM

    Authors: Safwan Hafeedh Younus, Aubida A. Al-Hameed, Ahmed Taha Hussein, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

    Abstract: A system that employs wavelength division multiplexing (WDM) in conjunction with subcarrier multiplexing (SCM) tones is proposed to realize high data rate multi-user indoor visible light communication (VLC). The SCM tones, which are unmodulated signals, are used to identify each light unit, to find the optimum light unit for each user and to calculate the level of the co-channel interference (CCI)… ▽ More

    Submitted 4 November, 2018; originally announced November 2018.

  22. arXiv:1811.01340  [pdf

    eess.SP

    Subcarrier Multiplexing for Parallel Data Transmission in Indoor Visible Light Communication Systems

    Authors: Safwan Hafeedh Younus, Aubida A. Al-Hameed, Ahmed Taha Hussein, Mohammed T. Alresheedi, Jaafar M. H. Elmirghani

    Abstract: This paper presents an indoor visible light communication (VLC) system in conjunction with an imaging receiver with parallel data transmission (spatial multiplexing) to decrease the effects of inter-symbol interference (ISI). To distinguish between light units (transmitters) and to match the light units used to convey the data with the pixels of the imaging receiver, we propose the use of subcarri… ▽ More

    Submitted 4 November, 2018; originally announced November 2018.