Skip to main content

Showing 1–5 of 5 results for author: Mousa, A

Searching in archive cs. Search in all archives.
.
  1. Towards a World-English Language Model for On-Device Virtual Assistants

    Authors: Rricha Jalota, Lyan Verwimp, Markus Nussbaum-Thom, Amr Mousa, Arturo Argueta, Youssef Oualil

    Abstract: Neural Network Language Models (NNLMs) for Virtual Assistants (VAs) are generally language-, region-, and in some cases, device-dependent, which increases the effort to scale and maintain them. Combining NNLMs for one or more of the categories is one way to improve scalability. In this work, we combine regional variants of English to build a ``World English'' NNLM for on-device VAs. In particular,… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted in ICASSP 2024

  2. arXiv:1707.00800  [pdf

    cs.CV cs.IR

    Arabic Character Segmentation Using Projection Based Approach with Profile's Amplitude Filter

    Authors: Mahmoud A. A. Mousa, Mohammed S. Sayed, Mahmoud I. Abdalla

    Abstract: Arabic is one of the languages that present special challenges to Optical character recognition (OCR). The main challenge in Arabic is that it is mostly cursive. Therefore, a segmentation process must be carried out to determine where the character begins and where it ends. This step is essential for character recognition. This paper presents Arabic character segmentation algorithm. The proposed a… ▽ More

    Submitted 3 July, 2017; originally announced July 2017.

  3. arXiv:1706.09886  [pdf, ps, other

    cs.LO eess.SY

    Optimal Control for Multi-Mode Systems with Discrete Costs

    Authors: Mahmoud A. A. Mousa, Sven Schewe, Dominik Wojtczak

    Abstract: This paper studies optimal time-bounded control in multi-mode systems with discrete costs. Multi-mode systems are an important subclass of linear hybrid systems, in which there are no guards on transitions and all invariants are global. Each state has a continuous cost attached to it, which is linear in the sojourn time, while a discrete cost is attached to each transition taken. We show that an o… ▽ More

    Submitted 29 June, 2017; originally announced June 2017.

    Comments: extended version of a FORMATS 2017 paper

  4. arXiv:1705.10874  [pdf, ps, other

    cs.SD cs.CL cs.LG

    Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments

    Authors: Zixing Zhang, Jürgen Geiger, Jouni Pohjalainen, Amr El-Desoky Mousa, Wenyu **, Björn Schuller

    Abstract: Eliminating the negative effect of non-stationary environmental noise is a long-standing research topic for automatic speech recognition that stills remains an important challenge. Data-driven supervised approaches, including ones based on deep neural networks, have recently emerged as potential alternatives to traditional unsupervised approaches and with sufficient training, can alleviate the sho… ▽ More

    Submitted 21 September, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

  5. arXiv:1510.00268  [pdf, ps, other

    cs.SD

    The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Sha** Dereverberation and LSTM Language Models

    Authors: Amr El-Desoky Mousa, Erik Marchi, Björn Schuller

    Abstract: This paper presents our contribution to the 3rd CHiME Speech Separation and Recognition Challenge. Our system uses Bidirectional Long Short-Term Memory (BLSTM) Recurrent Neural Networks (RNNs) for Single-channel Speech Enhancement (SSE). Networks are trained to predict clean speech as well as noise features from noisy speech features. In addition, the system applies two methods of dereverberation… ▽ More

    Submitted 1 October, 2015; originally announced October 2015.