Skip to main content

Showing 1–10 of 10 results for author: Oualil, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06729  [pdf, other

    cs.IR cs.AI cs.CL

    Synthetic Query Generation using Large Language Models for Virtual Assistants

    Authors: Sonal Sannigrahi, Thiago Fraga-Silva, Youssef Oualil, Christophe Van Gysel

    Abstract: Virtual Assistants (VAs) are important Information Retrieval platforms that help users accomplish various tasks through spoken commands. The speech recognition system (speech-to-text) uses query priors, trained solely on text, to distinguish between phonetically confusing alternatives. Hence, the generation of synthetic queries that are similar to existing VA usage can greatly improve upon the VA'… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: SIGIR '24. The 47th International ACM SIGIR Conference on Research & Development in Information Retrieval

  2. Towards a World-English Language Model for On-Device Virtual Assistants

    Authors: Rricha Jalota, Lyan Verwimp, Markus Nussbaum-Thom, Amr Mousa, Arturo Argueta, Youssef Oualil

    Abstract: Neural Network Language Models (NNLMs) for Virtual Assistants (VAs) are generally language-, region-, and in some cases, device-dependent, which increases the effort to scale and maintain them. Combining NNLMs for one or more of the categories is one way to improve scalability. In this work, we combine regional variants of English to build a ``World English'' NNLM for on-device VAs. In particular,… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted in ICASSP 2024

  3. arXiv:2310.03424  [pdf, other

    cs.LG cs.CL

    Neural Language Model Pruning for Automatic Speech Recognition

    Authors: Leonardo Emili, Thiago Fraga-Silva, Ernest Pusateri, Markus Nußbaum-Thom, Youssef Oualil

    Abstract: We study model pruning methods applied to Transformer-based neural network language models for automatic speech recognition. We explore three aspects of the pruning frame work, namely criterion, method and scheduler, analyzing their contribution in terms of accuracy and inference speed. To the best of our knowledge, such in-depth analyses on large-scale recognition systems has not been reported in… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 8 pages, 3 figures

  4. arXiv:2305.09764  [pdf, other

    cs.CL cs.SD eess.AS

    Application-Agnostic Language Modeling for On-Device ASR

    Authors: Markus Nußbaum-Thom, Lyan Verwimp, Youssef Oualil

    Abstract: On-device automatic speech recognition systems face several challenges compared to server-based systems. They have to meet stricter constraints in terms of speed, disk size and memory while maintaining the same accuracy. Often they have to serve several applications with different distributions at once, such as communicating with a virtual assistant and speech-to-text. The simplest solution to ser… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: accepted for ACL 2023 industry track

  5. arXiv:2206.14885  [pdf, other

    cs.CL cs.AI

    Space-Efficient Representation of Entity-centric Query Language Models

    Authors: Christophe Van Gysel, Mirko Hannemann, Ernest Pusateri, Youssef Oualil, Ilya Oparin

    Abstract: Virtual assistants make use of automatic speech recognition (ASR) to help users answer entity-centric queries. However, spoken entity recognition is a difficult problem, due to the large number of frequently-changing named entities. In addition, resources available for recognition are constrained when ASR is performed on-device. In this work, we investigate the use of probabilistic grammars as l… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: Interspeech '22

  6. arXiv:1908.09738  [pdf, ps, other

    eess.AS cs.CL cs.LG cs.SD

    Connecting and Comparing Language Model Interpolation Techniques

    Authors: Ernest Pusateri, Christophe Van Gysel, Rami Botros, Sameer Badaskar, Mirko Hannemann, Youssef Oualil, Ilya Oparin

    Abstract: In this work, we uncover a theoretical connection between two language model interpolation techniques, count merging and Bayesian interpolation. We compare these techniques as well as linear interpolation in three scenarios with abundant training data per component model. Consistent with prior work, we show that both count merging and Bayesian interpolation outperform linear interpolation. We incl… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

  7. A Neural Network Approach for Mixing Language Models

    Authors: Youssef Oualil, Dietrich Klakow

    Abstract: The performance of Neural Network (NN)-based language models is steadily improving due to the emergence of new architectures, which are able to learn different natural language characteristics. This paper presents a novel framework, which shows that a significant improvement can be achieved by combining different existing heterogeneous models in a single architecture. This is done through 1) a fea… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

    Comments: Published at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2017. arXiv admin note: text overlap with arXiv:1703.08068

    MSC Class: 97K50 ACM Class: I.2.7

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, 2017, pp. 5710-5714

  8. arXiv:1708.06555  [pdf, other

    cs.CL cs.LG

    Long-Short Range Context Neural Networks for Language Modeling

    Authors: Youssef Oualil, Mittul Singh, Clayton Greenberg, Dietrich Klakow

    Abstract: The goal of language modeling techniques is to capture the statistical and structural properties of natural languages from training corpora. This task typically involves the learning of short range dependencies, which generally model the syntactic properties of a language and/or long range dependencies, which are semantic in nature. We propose in this paper a new multi-span architecture, which sep… ▽ More

    Submitted 22 August, 2017; originally announced August 2017.

    Comments: Published at EMNLP'16

    MSC Class: 97K50 ACM Class: I.2.7

  9. arXiv:1708.05997  [pdf, ps, other

    cs.CL cs.AI

    A Batch Noise Contrastive Estimation Approach for Training Large Vocabulary Language Models

    Authors: Youssef Oualil, Dietrich Klakow

    Abstract: Training large vocabulary Neural Network Language Models (NNLMs) is a difficult task due to the explicit requirement of the output layer normalization, which typically involves the evaluation of the full softmax function over the complete vocabulary. This paper proposes a Batch Noise Contrastive Estimation (B-NCE) approach to alleviate this problem. This is achieved by reducing the vocabulary, at… ▽ More

    Submitted 22 August, 2017; v1 submitted 20 August, 2017; originally announced August 2017.

    Comments: Accepted for publication at INTERSPEECH'17

    MSC Class: 97K50 ACM Class: I.2.7

  10. arXiv:1703.08068  [pdf, other

    cs.CL

    Sequential Recurrent Neural Networks for Language Modeling

    Authors: Youssef Oualil, Clayton Greenberg, Mittul Singh, Dietrich Klakow

    Abstract: Feedforward Neural Network (FNN)-based language models estimate the probability of the next word based on the history of the last N words, whereas Recurrent Neural Networks (RNN) perform the same task based only on the last word and some context information that cycles in the network. This paper presents a novel approach, which bridges the gap between these two categories of networks. In particula… ▽ More

    Submitted 23 March, 2017; originally announced March 2017.

    Comments: published (INTERSPEECH 2016), 5 pages, 3 figures, 4 tables