Skip to main content

Showing 1–10 of 10 results for author: Verwimp, L

Searching in archive cs. Search in all archives.
.
  1. Towards a World-English Language Model for On-Device Virtual Assistants

    Authors: Rricha Jalota, Lyan Verwimp, Markus Nussbaum-Thom, Amr Mousa, Arturo Argueta, Youssef Oualil

    Abstract: Neural Network Language Models (NNLMs) for Virtual Assistants (VAs) are generally language-, region-, and in some cases, device-dependent, which increases the effort to scale and maintain them. Combining NNLMs for one or more of the categories is one way to improve scalability. In this work, we combine regional variants of English to build a ``World English'' NNLM for on-device VAs. In particular,… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted in ICASSP 2024

  2. arXiv:2305.09764  [pdf, other

    cs.CL cs.SD eess.AS

    Application-Agnostic Language Modeling for On-Device ASR

    Authors: Markus Nußbaum-Thom, Lyan Verwimp, Youssef Oualil

    Abstract: On-device automatic speech recognition systems face several challenges compared to server-based systems. They have to meet stricter constraints in terms of speed, disk size and memory while maintaining the same accuracy. Often they have to serve several applications with different distributions at once, such as communicating with a virtual assistant and speech-to-text. The simplest solution to ser… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: accepted for ACL 2023 industry track

  3. arXiv:2210.12214  [pdf, ps, other

    cs.SD cs.CL eess.AS

    Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation

    Authors: Thien Nguyen, Nathalie Tran, Liuhui Deng, Thiago Fraga da Silva, Matthew Radzihovsky, Roger Hsiao, Henry Mason, Stefan Braun, Erik McDermott, Dogan Can, Pawel Swietojanski, Lyan Verwimp, Sibel Oyman, Tresi Arvizo, Honza Silovsky, Arnab Ghoshal, Mathieu Martel, Bharat Ram Ambati, Mohamed Ali

    Abstract: Code-switching describes the practice of using more than one language in the same sentence. In this study, we investigate how to optimize a neural transducer based bilingual automatic speech recognition (ASR) model for code-switching speech. Focusing on the scenario where the ASR model is trained without supervised code-switching data, we found that semi-supervised training and synthetic code-swit… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 5 pages, 1 figure, submitted to ICASSP 2023, *: equal contributions

  4. arXiv:2106.08927  [pdf

    cs.CL cs.LG

    On the long-term learning ability of LSTM LMs

    Authors: Wim Boes, Robbe Van Rompaey, Lyan Verwimp, Joris Pelemans, Hugo Van hamme, Patrick Wambacq

    Abstract: We inspect the long-term learning ability of Long Short-Term Memory language models (LSTM LMs) by evaluating a contextual extension based on the Continuous Bag-of-Words (CBOW) model for both sentence- and discourse-level LSTM LMs and by analyzing its performance. We evaluate on text and speech. Sentence-level models using the long-term contextual module perform comparably to vanilla discourse-leve… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Journal ref: ESANN 2020 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (2020) 625-630

  5. arXiv:2102.07219  [pdf, ps, other

    cs.CL

    Error-driven Pruning of Language Models for Virtual Assistants

    Authors: Sashank Gondala, Lyan Verwimp, Ernest Pusateri, Manos Tsagkias, Christophe Van Gysel

    Abstract: Language models (LMs) for virtual assistants (VAs) are typically trained on large amounts of data, resulting in prohibitively large models which require excessive memory and/or cannot be used to serve user requests in real-time. Entropy pruning results in smaller models but with significant degradation of effectiveness in the tail of the user request distribution. We customize entropy pruning by a… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

    Comments: ICASSP '21. The 46th International IEEE Conference on Acoustics, Speech, and Signal Processing

  6. arXiv:1909.04130  [pdf, other

    cs.CL

    Reverse Transfer Learning: Can Word Embeddings Trained for Different NLP Tasks Improve Neural Language Models?

    Authors: Lyan Verwimp, Jerome R. Bellegarda

    Abstract: Natural language processing (NLP) tasks tend to suffer from a paucity of suitably annotated training data, hence the recent success of transfer learning across a wide variety of them. The typical recipe involves: (i) training a deep, possibly bidirectional, neural network with an objective related to language modeling, for which training data is plentiful; and (ii) using the trained network to der… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted for publication at Interspeech 2019

  7. arXiv:1809.08826  [pdf, ps, other

    cs.CL cs.LG

    Information-Weighted Neural Cache Language Models for ASR

    Authors: Lyan Verwimp, Joris Pelemans, Hugo Van hamme, Patrick Wambacq

    Abstract: Neural cache language models (LMs) extend the idea of regular cache language models by making the cache probability dependent on the similarity between the current context and the context of the words in the cache. We make an extensive comparison of 'regular' cache models with neural cache models, both in terms of perplexity and WER after rescoring first-pass ASR results. Furthermore, we propose t… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

    Comments: Accepted for publication at SLT 2018

  8. arXiv:1805.04264  [pdf, ps, other

    cs.CL cs.NE

    State Gradients for RNN Memory Analysis

    Authors: Lyan Verwimp, Hugo Van hamme, Vincent Renkens, Patrick Wambacq

    Abstract: We present a framework for analyzing what the state in RNNs remembers from its input embeddings. Our approach is inspired by backpropagation, in the sense that we compute the gradients of the states with respect to the input embeddings. The gradient matrix is decomposed with Singular Value Decomposition to analyze which directions in the embedding space are best transferred to the hidden state spa… ▽ More

    Submitted 18 June, 2018; v1 submitted 11 May, 2018; originally announced May 2018.

    Comments: Accepted for Interspeech 2018

  9. arXiv:1709.03759  [pdf, ps, other

    cs.CL

    Language Models of Spoken Dutch

    Authors: Lyan Verwimp, Joris Pelemans, Marieke Lycke, Hugo Van hamme, Patrick Wambacq

    Abstract: In Flanders, all TV shows are subtitled. However, the process of subtitling is a very time-consuming one and can be sped up by providing the output of a speech recognizer run on the audio of the TV show, prior to the subtitling. Naturally, this speech recognition will perform much better if the employed language model is adapted to the register and the topic of the program. We present several lang… ▽ More

    Submitted 12 September, 2017; originally announced September 2017.

  10. arXiv:1704.02813  [pdf, other

    cs.CL

    Character-Word LSTM Language Models

    Authors: Lyan Verwimp, Joris Pelemans, Hugo Van hamme, Patrick Wambacq

    Abstract: We present a Character-Word Long Short-Term Memory Language Model which both reduces the perplexity with respect to a baseline word-level language model and reduces the number of parameters of the model. Character information can reveal structural (dis)similarities between words and can even be used when a word is out-of-vocabulary, thus improving the modeling of infrequent and unknown words. By c… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Journal ref: European Chapter of the Association for Computational Linguistics (EACL) 2017, Valencia, Spain, pp. 417-427