Skip to main content

Showing 1–2 of 2 results for author: Siu, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.08207  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Transformer-based Model for ASR N-Best Rescoring and Rewriting

    Authors: Iwen E. Kang, Christophe Van Gysel, Man-Hung Siu

    Abstract: Voice assistants increasingly use on-device Automatic Speech Recognition (ASR) to ensure speed and privacy. However, due to resource constraints on the device, queries pertaining to complex information domains often require further processing by a search engine. For such applications, we propose a novel Transformer based model capable of rescoring and rewriting, by exploring full context of the N-… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Interspeech '24

  2. arXiv:2310.07062  [pdf, other

    cs.SD cs.LG eess.AS

    Acoustic Model Fusion for End-to-end Speech Recognition

    Authors: Zhihong Lei, Mingbin Xu, Shiyi Han, Leo Liu, Zhen Huang, Tim Ng, Yuanyuan Zhang, Ernest Pusateri, Mirko Hannemann, Yaqiao Deng, Man-Hung Siu

    Abstract: Recent advances in deep learning and automatic speech recognition (ASR) have enabled the end-to-end (E2E) ASR system and boosted the accuracy to a new level. The E2E systems implicitly model all conventional ASR components, such as the acoustic model (AM) and the language model (LM), in a single network trained on audio-text pairs. Despite this simpler system architecture, fusing a separate LM, tr… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.