Skip to main content

Showing 1–2 of 2 results for author: Leow, C S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2203.15473  [pdf, other

    eess.AS

    Frequency-Directional Attention Model for Multilingual Automatic Speech Recognition

    Authors: Akihiro Dobashi, Chee Siang Leow, Hiromitsu Nishizaki

    Abstract: This paper proposes a model for transforming speech features using the frequency-directional attention model for End-to-End (E2E) automatic speech recognition. The idea is based on the hypothesis that in the phoneme system of each language, the characteristics of the frequency bands of speech when uttering them are different. By transforming the input Mel filter bank features with an attention mod… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: submitted to INTERSPEECH2022

  2. arXiv:2104.01384  [pdf, other

    eess.AS cs.CL

    ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi

    Authors: Yu Wang, Chee Siang Leow, Akio Kobayashi, Takehito Utsuro, Hiromitsu Nishizaki

    Abstract: This paper describes the ExKaldi-RT online automatic speech recognition (ASR) toolkit that is implemented based on the Kaldi ASR toolkit and Python language. ExKaldi-RT provides tools for building online recognition pipelines. While similar tools are available built on Kaldi, a key feature of ExKaldi-RT that it works on Python, which has an easy-to-use interface that allows online ASR system devel… ▽ More

    Submitted 8 August, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Comments: Accepted at the IEEE 10th Global Conference on Consumer Electronics