Skip to main content

Showing 1–1 of 1 results for author: Dobashi, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2203.15473  [pdf, other

    eess.AS

    Frequency-Directional Attention Model for Multilingual Automatic Speech Recognition

    Authors: Akihiro Dobashi, Chee Siang Leow, Hiromitsu Nishizaki

    Abstract: This paper proposes a model for transforming speech features using the frequency-directional attention model for End-to-End (E2E) automatic speech recognition. The idea is based on the hypothesis that in the phoneme system of each language, the characteristics of the frequency bands of speech when uttering them are different. By transforming the input Mel filter bank features with an attention mod… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

    Comments: submitted to INTERSPEECH2022