Skip to main content

Showing 1–1 of 1 results for author: Kamada, C

.
  1. arXiv:2201.09427  [pdf, other

    eess.AS cs.SD

    Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end

    Authors: Rem Hida, Masaki Hamada, Chie Kamada, Emiru Tsunoo, Toshiyuki Sekiya, Toshiyuki Kumakura

    Abstract: Although end-to-end text-to-speech (TTS) models can generate natural speech, challenges still remain when it comes to estimating sentence-level phonetic and prosodic information from raw text in Japanese TTS systems. In this paper, we propose a method for polyphone disambiguation (PD) and accent prediction (AP). The proposed method incorporates explicit features extracted from morphological analys… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: 5 pages, 2 figures. Accepted to ICASSP2022