Skip to main content

Showing 1–1 of 1 results for author: Tsuboi, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2009.08474  [pdf, other

    eess.AS cs.LG cs.SD

    Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis

    Authors: Yukiya Hono, Kazuna Tsuboi, Kei Sawada, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda

    Abstract: This paper proposes a hierarchical generative model with a multi-grained latent variable to synthesize expressive speech. In recent years, fine-grained latent variables are introduced into the text-to-speech synthesis that enable the fine control of the prosody and speaking styles of synthesized speech. However, the naturalness of speech degrades when these latent variables are obtained by samplin… ▽ More

    Submitted 26 December, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: 5 pages, accepted to INTERSPEECH 2020, demo page: https://www.rinna.jp/research/interspeech2020/