Skip to main content

Showing 1–2 of 2 results for author: Kunikoshi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.13817  [pdf, other

    cs.SD cs.AI eess.AS

    Comparison of Speech Representations for the MOS Prediction System

    Authors: Aki Kunikoshi, Jaebok Kim, Wonsuk Jun, Kåre Sjölander

    Abstract: Automatic methods to predict Mean Opinion Score (MOS) of listeners have been researched to assure the quality of Text-to-Speech systems. Many previous studies focus on architectural advances (e.g. MBNet, LDNet, etc.) to capture relations between spectral features and MOS in a more effective way and achieved high accuracy. However, the optimal representation in terms of generalization capability st… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 5 pages, 4 figures

  2. arXiv:2204.00061  [pdf, other

    cs.SD cs.CL eess.AS

    Data-augmented cross-lingual synthesis in a teacher-student framework

    Authors: Marcel de Korte, Jaebok Kim, Aki Kunikoshi, Adaeze Adigwe, Esther Klabbers

    Abstract: Cross-lingual synthesis can be defined as the task of letting a speaker generate fluent synthetic speech in another language. This is a challenging task, and resulting speech can suffer from reduced naturalness, accented speech, and/or loss of essential voice characteristics. Previous research shows that many models appear to have insufficient generalization capabilities to perform well on every o… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

    Comments: Submitted to INTERSPEECH 2022

    ACM Class: I.2.7