Skip to main content

Showing 1–1 of 1 results for author: Naoyuki, O

Searching in archive eess. Search in all archives.
.
  1. arXiv:2305.15055  [pdf, other

    cs.SD cs.AI eess.AS

    Iteratively Improving Speech Recognition and Voice Conversion

    Authors: Mayank Kumar Singh, Naoya Takahashi, Onoe Naoyuki

    Abstract: Many existing works on voice conversion (VC) tasks use automatic speech recognition (ASR) models for ensuring linguistic consistency between source and converted samples. However, for the low-data resource domains, training a high-quality ASR remains to be a challenging task. In this work, we propose a novel iterative way of improving both the ASR and VC models. We first train an ASR model which i… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.