Skip to main content

Showing 1–1 of 1 results for author: Yoshimoto, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.06714  [pdf, other

    cs.CL cs.SD eess.AS

    Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness

    Authors: Xincan Feng, Akifumi Yoshimoto

    Abstract: Recent advancements in Natural Language Processing (NLP) have seen Large-scale Language Models (LLMs) excel at producing high-quality text for various purposes. Notably, in Text-To-Speech (TTS) systems, the integration of BERT for semantic token generation has underscored the importance of semantic content in producing coherent speech outputs. Despite this, the specific utility of LLMs in enhancin… ▽ More

    Submitted 17 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: 9 pages, 2 figures, 4 tables; accepted at LREC-COLING 2024