Skip to main content

Showing 1–1 of 1 results for author: Sereda, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.02839  [pdf, ps, other

    eess.AS cs.AI cs.CL

    Pheme: Efficient and Conversational Speech Generation

    Authors: Paweł Budzianowski, Taras Sereda, Tomasz Cichy, Ivan Vulić

    Abstract: In recent years, speech generation has seen remarkable progress, now achieving one-shot generation capability that is often virtually indistinguishable from real human voice. Integrating such advancements in speech generation with large language models might revolutionize a wide range of applications. However, certain applications, such as assistive conversational systems, require natural and conv… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.