Skip to main content

Showing 1–1 of 1 results for author: Benita, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.01381  [pdf, other

    cs.SD cs.CL eess.AS

    DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

    Authors: Roi Benita, Michael Elad, Joseph Keshet

    Abstract: Diffusion models have recently been shown to be relevant for high-quality speech generation. Most work has been focused on generating spectrograms, and as such, they further require a subsequent model to convert the spectrogram to a waveform (i.e., a vocoder). This work proposes a diffusion probabilistic end-to-end model for generating a raw speech waveform. The proposed model is autoregressive, g… ▽ More

    Submitted 10 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.