Skip to main content

Showing 1–3 of 3 results for author: Walters, T C

Searching in archive cs. Search in all archives.
.
  1. arXiv:1910.06464  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Low Bit-Rate Speech Coding with VQ-VAE and a WaveNet Decoder

    Authors: Cristina Gârbacea, Aäron van den Oord, Yazhe Li, Felicia S C Lim, Alejandro Luebs, Oriol Vinyals, Thomas C Walters

    Abstract: In order to efficiently transmit and store speech signals, speech codecs create a minimally redundant representation of the input signal which is then decoded at the receiver with the best possible perceptual quality. In this work we demonstrate that a neural network architecture based on VQ-VAE with a WaveNet decoder can be used to perform very low bit-rate speech coding with high reconstruction… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: ICASSP 2019

    Journal ref: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 735-739. IEEE, 2019

  2. arXiv:1907.04927  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    Speech bandwidth extension with WaveNet

    Authors: Archit Gupta, Brendan Shillingford, Yannis Assael, Thomas C. Walters

    Abstract: Large-scale mobile communication systems tend to contain legacy transmission channels with narrowband bottlenecks, resulting in characteristic "telephone-quality" audio. While higher quality codecs exist, due to the scale and heterogeneity of the networks, transmitting higher sample rate audio with modern high-quality audio codecs can be difficult in practice. This paper proposes an approach where… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

  3. arXiv:1712.01120  [pdf, other

    eess.AS cs.SD eess.SP

    Wavenet based low rate speech coding

    Authors: W. Bastiaan Kleijn, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Florian Stimberg, Quan Wang, Thomas C. Walters

    Abstract: Traditional parametric coding of speech facilitates low rate but provides poor reconstruction quality because of the inadequacy of the model used. We describe how a WaveNet generative speech model can be used to generate high quality speech from the bit stream of a standard parametric coder operating at 2.4 kb/s. We compare this parametric coder with a waveform coder based on the same generative m… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: 5 pages, 2 figures