Skip to main content

Showing 1–2 of 2 results for author: Friedland, G

Searching in archive eess. Search in all archives.
.
  1. arXiv:2309.09088  [pdf, other

    cs.SD eess.AS

    Enhancing GAN-Based Vocoders with Contrastive Learning Under Data-limited Condition

    Authors: Haoming Guo, Seth Z. Zhao, Jiachen Lian, Gopala Anumanchipalli, Gerald Friedland

    Abstract: Vocoder models have recently achieved substantial progress in generating authentic audio comparable to human quality while significantly reducing memory requirement and inference time. However, these data-hungry generative models require large-scale audio data for learning good representations. In this paper, we apply contrastive learning methods in training the vocoder to improve the perceptual q… ▽ More

    Submitted 18 December, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

  2. arXiv:1710.04288  [pdf, other

    eess.AS cs.SD

    Audio Concept Classification with Hierarchical Deep Neural Networks

    Authors: Mirco Ravanelli, Benjamin Elizalde, Karl Ni, Gerald Friedland

    Abstract: Audio-based multimedia retrieval tasks may identify semantic information in audio streams, i.e., audio concepts (such as music, laughter, or a revving engine). Conventional Gaussian-Mixture-Models have had some success in classifying a reduced set of audio concepts. However, multi-class classification can benefit from context window analysis and the discriminating power of deeper architectures. Al… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Journal ref: EUSIPCO 2014