Skip to main content

Showing 1–3 of 3 results for author: Sarroff, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2103.08709  [pdf, other

    eess.AS cs.SD eess.SP

    Lightweight and interpretable neural modeling of an audio distortion effect using hyperconditioned differentiable biquads

    Authors: Shahan Nercessian, Andy Sarroff, Kurt James Werner

    Abstract: In this work, we propose using differentiable cascaded biquads to model an audio distortion effect. We extend trainable infinite impulse response (IIR) filters to the hyperconditioned case, in which a transformation is learned to directly map external parameters of the distortion effect to its internal filter and gain parameters, along with activations necessary to ensure filter stability. We prop… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 5 pages, 4 figures. To be published in IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2021

  2. arXiv:1810.01395  [pdf, other

    cs.SD cs.CL cs.LG eess.AS stat.ML

    Phasebook and Friends: Leveraging Discrete Representations for Source Separation

    Authors: Jonathan Le Roux, Gordon Wichern, Shinji Watanabe, Andy Sarroff, John R. Hershey

    Abstract: Deep learning based speech enhancement and source separation systems have recently reached unprecedented levels of quality, to the point that performance is reaching a new ceiling. Most systems rely on estimating the magnitude of a target source by estimating a real-valued mask to be applied to a time-frequency representation of the mixture signal. A limiting factor in such approaches is a lack of… ▽ More

    Submitted 7 March, 2019; v1 submitted 2 October, 2018; originally announced October 2018.

  3. arXiv:1511.06351  [pdf, other

    cs.LG cs.NE

    Learning Representations Using Complex-Valued Nets

    Authors: Andy M. Sarroff, Victor Shepardson, Michael A. Casey

    Abstract: Complex-valued neural networks (CVNNs) are an emerging field of research in neural networks due to their potential representational properties for audio, image, and physiological signals. It is common in signal processing to transform sequences of real values to the complex domain via a set of complex basis functions, such as the Fourier transform. We show how CVNNs can be used to learn complex re… ▽ More

    Submitted 19 November, 2015; originally announced November 2015.