Skip to main content

Showing 1–3 of 3 results for author: Narayanaswamy, V S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2104.07161  [pdf, other

    cs.SD cs.LG eess.AS

    On the Design of Deep Priors for Unsupervised Audio Restoration

    Authors: Vivek Sivaraman Narayanaswamy, Jayaraman J. Thiagarajan, Andreas Spanias

    Abstract: Unsupervised deep learning methods for solving audio restoration problems extensively rely on carefully tailored neural architectures that carry strong inductive biases for defining priors in the time or spectral domain. In this context, lot of recent success has been achieved with sophisticated convolutional network constructions that recover audio signals in the spectral domain. However, in prac… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

  2. arXiv:1904.04161  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Audio Source Separation via Multi-Scale Learning with Dilated Dense U-Nets

    Authors: Vivek Sivaraman Narayanaswamy, Sameeksha Katoch, Jayaraman J. Thiagarajan, Huan Song, Andreas Spanias

    Abstract: Modern audio source separation techniques rely on optimizing sequence model architectures such as, 1D-CNNs, on mixture recordings to generalize well to unseen mixtures. Specifically, recent focus is on time-domain based architectures such as Wave-U-Net which exploit temporal context by extracting multi-scale features. However, the optimality of the feature extraction process in these architectures… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

  3. arXiv:1811.00183  [pdf, other

    stat.ML cs.LG cs.SD eess.AS

    Designing an Effective Metric Learning Pipeline for Speaker Diarization

    Authors: Vivek Sivaraman Narayanaswamy, Jayaraman J. Thiagarajan, Huan Song, Andreas Spanias

    Abstract: State-of-the-art speaker diarization systems utilize knowledge from external data, in the form of a pre-trained distance metric, to effectively determine relative speaker identities to unseen data. However, much of recent focus has been on choosing the appropriate feature extractor, ranging from pre-trained $i-$vectors to representations learned via different sequence modeling architectures (e.g.… ▽ More

    Submitted 31 October, 2018; originally announced November 2018.