Skip to main content

Showing 1–4 of 4 results for author: Cvetkovic, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2110.08634  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Towards Robust Waveform-Based Acoustic Models

    Authors: Dino Oglic, Zoran Cvetkovic, Peter Sollich, Steve Renals, Bin Yu

    Abstract: We study the problem of learning robust acoustic models in adverse environments, characterized by a significant mismatch between training and test conditions. This problem is of paramount importance for the deployment of speech recognition systems that need to perform well in unseen environments. First, we characterize data augmentation theoretically as an instance of vicinal risk minimization, wh… ▽ More

    Submitted 29 June, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022

  2. arXiv:2110.06639  [pdf, other

    cs.LG stat.ML

    When saliency goes off on a tangent: Interpreting Deep Neural Networks with nonlinear saliency maps

    Authors: Jan Rosenzweig, Zoran Cvetkovic, Ivana Rosenzweig

    Abstract: A fundamental bottleneck in utilising complex machine learning systems for critical applications has been not knowing why they do and what they do, thus preventing the development of any crucial safety protocols. To date, no method exist that can provide full insight into the granularity of the neural network's decision process. In the past, saliency maps were an early attempt at resolving this pr… ▽ More

    Submitted 16 January, 2023; v1 submitted 13 October, 2021; originally announced October 2021.

  3. arXiv:2002.05059  [pdf, other

    cs.LG stat.ML

    Goldilocks Neural Networks

    Authors: Jan Rosenzweig, Zoran Cvetkovic, Ivana Rosenzweig

    Abstract: We introduce the new "Goldilocks" class of activation functions, which non-linearly deform the input signal only locally when the input signal is in the appropriate range. The small local deformation of the signal enables better understanding of how and why the signal is transformed through the layers. Numerical results on CIFAR-10 and CIFAR-100 data sets show that Goldilocks networks perform bett… ▽ More

    Submitted 26 February, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

  4. Learning Waveform-Based Acoustic Models using Deep Variational Convolutional Neural Networks

    Authors: Dino Oglic, Zoran Cvetkovic, Peter Sollich

    Abstract: We investigate the potential of stochastic neural networks for learning effective waveform-based acoustic models. The waveform-based setting, inherent to fully end-to-end speech recognition systems, is motivated by several comparative studies of automatic and human speech recognition that associate standard non-adaptive feature extraction techniques with information loss which can adversely affect… ▽ More

    Submitted 15 August, 2021; v1 submitted 22 June, 2019; originally announced June 2019.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing 2021