Skip to main content

Showing 1–3 of 3 results for author: Sallo, R A

.
  1. arXiv:2010.05844  [pdf, other

    cs.SD cs.LG eess.AS

    Conditioning Trick for Training Stable GANs

    Authors: Mohammad Esmaeilpour, Raymel Alfonso Sallo, Olivier St-Georges, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper we propose a conditioning trick, called difference departure from normality, applied on the generator network in response to instability issues during GAN training. We force the generator to get closer to the departure from normality function of real samples computed in the spectral domain of Schur decomposition. This binding makes the generator amenable to truncation and does not li… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  2. arXiv:2008.11618  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Adversarially Training for Audio Classifiers

    Authors: Raymel Alfonso Sallo, Mohammad Esmaeilpour, Patrick Cardinal

    Abstract: In this paper, we investigate the potential effect of the adversarially training on the robustness of six advanced deep neural networks against a variety of targeted and non-targeted adversarial attacks. We firstly show that, the ResNet-56 model trained on the 2D representation of the discrete wavelet transform appended with the tonnetz chromagram outperforms other models in terms of recognition a… ▽ More

    Submitted 25 October, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: Paper accepted to International Conference on Pattern Recognition (ICPR) 2020

  3. arXiv:2008.05454  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Improving Stability of LS-GANs for Audio and Speech Signals

    Authors: Mohammad Esmaeilpour, Raymel Alfonso Sallo, Olivier St-Georges, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper we address the instability issue of generative adversarial network (GAN) by proposing a new similarity metric in unitary space of Schur decomposition for 2D representations of audio and speech signals. We show that encoding departure from normality computed in this vector space into the generator optimization formulation helps to craft more comprehensive spectrograms. We demonstrate… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: 10 pages