Skip to main content

Showing 1–16 of 16 results for author: Esmaeilpour, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2207.06858  [pdf, ps, other

    cs.SD cs.LG eess.AS

    RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks

    Authors: Mohammad Esmaeilpour, Nourhene Chaalia, Patrick Cardinal

    Abstract: This paper introduces a new synthesis-based defense algorithm for counteracting with a varieties of adversarial attacks developed for challenging the performance of the cutting-edge speech-to-text transcription systems. Our algorithm implements a Sobolev-based GAN and proposes a novel regularizer for effectively controlling over the functionality of the entire generative model, particularly the di… ▽ More

    Submitted 24 September, 2022; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Paper ACCEPTED FOR PUBLICATION IEEE Signal Processing Letters Journal

  2. arXiv:2205.11693  [pdf, other

    cs.LG cs.AI cs.DB

    RCC-GAN: Regularized Compound Conditional GAN for Large-Scale Tabular Data Synthesis

    Authors: Mohammad Esmaeilpour, Nourhene Chaalia, Adel Abusitta, Francois-Xavier Devailly, Wissem Maazoun, Patrick Cardinal

    Abstract: This paper introduces a novel generative adversarial network (GAN) for synthesizing large-scale tabular databases which contain various features such as continuous, discrete, and binary. Technically, our GAN belongs to the category of class-conditioned generative models with a predefined conditional vector. However, we propose a new formulation for deriving such a vector incorporating both binary… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: Paper submitted to IEEE Transactions on Neural Networks and Learning Systems

  3. arXiv:2204.07018  [pdf, other

    cs.SD cs.CR cs.CV cs.LG eess.AS

    From Environmental Sound Representation to Robustness of 2D CNN Models Against Adversarial Attacks

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: This paper investigates the impact of different standard environmental sound representations (spectrograms) on the recognition performance and adversarial attack robustness of a victim residual convolutional neural network, namely ResNet-18. Our main motivation for focusing on such a front-end classifier rather than other complex architectures is balancing recognition accuracy and the total number… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: 32 pages, Preprint Submitted to Journal of Applied Acoustics. arXiv admin note: substantial text overlap with arXiv:2007.13703

  4. arXiv:2111.06549  [pdf, other

    cs.LG

    Bi-Discriminator Class-Conditional Tabular GAN

    Authors: Mohammad Esmaeilpour, Nourhene Chaalia, Adel Abusitta, Francois-Xavier Devailly, Wissem Maazoun, Patrick Cardinal

    Abstract: This paper introduces a bi-discriminator GAN for synthesizing tabular datasets containing continuous, binary, and discrete columns. Our proposed approach employs an adapted preprocessing scheme and a novel conditional term for the generator network to more effectively capture the input sample distributions. Additionally, we implement straightforward yet effective architectures for discriminator ne… ▽ More

    Submitted 2 December, 2021; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: Submitted to Elsevier Pattern Recognition Letter

  5. arXiv:2103.14717  [pdf, other

    cs.SD cs.CR eess.AS

    Cyclic Defense GAN Against Speech Adversarial Attacks

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: This paper proposes a new defense approach for counteracting state-of-the-art white and black-box adversarial attack algorithms. Our approach fits into the implicit reactive defense algorithm category since it does not directly manipulate the potentially malicious input signals. Instead, it reconstructs a similar signal with a synthesized spectrogram using a cyclic generative adversarial network.… ▽ More

    Submitted 22 August, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 5

    Journal ref: IEEE Signal Processing Letters (2021) 1-5

  6. arXiv:2103.08095  [pdf, ps, other

    cs.SD cs.LG eess.AS

    Towards Robust Speech-to-Text Adversarial Attack

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: This paper introduces a novel adversarial algorithm for attacking the state-of-the-art speech-to-text systems, namely DeepSpeech, Kaldi, and Lingvo. Our approach is based on develo** an extension for the conventional distortion condition of the adversarial optimization formulation using the Cramèr integral probability metric. Minimizing over this metric, which measures the discrepancies between… ▽ More

    Submitted 14 March, 2021; originally announced March 2021.

    Comments: 5 pages

  7. arXiv:2103.08086  [pdf, other

    cs.SD cs.LG eess.AS

    Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: This paper introduces a defense approach against end-to-end adversarial attacks developed for cutting-edge speech-to-text systems. The proposed defense algorithm has four major steps. First, we represent speech signals with 2D spectrograms using the short-time Fourier transform. Second, we iteratively find a safe vector using a spectrogram subspace projection operation. This operation minimizes th… ▽ More

    Submitted 14 March, 2021; originally announced March 2021.

    Comments: 10 pages

  8. arXiv:2010.11352  [pdf, other

    cs.SD cs.CR cs.CV cs.LG eess.AS

    Class-Conditional Defense GAN Against End-to-End Speech Attacks

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper we propose a novel defense approach against end-to-end adversarial attacks developed to fool advanced speech-to-text systems such as DeepSpeech and Lingvo. Unlike conventional defense approaches, the proposed approach does not directly employ low-level transformations such as autoencoding a given input signal aiming at removing potential adversarial perturbation. Instead of that, we… ▽ More

    Submitted 19 February, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: 5 pages

    Journal ref: 46th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2021

  9. arXiv:2010.05844  [pdf, other

    cs.SD cs.LG eess.AS

    Conditioning Trick for Training Stable GANs

    Authors: Mohammad Esmaeilpour, Raymel Alfonso Sallo, Olivier St-Georges, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper we propose a conditioning trick, called difference departure from normality, applied on the generator network in response to instability issues during GAN training. We force the generator to get closer to the departure from normality function of real samples computed in the spectral domain of Schur decomposition. This binding makes the generator amenable to truncation and does not li… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  10. arXiv:2008.11618  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Adversarially Training for Audio Classifiers

    Authors: Raymel Alfonso Sallo, Mohammad Esmaeilpour, Patrick Cardinal

    Abstract: In this paper, we investigate the potential effect of the adversarially training on the robustness of six advanced deep neural networks against a variety of targeted and non-targeted adversarial attacks. We firstly show that, the ResNet-56 model trained on the 2D representation of the discrete wavelet transform appended with the tonnetz chromagram outperforms other models in terms of recognition a… ▽ More

    Submitted 25 October, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: Paper accepted to International Conference on Pattern Recognition (ICPR) 2020

  11. arXiv:2008.05454  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Improving Stability of LS-GANs for Audio and Speech Signals

    Authors: Mohammad Esmaeilpour, Raymel Alfonso Sallo, Olivier St-Georges, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper we address the instability issue of generative adversarial network (GAN) by proposing a new similarity metric in unitary space of Schur decomposition for 2D representations of audio and speech signals. We show that encoding departure from normality computed in this vector space into the generator optimization formulation helps to craft more comprehensive spectrograms. We demonstrate… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: 10 pages

  12. arXiv:2007.13703  [pdf, other

    eess.AS cs.LG cs.SD

    From Sound Representation to Model Robustness

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper, we investigate the impact of different standard environmental sound representations (spectrograms) on the recognition performance and adversarial attack robustness of a victim residual convolutional neural network. Averaged over various experiments on three benchmarking environmental sound datasets, we found the ResNet-18 model outperforms other deep learning architectures such as G… ▽ More

    Submitted 17 January, 2021; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: 12 pages

  13. arXiv:1910.12084  [pdf, ps, other

    cs.LG cs.CR cs.SD eess.AS stat.ML

    Detection of Adversarial Attacks and Characterization of Adversarial Subspace

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: Adversarial attacks have always been a serious threat for any data-driven model. In this paper, we explore subspaces of adversarial examples in unitary vector domain, and we propose a novel detector for defending our models trained for environmental sound classification. We measure chordal distance between legitimate and malicious representation of sounds in unitary space of generalized Schur deco… ▽ More

    Submitted 26 October, 2019; originally announced October 2019.

    Comments: Submitted to ICASSP 2020

  14. arXiv:1910.10106  [pdf, other

    cs.SD cs.LG cs.MM eess.AS stat.ML

    Cross-Representation Transferability of Adversarial Attacks: From Spectrograms to Audio Waveforms

    Authors: Karl Michel Koerich, Mohammad Esmaeilpour, Sajjad Abdoli, Alceu de Souza Britto Jr., Alessandro Lameiras Koerich

    Abstract: This paper shows the susceptibility of spectrogram-based audio classifiers to adversarial attacks and the transferability of such attacks to audio waveforms. Some commonly used adversarial attacks to images have been applied to Mel-frequency and short-time Fourier transform spectrograms, and such perturbed spectrograms are able to fool a 2D convolutional neural network (CNN). Such attacks produce… ▽ More

    Submitted 29 July, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 8 pages

    Journal ref: IEEE International Joint Conference on Neural Networks (IJCNN 2020), Glasgow, UK

  15. arXiv:1904.10990  [pdf, other

    cs.LG cs.CR cs.SD eess.AS stat.ML

    A Robust Approach for Securing Audio Classification Against Adversarial Attacks

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: Adversarial audio attacks can be considered as a small perturbation unperceptive to human ears that is intentionally added to the audio signal and causes a machine learning model to make mistakes. This poses a security concern about the safety of machine learning models since the adversarial attacks can fool such models toward the wrong predictions. In this paper we first review some strong advers… ▽ More

    Submitted 25 November, 2019; v1 submitted 24 April, 2019; originally announced April 2019.

    Comments: Paper Accepted for Publication in IEEE Transactions on Information Forensics and Security

  16. arXiv:1904.04221  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Unsupervised Feature Learning for Environmental Sound Classification Using Weighted Cycle-Consistent Generative Adversarial Network

    Authors: Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper we propose a novel environmental sound classification approach incorporating unsupervised feature learning from codebook via spherical $K$-Means++ algorithm and a new architecture for high-level data augmentation. The audio signal is transformed into a 2D representation using a discrete wavelet transform (DWT). The DWT spectrograms are then augmented by a novel architecture for cycle… ▽ More

    Submitted 25 November, 2019; v1 submitted 8 April, 2019; originally announced April 2019.

    Comments: Paper Accepted for Publication in Elsevier Applied Soft Computing