Skip to main content

Showing 1–7 of 7 results for author: Abdoli, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.18838  [pdf

    cs.DL cs.AI physics.soc-ph

    Unleashing the Power of AI. A Systematic Review of Cutting-Edge Techniques in AI-Enhanced Scientometrics, Webometrics, and Bibliometrics

    Authors: Hamid Reza Saeidnia, Elaheh Hosseini, Shadi Abdoli, Marcel Ausloos

    Abstract: Purpose: The study aims to analyze the synergy of Artificial Intelligence (AI), with scientometrics, webometrics, and bibliometrics to unlock and to emphasize the potential of the applications and benefits of AI algorithms in these fields. Design/methodology/approach: By conducting a systematic literature review, our aim is to explore the potential of AI in revolutionizing the methods used to me… ▽ More

    Submitted 22 February, 2024; originally announced March 2024.

    Comments: to be published in Library High Tech; 30 pages; 80 references; 4 figures; 3 tables

  2. arXiv:2305.01578  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Self-supervised learning for infant cry analysis

    Authors: Arsenii Gorin, Cem Subakan, Sajjad Abdoli, Junhao Wang, Samantha Latremouille, Charles Onu

    Abstract: In this paper, we explore self-supervised learning (SSL) for analyzing a first-of-its-kind database of cry recordings containing clinical indications of more than a thousand newborns. Specifically, we target cry-based detection of neurological injury as well as identification of cry triggers such as pain, hunger, and discomfort. Annotating a large database in the medical setting is expensive and t… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE ICASSP 2023 workshop Self-supervision in Audio, Speech and Beyond

  3. arXiv:2205.06237  [pdf, other

    cs.CV

    Knowledge Distillation for Multi-Target Domain Adaptation in Real-Time Person Re-Identification

    Authors: FĂ©lix Remigereau, Djebril Mekhazni, Sajjad Abdoli, Le Thanh Nguyen-Meidine, Rafael M. O. Cruz, Eric Granger

    Abstract: Despite the recent success of deep learning architectures, person re-identification (ReID) remains a challenging problem in real-word applications. Several unsupervised single-target domain adaptation (STDA) methods have recently been proposed to limit the decline in ReID accuracy caused by the domain shift that typically occurs between source and target video data. Given the multimodal nature of… ▽ More

    Submitted 10 July, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: 4 pages, 2 figures, submitted to ICIP2022

  4. arXiv:1912.00938  [pdf

    eess.AS cs.SD

    Speaker detection in the wild: Lessons learned from JSALT 2019

    Authors: Paola Garcia, Jesus Villalba, Herve Bredin, Jun Du, Diego Castan, Alejandrina Cristia, Latane Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Leo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak

    Abstract: This paper presents the problems and solutions addressed at the JSALT workshop when using a single microphone for speaker detection in adverse scenarios. The main focus was to tackle a wide range of conditions that go from meetings to wild speech. We describe the research threads we explored and a set of modules that was successful for these scenarios. The ultimate goal was to explore speaker dete… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: Submitted to ICASSP 2020

  5. arXiv:1910.10106  [pdf, other

    cs.SD cs.LG cs.MM eess.AS stat.ML

    Cross-Representation Transferability of Adversarial Attacks: From Spectrograms to Audio Waveforms

    Authors: Karl Michel Koerich, Mohammad Esmaeilpour, Sajjad Abdoli, Alceu de Souza Britto Jr., Alessandro Lameiras Koerich

    Abstract: This paper shows the susceptibility of spectrogram-based audio classifiers to adversarial attacks and the transferability of such attacks to audio waveforms. Some commonly used adversarial attacks to images have been applied to Mel-frequency and short-time Fourier transform spectrograms, and such perturbed spectrograms are able to fool a 2D convolutional neural network (CNN). Such attacks produce… ▽ More

    Submitted 29 July, 2020; v1 submitted 22 October, 2019; originally announced October 2019.

    Comments: 8 pages

    Journal ref: IEEE International Joint Conference on Neural Networks (IJCNN 2020), Glasgow, UK

  6. arXiv:1908.03173  [pdf, other

    cs.LG cs.SD eess.AS stat.ML

    Universal Adversarial Audio Perturbations

    Authors: Sajjad Abdoli, Luiz G. Hafemann, Jerome Rony, Ismail Ben Ayed, Patrick Cardinal, Alessandro L. Koerich

    Abstract: We demonstrate the existence of universal adversarial perturbations, which can fool a family of audio classification architectures, for both targeted and untargeted attack scenarios. We propose two methods for finding such perturbations. The first method is based on an iterative, greedy approach that is well-known in computer vision: it aggregates small perturbations to the input so as to push it… ▽ More

    Submitted 16 November, 2020; v1 submitted 8 August, 2019; originally announced August 2019.

  7. arXiv:1904.08990  [pdf, other

    cs.SD cs.LG stat.ML

    End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network

    Authors: Sajjad Abdoli, Patrick Cardinal, Alessandro Lameiras Koerich

    Abstract: In this paper, we present an end-to-end approach for environmental sound classification based on a 1D Convolution Neural Network (CNN) that learns a representation directly from the audio signal. Several convolutional layers are used to capture the signal's fine time structure and learn diverse filters that are relevant to the classification task. The proposed approach can deal with audio signals… ▽ More

    Submitted 18 April, 2019; originally announced April 2019.