Skip to main content

Showing 1–4 of 4 results for author: Madhu, N

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.09819  [pdf, other

    eess.AS

    Enhanced Deep Speech Separation in Clustered Ad Hoc Distributed Microphone Environments

    Authors: Jihyun Kim, Stijn Kindt, Nilesh Madhu, Hong-Goo Kang

    Abstract: Ad-hoc distributed microphone environments, where microphone locations and numbers are unpredictable, present a challenge to traditional deep learning models, which typically require fixed architectures. To tailor deep learning models to accommodate arbitrary array configurations, the Transform-Average-Concatenate (TAC) layer was previously introduced. In this work, we integrate TAC layers with du… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  2. arXiv:2306.02344  [pdf, other

    eess.AS

    Influence of Lossy Speech Codecs on Hearing-aid, Binaural Sound Source Localisation using DNNs

    Authors: Siyuan Song, Stijn Kindt, Jasper Maes, Alexander Bohlender. Nilesh Madhu

    Abstract: Hearing aids are typically equipped with multiple microphones to exploit spatial information for source localisation and speech enhancement. Especially for hearing aids, a good source localisation is important: it not only guides source separation methods but can also be used to enhance spatial cues, increasing user-awareness of important events in their surroundings. We use a state-of-the-art dee… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  3. arXiv:2304.03515  [pdf, other

    eess.AS cs.SD

    Margin-Mixup: A Method for Robust Speaker Verification in Multi-Speaker Audio

    Authors: Jenthe Thienpondt, Nilesh Madhu, Kris Demuynck

    Abstract: This paper is concerned with the task of speaker verification on audio with multiple overlap** speakers. Most speaker verification systems are designed with the assumption of a single speaker being present in a given audio segment. However, in a real-world setting this assumption does not always hold. In this paper, we demonstrate that current speaker verification systems are not robust against… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: proceedings of ICASSP 2023

  4. arXiv:2108.00912  [pdf, other

    eess.AS cs.SD

    Robust Acoustic Scene Classification in the Presence of Active Foreground Speech

    Authors: Siyuan Song, Brecht Desplanques, Celest De Moor, Kris Demuynck, Nilesh Madhu

    Abstract: We present an iVector based Acoustic Scene Classification (ASC) system suited for real life settings where active foreground speech can be present. In the proposed system, each recording is represented by a fixed-length iVector that models the recording's important properties. A regularized Gaussian backend classifier with class-specific covariance models is used to extract the relevant acoustic s… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.