Skip to main content

Showing 1–8 of 8 results for author: Dietzen, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2405.02991  [pdf, other

    cs.SD eess.AS

    Steered Response Power for Sound Source Localization: A Tutorial Review

    Authors: Eric Grinstein, Elisa Tengan, Bilgesu Çakmak, Thomas Dietzen, Leonardo Nunes, Toon van Waterschoot, Mike Brookes, Patrick A. Naylor

    Abstract: In the last three decades, the Steered Response Power (SRP) method has been widely used for the task of Sound Source Localization (SSL), due to its satisfactory localization performance on moderately reverberant and noisy scenarios. Many works have analyzed and extended the original SRP method to reduce its computational cost, to allow it to locate multiple sources, or to improve its performance i… ▽ More

    Submitted 9 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

  2. arXiv:2306.08514  [pdf, other

    eess.AS eess.SP

    Low-Complexity Steered Response Power Map** based on Low-Rank and Sparse Interpolation

    Authors: Thomas Dietzen, Enzo De Sena, Toon van Waterschoot

    Abstract: For acoustic source localization, a map of the acoustic scene as obtained by the steered response power (SRP) approach can be employed. In SRP, the frequency-weighted output power of a beamformer steered towards a set of candidate locations is obtained from generalized cross-correlations (GCCs). Due to the dense grid of candidate locations, conventional SRP exhibits a high computational complexity… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  3. MYRiAD: A Multi-Array Room Acoustic Database

    Authors: Thomas Dietzen, Randall Ali, Maja Taseska, Toon van Waterschoot

    Abstract: In the development of acoustic signal processing algorithms, their evaluation in various acoustic environments is of utmost importance. In order to advance evaluation in realistic and reproducible scenarios, several high-quality acoustic databases have been developed over the years. In this paper, we present another complementary database of acoustic recordings, referred to as the Multi-arraY Room… ▽ More

    Submitted 12 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Journal ref: EURASIP J. Audio Speech Music Process., vol. 2023, no. 17, pp. 1-14, Apr. 2023

  4. arXiv:2211.02690  [pdf, other

    eess.AS cs.SD

    Speech enhancement using ego-noise references with a microphone array embedded in an unmanned aerial vehicle

    Authors: Elisa Tengan, Thomas Dietzen, Santiago Ruiz, Mansour Alkmim, João Cardenuto, Toon van Waterschoot

    Abstract: A method is proposed for performing speech enhancement using ego-noise references with a microphone array embedded in an unmanned aerial vehicle (UAV). The ego-noise reference signals are captured with microphones located near the UAV's propellers and used in the prior knowledge multichannel Wiener filter (PK-MWF) to obtain the speech correlation matrix estimate. Speech presence probability (SPP)… ▽ More

    Submitted 16 August, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Journal ref: Proceedings of the 24th International Congress on Acoustics (ICA), Gyeongju, South Korea, 24 Oct 2022-28 Oct 2022

  5. Low-Complexity Steered Response Power Map** based on Nyquist-Shannon Sampling

    Authors: Thomas Dietzen, Enzo De Sena, Toon van Waterschoot

    Abstract: The steered response power (SRP) approach to acoustic source localization computes a map of the acoustic scene from the frequency-weighted output power of a beamformer steered towards a set of candidate locations. Equivalently, SRP may be expressed in terms of time-domain generalized cross-correlations (GCCs) at lags equal to the candidate locations' time-differences of arrival (TDOAs). Due to the… ▽ More

    Submitted 22 July, 2021; v1 submitted 17 December, 2020; originally announced December 2020.

  6. Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components

    Authors: Thomas Dietzen, Marc Moonen, Toon van Waterschoot

    Abstract: Power spectral density (PSD) estimates of various microphone signal components are essential to many speech enhancement procedures. As speech is highly non-nonstationary, performance improvements may be gained by maintaining time-variations in PSD estimates. In this paper, we propose an instantaneous PSD estimation approach based on generalized principal components. Similarly to other eigenspace-b… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Journal ref: Proc. 28th European Signal Process. Conf. (EUSIPCO 2020), Amsterdam, Netherlands, Jan 2021, pp. 1-5

  7. Integrated sidelobe cancellation and linear prediction Kalman filter for joint multi-microphone speech dereverberation, interfering speech cancellation, and noise reduction

    Authors: T. Dietzen, S. Doclo, M. Moonen, T. van Waterschoot

    Abstract: In multi-microphone speech enhancement, reverberation as well as additive noise and/or interfering speech are commonly suppressed by deconvolution and spatial filtering, e.g., using multi-channel linear prediction (MCLP) on the one hand and beamforming, e.g., a generalized sidelobe canceler (GSC), on the other hand. In this paper, we consider several reverberant speech components, whereof some are… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  8. Square root-based multi-source early PSD estimation and recursive RETF update in reverberant environments by means of the orthogonal Procrustes problem

    Authors: T. Dietzen, S. Doclo, M. Moonen, T. van Waterschoot

    Abstract: Multi-channel short-time Fourier transform (STFT) domain-based processing of reverberant microphone signals commonly relies on power-spectral-density (PSD) estimates of early source images, where early refers to reflections contained within the same STFT frame. State-of-the-art approaches to multi-source early PSD estimation, given an estimate of the associated relative early transfer functions (R… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.