Skip to main content

Showing 1–7 of 7 results for author: Bitzer, J

.
  1. arXiv:2401.08486  [pdf, other

    eess.AS

    Microphone Subset Selection for the Weighted Prediction Error Algorithm using a Group Sparsity Penalty

    Authors: Anselm Lohmann, Toon van Waterschoot, Joerg Bitzer, Simon Doclo

    Abstract: Reverberation can severely degrade the quality of speech signals recorded using microphones in an enclosure. In acoustic sensor networks with spatially distributed microphones, a similar dereverberation performance may be achieved using only a subset of all available microphones. Using the popular convex relaxation method, in this paper we propose to perform microphone subset selection for the wei… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  2. arXiv:2310.00319  [pdf, other

    eess.SP cs.SD eess.AS

    Time-Variant Overlap-Add in Partitions

    Authors: Hagen Jaeger, Uwe Simmer, Jörg Bitzer, Matthias Blau

    Abstract: Virtual and augmented realities are increasingly popular tools in many domains such as architecture, production, training and education, (psycho)therapy, gaming, and others. For a convincing rendering of sound in virtual and augmented environments, audio signals must be convolved in real-time with impulse responses that change from one moment in time to another. Key requirements for the implementa… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  3. arXiv:2306.16071  [pdf, other

    eess.AS cs.CL cs.SD

    Long-term Conversation Analysis: Exploring Utility and Privacy

    Authors: Francesco Nespoli, Jule Pohlhausen, Patrick A. Naylor, Joerg Bitzer

    Abstract: The analysis of conversations recorded in everyday life requires privacy protection. In this contribution, we explore a privacy-preserving feature extraction method based on input feature dimension reduction, spectral smoothing and the low-cost speaker anonymization technique based on McAdams coefficient. We assess the utility of the feature extraction methods with a voice activity detection and a… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Submitted to ITG Conference on Speech Communication, 2023

  4. arXiv:2306.16069  [pdf, other

    eess.AS cs.SD eess.SP

    Two-Stage Voice Anonymization for Enhanced Privacy

    Authors: Francesco Nespoli, Daniel Barreda, Joerg Bitzer, Patrick A. Naylor

    Abstract: In recent years, the need for privacy preservation when manipulating or storing personal data, including speech , has become a major issue. In this paper, we present a system addressing the speaker-level anonymization problem. We propose and evaluate a two-stage anonymization pipeline exploiting a state-of-the-art anonymization model described in the Voice Privacy Challenge 2022 in combination wit… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: submitted to INTERSPEECH

  5. arXiv:2301.07649  [pdf, other

    eess.AS

    Dereverberation in Acoustic Sensor Networks Using Weighted Prediction Error With Microphone-dependent Prediction Delays

    Authors: Anselm Lohmann, Toon van Waterschoot, Joerg Bitzer, Simon Doclo

    Abstract: In the last decades several multi-microphone speech dereverberation algorithms have been proposed, among which the weighted prediction error (WPE) algorithm. In the WPE algorithm, a prediction delay is required to reduce the correlation between the prediction signals and the direct component in the reference microphone signal. In compact arrays with closely-spaced microphones, the prediction delay… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

  6. arXiv:2212.04788  [pdf, other

    eess.AS eess.SP

    Geometry-aware DoA Estimation using a Deep Neural Network with mixed-data input features

    Authors: Ulrik Kowalk, Simon Doclo, Joerg Bitzer

    Abstract: Unlike model-based direction of arrival (DoA) estimation algorithms, supervised learning-based DoA estimation algorithms based on deep neural networks (DNNs) are usually trained for one specific microphone array geometry, resulting in poor performance when applied to a different array geometry. In this paper we illustrate the fundamental difference between supervised learning-based and model-based… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

    Comments: Submitted to ICASSP 2023

  7. arXiv:2206.05606  [pdf, other

    eess.AS cs.SD

    Signal-informed DNN-based DOA Estimation combining an External Microphone and GCC-PHAT Features

    Authors: Ulrik Kowalk, Simon Doclo, Joerg Bitzer

    Abstract: Aiming at estimating the direction of arrival (DOA) of a desired speaker in a multi-talker environment using a microphone array, in this paper we propose a signal-informed method exploiting the availability of an external microphone attached to the desired speaker. The proposed method applies a binary mask to the GCC-PHAT input features of a convolutional neural network, where the binary mask is c… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.