Skip to main content

Showing 1–10 of 10 results for author: Tammen, M

.
  1. arXiv:2405.03555  [pdf, other

    cs.NI

    A Comprehensive Overview and Survey of O-RAN: Exploring Slicing-aware Architecture, Deployment Options, and Use Cases

    Authors: Khurshid Alam, Mohammad Asif Habibi, Matthias Tammen, Dennis Krummacker, Walid Saad, Marco Di Renzo, Tommaso Melodia, Xavier Costa-Pérez, Mérouane Debbah, Ashutosh Dutta, Hans D. Schotten

    Abstract: Open-radio access network (O-RAN) seeks to establish principles of openness, programmability, automation, intelligence, and hardware-software disaggregation with interoperable interfaces. It advocates for multi-vendorism and multi-stakeholderism within a cloudified and virtualized wireless infrastructure, aimed at enhancing the deployment, operation, and maintenance of RAN architecture. This enhan… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 45 pages, 12 figures, 4 tables, submitted to the IEEE for possible publication

  2. arXiv:2402.03058  [pdf, other

    eess.AS cs.SD

    Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers

    Authors: Marvin Tammen, Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Shoko Araki, Simon Doclo

    Abstract: Although mask-based beamforming is a powerful speech enhancement approach, it often requires manual parameter tuning to handle moving speakers. Recently, this approach was augmented with an attention-based spatial covariance matrix aggregator (ASA) module, enabling accurate tracking of moving speakers without manual tuning. However, the deep neural network model used in this module is limited to s… ▽ More

    Submitted 17 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: accepted at Interspeech 2024

  3. arXiv:2205.13851  [pdf, other

    eess.AS

    Speaker-conditioning Single-channel Target Speaker Extraction using Conformer-based Architectures

    Authors: Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo

    Abstract: Target speaker extraction aims at extracting the target speaker from a mixture of multiple speakers exploiting auxiliary information about the target speaker. In this paper, we consider a complete time-domain target speaker extraction system consisting of a speaker embedder network and a speaker separator network which are jointly trained in an end-to-end learning process. We propose two different… ▽ More

    Submitted 27 May, 2022; originally announced May 2022.

    Comments: submitted to IWAENC 2022

  4. Dictionary-Based Fusion of Contact and Acoustic Microphones for Wind Noise Reduction

    Authors: Marvin Tammen, Xilin Li, Simon Doclo, Lalin Theverapperuma

    Abstract: In mobile speech communication applications, wind noise can lead to a severe reduction of speech quality and intelligibility. Since the performance of speech enhancement algorithms using acoustic microphones tends to substantially degrade in extremely challenging scenarios, auxiliary sensors such as contact microphones can be used. Although contact microphones offer a much lower recorded wind nois… ▽ More

    Submitted 14 November, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: accepted at IWAENC 22

  5. Deep Multi-Frame MVDR Filtering for Binaural Noise Reduction

    Authors: Marvin Tammen, Simon Doclo

    Abstract: To improve speech intelligibility and speech quality in noisy environments, binaural noise reduction algorithms for head-mounted assistive listening devices are of crucial importance. Several binaural noise reduction algorithms such as the well-known binaural minimum variance distortionless response (MVDR) beamformer have been proposed, which exploit spatial correlations of both the target speech… ▽ More

    Submitted 14 November, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: accepted at IWAENC 2022

  6. arXiv:2106.01902  [pdf, ps, other

    eess.AS cs.SD

    Joint Multi-Channel Dereverberation and Noise Reduction Using a Unified Convolutional Beamformer With Sparse Priors

    Authors: Henri Gode, Marvin Tammen, Simon Doclo

    Abstract: Recently, the convolutional weighted power minimization distortionless response (WPD) beamformer was proposed, which unifies multi-channel weighted prediction error dereverberation and minimum power distortionless response beamforming. To optimize the convolutional filter, the desired speech component is modeled with a time-varying Gaussian model, which promotes the sparsity of the desired speech… ▽ More

    Submitted 13 March, 2023; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: ITG Conference on Speech Communication

  7. arXiv:2104.04234  [pdf, other

    eess.AS

    Speaker-conditioned Target Speaker Extraction based on Customized LSTM Cells

    Authors: Ragini Sinha, Marvin Tammen, Christian Rollwage, Simon Doclo

    Abstract: Speaker-conditioned target speaker extraction systems rely on auxiliary information about the target speaker to extract the target speaker signal from a mixture of multiple speakers. Typically, a deep neural network is applied to isolate the relevant target speaker characteristics. In this paper, we focus on a single-channel target speaker extraction system based on a CNN-LSTM separator network an… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

  8. Deep Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement

    Authors: Marvin Tammen, Simon Doclo

    Abstract: Multi-frame algorithms for single-microphone speech enhancement, e.g., the multi-frame minimum variance distortionless response (MFMVDR) filter, are able to exploit speech correlation across adjacent time frames in the short-time Fourier transform (STFT) domain. Provided that accurate estimates of the required speech interframe correlation vector and the noise correlation matrix are available, it… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: submitted to the 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Ontario, Canada

  9. DNN-Based Speech Presence Probability Estimation for Multi-Frame Single-Microphone Speech Enhancement

    Authors: Marvin Tammen, Dörte Fischer, Bernd T. Meyer, Simon Doclo

    Abstract: Multi-frame approaches for single-microphone speech enhancement, e.g., the multi-frame minimum-power-distortionless-response (MFMPDR) filter, are able to exploit speech correlations across neighboring time frames. In contrast to single-frame approaches such as the Wiener gain, it has been shown that multi-frame approaches achieve a substantial noise reduction with hardly any speech distortion, pro… ▽ More

    Submitted 14 November, 2022; v1 submitted 21 May, 2019; originally announced May 2019.

  10. arXiv:1804.06196  [pdf, other

    cs.CR

    Demystifying Deception Technology:A Survey

    Authors: Daniel Fraunholz, Simon Duque Anton, Christoph Lipps, Daniel Reti, Daniel Krohmer, Frederic Pohl, Matthias Tammen, Hans Dieter Schotten

    Abstract: Deception boosts security for systems and components by denial, deceit, misinformation, camouflage and obfuscation. In this work an extensive overview of the deception technology environment is presented. Taxonomies, theoretical backgrounds, psychological aspects as well as concepts, implementations, legal aspects and ethics are discussed and compared.

    Submitted 17 April, 2018; originally announced April 2018.

    Comments: 25 pages, 169 references