Skip to main content

Showing 1–13 of 13 results for author: Beck, E

Searching in archive eess. Search in all archives.
.
  1. RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition

    Authors: Wei Zhou, Eugen Beck, Simon Berger, Ralf Schlüter, Hermann Ney

    Abstract: Modern public ASR tools usually provide rich support for training various sequence-to-sequence (S2S) models, but rather simple support for decoding open-vocabulary scenarios only. For closed-vocabulary scenarios, public tools supporting lexical-constrained decoding are usually only for classical ASR, or do not support all S2S models. To eliminate this restriction on research possibilities such as… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: accepted at Interspeech 2023

  2. arXiv:2305.03571  [pdf, ps, other

    eess.SP cs.IT cs.LG stat.ML

    Model-free Reinforcement Learning of Semantic Communication by Stochastic Policy Gradient

    Authors: Edgar Beck, Carsten Bockelmann, Armin Dekorsy

    Abstract: Following the recent success of Machine Learning tools in wireless communications, the idea of semantic communication by Weaver from 1949 has gained attention. It breaks with Shannon's classic design paradigm by aiming to transmit the meaning, i.e., semantics, of a message instead of its exact version, allowing for information rate savings. In this work, we apply the Stochastic Policy Gradient (SP… ▽ More

    Submitted 14 March, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted for publication in IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN 2024), Source Code: https://github.com/ant-uni-bremen/SINFONY

  3. arXiv:2304.10176  [pdf, ps, other

    cs.LG cs.AI eess.SY

    Robust Deep Reinforcement Learning Scheduling via Weight Anchoring

    Authors: Steffen Gracla, Edgar Beck, Carsten Bockelmann, Armin Dekorsy

    Abstract: Questions remain on the robustness of data-driven learning methods when crossing the gap from simulation to reality. We utilize weight anchoring, a method known from continual learning, to cultivate and fixate desired behavior in Neural Networks. Weight anchoring may be used to find a solution to a learning problem that is nearby the solution of another learning problem. Thereby, learning can be c… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  4. arXiv:2304.09488  [pdf, ps, other

    eess.SY cs.LG

    Learning Resource Scheduling with High Priority Users using Deep Deterministic Policy Gradients

    Authors: Steffen Gracla, Edgar Beck, Carsten Bockelmann, Armin Dekorsy

    Abstract: Advances in mobile communication capabilities open the door for closer integration of pre-hospital and in-hospital care processes. For example, medical specialists can be enabled to guide on-site paramedics and can, in turn, be supplied with live vitals or visuals. Consolidating such performance-critical applications with the highly complex workings of mobile communications requires solutions both… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  5. arXiv:2204.13366  [pdf, other

    cs.IT cs.AI cs.LG eess.SP stat.ML

    Semantic Information Recovery in Wireless Networks

    Authors: Edgar Beck, Carsten Bockelmann, Armin Dekorsy

    Abstract: Motivated by the recent success of Machine Learning (ML) tools in wireless communications, the idea of semantic communication by Weaver from 1949 has gained attention. It breaks with Shannon's classic design paradigm by aiming to transmit the meaning of a message, i.e., semantics, rather than its exact version and thus allows for savings in information rate. In this work, we extend the fundamental… ▽ More

    Submitted 12 June, 2023; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Submitted for peer review. arXiv admin note: text overlap with arXiv:2305.03571

  6. arXiv:2201.09692  [pdf, ps, other

    cs.SD eess.AS

    Improving Factored Hybrid HMM Acoustic Modeling without State Tying

    Authors: Tina Raissi, Eugen Beck, Ralf Schlüter, Hermann Ney

    Abstract: In this work, we show that a factored hybrid hidden Markov model (FH-HMM) which is defined without any phonetic state-tying outperforms a state-of-the-art hybrid HMM. The factored hybrid HMM provides a link to transducer models in the way it models phonetic (label) context while preserving the strict separation of acoustic and language model of the hybrid HMM approach. Furthermore, we show that th… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: Accepted for presentation at IEEE ICASSP 2022

    MSC Class: 68T10 ACM Class: I.2.7

  7. arXiv:2201.07463  [pdf

    eess.IV cs.LG

    Cortical lesions, central vein sign, and paramagnetic rim lesions in multiple sclerosis: emerging machine learning techniques and future avenues

    Authors: Francesco La Rosa, Maxence Wynen, Omar Al-Louzi, Erin S Beck, Till Huelnhagen, Pietro Maggi, Jean-Philippe Thiran, Tobias Kober, Russell T Shinohara, Pascal Sati, Daniel S Reich, Cristina Granziera, Martina Absinta, Meritxell Bach Cuadra

    Abstract: The current multiple sclerosis (MS) diagnostic criteria lack specificity, and this may lead to misdiagnosis, which remains an issue in present-day clinical practice. In addition, conventional biomarkers only moderately correlate with MS disease progression. Recently, advanced MS lesional imaging biomarkers such as cortical lesions (CL), the central vein sign (CVS), and paramagnetic rim lesions (PR… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  8. arXiv:2104.02387  [pdf, other

    cs.SD eess.AS

    Towards Consistent Hybrid HMM Acoustic Modeling

    Authors: Tina Raissi, Eugen Beck, Ralf Schlüter, Hermann Ney

    Abstract: High-performance hybrid automatic speech recognition (ASR) systems are often trained with clustered triphone outputs, and thus require a complex training pipeline to generate the clustering. The same complex pipeline is often utilized in order to generate an alignment for use in frame-wise cross-entropy training. In this work, we propose a flat-start factored hybrid model trained by modeling the f… ▽ More

    Submitted 12 October, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    MSC Class: 68T10 ACM Class: I.2.7

  9. arXiv:2102.12756  [pdf, other

    eess.SP cs.AI cs.IT cs.LG stat.ML

    CMDNet: Learning a Probabilistic Relaxation of Discrete Variables for Soft Detection with Low Complexity

    Authors: Edgar Beck, Carsten Bockelmann, Armin Dekorsy

    Abstract: Following the great success of Machine Learning (ML), especially Deep Neural Networks (DNNs), in many research domains in 2010s, several ML-based approaches were proposed for detection in large inverse linear problems, e.g., massive MIMO systems. The main motivation behind is that the complexity of Maximum A-Posteriori (MAP) detection grows exponentially with system dimensions. Instead of using DN… ▽ More

    Submitted 13 August, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Submitted for publication

  10. arXiv:2008.06780  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Automated Detection of Cortical Lesions in Multiple Sclerosis Patients with 7T MRI

    Authors: Francesco La Rosa, Erin S Beck, Ahmed Abdulkadir, Jean-Philippe Thiran, Daniel S Reich, Pascal Sati, Meritxell Bach Cuadra

    Abstract: The automated detection of cortical lesions (CLs) in patients with multiple sclerosis (MS) is a challenging task that, despite its clinical relevance, has received very little attention. Accurate detection of the small and scarce lesions requires specialized sequences and high or ultra-high field MRI. For supervised training based on multimodal structural MRI at 7T, two experts generated ground tr… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: Accepted to MICCAI 2020

  11. Context-Dependent Acoustic Modeling without Explicit Phone Clustering

    Authors: Tina Raissi, Eugen Beck, Ralf Schlüter, Hermann Ney

    Abstract: Phoneme-based acoustic modeling of large vocabulary automatic speech recognition takes advantage of phoneme context. The large number of context-dependent (CD) phonemes and their highly varying statistics require tying or smoothing to enable robust training. Usually, classification and regression trees are used for phonetic clustering, which is standard in hidden Markov model (HMM)-based systems.… ▽ More

    Submitted 7 April, 2021; v1 submitted 15 May, 2020; originally announced May 2020.

    Comments: Proceedings of Interspeech 2020

    MSC Class: 68T10 ACM Class: I.2.7

  12. arXiv:1907.01030  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    LSTM Language Models for LVCSR in First-Pass Decoding and Lattice-Rescoring

    Authors: Eugen Beck, Wei Zhou, Ralf Schlüter, Hermann Ney

    Abstract: LSTM based language models are an important part of modern LVCSR systems as they significantly improve performance over traditional backoff language models. Incorporating them efficiently into decoding has been notoriously difficult. In this paper we present an approach based on a combination of one-pass decoding and lattice rescoring. We perform decoding with the LSTM-LM in the first pass but rec… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

  13. RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation

    Authors: Christoph Lüscher, Eugen Beck, Kazuki Irie, Markus Kitza, Wilfried Michel, Albert Zeyer, Ralf Schlüter, Hermann Ney

    Abstract: We present state-of-the-art automatic speech recognition (ASR) systems employing a standard hybrid DNN/HMM architecture compared to an attention-based encoder-decoder design for the LibriSpeech task. Detailed descriptions of the system development, including model design, pretraining schemes, training schedules, and optimization approaches are provided for both system architectures. Both hybrid DN… ▽ More

    Submitted 25 July, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: Proceedings of INTERSPEECH 2019