Skip to main content

Showing 1–4 of 4 results for author: Rosendahl, J

.
  1. arXiv:2110.09245  [pdf, other

    cs.CL cs.SD eess.AS

    Efficient Sequence Training of Attention Models using Approximative Recombination

    Authors: Nils-Philipp Wynands, Wilfried Michel, Jan Rosendahl, Ralf Schlüter, Hermann Ney

    Abstract: Sequence discriminative training is a great tool to improve the performance of an automatic speech recognition system. It does, however, necessitate a sum over all possible word sequences, which is intractable to compute in practice. Current state-of-the-art systems with unlimited label context circumvent this problem by limiting the summation to an n-best list of relevant competing hypotheses obt… ▽ More

    Submitted 21 April, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

  2. arXiv:2109.13097  [pdf, other

    cs.CL

    Towards Reinforcement Learning for Pivot-based Neural Machine Translation with Non-autoregressive Transformer

    Authors: Evgeniia Tokarchuk, Jan Rosendahl, Weiyue Wang, Pavel Petrushkov, Tomer Lancewicki, Shahram Khadivi, Hermann Ney

    Abstract: Pivot-based neural machine translation (NMT) is commonly used in low-resource setups, especially for translation between non-English language pairs. It benefits from using high resource source-pivot and pivot-target language pairs and an individual system is trained for both sub-tasks. However, these models have no connection during training, and the source-pivot model is not optimized to produce… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: RL4RealLife Workshop 2021 camera-ready

  3. Integrated Training for Sequence-to-Sequence Models Using Non-Autoregressive Transformer

    Authors: Evgeniia Tokarchuk, Jan Rosendahl, Weiyue Wang, Pavel Petrushkov, Tomer Lancewicki, Shahram Khadivi, Hermann Ney

    Abstract: Complex natural language applications such as speech translation or pivot translation traditionally rely on cascaded models. However, cascaded models are known to be prone to error propagation and model discrepancy problems. Furthermore, there is no possibility of using end-to-end training data in conventional cascaded systems, meaning that the training data most suited for the task cannot be used… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: IWSLT 2021 camera-ready

  4. arXiv:1906.01942  [pdf, other

    cs.CL cs.LG

    Learning Bilingual Sentence Embeddings via Autoencoding and Computing Similarities with a Multilayer Perceptron

    Authors: Yunsu Kim, Hendrik Rosendahl, Nick Rossenbach, Jan Rosendahl, Shahram Khadivi, Hermann Ney

    Abstract: We propose a novel model architecture and training algorithm to learn bilingual sentence embeddings from a combination of parallel and monolingual data. Our method connects autoencoding and neural machine translation to force the source and target sentence embeddings to share the same space without the help of a pivot language or an additional transformation. We train a multilayer perceptron on to… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: ACL 2019 Repl4NLP camera-ready