Skip to main content

Showing 1–7 of 7 results for author: Wuebker, J

.
  1. arXiv:2309.08590  [pdf, other

    cs.CL

    Neural Machine Translation Models Can Learn to be Few-shot Learners

    Authors: Raphael Reinauer, Patrick Simianer, Kaden Uhlig, Johannes E. M. Mosig, Joern Wuebker

    Abstract: The emergent ability of Large Language Models to use a small number of examples to learn to perform in novel domains and tasks, also called in-context learning (ICL). In this work, we show that a much smaller model can be trained to perform ICL by fine-tuning towards a specialized training objective, exemplified on the task of domain adaptation for neural machine translation. With this capacity fo… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  2. arXiv:2206.08593  [pdf, other

    cs.CL cs.LG

    Automatic Correction of Human Translations

    Authors: Jessy Lin, Geza Kovacs, Aditya Shastry, Joern Wuebker, John DeNero

    Abstract: We introduce translation error correction (TEC), the task of automatically correcting human-generated translations. Imperfections in machine translations (MT) have long motivated systems for improving translations post-hoc with automatic post-editing. In contrast, little attention has been devoted to the problem of automatically correcting human translations, despite the intuition that humans make… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: NAACL 2022. Dataset available at: https://github.com/lilt/tec

  3. arXiv:2011.05978  [pdf, other

    cs.CL cs.HC

    The Impact of Text Presentation on Translator Performance

    Authors: Samuel Läubli, Patrick Simianer, Joern Wuebker, Geza Kovacs, Rico Sennrich, Spence Green

    Abstract: Widely used computer-aided translation (CAT) tools divide documents into segments such as sentences and arrange them in a side-by-side, spreadsheet-like view. We present the first controlled evaluation of these design choices on translator performance, measuring speed and accuracy in three experimental text processing tasks. We find significant evidence that sentence-by-sentence presentation enabl… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Accepted for publication in Target

  4. arXiv:2004.14675  [pdf, other

    cs.CL

    End-to-End Neural Word Alignment Outperforms GIZA++

    Authors: Thomas Zenkel, Joern Wuebker, John DeNero

    Abstract: Word alignment was once a core unsupervised learning task in natural language processing because of its essential role in training statistical machine translation (MT) models. Although unnecessary for training neural MT models, word alignment still plays an important role in interactive applications of neural machine translation, such as annotation transfer and lexicon injection. While statistical… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: Accepted at ACL 2020

  5. arXiv:1901.11359  [pdf, other

    cs.CL

    Adding Interpretable Attention to Neural Translation Models Improves Word Alignment

    Authors: Thomas Zenkel, Joern Wuebker, John DeNero

    Abstract: Multi-layer models with multiple attention heads per layer provide superior translation quality compared to simpler and shallower models, but determining what source context is most relevant to each target word is more challenging as a result. Therefore, deriving high-accuracy word alignments from the activations of a state-of-the-art neural machine translation model is an open challenge. We propo… ▽ More

    Submitted 31 January, 2019; originally announced January 2019.

  6. arXiv:1901.01574  [pdf, other

    cs.CL

    A Comparative Study on Vocabulary Reduction for Phrase Table Smoothing

    Authors: Yunsu Kim, Andreas Guta, Joern Wuebker, Hermann Ney

    Abstract: This work systematically analyzes the smoothing effect of vocabulary reduction for phrase translation models. We extensively compare various word-level vocabularies to show that the performance of smoothing is not significantly affected by the choice of vocabulary. This result provides empirical evidence that the standard phrase translation model is extremely sparse. Our experiments also reveal th… ▽ More

    Submitted 6 January, 2019; originally announced January 2019.

    Comments: Published in WMT 2016

  7. arXiv:1811.01990  [pdf, other

    cs.CL

    Compact Personalized Models for Neural Machine Translation

    Authors: Joern Wuebker, Patrick Simianer, John DeNero

    Abstract: We propose and compare methods for gradient-based domain adaptation of self-attentive neural machine translation models. We demonstrate that a large proportion of model parameters can be frozen during adaptation with minimal or no reduction in translation quality by encouraging structured sparsity in the set of offset tensors during learning via group lasso regularization. We evaluate this techniq… ▽ More

    Submitted 5 November, 2018; originally announced November 2018.

    Comments: Published at the 2018 Conference on Empirical Methods in Natural Language Processing