Skip to main content

Showing 1–6 of 6 results for author: Hieber, F

.
  1. arXiv:2210.13281  [pdf, other

    cs.CL

    Analyzing the Use of Influence Functions for Instance-Specific Data Filtering in Neural Machine Translation

    Authors: Tsz Kin Lam, Eva Hasler, Felix Hieber

    Abstract: Customer feedback can be an important signal for improving commercial machine translation systems. One solution for fixing specific translation errors is to remove the related erroneous training instances followed by re-training of the machine translation system, which we refer to as instance-specific data filtering. Influence functions (IF) have been shown to be effective in finding such relevant… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted at WMT 2022

  2. arXiv:2207.05851  [pdf, ps, other

    cs.CL

    Sockeye 3: Fast Neural Machine Translation with PyTorch

    Authors: Felix Hieber, Michael Denkowski, Tobias Domhan, Barbara Darques Barros, Celina Dong Ye, Xing Niu, Cuong Hoang, Ke Tran, Benjamin Hsu, Maria Nadejde, Surafel Lakew, Prashant Mathur, Anna Currey, Marcello Federico

    Abstract: Sockeye 3 is the latest version of the Sockeye toolkit for Neural Machine Translation (NMT). Now based on PyTorch, Sockeye 3 provides faster model implementations and more advanced features with a further streamlined codebase. This enables broader experimentation with faster iteration, efficient training of stronger and faster models, and the flexibility to move new ideas quickly from research to… ▽ More

    Submitted 2 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

  3. arXiv:2205.06618  [pdf, other

    cs.CL cs.AI cs.LG

    The Devil is in the Details: On the Pitfalls of Vocabulary Selection in Neural Machine Translation

    Authors: Tobias Domhan, Eva Hasler, Ke Tran, Sony Trenous, Bill Byrne, Felix Hieber

    Abstract: Vocabulary selection, or lexical shortlisting, is a well-known technique to improve latency of Neural Machine Translation models by constraining the set of allowed output words during inference. The chosen set is typically determined by separately trained alignment model parameters, independent of the source-sentence context at inference time. While vocabulary selection appears competitive with re… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  4. arXiv:2008.04885  [pdf, ps, other

    cs.CL

    The Sockeye 2 Neural Machine Translation Toolkit at AMTA 2020

    Authors: Tobias Domhan, Michael Denkowski, David Vilar, Xing Niu, Felix Hieber, Kenneth Heafield

    Abstract: We present Sockeye 2, a modernized and streamlined version of the Sockeye neural machine translation (NMT) toolkit. New features include a simplified code base through the use of MXNet's Gluon API, a focus on state of the art model architectures, distributed mixed precision training, and efficient CPU decoding with 8-bit quantization. These improvements result in faster training and inference, hig… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

  5. arXiv:1810.04101  [pdf, other

    cs.CV

    Image Captioning as Neural Machine Translation Task in SOCKEYE

    Authors: Loris Bazzani, Tobias Domhan, Felix Hieber

    Abstract: Image captioning is an interdisciplinary research problem that stands between computer vision and natural language processing. The task is to generate a textual description of the content of an image. The typical model used for image captioning is an encoder-decoder deep network, where the encoder captures the essence of an image while the decoder is responsible for generating a sentence describin… ▽ More

    Submitted 15 October, 2018; v1 submitted 9 October, 2018; originally announced October 2018.

  6. arXiv:1712.05690  [pdf, other

    cs.CL cs.LG stat.ML

    Sockeye: A Toolkit for Neural Machine Translation

    Authors: Felix Hieber, Tobias Domhan, Michael Denkowski, David Vilar, Artem Sokolov, Ann Clifton, Matt Post

    Abstract: We describe Sockeye (version 1.12), an open-source sequence-to-sequence toolkit for Neural Machine Translation (NMT). Sockeye is a production-ready framework for training and applying models as well as an experimental platform for researchers. Written in Python and built on MXNet, the toolkit offers scalable training and inference for the three most prominent encoder-decoder architectures: attenti… ▽ More

    Submitted 1 June, 2018; v1 submitted 15 December, 2017; originally announced December 2017.