Skip to main content

Showing 1–2 of 2 results for author: Schamper, J

.
  1. arXiv:1906.07286  [pdf, other

    cs.CL cs.LG

    Generalizing Back-Translation in Neural Machine Translation

    Authors: Miguel Graça, Yunsu Kim, Julian Schamper, Shahram Khadivi, Hermann Ney

    Abstract: Back-translation - data augmentation by translating target monolingual data - is a crucial component in modern neural machine translation (NMT). In this work, we reformulate back-translation in the scope of cross-entropy optimization of an NMT model, clarifying its underlying mathematical assumptions and approximations beyond its heuristic usage. Our formulation covers broader synthetic data gener… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: 4th Conference on Machine Translation (WMT 2019) camera-ready

  2. arXiv:1901.01577  [pdf, other

    cs.CL

    Unsupervised Training for Large Vocabulary Translation Using Sparse Lexicon and Word Classes

    Authors: Yunsu Kim, Julian Schamper, Hermann Ney

    Abstract: We address for the first time unsupervised training for a translation task with hundreds of thousands of vocabulary words. We scale up the expectation-maximization (EM) algorithm to learn a large translation table without any parallel text or seed lexicon. First, we solve the memory bottleneck and enforce the sparsity with a simple thresholding scheme for the lexicon. Second, we initialize the lex… ▽ More

    Submitted 6 January, 2019; originally announced January 2019.

    Comments: Published in EACL 2017