Skip to main content

Showing 1–5 of 5 results for author: Dahlmann, L

.
  1. arXiv:2108.10197  [pdf, other

    cs.CL stat.CO

    Deploying a BERT-based Query-Title Relevance Classifier in a Production System: a View from the Trenches

    Authors: Leonard Dahlmann, Tomer Lancewicki

    Abstract: The Bidirectional Encoder Representations from Transformers (BERT) model has been radically improving the performance of many Natural Language Processing (NLP) tasks such as Text Classification and Named Entity Recognition (NER) applications. However, it is challenging to scale BERT for low-latency and high-throughput industrial use cases due to its enormous size. We successfully optimize a Query-… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  2. arXiv:2010.09482  [pdf, other

    cs.CL cs.AI

    Diving Deep into Context-Aware Neural Machine Translation

    Authors: **g**g Huo, Christian Herold, Yingbo Gao, Leonard Dahlmann, Shahram Khadivi, Hermann Ney

    Abstract: Context-aware neural machine translation (NMT) is a promising direction to improve the translation quality by making use of the additional context, e.g., document-level translation, or having meta-information. Although there exist various architectures and analyses, the effectiveness of different context-aware NMT models is not well explored yet. This paper analyzes the performance of document-lev… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: Accepted at 5th Conference on Machine Translation (WMT20)

  3. arXiv:1906.03129  [pdf, other

    cs.CL cs.AI

    Word-based Domain Adaptation for Neural Machine Translation

    Authors: Shen Yan, Leonard Dahlmann, Pavel Petrushkov, Sanjika Hewavitharana, Shahram Khadivi

    Abstract: In this paper, we empirically investigate applying word-level weights to adapt neural machine translation to e-commerce domains, where small e-commerce datasets and large out-of-domain datasets are available. In order to mine in-domain like words in the out-of-domain datasets, we compute word weights by using a domain-specific and a non-domain-specific language model followed by smoothing and bina… ▽ More

    Submitted 7 June, 2019; originally announced June 2019.

    Comments: Published on the proceedings of the International Workshop on Spoken Language Translation (IWSLT), 2018

    Journal ref: Proceedings of the 15th International Workshop on Spoken Language Translation, Bruges, Belgium, October 29-30, 2018

  4. arXiv:1708.03271  [pdf, other

    cs.CL

    Neural Machine Translation Leveraging Phrase-based Models in a Hybrid Search

    Authors: Leonard Dahlmann, Evgeny Matusov, Pavel Petrushkov, Shahram Khadivi

    Abstract: In this paper, we introduce a hybrid search for attention-based neural machine translation (NMT). A target phrase learned with statistical MT models extends a hypothesis in the NMT beam search when the attention of the NMT model focuses on the source words translated by this phrase. Phrases added in this way are scored with the NMT model, but also with SMT features including phrase-level translati… ▽ More

    Submitted 10 August, 2017; originally announced August 2017.

    Comments: To appear in Proceedings of EMNLP 2017

  5. arXiv:1708.03186  [pdf, other

    cs.CL

    Neural and Statistical Methods for Leveraging Meta-information in Machine Translation

    Authors: Shahram Khadivi, Patrick Wilken, Leonard Dahlmann, Evgeny Matusov

    Abstract: In this paper, we discuss different methods which use meta information and richer context that may accompany source language input to improve machine translation quality. We focus on category information of input text as meta information, but the proposed methods can be extended to all textual and non-textual meta information that might be available for the input text or automatically predicted us… ▽ More

    Submitted 10 August, 2017; originally announced August 2017.

    Comments: To appear in MT Summit 2017