Skip to main content

Showing 1–8 of 8 results for author: Variš, D

.
  1. arXiv:2308.03601  [pdf, other

    cs.CL

    Negative Lexical Constraints in Neural Machine Translation

    Authors: Josef Jon, Dušan Variš, Michal Novák, João Paulo Aires, Ondřej Bojar

    Abstract: This paper explores negative lexical constraining in English to Czech neural machine translation. Negative lexical constraining is used to prohibit certain words or expressions in the translation produced by the neural translation model. We compared various methods based on modifying either the decoding process or the training data. The comparison was performed on two tasks: paraphrasing and feedb… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  2. arXiv:2109.09354  [pdf, other

    cs.CL

    CUNI systems for WMT21: Multilingual Low-Resource Translation for Indo-European Languages Shared Task

    Authors: Josef Jon, Michal Novák, João Paulo Aires, Dušan Variš, Ondřej Bojar

    Abstract: This paper describes Charles University submission for Multilingual Low-Resource Translation for Indo-European Languages shared task at WMT21. We competed in translation from Catalan into Romanian, Italian and Occitan. Our systems are based on shared multilingual model. We show that using joint model for multiple similar language pairs improves upon translation quality in each pair. We also demons… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  3. arXiv:2109.09350  [pdf, other

    cs.CL

    CUNI systems for WMT21: Terminology translation Shared Task

    Authors: Josef Jon, Michal Novák, João Paulo Aires, Dušan Variš, Ondřej Bojar

    Abstract: This paper describes Charles University submission for Terminology translation Shared Task at WMT21. The objective of this task is to design a system which translates certain terms based on a provided terminology database, while preserving high overall translation quality. We competed in English-French language pair. Our approach is based on providing the desired translations alongside the input s… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  4. Sequence Length is a Domain: Length-based Overfitting in Transformer Models

    Authors: Dušan Variš, Ondřej Bojar

    Abstract: Transformer-based sequence-to-sequence architectures, while achieving state-of-the-art results on a large number of NLP tasks, can still suffer from overfitting during training. In practice, this is usually countered either by applying regularization methods (e.g. dropout, L2-regularization) or by providing huge amounts of training data. Additionally, Transformer and other architectures are known… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

  5. End-to-End Lexically Constrained Machine Translation for Morphologically Rich Languages

    Authors: Josef Jon, João Paulo Aires, Dušan Variš, Ondřej Bojar

    Abstract: Lexically constrained machine translation allows the user to manipulate the output sentence by enforcing the presence or absence of certain words and phrases. Although current approaches can enforce terms to appear in the translation, they often struggle to make the constraint word form agree with the rest of the generated output. Our manual analysis shows that 46% of the errors in the output of a… ▽ More

    Submitted 24 June, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

  6. arXiv:2010.09413  [pdf, other

    cs.CV cs.CL

    Image Captioning with Visual Object Representations Grounded in the Textual Modality

    Authors: Dušan Variš, Katsuhito Sudoh, Satoshi Nakamura

    Abstract: We present our work in progress exploring the possibilities of a shared embedding space between textual and visual modality. Leveraging the textual nature of object detection labels and the hypothetical expressiveness of extracted visual object representations, we propose an approach opposite to the current trend, grounding of the representations in the word embedding space of the captioning syste… ▽ More

    Submitted 20 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

  7. Unsupervised Pretraining for Neural Machine Translation Using Elastic Weight Consolidation

    Authors: Dušan Variš, Ondřej Bojar

    Abstract: This work presents our ongoing research of unsupervised pretraining in neural machine translation (NMT). In our method, we initialize the weights of the encoder and decoder with two language models that are trained with monolingual data and then fine-tune the model on parallel data using Elastic Weight Consolidation (EWC) to avoid forgetting of the original language modeling tasks. We compare the… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: ACL-SRW 2019 (camera-ready)

  8. arXiv:1811.04697  [pdf, ps, other

    cs.CL

    CUNI System for the WMT18 Multimodal Translation Task

    Authors: **dřich Helcl, **dřich Libovický, Dušan Variš

    Abstract: We present our submission to the WMT18 Multimodal Translation Task. The main feature of our submission is applying a self-attentive network instead of a recurrent neural network. We evaluate two methods of incorporating the visual features in the model: first, we include the image representation as another input to the network; second, we train the model to predict the visual features and use it a… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

    Comments: Published at WMT18