Skip to main content

Showing 1–7 of 7 results for author: Gwinnup, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.03075  [pdf, other

    cs.CL

    Detecting Concrete Visual Tokens for Multimodal Machine Translation

    Authors: Braeden Bowen, Vipin Vijayan, Scott Grigsby, Timothy Anderson, Jeremy Gwinnup

    Abstract: The challenge of visual grounding and masking in multimodal machine translation (MMT) systems has encouraged varying approaches to the detection and selection of visually-grounded text tokens for masking. We introduce new methods for detection of visually and contextually relevant (concrete) tokens from source sentences, including detection with natural language processing (NLP), detection with ob… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  2. arXiv:2403.03045  [pdf, other

    cs.CL

    Adding Multimodal Capabilities to a Text-only Translation Model

    Authors: Vipin Vijayan, Braeden Bowen, Scott Grigsby, Timothy Anderson, Jeremy Gwinnup

    Abstract: While most current work in multimodal machine translation (MMT) uses the Multi30k dataset for training and evaluation, we find that the resulting models overfit to the Multi30k dataset to an extreme degree. Consequently, these models perform very badly when evaluated against typical text-only testing sets such as the WMT newstest datasets. In order to perform well on both Multi30k and typical text… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  3. arXiv:2403.03014  [pdf, other

    cs.CL

    The Case for Evaluating Multimodal Translation Models on Text Datasets

    Authors: Vipin Vijayan, Braeden Bowen, Scott Grigsby, Timothy Anderson, Jeremy Gwinnup

    Abstract: A good evaluation framework should evaluate multimodal machine translation (MMT) models by measuring 1) their use of visual information to aid in the translation task and 2) their ability to translate complex sentences such as done for text-only machine translation. However, most current work in MMT is evaluated against the Multi30k testing sets, which do not measure these properties. Namely, the… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  4. arXiv:2306.07198  [pdf, other

    cs.CL

    A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation

    Authors: Jeremy Gwinnup, Kevin Duh

    Abstract: Large language models such as BERT and the GPT series started a paradigm shift that calls for building general-purpose models via pre-training on large datasets, followed by fine-tuning on task-specific datasets. There is now a plethora of large pre-trained models for Natural Language Processing and Computer Vision. Recently, we have seen rapid developments in the joint Vision-Language space as we… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 10 pages

  5. arXiv:2209.04944  [pdf, other

    cs.CV cs.LG

    Learning When to Say "I Don't Know"

    Authors: Nicholas Kashani Motlagh, Jim Davis, Tim Anderson, Jeremy Gwinnup

    Abstract: We propose a new Reject Option Classification technique to identify and remove regions of uncertainty in the decision space for a given neural classifier and dataset. Such existing formulations employ a learned rejection (remove)/selection (keep) function and require either a known cost for rejecting examples or strong constraints on the accuracy or coverage of the selected examples. We consider a… ▽ More

    Submitted 15 February, 2023; v1 submitted 11 September, 2022; originally announced September 2022.

    Comments: International Symposium on Visual Computing, October 2022

  6. arXiv:1811.00739  [pdf, other

    cs.CL cs.LG

    An Empirical Exploration of Curriculum Learning for Neural Machine Translation

    Authors: Xuan Zhang, Gaurav Kumar, Huda Khayrallah, Kenton Murray, Jeremy Gwinnup, Marianna J Martindale, Paul McNamee, Kevin Duh, Marine Carpuat

    Abstract: Machine translation systems based on deep neural networks are expensive to train. Curriculum learning aims to address this issue by choosing the order in which samples are presented during training to help train better models faster. We adopt a probabilistic view of curriculum learning, which lets us flexibly evaluate the impact of curricula design, and perform an extensive exploration on a German… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

  7. Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

    Authors: Brian Thompson, Huda Khayrallah, Antonios Anastasopoulos, Arya D. McCarthy, Kevin Duh, Rebecca Marvin, Paul McNamee, Jeremy Gwinnup, Tim Anderson, Philipp Koehn

    Abstract: To better understand the effectiveness of continued training, we analyze the major components of a neural machine translation system (the encoder, decoder, and each embedding space) and consider each component's contribution to, and capacity for, domain adaptation. We find that freezing any single component during continued training has minimal impact on performance, and that performance is surpri… ▽ More

    Submitted 15 January, 2019; v1 submitted 13 September, 2018; originally announced September 2018.

    Comments: presented at WMT 2018. Please cite using the bib entry from here: http://www.statmt.org/wmt18/bib/WMT013.bib

    Journal ref: Proceedings of the Third Conference on Machine Translation: Research Papers (2018) 124-132