Skip to main content

Showing 1–50 of 62 results for author: Specia, L

.
  1. arXiv:2407.00248  [pdf, other

    cs.CL

    DiffuseDef: Improved Robustness to Adversarial Attacks

    Authors: Zhenhao Li, Marek Rei, Lucia Specia

    Abstract: Pretrained language models have significantly advanced performance across various natural language processing tasks. However, adversarial attacks continue to pose a critical challenge to system built using these models, as they can be exploited with carefully crafted adversarial texts. Inspired by the ability of diffusion models to predict and reduce noise in computer vision, we propose a novel an… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2401.12874  [pdf, other

    cs.CL cs.AI

    From Understanding to Utilization: A Survey on Explainability for Large Language Models

    Authors: Haoyan Luo, Lucia Specia

    Abstract: Explainability for Large Language Models (LLMs) is a critical yet challenging aspect of natural language processing. As LLMs are increasingly integral to diverse applications, their "black-box" nature sparks significant concerns regarding transparency and ethical use. This survey underscores the imperative for increased explainability in LLMs, delving into both the research on explainability and t… ▽ More

    Submitted 21 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  3. arXiv:2211.09878  [pdf, other

    cs.CL

    Reducing Hallucinations in Neural Machine Translation with Feature Attribution

    Authors: Joël Tang, Marina Fomicheva, Lucia Specia

    Abstract: Neural conditional language generation models achieve the state-of-the-art in Neural Machine Translation (NMT) but are highly dependent on the quality of parallel training dataset. When trained on low-quality datasets, these models are prone to various error types, including hallucinations, i.e. outputs that are fluent, but unrelated to the source sentences. These errors are particularly dangerous… ▽ More

    Submitted 14 June, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

  4. arXiv:2210.10836  [pdf, other

    cs.CV cs.LG

    Scene Text Recognition with Semantics

    Authors: Joshua Cesare Placidi, Yishu Miao, Zixu Wang, Lucia Specia

    Abstract: Scene Text Recognition (STR) models have achieved high performance in recent years on benchmark datasets where text images are presented with minimal noise. Traditional STR recognition pipelines take a cropped image as sole input and attempt to identify the characters present. This infrastructure can fail in instances where the input image is noisy or the text is partially obscured. This paper pro… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 11 pages, 7 figures

  5. arXiv:2210.05039  [pdf, other

    cs.LG cs.CV

    Contrastive Video-Language Learning with Fine-grained Frame Sampling

    Authors: Zixu Wang, Yujie Zhong, Yishu Miao, Lin Ma, Lucia Specia

    Abstract: Despite recent progress in video and language representation learning, the weak or sparse correspondence between the two modalities remains a bottleneck in the area. Most video-language models are trained via pair-level loss to predict whether a pair of video and text is aligned. However, even in paired video-text segments, only a subset of the frames are semantically relevant to the corresponding… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: AACL-IJCNLP 2022

  6. arXiv:2206.12469  [pdf, other

    cs.SD cs.CL eess.AS

    Burst2Vec: An Adversarial Multi-Task Approach for Predicting Emotion, Age, and Origin from Vocal Bursts

    Authors: Atijit Anuchitanukul, Lucia Specia

    Abstract: We present Burst2Vec, our multi-task learning approach to predict emotion, age, and origin (i.e., native country/language) from vocal bursts. Burst2Vec utilises pre-trained speech representations to capture acoustic information from raw waveforms and incorporates the concept of model debiasing via adversarial training. Our models achieve a relative 30 % performance gain over baselines using pre-ex… ▽ More

    Submitted 18 October, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

  7. arXiv:2205.00047  [pdf, other

    cs.LG cs.CL cs.CR

    Logically Consistent Adversarial Attacks for Soft Theorem Provers

    Authors: Alexander Gaskell, Yishu Miao, Lucia Specia, Francesca Toni

    Abstract: Recent efforts within the AI community have yielded impressive results towards "soft theorem proving" over natural language sentences using language models. We propose a novel, generative adversarial framework for probing and improving these models' reasoning capabilities. Adversarial attacks in this domain suffer from the logical inconsistency problem, whereby perturbations to the input may alter… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

    Comments: IJCAI-ECAI 2022

  8. Supervised Visual Attention for Simultaneous Multimodal Machine Translation

    Authors: Veneta Haralampieva, Ozan Caglayan, Lucia Specia

    Abstract: Recently, there has been a surge in research in multimodal machine translation (MMT), where additional modalities such as images are used to improve translation quality of textual systems. A particular use for such multimodal systems is the task of simultaneous machine translation, where visual context has been shown to complement the partial information provided by the source sentence, especially… ▽ More

    Submitted 29 June, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

    Comments: Accepted to Journal of Artificial Intelligence Research (JAIR)

    Journal ref: Journal of Artificial Intelligence Research 74 (2022) 1059-1089

  9. arXiv:2111.12447  [pdf, other

    cs.CL

    Revisiting Contextual Toxicity Detection in Conversations

    Authors: Atijit Anuchitanukul, Julia Ive, Lucia Specia

    Abstract: Understanding toxicity in user conversations is undoubtedly an important problem. Addressing "covert" or implicit cases of toxicity is particularly hard and requires context. Very few previous studies have analysed the influence of conversational context in human perception or in automated detection models. We dive deeper into both these directions. We start by analysing existing contextual datase… ▽ More

    Submitted 18 October, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  10. arXiv:2110.08226  [pdf, other

    cs.LG cs.CL cs.CV

    Guiding Visual Question Generation

    Authors: Nihir Vedd, Zixu Wang, Marek Rei, Yishu Miao, Lucia Specia

    Abstract: In traditional Visual Question Generation (VQG), most images have multiple concepts (e.g. objects and categories) for which a question could be generated, but models are trained to mimic an arbitrary choice of concept as given in their training data. This makes training difficult and also poses issues for evaluation -- multiple valid questions exist for most images but only one or a few are captur… ▽ More

    Submitted 26 July, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: 14 pages including references and Appendix. 3 figures and 4 tables

  11. arXiv:2109.10859  [pdf, other

    cs.CL cs.AI

    Pushing the Right Buttons: Adversarial Evaluation of Quality Estimation

    Authors: Diptesh Kanojia, Marina Fomicheva, Tharindu Ranasinghe, Frédéric Blain, Constantin Orăsan, Lucia Specia

    Abstract: Current Machine Translation (MT) systems achieve very good results on a growing variety of language pairs and datasets. However, they are known to produce fluent translation outputs that can contain important meaning errors, thus undermining their reliability in practice. Quality Estimation (QE) is the task of automatically assessing the performance of MT systems at test time. Thus, in order to be… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Accepted to WMT 2021 Conference co-located with EMNLP 2021. 14 pages with a 4 page appendix

  12. arXiv:2109.08627  [pdf, other

    cs.CL

    Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications

    Authors: Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Francisco Guzmán, Lucia Specia

    Abstract: Sentence-level Quality estimation (QE) of machine translation is traditionally formulated as a regression task, and the performance of QE models is typically measured by Pearson correlation with human labels. Recent QE models have achieved previously-unseen levels of correlation with human judgments, but they rely on large multilingual contextualized language models that are computationally expens… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  13. arXiv:2109.08120  [pdf, other

    cs.CL

    A Survey of Online Hate Speech through the Causal Lens

    Authors: Antigoni-Maria Founta, Lucia Specia

    Abstract: The societal issue of digital hostility has previously attracted a lot of attention. The topic counts an ample body of literature, yet remains prominent and challenging as ever due to its subjective nature. We posit that a better understanding of this problem will require the use of causal inference frameworks. This survey summarises the relevant research that revolves around estimations of causal… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted to CI+NLP: First Workshop on Causal Inference and NLP, part of EMNLP 2021

  14. arXiv:2108.12197  [pdf, other

    cs.CL

    Translation Error Detection as Rationale Extraction

    Authors: Marina Fomicheva, Lucia Specia, Nikolaos Aletras

    Abstract: Recent Quality Estimation (QE) models based on multilingual pre-trained representations have achieved very competitive results when predicting the overall quality of translated sentences. Predicting translation errors, i.e. detecting specifically which words are incorrect, is a more challenging task, especially with limited amounts of training data. We hypothesize that, not unlike humans, successf… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  15. arXiv:2107.00411  [pdf, other

    cs.CL

    Knowledge Distillation for Quality Estimation

    Authors: Amit Gajbhiye, Marina Fomicheva, Fernando Alva-Manchego, Frédéric Blain, Abiola Obamuyide, Nikolaos Aletras, Lucia Specia

    Abstract: Quality Estimation (QE) is the task of automatically predicting Machine Translation quality in the absence of reference translations, making it applicable in real-time settings, such as translating online social media conversations. Recent success in QE stems from the use of multilingual pre-trained representations, where very large models lead to impressive results. However, the inference time, d… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: ACL Findings 2021

  16. arXiv:2106.03484  [pdf, other

    cs.CL

    BERTGEN: Multi-task Generation through BERT

    Authors: Faidon Mitzalis, Ozan Caglayan, Pranava Madhyastha, Lucia Specia

    Abstract: We present BERTGEN, a novel generative, decoder-only model which extends BERT by fusing multimodal and multilingual pretrained models VL-BERT and M-BERT, respectively. BERTGEN is auto-regressively trained for language generation tasks, namely image captioning, machine translation and multimodal machine translation, under a multitask setting. With a comprehensive set of evaluations, we show that BE… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL 2021 Main Conference

  17. arXiv:2105.04780  [pdf, other

    cs.CV cs.CL

    Cross-Modal Generative Augmentation for Visual Question Answering

    Authors: Zixu Wang, Yishu Miao, Lucia Specia

    Abstract: Data augmentation has been shown to effectively improve the performance of multimodal machine learning models. This paper introduces a generative model for data augmentation by leveraging the correlations among multiple modalities. Different from conventional data augmentation approaches that apply low-level operations with deterministic heuristics, our method learns a generator that generates sam… ▽ More

    Submitted 22 October, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: BMVC 2021

  18. arXiv:2104.07112  [pdf, other

    cs.CL

    What Makes a Scientific Paper be Accepted for Publication?

    Authors: Panagiotis Fytas, Georgios Rizos, Lucia Specia

    Abstract: Despite peer-reviewing being an essential component of academia since the 1600s, it has repeatedly received criticisms for lack of transparency and consistency. We posit that recent work in machine learning and explainable AI provide tools that enable insights into the decisions from a given peer review process. We start by extracting global explanations in the form of linguistic features that aff… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    MSC Class: 68T50 ACM Class: I.2.7

  19. arXiv:2104.05688  [pdf, other

    cs.CL cs.HC

    Backtranslation Feedback Improves User Confidence in MT, Not Quality

    Authors: Vilém Zouhar, Michal Novák, Matúš Žilinec, Ondřej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya

    Abstract: Translating text into a language unknown to the text's author, dubbed outbound translation, is a modern need for which the user experience has significant room for improvement, beyond the basic machine translation facility. We demonstrate this by showing three ways in which user confidence in the outbound translation, as well as its overall final quality, can be affected: backward translation, qua… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 9 pages (excluding references); to appear at NAACL-HWT 2021

  20. Visual Cues and Error Correction for Translation Robustness

    Authors: Zhenhao Li, Marek Rei, Lucia Specia

    Abstract: Neural Machine Translation models are sensitive to noise in the input texts, such as misspelled words and ungrammatical constructions. Existing robustness techniques generally fail when faced with unseen types of noise and their performance degrades on clean texts. In this paper, we focus on three types of realistic noise that are commonly generated by humans and introduce the idea of visual conte… ▽ More

    Submitted 2 May, 2022; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: Accepted at Findings of EMNLP 2021; add acknowledgements

  21. arXiv:2103.01910  [pdf, other

    cs.CL

    MultiSubs: A Large-scale Multimodal and Multilingual Dataset

    Authors: Josiah Wang, Pranava Madhyastha, Josiel Figueiredo, Chiraag Lala, Lucia Specia

    Abstract: This paper introduces a large-scale multimodal and multilingual dataset that aims to facilitate research on grounding words to images in their contextual usage in language. The dataset consists of images selected to unambiguously illustrate concepts expressed in sentences from movie subtitles. The dataset is a valuable resource as (i) the images are aligned to text fragments rather than whole sent… ▽ More

    Submitted 16 June, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Added an n-gram with back-off baseline model to the lexical translation task (Section 7.2.4). Also synchronised the paper structure to the LREC2022 version of this work. This arxiv version is a longer version of the LREC2022 version including more experiments and an additional lexical translation task

  22. arXiv:2102.11403  [pdf, other

    cs.CL

    Exploring Supervised and Unsupervised Rewards in Machine Translation

    Authors: Julia Ive, Zixu Wang, Marina Fomicheva, Lucia Specia

    Abstract: Reinforcement Learning (RL) is a powerful framework to address the discrepancy between loss functions used during training and the final evaluation metrics to be used at test time. When applied to neural Machine Translation (MT), it minimises the mismatch between the cross-entropy loss and non-differentiable evaluation metrics like BLEU. However, the suitability of these metrics as reward function… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Long paper accepted to EACL 2021, Camera-ready version

  23. arXiv:2102.11387  [pdf, other

    cs.CL

    Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation

    Authors: Julia Ive, Andy Mingren Li, Yishu Miao, Ozan Caglayan, Pranava Madhyastha, Lucia Specia

    Abstract: This paper addresses the problem of simultaneous machine translation (SiMT) by exploring two main concepts: (a) adaptive policies to learn a good trade-off between high translation quality and low latency; and (b) visual information to support this process by providing additional (visual) contextual information which may be available before the textual input is produced. For that, we propose a mul… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Long paper accepted to EACL 2021, Camera-ready version

  24. arXiv:2102.04020  [pdf, other

    cs.CL

    Quality Estimation without Human-labeled Data

    Authors: Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Francisco Guzmán, Lucia Specia

    Abstract: Quality estimation aims to measure the quality of translated content without access to a reference translation. This is crucial for machine translation systems in real-world scenarios where high-quality translation is needed. While many approaches exist for quality estimation, they are based on supervised machine learning requiring costly human labelled data. As an alternative, we propose a techni… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted by EACL2021

  25. arXiv:2101.10044  [pdf, other

    cs.CL cs.CV

    Cross-lingual Visual Pre-training for Multimodal Machine Translation

    Authors: Ozan Caglayan, Menekse Kuyu, Mustafa Sercan Amac, Pranava Madhyastha, Erkut Erdem, Aykut Erdem, Lucia Specia

    Abstract: Pre-trained language models have been shown to improve performance in many natural language tasks substantially. Although the early focus of such models was single language pre-training, recent advances have resulted in cross-lingual and visual pre-training methods. In this paper, we combine these two approaches to learn visually-grounded cross-lingual representations. Specifically, we extend the… ▽ More

    Submitted 20 April, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: Accepted to EACL 2021 (Camera-ready version)

  26. arXiv:2101.06399  [pdf, other

    cs.CV cs.AI cs.CL

    Latent Variable Models for Visual Question Answering

    Authors: Zixu Wang, Yishu Miao, Lucia Specia

    Abstract: Current work on Visual Question Answering (VQA) explore deterministic approaches conditioned on various types of image and question features. We posit that, in addition to image and question pairs, other modalities are useful for teaching machine to carry out question answering. Hence in this paper, we propose latent variable models for VQA where extra information (e.g. captions and answer categor… ▽ More

    Submitted 26 September, 2021; v1 submitted 16 January, 2021; originally announced January 2021.

    Comments: ICCV21 CLVL: 4th Workshop on Closing the Loop Between Vision and Language

  27. arXiv:2012.07098  [pdf, other

    cs.CV

    MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish

    Authors: Begum Citamak, Ozan Caglayan, Menekse Kuyu, Erkut Erdem, Aykut Erdem, Pranava Madhyastha, Lucia Specia

    Abstract: Automatic generation of video descriptions in natural language, also called video captioning, aims to understand the visual content of the video and produce a natural language sentence depicting the objects and actions in the scene. This challenging integrated vision and language problem, however, has been predominantly addressed for English. The lack of data and the linguistic properties of other… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

  28. arXiv:2011.09634  [pdf, other

    cs.CV

    Watch and Learn: Map** Language and Noisy Real-world Videos with Self-supervision

    Authors: Yujie Zhong, Linhai Xie, Sen Wang, Lucia Specia, Yishu Miao

    Abstract: In this paper, we teach machines to understand visuals and natural language by learning the map** between sentences and noisy video snippets without explicit annotations. Firstly, we define a self-supervised learning framework that captures the cross-modal information. A novel adversarial learning module is then introduced to explicitly handle the noises in the natural videos, where the subtitle… ▽ More

    Submitted 11 January, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2020 Self-Supervised Learning Workshop

  29. arXiv:2010.13588  [pdf, ps, other

    cs.CL

    Curious Case of Language Generation Evaluation Metrics: A Cautionary Tale

    Authors: Ozan Caglayan, Pranava Madhyastha, Lucia Specia

    Abstract: Automatic evaluation of language generation systems is a well-studied problem in Natural Language Processing. While novel metrics are proposed every year, a few popular metrics remain as the de facto metrics to evaluate tasks such as image captioning and machine translation, despite their known limitations. This is partly due to ease of use, and partly because researchers expect to see them and kn… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 7 pages, accepted to COLING 2020

  30. arXiv:2010.04987  [pdf, other

    cs.CL cs.HC cs.LG

    FIND: Human-in-the-Loop Debugging Deep Text Classifiers

    Authors: Piyawat Lertvittayakumjorn, Lucia Specia, Francesca Toni

    Abstract: Since obtaining a perfect training dataset (i.e., a dataset which is considerably large, unbiased, and well-representative of unseen cases) is hardly possible, many real-world text classifiers are trained on the available, yet imperfect, datasets. These classifiers are thus likely to have undesirable properties. For instance, they may have biases against some sub-populations or may not work effect… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: 17 pages including appendices; To appear at EMNLP 2020

  31. arXiv:2010.04480  [pdf, other

    cs.CL

    MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset

    Authors: Marina Fomicheva, Shuo Sun, Erick Fonseca, Chrysoula Zerva, Frédéric Blain, Vishrav Chaudhary, Francisco Guzmán, Nina Lopatina, Lucia Specia, André F. T. Martins

    Abstract: We present MLQE-PE, a new dataset for Machine Translation (MT) Quality Estimation (QE) and Automatic Post-Editing (APE). The dataset contains eleven language pairs, with human labels for up to 10,000 translations per language pair in the following formats: sentence-level direct assessments and post-editing effort, and word-level good/bad labels. It also contains the post-edited sentences, as well… ▽ More

    Submitted 11 October, 2021; v1 submitted 9 October, 2020; originally announced October 2020.

  32. arXiv:2009.07310  [pdf, other

    cs.CL

    Simultaneous Machine Translation with Visual Context

    Authors: Ozan Caglayan, Julia Ive, Veneta Haralampieva, Pranava Madhyastha, Loïc Barrault, Lucia Specia

    Abstract: Simultaneous machine translation (SiMT) aims to translate a continuous input text stream into another language with the lowest latency and highest quality possible. The translation thus has to start with an incomplete source text, which is read progressively, creating the need for anticipation. In this paper, we seek to understand whether the addition of visual information can compensate for the m… ▽ More

    Submitted 13 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: Long paper accepted to EMNLP 2020, Camera-ready version

  33. arXiv:2005.10608  [pdf, other

    cs.CL

    Unsupervised Quality Estimation for Neural Machine Translation

    Authors: Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia

    Abstract: Quality Estimation (QE) is an important component in making Machine Translation (MT) useful in real-world applications, as it is aimed to inform the user on the quality of the MT output at test time. Existing approaches require large amounts of expert annotated data, computation and time for training. As an alternative, we devise an unsupervised approach to QE where no training or access to additi… ▽ More

    Submitted 20 July, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: Accepted for publication in TACL. Authors' final version

  34. arXiv:2005.00481  [pdf, other

    cs.CL

    ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations

    Authors: Fernando Alva-Manchego, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot, Lucia Specia

    Abstract: In order to simplify a sentence, human editors perform multiple rewriting transformations: they split it into several shorter sentences, paraphrase words (i.e. replacing complex words or phrases by simpler synonyms), reorder components, and/or delete information deemed unnecessary. Despite these varied range of possible text alterations, current models for automatic sentence simplification are eva… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL 2020 (camera-ready version)

  35. arXiv:1911.12798  [pdf, other

    cs.CL

    Multimodal Machine Translation through Visuals and Speech

    Authors: Umut Sulubacak, Ozan Caglayan, Stig-Arne Grönroos, Aku Rouhe, Desmond Elliott, Lucia Specia, Jörg Tiedemann

    Abstract: Multimodal machine translation involves drawing information from more than one modality, based on the assumption that the additional modalities will contain useful alternative views of the input data. The most prominent tasks in this area are spoken language translation, image-guided translation, and video-guided translation, which exploit audio and visual modalities, respectively. These tasks are… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: 34 pages, 4 tables, 8 figures. Submitted (Nov 2019) to the Machine Translation journal (Springer)

  36. arXiv:1910.13215  [pdf, other

    cs.CL

    Transformer-based Cascaded Multimodal Speech Translation

    Authors: Zixiu Wu, Ozan Caglayan, Julia Ive, Josiah Wang, Lucia Specia

    Abstract: This paper describes the cascaded multimodal speech translation systems developed by Imperial College London for the IWSLT 2019 evaluation campaign. The architecture consists of an automatic speech recognition (ASR) system followed by a Transformer-based multimodal machine translation (MMT) system. While the ASR component is identical across the experiments, the MMT model varies in terms of the wa… ▽ More

    Submitted 8 November, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted to IWSLT 2019

  37. arXiv:1910.07482  [pdf, other

    cs.CL cs.NE

    Imperial College London Submission to VATEX Video Captioning Task

    Authors: Ozan Caglayan, Zixiu Wu, Pranava Madhyastha, Josiah Wang, Lucia Specia

    Abstract: This paper describes the Imperial College London team's submission to the 2019' VATEX video captioning challenge, where we first explore two sequence-to-sequence models, namely a recurrent (GRU) model and a transformer model, which generate captions from the I3D action features. We then investigate the effect of drop** the encoder and the attention mechanism and instead conditioning the GRU deco… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

  38. arXiv:1910.06204  [pdf, other

    cs.CL

    Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality

    Authors: Carolina Scarton, Mikel L. Forcada, Miquel Esplà-Gomis, Lucia Specia

    Abstract: Devising metrics to assess translation quality has always been at the core of machine translation (MT) research. Traditional automatic reference-based metrics, such as BLEU, have shown correlations with human judgements of adequacy and fluency and have been paramount for the advancement of MT system development. Crowd-sourcing has popularised and enabled the scalability of metrics based on human j… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: IWSLT 2019, Hong Kong, November 2 and 3, 2019

  39. Improving Neural Machine Translation Robustness via Data Augmentation: Beyond Back Translation

    Authors: Zhenhao Li, Lucia Specia

    Abstract: Neural Machine Translation (NMT) models have been proved strong when translating clean texts, but they are very sensitive to noise in the input. Improving NMT models robustness can be seen as a form of "domain" adaption to noise. The recently created Machine Translation on Noisy Text task corpus provides noisy-clean parallel data for a few language pairs, but this data is very limited in size and… ▽ More

    Submitted 14 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: add missing content & references, fix url line break in footnotes

  40. arXiv:1908.07553  [pdf, other

    cs.CV cs.CL cs.LG

    Phrase Localization Without Paired Training Examples

    Authors: Josiah Wang, Lucia Specia

    Abstract: Localizing phrases in images is an important part of image understanding and can be useful in many applications that require map**s between textual and visual information. Existing work attempts to learn these map**s from examples of phrase-image region correspondences (strong supervision) or from phrase-image pairs (weak supervision). We postulate that such paired annotations are unnecessary,… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: Accepted for oral presentation at the IEEE/CVF International Conference on Computer Vision (ICCV) 2019

  41. arXiv:1908.04567  [pdf, other

    cs.CL

    EASSE: Easier Automatic Sentence Simplification Evaluation

    Authors: Fernando Alva-Manchego, Louis Martin, Carolina Scarton, Lucia Specia

    Abstract: We introduce EASSE, a Python package aiming to facilitate and standardise automatic evaluation and comparison of Sentence Simplification (SS) systems. EASSE provides a single access point to a broad range of evaluation resources: standard automatic metrics for assessing SS outputs (e.g. SARI), word-level accuracy scores for certain simplification transformations, reference-independent quality esti… ▽ More

    Submitted 13 September, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: EMNLP-IJCNLP 2019 Demo (Camera-ready Version)

  42. arXiv:1908.01665  [pdf, other

    cs.CL

    Predicting Actions to Help Predict Translations

    Authors: Zixiu Wu, Julia Ive, Josiah Wang, Pranava Madhyastha, Lucia Specia

    Abstract: We address the task of text translation on the How2 dataset using a state of the art transformer-based multimodal approach. The question we ask ourselves is whether visual features can support the translation process, in particular, given that this is a dataset extracted from videos, we focus on the translation of actions, which we believe are poorly captured in current static image-text datasets… ▽ More

    Submitted 18 August, 2019; v1 submitted 5 August, 2019; originally announced August 2019.

    Comments: Accepted to workshop "The How2 Challenge: New Tasks for Vision & Language" of International Conference on Machine Learning 2019

  43. arXiv:1907.09340  [pdf, other

    cs.CL cs.CV cs.LG

    VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions

    Authors: Pranava Madhyastha, Josiah Wang, Lucia Specia

    Abstract: We address the task of evaluating image description generation systems. We propose a novel image-aware metric for this task: VIFIDEL. It estimates the faithfulness of a generated caption with respect to the content of the actual image, based on the semantic similarity between labels of objects depicted in images and words in the description. The metric is also able to take into account the relativ… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: Accepted for publication at ACL 2019

  44. arXiv:1907.01055  [pdf, other

    cs.CL cs.LG

    Is artificial data useful for biomedical Natural Language Processing algorithms?

    Authors: Zixu Wang, Julia Ive, Sumithra Velupillai, Lucia Specia

    Abstract: A major obstacle to the development of Natural Language Processing (NLP) methods in the biomedical domain is data accessibility. This problem can be addressed by generating medical data artificially. Most previous studies have focused on the generation of short clinical text, and evaluation of the data utility has been limited. We propose a generic methodology to guide the generation of clinical t… ▽ More

    Submitted 7 August, 2019; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: BioNLP 2019

  45. arXiv:1906.07701  [pdf, other

    cs.CL

    Distilling Translations with Visual Awareness

    Authors: Julia Ive, Pranava Madhyastha, Lucia Specia

    Abstract: Previous work on multimodal machine translation has shown that visual information is only needed in very specific cases, for example in the presence of ambiguous words where the textual context is not sufficient. As a consequence, models tend to learn to ignore this information. We propose a translate-and-refine approach to this problem where images are only used by a second stage decoder. This ap… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

    Comments: accepted to ACL 2019

  46. arXiv:1903.08678  [pdf, other

    cs.CL

    Probing the Need for Visual Context in Multimodal Machine Translation

    Authors: Ozan Caglayan, Pranava Madhyastha, Lucia Specia, Loïc Barrault

    Abstract: Current work on multimodal machine translation (MMT) has suggested that the visual modality is either unnecessary or only marginally beneficial. We posit that this is a consequence of the very simple, short and repetitive sentences used in the only available dataset for the task (Multi30K), rendering the source text sufficient as context. In the general case, however, we believe that it is possibl… ▽ More

    Submitted 2 June, 2019; v1 submitted 20 March, 2019; originally announced March 2019.

    Comments: Accepted to NAACL-HLT 2019, reviewer comments addressed, camera-ready

  47. arXiv:1811.00347  [pdf, other

    cs.CL

    How2: A Large-scale Dataset for Multimodal Language Understanding

    Authors: Ramon Sanabria, Ozan Caglayan, Shruti Palaskar, Desmond Elliott, Loïc Barrault, Lucia Specia, Florian Metze

    Abstract: In this paper, we introduce How2, a multimodal collection of instructional videos with English subtitles and crowdsourced Portuguese translations. We also present integrated sequence-to-sequence baselines for machine translation, automatic speech recognition, spoken language translation, and multimodal summarization. By making available data and code for several multimodal natural language tasks,… ▽ More

    Submitted 7 December, 2018; v1 submitted 1 November, 2018; originally announced November 2018.

  48. arXiv:1810.03148  [pdf, other

    cs.CL

    Assessing Crosslingual Discourse Relations in Machine Translation

    Authors: Karin Sim Smith, Lucia Specia

    Abstract: In an attempt to improve overall translation quality, there has been an increasing focus on integrating more linguistic elements into Machine Translation (MT). While significant progress has been achieved, especially recently with neural models, automatically evaluating the output of such systems is still an open problem. Current practice in MT evaluation relies on a single reference translation,… ▽ More

    Submitted 7 October, 2018; originally announced October 2018.

  49. arXiv:1809.04144  [pdf, other

    cs.CV

    End-to-end Image Captioning Exploits Multimodal Distributional Similarity

    Authors: Pranava Madhyastha, Josiah Wang, Lucia Specia

    Abstract: We hypothesize that end-to-end neural image captioning systems work seemingly well because they exploit and learn `distributional similarity' in a multimodal feature space by map** a test image to similar training images in this space and generating a caption from the same space. To validate our hypothesis, we focus on the `image' side of image captioning, and vary the input image representation… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: Published in BMVC 2018

  50. arXiv:1809.00315  [pdf, other

    cs.CL

    Exploring Gap Filling as a Cheaper Alternative to Reading Comprehension Questionnaires when Evaluating Machine Translation for Gisting

    Authors: Mikel L. Forcada, Carolina Scarton, Lucia Specia, Barry Haddow, Alexandra Birch

    Abstract: A popular application of machine translation (MT) is gisting: MT is consumed as is to make sense of text in a foreign language. Evaluation of the usefulness of MT for gisting is surprisingly uncommon. The classical method uses reading comprehension questionnaires (RCQ), in which informants are asked to answer professionally-written questions in their language about a foreign text that has been mac… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.

    Comments: 12 pages, 3 figures, 2 tables, Proceedings of the Third Conference on Machine Translation (WMT18), 2018

    MSC Class: 68T50 ACM Class: I.2.7