Skip to main content

Showing 1–18 of 18 results for author: Treviso, M

.
  1. arXiv:2407.05489  [pdf, other

    cs.CL

    How Effective are State Space Models for Machine Translation?

    Authors: Hugo Pitorro, Pavlo Vasylenko, Marcos Treviso, André F. T. Martins

    Abstract: Transformers are the current architecture of choice for NLP, but their attention layers do not scale well to long contexts. Recent works propose to replace attention with linear recurrent layers -- this is the case for state space models, which enjoy efficient training and inference. However, it remains unclear whether these models are competitive with transformers in machine translation (MT). In… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  2. arXiv:2406.19482  [pdf, other

    cs.CL

    xTower: A Multilingual LLM for Explaining and Correcting Translation Errors

    Authors: Marcos Treviso, Nuno M. Guerreiro, Sweta Agrawal, Ricardo Rei, José Pombal, Tania Vaz, Helena Wu, Beatriz Silva, Daan van Stigt, André F. T. Martins

    Abstract: While machine translation (MT) systems are achieving increasingly strong performance on benchmarks, they often produce translations with errors and anomalies. Understanding these errors can potentially help improve the translation quality and user experience. This paper introduces xTower, an open large language model (LLM) built on top of TowerBase designed to provide free-text explanations for tr… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2309.11925  [pdf, other

    cs.CL

    Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Nuno M. Guerreiro, José Pombal, Daan van Stigt, Marcos Treviso, Luisa Coheur, José G. C. de Souza, André F. T. Martins

    Abstract: We present the joint contribution of Unbabel and Instituto Superior Técnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks,… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  4. arXiv:2305.17075  [pdf, other

    cs.CL

    CREST: A Joint Framework for Rationalization and Counterfactual Text Generation

    Authors: Marcos Treviso, Alexis Ross, Nuno M. Guerreiro, André F. T. Martins

    Abstract: Selective rationales and counterfactual examples have emerged as two effective, complementary classes of interpretability methods for analyzing and training NLP models. However, prior work has not explored how these methods can be integrated to combine their complementary advantages. We overcome this limitation by introducing CREST (ContRastive Edits with Sparse raTionalization), a joint framework… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 (main)

  5. arXiv:2305.11806  [pdf, other

    cs.CL

    The Inside Story: Towards Better Understanding of Machine Translation Neural Evaluation Metrics

    Authors: Ricardo Rei, Nuno M. Guerreiro, Marcos Treviso, Luisa Coheur, Alon Lavie, André F. T. Martins

    Abstract: Neural metrics for machine translation evaluation, such as COMET, exhibit significant improvements in their correlation with human judgments, as compared to traditional metrics based on lexical overlap, such as BLEU. Yet, neural metrics are, to a great extent, "black boxes" returning a single sentence-level score without transparency about the decision-making process. In this work, we develop and… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023

  6. arXiv:2209.06243  [pdf, other

    cs.CL cs.LG

    CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

    Abstract: We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equip** it w… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: WMT 2022 Quality Estimation shared task

  7. arXiv:2209.00099  [pdf, other

    cs.CL

    Efficient Methods for Natural Language Processing: A Survey

    Authors: Marcos Treviso, Ji-Ung Lee, Tianchu Ji, Betty van Aken, Qingqing Cao, Manuel R. Ciosici, Michael Hassid, Kenneth Heafield, Sara Hooker, Colin Raffel, Pedro H. Martins, André F. T. Martins, Jessica Zosa Forde, Peter Milder, Edwin Simpson, Noam Slonim, Jesse Dodge, Emma Strubell, Niranjan Balasubramanian, Leon Derczynski, Iryna Gurevych, Roy Schwartz

    Abstract: Recent work in natural language processing (NLP) has yielded appealing results from scaling model parameters and training data; however, using only scale to improve performance means that resource consumption also grows. Such resources include data, time, storage, or energy, all of which are naturally limited and unevenly distributed. This motivates research into efficient methods that require few… ▽ More

    Submitted 24 March, 2023; v1 submitted 31 August, 2022; originally announced September 2022.

    Comments: Accepted at TACL, pre publication version

  8. arXiv:2204.10810  [pdf, other

    cs.LG cs.CL cs.CV

    Learning to Scaffold: Optimizing Model Explanations for Teaching

    Authors: Patrick Fernandes, Marcos Treviso, Danish Pruthi, André F. T. Martins, Graham Neubig

    Abstract: Modern machine learning models are opaque, and as a result there is a burgeoning academic subfield on methods that explain these models' behavior. However, what is the precise goal of providing such explanations, and how can we demonstrate that explanations achieve this goal? Some research argues that explanations should help teach a student (either human or machine) to simulate the model being ex… ▽ More

    Submitted 29 November, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: 10 pages. NeurIPS 2022

  9. arXiv:2109.12188  [pdf, other

    cs.CL

    Predicting Attention Sparsity in Transformers

    Authors: Marcos Treviso, António Góis, Patrick Fernandes, Erick Fonseca, André F. T. Martins

    Abstract: Transformers' quadratic complexity with respect to the input sequence length has motivated a body of work on efficient sparse approximations to softmax. An alternative path, used by entmax transformers, consists of having built-in exact sparse attention; however this approach still requires quadratic computation. In this paper, we propose Sparsefinder, a simple model trained to identify the sparsi… ▽ More

    Submitted 21 April, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: SPNLP22

  10. arXiv:2108.01988  [pdf, other

    cs.LG cs.AI stat.ML

    Sparse Continuous Distributions and Fenchel-Young Losses

    Authors: André F. T. Martins, Marcos Treviso, António Farinhas, Pedro M. Q. Aguiar, Mário A. T. Figueiredo, Mathieu Blondel, Vlad Niculae

    Abstract: Exponential families are widely used in machine learning, including many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, recent work on sparse alternatives to softmax (e.g., sparsemax, $α$-entmax, and fused… ▽ More

    Submitted 4 August, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: JMLR 2022 camera ready version. arXiv admin note: text overlap with arXiv:2006.07214

  11. arXiv:2006.07214  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Sparse and Continuous Attention Mechanisms

    Authors: André F. T. Martins, António Farinhas, Marcos Treviso, Vlad Niculae, Pedro M. Q. Aguiar, Mário A. T. Figueiredo

    Abstract: Exponential families are widely used in machine learning; they include many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, there has been recent work on sparse alternatives to softmax (e.g. sparsemax and a… ▽ More

    Submitted 29 October, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Accepted for spotlight presentation at NeurIPS 2020

  12. arXiv:2004.13876  [pdf, other

    cs.CL

    The Explanation Game: Towards Prediction Explainability through Sparse Communication

    Authors: Marcos V. Treviso, André F. T. Martins

    Abstract: Explainability is a topic of growing importance in NLP. In this work, we provide a unified perspective of explainability as a communication problem between an explainer and a layperson about a classifier's decision. We use this framework to compare several prior approaches for extracting explanations, including gradient methods, representation erasure, and attention mechanisms, in terms of their c… ▽ More

    Submitted 12 October, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: BlackBoxNLP 2020

  13. arXiv:1907.10352  [pdf, other

    cs.CL

    Unbabel's Participation in the WMT19 Translation Quality Estimation Shared Task

    Authors: Fabio Kepler, Jonay Trénous, Marcos Treviso, Miguel Vera, António Góis, M. Amin Farajian, António V. Lopes, André F. T. Martins

    Abstract: We present the contribution of the Unbabel team to the WMT 2019 Shared Task on Quality Estimation. We participated on the word, sentence, and document-level tracks, encompassing 3 language pairs: English-German, English-Russian, and English-French. Our submissions build upon the recent OpenKiwi framework: we combine linear, neural, and predictor-estimator systems with new transfer learning approac… ▽ More

    Submitted 11 September, 2019; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: In Proceedings of the Fourth Conference on Machine Translation (WMT) 2019: https://www.aclweb.org/anthology/W19-5406/

  14. arXiv:1902.08646  [pdf, other

    cs.CL cs.LG

    OpenKiwi: An Open Source Framework for Quality Estimation

    Authors: Fábio Kepler, Jonay Trénous, Marcos Treviso, Miguel Vera, André F. T. Martins

    Abstract: We introduce OpenKiwi, a PyTorch-based open source framework for translation quality estimation. OpenKiwi supports training and testing of word-level and sentence-level quality estimation systems, implementing the winning systems of the WMT 2015-18 quality estimation campaigns. We benchmark OpenKiwi on two datasets from WMT 2018 (English-German SMT and NMT), yielding state-of-the-art performance o… ▽ More

    Submitted 26 August, 2019; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Published at the Annual Meeting of the Association for Computational Linguistics (ACL) 2019: System Demonstrations (https://aclweb.org/anthology/papers/P/P19/P19-3020/)

  15. arXiv:1708.06025  [pdf, ps, other

    cs.CL

    Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks

    Authors: Nathan Hartmann, Erick Fonseca, Christopher Shulby, Marcos Treviso, Jessica Rodrigues, Sandra Aluisio

    Abstract: Word embeddings have been found to provide meaningful representations for words in an efficient way; therefore, they have become common in Natural Language Processing sys- tems. In this paper, we evaluated different word embedding models trained on a large Portuguese corpus, including both Brazilian and European variants. We trained 31 word embedding models using FastText, GloVe, Wang2Vec and Word… ▽ More

    Submitted 20 August, 2017; originally announced August 2017.

    Comments: 7 pages, STIL 2017 Full paper

  16. arXiv:1708.04704  [pdf, other

    cs.CL

    Evaluating Word Embeddings for Sentence Boundary Detection in Speech Transcripts

    Authors: Marcos V. Treviso, Christopher D. Shulby, Sandra M. Aluisio

    Abstract: This paper is motivated by the automation of neuropsychological tests involving discourse analysis in the retellings of narratives by patients with potential cognitive impairment. In this scenario the task of sentence boundary detection in speech transcripts is important as discourse analysis involves the application of Natural Language Processing tools, such as taggers and parsers, which depend o… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: Accepted on STIL 2017

  17. arXiv:1707.02657  [pdf, other

    cs.CL cs.AI cs.LG

    PELESent: Cross-domain polarity classification using distant supervision

    Authors: Edilson A. Corrêa Jr, Vanessa Q. Marinho, Leandro B. dos Santos, Thales F. C. Bertaglia, Marcos V. Treviso, Henrico B. Brum

    Abstract: The enormous amount of texts published daily by Internet users has fostered the development of methods to analyze this content in several natural language processing areas, such as sentiment analysis. The main goal of this task is to classify the polarity of a message. Even though many approaches have been proposed for sentiment analysis, some of the most successful ones rely on the availability o… ▽ More

    Submitted 9 July, 2017; originally announced July 2017.

    Comments: Accepted for publication in BRACIS 2017

  18. arXiv:1610.00211  [pdf, other

    cs.CL

    Sentence Segmentation in Narrative Transcripts from Neuropsychological Tests using Recurrent Convolutional Neural Networks

    Authors: Marcos Vinícius Treviso, Christopher Shulby, Sandra Maria Aluísio

    Abstract: Automated discourse analysis tools based on Natural Language Processing (NLP) aiming at the diagnosis of language-impairing dementias generally extract several textual metrics of narrative transcripts. However, the absence of sentence boundary segmentation in the transcripts prevents the direct application of NLP methods which rely on these marks to function properly, such as taggers and parsers.… ▽ More

    Submitted 15 August, 2017; v1 submitted 1 October, 2016; originally announced October 2016.

    Comments: EACL 2017

    MSC Class: 68T50