Skip to main content

Showing 1–12 of 12 results for author: Farinhas, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00049  [pdf, other

    cs.CL cs.LG

    QUEST: Quality-Aware Metropolis-Hastings Sampling for Machine Translation

    Authors: Gonçalo R. A. Faria, Sweta Agrawal, António Farinhas, Ricardo Rei, José G. C. de Souza, André F. T. Martins

    Abstract: An important challenge in machine translation (MT) is to generate high-quality and diverse translations. Prior work has shown that the estimated likelihood from the MT model correlates poorly with translation quality. In contrast, quality evaluation metrics (such as COMET or BLEURT) exhibit high correlations with human judgments, which has motivated their use as rerankers (such as quality-aware an… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

  2. arXiv:2405.18348  [pdf, other

    cs.CL

    Can Automatic Metrics Assess High-Quality Translations?

    Authors: Sweta Agrawal, António Farinhas, Ricardo Rei, André F. T. Martins

    Abstract: Automatic metrics for evaluating translation quality are typically validated by measuring how well they correlate with human assessments. However, correlation methods tend to capture only the ability of metrics to differentiate between good and bad source-translation pairs, overlooking their reliability in distinguishing alternative translations for the same source. In this paper, we confirm that… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: work in progress

  3. arXiv:2405.01976  [pdf, other

    cs.CL cs.LG

    Conformal Prediction for Natural Language Processing: A Survey

    Authors: Margarida M. Campos, António Farinhas, Chrysoula Zerva, Mário A. T. Figueiredo, André F. T. Martins

    Abstract: The rapid proliferation of large language models and natural language processing (NLP) applications creates a crucial need for uncertainty quantification to mitigate risks such as hallucinations and to enhance decision-making reliability in critical applications. Conformal prediction is emerging as a theoretically sound and practically useful framework, combining flexibility with strong statistica… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  4. arXiv:2311.09132  [pdf, other

    cs.CL

    Aligning Neural Machine Translation Models: Human Feedback in Training and Inference

    Authors: Miguel Moura Ramos, Patrick Fernandes, António Farinhas, André F. T. Martins

    Abstract: Reinforcement learning from human feedback (RLHF) is a recent technique to improve the quality of the text generated by a language model, making it closer to what humans would generate. A core ingredient in RLHF's success in aligning and improving large language models (LLMs) is its reward model, trained using human feedback on model outputs. In machine translation (MT), where metrics trained from… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 14 pages, work-in-progress

  5. arXiv:2310.11430  [pdf, other

    cs.CL

    An Empirical Study of Translation Hypothesis Ensembling with Large Language Models

    Authors: António Farinhas, José G. C. de Souza, André F. T. Martins

    Abstract: Large language models (LLMs) are becoming a one-fits-many solution, but they sometimes hallucinate or produce unreliable output. In this paper, we investigate how hypothesis ensembling can improve the quality of the generated text for the specific problem of LLM-based machine translation. We experiment with several techniques for ensembling hypotheses produced by LLMs such as ChatGPT, LLaMA, and A… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (main conference)

  6. arXiv:2310.01262  [pdf, other

    cs.LG stat.ML

    Non-Exchangeable Conformal Risk Control

    Authors: António Farinhas, Chrysoula Zerva, Dennis Ulmer, André F. T. Martins

    Abstract: Split conformal prediction has recently sparked great interest due to its ability to provide formally guaranteed uncertainty sets or intervals for predictions made by black-box neural models, ensuring a predefined probability of containing the actual ground truth. While the original formulation assumes data exchangeability, some extensions handle non-exchangeable data, which is often the case in m… ▽ More

    Submitted 26 January, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  7. arXiv:2305.00955  [pdf, other

    cs.CL cs.AI cs.LG

    Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation

    Authors: Patrick Fernandes, Aman Madaan, Emmy Liu, António Farinhas, Pedro Henrique Martins, Amanda Bertsch, José G. C. de Souza, Shuyan Zhou, Tongshuang Wu, Graham Neubig, André F. T. Martins

    Abstract: Many recent advances in natural language generation have been fueled by training large language models on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving mod… ▽ More

    Submitted 31 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Work in Progress

  8. arXiv:2205.00978  [pdf, other

    cs.CL

    Quality-Aware Decoding for Neural Machine Translation

    Authors: Patrick Fernandes, António Farinhas, Ricardo Rei, José G. C. de Souza, Perez Ogayo, Graham Neubig, André F. T. Martins

    Abstract: Despite the progress in machine translation quality estimation and evaluation in the last years, decoding in neural machine translation (NMT) is mostly oblivious to this and centers around finding the most probable translation according to the model (MAP decoding), approximated with beam search. In this paper, we bring together these two lines of research and propose quality-aware decoding for NMT… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: NAACL2022

  9. arXiv:2108.02658  [pdf, other

    cs.LG

    Sparse Communication via Mixed Distributions

    Authors: António Farinhas, Wilker Aziz, Vlad Niculae, André F. T. Martins

    Abstract: Neural networks and other machine learning models compute continuous representations, while humans communicate mostly through discrete symbols. Reconciling these two forms of communication is desirable for generating human-readable interpretations or learning discrete latent variable models, while maintaining end-to-end differentiability. Some existing approaches (such as the Gumbel-Softmax transf… ▽ More

    Submitted 11 February, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

    Comments: Accepted for oral presentation at ICLR 2022

  10. arXiv:2108.01988  [pdf, other

    cs.LG cs.AI stat.ML

    Sparse Continuous Distributions and Fenchel-Young Losses

    Authors: André F. T. Martins, Marcos Treviso, António Farinhas, Pedro M. Q. Aguiar, Mário A. T. Figueiredo, Mathieu Blondel, Vlad Niculae

    Abstract: Exponential families are widely used in machine learning, including many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, recent work on sparse alternatives to softmax (e.g., sparsemax, $α$-entmax, and fused… ▽ More

    Submitted 4 August, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: JMLR 2022 camera ready version. arXiv admin note: text overlap with arXiv:2006.07214

  11. arXiv:2104.03046  [pdf, other

    cs.CV cs.LG

    Multimodal Continuous Visual Attention Mechanisms

    Authors: António Farinhas, André F. T. Martins, Pedro M. Q. Aguiar

    Abstract: Visual attention mechanisms are a key component of neural network models for computer vision. By focusing on a discrete set of objects or image regions, these mechanisms identify the most relevant features and use them to build more powerful representations. Recently, continuous-domain alternatives to discrete attention models have been proposed, which exploit the continuity of images. These appro… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

  12. arXiv:2006.07214  [pdf, other

    cs.LG cs.CL cs.CV stat.ML

    Sparse and Continuous Attention Mechanisms

    Authors: André F. T. Martins, António Farinhas, Marcos Treviso, Vlad Niculae, Pedro M. Q. Aguiar, Mário A. T. Figueiredo

    Abstract: Exponential families are widely used in machine learning; they include many distributions in continuous and discrete domains (e.g., Gaussian, Dirichlet, Poisson, and categorical distributions via the softmax transformation). Distributions in each of these families have fixed support. In contrast, for finite domains, there has been recent work on sparse alternatives to softmax (e.g. sparsemax and a… ▽ More

    Submitted 29 October, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Accepted for spotlight presentation at NeurIPS 2020