Skip to main content

Showing 1–8 of 8 results for author: Correia, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.17771  [pdf, other

    cs.CL

    Supervising the Centroid Baseline for Extractive Multi-Document Summarization

    Authors: Simão Gonçalves, Gonçalo Correia, Diogo Pernes, Afonso Mendes

    Abstract: The centroid method is a simple approach for extractive multi-document summarization and many improvements to its pipeline have been proposed. We further refine it by adding a beam search process to the sentence selection and also a centroid estimation attention model that leads to improved results. We demonstrate this in several multi-document summarization datasets, including in a multilingual s… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted at "The 4th New Frontiers in Summarization (with LLMs) Workshop"

  2. arXiv:2311.14225  [pdf

    cs.MA

    An Agent-Based Discrete Event Simulation of Teleoperated Driving in Freight Transport Operations: The Fleet Sizing Problem

    Authors: Bahman Madadi, Ali Nadi, Gonçalo Homem de Almeida Correia, Thierry Verduijn, Lóránt Tavasszy

    Abstract: Teleoperated or remote-controlled driving complements automated driving and acts as transitional technology toward full automation. An economic advantage of teleoperated driving in logistics operations lies in managing fleets with fewer teleoperators compared to vehicles with in-vehicle drivers. This alleviates growing truck driver shortage problems in the logistics industry and saves costs. Howev… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  3. arXiv:2310.00100  [pdf, other

    cs.CL cs.AI

    Multilingual Natural Language Processing Model for Radiology Reports -- The Summary is all you need!

    Authors: Mariana Lindo, Ana Sofia Santos, André Ferreira, Jianning Li, Gijs Luijten, Gustavo Correia, Moon Kim, Benedikt Michael Schaarschmidt, Cornelius Deuschl, Johannes Haubold, Jens Kleesiek, Jan Egger, Victor Alves

    Abstract: The impression section of a radiology report summarizes important radiology findings and plays a critical role in communicating these findings to physicians. However, the preparation of these summaries is time-consuming and error-prone for radiologists. Recently, numerous models for radiology report summarization have been developed. Nevertheless, there is currently no model that can summarize the… ▽ More

    Submitted 13 January, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: 10 pages, 1 figure, 3 tables

  4. arXiv:2303.06024  [pdf

    cs.NE cs.GT cs.LG math.OC

    A hybrid deep-learning-metaheuristic framework for bi-level network design problems

    Authors: Bahman Madadi, Goncalo Homem de Almeida Correia

    Abstract: This study proposes a hybrid deep-learning-metaheuristic framework with a bi-level architecture for road network design problems (NDPs). We train a graph neural network (GNN) to approximate the solution of the user equilibrium (UE) traffic assignment problem and use inferences made by the trained model to calculate fitness function evaluations of a genetic algorithm (GA) to approximate solutions f… ▽ More

    Submitted 10 August, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Two case studies added, intro, discussion and conclusion extended, details added to method and experiments, typos fixed, title revised, references added

  5. arXiv:2007.01919  [pdf, other

    cs.LG stat.ML

    Efficient Marginalization of Discrete and Structured Latent Variables via Sparsity

    Authors: Gonçalo M. Correia, Vlad Niculae, Wilker Aziz, André F. T. Martins

    Abstract: Training neural network models with discrete (categorical or structured) latent variables can be computationally challenging, due to the need for marginalization over large or combinatorial sets. To circumvent this issue, one typically resorts to sampling-based approximations of the true marginal, requiring noisy gradient estimators (e.g., score function estimator) or continuous relaxations with l… ▽ More

    Submitted 28 December, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: Accepted for spotlight presentation at NeurIPS 2020

  6. arXiv:1909.00015  [pdf, other

    cs.CL stat.ML

    Adaptively Sparse Transformers

    Authors: Gonçalo M. Correia, Vlad Niculae, André F. T. Martins

    Abstract: Attention mechanisms have become ubiquitous in NLP. Recent architectures, notably the Transformer, learn powerful context-aware word representations through layered, multi-headed attention. The multiple heads learn diverse types of word relationships. However, with standard softmax attention, all attention heads are dense, assigning a non-zero weight to all context words. In this work, we introduc… ▽ More

    Submitted 6 September, 2019; v1 submitted 30 August, 2019; originally announced September 2019.

    Comments: Conference on Empirical Methods in Natural Language Processing (EMNLP), 2019, Hong Kong, China

  7. arXiv:1906.06253  [pdf, other

    cs.CL cs.LG

    A Simple and Effective Approach to Automatic Post-Editing with Transfer Learning

    Authors: Gonçalo M. Correia, André F. T. Martins

    Abstract: Automatic post-editing (APE) seeks to automatically refine the output of a black-box machine translation (MT) system through human post-edits. APE systems are usually trained by complementing human post-edited data with large, artificial data generated through back-translations, a time-consuming process often no easier than training an MT system from scratch. In this paper, we propose an alternati… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

    Comments: In proceedings of ACL 2019

  8. arXiv:1905.13068  [pdf, other

    cs.CL

    Unbabel's Submission to the WMT2019 APE Shared Task: BERT-based Encoder-Decoder for Automatic Post-Editing

    Authors: António V. Lopes, M. Amin Farajian, Gonçalo M. Correia, Jonay Trenous, André F. T. Martins

    Abstract: This paper describes Unbabel's submission to the WMT2019 APE Shared Task for the English-German language pair. Following the recent rise of large, powerful, pre-trained models, we adapt the BERT pretrained model to perform Automatic Post-Editing in an encoder-decoder framework. Analogously to dual-encoder architectures we develop a BERT-based encoder-decoder (BED) model in which a single pretraine… ▽ More

    Submitted 29 June, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: Updated sections 2.2 and 4