Skip to main content

Showing 1–13 of 13 results for author: Branco, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03955  [pdf, ps, other

    cs.CL

    Meta-prompting Optimized Retrieval-augmented Generation

    Authors: João Rodrigues, António Branco

    Abstract: Retrieval-augmented generation resorts to content retrieved from external sources in order to leverage the performance of large language models in downstream tasks. The excessive volume of retrieved content, the possible dispersion of its parts, or their out of focus range may happen nevertheless to eventually have a detrimental rather than an incremental effect. To mitigate this issue and improve… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2404.05333  [pdf, ps, other

    cs.CL

    PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese

    Authors: Tomás Osório, Bernardo Leite, Henrique Lopes Cardoso, Luís Gomes, João Rodrigues, Rodrigo Santos, António Branco

    Abstract: Leveraging research on the neural modelling of Portuguese, we contribute a collection of datasets for an array of language processing tasks and a corresponding collection of fine-tuned neural language models on these downstream tasks. To align with mainstream benchmarks in the literature, originally developed in English, and to kick start their Portuguese counterparts, the datasets were machine-tr… ▽ More

    Submitted 8 May, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Preprint - Paper accepted for BUCC 2024

  3. arXiv:2403.08004  [pdf, other

    cs.CL cs.AI cs.CV

    Pix2Pix-OnTheFly: Leveraging LLMs for Instruction-Guided Image Editing

    Authors: Rodrigo Santos, João Silva, António Branco

    Abstract: The combination of language processing and image processing keeps attracting increased interest given recent impressive advances that leverage the combined strengths of both domains of research. Among these advances, the task of editing an image on the basis solely of a natural language instruction stands out as a most challenging endeavour. While recent approaches for this task resort, in one way… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  4. arXiv:2403.01897  [pdf, other

    cs.CL

    Fostering the Ecosystem of Open Neural Encoders for Portuguese with Albertina PT* Family

    Authors: Rodrigo Santos, João Rodrigues, Luís Gomes, João Silva, António Branco, Henrique Lopes Cardoso, Tomás Freitas Osório, Bernardo Leite

    Abstract: To foster the neural encoding of Portuguese, this paper contributes foundation encoder models that represent an expansion of the still very scarce ecosystem of large language models specifically developed for this language that are fully open, in the sense that they are open source and openly distributed for free under an open license for any purpose, thus including research and commercial usages.… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  5. arXiv:2402.18766  [pdf, other

    cs.CL

    Advancing Generative AI for Portuguese with Open Decoder Gervásio PT*

    Authors: Rodrigo Santos, João Silva, Luís Gomes, João Rodrigues, António Branco

    Abstract: To advance the neural decoding of Portuguese, in this paper we present a fully open Transformer-based, instruction-tuned decoder model that sets a new state of the art in this respect. To develop this decoder, which we named Gervásio PT*, a strong LLaMA~2 7B model was used as a starting point, and its further improvement through additional training was done over language resources that include new… ▽ More

    Submitted 5 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  6. arXiv:2312.14029  [pdf, other

    cs.DS

    Phylogenetic tree distance computation over succinct representations

    Authors: António Pedro Branco, Cátia Vaz, Alexandre P. Francisco

    Abstract: There are several tools available to infer phylogenetic trees, which depict the evolutionary relationships among biological entities such as viral and bacterial strains in infectious outbreaks, or cancerous cells in tumor progression trees. These tools rely on several inference methods available to produce phylogenetic trees, with resulting trees not being unique. Thus, methods for comparing phylo… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  7. Advancing Neural Encoding of Portuguese with Transformer Albertina PT-*

    Authors: João Rodrigues, Luís Gomes, João Silva, António Branco, Rodrigo Santos, Henrique Lopes Cardoso, Tomás Osório

    Abstract: To advance the neural encoding of Portuguese (PT), and a fortiori the technological preparation of this language for the digital age, we developed a Transformer-based foundation model that sets a new state of the art in this respect for two of its variants, namely European Portuguese from Portugal (PT-PT) and American Portuguese from Brazil (PT-BR). To develop this encoder, which we named Albert… ▽ More

    Submitted 20 June, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  8. arXiv:2209.02495  [pdf, other

    cs.CL

    Transfer Learning of Lexical Semantic Families for Argumentative Discourse Units Identification

    Authors: João Rodrigues, Ruben Branco, António Branco

    Abstract: Argument mining tasks require an informed range of low to high complexity linguistic phenomena and commonsense knowledge. Previous work has shown that pre-trained language models are highly effective at encoding syntactic and semantic linguistic phenomena when applied with transfer learning techniques and built on different pre-training objectives. It remains an issue of how much the existing pre-… ▽ More

    Submitted 24 October, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

  9. arXiv:2103.06924  [pdf, other

    cs.CL

    Anaphoric Binding: an integrated overview

    Authors: António Branco

    Abstract: The interpretation of anaphors depends on their antecedents as the semantic value that an anaphor eventually conveys is co-specified by the value of its antecedent. Interestingly, when occurring in a given syntactic position, different anaphors may have different sets of admissible antecedents. Such differences are the basis for the categorization of anaphoric expressions according to their anapho… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  10. arXiv:2011.07997  [pdf, other

    cs.CL

    Comparative Probing of Lexical Semantics Theories for Cognitive Plausibility and Technological Usefulness

    Authors: António Branco, João Rodrigues, Małgorzata Salawa, Ruben Branco, Chakaveh Saedi

    Abstract: Lexical semantics theories differ in advocating that the meaning of words is represented as an inference graph, a feature map** or a vector space, thus raising the question: is it the case that one of these approaches is superior to the others in representing lexical semantics appropriately? Or in its non antagonistic counterpart: could there be a unified account of lexical semantics where these… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

  11. arXiv:2003.13833  [pdf

    cs.CL cs.AI cs.DL

    The European Language Technology Landscape in 2020: Language-Centric and Human-Centric AI for Cross-Cultural Communication in Multilingual Europe

    Authors: Georg Rehm, Katrin Marheinecke, Stefanie Hegele, Stelios Piperidis, Kalina Bontcheva, Jan Hajič, Khalid Choukri, Andrejs Vasiļjevs, Gerhard Backfried, Christoph Prinz, José Manuel Gómez Pérez, Luc Meertens, Paul Lukowicz, Josef van Genabith, Andrea Lösch, Philipp Slusallek, Morten Irgens, Patrick Gatellier, Joachim Köhler, Laure Le Bars, Dimitra Anastasiou, Albina Auksoriūtė, Núria Bel, António Branco, Gerhard Budin , et al. (22 additional authors not shown)

    Abstract: Multilingualism is a cultural cornerstone of Europe and firmly anchored in the European treaties including full language equality. However, language barriers impacting business, cross-lingual and cross-cultural communication are still omnipresent. Language Technologies (LTs) are a powerful means to break down these barriers. While the last decade has seen various initiatives that created a multitu… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: Proceedings of the 12th Language Resources and Evaluation Conference (LREC 2020). To appear

  12. arXiv:1912.00567  [pdf, ps, other

    cs.CL

    Merging External Bilingual Pairs into Neural Machine Translation

    Authors: Tao Wang, Shaohui Kuang, Deyi Xiong, António Branco

    Abstract: As neural machine translation (NMT) is not easily amenable to explicit correction of errors, incorporating pre-specified translations into NMT is widely regarded as a non-trivial challenge. In this paper, we propose and explore three methods to endow NMT with pre-specified bilingual pairs. Instead, for instance, of modifying the beam search algorithm during decoding or making complex modifications… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: 7 pages, 3 figures, 5 tables

    MSC Class: 68T50 ACM Class: I.2.7

  13. arXiv:1711.05380  [pdf, other

    cs.CL

    Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

    Authors: Shaohui Kuang, Junhui Li, António Branco, Weihua Luo, Deyi Xiong

    Abstract: In neural machine translation, a source sequence of words is encoded into a vector from which a target sequence is generated in the decoding phase. Differently from statistical machine translation, the associations between source words and their possible target counterparts are not explicitly stored. Source and target words are at the two ends of a long information processing procedure, mediated b… ▽ More

    Submitted 10 May, 2018; v1 submitted 14 November, 2017; originally announced November 2017.

    Comments: 9 pages, 6 figures. Accepted by ACL2018