Skip to main content

Showing 1–7 of 7 results for author: Almeida, T S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08191  [pdf, other

    cs.CL

    Measuring Cross-lingual Transfer in Bytes

    Authors: Leandro Rodrigues de Souza, Thales Sales Almeida, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Multilingual pretraining has been a successful solution to the challenges posed by the lack of resources for languages. These models can transfer knowledge to target languages with minimal or no examples. Recent research suggests that monolingual models also have a similar capability, but the mechanisms behind this transfer remain unclear. Some studies have explored factors like language contamina… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  2. arXiv:2403.09887  [pdf, other

    cs.CL cs.AI

    Sabiá-2: A New Generation of Portuguese Large Language Models

    Authors: Thales Sales Almeida, Hugo Abonizio, Rodrigo Nogueira, Ramon Pires

    Abstract: We introduce Sabiá-2, a family of large language models trained on Portuguese texts. The models are evaluated on a diverse range of exams, including entry-level tests for Brazilian universities, professional certification exams, and graduate-level exams for various disciplines such as accounting, economics, engineering, law and medicine. Our results reveal that our best model so far, Sabiá-2 Mediu… ▽ More

    Submitted 26 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  3. arXiv:2311.14169  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating GPT-4's Vision Capabilities on Brazilian University Admission Exams

    Authors: Ramon Pires, Thales Sales Almeida, Hugo Abonizio, Rodrigo Nogueira

    Abstract: Recent advancements in language models have showcased human-comparable performance in academic entrance exams. However, existing studies often overlook questions that require the integration of visual comprehension, thus compromising the full spectrum and complexity inherent in real-world scenarios. To address this gap, we present a comprehensive framework to evaluate language models on entrance e… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2303.17003

  4. arXiv:2307.05410  [pdf, other

    cs.CL

    BLUEX: A benchmark based on Brazilian Leading Universities Entrance eXams

    Authors: Thales Sales Almeida, Thiago Laitz, Giovana K. Bonás, Rodrigo Nogueira

    Abstract: One common trend in recent studies of language models (LMs) is the use of standardized tests for evaluation. However, despite being the fifth most spoken language worldwide, few such evaluations have been conducted in Portuguese. This is mainly due to the lack of high-quality datasets available to the community for carrying out evaluations in Portuguese. To address this gap, we introduce the Brazi… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  5. Sabiá: Portuguese Large Language Models

    Authors: Ramon Pires, Hugo Abonizio, Thales Sales Almeida, Rodrigo Nogueira

    Abstract: As the capabilities of language models continue to advance, it is conceivable that "one-size-fits-all" model will remain as the main paradigm. For instance, given the vast number of languages worldwide, many of which are low-resource, the prevalent practice is to pretrain a single model on multiple languages. In this paper, we add to the growing body of evidence that challenges this practice, demo… ▽ More

    Submitted 9 November, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

  6. arXiv:2210.14837  [pdf, other

    cs.IR cs.LG

    NeuralSearchX: Serving a Multi-billion-parameter Reranker for Multilingual Metasearch at a Low Cost

    Authors: Thales Sales Almeida, Thiago Laitz, João Seródio, Luiz Henrique Bonifacio, Roberto Lotufo, Rodrigo Nogueira

    Abstract: The widespread availability of search API's (both free and commercial) brings the promise of increased coverage and quality of search results for metasearch engines, while decreasing the maintenance costs of the crawling and indexing infrastructures. However, merging strategies frequently comprise complex pipelines that require careful tuning, which is often overlooked in the literature. In this w… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: published as a full paper at the DESIRES 2022 Conference. 13 pages

    Journal ref: DESIRES 2022-3rd International Conference on Design of Experimental Search and Information REtrieval Systems, 30-31,August 2022, San Jose, CA, USA

  7. arXiv:2103.07573  [pdf, other

    eess.IV cond-mat.mtrl-sci cs.CV q-bio.QM

    Mining Artifacts in Mycelium SEM Micrographs

    Authors: Thaicia Stona de Almeida

    Abstract: Mycelium is a promising biomaterial based on fungal mycelium, a highly porous, nanofibrous structure. Scanning electron micrographs are used to characterize its network, but the currently available tools for nanofibrous microstructures do not contemplate the particularities of biomaterials. The adoption of a software for artificial nanofibrous in mycelium characterization adds the uncertainty of i… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: 7 pages, 9 figures

    MSC Class: 74N15; 62H35; 68U10 ACM Class: I.4