Skip to main content

Showing 1–41 of 41 results for author: Lotufo, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10806  [pdf, ps, other

    cs.CL cs.AI cs.IR

    ptt5-v2: A Closer Look at Continued Pretraining of T5 Models for the Portuguese Language

    Authors: Marcos Piau, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Despite advancements in Natural Language Processing (NLP) and the growing availability of pretrained models, the English language remains the primary focus of model development. Continued pretraining on language-specific corpora provides a practical solution for adapting models to other languages. However, the impact of different pretraining settings on downstream tasks remains underexplored. This… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2404.08191  [pdf, other

    cs.CL

    Measuring Cross-lingual Transfer in Bytes

    Authors: Leandro Rodrigues de Souza, Thales Sales Almeida, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Multilingual pretraining has been a successful solution to the challenges posed by the lack of resources for languages. These models can transfer knowledge to target languages with minimal or no examples. Recent research suggests that monolingual models also have a similar capability, but the mechanisms behind this transfer remain unclear. Some studies have explored factors like language contamina… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  3. arXiv:2404.06976  [pdf, other

    cs.IR

    Quati: A Brazilian Portuguese Information Retrieval Dataset from Native Speakers

    Authors: Mirelle Bueno, Eduardo Seiti de Oliveira, Rodrigo Nogueira, Roberto A. Lotufo, Jayr Alencar Pereira

    Abstract: Despite Portuguese being one of the most spoken languages in the world, there is a lack of high-quality information retrieval datasets in that language. We present Quati, a dataset specifically designed for the Brazilian Portuguese language. It comprises a collection of queries formulated by native speakers and a curated set of documents sourced from a selection of high-quality Brazilian Portugues… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 22 pages

  4. arXiv:2402.07859  [pdf, other

    cs.CL cs.AI

    Lissard: Long and Simple Sequential Reasoning Datasets

    Authors: Mirelle Bueno, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Language models are now capable of solving tasks that require dealing with long sequences consisting of hundreds of thousands of tokens. However, they often fail on tasks that require repetitive use of simple rules, even on sequences that are much shorter than those seen during training. For example, state-of-the-art LLMs can find common items in two lists with up to 20 items but fail when lists h… ▽ More

    Submitted 20 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  5. arXiv:2402.06334  [pdf, other

    cs.IR cs.AI cs.CL

    ExaRanker-Open: Synthetic Explanation for IR using Open-Source LLMs

    Authors: Fernando Ferraretto, Thiago Laitz, Roberto Lotufo, Rodrigo Nogueira

    Abstract: ExaRanker recently introduced an approach to training information retrieval (IR) models, incorporating natural language explanations as additional labels. The method addresses the challenge of limited labeled examples, leading to improvements in the effectiveness of IR models. However, the initial results were based on proprietary language models such as GPT-3.5, which posed constraints on dataset… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  6. arXiv:2401.06910  [pdf, other

    cs.IR

    InRanker: Distilled Rankers for Zero-shot Information Retrieval

    Authors: Thiago Laitz, Konstantinos Papakostas, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Despite multi-billion parameter neural rankers being common components of state-of-the-art information retrieval pipelines, they are rarely used in production due to the enormous amount of compute required for inference. In this work, we propose a new method for distilling large rankers into their smaller versions focusing on out-of-domain effectiveness. We introduce InRanker, a version of monoT5… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  7. arXiv:2401.05273  [pdf, other

    cs.CL cs.AI

    INACIA: Integrating Large Language Models in Brazilian Audit Courts: Opportunities and Challenges

    Authors: Jayr Pereira, Andre Assumpcao, Julio Trecenti, Luiz Airosa, Caio Lente, Jhonatan Cléto, Guilherme Dobins, Rodrigo Nogueira, Luis Mitchell, Roberto Lotufo

    Abstract: This paper introduces INACIA (Instrução Assistida com Inteligência Artificial), a groundbreaking system designed to integrate Large Language Models (LLMs) into the operational framework of Brazilian Federal Court of Accounts (TCU). The system automates various stages of case analysis, including basic information extraction, admissibility examination, Periculum in mora and Fumus boni iuris analyses… ▽ More

    Submitted 26 February, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  8. arXiv:2312.06907  [pdf, other

    eess.AS cs.SD

    w2v-SELD: A Sound Event Localization and Detection Framework for Self-Supervised Spatial Audio Pre-Training

    Authors: Orlem Lima dos Santos, Karen Rosero, Roberto de Alencar Lotufo

    Abstract: Sound Event Detection and Localization (SELD) constitutes a complex task that depends on extensive multichannel audio recordings with annotated sound events and their respective locations. In this paper, we introduce a self-supervised approach for SELD adapted from the pre-training methodology of wav2vec 2.0, which learns representations directly from raw audio data, eliminating the need for super… ▽ More

    Submitted 29 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 17 pages, 5 figures

  9. arXiv:2312.02365  [pdf, other

    eess.IV cs.CV

    MEDPSeg: Hierarchical polymorphic multitask learning for the segmentation of ground-glass opacities, consolidation, and pulmonary structures on computed tomography

    Authors: Diedre S. Carmo, Jean A. Ribeiro, Alejandro P. Comellas, Joseph M. Reinhardt, Sarah E. Gerard, Letícia Rittner, Roberto A. Lotufo

    Abstract: The COVID-19 pandemic response highlighted the potential of deep learning methods in facilitating the diagnosis, prognosis and understanding of lung diseases through automated segmentation of pulmonary structures and lesions in chest computed tomography (CT). Automated separation of lung lesion into ground-glass opacity (GGO) and consolidation is hindered due to the labor-intensive and subjective… ▽ More

    Submitted 25 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: This manuscript is under review and might change in the future

  10. arXiv:2310.09446  [pdf, other

    eess.IV cs.CV

    Automatic segmentation of lung findings in CT and application to Long COVID

    Authors: Diedre S. Carmo, Rosarie A. Tudas, Alejandro P. Comellas, Leticia Rittner, Roberto A. Lotufo, Joseph M. Reinhardt, Sarah E. Gerard

    Abstract: Automated segmentation of lung abnormalities in computed tomography is an important step for diagnosing and characterizing lung disease. In this work, we improve upon a previous method and propose S-MEDSeg, a deep learning based approach for accurate segmentation of lung lesions in chest CT images. S-MEDSeg combines a pre-trained EfficientNet backbone, bidirectional feature pyramid network, and mo… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: Versao em português brasileiro submetida para o XV EADCA 2023. Brazilian portuguese verson submitted to XV EADCA 2023

  11. arXiv:2307.04601  [pdf, ps, other

    cs.IR

    InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information Retrieval

    Authors: Hugo Abonizio, Luiz Bonifacio, Vitor Jeronymo, Roberto Lotufo, Jakub Zavrel, Rodrigo Nogueira

    Abstract: Recent work has explored Large Language Models (LLMs) to overcome the lack of training data for Information Retrieval (IR) tasks. The generalization abilities of these models have enabled the creation of synthetic in-domain data by providing instructions and a few examples on a prompt. InPars and Promptagator have pioneered this approach and both methods have demonstrated the potential of using LL… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  12. arXiv:2304.05901  [pdf, other

    eess.IV cs.CV

    Automated computed tomography and magnetic resonance imaging segmentation using deep learning: a beginner's guide

    Authors: Diedre Carmo, Gustavo Pinheiro, Lívia Rodrigues, Thays Abreu, Roberto Lotufo, Letícia Rittner

    Abstract: Medical image segmentation is an increasingly popular area of research in medical imaging processing and analysis. However, many researchers who are new to the field struggle with basic concepts. This tutorial paper aims to provide an overview of the fundamental concepts of medical imaging, with a focus on Magnetic Resonance and Computerized Tomography. We will also discuss deep learning algorithm… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: Equal contribution from Diedre Carmo, Gustavo Pinheiro, and Lívia Rodrigues

  13. arXiv:2303.17003  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating GPT-3.5 and GPT-4 Models on Brazilian University Admission Exams

    Authors: Desnes Nunes, Ricardo Primi, Ramon Pires, Roberto Lotufo, Rodrigo Nogueira

    Abstract: The present study aims to explore the capabilities of Language Models (LMs) in tackling high-stakes multiple-choice tests, represented here by the Exame Nacional do Ensino Médio (ENEM), a multidisciplinary entrance examination widely adopted by Brazilian universities. This exam poses challenging tasks for LMs, since its questions may span into multiple fields of knowledge, requiring understanding… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  14. arXiv:2303.16145  [pdf, other

    cs.IR

    NeuralMind-UNICAMP at 2022 TREC NeuCLIR: Large Boring Rerankers for Cross-lingual Retrieval

    Authors: Vitor Jeronymo, Roberto Lotufo, Rodrigo Nogueira

    Abstract: This paper reports on a study of cross-lingual information retrieval (CLIR) using the mT5-XXL reranker on the NeuCLIR track of TREC 2022. Perhaps the biggest contribution of this study is the finding that despite the mT5 model being fine-tuned only on query-document pairs of the same language it proved to be viable for CLIR tasks, where query-document pairs are in different languages, even in the… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  15. arXiv:2301.10521  [pdf, other

    cs.CL cs.AI cs.IR

    ExaRanker: Explanation-Augmented Neural Ranker

    Authors: Fernando Ferraretto, Thiago Laitz, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Recent work has shown that inducing a large language model (LLM) to generate explanations prior to outputting an answer is an effective strategy to improve performance on a wide range of reasoning tasks. In this work, we show that neural rankers also benefit from explanations. We use LLMs such as GPT-3.5 to augment retrieval datasets with explanations and train a sequence-to-sequence ranking model… ▽ More

    Submitted 3 June, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

  16. arXiv:2301.01820  [pdf, ps, other

    cs.IR cs.AI

    InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval

    Authors: Vitor Jeronymo, Luiz Bonifacio, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, Jakub Zavrel, Rodrigo Nogueira

    Abstract: Recently, InPars introduced a method to efficiently use large language models (LLMs) in information retrieval tasks: via few-shot examples, an LLM is induced to generate relevant queries for documents. These synthetic query-document pairs can then be used to train a retriever. However, InPars and, more recently, Promptagator, rely on proprietary LLMs such as GPT-3 and FLAN to generate such dataset… ▽ More

    Submitted 26 May, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

  17. arXiv:2212.09656  [pdf, other

    cs.CL cs.IR

    Visconde: Multi-document QA with GPT-3 and Neural Reranking

    Authors: Jayr Pereira, Robson Fidalgo, Roberto Lotufo, Rodrigo Nogueira

    Abstract: This paper proposes a question-answering system that can answer questions whose supporting evidence is spread over multiple (potentially long) documents. The system, called Visconde, uses a three-step pipeline to perform the task: decompose, retrieve, and aggregate. The first step decomposes the question into simpler questions using a few-shot large language model (LLM). Then, a state-of-the-art s… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  18. arXiv:2212.06121  [pdf, other

    cs.IR cs.CL

    In Defense of Cross-Encoders for Zero-Shot Retrieval

    Authors: Guilherme Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Bi-encoders and cross-encoders are widely used in many state-of-the-art retrieval pipelines. In this work we study the generalization ability of these two types of architectures on a wide range of parameter count on both in-domain and out-of-domain scenarios. We find that the number of parameters and early query-document interactions of cross-encoders play a significant role in the generalization… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2206.02873

  19. arXiv:2210.14837  [pdf, other

    cs.IR cs.LG

    NeuralSearchX: Serving a Multi-billion-parameter Reranker for Multilingual Metasearch at a Low Cost

    Authors: Thales Sales Almeida, Thiago Laitz, João Seródio, Luiz Henrique Bonifacio, Roberto Lotufo, Rodrigo Nogueira

    Abstract: The widespread availability of search API's (both free and commercial) brings the promise of increased coverage and quality of search results for metasearch engines, while decreasing the maintenance costs of the crawling and indexing infrastructures. However, merging strategies frequently comprise complex pipelines that require careful tuning, which is often overlooked in the literature. In this w… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: published as a full paper at the DESIRES 2022 Conference. 13 pages

    Journal ref: DESIRES 2022-3rd International Conference on Design of Experimental Search and Information REtrieval Systems, 30-31,August 2022, San Jose, CA, USA

  20. arXiv:2209.15094  [pdf, other

    eess.IV cs.CV

    Open-source tool for Airway Segmentation in Computed Tomography using 2.5D Modified EfficientDet: Contribution to the ATM22 Challenge

    Authors: Diedre Carmo, Leticia Rittner, Roberto Lotufo

    Abstract: Airway segmentation in computed tomography images can be used to analyze pulmonary diseases, however, manual segmentation is labor intensive and relies on expert knowledge. This manuscript details our contribution to MICCAI's 2022 Airway Tree Modelling challenge, a competition of fully automated methods for airway segmentation. We employed a previously developed deep learning architecture based on… ▽ More

    Submitted 3 October, 2022; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Open source code, graphical user interface, and a pip package for predictions with our model and trained weights are in https://github.com/MICLab-Unicamp/medseg

  21. arXiv:2209.13738  [pdf, ps, other

    cs.CL cs.AI

    mRobust04: A Multilingual Version of the TREC Robust 2004 Benchmark

    Authors: Vitor Jeronymo, Mauricio Nascimento, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Robust 2004 is an information retrieval benchmark whose large number of judgments per query make it a reliable evaluation dataset. In this paper, we present mRobust04, a multilingual version of Robust04 that was translated to 8 languages using Google Translate. We also provide results of three different multilingual retrievers on this dataset. The dataset is available at https://huggingface.co/dat… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 4 pages

  22. arXiv:2209.11035  [pdf, other

    cs.CL

    MonoByte: A Pool of Monolingual Byte-level Language Models

    Authors: Hugo Abonizio, Leandro Rodrigues de Souza, Roberto Lotufo, Rodrigo Nogueira

    Abstract: The zero-shot cross-lingual ability of models pretrained on multilingual and even monolingual corpora has spurred many hypotheses to explain this intriguing empirical result. However, due to the costs of pretraining, most research uses public models whose pretraining methodology, such as the choice of tokenization, corpus size, and computational budget, might differ drastically. When researchers p… ▽ More

    Submitted 27 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  23. arXiv:2208.11445  [pdf, other

    cs.CL

    Induced Natural Language Rationales and Interleaved Markup Tokens Enable Extrapolation in Large Language Models

    Authors: Mirelle Bueno, Carlos Gemmell, Jeffrey Dalton, Roberto Lotufo, Rodrigo Nogueira

    Abstract: The ability to extrapolate, i.e., to make predictions on sequences that are longer than those presented as training examples, is a challenging problem for current deep learning models. Recent work shows that this limitation persists in state-of-the-art Transformer-based models. Most solutions to this problem use specific architectures or training methods that do not generalize to other tasks. We d… ▽ More

    Submitted 28 November, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

  24. arXiv:2208.06264  [pdf, ps, other

    cs.IR

    A Boring-yet-effective Approach for the Product Ranking Task of the Amazon KDD Cup 2022

    Authors: Vitor Jeronymo, Guilherme Rosa, Surya Kallumadi, Roberto Lotufo, Rodrigo Nogueira

    Abstract: In this work we describe our submission to the product ranking task of the Amazon KDD Cup 2022. We rely on a receipt that showed to be effective in previous competitions: we focus our efforts towards efficiently training and deploying large language odels, such as mT5, while reducing to a minimum the number of task-specific adaptations. Despite the simplicity of our approach, our best model was le… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  25. arXiv:2206.02873  [pdf, other

    cs.IR cs.CL cs.PF

    No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval

    Authors: Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Marzieh Fadaee, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Recent work has shown that small distilled language models are strong competitors to models that are orders of magnitude larger and slower in a wide range of information retrieval tasks. This has made distilled and dense models, due to latency constraints, the go-to choice for deployment in real-world retrieval applications. In this work, we question this practice by showing that the number of par… ▽ More

    Submitted 12 December, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

  26. arXiv:2205.15172  [pdf, ps, other

    cs.CL

    Billions of Parameters Are Worth More Than In-domain Training Data: A case study in the Legal Case Entailment Task

    Authors: Guilherme Moraes Rosa, Luiz Bonifacio, Vitor Jeronymo, Hugo Abonizio, Roberto Lotufo, Rodrigo Nogueira

    Abstract: Recent work has shown that language models scaled to billions of parameters, such as GPT-3, perform remarkably well in zero-shot and few-shot scenarios. In this work, we experiment with zero-shot models in the legal case entailment task of the COLIEE 2022 competition. Our experiments show that scaling the number of parameters in a language model improves the F1 score of our previous zero-shot resu… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

  27. To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment

    Authors: Guilherme Moraes Rosa, Ruan Chaves Rodrigues, Roberto de Alencar Lotufo, Rodrigo Nogueira

    Abstract: There has been mounting evidence that pretrained language models fine-tuned on large and diverse supervised datasets can transfer well to a variety of out-of-domain tasks. In this work, we investigate this transfer ability to the legal domain. For that, we participated in the legal case entailment task of COLIEE 2021, in which we use such models with no adaptations to the target domain. Our submis… ▽ More

    Submitted 7 February, 2022; originally announced February 2022.

  28. arXiv:2201.05658  [pdf, other

    cs.AI cs.CL

    Sequence-to-Sequence Models for Extracting Information from Registration and Legal Documents

    Authors: Ramon Pires, Fábio C. de Souza, Guilherme Rosa, Roberto A. Lotufo, Rodrigo Nogueira

    Abstract: A typical information extraction pipeline consists of token- or span-level classification models coupled with a series of pre- and post-processing scripts. In a production pipeline, requirements often change, with classes being added and removed, which leads to nontrivial modifications to the source code and the possible introduction of bugs. In this work, we evaluate sequence-to-sequence models a… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  29. arXiv:2109.01942  [pdf, other

    cs.CL

    On the ability of monolingual models to learn language-agnostic representations

    Authors: Leandro Rodrigues de Souza, Rodrigo Nogueira, Roberto Lotufo

    Abstract: Pretrained multilingual models have become a de facto default approach for zero-shot cross-lingual transfer. Previous work has shown that these models are able to achieve cross-lingual representations when pretrained on two or more languages with shared parameters. In this work, we provide evidence that a model can achieve language-agnostic representations even when pretrained on a single language… ▽ More

    Submitted 25 October, 2021; v1 submitted 4 September, 2021; originally announced September 2021.

  30. arXiv:2108.13897  [pdf, other

    cs.CL cs.AI

    mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset

    Authors: Luiz Bonifacio, Vitor Jeronymo, Hugo Queiroz Abonizio, Israel Campiotti, Marzieh Fadaee, Roberto Lotufo, Rodrigo Nogueira

    Abstract: The MS MARCO ranking dataset has been widely used for training deep learning models for IR tasks, achieving considerable effectiveness on diverse zero-shot scenarios. However, this type of resource is scarce in languages other than English. In this work, we present mMARCO, a multilingual version of the MS MARCO passage ranking dataset comprising 13 languages that was created using machine translat… ▽ More

    Submitted 17 August, 2022; v1 submitted 31 August, 2021; originally announced August 2021.

  31. arXiv:2105.06813  [pdf, other

    cs.CL cs.IR cs.LG

    A cost-benefit analysis of cross-lingual transfer methods

    Authors: Guilherme Moraes Rosa, Luiz Henrique Bonifacio, Leandro Rodrigues de Souza, Roberto Lotufo, Rodrigo Nogueira

    Abstract: An effective method for cross-lingual transfer is to fine-tune a bilingual or multilingual model on a supervised dataset in one language and evaluating it on another language in a zero-shot manner. Translating examples at training time or inference time are also viable alternatives. However, there are costs associated with these methods that are rarely addressed in the literature. In this work, we… ▽ More

    Submitted 14 December, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

  32. arXiv:2105.05686  [pdf, ps, other

    cs.IR cs.CL cs.LG

    Yes, BM25 is a Strong Baseline for Legal Case Retrieval

    Authors: Guilherme Moraes Rosa, Ruan Chaves Rodrigues, Roberto Lotufo, Rodrigo Nogueira

    Abstract: We describe our single submission to task 1 of COLIEE 2021. Our vanilla BM25 got second place, well above the median of submissions. Code is available at https://github.com/neuralmind-ai/coliee.

    Submitted 25 October, 2021; v1 submitted 26 April, 2021; originally announced May 2021.

  33. arXiv:2009.09290  [pdf, other

    cs.IR cs.CL cs.LG

    Can questions summarize a corpus? Using question generation for characterizing COVID-19 research

    Authors: Gabriela Surita, Rodrigo Nogueira, Roberto Lotufo

    Abstract: What are the latent questions on some textual data? In this work, we investigate using question generation models for exploring a collection of documents. Our method, dubbed corpus2question, consists of applying a pre-trained question generation model over a corpus and aggregating the resulting questions by frequency and time. This technique is an alternative to methods such as topic modelling and… ▽ More

    Submitted 19 September, 2020; originally announced September 2020.

    Comments: 11 pages, 5 figures

    ACM Class: H.3.3

  34. arXiv:2008.09144  [pdf, other

    cs.CL

    PTT5: Pretraining and validating the T5 model on Brazilian Portuguese data

    Authors: Diedre Carmo, Marcos Piau, Israel Campiotti, Rodrigo Nogueira, Roberto Lotufo

    Abstract: In natural language processing (NLP), there is a need for more resources in Portuguese, since much of the data used in the state-of-the-art research is in other languages. In this paper, we pretrain a T5 model on the BrWac corpus, an extensive collection of web pages in Portuguese, and evaluate its performance against other Portuguese pretrained models and multilingual models on three different ta… ▽ More

    Submitted 8 October, 2020; v1 submitted 20 August, 2020; originally announced August 2020.

  35. arXiv:2008.08769  [pdf, other

    cs.CL

    Lite Training Strategies for Portuguese-English and English-Portuguese Translation

    Authors: Alexandre Lopes, Rodrigo Nogueira, Roberto Lotufo, Helio Pedrini

    Abstract: Despite the widespread adoption of deep learning for machine translation, it is still expensive to develop high-quality translation models. In this work, we investigate the use of pre-trained models, such as T5 for Portuguese-English and English-Portuguese translation tasks using low-cost hardware. We explore the use of Portuguese and English pre-trained language models and propose an adaptation o… ▽ More

    Submitted 20 August, 2020; originally announced August 2020.

    Comments: for code and weights, visit https://github.com/unicamp-dl/Lite-T5-Translation

  36. arXiv:2002.06219  [pdf, other

    cs.LG eess.SP stat.ML

    Electricity Theft Detection with self-attention

    Authors: Paulo Finardi, Israel Campiotti, Gustavo Plensack, Rafael Derradi de Souza, Rodrigo Nogueira, Gustavo Pinheiro, Roberto Lotufo

    Abstract: In this work we propose a novel self-attention mechanism model to address electricity theft detection on an imbalanced realistic dataset that presents a daily electricity consumption provided by State Grid Corporation of China. Our key contribution is the introduction of a multi-head self-attention mechanism concatenated with dilated convolutions and unified by a convolution of kernel size $1$. Mo… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

  37. Hippocampus Segmentation on Epilepsy and Alzheimer's Disease Studies with Multiple Convolutional Neural Networks

    Authors: Diedre Carmo, Bruna Silva, Clarissa Yasuda, Letícia Rittner, Roberto Lotufo

    Abstract: Hippocampus segmentation on magnetic resonance imaging is of key importance for the diagnosis, treatment decision and investigation of neuropsychiatric disorders. Automatic segmentation is an active research field, with many recent models using deep learning. Most current state-of-the art hippocampus segmentation methods train their methods on healthy or Alzheimer's disease patients from public da… ▽ More

    Submitted 10 February, 2021; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: Code is available at https://github.com/dscarmo/e2dhipseg Published in Heliyon: https://www.sciencedirect.com/science/article/pii/S2405844021003315

    Journal ref: Heliyon, Volume 7, Issue 2, 2021

  38. arXiv:1909.10649  [pdf, other

    cs.CL cs.IR cs.LG

    Portuguese Named Entity Recognition using BERT-CRF

    Authors: Fábio Souza, Rodrigo Nogueira, Roberto Lotufo

    Abstract: Recent advances in language representation using neural networks have made it viable to transfer the learned internal states of a trained model to downstream natural language processing tasks, such as named entity recognition (NER) and question answering. It has been shown that the leverage of pre-trained language models improves the overall performance on many tasks and is highly beneficial when… ▽ More

    Submitted 27 February, 2020; v1 submitted 23 September, 2019; originally announced September 2019.

  39. arXiv:1902.04487  [pdf, other

    eess.IV cs.CV cs.LG

    Extended 2D Consensus Hippocampus Segmentation

    Authors: Diedre Carmo, Bruna Silva, Clarissa Yasuda, Letícia Rittner, Roberto Lotufo

    Abstract: Hippocampus segmentation plays a key role in diagnosing various brain disorders such as Alzheimer's disease, epilepsy, multiple sclerosis, cancer, depression and others. Nowadays, segmentation is still mainly performed manually by specialists. Segmentation done by experts is considered to be a gold-standard when evaluating automated methods, buts it is a time consuming and arduos task, requiring s… ▽ More

    Submitted 13 May, 2020; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: This was published as an extended abstract in MIDL 2019 [arXiv:1907.08612]. An alpha version of the code is available at https://github.com/dscarmo/e2dhipseg. More experiments on improvements to the method and code are ongoing. Future updates are to be expected. A new, more complete paper is published in arXiv:2001.05058

    Report number: MIDL/2019/ExtendedAbstract/Sygx97DaKV

  40. arXiv:1804.04988  [pdf, other

    cs.CV

    Convolutional Neural Networks for Skull-strip** in Brain MR Imaging using Consensus-based Silver standard Masks

    Authors: Oeslle Lucena, Roberto Souza, Leticia Rittner, Richard Frayne, Roberto Lotufo

    Abstract: Convolutional neural networks (CNN) for medical imaging are constrained by the number of annotated data required in the training stage. Usually, manual annotation is considered to be the "gold standard". However, medical imaging datasets that include expert manual segmentation are scarce as this step is time-consuming, and therefore expensive. Moreover, single-rater manual annotation is most often… ▽ More

    Submitted 13 April, 2018; originally announced April 2018.

  41. Evaluating software-based fingerprint liveness detection using Convolutional Networks and Local Binary Patterns

    Authors: Rodrigo Frassetto Nogueira, Roberto de Alencar Lotufo, Rubens Campos Machado

    Abstract: With the growing use of biometric authentication systems in the past years, spoof fingerprint detection has become increasingly important. In this work, we implement and evaluate two different feature extraction techniques for software-based fingerprint liveness detection: Convolutional Networks with random weights and Local Binary Patterns. Both techniques were used in conjunction with a Support… ▽ More

    Submitted 3 August, 2015; originally announced August 2015.

    Comments: arXiv admin note: text overlap with arXiv:1301.3557 by other authors

    Journal ref: Biometric Measurements and Systems for Security and Medical Applications (BIOMS) Proceedings, 2014 IEEE Workshop on