Skip to main content

Showing 1–4 of 4 results for author: Jodas, D S

.
  1. arXiv:2401.02909  [pdf, other

    cs.CL

    Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task

    Authors: Gabriel Lino Garcia, Pedro Henrique Paiola, Luis Henrique Morelli, Giovani Candido, Arnaldo Cândido Júnior, Danilo Samuel Jodas, Luis C. S. Afonso, Ivan Rizzo Guilherme, Bruno Elias Penteado, João Paulo Papa

    Abstract: Large Language Models (LLMs) are increasingly bringing advances to Natural Language Processing. However, low-resource languages, those lacking extensive prominence in datasets for various NLP tasks, or where existing datasets are not as substantial, such as Portuguese, already obtain several benefits from LLMs, but not to the same extent. LLMs trained on multilingual datasets normally struggle to… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: 10 pages, 3 figures

  2. PL-kNN: A Parameterless Nearest Neighbors Classifier

    Authors: Danilo Samuel Jodas, Leandro Aparecido Passos, Ahsan Adeel, João Paulo Papa

    Abstract: Demands for minimum parameter setup in machine learning models are desirable to avoid time-consuming optimization processes. The $k$-Nearest Neighbors is one of the most effective and straightforward models employed in numerous problems. Despite its well-known performance, it requires the value of $k$ for specific data distribution, thus demanding expensive computational efforts. This paper propos… ▽ More

    Submitted 30 September, 2022; v1 submitted 26 September, 2022; originally announced September 2022.

  3. Handling Imbalanced Datasets Through Optimum-Path Forest

    Authors: Leandro Aparecido Passos, Danilo S. Jodas, Luiz C. F. Ribeiro, Marco Akio, Andre Nunes de Souza, João Paulo Papa

    Abstract: In the last decade, machine learning-based approaches became capable of performing a wide range of complex tasks sometimes better than humans, demanding a fraction of the time. Such an advance is partially due to the exponential growth in the amount of data available, which makes it possible to extract trustworthy real-world information from them. However, such data is generally imbalanced since s… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  4. $\text{O}^2$PF: Oversampling via Optimum-Path Forest for Breast Cancer Detection

    Authors: Leandro Aparecido Passos, Danilo Samuel Jodas, Luiz C. F. Ribeiro, Thierry Pinheiro, João P. Papa

    Abstract: Breast cancer is among the most deadly diseases, distressing mostly women worldwide. Although traditional methods for detection have presented themselves as valid for the task, they still commonly present low accuracies and demand considerable time and effort from professionals. Therefore, a computer-aided diagnosis (CAD) system capable of providing early detection becomes hugely desirable. In the… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

    Comments: 6 pages, 3 figures. 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS)