Skip to main content

Showing 1–6 of 6 results for author: Amadeus, M

.
  1. arXiv:2402.06766  [pdf, other

    cs.CL cs.AI

    Evaluation Metrics for Text Data Augmentation in NLP

    Authors: Marcellus Amadeus, William Alberto Cruz Castañeda

    Abstract: Recent surveys on data augmentation for natural language processing have reported different techniques and advancements in the field. Several frameworks, tools, and repositories promote the implementation of text data augmentation pipelines. However, a lack of evaluation criteria and standards for method comparison due to different tasks, metrics, datasets, architectures, and experimental settings… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  2. arXiv:2402.05794  [pdf, other

    cs.CL cs.AI

    Phonetically rich corpus construction for a low-resourced language

    Authors: Marcellus Amadeus, William Alberto Cruz Castañeda, Wilmer Lobato, Niasche Aquino

    Abstract: Speech technologies rely on capturing a speaker's voice variability while obtaining comprehensive language information. Textual prompts and sentence selection methods have been proposed in the literature to comprise such adequate phonetic data, referred to as a phonetically rich \textit{corpus}. However, they are still insufficient for acoustic modeling, especially critical for languages with limi… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  3. arXiv:2402.05106  [pdf, other

    cs.CV cs.AI cs.CL

    Image captioning for Brazilian Portuguese using GRIT model

    Authors: Rafael Silva de Alencar, William Alberto Cruz Castañeda, Marcellus Amadeus

    Abstract: This work presents the early development of a model of image captioning for the Brazilian Portuguese language. We used the GRIT (Grid - and Region-based Image captioning Transformer) model to accomplish this work. GRIT is a Transformer-only neural architecture that effectively utilizes two visual features to generate better captions. The GRIT method emerged as a proposal to be a more efficient way… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2207.09666 by other authors

  4. arXiv:2402.03501  [pdf, other

    cs.CV cs.AI cs.CL

    An Inpainting-Infused Pipeline for Attire and Background Replacement

    Authors: Felipe Rodrigues Perche-Mahlow, André Felipe-Zanella, William Alberto Cruz-Castañeda, Marcellus Amadeus

    Abstract: In recent years, groundbreaking advancements in Generative Artificial Intelligence (GenAI) have triggered a transformative paradigm shift, significantly influencing various domains. In this work, we specifically explore an integrated approach, leveraging advanced techniques in GenAI and computer vision emphasizing image manipulation. The methodology unfolds through several stages, including depth… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  5. arXiv:2401.05520  [pdf, other

    cs.CV cs.AI cs.CL

    From Pampas to Pixels: Fine-Tuning Diffusion Models for Gaúcho Heritage

    Authors: Marcellus Amadeus, William Alberto Cruz Castañeda, André Felipe Zanella, Felipe Rodrigues Perche Mahlow

    Abstract: Generative AI has become pervasive in society, witnessing significant advancements in various domains. Particularly in the realm of Text-to-Image (TTI) models, Latent Diffusion Models (LDMs), showcase remarkable capabilities in generating visual content based on textual prompts. This paper addresses the potential of LDMs in representing local cultural concepts, historical figures, and endangered s… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  6. arXiv:2304.02785  [pdf, other

    cs.CL

    Performance of Data Augmentation Methods for Brazilian Portuguese Text Classification

    Authors: Marcellus Amadeus, Paulo Branco

    Abstract: Improving machine learning performance while increasing model generalization has been a constantly pursued goal by AI researchers. Data augmentation techniques are often used towards achieving this target, and most of its evaluation is made using English corpora. In this work, we took advantage of different existing data augmentation methods to analyze their performances applied to text classifica… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.