Skip to main content

Showing 1–12 of 12 results for author: Colcher, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.06790  [pdf, other

    cs.CL cs.AI

    Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions

    Authors: Daniel de S. Moraes, Pedro T. C. Santos, Polyana B. da Costa, Matheus A. S. Pinto, Ivan de J. P. Pinto, Álvaro M. G. da Veiga, Sergio Colcher, Antonio J. G. Busson, Rafael H. Rocha, Rennan Gaio, Rafael Miceli, Gabriela Tourinho, Marcos Rabaioli, Leandro Santos, Fellipe Marques, David Favaro

    Abstract: This work presents an unsupervised method for automatically constructing and expanding topic taxonomies using instruction-based fine-tuned LLMs (Large Language Models). We apply topic modeling and keyword extraction techniques to create initial topic taxonomies and LLMs to post-process the resulting terms and create a hierarchy. To expand an existing taxonomy with new terms, we use zero-shot promp… ▽ More

    Submitted 11 February, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  2. Hierarchical Classification of Financial Transactions Through Context-Fusion of Transformer-based Embeddings and Taxonomy-aware Attention Layer

    Authors: Antonio J. G. Busson, Rafael Rocha, Rennan Gaio, Rafael Miceli, Ivan Pereira, Daniel de S. Moraes, Sérgio Colcher, Alvaro Veiga, Bruno Rizzi, Francisco Evangelista, Leandro Santos, Fellipe Marques, Marcos Rabaioli, Diego Feldberg, Debora Mattos, João Pasqua, Diogo Dias

    Abstract: This work proposes the Two-headed DragoNet, a Transformer-based model for hierarchical multi-label classification of financial transactions. Our model is based on a stack of Transformers encoder layers that generate contextual embeddings from two short textual descriptors (merchant name and business activity), followed by a Context Fusion layer and two output heads that classify transactions accor… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  3. arXiv:2110.01425  [pdf, other

    cs.SD cs.CL eess.AS

    Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches for Automatic Speech Recognition Systems

    Authors: Julio Cesar Duarte, Sérgio Colcher

    Abstract: Automatic speech recognition systems are part of people's daily lives, embedded in personal assistants and mobile phones, hel** as a facilitator for human-machine interaction while allowing access to information in a practically intuitive way. Such systems are usually implemented using machine learning techniques, especially with deep neural networks. Even with its high performance in the task o… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: Tech report series Monografias em Ciência da Computação, september, 2021, Dep. Informática PUC-Rio, RJ, BRAZIL, ISSN 0103-9741

    Report number: MCC no. 05/2021

  4. arXiv:2106.08269  [pdf, other

    cs.CV cs.LG

    Generating Data Augmentation samples for Semantic Segmentation of Salt Bodies in a Synthetic Seismic Image Dataset

    Authors: Luis Felipe Henriques, Sérgio Colcher, Ruy Luiz Milidiú, André Bulcão, Pablo Barros

    Abstract: Nowadays, subsurface salt body localization and delineation, also called semantic segmentation of salt bodies, are among the most challenging geophysicist tasks. Thus, identifying large salt bodies is notoriously tricky and is crucial for identifying hydrocarbon reservoirs and drill path planning. This work proposes a Data Augmentation method based on training two generative models to augment the… ▽ More

    Submitted 17 June, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  5. arXiv:2011.14870  [pdf, other

    cs.LG cs.AI cs.CV

    Prior Flow Variational Autoencoder: A density estimation model for Non-Intrusive Load Monitoring

    Authors: Luis Felipe M. O. Henriques, Eduardo Morgan, Sergio Colcher, Ruy Luiz Milidiú

    Abstract: Non-Intrusive Load Monitoring (NILM) is a computational technique to estimate the power loads' appliance-by-appliance from the whole consumption measured by a single meter. In this paper, we propose a conditional density estimation model, based on deep neural networks, that joins a Conditional Variational Autoencoder with a Conditional Invertible Normalizing Flow model to estimate the individual a… ▽ More

    Submitted 27 June, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

  6. arXiv:2010.11732  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    A Cluster-Matching-Based Method for Video Face Recognition

    Authors: Paulo R C Mendes, Antonio J G Busson, Sérgio Colcher, Daniel Schwabe, Álan L V Guedes, Carlos Laufer

    Abstract: Face recognition systems are present in many modern solutions and thousands of applications in our daily lives. However, current solutions are not easily scalable, especially when it comes to the addition of new targeted people. We propose a cluster-matching-based approach for face recognition in video. In our approach, we use unsupervised learning to cluster the faces present in both the dataset… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

    Comments: 13 pages

  7. arXiv:2010.05760  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Video Quality Enhancement Using Deep Learning-Based Prediction Models for Quantized DCT Coefficients in MPEG I-frames

    Authors: Antonio J G Busson, Paulo R C Mendes, Daniel de S Moraes, Álvaro M da Veiga, Álan L V Guedes, Sérgio Colcher

    Abstract: Recent works have successfully applied some types of Convolutional Neural Networks (CNNs) to reduce the noticeable distortion resulting from the lossy JPEG/MPEG compression technique. Most of them are built upon the processing made on the spatial domain. In this work, we propose a MPEG video decoder that is purely based on the frequency-to-frequency domain: it reads the quantized DCT coefficients… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  8. arXiv:2010.04676  [pdf, other

    cs.MM cs.LG

    A Clustering-Based Method for Automatic Educational Video Recommendation Using Deep Face-Features of Lecturers

    Authors: Paulo R. C. Mendes, Eduardo S. Vieira, Álan L. V. Guedes, Antonio J. G. Busson, Sérgio Colcher

    Abstract: Discovering and accessing specific content within educational video bases is a challenging task, mainly because of the abundance of video content and its diversity. Recommender systems are often used to enhance the ability to find and select content. But, recommendation mechanisms, especially those based on textual information, exhibit some limitations, such as being error-prone to manually create… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  9. arXiv:2005.03626  [pdf, other

    cs.CV cs.LG

    Seismic Shot Gather Noise Localization Using a Multi-Scale Feature-Fusion-Based Neural Network

    Authors: Antonio José G. Busson, Sérgio Colcher, Ruy Luiz Milidiú, Bruno Pereira Dias, André Bulcão

    Abstract: Deep learning-based models, such as convolutional neural networks, have advanced various segments of computer vision. However, this technology is rarely applied to seismic shot gather noise localization problem. This letter presents an investigation on the effectiveness of a multi-scale feature-fusion-based network for seismic shot-gather noise localization. Herein, we describe the following: (1)… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

  10. arXiv:1912.01148  [pdf, other

    cs.CV cs.LG eess.IV

    A Deep Convolutional Network for Seismic Shot-Gather Image Quality Classification

    Authors: Eduardo Betine Bucker, Antonio José Grandson Busson, Ruy Luiz Milidiú, Sérgio Colcher, Bruno Pereira Dias, André Bulcão

    Abstract: Deep Learning-based models such as Convolutional Neural Networks, have led to significant advancements in several areas of computing applications. Seismogram quality assurance is a relevant Geophysics task, since in the early stages of seismic processing, we are required to identify and fix noisy sail lines. In this work, we introduce a real-world seismogram quality classification dataset based on… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

  11. arXiv:1911.03974  [pdf, other

    cs.MM cs.IR

    A Multimodal CNN-based Tool to Censure Inappropriate Video Scenes

    Authors: Pedro V. A. de Freitas, Paulo R. C. Mendes, Gabriel N. P. dos Santos, Antonio José G. Busson, Álan Livio Guedes, Sérgio Colcher, Ruy Luiz Milidiú

    Abstract: Due to the extensive use of video-sharing platforms and services for their storage, the amount of such media on the internet has become massive. This volume of data makes it difficult to control the kind of content that may be present in such video files. One of the main concerns regarding the video content is if it has an inappropriate subject matter, such as nudity, violence, or other potentiall… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

  12. arXiv:1811.04193  [pdf, other

    cs.MM

    A Ginga-enabled Digital Radio Mondiale Broadcasting chain: Signaling and Definitions

    Authors: Rafael Diniz, Alan L. V. Guedes, Sergio Colcher

    Abstract: ISDB-T International standard is currently adopted by most Latin America countries and is already installed in most TV sets sold in recent years in the region. To support interactive applications in Digital TV receivers, ISDB-T defines the middleware Ginga. Similar to Digital TV, Digital Radio standards also provide the means to carry interactive applications; however, their specifications for int… ▽ More

    Submitted 12 June, 2019; v1 submitted 9 November, 2018; originally announced November 2018.

    Comments: 15 pages

    Report number: ISSN 0103-9741 ISSN 0103-9741