Skip to main content

Showing 1–20 of 20 results for author: da Costa, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.06790  [pdf, other

    cs.CL cs.AI

    Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions

    Authors: Daniel de S. Moraes, Pedro T. C. Santos, Polyana B. da Costa, Matheus A. S. Pinto, Ivan de J. P. Pinto, Álvaro M. G. da Veiga, Sergio Colcher, Antonio J. G. Busson, Rafael H. Rocha, Rennan Gaio, Rafael Miceli, Gabriela Tourinho, Marcos Rabaioli, Leandro Santos, Fellipe Marques, David Favaro

    Abstract: This work presents an unsupervised method for automatically constructing and expanding topic taxonomies using instruction-based fine-tuned LLMs (Large Language Models). We apply topic modeling and keyword extraction techniques to create initial topic taxonomies and LLMs to post-process the resulting terms and create a hierarchy. To expand an existing taxonomy with new terms, we use zero-shot promp… ▽ More

    Submitted 11 February, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  2. arXiv:2307.15208  [pdf, other

    eess.IV cs.CV

    Generative AI for Medical Imaging: extending the MONAI Framework

    Authors: Walter H. L. Pinaya, Mark S. Graham, Eric Kerfoot, Petru-Daniel Tudosiu, Jessica Dafflon, Virginia Fernandez, Pedro Sanchez, Julia Wolleb, Pedro F. da Costa, Ashay Patel, Hyung** Chung, Can Zhao, Wei Peng, Zelong Liu, Xueyan Mei, Oeslle Lucena, Jong Chul Ye, Sotirios A. Tsaftaris, Prerna Dogra, Andrew Feng, Marc Modat, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Recent advances in generative AI have brought incredible breakthroughs in several areas, including medical imaging. These generative models have tremendous potential not only to help safely share medical data via synthetic datasets but also to perform an array of diverse applications, such as anomaly detection, image-to-image translation, denoising, and MRI reconstruction. However, due to the comp… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  3. arXiv:2212.10913  [pdf

    cs.CR cs.LG

    Ensemble learning techniques for intrusion detection system in the context of cybersecurity

    Authors: Andricson Abeline Moreira, Carlos A. C. Tojeiro, Carlos J. Reis, Gustavo Henrique Massaro, Igor Andrade Brito e Kelton A. P. da Costa

    Abstract: Recently, there has been an interest in improving the resources available in Intrusion Detection System (IDS) techniques. In this sense, several studies related to cybersecurity show that the environment invasions and information kidnap** are increasingly recurrent and complex. The criticality of the business involving operations in an environment using computing resources does not allow the vul… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: in Portuguese language. CIACA - Conferencia Ibero-Americana Computação Aplicada 2022 Proceedings

  4. Extractive Text Summarization Using Generalized Additive Models with Interactions for Sentence Selection

    Authors: Vinícius Camargo da Silva, João Paulo Papa, Kelton Augusto Pontara da Costa

    Abstract: Automatic Text Summarization (ATS) is becoming relevant with the growth of textual data; however, with the popularization of public large-scale datasets, some recent machine learning approaches have focused on dense models and architectures that, despite producing notable results, usually turn out in models difficult to interpret. Given the challenge behind interpretable learning-based text summar… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  5. arXiv:2212.04984  [pdf, other

    cs.LG cs.AI

    Transformer-based normative modelling for anomaly detection of early schizophrenia

    Authors: Pedro F Da Costa, Jessica Dafflon, Sergio Leonardo Mendes, João Ricardo Sato, M. Jorge Cardoso, Robert Leech, Emily JH Jones, Walter H. L. Pinaya

    Abstract: Despite the impact of psychiatric disorders on clinical health, early-stage diagnosis remains a challenge. Machine learning studies have shown that classifiers tend to be overly narrow in the diagnosis prediction task. The overlap between conditions leads to high heterogeneity among participants that is not adequately captured by classification models. To address this issue, normative approaches h… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 10 pages, 2 figures, 2 tables, presented at NeurIPS22@PAI4MH

  6. arXiv:2209.07162  [pdf, other

    eess.IV cs.CV q-bio.QM

    Brain Imaging Generation with Latent Diffusion Models

    Authors: Walter H. L. Pinaya, Petru-Daniel Tudosiu, Jessica Dafflon, Pedro F da Costa, Virginia Fernandez, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Deep neural networks have brought remarkable breakthroughs in medical image analysis. However, due to their data-hungry nature, the modest dataset sizes in medical imaging projects might be hindering their full potential. Generating synthetic data provides a promising alternative, allowing to complement training datasets and conducting medical image research at a larger scale. Diffusion models rec… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 10 pages, 3 figures, Accepted in the Deep Generative Models workshop @ MICCAI 2022

  7. arXiv:2206.03461  [pdf, other

    cs.CV eess.IV q-bio.QM

    Fast Unsupervised Brain Anomaly Detection and Segmentation with Diffusion Models

    Authors: Walter H. L. Pinaya, Mark S. Graham, Robert Gray, Pedro F Da Costa, Petru-Daniel Tudosiu, Paul Wright, Yee H. Mah, Andrew D. MacKinnon, James T. Teo, Rolf Jager, David Werring, Geraint Rees, Parashkev Nachev, Sebastien Ourselin, M. Jorge Cardoso

    Abstract: Deep generative models have emerged as promising tools for detecting arbitrary anomalies in data, dispensing with the necessity for manual labelling. Recently, autoregressive transformers have achieved state-of-the-art performance for anomaly detection in medical imaging. Nonetheless, these models still have some intrinsic weaknesses, such as requiring images to be modelled as 1D sequences, the ac… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  8. arXiv:2205.09185  [pdf, other

    physics.ins-det cs.LG hep-ex nucl-ex physics.comp-ph

    AI-assisted Optimization of the ECCE Tracking System at the Electron Ion Collider

    Authors: C. Fanelli, Z. Papandreou, K. Suresh, J. K. Adkins, Y. Akiba, A. Albataineh, M. Amaryan, I. C. Arsene, C. Ayerbe Gayoso, J. Bae, X. Bai, M. D. Baker, M. Bashkanov, R. Bellwied, F. Benmokhtar, V. Berdnikov, J. C. Bernauer, F. Bock, W. Boeglin, M. Borysova, E. Brash, P. Brindza, W. J. Briscoe, M. Brooks, S. Bueltmann , et al. (258 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC) is a cutting-edge accelerator facility that will study the nature of the "glue" that binds the building blocks of the visible matter in the universe. The proposed experiment will be realized at Brookhaven National Laboratory in approximately 10 years from now, with detector design and R&D currently ongoing. Notably, EIC is one of the first large-scale facilities to… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 16 pages, 18 figures, 2 appendices, 3 tables

  9. arXiv:2203.02728  [pdf, other

    cs.CV

    An End-to-End Approach for Seam Carving Detection using Deep Neural Networks

    Authors: Thierry P. Moreira, Marcos Cleison S. Santana, Leandro A. Passos João Paulo Papa, Kelton Augusto P. da Costa

    Abstract: Seam carving is a computational method capable of resizing images for both reduction and expansion based on its content, instead of the image geometry. Although the technique is mostly employed to deal with redundant information, i.e., regions composed of pixels with similar intensity, it can also be used for tampering images by inserting or removing relevant objects. Therefore, detecting such a p… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

  10. A Review of Deep Learning-based Approaches for Deepfake Content Detection

    Authors: Leandro A. Passos, Danilo Jodas, Kelton A. P. da Costa, Luis A. Souza Júnior, Douglas Rodrigues, Javier Del Ser, David Camacho, João Paulo Papa

    Abstract: Recent advancements in deep learning generative models have raised concerns as they can create highly convincing counterfeit images and videos. This poses a threat to people's integrity and can lead to social instability. To address this issue, there is a pressing need to develop new computational models that can efficiently detect forged content and alert users to potential image and video manipu… ▽ More

    Submitted 15 February, 2024; v1 submitted 12 February, 2022; originally announced February 2022.

  11. arXiv:2201.10453  [pdf, other

    cs.AI

    The First AI4TSP Competition: Learning to Solve Stochastic Routing Problems

    Authors: Laurens Bliek, Paulo da Costa, Reza Refaei Afshar, Yingqian Zhang, Tom Catshoek, Daniël Vos, Sicco Verwer, Fynn Schmitt-Ulms, André Hottung, Tapan Shah, Meinolf Sellmann, Kevin Tierney, Carl Perreault-Lafleur, Caroline Leboeuf, Federico Bobbio, Justine Pepin, Warley Almeida Silva, Ricardo Gama, Hugo L. Fernandes, Martin Zaefferer, Manuel López-Ibáñez, Ekhine Irurozki

    Abstract: This paper reports on the first international competition on AI for the traveling salesman problem (TSP) at the International Joint Conference on Artificial Intelligence 2021 (IJCAI-21). The TSP is one of the classical combinatorial optimization problems, with many variants inspired by real-world applications. This first competition asked the participants to develop algorithms to solve a time-depe… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 21 pages

    MSC Class: 68T05

  12. arXiv:2105.15119  [pdf, other

    math.OC cs.LG

    Policies for the Dynamic Traveling Maintainer Problem with Alerts

    Authors: Paulo da Costa, Peter Verleijsdonk, Simon Voorberg, Alp Akcay, Stella Kapodistria, Willem van Jaarsveld, Yingqian Zhang

    Abstract: Downtime of industrial assets such as wind turbines and medical imaging devices comes at a sharp cost. To avoid such downtime costs, companies seek to initiate maintenance just before failure. Unfortunately, this is challenging for the following two reasons: On the one hand, because asset failures are notoriously difficult to predict, even in the presence of real-time monitoring devices which sign… ▽ More

    Submitted 20 May, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

  13. arXiv:2007.09989  [pdf, other

    cs.LG cs.HC stat.ML

    Bayesian optimization for automatic design of face stimuli

    Authors: Pedro F. da Costa, Romy Lorenz, Ricardo Pio Monti, Emily Jones, Robert Leech

    Abstract: Investigating the cognitive and neural mechanisms involved with face processing is a fundamental task in modern neuroscience and psychology. To date, the majority of such studies have focused on the use of pre-selected stimuli. The absence of personalized stimuli presents a serious limitation as it fails to account for how each individual face processing system is tuned to cultural embeddings or h… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: Accepted at ICML2020 workshop track

  14. arXiv:2004.01608  [pdf, other

    cs.LG cs.AI stat.ML

    Learning 2-opt Heuristics for the Traveling Salesman Problem via Deep Reinforcement Learning

    Authors: Paulo R. de O. da Costa, Jason Rhuggenaath, Yingqian Zhang, Alp Akcay

    Abstract: Recent works using deep learning to solve the Traveling Salesman Problem (TSP) have focused on learning construction heuristics. Such approaches find TSP solutions of good quality but require additional procedures such as beam search and sampling to improve solutions and achieve state-of-the-art performance. However, few studies have focused on improvement heuristics, where a given solution is imp… ▽ More

    Submitted 14 September, 2020; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: To appear in Proceedings Machine Learning Research - ACML 2020

  15. arXiv:1907.07568  [pdf, other

    cs.NE cs.LG

    Machine Learning based Simulation Optimisation for Trailer Management

    Authors: Dylan Rijnen, Jason Rhuggenaath, Paulo R. de O. da Costa, Yingqian Zhang

    Abstract: In many situations, simulation models are developed to handle complex real-world business optimisation problems. For example, a discrete-event simulation model is used to simulate the trailer management process in a big Fast-Moving Consumer Goods company. To address the problem of finding suitable inputs to this simulator for optimising fleet configuration, we propose a simulation optimisation app… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: Submitted to IEEE SMC 2019

  16. arXiv:1907.07480  [pdf, other

    cs.LG stat.ML

    Remaining Useful Lifetime Prediction via Deep Domain Adaptation

    Authors: Paulo R. de O. da Costa, Alp Akcay, Yingqian Zhang, Uzay Kaymak

    Abstract: In Prognostics and Health Management (PHM) sufficient prior observed degradation data is usually critical for Remaining Useful Lifetime (RUL) prediction. Most previous data-driven prediction methods assume that training (source) and testing (target) condition monitoring data have similar distributions. However, due to different operating conditions, fault modes, noise and equipment updates distrib… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

  17. arXiv:1907.04711  [pdf, other

    cs.AI cs.LG stat.ML

    Data-driven Policy on Feasibility Determination for the Train Shunting Problem

    Authors: Paulo R. de O. da Costa, J. Rhuggenaath, Y. Zhang, A. Akcay, W. Lee, U. Kaymak

    Abstract: Parking, matching, scheduling, and routing are common problems in train maintenance. In particular, train units are commonly maintained and cleaned at dedicated shunting yards. The planning problem that results from such situations is referred to as the Train Unit Shunting Problem (TUSP). This problem involves matching arriving train units to service tasks and determining the schedule for departin… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: Accepted as conference paper at ECML PKDD 2019

  18. arXiv:1806.07908  [pdf, other

    cs.LG cs.CV stat.ML

    Como funciona o Deep Learning

    Authors: Moacir Antonelli Ponti, Gabriel B. Paranhos da Costa

    Abstract: Deep Learning methods are currently the state-of-the-art in many problems which can be tackled via machine learning, in particular classification problems. However there is still lack of understanding on how those methods work, why they work and what are the limitations involved in using them. In this chapter we will describe in detail the transition from shallow to deep networks, include examples… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: Book chapter, in Portuguese, 31 pages

    Journal ref: In: Tópicos em Gerenciamento de Dados e Informações, SBC, Cap.3, ISBN 978-85-7669-400-7, pp.63-93, 2017

  19. arXiv:1609.02781  [pdf, other

    cs.CV

    An empirical study on the effects of different types of noise in image classification tasks

    Authors: Gabriel B. Paranhos da Costa, Welinton A. Contato, Tiago S. Nazare, João E. S. Batista Neto, Moacir Ponti

    Abstract: Image classification is one of the main research problems in computer vision and machine learning. Since in most real-world image classification applications there is no control over how the images are captured, it is necessary to consider the possibility that these images might be affected by noise (e.g. sensor noise in a low-quality surveillance camera). In this paper we analyse the impact of th… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

  20. arXiv:1207.1354  [pdf

    cs.AI

    Of Starships and Klingons: Bayesian Logic for the 23rd Century

    Authors: Kathryn Blackmond Laskey, Paulo da Costa

    Abstract: Intelligent systems in an open world must reason about many interacting entities related to each other in diverse ways and having uncertain features and relationships. Traditional probabilistic languages lack the expressive power to handle relational domains. Classical first-order logic is sufficiently expressive, but lacks a coherent plausible reasoning capability. Recent years have seen the emer… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-346-353