Skip to main content

Showing 1–50 of 147 results for author: Oliveira, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17526  [pdf, other

    cs.CL cs.IR

    LumberChunker: Long-Form Narrative Document Segmentation

    Authors: André V. Duarte, João Marques, Miguel Graça, Miguel Freire, Lei Li, Arlindo L. Oliveira

    Abstract: Modern NLP tasks increasingly rely on dense retrieval methods to access up-to-date and relevant contextual information. We are motivated by the premise that retrieval benefits from segments that can vary in size such that a content's semantic independence is better captured. We propose LumberChunker, a method leveraging an LLM to dynamically segment documents, which iteratively prompts the LLM to… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    ACM Class: I.2

  2. arXiv:2406.02748  [pdf, other

    cs.CV cs.AI

    Story Generation from Visual Inputs: Techniques, Related Tasks, and Challenges

    Authors: Daniel A. P. Oliveira, Eugénio Ribeiro, David Martins de Matos

    Abstract: Creating engaging narratives from visual data is crucial for automated digital media consumption, assistive technologies, and interactive entertainment. This survey covers methodologies used in the generation of these narratives, focusing on their principles, strengths, and limitations. The survey also covers tasks related to automatic story generation, such as image and video captioning, and vi… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; I.2.10

  3. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  4. arXiv:2405.17202  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Efficient multi-prompt evaluation of LLMs

    Authors: Felipe Maia Polo, Ronald Xu, Lucas Weber, Mírian Silva, Onkar Bhardwaj, Leshem Choshen, Allysson Flavio Melo de Oliveira, Yuekai Sun, Mikhail Yurochkin

    Abstract: Most popular benchmarks for comparing LLMs rely on a limited set of prompt templates, which may not fully capture the LLMs' abilities and can affect the reproducibility of results on leaderboards. Many recent works empirically verify prompt sensitivity and advocate for changes in LLM evaluation. In this paper, we consider the problem of estimating the performance distribution across many prompt va… ▽ More

    Submitted 7 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  5. arXiv:2405.15645  [pdf, other

    cs.PF cs.DC

    An Online Probabilistic Distributed Tracing System

    Authors: M. Toslali, S. Qasim, S. Parthasarathy, F. A. Oliveira, H. Huang, G. Stringhini, Z. Liu, A. K. Coskun

    Abstract: Distributed tracing has become a fundamental tool for diagnosing performance issues in the cloud by recording causally ordered, end-to-end workflows of request executions. However, tracing in production workloads can introduce significant overheads due to the extensive instrumentation needed for identifying performance variations. This paper addresses the trade-off between the cost of tracing and… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  6. arXiv:2404.16049  [pdf, other

    physics.med-ph cs.CV cs.LG eess.IV eess.SP

    Exploring the limitations of blood pressure estimation using the photoplethysmography signal

    Authors: Felipe M. Dias, Diego A. C. Cardenas, Marcelo A. F. Toledo, Filipe A. C. Oliveira, Estela Ribeiro, Jose E. Krieger, Marco A. Gutierrez

    Abstract: Hypertension, a leading contributor to cardiovascular morbidity, underscores the need for accurate and continuous blood pressure (BP) monitoring. Photoplethysmography (PPG) presents a promising approach to this end. However, the precision of BP estimates derived from PPG signals has been the subject of ongoing debate, necessitating a comprehensive evaluation of their effectiveness and constraints.… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 17 pages, 7 figures, 3 tables

  7. arXiv:2404.06389  [pdf, other

    eess.IV cs.CV cs.CY cs.MS

    Raster Forge: Interactive Raster Manipulation Library and GUI for Python

    Authors: Afonso Oliveira, Nuno Fachada, João P. Matos-Carvalho

    Abstract: Raster Forge is a Python library and graphical user interface for raster data manipulation and analysis. The tool is focused on remote sensing applications, particularly in wildfire management. It allows users to import, visualize, and process raster layers for tasks such as image compositing or topographical analysis. For wildfire management, it generates fuel maps using predefined models. Its im… ▽ More

    Submitted 19 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    ACM Class: I.4; I.5; J.2; D.2; H.5.2

    Journal ref: Software Impacts, 20, 100657, 2024

  8. arXiv:2404.04385  [pdf, other

    cs.CR

    Reconfigurable and Scalable Honeynet for Cyber-Physical Systems

    Authors: Luís Sousa, José Cecílio, Pedro Ferreira, Alan Oliveira

    Abstract: Industrial Control Systems (ICS) constitute the backbone of contemporary industrial operations, ranging from modest heating, ventilation, and air conditioning systems to expansive national power grids. Given their pivotal role in critical infrastructure, there has been a concerted effort to enhance security measures and deepen our comprehension of potential cyber threats within this domain. To add… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  9. arXiv:2404.03754  [pdf, other

    eess.IV cs.CV physics.geo-ph

    Data Science for Geographic Information Systems

    Authors: Afonso Oliveira, Nuno Fachada, João P. Matos-Carvalho

    Abstract: The integration of data science into Geographic Information Systems (GIS) has facilitated the evolution of these tools into complete spatial analysis platforms. The adoption of machine learning and big data techniques has equipped these platforms with the capacity to handle larger amounts of increasingly complex data, transcending the limitations of more traditional approaches. This work traces th… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    ACM Class: I.2.10; I.4; I.5; J.2

  10. arXiv:2404.02659  [pdf, other

    cs.CV cs.NE

    A Satellite Band Selection Framework for Amazon Forest Deforestation Detection Task

    Authors: Eduardo Neto, Fabio A. Faria, Amanda A. S. de Oliveira, Álvaro L. Fazenda

    Abstract: The conservation of tropical forests is a topic of significant social and ecological relevance due to their crucial role in the global ecosystem. Unfortunately, deforestation and degradation impact millions of hectares annually, necessitating government or private initiatives for effective forest monitoring. This study introduces a novel framework that employs the Univariate Marginal Distribution… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 9 pages, 4 figures, paper accepted for presentation at GECCO 2024

  11. arXiv:2404.01446  [pdf, other

    cs.CV cs.AI

    Finding Regions of Interest in Whole Slide Images Using Multiple Instance Learning

    Authors: Martim Afonso, Praphulla M. S. Bhawsar, Monjoy Saha, Jonas S. Almeida, Arlindo L. Oliveira

    Abstract: Whole Slide Images (WSI), obtained by high-resolution digital scanning of microscope slides at multiple scales, are the cornerstone of modern Digital Pathology. However, they represent a particular challenge to AI-based/AI-mediated analysis because pathology labeling is typically done at slide-level, instead of tile-level. It is not just that medical diagnostics is recorded at the specimen level,… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

  12. arXiv:2403.08795  [pdf

    cs.HC cs.CL

    Ontologia para monitorar a deficiência mental em seus déficts no processamento da informação por declínio cognitivo e evitar agressões psicológicas e físicas em ambientes educacionais com ajuda da I.A*

    Authors: Bruna Araújo de Castro Oliveira

    Abstract: The intention of this article is to propose the use of artificial intelligence to detect through analysis by UFO ontology the emergence of verbal and physical aggression related to psychosocial deficiencies and their provoking agents, in an attempt to prevent catastrophic consequences within school environments.

    Submitted 31 January, 2024; originally announced March 2024.

    Comments: in Portuguese language. Minha vez de falar sobre a realidade

  13. arXiv:2402.18511  [pdf

    cs.RO

    Leveraging Compliant Tactile Perception for Haptic Blind Surface Reconstruction

    Authors: Laurent Yves Emile Ramos Cheret, Vinicius Prado da Fonseca, Thiago Eustaquio Alves de Oliveira

    Abstract: Non-flat surfaces pose difficulties for robots operating in unstructured environments. Reconstructions of uneven surfaces may only be partially possible due to non-compliant end-effectors and limitations on vision systems such as transparency, reflections, and occlusions. This study achieves blind surface reconstruction by harnessing the robotic manipulator's kinematic data and a compliant tactile… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 7 pages, 9 figures, 2024 IEEE International Conference on Robotics and Automation (ICRA 2024)

  14. arXiv:2402.10889  [pdf, other

    cs.CR

    Evaluation of EAP Usage for Authenticating Eduroam Users in 5G Networks

    Authors: Leonardo Azalim de Oliveira, Edelberto Franco Silva

    Abstract: The fifth generation of the telecommunication networks (5G) established the service-oriented paradigm on the mobile networks. In this new context, the 5G Core component has become extremely flexible so, in addition to serving mobile networks, it can also be used to connect devices from the so-called non-3GPP networks, which contains technologies such as WiFi. The implementation of this connectivit… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    ACM Class: C.2.0

  15. arXiv:2402.09910  [pdf, other

    cs.CL cs.LG

    DE-COP: Detecting Copyrighted Content in Language Models Training Data

    Authors: André V. Duarte, Xuandong Zhao, Arlindo L. Oliveira, Lei Li

    Abstract: How can we detect if copyrighted content was used in the training process of a language model, considering that the training data is typically undisclosed? We are motivated by the premise that a language model is likely to identify verbatim excerpts from its training text. We propose DE-COP, a method to determine whether a piece of copyrighted content was included in training. DE-COP's core approa… ▽ More

    Submitted 25 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    ACM Class: I.2

  16. arXiv:2402.06653  [pdf, other

    cs.LG physics.ao-ph

    Using remotely sensed data for air pollution assessment

    Authors: Teresa Bernardino, Maria Alexandra Oliveira, João Nuno Silva

    Abstract: Air pollution constitutes a global problem of paramount importance that affects not only human health, but also the environment. The existence of spatial and temporal data regarding the concentrations of pollutants is crucial for performing air pollution studies and monitor emissions. However, although observation data presents great temporal coverage, the number of stations is very limited and th… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  17. arXiv:2402.04884  [pdf, other

    cs.DB

    Topological relations in water quality monitoring

    Authors: Bruno Chaves Figueiredo, Maria Alexandra Oliveira, João Nuno Silva

    Abstract: The Alqueva Multi-Purpose Project (EFMA) is a massive abduction and storage infrastructure system in the Alentejo, which has a water quality monitoring network with almost thousands of water quality stations distributed across three subsystems: Alqueva, Pedrogão, and Ardila. Identification of pollution sources in complex infrastructure systems, such as the EFMA, requires recognition of water flow… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  18. arXiv:2402.02582  [pdf, other

    cs.CY cs.DB

    On the development of an application for the compilation of global sea level changes

    Authors: Mihir Odhavji, Maria Alexandra Oliveira, João Nuno Silva

    Abstract: There is a lot of data about mean sea level variation from studies conducted around the globe. This data is dispersed, lacks organization along with standardization, and in most cases, it is not available online. In some instances, when it is available, it is often in unpractical ways and different formats. Analyzing it would be inefficient and very time-consuming. In addition to all of that, to s… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  19. arXiv:2401.12980  [pdf, other

    cs.CL

    Identifying Risk Patterns in Brazilian Police Reports Preceding Femicides: A Long Short Term Memory (LSTM) Based Analysis

    Authors: Vinicius Lima, Jaque Almeida de Oliveira

    Abstract: Femicide refers to the killing of a female victim, often perpetrated by an intimate partner or family member, and is also associated with gender-based violence. Studies have shown that there is a pattern of escalating violence leading up to these killings, highlighting the potential for prevention if the level of danger to the victim can be assessed. Machine learning offers a promising approach to… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: IEEE Global Humanitarian Technology Conference (GHTC) 2023

  20. arXiv:2401.05891  [pdf, other

    cs.CV

    LiDAR data acquisition and processing for ecology applications

    Authors: Ion Ciobotari, Adriana Príncipe, Maria Alexandra Oliveira, João Nuno Silva

    Abstract: The collection of ecological data in the field is essential to diagnose, monitor and manage ecosystems in a sustainable way. Since acquisition of this information through traditional methods are generally time-consuming, due to the capability of recording large volumes of data in short time periods, automation of data acquisition sees a growing trend. Terrestrial laser scanners (TLS), particularly… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  21. arXiv:2401.03005  [pdf, other

    physics.soc-ph cs.CV

    Evolution of urban areas and land surface temperature

    Authors: Sudipan Saha, Tushar Verma, Dario Augusto Borges Oliveira

    Abstract: With the global population on the rise, our cities have been expanding to accommodate the growing number of people. The expansion of cities generally leads to the engulfment of peripheral areas. However, such expansion of urban areas is likely to cause increment in areas with increased land surface temperature (LST). By considering each summer as a data point, we form LST multi-year time-series an… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  22. arXiv:2312.09358  [pdf, other

    cs.SI physics.soc-ph

    Echo chamber formation sharpened by priority users

    Authors: Henrique F. de Arruda, Kleber A. Oliveira, Yamir Moreno

    Abstract: Priority users (e.g., verified profiles on Twitter) are social media users whose content is promoted by recommendation algorithms. However, the impact of this heterogeneous user influence on opinion dynamics, such as polarization phenomena, is unknown. We conduct a computational mechanistic investigation of such consequences in a stylized setting. First, we allow priority users, whose content has… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  23. arXiv:2311.08547  [pdf, other

    cs.AI

    DeepThought: An Architecture for Autonomous Self-motivated Systems

    Authors: Arlindo L. Oliveira, Tiago Domingos, Mário Figueiredo, Pedro U. Lima

    Abstract: The ability of large language models (LLMs) to engage in credible dialogues with humans, taking into account the training data and the context of the conversation, has raised discussions about their ability to exhibit intrinsic motivations, agency, or even some degree of consciousness. We argue that the internal architecture of LLMs and their finite and volatile state cannot support any of these p… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    ACM Class: I.2

  24. arXiv:2311.02082  [pdf

    cs.AI cs.IR

    Semantic Modelling of Organizational Knowledge as a Basis for Enterprise Data Governance 4.0 -- Application to a Unified Clinical Data Model

    Authors: Miguel AP Oliveira, Stephane Manara, Bruno Molé, Thomas Muller, Aurélien Guillouche, Lysann Hesske, Bruce Jordan, Gilles Hubert, Chinmay Kulkarni, Pralipta Jagdev, Cedric R. Berger

    Abstract: Individuals and organizations cope with an always-growing amount of data, which is heterogeneous in its contents and formats. An adequate data management process yielding data quality and control over its lifecycle is a prerequisite to getting value out of this data and minimizing inherent risks related to multiple usages. Common data governance frameworks rely on people, policies, and processes t… ▽ More

    Submitted 23 November, 2023; v1 submitted 20 October, 2023; originally announced November 2023.

  25. arXiv:2310.12112  [pdf, other

    cs.CR cs.AI cs.LG

    A Cautionary Tale: On the Role of Reference Data in Empirical Privacy Defenses

    Authors: Caelin G. Kaplan, Chuan Xu, Othmane Marfoq, Giovanni Neglia, Anderson Santana de Oliveira

    Abstract: Within the realm of privacy-preserving machine learning, empirical privacy defenses have been proposed as a solution to achieve satisfactory levels of training data privacy without a significant drop in model utility. Most existing defenses against membership inference attacks assume access to reference data, defined as an additional dataset coming from the same (or a similar) underlying distribut… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  26. arXiv:2310.10575  [pdf, other

    cs.CV q-bio.NC

    Matching the Neuronal Representations of V1 is Necessary to Improve Robustness in CNNs with V1-like Front-ends

    Authors: Ruxandra Barbulescu, Tiago Marques, Arlindo L. Oliveira

    Abstract: While some convolutional neural networks (CNNs) have achieved great success in object recognition, they struggle to identify objects in images corrupted with different types of common noise patterns. Recently, it was shown that simulating computations in early visual areas at the front of CNNs leads to improvements in robustness to image corruptions. Here, we further explore this result and show t… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  27. arXiv:2309.01751  [pdf, other

    eess.IV cs.CV physics.geo-ph

    Multispectral Indices for Wildfire Management

    Authors: Afonso Oliveira, João P. Matos-Carvalho, Filipe Moutinho, Nuno Fachada

    Abstract: This paper highlights and summarizes the most important multispectral indices and associated methodologies for fire management. Various fields of study are examined where multispectral indices align with wildfire prevention and management, including vegetation and soil attribute extraction, water feature map**, artificial structure identification, and post-fire burnt area estimation. The versati… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

    ACM Class: I.2.10; I.4; I.5; J.2

  28. arXiv:2308.16323  [pdf, other

    eess.IV cs.CV cs.HC

    Software multiplataforma para a segmentação de vasos sanguíneos em imagens da retina

    Authors: João Henrique Pereira Machado, Gilson Adamczuk Oliveira, Érick Oliveira Rodrigues

    Abstract: In this work, we utilize image segmentation to visually identify blood vessels in retinal examination images. This process is typically carried out manually. However, we can employ heuristic methods and machine learning to automate or at least expedite the process. In this context, we propose a cross-platform, open-source, and responsive software that allows users to manually segment a retinal ima… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: in Portuguese language. International Conference on Production Research - Americas 2022. https://www.even3.com.br/anais/foreigners_subscription_icpr_americas22/664603-software-multiplataforma-para-a-segmentacao-de-vasos-sanguineos-em-imagens-da-retina/

  29. arXiv:2308.05759  [pdf, ps, other

    eess.SP cs.AI cs.LG

    A machine-learning sleep-wake classification model using a reduced number of features derived from photoplethysmography and activity signals

    Authors: Douglas A. Almeida, Felipe M. Dias, Marcelo A. F. Toledo, Diego A. C. Cardenas, Filipe A. C. Oliveira, Estela Ribeiro, Jose E. Krieger, Marco A. Gutierrez

    Abstract: Sleep is a crucial aspect of our overall health and well-being. It plays a vital role in regulating our mental and physical health, impacting our mood, memory, and cognitive function to our physical resilience and immune system. The classification of sleep stages is a mandatory step to assess sleep quality, providing the metrics to estimate the quality of sleep and how well our body is functioning… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: 8 pages, 3 figures

  30. arXiv:2308.03584  [pdf, other

    cs.DB

    A Polystore Architecture Using Knowledge Graphs to Support Queries on Heterogeneous Data Stores

    Authors: Leonardo Guerreiro Azevedo, Renan Francisco Santos Souza, Elton F. de S. Soares, Raphael M. Thiago, Julio Cesar Cardoso Tesolin, Ann C. Oliveira, Marcio Ferreira Moreno

    Abstract: Modern applications commonly need to manage dataset types composed of heterogeneous data and schemas, making it difficult to access them in an integrated way. A single data store to manage heterogeneous data using a common data model is not effective in such a scenario, which results in the domain data being fragmented in the data stores that best fit their storage and access requirements (e.g., N… ▽ More

    Submitted 15 March, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Reference the paper as L. G. Azevedo, R. Souza, E. F. de S. Soares, R. M. Thiago, J. C. D. Tesolin, A. C. Oliveira, M. F. Moreno, A Polystore Architecture Using Knowledge Graphs to Support Queries on Heterogeneous Data Stores. Proceedings of 20th Brazilian Symposium in Information Systems, 2024 (to be published)

  31. arXiv:2308.01930  [pdf, other

    cs.LG cs.AI eess.SP

    Machine Learning-Based Diabetes Detection Using Photoplethysmography Signal Features

    Authors: Filipe A. C. Oliveira, Felipe M. Dias, Marcelo A. F. Toledo, Diego A. C. Cardenas, Douglas A. Almeida, Estela Ribeiro, Jose E. Krieger, Marco A. Gutierrez

    Abstract: Diabetes is a prevalent chronic condition that compromises the health of millions of people worldwide. Minimally invasive methods are needed to prevent and control diabetes but most devices for measuring glucose levels are invasive and not amenable for continuous monitoring. Here, we present an alternative method to overcome these shortcomings based on non-invasive optical photoplethysmography (PP… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 11 pages, 6 figures

  32. arXiv:2307.10018  [pdf, other

    cs.RO cs.AI

    RobôCIn Small Size League Extended Team Description Paper for RoboCup 2023

    Authors: Aline Lima de Oliveira, Cauê Addae da Silva Gomes, Cecília Virginia Santos da Silva, Charles Matheus de Sousa Alves, Danilo Andrade Martins de Souza, Driele Pires Ferreira Araújo Xavier, Edgleyson Pereira da Silva, Felipe Bezerra Martins, Lucas Henrique Cavalcanti Santos, Lucas Dias Maciel, Matheus Paixão Gumercindo dos Santos, Matheus Lafayette Vasconcelos, Matheus Vinícius Teotonio do Nascimento Andrade, João Guilherme Oliveira Carvalho de Melo, João Pedro Souza Pereira de Moura, José Ronald da Silva, José Victor Silva Cruz, Pedro Henrique Santana de Morais, Pedro Paulo Salman de Oliveira, Riei Joaquim Matos Rodrigues, Roberto Costa Fernandes, Ryan Vinicius Santos Morais, Tamara Mayara Ramos Teobaldo, Washington Igor dos Santos Silva, Edna Natividade Silva Barros

    Abstract: RobôCIn has participated in RoboCup Small Size League since 2019, won its first world title in 2022 (Division B), and is currently a three-times Latin-American champion. This paper presents our improvements to defend the Small Size League (SSL) division B title in RoboCup 2023 in Bordeaux, France. This paper aims to share some of the academic research that our team developed over the past year. Ou… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  33. arXiv:2307.08766  [pdf, other

    cs.LG cs.AI eess.SP

    Quality Assessment of Photoplethysmography Signals For Cardiovascular Biomarkers Monitoring Using Wearable Devices

    Authors: Felipe M. Dias, Marcelo A. F. Toledo, Diego A. C. Cardenas, Douglas A. Almeida, Filipe A. C. Oliveira, Estela Ribeiro, Jose E. Krieger, Marco A. Gutierrez

    Abstract: Photoplethysmography (PPG) is a non-invasive technology that measures changes in blood volume in the microvascular bed of tissue. It is commonly used in medical devices such as pulse oximeters and wrist worn heart rate monitors to monitor cardiovascular hemodynamics. PPG allows for the assessment of parameters (e.g., heart rate, pulse waveform, and peripheral perfusion) that can indicate condition… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 9 pages

  34. arXiv:2307.02300  [pdf, other

    cs.LG cs.IR

    Improving Address Matching using Siamese Transformer Networks

    Authors: André V. Duarte, Arlindo L. Oliveira

    Abstract: Matching addresses is a critical task for companies and post offices involved in the processing and delivery of packages. The ramifications of incorrectly delivering a package to the wrong recipient are numerous, ranging from harm to the company's reputation to economic and environmental costs. This research introduces a deep learning-based model designed to increase the efficiency of address matc… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: To be published in the 22nd EPIA Conference on Artificial Intelligence, EPIA 2023, Faial Island - Azores, Portugal, 5-8 September 2023, Proceedings

    ACM Class: I.2

  35. arXiv:2306.06834  [pdf, other

    cs.SE

    Motivational models for validating agile requirements in Software Engineering subjects

    Authors: Eduardo A. Oliveira, Leon Sterling

    Abstract: This paper describes how motivational models can be used to cross check agile requirements artifacts to improve consistency and completeness of software requirements. Motivational models provide a high level understanding of the purposes of a software system. They complement personas and user stories which focus more on user needs rather than on system features. We present an exploratory case stud… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: 9 pages, 2 figures, SERP'21 - The 19th International Conference on Software Engineering Research and Practice

  36. arXiv:2305.09904  [pdf, ps, other

    cs.LG eess.SY

    On the ISS Property of the Gradient Flow for Single Hidden-Layer Neural Networks with Linear Activations

    Authors: Arthur Castello B. de Oliveira, Milad Siami, Eduardo D. Sontag

    Abstract: Recent research in neural networks and machine learning suggests that using many more parameters than strictly required by the initial complexity of a regression problem can result in more accurate or faster-converging models -- contrary to classical statistical belief. This phenomenon, sometimes known as ``benign overfitting'', raises questions regarding in what other ways might overparameterizat… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 10 pages, 1 figure, extended conference version

  37. arXiv:2305.06129  [pdf, other

    cs.SE

    Do code refactorings influence the merge effort?

    Authors: Andre Oliveira, Vania Neves, Alexandre Plastino, Ana Carla Bibiano, Alessandro Garcia, Leonardo Murta

    Abstract: In collaborative software development, multiple contributors frequently change the source code in parallel to implement new features, fix bugs, refactor existing code, and make other changes. These simultaneous changes need to be merged into the same version of the source code. However, the merge operation can fail, and developer intervention is required to resolve the conflicts. Studies in the li… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 11 pages + 2 for citations, 7 figures, 3 tables. Preprint of a paper that will be published in the IEEE/ACM 45th International Conference on Software Engineering (ICSE 2023) - Authors' version of the work

  38. arXiv:2304.12226  [pdf, other

    cs.IT

    Algebraic and Geometric Characterizations Related to the Quantization Problem of the $C_{2,8}$ Channel

    Authors: Anderson José de Oliveira, Giuliano Gadioli La Guardia, Reginaldo Palazzo Jr., Clarice Dias de Albuquerque, Cátia Regina de Oliveira Quilles Queiroz, Leandro Bezerra de Lima, Vandenberg Lopes Vieira

    Abstract: In this paper, we consider the steps to be followed in the analysis and interpretation of the quantization problem related to the $C_{2,8}$ channel, where the Fuchsian differential equations, the generators of the Fuchsian groups, and the tessellations associated with the cases $g=2$ and $g=3$, related to the hyperbolic case, are determined. In order to obtain these results, it is necessary to det… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: 31 pages, 9 figures

  39. arXiv:2303.08572  [pdf, other

    cs.LG cs.CL cs.IT

    Distinguishing Cause from Effect on Categorical Data: The Uniform Channel Model

    Authors: Mário A. T. Figueiredo, Catarina A. Oliveira

    Abstract: Distinguishing cause from effect using observations of a pair of random variables is a core problem in causal discovery. Most approaches proposed for this task, namely additive noise models (ANM), are only adequate for quantitative data. We propose a criterion to address the cause-effect problem with categorical variables (living in sets with no meaningful order), inspired by seeing a conditional… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 20 pages, 2 appendices

    MSC Class: 62D20

  40. arXiv:2303.07975  [pdf, other

    cs.CR

    Software-based security approach for networked embedded devices

    Authors: José Ferreira, Alan Oliveira, André Souto, José Cecílio

    Abstract: As the Internet of Things (IoT) continues to expand, data security has become increasingly important for ensuring privacy and safety, especially given the sensitive and, sometimes, critical nature of the data handled by IoT devices. There exist hardware-based trusted execution environments used to protect data, but they are not compatible with low-cost devices that lack hardware-assisted security… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 4

  41. arXiv:2302.02910  [pdf, other

    cs.LG

    An Empirical Analysis of Fairness Notions under Differential Privacy

    Authors: Anderson Santana de Oliveira, Caelin Kaplan, Khawla Mallat, Tanmay Chakraborty

    Abstract: Recent works have shown that selecting an optimal model architecture suited to the differential privacy setting is necessary to achieve the best possible utility for a given privacy budget using differentially private stochastic gradient descent (DP-SGD)(Tramer and Boneh 2020; Cheng et al. 2022). In light of these findings, we empirically analyse how different fairness notions, belonging to distin… ▽ More

    Submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted for oral presentation at the The Fourth AAAI Workshop on Privacy-Preserving Artificial Intelligence (PPAI-23) https://aaai-ppai23.github.io/#accepted_papers

  42. arXiv:2301.10608  [pdf, other

    cs.CV cs.LG

    Connecting metrics for shape-texture knowledge in computer vision

    Authors: Tiago Oliveira, Tiago Marques, Arlindo L. Oliveira

    Abstract: Modern artificial neural networks, including convolutional neural networks and vision transformers, have mastered several computer vision tasks, including object recognition. However, there are many significant differences between the behavior and robustness of these systems and of the human visual system. Deep neural networks remain brittle and susceptible to many changes in the image that do not… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 7 pages, 3 figures

  43. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  44. arXiv:2211.02627  [pdf

    eess.SP cs.AI cs.LG

    An IoT Cloud and Big Data Architecture for the Maintenance of Home Appliances

    Authors: Pedro Chaves, Tiago Fonseca, Luis Lino Ferreira, Bernardo Cabral, Orlando Sousa, Andre Oliveira, Jorge Landeck

    Abstract: Billions of interconnected Internet of Things (IoT) sensors and devices collect tremendous amounts of data from real-world scenarios. Big data is generating increasing interest in a wide range of industries. Once data is analyzed through compute-intensive Machine Learning (ML) methods, it can derive critical business value for organizations. Powerfulplatforms are essential to handle and process su… ▽ More

    Submitted 25 October, 2022; originally announced November 2022.

    Comments: 6 pages, 6 figures, IECON 2022

  45. arXiv:2210.13167  [pdf, other

    cs.CV

    Exploring Self-Attention for Crop-type Classification Explainability

    Authors: Ivica Obadic, Ribana Roscher, Dario Augusto Borges Oliveira, Xiao Xiang Zhu

    Abstract: Automated crop-type classification using Sentinel-2 satellite time series is essential to support agriculture monitoring. Recently, deep learning models based on transformer encoders became a promising approach for crop-type classification. Using explainable machine learning to reveal the inner workings of these models is an important step towards improving stakeholders' trust and efficient agricu… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  46. arXiv:2210.11327  [pdf, other

    cs.LG stat.ML

    Improving Data Quality with Training Dynamics of Gradient Boosting Decision Trees

    Authors: Moacir Antonelli Ponti, Lucas de Angelis Oliveira, Mathias Esteban, Valentina Garcia, Juan Martín Román, Luis Argerich

    Abstract: Real world datasets contain incorrectly labeled instances that hamper the performance of the model and, in particular, the ability to generalize out of distribution. Also, each example might have different contribution towards learning. This motivates studies to better understanding of the role of data instances with respect to their contribution in good metrics in models. In this paper we propose… ▽ More

    Submitted 22 February, 2024; v1 submitted 20 October, 2022; originally announced October 2022.

  47. arXiv:2209.10901  [pdf, other

    cs.LG

    Pretraining the Vision Transformer using self-supervised methods for vision based Deep Reinforcement Learning

    Authors: Manuel Goulão, Arlindo L. Oliveira

    Abstract: The Vision Transformer architecture has shown to be competitive in the computer vision (CV) space where it has dethroned convolution-based networks in several benchmarks. Nevertheless, convolutional neural networks (CNN) remain the preferential architecture for the representation module in reinforcement learning. In this work, we study pretraining a Vision Transformer using several state-of-the-ar… ▽ More

    Submitted 18 July, 2023; v1 submitted 22 September, 2022; originally announced September 2022.

  48. arXiv:2209.07928  [pdf, other

    cs.AI cs.CL eess.SY

    The BLue Amazon Brain (BLAB): A Modular Architecture of Services about the Brazilian Maritime Territory

    Authors: Paulo Pirozelli, Ais B. R. Castro, Ana Luiza C. de Oliveira, André S. Oliveira, Flávio N. Cação, Igor C. Silveira, João G. M. Campos, Laura C. Motheo, Leticia F. Figueiredo, Lucas F. A. O. Pellicer, Marcelo A. José, Marcos M. José, Pedro de M. Ligabue, Ricardo S. Grava, Rodrigo M. Tavares, Vinícius B. Matos, Yan V. Sym, Anna H. R. Costa, Anarosa A. F. Brandão, Denis D. Mauá, Fabio G. Cozman, Sarajane M. Peres

    Abstract: We describe the first steps in the development of an artificial agent focused on the Brazilian maritime territory, a large region within the South Atlantic also known as the Blue Amazon. The "BLue Amazon Brain" (BLAB) integrates a number of services aimed at disseminating information about this region and its importance, functioning as a tool for environmental awareness. The main service provided… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Journal ref: AI: Modeling Oceans and Climate Change (IJCAI-ECAI), 2022

  49. arXiv:2209.06932  [pdf, other

    cs.LG

    Optimizing Connectivity through Network Gradients for Restricted Boltzmann Machines

    Authors: A. C. N. de Oliveira, D. R. Figueiredo

    Abstract: Leveraging sparse networks to connect successive layers in deep neural networks has recently been shown to provide benefits to large scale state-of-the-art models. However, network connectivity also plays a significant role on the learning performance of shallow networks, such as the classic Restricted Boltzmann Machines (RBM). Efficiently finding sparse connectivity patterns that improve the lear… ▽ More

    Submitted 3 December, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

  50. arXiv:2208.11607  [pdf, other

    cs.CV

    Learning crop type map** from regional label proportions in large-scale SAR and optical imagery

    Authors: Laura E. C. La Rosa, Dario A. B. Oliveira, Pedram Ghamisi

    Abstract: The application of deep learning algorithms to Earth observation (EO) in recent years has enabled substantial progress in fields that rely on remotely sensed data. However, given the data scale in EO, creating large datasets with pixel-level annotations by experts is expensive and highly time-consuming. In this context, priors are seen as an attractive way to alleviate the burden of manual labelin… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.