Search | arXiv e-print repository

Feature-oriented Test Case Selection and Prioritization During the Evolution of Highly-Configurable Systems

Authors: Willian D. F. Mendonça, Wesley K. G. Assunção, Silvia R. Vergilio

Abstract: Testing Highly Configurable Systems (HCSs) is a challenging task, especially in an evolution scenario where features are added, changed, or removed, which hampers test case selection and prioritization. Existing work is usually based on the variability model, which is not always available or updated. Yet, the few existing approaches rely on links between test cases and changed files (or lines of c… ▽ More Testing Highly Configurable Systems (HCSs) is a challenging task, especially in an evolution scenario where features are added, changed, or removed, which hampers test case selection and prioritization. Existing work is usually based on the variability model, which is not always available or updated. Yet, the few existing approaches rely on links between test cases and changed files (or lines of code), not considering how features are implemented, usually spread over several and unchanged files. To overcome these limitations, we introduce FeaTestSelPrio, a feature-oriented test case selection and prioritization approach for HCSs. The approach links test cases to feature implementations, using HCS pre-processor directives, to select test cases based on features affected by changes in each commit. After, the test cases are prioritized according to the number of features they cover. Our approach selects a greater number of tests and takes longer to execute than a changed-file-oriented approach, used as baseline, but FeaTestSelPrio performs better regarding detected failures. By adding the approach execution time to the execution time of the selected test cases, we reached a reduction of $\approx$50%, in comparison with retest-all. The prioritization step allows reducing the average test budget in 86% of the failed commits. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2312.16854 [pdf, other]

doi 10.1145/3597503.3639164

TRIAD: Automated Traceability Recovery based on Biterm-enhanced Deduction of Transitive Links among Artifacts

Authors: Hui Gao, Hongyu Kuang, Wesley K. G. Assunção, Christoph Mayr-Dorn, Guo** Rong, He Zhang, Xiaoxing Ma, Alexander Egyed

Abstract: Traceability allows stakeholders to extract and comprehend the trace links among software artifacts introduced across the software life cycle, to provide significant support for software engineering tasks. Despite its proven benefits, software traceability is challenging to recover and maintain manually. Hence, plenty of approaches for automated traceability have been proposed. Most rely on textua… ▽ More Traceability allows stakeholders to extract and comprehend the trace links among software artifacts introduced across the software life cycle, to provide significant support for software engineering tasks. Despite its proven benefits, software traceability is challenging to recover and maintain manually. Hence, plenty of approaches for automated traceability have been proposed. Most rely on textual similarities among software artifacts, such as those based on Information Retrieval (IR). However, artifacts in different abstraction levels usually have different textual descriptions, which can greatly hinder the performance of IR-based approaches (e.g., a requirement in natural language may have a small textual similarity to a Java class). In this work, we leverage the consensual biterms and transitive relationships (i.e., inner- and outer-transitive links) based on intermediate artifacts to improve IR-based traceability recovery. We first extract and filter biterms from all source, intermediate, and target artifacts. We then use the consensual biterms from the intermediate artifacts to extend the biterms of both source and target artifacts, and finally deduce outer and inner-transitive links to adjust text similarities between source and target artifacts. We conducted a comprehensive empirical evaluation based on five systems widely used in other literature to show that our approach can outperform four state-of-the-art approaches, and how its performance is affected by different conditions of source, intermediate, and target artifacts. The results indicate that our approach can outperform baseline approaches in AP over 15% and MAP over 10% on average. △ Less

Submitted 16 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

Comments: Accepted by the 46th International Conference on Software Engineering (ICSE 2024)

arXiv:2304.10283 [pdf, other]

Is augmentation effective to improve prediction in imbalanced text datasets?

Authors: Gabriel O. Assunção, Rafael Izbicki, Marcos O. Prates

Abstract: Imbalanced datasets present a significant challenge for machine learning models, often leading to biased predictions. To address this issue, data augmentation techniques are widely used in natural language processing (NLP) to generate new samples for the minority class. However, in this paper, we challenge the common assumption that data augmentation is always necessary to improve predictions on i… ▽ More Imbalanced datasets present a significant challenge for machine learning models, often leading to biased predictions. To address this issue, data augmentation techniques are widely used in natural language processing (NLP) to generate new samples for the minority class. However, in this paper, we challenge the common assumption that data augmentation is always necessary to improve predictions on imbalanced datasets. Instead, we argue that adjusting the classifier cutoffs without data augmentation can produce similar results to oversampling techniques. Our study provides theoretical and empirical evidence to support this claim. Our findings contribute to a better understanding of the strengths and limitations of different approaches to dealing with imbalanced data, and help researchers and practitioners make informed decisions about which methods to use for a given task. △ Less

Submitted 20 April, 2023; originally announced April 2023.

Comments: 21 pages, 5 figures

arXiv:2302.06615 [pdf, other]

Self-mediated exploration in artificial intelligence inspired by cognitive psychology

Authors: Gustavo Assunção, Miguel Castelo-Branco, Paulo Menezes

Abstract: Exploration of the physical environment is an indispensable precursor to data acquisition and enables knowledge generation via analytical or direct trialing. Artificial Intelligence lacks the exploratory capabilities of even the most underdeveloped organisms, hindering its autonomy and adaptability. Supported by cognitive psychology, this works links human behavior and artificial agents to endorse… ▽ More Exploration of the physical environment is an indispensable precursor to data acquisition and enables knowledge generation via analytical or direct trialing. Artificial Intelligence lacks the exploratory capabilities of even the most underdeveloped organisms, hindering its autonomy and adaptability. Supported by cognitive psychology, this works links human behavior and artificial agents to endorse self-development. In accordance with reported data, paradigms of epistemic and achievement emotion are embedded to machine-learning methodology contingent on their impact when decision making. A study is subsequently designed to mirror previous human trials, which artificial agents are made to undergo repeatedly towards convergence. Results demonstrate causality, learned by the vast majority of agents, between their internal states and exploration to match those reported for human counterparts. The ramifications of these findings are pondered for both research into human cognition and betterment of artificial intelligence. △ Less

Submitted 13 February, 2023; originally announced February 2023.

Comments: 21 pages, 5 figures, journal

arXiv:2003.00063 [pdf, other]

doi 10.3390/app11083397

Bio-Inspired Modality Fusion for Active Speaker Detection

Authors: Gustavo Assunção, Nuno Gonçalves, Paulo Menezes

Abstract: Human beings have developed fantastic abilities to integrate information from various sensory sources exploring their inherent complementarity. Perceptual capabilities are therefore heightened, enabling, for instance, the well-known "cocktail party" and McGurk effects, i.e., speech disambiguation from a panoply of sound signals. This fusion ability is also key in refining the perception of sound s… ▽ More Human beings have developed fantastic abilities to integrate information from various sensory sources exploring their inherent complementarity. Perceptual capabilities are therefore heightened, enabling, for instance, the well-known "cocktail party" and McGurk effects, i.e., speech disambiguation from a panoply of sound signals. This fusion ability is also key in refining the perception of sound source location, as in distinguishing whose voice is being heard in a group conversation. Furthermore, neuroscience has successfully identified the superior colliculus region in the brain as the one responsible for this modality fusion, with a handful of biological models having been proposed to approach its underlying neurophysiological process. Deriving inspiration from one of these models, this paper presents a methodology for effectively fusing correlated auditory and visual information for active speaker detection. Such an ability can have a wide range of applications, from teleconferencing systems to social robotics. The detection approach initially routes auditory and visual information through two specialized neural network structures. The resulting embeddings are fused via a novel layer based on the superior colliculus, whose topological structure emulates spatial neuron cross-map** of unimodal perceptual fields. The validation process employed two publicly available datasets, with achieved results confirming and greatly surpassing initial expectations. △ Less

Submitted 13 April, 2021; v1 submitted 28 February, 2020; originally announced March 2020.

Journal ref: Appl. Sci. 2021, 11(8), 3397

arXiv:1904.01861 [pdf]

Adopting a software product line engineering approach in industrial development contexts: A protocol for a systematic literature review

Authors: José L. Barros-Justo, Luisa Rincón, Ángela Villota, Wesley K. G. Assunção

Abstract: The value of a systematic secondary study (a systematic map** study (SMS) or a systematic literature review (SLR)) comes, directly, from its systematic nature. The formal, well-defined, objective and unbiased process guarantees that the results from these systematically conducted studies are valid. This process is embodied in an action protocol, which must be agreed upon by all the researchers b… ▽ More The value of a systematic secondary study (a systematic map** study (SMS) or a systematic literature review (SLR)) comes, directly, from its systematic nature. The formal, well-defined, objective and unbiased process guarantees that the results from these systematically conducted studies are valid. This process is embodied in an action protocol, which must be agreed upon by all the researchers before conducting the secondary study. The protocol is, therefore, a detailed action plan, which contains all the tasks and the ordered sequence of steps to be executed. This document details that protocol for a SLR on the adoption of the software product line engineering (SPLE) approach in industrial development contexts. The goal of that SLR is to identify and analyse the benefits and drawbacks that this adoption has had in industrial development contexts, in contrast to the experiences reported in academic environments. △ Less

Submitted 3 April, 2019; originally announced April 2019.

Comments: 15 pages, 2 Figures and 4 Tables

Showing 1–6 of 6 results for author: Assunção, G