Skip to main content

Showing 1–50 of 69 results for author: Papadakis, M

.
  1. arXiv:2406.09843  [pdf, other

    cs.SE

    An Exploratory Study on Using Large Language Models for Mutation Testing

    Authors: Bo Wang, Mingda Chen, Youfang Lin, Mike Papadakis, Jie M. Zhang

    Abstract: The question of how to generate high-utility mutations, to be used for testing purposes, forms a key challenge in mutation testing literature. %Existing approaches rely either on human-specified syntactic rules or learning-based approaches, all of which produce large numbers of redundant mutants. Large Language Models (LLMs) have shown great potential in code-related tasks but their utility in mut… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 13 pages, 3 figures

    ACM Class: D.2.5

  2. arXiv:2402.15769  [pdf, other

    cs.SE cs.AI

    Importance Guided Data Augmentation for Neural-Based Code Understanding

    Authors: Zeming Dong, Qiang Hu, Xiaofei Xie, Maxime Cordy, Mike Papadakis, Jianjun Zhao

    Abstract: Pre-trained code models lead the era of code intelligence. Many models have been designed with impressive performance recently. However, one important problem, data augmentation for code data that automatically helps developers prepare training data lacks study in the field of code learning. In this paper, we introduce a general data augmentation framework, GenCode, to enhance the training of code… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  3. arXiv:2402.05102  [pdf, other

    cs.SE

    You Can REST Now: Automated Specification Inference and Black-Box Testing of RESTful APIs with Large Language Models

    Authors: Alix Decrop, Gilles Perrouin, Mike Papadakis, Xavier Devroey, Pierre-Yves Schobbens

    Abstract: RESTful APIs are popular web services, requiring documentation to ease their comprehension, reusability and testing practices. The OpenAPI Specification (OAS) is a widely adopted and machine-readable format used to document such APIs. However, manually documenting RESTful APIs is a time-consuming and error-prone task, resulting in unavailable, incomplete, or imprecise documentation. As RESTful API… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  4. arXiv:2309.05381  [pdf, other

    cs.SE cs.AI

    Hazards in Deep Learning Testing: Prevalence, Impact and Recommendations

    Authors: Salah Ghamizi, Maxime Cordy, Yuejun Guo, Mike Papadakis, And Yves Le Traon

    Abstract: Much research on Machine Learning testing relies on empirical studies that evaluate and show their potential. However, in this context empirical results are sensitive to a number of parameters that can adversely impact the results of the experiments and potentially lead to wrong conclusions (Type I errors, i.e., incorrectly rejecting the Null Hypothesis). To this end, we survey the related literat… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  5. arXiv:2308.15276  [pdf, other

    cs.SE

    Large Language Models in Fault Localisation

    Authors: Yonghao Wu, Zheng Li, Jie M. Zhang, Mike Papadakis, Mark Harman, Yong Liu

    Abstract: Large Language Models (LLMs) have shown promise in multiple software engineering tasks including code generation, program repair, code summarisation, and test generation. Fault localisation is instrumental in enabling automated debugging and repair of programs and was prominently featured as a highlight during the launch event of ChatGPT-4. Nevertheless, the performance of LLMs compared to state-o… ▽ More

    Submitted 2 October, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

  6. arXiv:2308.01314  [pdf, other

    cs.LG cs.SE stat.ML

    Evaluating the Robustness of Test Selection Methods for Deep Neural Networks

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Wei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Testing deep learning-based systems is crucial but challenging due to the required time and labor for labeling collected raw data. To alleviate the labeling effort, multiple test selection methods have been proposed where only a subset of test data needs to be labeled while satisfying testing requirements. However, we observe that such methods with reported promising results are only evaluated und… ▽ More

    Submitted 29 July, 2023; originally announced August 2023.

    Comments: 12 pages

  7. arXiv:2307.12155  [pdf, ps, other

    cs.CL

    Identifying Misinformation on YouTube through Transcript Contextual Analysis with Transformer Models

    Authors: Christos Christodoulou, Nikos Salamanos, Pantelitsa Leonidou, Michail Papadakis, Michael Sirivianos

    Abstract: Misinformation on YouTube is a significant concern, necessitating robust detection strategies. In this paper, we introduce a novel methodology for video classification, focusing on the veracity of the content. We convert the conventional video classification task into a text classification task by leveraging the textual content derived from the video transcripts. We employ advanced machine learnin… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  8. arXiv:2306.01250  [pdf, other

    cs.SE

    Active Code Learning: Benchmarking Sample-Efficient Training of Code Models

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon

    Abstract: The costly human effort required to prepare the training data of machine learning (ML) models hinders their practical development and usage in software engineering (ML4Code), especially for those with limited budgets. Therefore, efficiently training models of code with less human effort has become an emergent problem. Active learning is such a technique to address this issue that allows developers… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 12 pages, ongoing work

  9. arXiv:2305.08592  [pdf, other

    cs.SE

    Time-based Repair for Asynchronous Wait Flaky Tests in Web Testing

    Authors: Yu Pei, Jeongju Sohn, Sarra Habchi, Mike Papadakis

    Abstract: Asynchronous waits are one of the most prevalent root causes of flaky tests and a major time-influential factor of web application testing. To investigate the characteristics of asynchronous wait flaky tests and their fixes in web testing, we build a dataset of 49 reproducible flaky tests, from 26 open-source projects, caused by asynchronous waits, along with their corresponding developer-written… ▽ More

    Submitted 19 May, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  10. arXiv:2303.06808  [pdf, other

    cs.SE cs.AI

    Boosting Source Code Learning with Data Augmentation: An Empirical Study

    Authors: Zeming Dong, Qiang Hu, Yuejun Guo, Zhenya Zhang, Maxime Cordy, Mike Papadakis, Yves Le Traon, Jianjun Zhao

    Abstract: The next era of program understanding is being propelled by the use of machine learning to solve software problems. Recent studies have shown surprising results of source code learning, which applies deep neural networks (DNNs) to various critical software tasks, e.g., bug detection and clone detection. This success can be greatly attributed to the utilization of massive high-quality training data… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

  11. arXiv:2303.05213  [pdf, other

    cs.SE

    ACoRe: Automated Goal-Conflict Resolution

    Authors: Luiz Carvalho, Renzo Degiovanni, Matìas Brizzio, Maxime Cordy, Nazareno Aguirre, Yves Le Traon, Mike Papadakis

    Abstract: System goals are the statements that, in the context of software requirements specification, capture how the software should behave. Many times, the understanding of stakeholders on what the system should do, as captured in the goals, can lead to different problems, from clearly contradicting goals, to more subtle situations in which the satisfaction of some goals inhibits the satisfaction of othe… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  12. arXiv:2303.04247  [pdf, other

    cs.SE cs.CR

    Vulnerability Mimicking Mutants

    Authors: Aayush Garg, Renzo Degiovanni, Mike Papadakis, Yves Le Traon

    Abstract: With the increasing release of powerful language models trained on large code corpus (e.g. CodeBERT was trained on 6.4 million programs), a new family of mutation testing tools has arisen with the promise to generate more "natural" mutants in the sense that the mutated code aims at following the implicit rules and coding conventions typically produced by programmers. In this paper, we study to wha… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2301.12284

  13. arXiv:2302.10594  [pdf, other

    cs.SE

    The Importance of Discerning Flaky from Fault-triggering Test Failures: A Case Study on the Chromium CI

    Authors: Guillaume Haben, Sarra Habchi, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: Flaky tests are tests that pass and fail on different executions of the same version of a program under test. They waste valuable developer time by making developers investigate false alerts (flaky test failures). To deal with this problem, many prediction methods that identify flaky tests have been proposed. While promising, the actual utility of these methods remains unclear since they have not… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  14. arXiv:2302.02907  [pdf, other

    cs.CV cs.CR cs.LG

    GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

    Authors: Salah Ghamizi, **gfeng Zhang, Maxime Cordy, Mike Papadakis, Masashi Sugiyama, Yves Le Traon

    Abstract: While leveraging additional training data is well established to improve adversarial robustness, it incurs the unavoidable cost of data collection and the heavy computation to train models. To mitigate the costs, we propose Guided Adversarial Training (GAT), a novel adversarial training technique that exploits auxiliary tasks under a limited set of training data. Our approach extends single-task m… ▽ More

    Submitted 25 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  15. arXiv:2301.12403  [pdf, other

    cs.SE

    Specification Inference for Evolving Systems

    Authors: Renzo Degiovanni, Facundo Molina, Agustin Nolasco, Nazareno Aguirre, Mike Papadakis

    Abstract: In this paper, we propose an assertion-based approach to capture software evolution, through the notion of commit-relevant specification. A commit-relevant specification summarises the program properties that have changed as a consequence of a commit (understood as a specific software modification), via two sets of assertions, the delta-added assertions, properties that did not hold in the pre-com… ▽ More

    Submitted 29 January, 2023; originally announced January 2023.

  16. arXiv:2301.12284  [pdf, other

    cs.SE

    Assertion Inferring Mutants

    Authors: Aayush Garg, Renzo Degiovanni, Facundo Molina, Mike Papadakis, Nazareno Aguirre, Maxime Cordy, Yves Le Traon

    Abstract: Specification inference techniques aim at (automatically) inferring a set of assertions that capture the exhibited software behaviour by generating and filtering assertions through dynamic test executions and mutation testing. Although powerful, such techniques are computationally expensive due to a large number of assertions, test cases and mutated versions that need to be executed. To overcome t… ▽ More

    Submitted 28 January, 2023; originally announced January 2023.

  17. arXiv:2301.03543  [pdf, other

    cs.SE

    Efficient Mutation Testing via Pre-Trained Language Models

    Authors: Ahmed Khanfir, Renzo Degiovanni, Mike Papadakis, Yves Le Traon

    Abstract: Mutation testing is an established fault-based testing technique. It operates by seeding faults into the programs under test and asking developers to write tests that reveal these faults. These tests have the potential to reveal a large number of faults -- those that couple with the seeded ones -- and thus are deemed important. To this end, mutation testing should seed faults that are both "natura… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  18. arXiv:2212.11762  [pdf, other

    cs.SE

    Kee** Mutation Test Suites Consistent and Relevant with Long-Standing Mutants

    Authors: Milos Ojdanic, Mike Papadakis, Mark Harman

    Abstract: Mutation testing has been demonstrated to be one of the most powerful fault-revealing tools in the tester's tool kit. Much previous work implicitly assumed it to be sufficient to re-compute mutant suites per release. Sadly, this makes mutation results inconsistent; mutant scores from each release cannot be directly compared, making it harder to measure test improvement. Furthermore, regular code c… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  19. arXiv:2212.08130  [pdf, other

    eess.IV cs.CV cs.LG

    On Evaluating Adversarial Robustness of Chest X-ray Classification: Pitfalls and Best Practices

    Authors: Salah Ghamizi, Maxime Cordy, Michail Papadakis, Yves Le Traon

    Abstract: Vulnerability to adversarial attacks is a well-known weakness of Deep Neural Networks. While most of the studies focus on natural images with standardized benchmarks like ImageNet and CIFAR, little research has considered real world applications, in particular in the medical domain. Our research shows that, contrary to previous claims, robustness of chest x-ray classification is much harder to eva… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  20. arXiv:2210.03123  [pdf, other

    cs.LG cs.AI

    On the Effectiveness of Hybrid Pooling in Mixup-Based Graph Learning for Language Processing

    Authors: Zeming Dong, Qiang Hu, Zhenya Zhang, Yuejun Guo, Maxime Cordy, Mike Papadakis, Yves Le Traon, Jianjun Zhao

    Abstract: Graph neural network (GNN)-based graph learning has been popular in natural language and programming language processing, particularly in text and source code classification. Typically, GNNs are constructed by incorporating alternating layers which learn transformations of graph node features, along with graph pooling layers that use graph pooling operators (e.g., Max-pooling) to effectively reduc… ▽ More

    Submitted 21 May, 2024; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted by Journal of Systems and Software (JSS) 2024

  21. arXiv:2210.03003  [pdf, other

    cs.SE cs.AI

    MIXCODE: Enhancing Code Classification by Mixup-Based Data Augmentation

    Authors: Zeming Dong, Qiang Hu, Yuejun Guo, Maxime Cordy, Mike Papadakis, Zhenya Zhang, Yves Le Traon, Jianjun Zhao

    Abstract: Inspired by the great success of Deep Neural Networks (DNNs) in natural language processing (NLP), DNNs have been increasingly applied in source code analysis and attracted significant attention from the software engineering community. Due to its data-driven nature, a DNN model requires massive and high-quality labeled training data to achieve expert-level performance. Collecting such data is ofte… ▽ More

    Submitted 10 January, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted by SANER 2023

  22. arXiv:2208.14799  [pdf, other

    cs.SE

    Predicting Flaky Tests Categories using Few-Shot Learning

    Authors: Amal Akli, Guillaume Haben, Sarra Habchi, Mike Papadakis, Yves Le Traon

    Abstract: Flaky tests are tests that yield different outcomes when run on the same version of a program. This non-deterministic behaviour plagues continuous integration with false signals, wasting developers' time and reducing their trust in test suites. Studies highlighted the importance of kee** tests flakiness-free. Recently, the research community has been pushing forward the detection of flaky tests… ▽ More

    Submitted 31 August, 2022; originally announced August 2022.

  23. arXiv:2208.06042  [pdf, other

    cs.SE

    CodeBERT-nt: code naturalness via CodeBERT

    Authors: Ahmed Khanfir, Matthieu Jimenez, Mike Papadakis, Yves Le Traon

    Abstract: Much of software-engineering research relies on the naturalness of code, the fact that code, in small code snippets, is repetitive and can be predicted using statistical language models like n-gram. Although powerful, training such models on large code corpus is tedious, time-consuming and sensitive to code patterns (and practices) encountered during training. Consequently, these models are often… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  24. arXiv:2207.13129  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    LGV: Boosting Adversarial Example Transferability from Large Geometric Vicinity

    Authors: Martin Gubri, Maxime Cordy, Mike Papadakis, Yves Le Traon, Koushik Sen

    Abstract: We propose transferability from Large Geometric Vicinity (LGV), a new technique to increase the transferability of black-box adversarial attacks. LGV starts from a pretrained surrogate model and collects multiple weight sets from a few additional training epochs with a constant and high learning rate. LGV exploits two geometric properties that we relate to transferability. First, models that belon… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  25. arXiv:2207.11018   

    cs.SE

    Learning from what we know: How to perform vulnerability prediction using noisy historical data

    Authors: Aayush Garg, Renzo Degiovanni, Matthieu Jimenez, Maxime Cordy, Mike Papadakis, Yves LeTraon

    Abstract: Vulnerability prediction refers to the problem of identifying system components that are most likely to be vulnerable. Typically, this problem is tackled by training binary classifiers on historical data. Unfortunately, recent research has shown that such approaches underperform due to the following two reasons: a) the imbalanced nature of the problem, and b) the inherently noisy historical data,… ▽ More

    Submitted 25 July, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Please do not consider this new version of the article for citations. The article (with its previous versions) is already available here: arXiv:2012.11701

  26. arXiv:2207.10942  [pdf, other

    cs.SE cs.AI

    Aries: Efficient Testing of Deep Neural Networks via Labeling-Free Accuracy Estimation

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Deep learning (DL) plays a more and more important role in our daily life due to its competitive performance in industrial application domains. As the core of DL-enabled systems, deep neural networks (DNNs) need to be carefully evaluated to ensure the produced models match the expected requirements. In practice, the \emph{de facto standard} to assess the quality of DNNs in the industry is to check… ▽ More

    Submitted 3 February, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: accepted to ICSE'23, preprint version

  27. arXiv:2207.10143  [pdf, other

    cs.SE

    What Made This Test Flake? Pinpointing Classes Responsible for Test Flakiness

    Authors: Sarra Habchi, Guillaume Haben, Jeongju Sohn, Adriano Franci, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: Flaky tests are defined as tests that manifest non-deterministic behaviour by passing and failing intermittently for the same version of the code. These tests cripple continuous integration with false alerts that waste developers' time and break their trust in regression testing. To mitigate the effects of flakiness, both researchers and industrial experts proposed strategies and tools to detect a… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted at the 38th IEEE International Conference on Software Maintenance and Evolution (ICSME)

  28. arXiv:2206.05480  [pdf, ps, other

    cs.SE cs.AI

    CodeS: Towards Code Model Generalization Under Distribution Shift

    Authors: Qiang Hu, Yuejun Guo, Xiaofei Xie, Maxime Cordy, Lei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Distribution shift has been a longstanding challenge for the reliable deployment of deep learning (DL) models due to unexpected accuracy degradation. Although DL has been becoming a driving force for large-scale source code analysis in the big code era, limited progress has been made on distribution shift analysis and benchmarking for source code tasks. To fill this gap, this paper initiates to pr… ▽ More

    Submitted 4 February, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: accepted by ICSE'23-NIER

  29. arXiv:2205.08809  [pdf

    cs.SE

    Software Fairness: An Analysis and Survey

    Authors: Ezekiel Soremekun, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: In the last decade, researchers have studied fairness as a software property. In particular, how to engineer fair software systems? This includes specifying, designing, and validating fairness properties. However, the landscape of works addressing bias as a software engineering concern is unclear, i.e., techniques and studies that analyze the fairness properties of learning-based software. In this… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  30. arXiv:2204.04220  [pdf, other

    cs.LG cs.AI cs.SE

    Characterizing and Understanding the Behavior of Quantized Models for Reliable Deployment

    Authors: Qiang Hu, Yuejun Guo, Maxime Cordy, Xiaofei Xie, Wei Ma, Mike Papadakis, Yves Le Traon

    Abstract: Deep Neural Networks (DNNs) have gained considerable attention in the past decades due to their astounding performance in different applications, such as natural language modeling, self-driving assistance, and source code understanding. With rapid exploration, more and more complex DNN architectures have been proposed along with huge pre-trained model parameters. The common way to use such DNN mod… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: 12 pages

  31. arXiv:2204.03994  [pdf, other

    cs.LG cs.AI cs.SE

    LaF: Labeling-Free Model Selection for Automated Deep Neural Network Reusing

    Authors: Qiang Hu, Yuejun Guo, Maxime Cordy, Xiaofei Xie, Mike Papadakis, Yves Le Traon

    Abstract: Applying deep learning to science is a new trend in recent years which leads DL engineering to become an important problem. Although training data preparation, model architecture design, and model training are the normal processes to build DL models, all of them are complex and costly. Therefore, reusing the open-sourced pre-trained model is a practical way to bypass this hurdle for developers. Gi… ▽ More

    Submitted 20 January, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: 22 pages

  32. arXiv:2203.11343  [pdf, ps, other

    cs.SE

    Using Evolutionary Coupling to Establish Relevance Links Between Tests and Code Units. A case study on fault localization

    Authors: Jeongju Sohn, Mike Papadakis

    Abstract: Many software engineering techniques, such as fault localization, operate based on relevance relationships between tests and code. These relationships are often inferred through the use of dynamic test execution information (test execution traces) that approximate the link between relevant code units and asserted, by the tests, program behaviour. Unfortunately, in practice dynamic information is n… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  33. arXiv:2203.03289  [pdf, other

    cs.SE

    $μ$BERT: Mutation Testing using Pre-Trained Language Models

    Authors: Renzo Degiovanni, Mike Papadakis

    Abstract: We introduce $μ$BERT, a mutation testing tool that uses a pre-trained language model (CodeBERT) to generate mutants. This is done by masking a token from the expression given as input and using CodeBERT to predict it. Thus, the mutants are generated by replacing the masked tokens with the predicted ones. We evaluate $μ$BERT on 40 real faults from Defects4J and show that it can detect 27 out of the… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

  34. arXiv:2202.04007  [pdf, other

    cs.CV cs.LG

    Results and findings of the 2021 Image Similarity Challenge

    Authors: Zoë Papakipos, Giorgos Tolias, Tomas Jenicek, Ed Pizzi, Shuhei Yokoo, Wenhao Wang, Yifan Sun, Weipu Zhang, Yi Yang, Sanjay Addicam, Sergio Manuel Papadakis, Cristian Canton Ferrer, Ondrej Chum, Matthijs Douze

    Abstract: The 2021 Image Similarity Challenge introduced a dataset to serve as a new benchmark to evaluate recent image copy detection methods. There were 200 participants to the competition. This paper presents a quantitative and qualitative analysis of the top submissions. It appears that the most difficult image transformations involve either severe image crops or hiding into unrelated images, combined w… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  35. arXiv:2112.14566  [pdf, other

    cs.SE

    Mutation Testing in Evolving Systems: Studying the relevance of mutants to code evolution

    Authors: Milos Ojdanic, Ezekiel Soremekun, Renzo Degiovanni, Mike Papadakis, Yves Le Traon

    Abstract: When software evolves, opportunities for introducing faults appear. Therefore, it is important to test the evolved program behaviors during each evolution cycle. We conduct an exploratory study to investigate the properties of commit-relevant mutants, i.e., the test elements of commit-aware mutation testing, by offering a general definition and an experimental approach to identify them. We thus, a… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

  36. arXiv:2112.14508  [pdf, other

    cs.SE

    Syntactic Vs. Semantic similarity of Artificial and Real Faults in Mutation Testing Studies

    Authors: Milos Ojdanic, Aayush Garg, Ahmed Khanfir, Renzo Degiovanni, Mike Papadakis, Yves Le Traon

    Abstract: Fault seeding is typically used in controlled studies to evaluate and compare test techniques. Central to these techniques lies the hypothesis that artificially seeded faults involve some form of realistic properties and thus provide realistic experimental results. In an attempt to strengthen realism, a recent line of research uses advanced machine learning techniques, such as deep learning and Na… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

  37. Cerebro: Static Subsuming Mutant Selection

    Authors: Aayush Garg, Milos Ojdanic, Renzo Degiovanni, Thierry Titcheu Chekam, Mike Papadakis, Yves Le Traon

    Abstract: Mutation testing research has indicated that a major part of its application cost is due to the large number of low utility mutants that it introduces. Although previous research has identified this issue, no previous study has proposed any effective solution to the problem. Thus, it remains unclear how to mutate and test a given piece of code in a best effort way, i.e., achieving a good trade-off… ▽ More

    Submitted 1 March, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

  38. arXiv:2112.04919  [pdf, ps, other

    cs.SE

    A Qualitative Study on the Sources, Impacts, and Mitigation Strategies of Flaky Tests

    Authors: Sarra Habchi, Guillaume Haben, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: Test flakiness forms a major testing concern. Flaky tests manifest non-deterministic outcomes that cripple continuous integration and lead developers to investigate false alerts. Industrial reports indicate that on a large scale, the accrual of flaky tests breaks the trust in test suites and entails significant computational cost. To alleviate this, practitioners are constrained to identify flaky… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  39. arXiv:2112.03415  [pdf, other

    cs.CV cs.AI

    Producing augmentation-invariant embeddings from real-life imagery

    Authors: Sergio Manuel Papadakis, Sanjay Addicam

    Abstract: This article presents an efficient way to produce feature-rich, high-dimensionality embedding spaces from real-life images. The features produced are designed to be independent from augmentations used in real-life cases which appear on social media. Our approach uses convolutional neural networks (CNN) to produce an embedding space. An ArcFace head was used to train the model by employing automati… ▽ More

    Submitted 10 December, 2021; v1 submitted 6 December, 2021; originally announced December 2021.

  40. arXiv:2112.02542  [pdf, other

    cs.LG cs.AI

    Robust Active Learning: Sample-Efficient Training of Robust Deep Learning Models

    Authors: Yuejun Guo, Qiang Hu, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Active learning is an established technique to reduce the labeling cost to build high-quality machine learning models. A core component of active learning is the acquisition function that determines which data should be selected to annotate. State-of-the-art acquisition functions -- and more largely, active learning techniques -- have been designed to maximize the clean performance (e.g. accuracy)… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Comments: 10 pages

  41. arXiv:2112.01218  [pdf, other

    cs.SE

    GraphCode2Vec: Generic Code Embedding via Lexical and Program Dependence Analyses

    Authors: Wei Ma, Mengjie Zhao, Ezekiel Soremekun, Qiang Hu, Jie Zhang, Mike Papadakis, Maxime Cordy, Xiaofei Xie, Yves Le Traon

    Abstract: Code embedding is a keystone in the application of machine learning on several Software Engineering (SE) tasks. To effectively support a plethora of SE tasks, the embedding needs to capture program syntax and semantics in a way that is generic. To this end, we propose the first self-supervised pre-training approach (called GraphCode2Vec) which produces task-agnostic embedding of lexical and progra… ▽ More

    Submitted 21 January, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

  42. arXiv:2111.03382  [pdf, other

    cs.SE

    Discerning Legitimate Failures From False Alerts: A Study of Chromium's Continuous Integration

    Authors: Guillaume Haben, Sarra Habchi, Mike Papadakis, Maxime Cordy, Yves Le Traon

    Abstract: Flakiness is a major concern in Software testing. Flaky tests pass and fail for the same version of a program and mislead developers who spend time and resources investigating test failures only to discover that they are false alerts. In practice, the defacto approach to address this concern is to rerun failing tests ho** that they would pass and manifest as false alerts. Nonetheless, completely… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  43. arXiv:2111.02317  [pdf, other

    cs.SE

    Smells in System User Interactive Tests

    Authors: Renaud Rwemalika, Sarra Habchi, Mike Papadakis, Yves Le Traon, Marie-Claude Brasseur

    Abstract: Test smells are known as bad development practices that reflect poor design and implementation choices in software tests. Over the last decade, test smells were heavily studied to measure their prevalence and impacts on test maintainability. However, these studies focused mainly on the unit level and to this day, the work on system tests that interact with the System Under Test through a Graphical… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

  44. arXiv:2110.15053  [pdf, other

    cs.LG cs.AI cs.CV

    Adversarial Robustness in Multi-Task Learning: Promises and Illusions

    Authors: Salah Ghamizi, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Vulnerability to adversarial attacks is a well-known weakness of Deep Neural networks. While most of the studies focus on single-task neural networks with computer vision datasets, very little research has considered complex multi-task models that are common in real applications. In this paper, we evaluate the design choices that impact the robustness of multi-task deep learning networks. We provi… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  45. arXiv:2109.12838  [pdf, other

    cs.LG cs.CR cs.CV

    MUTEN: Boosting Gradient-Based Adversarial Attacks via Mutant-Based Ensembles

    Authors: Yuejun Guo, Qiang Hu, Maxime Cordy, Michail Papadakis, Yves Le Traon

    Abstract: Deep Neural Networks (DNNs) are vulnerable to adversarial examples, which causes serious threats to security-critical applications. This motivated much research on providing mechanisms to make models more robust against adversarial attacks. Unfortunately, most of these defenses, such as gradient masking, are easily overcome through different attack means. In this paper, we propose MUTEN, a low-cos… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  46. Automated Repair of Unrealisable LTL Specifications Guided by Model Counting

    Authors: Matías Brizzio, Maxime Cordy, Mike Papadakis, César Sánchez, Nazareno Aguirre, Renzo Degiovanni

    Abstract: The reactive synthesis problem consists of automatically producing correct-by-construction operational models of systems from high-level formal specifications of their behaviours. However, specifications are often unrealisable, meaning that no system can be synthesised from the specification. To deal with this problem, we present AuRUS, a search-based approach to repair unrealisable Linear-Time Te… ▽ More

    Submitted 14 April, 2023; v1 submitted 26 May, 2021; originally announced May 2021.

  47. arXiv:2104.07441  [pdf, other

    cs.SE

    On the Use of Mutation in Injecting Test Order-Dependency

    Authors: Sarra Habchi, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Background: Test flakiness is identified as a major issue that compromises the regression testing process of complex software systems. Flaky tests manifest non-deterministic behaviour, send confusing signals to developers, and break their trust in test suites. Both industrial reports and research studies highlighted the negative impact of flakiness on software quality and developers' productivity.… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  48. arXiv:2012.11701  [pdf, other

    cs.CR cs.SE

    Learning from What We Know: How to Perform Vulnerability Prediction using Noisy Historical Data

    Authors: Aayush Garg, Renzo Degiovanni, Matthieu Jimenez, Maxime Cordy, Mike Papadakis, Yves Le Traon

    Abstract: Vulnerability prediction refers to the problem of identifying system components that are most likely to be vulnerable. Typically, this problem is tackled by training binary classifiers on historical data. Unfortunately, recent research has shown that such approaches underperform due to the following two reasons: a) the imbalanced nature of the problem, and b) the inherently noisy historical data,… ▽ More

    Submitted 25 July, 2022; v1 submitted 21 December, 2020; originally announced December 2020.

    Comments: The article was accepted in Empirical Software Engineering (EMSE) on July 02, 2022

  49. Influence-Driven Data Poisoning in Graph-Based Semi-Supervised Classifiers

    Authors: Adriano Franci, Maxime Cordy, Martin Gubri, Mike Papadakis, Yves Le Traon

    Abstract: Graph-based Semi-Supervised Learning (GSSL) is a practical solution to learn from a limited amount of labelled data together with a vast amount of unlabelled data. However, due to their reliance on the known labels to infer the unknown labels, these algorithms are sensitive to data quality. It is therefore essential to study the potential threats related to the labelled data, more specifically, la… ▽ More

    Submitted 11 May, 2022; v1 submitted 14 December, 2020; originally announced December 2020.

  50. IBIR: Bug Report driven Fault Injection

    Authors: Ahmed Khanfir, Anil Koyuncu, Mike Papadakis, Maxime Cordy, Tegawendé F. Bissyandé, Jacques Klein, Yves Le Traon

    Abstract: Much research on software engineering and software testing relies on experimental studies based on fault injection. Fault injection, however, is not often relevant to emulate real-world software faults since it "blindly" injects large numbers of faults. It remains indeed challenging to inject few but realistic faults that target a particular functionality in a program. In this work, we introduce I… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.