Search | arXiv e-print repository

doi 10.7717/peerj-cs.187

Too Trivial To Test? An Inverse View on Defect Prediction to Identify Methods with Low Fault Risk

Authors: Rainer Niedermayr, Tobias Röhm, Stefan Wagner

Abstract: Background. Test resources are usually limited and therefore it is often not possible to completely test an application before a release. To cope with the problem of scarce resources, development teams can apply defect prediction to identify fault-prone code regions. However, defect prediction tends to low precision in cross-project prediction scenarios. Aims. We take an inverse view on defect p… ▽ More Background. Test resources are usually limited and therefore it is often not possible to completely test an application before a release. To cope with the problem of scarce resources, development teams can apply defect prediction to identify fault-prone code regions. However, defect prediction tends to low precision in cross-project prediction scenarios. Aims. We take an inverse view on defect prediction and aim to identify methods that can be deferred when testing because they contain hardly any faults due to their code being "trivial". We expect that characteristics of such methods might be project-independent, so that our approach could improve cross-project predictions. Method. We compute code metrics and apply association rule mining to create rules for identifying methods with low fault risk. We conduct an empirical study to assess our approach with six Java open-source projects containing precise fault data at the method level. Results. Our results show that inverse defect prediction can identify approx. 32-44% of the methods of a project to have a low fault risk; on average, they are about six times less likely to contain a fault than other methods. In cross-project predictions with larger, more diversified training sets, identified methods are even eleven times less likely to contain a fault. Conclusions. Inverse defect prediction supports the efficient allocation of test resources by identifying methods that can be treated with less priority in testing activities and is well applicable in cross-project prediction scenarios. △ Less

Submitted 2 November, 2018; originally announced November 2018.

Comments: Submitted to PeerJ CS

ACM Class: D.2.5; D.2.8

Journal ref: PeerJ Computer Science 5:e187, 2019

arXiv:1806.04556 [pdf, ps, other]

doi 10.1007/978-3-030-05767-1_10

Evaluating Maintainability Prejudices with a Large-Scale Study of Open-Source Projects

Authors: Tobias Roehm, Daniel Veihelmann, Stefan Wagner, Elmar Juergens

Abstract: Exaggeration or context changes can render maintainability experience into prejudice. For example, JavaScript is often seen as least elegant language and hence of lowest maintainability. Such prejudice should not guide decisions without prior empirical validation. We formulated 10 hypotheses about maintainability based on prejudices and test them in a large set of open-source projects (6,897 GitHu… ▽ More Exaggeration or context changes can render maintainability experience into prejudice. For example, JavaScript is often seen as least elegant language and hence of lowest maintainability. Such prejudice should not guide decisions without prior empirical validation. We formulated 10 hypotheses about maintainability based on prejudices and test them in a large set of open-source projects (6,897 GitHub repositories, 402 million lines, 5 programming languages). We operationalize maintainability with five static analysis metrics. We found that JavaScript code is not worse than other code, Java code shows higher maintainability than C# code and C code has longer methods than other code. The quality of interface documentation is better in Java code than in other code. Code developed by teams is not of higher and large code bases not of lower maintainability. Projects with high maintainability are not more popular or more often forked. Overall, most hypotheses are not supported by open-source data. △ Less

Submitted 12 June, 2018; originally announced June 2018.

Comments: 20 pages

Journal ref: In D. Winkler, S. Biffl, and J. Bergsmann, editors, Software Quality: The Complexity and Challenges of Software Engineering and Software Quality in the Cloud, Springer International Publishing, 2019

arXiv:1805.01132 [pdf, other]

doi 10.1145/3183440.3195022

Poster: Identification of Methods with Low Fault Risk

Authors: Rainer Niedermayr, Tobias Röhm, Stefan Wagner

Abstract: Test resources are usually limited and therefore it is often not possible to completely test an application before a release. Therefore, testers need to focus their activities on the relevant code regions. In this paper, we introduce an inverse defect prediction approach to identify methods that contain hardly any faults. We applied our approach to six Java open-source projects and show that on av… ▽ More Test resources are usually limited and therefore it is often not possible to completely test an application before a release. Therefore, testers need to focus their activities on the relevant code regions. In this paper, we introduce an inverse defect prediction approach to identify methods that contain hardly any faults. We applied our approach to six Java open-source projects and show that on average 31.6% of the methods of a project have a low fault risk; they contain in total, on average, only 5.8% of all faults. Furthermore, the results suggest that, unlike defect prediction, our approach can also be applied in cross-project prediction scenarios. Therefore, inverse defect prediction can help prioritize untested code areas and guide testers to increase the fault detection probability. △ Less

Submitted 3 May, 2018; originally announced May 2018.

Comments: ICSE 2018 Poster Track

ACM Class: D.2.5; D.2.8

Journal ref: 2018 IEEE/ACM International Conference on Software Engineering Companion (ICSE Companion)

Showing 1–3 of 3 results for author: Roehm, T