Can Transformer Models Effectively Detect Software Aspects in StackOverflow Discussion?

Mandal, Nibir Chandra; Muhammad, Tashreef; Shahariar, G. M.

doi:10.1007/978-3-031-34622-4_18

Computer Science > Software Engineering

arXiv:2209.12065 (cs)

[Submitted on 24 Sep 2022]

Title:Can Transformer Models Effectively Detect Software Aspects in StackOverflow Discussion?

Authors:Nibir Chandra Mandal, Tashreef Muhammad, G. M. Shahariar

View PDF

Abstract:Dozens of new tools and technologies are being incorporated to help developers, which is becoming a source of consternation as they struggle to choose one over the others. For example, there are at least ten frameworks available to developers for develo** web applications, posing a conundrum in selecting the best one that meets their needs. As a result, developers are continuously searching for all of the benefits and drawbacks of each API, framework, tool, and so on. One of the typical approaches is to examine all of the features through official documentation and discussion. This approach is time-consuming, often makes it difficult to determine which aspects are the most important to a particular developer and whether a particular aspect is important to the community at large. In this paper, we have used a benchmark API aspects dataset (Opiner) collected from StackOverflow posts and observed how Transformer models (BERT, RoBERTa, DistilBERT, and XLNet) perform in detecting software aspects in textual developer discussion with respect to the baseline Support Vector Machine (SVM) model. Through extensive experimentation, we have found that transformer models improve the performance of baseline SVM for most of the aspects, i.e., `Performance', `Security', `Usability', `Documentation', `Bug', `Legal', `OnlySentiment', and `Others'. However, the models fail to apprehend some of the aspects (e.g., `Community' and `Potability') and their performance varies depending on the aspects. Also, larger architectures like XLNet are ineffective in interpreting software aspects compared to smaller architectures like DistilBERT.

Comments:	15 pages, 2 figures, submitted to International Conference on Machine Intelligence and Emerging Technologies (MIET 2022)
Subjects:	Software Engineering (cs.SE); Computation and Language (cs.CL)
Cite as:	arXiv:2209.12065 [cs.SE]
	(or arXiv:2209.12065v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2209.12065
Journal reference:	Preprint of an article published in Machine Intelligence and Emerging Technologies. MIET 2022. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 491. Springer, Cham
Related DOI:	https://doi.org/10.1007/978-3-031-34622-4_18

Submission history

From: Tashreef Muhammad [view email]
[v1] Sat, 24 Sep 2022 18:28:14 UTC (3,558 KB)

Computer Science > Software Engineering

Title:Can Transformer Models Effectively Detect Software Aspects in StackOverflow Discussion?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Can Transformer Models Effectively Detect Software Aspects in StackOverflow Discussion?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators