Skip to main content

Showing 1–13 of 13 results for author: Kamei, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.09165  [pdf, other

    cs.SE

    An Empirical Study of Token-based Micro Commits

    Authors: Masanari Kondo, Daniel M. German, Yasutaka Kamei, Naoyasu Ubayashi, Osamu Mizuno

    Abstract: In software development, developers frequently apply maintenance activities to the source code that change a few lines by a single commit. A good understanding of the characteristics of such small changes can support quality assurance approaches (e.g., automated program repair), as it is likely that small changes are addressing deficiencies in other changes; thus, understanding the reasons for cre… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  2. arXiv:2404.09223  [pdf, other

    cs.SE

    OSS Myths and Facts

    Authors: Yukako Iimura, Masanari Kondo, Kazushi Tomoto, Yasutaka Kamei, Naoyasu Ubayashi, Shinobu Saito

    Abstract: We have selected six myths about the OSS community and have tested whether they are true or not. The purpose of this report is to identify the lessons that can be learned from the development style of the OSS community and the issues that need to be addressed in order to achieve better Employee Experience (EX) in software development within companies and organizations. The OSS community has been l… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: English Version: 28 pages + Japanese Version: 23 pages

  3. arXiv:2402.01438  [pdf, ps, other

    cs.SE

    Exploring the Effect of Multiple Natural Languages on Code Suggestion Using GitHub Copilot

    Authors: Kei Koyanagi, Dong Wang, Kotaro Noguchi, Masanari Kondo, Alexander Serebrenik, Yasutaka Kamei, Naoyasu Ubayashi

    Abstract: GitHub Copilot is an AI-enabled tool that automates program synthesis. It has gained significant attention since its launch in 2021. Recent studies have extensively examined Copilot's capabilities in various programming tasks, as well as its security issues. However, little is known about the effect of different natural languages on code suggestion. Natural language is considered a social bias in… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  4. arXiv:2308.10078  [pdf, other

    cs.SE

    Repeated Builds During Code Review: An Empirical Study of the OpenStack Community

    Authors: Rungroj Maipradit, Dong Wang, Patanamon Thongtanunam, Raula Gaikovina Kula, Yasutaka Kamei, Shane McIntosh

    Abstract: Code review is a popular practice where developers critique each others' changes. Since automated builds can identify low-level issues (e.g., syntactic errors, regression bugs), it is not uncommon for software organizations to incorporate automated builds in the code review process. In such code review deployment scenarios, submitted change sets must be approved for integration by both peer code r… ▽ More

    Submitted 19 August, 2023; originally announced August 2023.

    Comments: conference

  5. arXiv:2307.07117  [pdf, other

    cs.SE

    When Conversations Turn Into Work: A Taxonomy of Converted Discussions and Issues in GitHub

    Authors: Dong Wang, Masanari Kondo, Yasutaka Kamei, Raula Gaikovina Kula, Naoyasu Ubayashi

    Abstract: Popular and large contemporary open-source projects now embrace a diverse set of documentation for communication channels. Examples include contribution guidelines (i.e., commit message guidelines, coding rules, submission guidelines), code of conduct (i.e., rules and behavior expectations), governance policies, and Q&A forum. In 2020, GitHub released Discussion to distinguish between communicatio… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  6. arXiv:2307.07111  [pdf, other

    cs.SE

    More Than React: Investigating The Role of Emoji Reaction in GitHub Pull Requests

    Authors: Dong Wang, Tao Xiao, Teyon Son, Raula Gaikovina Kula, Takashi Ishio, Yasutaka Kamei, Kenichi Matsumoto

    Abstract: Open source software development has become more social and collaborative, evident GitHub. Since 2016, GitHub started to support more informal methods such as emoji reactions, with the goal to reduce commenting noise when reviewing any code changes to a repository. From a code review context, the extent to which emoji reactions facilitate a more efficient review process is unknown. We conduct an e… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  7. arXiv:2303.15684  [pdf, other

    cs.SE

    Understanding the Role of Images on Stack Overflow

    Authors: Dong Wang, Tao Xiao, Christoph Treude, Raula Gaikovina Kula, Hideaki Hata, Yasutaka Kamei

    Abstract: Images are increasingly being shared by software developers in diverse channels including question-and-answer forums like Stack Overflow. Although prior work has pointed out that these images are meaningful and provide complementary information compared to their associated text, how images are used to support questions is empirically unknown. To address this knowledge gap, in this paper we specifi… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  8. The Impact of Using Regression Models to Build Defect Classifiers

    Authors: Gopi Krishnan Rajbahadur, Shaowei Wang, Yasutaka Kamei, Ahmed E. Hassan

    Abstract: It is common practice to discretize continuous defect counts into defective and non-defective classes and use them as a target variable when building defect classifiers (discretized classifiers). However, this discretization of continuous defect counts leads to information loss that might affect the performance and interpretation of defect classifiers. Another possible approach to build defect cla… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Journal ref: IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), 2017, pp. 135-145

  9. Impact of Discretization Noise of the Dependent variable on Machine Learning Classifiers in Software Engineering

    Authors: Gopi Krishnan Rajbahadur, Shaowei Wang, Yasutaka Kamei, Ahmed E. Hassan

    Abstract: Researchers usually discretize a continuous dependent variable into two target classes by introducing an artificial discretization threshold (e.g., median). However, such discretization may introduce noise (i.e., discretization noise) due to ambiguous class loyalty of data points that are close to the artificial threshold. Previous studies do not provide a clear directive on the impact of discreti… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Journal ref: IEEE Transactions on Software Engineering, Vol 47, Issue 7 (2021), 1414-1430

  10. arXiv:2202.02389  [pdf, other

    cs.LG cs.AI cs.SE

    The impact of feature importance methods on the interpretation of defect classifiers

    Authors: Gopi Krishnan Rajbahadur, Shaowei Wang, Yasutaka Kamei, Ahmed E. Hassan

    Abstract: Classifier specific (CS) and classifier agnostic (CA) feature importance methods are widely used (often interchangeably) by prior studies to derive feature importance ranks from a defect classifier. However, different feature importance methods are likely to compute different feature importance ranks even for the same dataset and classifier. Hence such interchangeable use of feature importance met… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  11. arXiv:2012.08053  [pdf, other

    cs.SE

    A Quantitative Study of Security Bug Fixes of GitHub Repositories

    Authors: Daito Nakano, Mingyang Yin, Ryosuke Sato, Abram Hindle, Yasutaka Kamei, Naoyasu Ubayashi

    Abstract: Software is prone to bugs and failures. Security bugs are those that expose or share privileged information and access in violation of the software's requirements. Given the seriousness of security bugs, there are centralized mechanisms for supporting and tracking these bugs across multiple products, one such mechanism is the Common Vulnerabilities and Exposures (CVE) ID description. When a bug ge… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  12. arXiv:1812.10578  [pdf, other

    cs.SE cs.AI

    Towards effective AI-powered agile project management

    Authors: Hoa Khanh Dam, Truyen Tran, John Grundy, Aditya Ghose, Yasutaka Kamei

    Abstract: The rise of Artificial intelligence (AI) has the potential to significantly transform the practice of project management. Project management has a large socio-technical element with many uncertainties arising from variability in human aspects e.g., customers' needs, developers' performance and team dynamics. AI can assist project managers and team members by automating repetitive, high-volume task… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

    Comments: In Proceedings of International Conference on Software Engineering (ICSE 2019), (To appear), NIER track, May 2019 (Montreal, Canada)

  13. arXiv:1810.09723  [pdf, ps, other

    cs.SE

    Bridging Semantic Gaps between Natural Languages and APIs with Word Embedding

    Authors: Xiaochen Li, He Jiang, Yasutaka Kamei, Xin Chen

    Abstract: Developers increasingly rely on text matching tools to analyze the relation between natural language words and APIs. However, semantic gaps, namely textual mismatches between words and APIs, negatively affect these tools. Previous studies have transformed words or APIs into low-dimensional vectors for matching; however, inaccurate results were obtained due to the failure of modeling words and APIs… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: accepted by IEEE Transactions on Software Engineering