Skip to main content

Showing 1–47 of 47 results for author: Roy, C K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.04575  [pdf, other

    cs.SE

    Can We Identify Stack Overflow Questions Requiring Code Snippets? Investigating the Cause & Effect of Missing Code Snippets

    Authors: Saikat Mondal, Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: On the Stack Overflow (SO) Q&A site, users often request solutions to their code-related problems (e.g., errors, unexpected behavior). Unfortunately, they often miss required code snippets during their question submission, which could prevent their questions from getting prompt and appropriate answers. In this study, we conduct an empirical study investigating the cause & effect of missing code sn… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted for inclusion in the International Conference on Software Analysis, Evolution, and Reengineering (SANER 2024) technical program

  2. arXiv:2402.04568  [pdf, other

    cs.SE

    Enhancing User Interaction in ChatGPT: Characterizing and Consolidating Multiple Prompts for Issue Resolution

    Authors: Saikat Mondal, Suborno Deb Bappon, Chanchal K. Roy

    Abstract: Prompt design plays a crucial role in sha** the efficacy of ChatGPT, influencing the model's ability to extract contextually accurate responses. Thus, optimal prompt construction is essential for maximizing the utility and performance of ChatGPT. However, sub-optimal prompt design may necessitate iterative refinement, as imprecise or ambiguous instructions can lead to undesired responses from Ch… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted at the 21st International Conference on Mining Software Repositories (MSR 2024)

  3. arXiv:2402.03735  [pdf, other

    cs.SE

    Investigating the Utility of ChatGPT in the Issue Tracking System: An Exploratory Study

    Authors: Joy Krishan Das, Saikat Mondal, Chanchal K. Roy

    Abstract: Issue tracking systems serve as the primary tool for incorporating external users and customizing a software project to meet the users' requirements. However, the limited number of contributors and the challenge of identifying the best approach for each issue often impede effective resolution. Recently, an increasing number of developers are turning to AI tools like ChatGPT to enhance problem-solv… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted in MSR 2024

  4. arXiv:2312.03182  [pdf, other

    cs.SE

    Investigating Technology Usage Span by Analyzing Users' Q&A Traces in Stack Overflow

    Authors: Saikat Mondal, Debajyoti Mondal, Chanchal K. Roy

    Abstract: Choosing an appropriate software development technology (e.g., programming language) is challenging due to the proliferation of diverse options. The selection of inappropriate technologies for development may have a far-reaching effect on software developers' career growth. Switching to a different technology after working with one may lead to a complex learning curve and, thus, be more challengin… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted in the 30th Asia-Pacific Software Engineering Conference (APSEC 2023)

  5. arXiv:2309.06424  [pdf

    cs.SE cs.AI cs.LG

    Unveiling the potential of large language models in generating semantic and cross-language clones

    Authors: Palash R. Roy, Ajmain I. Alam, Farouq Al-omari, Banani Roy, Chanchal K. Roy, Kevin A. Schneider

    Abstract: Semantic and Cross-language code clone generation may be useful for code reuse, code comprehension, refactoring and benchmarking. OpenAI's GPT model has potential in such clone generation as GPT is used for text generation. When developers copy/paste codes from Stack Overflow (SO) or within a system, there might be inconsistent changes leading to unexpected behaviours. Similarly, if someone posses… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted in IWSC

  6. arXiv:2308.13963  [pdf

    cs.SE

    GPTCloneBench: A comprehensive benchmark of semantic clones and cross-language clones using GPT-3 model and SemanticCloneBench

    Authors: Ajmain Inqiad Alam, Palash Ranjan Roy, Farouq Al-omari, Chanchal Kumar Roy, Banani Roy, Kevin Schneider

    Abstract: With the emergence of Machine Learning, there has been a surge in leveraging its capabilities for problem-solving across various domains. In the code clone realm, the identification of type-4 or semantic clones has emerged as a crucial yet challenging task. Researchers aim to utilize Machine Learning to tackle this challenge, often relying on the BigCloneBench dataset. However, it's worth noting t… ▽ More

    Submitted 1 September, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: Accepted in 39th IEEE International Conference on Software Maintenance and Evolution(ICSME 2023)

  7. arXiv:2304.03563  [pdf, other

    cs.SE

    Do Subjectivity and Objectivity Always Agree? A Case Study with Stack Overflow Questions

    Authors: Saikat Mondal, Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: In Stack Overflow (SO), the quality of posts (i.e., questions and answers) is subjectively evaluated by users through a voting mechanism. The net votes (upvotes - downvotes) obtained by a post are often considered an approximation of its quality. However, about half of the questions that received working solutions got more downvotes than upvotes. Furthermore, about 18% of the accepted answers (i.e… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: Accepted in the International Conference on Mining Software Repositories (MSR 2023)

  8. arXiv:2303.01435  [pdf, other

    cs.SE

    Pathways to Leverage Transcompiler based Data Augmentation for Cross-Language Clone Detection

    Authors: Subroto Nag Pinku, Debajyoti Mondal, Chanchal K. Roy

    Abstract: Software clones are often introduced when developers reuse code fragments to implement similar functionalities in the same or different software systems. Many high-performing clone detection tools today are based on deep learning techniques and are mostly used for detecting clones written in the same programming language, whereas clone detection tools for detecting cross-language clones are also e… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: Accepted at the 31st IEEE/ACM International Conference on Program Comprehension (ICPC 2023)

    ACM Class: D.2; D.2.13

  9. arXiv:2204.11449  [pdf, other

    cs.CV

    OCFormer: One-Class Transformer Network for Image Classification

    Authors: Prerana Mukherjee, Chandan Kumar Roy, Swalpa Kumar Roy

    Abstract: We propose a novel deep learning framework based on Vision Transformers (ViT) for one-class classification. The core idea is to use zero-centered Gaussian noise as a pseudo-negative class for latent space representation and then train the network using the optimal loss function. In prior works, there have been tremendous efforts to learn a good representation using varieties of loss functions, whi… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

  10. Backports: Change Types, Challenges and Strategies

    Authors: Debasish Chakroborti, Kevin A. Schneider, Chanchal K. Roy

    Abstract: Source code repositories allow developers to manage multiple versions (or branches) of a software system. Pull-requests are used to modify a branch, and backporting is a regular activity used to port changes from a current development branch to other versions. In open-source software, backports are common and often need to be adapted by hand, which motivates us to explore backports and backporting… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: In 30th International Conference on Program Comprehension (ICPC 22), May 16 to 17, 2022, Virtual Event, Pittsburgh

  11. arXiv:2201.10137  [pdf, other

    cs.SE cs.LG

    Leveraging Structural Properties of Source Code Graphs for Just-In-Time Bug Prediction

    Authors: Md Nadim, Debajyoti Mondal, Chanchal K. Roy

    Abstract: The most common use of data visualization is to minimize the complexity for proper understanding. A graph is one of the most commonly used representations for understanding relational data. It produces a simplified representation of data that is challenging to comprehend if kept in a textual format. In this study, we propose a methodology to utilize the relational properties of source code in the… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: Has been accepted for publication Automated Software Engineering (AUSE), an International Journal published by Springer

  12. Evaluating the Performance of Clone Detection Tools in Detecting Cloned Co-change Candidates

    Authors: Md Nadim, Manishankar Mondal, Chanchal K. Roy, Kevin Schneider

    Abstract: Co-change candidates are the group of code fragments that require a change if any of these fragments experience a modification in a commit operation during software evolution. The cloned co-change candidates are a subset of the co-change candidates, and the members in this subset are clones of one another. The cloned co-change candidates are usually created by reusing existing code fragments in a… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: Has been accepted for publication in The Journal of Systems & Software (JSS)

  13. arXiv:2111.12204  [pdf, other

    cs.SE

    The Reproducibility of Programming-Related Issues in Stack Overflow Questions

    Authors: Saikat Mondal, Mohammad Masudur Rahman, Chanchal K. Roy, Kevin Schneider

    Abstract: Software developers often look for solutions to their code-level problems using the Stack Overflow Q&A website. To receive help, developers frequently submit questions containing sample code segments and the description of the programming issue. Unfortunately, it is not always possible to reproduce the issues from the code segments that may impede questions from receiving prompt and appropriate so… ▽ More

    Submitted 25 December, 2021; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: This study has been accepted for publication in Empirical Software Engineering EMSE) journal

  14. arXiv:2111.03196  [pdf, other

    cs.SE cs.LG

    An Empirical Study of the Effectiveness of an Ensemble of Stand-alone Sentiment Detection Tools for Software Engineering Datasets

    Authors: Gias Uddin, Yann-Gael Gueheneuc, Foutse Khomh, Chanchal K Roy

    Abstract: Sentiment analysis in software engineering (SE) has shown promise to analyze and support diverse development activities. We report the results of an empirical study that we conducted to determine the feasibility of develo** an ensemble engine by combining the polarity labels of stand-alone SE-specific sentiment detectors. Our study has two phases. In the first phase, we pick five SE-specific sen… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

    Journal ref: ACM Transactions on Software Engineering and Methodology (TOSEM), 2021

  15. arXiv:2109.00659  [pdf, other

    cs.SE

    Semantic Slicing of Architectural Change Commits: Towards Semantic Design Review

    Authors: Amit Kumar Mondal, Chanchal K. Roy, Kevin A. Schneider, Banani Roy, Sristy Sumana Nath

    Abstract: Software architectural changes involve more than one module or component and are complex to analyze compared to local code changes. Development teams aiming to review architectural aspects (design) of a change commit consider many essential scenarios such as access rules and restrictions on usage of program entities across modules. Moreover, design review is essential when proper architectural for… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

  16. arXiv:2108.09646  [pdf, other

    cs.SE cs.IR cs.LG cs.NE

    A Systematic Review of Automated Query Reformulations in Source Code Search

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Fixing software bugs and adding new features are two of the major maintenance tasks. Software bugs and features are reported as change requests. Developers consult these requests and often choose a few keywords from them as an ad hoc query. Then they execute the query with a search engine to find the exact locations within software code that need to be changed. Unfortunately, even experienced deve… ▽ More

    Submitted 8 June, 2023; v1 submitted 22 August, 2021; originally announced August 2021.

    Comments: 81 pages, accepted at TOSEM

    ACM Class: D.2.5; D.2.1; D.2.7; D.2.13

  17. arXiv:2108.05341  [pdf, other

    cs.SE cs.IR cs.LG

    The Forgotten Role of Search Queries in IR-based Bug Localization: An Empirical Study

    Authors: Mohammad Masudur Rahman, Foutse Khomh, Shamima Yeasmin, Chanchal K. Roy

    Abstract: Being light-weight and cost-effective, IR-based approaches for bug localization have shown promise in finding software bugs. However, the accuracy of these approaches heavily depends on their used bug reports. A significant number of bug reports contain only plain natural language texts. According to existing studies, IR-based approaches cannot perform well when they use these bug reports as searc… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: 57 pages, EMSE (2021)

    ACM Class: D.2; D.2.5; D.2.7

  18. arXiv:2102.08874  [pdf, other

    cs.SE

    Mining API Usage Scenarios from Stack Overflow

    Authors: Gias Uddin, Foutse Khomh, Chanchal K Roy

    Abstract: We propose a framework to mine API usage scenarios from Stack Overflow. Each task consists of a code example, the task description, and the reactions of developers towards the code example. First, we present an algorithm to automatically link a code example in a forum post to an API mentioned in the textual contents of the forum post. Second, we generate a natural language description of the task… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Journal ref: 2020 Information and Software Technology (IST)

  19. arXiv:2102.08502  [pdf, other

    cs.SE

    Automatic API Usage Scenario Documentation from Technical Q&A Sites

    Authors: Gias Uddin, Foutse Khomh, Chanchal K Roy

    Abstract: The online technical Q&A site Stack Overflow (SO) is popular among developers to support their coding and diverse development needs. To address shortcomings in API official documentation resources, several research has thus focused on augmenting official API documentation with insights (e.g., code examples) from SO. The techniques propose to add code examples/insights about APIs into its official… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Journal ref: 2021 ACM Transactions on Software Engineering and Methodology (TOSEM)

  20. arXiv:2006.15682  [pdf, other

    cs.SE

    A Survey on the Evaluation of Clone Detection Performance and Benchmarking

    Authors: Jeffrey Svajlenko, Chanchal K. Roy

    Abstract: There are a great many clone detection tools proposed in the literature. In this paper, we investigate the state of clone detection tool evaluation. We begin by surveying the clone detection benchmarks, and performing a multi-faceted evaluation and comparison of their features and capabilities. We then survey the existing clone detection tool and technique publications, and evaluate how the author… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

    Comments: 109 pages, review article, several figures and tables, and 277 references. It covers the whole area of clone detection and evaluation literature

  21. The Vision of Software Clone Management: Past, Present, and Future

    Authors: Chanchal K. Roy, Minhaz F. Zibran, Rainer Koschke

    Abstract: Duplicated code or code clones are a kind of code smell that have both positive and negative impacts on the development and maintenance of software systems. Software clone research in the past mostly focused on the detection and analysis of code clones, while research in recent years extends to the whole spectrum of clone management. In the last decade, three surveys appeared in the literature, wh… ▽ More

    Submitted 3 May, 2020; originally announced May 2020.

    Comments: 16 pages

    Journal ref: 2014 Software Evolution Week - IEEE Conference on Software Maintenance, Reengineering, and Reverse Engineering (CSMR-WCRE), Antwerp, 2014, pp. 18-33

  22. An Exploratory Study to Find Motives Behind Cross-platform Forks from Software Heritage Dataset

    Authors: Avijit Bhattacharjee, Sristy Sumana Nath, Shurui Zhou, Debasish Chakroborti, Banani Roy, Chanchal K. Roy, Kevin Schneider

    Abstract: The fork-based development mechanism provides the flexibility and the unified processes for software teams to collaborate easily in a distributed setting without too much coordination overhead.Currently, multiple social coding platforms support fork-based development, such as GitHub, GitLab, and Bitbucket. Although these different platforms virtually share the same features, they have different em… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: Accepted at 17th International Conference on Mining Software Repositories, October 5--6, 2020, Seoul, Republic of Korea

  23. arXiv:1910.11125  [pdf, other

    cs.DC cs.SE

    Micro-level Modularity of Computaion-intensive Programs in Big Data Platforms: A Case Study with Image Data

    Authors: Amit Kumar Mondal, Banani Roy, Chanchal K. Roy, Kevin A. Schneider

    Abstract: With the rapid advancement of Big Data platforms such as Hadoop, Spark, and Dataflow, many tools are being developed that are intended to provide end users with an interactive environment for large-scale data analysis (e.g., IQmulus). However, there are challenges using these platforms. For example, developers find it difficult to use these platforms when develo** interactive and reusable data a… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

  24. arXiv:1909.04238  [pdf, other

    cs.SE

    LVMapper: A Large-variance Clone Detector Using Sequencing Alignment Approach

    Authors: Ming Wu, Pengcheng Wang, Kangqi Yin, Haoyu Cheng, Yun Xu, Chanchal K. Roy

    Abstract: To detect large-variance code clones (i.e. clones with relatively more differences) in large-scale code repositories is difficult because most current tools can only detect almost identical or very similar clones. It will make promotion and changes to some software applications such as bug detection, code completion, software analysis, etc. Recently, CCAligner made an attempt to detect clones with… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

  25. arXiv:1903.07662  [pdf, other

    cs.SE

    Recommending Comprehensive Solutions for Programming Tasks by Mining Crowd Knowledge

    Authors: Rodrigo F. G. Silva, Chanchal K. Roy, Mohammad Masudur Rahman, Kevin A. Schneider, Klerisson Paixao, Marcelo de Almeida Maia

    Abstract: Developers often search for relevant code examples on the web for their programming tasks. Unfortunately, they face two major problems. First, the search is impaired due to a lexical gap between their query (task description) and the information associated with the solution. Second, the retrieved solution may not be comprehensive, i.e., the code segment might miss a succinct explanation. These pro… ▽ More

    Submitted 20 March, 2019; v1 submitted 18 March, 2019; originally announced March 2019.

    Comments: Accepted at ICPC, 12 pages, 2019

  26. arXiv:1808.00594  [pdf, other

    cs.SE

    Improving IR-Based Bug Localization with Context-Aware Query Reformulation

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Recent findings suggest that Information Retrieval (IR)-based bug localization techniques do not perform well if the bug report lacks rich structured information (eg relevant program entity names). Conversely, excessive structured information (eg stack traces) in the bug report might not always help the automated localization either. In this paper, we propose a novel technique--BLIZZARD-- that aut… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

    Comments: To be presented at The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2018), FL, USA

    Journal ref: In Proc. ESEC/FSE 2018

  27. arXiv:1807.08798  [pdf, other

    cs.SE

    Effective Reformulation of Query for Code Search using Crowdsourced Knowledge and Extra-Large Data Analytics

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Software developers frequently issue generic natural language queries for code search while using code search engines (e.g., GitHub native search, Krugle). Such queries often do not lead to any relevant results due to vocabulary mismatch problems. In this paper, we propose a novel technique that automatically identifies relevant and specific API classes from Stack Overflow Q & A site for a program… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

    Comments: The 34th International Conference on Software Maintenance and Evolution (ICSME 2018), pp. 12, Madrid, Spain, September, 2018

    Journal ref: Proc. ICSME 2018

  28. Poster: Improving Bug Localization with Report Quality Dynamics and Query Reformulation

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Recent findings from a user study suggest that IR-based bug localization techniques do not perform well if the bug report lacks rich structured information such as relevant program entity names. On the contrary, excessive structured information such as stack traces in the bug report might always not be helpful for the automated bug localization. In this paper, we conduct a large empirical study us… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

    Comments: The 40th International Conference on Software Engineering (Companion volume, Poster Track) (ICSE 2018), pp. 348--349, Gothenburg, Sweden, May, 2018

    Journal ref: Proc. ICSE-C 2018, pp. 348--349

  29. arXiv:1807.04488  [pdf, other

    cs.SE

    Improved Query Reformulation for Concept Location using CodeRank and Document Structures

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: During software maintenance, developers usually deal with a significant number of software change requests. As a part of this, they often formulate an initial query from the request texts, and then attempt to map the concepts discussed in the request to relevant source code locations in the software system (a.k.a., concept location). Unfortunately, studies suggest that they often perform poorly in… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: The 32nd International Conference on Automated Software Engineering (ASE 2017), pp. 428-439, Urbana-Champaign, Illinois, USA, October, 2017

    Report number: 10.1109/ASE.2017.8115655

    Journal ref: Proc. ASE 2017, pp. 428-439

  30. arXiv:1807.04485  [pdf, other

    cs.SE

    Predicting Usefulness of Code Review Comments using Textual Features and Developer Experience

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy, Raula G. Kula

    Abstract: Although peer code review is widely adopted in both commercial and open source development, existing studies suggest that such code reviews often contain a significant amount of non-useful review comments. Unfortunately, to date, no tools or techniques exist that can provide automatic support in improving those non-useful comments. In this paper, we first report a comparative study between useful… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: The 14th International Conference on Mining Software Repositories (MSR 2017), pp. 215--226, Buenos Aires, Argentina, May, 2017

    Report number: 10.1109/MSR.2017.17

    Journal ref: Proc. MSR 2017, pp. 215--226

  31. arXiv:1807.04479  [pdf, other

    cs.SE

    RACK: Code Search in the IDE using Crowdsourced Knowledge

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy, David Lo

    Abstract: Traditional code search engines often do not perform well with natural language queries since they mostly apply keyword matching. These engines thus require carefully designed queries containing information about programming APIs for code search. Unfortunately, existing studies suggest that preparing an effective query for code search is both challenging and time consuming for the developers. In t… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: The 39th International Conference on Software Engineering (Companion volume) (ICSE 2017), pp. 51--54, Buenos Aires, Argentina, May, 2017

    Report number: 10.1109/ICSE-C.2017.11

    Journal ref: Proc. ICSE-C 2017, pp. 51--54

  32. arXiv:1807.04475  [pdf, other

    cs.SE

    STRICT: Information Retrieval Based Search Term Identification for Concept Location

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: During maintenance, software developers deal with numerous change requests that are written in an unstructured fashion using natural language. Such natural language texts illustrate the change requirement involving various domain related concepts. Software developers need to find appropriate search terms from those concepts so that they could locate the possible locations in the source code using… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: The 24th IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER 2017), pp. 79--90, Klagenfurt, Austria, February 2017

    Report number: 10.1109/SANER.2017.7884611

    Journal ref: Proc. SANER 2017, pp. 79--90

  33. CORRECT: Code Reviewer Recommendation at GitHub for Vendasta Technologies

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy, Jesse Redl, Jason A. Collins

    Abstract: Peer code review locates common coding standard violations and simple logical errors in the early phases of software development, and thus, reduces overall cost. Unfortunately, at GitHub, identifying an appropriate code reviewer for a pull request is challenging given that reliable information for reviewer identification is often not readily available. In this paper, we propose a code reviewer rec… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: The 31st IEEE/ACM International Conference on Automated Software Engineering (ASE 2016), pp. 792--797, Singapore, September 2016. arXiv admin note: substantial text overlap with arXiv:1807.02965

    Journal ref: Proc. ASE 2016, pp. 792--797

  34. CORRECT: Code Reviewer Recommendation in GitHub Based on Cross-Project and Technology Experience

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy, Jason A. Collins

    Abstract: Peer code review locates common coding rule violations and simple logical errors in the early phases of software development, and thus reduces overall cost. However, in GitHub, identifying an appropriate code reviewer for a pull request is a non-trivial task given that reliable information for reviewer identification is often not readily available. In this paper, we propose a code reviewer recomme… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: The 38th International Conference on Software Engineering (Companion volume) (ICSE 2016), pp. 222--231, Austin Texas, USA, May 2016

    Journal ref: Proc. ICSE-C 2016, pp. 222--231

  35. QUICKAR: Automatic Query Reformulation for Concept Location using Crowdsourced Knowledge

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: During maintenance, software developers deal with numerous change requests made by the users of a software system. Studies show that the developers find it challenging to select appropriate search terms from a change request during concept location. In this paper, we propose a novel technique--QUICKAR--that automatically suggests helpful reformulations for a given query by leveraging the crowdsour… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: The 31st IEEE/ACM International Conference on Automated Software Engineering (ASE 2016), pp. 220--225, Singapore, September 2016

    Journal ref: Proc. ASE 2016, pp. 220--225

  36. RACK: Automatic API Recommendation using Crowdsourced Knowledge

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy, David Lo

    Abstract: Traditional code search engines often do not perform well with natural language queries since they mostly apply keyword matching. These engines thus need carefully designed queries containing information about programming APIs for code search. Unfortunately, existing studies suggest that preparing an effective code search query is both challenging and time consuming for the developers. In this pap… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: The 23rd IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER 2016), pp. 349--359, Osaka, Japan, March 2016

    Journal ref: Proc. SANER 2016, pp. 349--359

  37. Recommending Insightful Comments for Source Code using Crowdsourced Knowledge

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy, Iman Keivanloo

    Abstract: Recently, automatic code comment generation is proposed to facilitate program comprehension. Existing code comment generation techniques focus on describing the functionality of the source code. However, there are other aspects such as insights about quality or issues of the code, which are overlooked by earlier approaches. In this paper, we describe a mining approach that recommends insightful co… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: The 15th IEEE International Working Conference on Source Code Analysis and Manipulation (SCAM 2015), pp. 81--90, Bremen, Germany, September 2015

    Journal ref: Proc. SCAM 2015, pp. 81--90

  38. arXiv:1807.02274  [pdf, other

    cs.SE cs.IR

    Recommending Relevant Sections from a Webpage about Programming Errors and Exceptions

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Programming errors or exceptions are inherent in software development and maintenance, and given today's Internet era, software developers often look at web for finding working solutions. They make use of a search engine for retrieving relevant pages, and then look for the appropriate solutions by manually going through the pages one by one. However, both the manual checking of a page's content ag… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: 25th Annual International Conference on Computer Science and Software Engineering (CASCON 2015), pp. 181--190, Markham, Canada, November 2015

    Journal ref: Proc. CASCON 2015, pp. 181--190

  39. TextRank Based Search Term Identification for Software Change Tasks

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: During maintenance, software developers deal with a number of software change requests. Each of those requests is generally written using natural language texts, and it involves one or more domain related concepts. A developer needs to map those concepts to exact source code locations within the project in order to implement the requested change. This map** generally starts with a search within… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: The 22nd IEEE International Conference on Software Analysis, Evolution, and Reengineering (SANER 2015), pp. 540--544, Montreal, Canada, March 2015

    Journal ref: Proc. SANER 2015, pp. 540--544

  40. On the Use of Context in Recommending Exception Handling Code Examples

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Studies show that software developers often either misuse exception handling features or use them inefficiently, and such a practice may lead an undergoing software project to a fragile, insecure and non-robust application system. In this paper, we propose a context-aware code recommendation approach that recommends exception handling code examples from a number of popular open source code reposit… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: The 14th IEEE International Working Conference on Source Code Analysis and Manipulation (SCAM 2014), pp. 285--294, Victoria, Canada, September 2014

    Journal ref: Proc. SCAM 2014, pp. 285--294

  41. SurfClipse: Context-Aware Meta Search in the IDE

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Despite various debugging supports of the existing IDEs for programming errors and exceptions, software developers often look at web for working solutions or any up-to-date information. Traditional web search does not consider the context of the problems that they search solutions for, and thus it often does not help much in problem solving. In this paper, we propose a context-aware meta search to… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: The 30th International Conference on Software Maintenance and Evolution (ICSME 2014), pp. 617--620, Victoria, Canada, September 2014

    Journal ref: Proc. ICSME 2014, pp. 617--620

  42. Towards a Context-Aware IDE-Based Meta Search Engine for Recommendation about Programming Errors and Exceptions

    Authors: Mohammad Masudur Rahman, Shamima Yeasmin, Chanchal K. Roy

    Abstract: Study shows that software developers spend about 19% of their time looking for information in the web during software development and maintenance. Traditional web search forces them to leave the working environment (e.g., IDE) and look for information in the web browser. It also does not consider the context of the problems that the developers search solutions for. The frequent switching between w… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: IEEE CSMR-18/WCRE-21 Software Evolution Week (CSMR-WCRE 2014), pp. 194--203, Antwerp, Belgium, February 2014

    Journal ref: Proc. CSMR-WCRE 2014, pp. 194--203

  43. An IDE-Based Context-Aware Meta Search Engine

    Authors: Mohammad Masudur Rahman, Shamima Yeasmin, Chanchal K. Roy

    Abstract: Traditional web search forces the developers to leave their working environments and look for solutions in the web browsers. It often does not consider the context of their programming problems. The context-switching between the web browser and the working environment is time-consuming and distracting, and the keyword-based traditional search often does not help much in problem solving. In this pa… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: 20th Working Conference on Reverse Engineering (WCRE 2013), Koblenz, Germany, October 2013, pp. 467--471

  44. An Insight into the Pull Requests of GitHub

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Given the increasing number of unsuccessful pull requests in GitHub projects, insights into the success and failure of these requests are essential for the developers. In this paper, we provide a comparative study between successful and unsuccessful pull requests made to 78 GitHub base projects by 20,142 developers from 103,192 forked projects. In the study, we analyze pull request discussion text… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: 11th Working Conference on Mining Software Repositories (MSR 2014), pp. 364--367, Hyderabad, India, May 2014

  45. Impact of Continuous Integration on Code Reviews

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: Peer code review and continuous integration often interleave with each other in the modern software quality management. Although several studies investigate how non-technical factors (e.g., reviewer workload), developer participation and even patch size affect the code review process, the impact of continuous integration on code reviews is not yet properly understood. In this paper, we report an e… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: International Conference on Mining Software Repositories (MSR 2017), pp. 499--502, Buenos Aires, Argentina, May, 2017

    Journal ref: International Conference on Mining Software Repositories (MSR 2017), pp. 499--502, Buenos Aires, Argentina, May, 2017

  46. An Insight into the Unresolved Questions at Stack Overflow

    Authors: Mohammad Masudur Rahman, Chanchal K. Roy

    Abstract: For a significant number of questions at Stack Overflow, none of the posted answers were accepted as solutions. Acceptance of an answer indicates that the answer actually solves the discussed problem in the question, and the question is answered sufficiently. In this paper, we investigate 3,956 such unresolved questions using an exploratory study where we analyze four important aspects of those qu… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: 12th Working Conference on Mining Software Repositories (MSR 2015), pp. 426--429, Florence, Italy, May 2015

    Journal ref: Proc. MSR (2015), 426-429

  47. SourcererCC: Scaling Code Clone Detection to Big Code

    Authors: Hitesh Sajnani, Vaibhav Saini, Jeffrey Svajlenko, Chanchal K. Roy, Cristina V. Lopes

    Abstract: Despite a decade of active research, there is a marked lack in clone detectors that scale to very large repositories of source code, in particular for detecting near-miss clones where significant editing activities may take place in the cloned code. We present SourcererCC, a token-based clone detector that targets three clone types, and exploits an index to achieve scalability to large inter-proje… ▽ More

    Submitted 20 December, 2015; originally announced December 2015.

    Comments: Accepted for publication at ICSE'16 (preprint, unrevised)