Skip to main content

Showing 1–6 of 6 results for author: Shrikanth, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.08512  [pdf, other

    cs.SE cs.AI

    AugmentedCode: Examining the Effects of Natural Language Resources in Code Retrieval Models

    Authors: Mehdi Bahrami, N. C. Shrikanth, Yuji Mizobuchi, Lei Liu, Masahiro Fukuyori, Wei-Peng Chen, Kazuki Munakata

    Abstract: Code retrieval is allowing software engineers to search codes through a natural language query, which relies on both natural language processing and software engineering techniques. There have been several attempts on code retrieval from searching snippet codes to function codes. In this paper, we introduce Augmented Code (AugmentedCode) retrieval which takes advantage of existing information with… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

    Comments: 7 pages, 2 figures, 5 tables, 1 video

  2. arXiv:2110.01710  [pdf, other

    cs.SE

    PyTorrent: A Python Library Corpus for Large-scale Language Models

    Authors: Mehdi Bahrami, N. C. Shrikanth, Shade Ruangwan, Lei Liu, Yuji Mizobuchi, Masahiro Fukuyori, Wei-Peng Chen, Kazuki Munakata, Tim Menzies

    Abstract: A large scale collection of both semantic and natural language resources is essential to leverage active Software Engineering research areas such as code reuse and code comprehensibility. Existing machine learning models ingest data from Open Source repositories (like GitHub projects) and forum discussions (like Stackoverflow.com), whereas, in this showcase, we took a step backward to orchestrate… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: 10 pages, 2 figures, 5 tables

  3. arXiv:2105.11082  [pdf, other

    cs.SE cs.AI cs.LG

    Assessing the Early Bird Heuristic (for Predicting Project Quality)

    Authors: N. C. Shrikanth, Tim Menzies

    Abstract: Before researchers rush to reason across all available data or try complex methods, perhaps it is prudent to first check for simpler alternatives. Specifically, if the historical data has the most information in some small region, perhaps a model learned from that region would suffice for the rest of the project. To support this claim, we offer a case study with 240 projects, where we find that… ▽ More

    Submitted 11 January, 2023; v1 submitted 23 May, 2021; originally announced May 2021.

    Comments: 38 pages (Accepted TOSEM Jan 2023)

  4. arXiv:2011.13071  [pdf, other

    cs.SE cs.LG

    Early Life Cycle Software Defect Prediction. Why? How?

    Authors: N. C. Shrikanth, Suvodeep Majumder, Tim Menzies

    Abstract: Many researchers assume that, for software analytics, "more data is better." We write to show that, at least for learning defect predictors, this may not be true. To demonstrate this, we analyzed hundreds of popular GitHub projects. These projects ran for 84 months and contained 3,728 commits (median values). Across these projects, most of the defects occur very early in their life cycle. Hence, d… ▽ More

    Submitted 8 February, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: 12 pages (To appear ICSE 2021)

  5. Assessing Practitioner Beliefs about Software Engineering

    Authors: N. C. Shrikanth, William Nichols, Fahmid Morshed Fahid, Tim Menzies

    Abstract: Software engineering is a highly dynamic discipline. Hence, as times change, so too might our beliefs about core processes in this field. This paper checks some five beliefs that originated in the past decades that comment on the relationships between (i) developer productivity; (ii) software quality and (iii) years of developer experience. Using data collected from 1,356 developers in the period… ▽ More

    Submitted 24 May, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: 32 pages, published https://link.springer.com/article/10.1007/s10664-021-09957-5

  6. Assessing Practitioner Beliefs about Software Defect Prediction

    Authors: N. C. Shrikanth, Tim Menzies

    Abstract: Just because software developers say they believe in "X", that does not necessarily mean that "X" is true. As shown here, there exist numerous beliefs listed in the recent Software Engineering literature which are only supported by small portions of the available data. Hence we ask what is the source of this disconnect between beliefs and evidence?. To answer this question we look for evidence for… ▽ More

    Submitted 8 April, 2020; v1 submitted 20 December, 2019; originally announced December 2019.

    Comments: 9 pages, 3 Figures, 4 Tables, ICSE SEIP 2020