-
A RAG Method for Source Code Inquiry Tailored to Long-Context LLMs
Abstract: Although the context length limitation of large language models (LLMs) has been mitigated, it still hinders their application to software development tasks. This study proposes a method incorporating execution traces into RAG for inquiries about source code. Small-scale experiments confirm a tendency for the method to contribute to improving LLM response quality.
Submitted 9 April, 2024; originally announced April 2024.
Comments: 6 pages, 2 columns, English translation of the manuscript originally presented in Japanese at a domestic workshop
MSC Class: 68-04 ACM Class: D.2.3; D.2.5
-
MSCCD: Grammar Pluggable Clone Detection Based on ANTLR Parser Generation
Abstract: For various reasons, programming languages continue to multiply and evolve. It has become necessary to have a multilingual clone detection tool that can easily expand supported programming languages and detect various code clones is needed. However, research on multilingual code clone detection has not received sufficient attention. In this study, we propose MSCCD (Multilingual Syntactic Code Clon… ▽ More
Submitted 6 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.
Comments: ICPC2022
-
arXiv:2110.01092 [pdf, ps, other]
Towards Informative Tagging of Code Fragments to Support the Investigation of Code Clones
Abstract: Investigating the code fragments of code clones detected by code clone detection tools is a time-consuming task, especially when a large number of reference source files are available. This paper proposes (i) a method for clustering a clone class, which is detected by code clone detection tools using syntactic similarity, based on topic similarity by considering its code fragments as sequences of… ▽ More
Submitted 3 October, 2021; originally announced October 2021.
Comments: 7 pages, 15th International Workshop on Software Clones