Skip to main content

Showing 1–6 of 6 results for author: Prenner, J A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.04986  [pdf, other

    cs.SE cs.AI

    Out of Context: How important is Local Context in Neural Program Repair?

    Authors: Julian Aron Prenner, Romain Robbes

    Abstract: Deep learning source code models have been applied very successfully to the problem of automated program repair. One of the standing issues is the small input window of current models which often cannot fully fit the context code required for a bug fix (e.g., method or class declarations of a project). Instead, input is often restricted to the local context, that is, the lines below and above the… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  2. arXiv:2304.01102  [pdf, other

    cs.SE cs.LG

    RunBugRun -- An Executable Dataset for Automated Program Repair

    Authors: Julian Aron Prenner, Romain Robbes

    Abstract: Recently, we can notice a transition to data-driven techniques in Automated Program Repair (APR), in particular towards deep neural networks. This entails training on hundreds of thousands or even millions of non-executable code fragments. We would like to bring more attention to an aspect of code often neglected in Neural Program Repair (NPR), namely its execution. Code execution has several sign… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  3. arXiv:2212.02684  [pdf, other

    cs.SE cs.LG

    Codex Hacks HackerRank: Memorization Issues and a Framework for Code Synthesis Evaluation

    Authors: Anjan Karmakar, Julian Aron Prenner, Marco D'Ambros, Romain Robbes

    Abstract: The Codex model has demonstrated extraordinary competence in synthesizing code from natural language problem descriptions. However, in order to reveal unknown failure modes and hidden biases, such large-scale models must be systematically subjected to multiple and diverse evaluation studies. In this work, we evaluate the code synthesis capabilities of the Codex model based on a set of 115 Python… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  4. arXiv:2111.03922  [pdf, ps, other

    cs.SE

    Automatic Program Repair with OpenAI's Codex: Evaluating QuixBugs

    Authors: Julian Aron Prenner, Romain Robbes

    Abstract: OpenAI's Codex, a GPT-3 like model trained on a large code corpus, has made headlines in and outside of academia. Given a short user-provided description, it is capable of synthesizing code snippets that are syntactically and semantically valid in most cases. In this work, we want to investigate whether Codex is able to localize and fix bugs, a task of central interest in the field of automated pr… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

  5. arXiv:2106.15209  [pdf, other

    cs.SE

    Making the most of small Software Engineering datasets with modern machine learning

    Authors: Julian Aron Prenner, Romain Robbes

    Abstract: This paper provides a starting point for Software Engineering (SE) researchers and practitioners faced with the problem of training machine learning models on small datasets. Due to the high costs associated with labeling data, in Software Engineering,there exist many small (< 1 000 samples) and medium-sized (< 100 000 samples) datasets. While deep learning has set the state of the art in many mac… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  6. arXiv:2103.01722  [pdf, other

    cs.SE

    Mining Software Repositories with a Collaborative Heuristic Repository

    Authors: Hlib Babii, Julian Aron Prenner, Laurin Stricker, Anjan Karmakar, Andrea Janes, Romain Robbes

    Abstract: Many software engineering studies or tasks rely on categorizing software engineering artifacts. In practice, this is done either by defining simple but often imprecise heuristics, or by manual labelling of the artifacts. Unfortunately, errors in these categorizations impact the tasks that rely on them. To improve the precision of these categorizations, we propose to gather heuristics in a collabor… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: 5 pages; to appear in Proceedings of ICSE NIER 2021