Skip to main content

Showing 1–9 of 9 results for author: Fakhoury, S

.
  1. arXiv:2405.01787  [pdf, other

    cs.PL cs.AI cs.SE

    Towards Neural Synthesis for SMT-Assisted Proof-Oriented Programming

    Authors: Saikat Chakraborty, Gabriel Ebner, Siddharth Bhat, Sarah Fakhoury, Sakina Fatima, Shuvendu Lahiri, Nikhil Swamy

    Abstract: Proof-oriented programs mix computational content with proofs of program correctness. However, the human effort involved in programming and proving is still substantial, despite the use of Satisfiability Modulo Theories (SMT) solvers to automate proofs in languages such as F*. Seeking to spur research on using AI to automate the construction of proof-oriented programs, we curate a dataset of 600… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2404.10362  [pdf, other

    cs.SE

    3DGen: AI-Assisted Generation of Provably Correct Binary Format Parsers

    Authors: Sarah Fakhoury, Markus Kuppe, Shuvendu K. Lahiri, Tahina Ramananandro, Nikhil Swamy

    Abstract: Improper parsing of attacker-controlled input is a leading source of software security vulnerabilities, especially when programmers transcribe informal format descriptions in RFCs into efficient parsing logic in low-level, memory unsafe languages. Several researchers have proposed formal specification languages for data formats from which efficient code can be extracted. However, distilling inform… ▽ More

    Submitted 6 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  3. arXiv:2404.10100  [pdf, other

    cs.SE

    LLM-based Test-driven Interactive Code Generation: User Study and Empirical Evaluation

    Authors: Sarah Fakhoury, Aaditya Naik, Georgios Sakkas, Saikat Chakraborty, Shuvendu K. Lahiri

    Abstract: Large language models (LLMs) have shown great potential in automating significant aspects of coding by producing natural code from informal natural language (NL) intent. However, given NL is informal, it does not lend easily to checking that the generated code correctly satisfies the user intent. In this paper, we propose a novel interactive workflow TiCoder for guided intent clarification (i.e.,… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2310.09342  [pdf, other

    cs.PL cs.AI cs.CL cs.SE

    Ranking LLM-Generated Loop Invariants for Program Verification

    Authors: Saikat Chakraborty, Shuvendu K. Lahiri, Sarah Fakhoury, Madanlal Musuvathi, Akash Lal, Aseem Rastogi, Aditya Senthilnathan, Rahul Sharma, Nikhil Swamy

    Abstract: Synthesizing inductive loop invariants is fundamental to automating program verification. In this work, we observe that Large Language Models (such as gpt-3.5 or gpt-4) are capable of synthesizing loop invariants for a class of programs in a 0-shot setting, yet require several samples to generate the correct invariants. This can lead to a large number of calls to a program verifier to establish an… ▽ More

    Submitted 12 February, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Findings of The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP-findings 2023)

  5. arXiv:2310.01831  [pdf, other

    cs.SE cs.AI cs.PL

    Can Large Language Models Transform Natural Language Intent into Formal Method Postconditions?

    Authors: Madeline Endres, Sarah Fakhoury, Saikat Chakraborty, Shuvendu K. Lahiri

    Abstract: Informal natural language that describes code functionality, such as code comments or function documentation, may contain substantial information about a programs intent. However, there is typically no guarantee that a programs implementation and natural language documentation are aligned. In the case of a conflict, leveraging information in code-adjacent natural language has the potential to enha… ▽ More

    Submitted 15 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: To appear at the Proceedings of the ACM on Software Engineering (PACMSE), Issue Foundations of Software Engineering (FSE) 2024

  6. arXiv:2304.03816  [pdf, other

    cs.SE cs.LG

    Towards Generating Functionally Correct Code Edits from Natural Language Issue Descriptions

    Authors: Sarah Fakhoury, Saikat Chakraborty, Madan Musuvathi, Shuvendu K. Lahiri

    Abstract: Large language models (LLMs), such as OpenAI's Codex, have demonstrated their potential to generate code from natural language descriptions across a wide range of programming tasks. Several benchmarks have recently emerged to evaluate the ability of LLMs to generate functionally correct code from natural language intent with respect to a set of hidden test cases. This has enabled the research comm… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  7. arXiv:2208.05950  [pdf, other

    cs.SE cs.LG cs.PL

    Interactive Code Generation via Test-Driven User-Intent Formalization

    Authors: Shuvendu K. Lahiri, Sarah Fakhoury, Aaditya Naik, Georgios Sakkas, Saikat Chakraborty, Madanlal Musuvathi, Piali Choudhury, Curtis von Veh, Jeevana Priya Inala, Chenglong Wang, Jianfeng Gao

    Abstract: Large language models (LLMs) have shown great potential in automating significant aspects of coding by producing natural code from informal natural language (NL) intent. However, when interacting with LLMs, users have no guarantees that the code suggestions produced correctly satisfy the intent they provided. In fact, it is hard to define a notion of correctness since natural language can be ambig… ▽ More

    Submitted 3 October, 2023; v1 submitted 11 August, 2022; originally announced August 2022.

    Comments: 18 pages

  8. Program Merge Conflict Resolution via Neural Transformers

    Authors: Alexey Svyatkovskiy, Sarah Fakhoury, Negar Ghorbani, Todd Mytkowicz, Elizabeth Dinella, Christian Bird, **u Jang, Neel Sundaresan, Shuvendu Lahiri

    Abstract: Collaborative software development is an integral part of the modern software development life cycle, essential to the success of large-scale software projects. When multiple developers make concurrent changes around the same lines of code, a merge conflict may occur. Such conflicts stall pull requests and continuous integration pipelines for hours to several days, seriously hurting developer prod… ▽ More

    Submitted 29 November, 2022; v1 submitted 31 August, 2021; originally announced September 2021.

    Comments: ESEC/FSE '22 camera ready version. 12 pages, 4 figures, online appendix

  9. gazel: Supporting Source Code Edits in Eye-Tracking Studies

    Authors: Sarah Fakhoury, Devjeet Roy, Harry Pines, Tyler Cleveland, Cole Peterson, Venera Arnaoudova, Bonita Sharif, Jonathan Maletic

    Abstract: Eye tracking tools are used in software engineering research to study various software development activities. However, a major limitation of these tools is their inability to track gaze data for activities that involve source code editing. We present a novel solution to support eye tracking experiments for tasks involving source code edits as an extension of the iTrace community infrastructure. W… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

    Comments: 4 pages, 2 figures

    Journal ref: International Conference on Software Engineering (ICSE) 2021