Skip to main content

Showing 1–1 of 1 results for author: La Rosa, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13057  [pdf, other

    cs.SE cs.AI

    Can Github issues be solved with Tree Of Thoughts?

    Authors: Ricardo La Rosa, Corey Hulse, Bangdi Liu

    Abstract: While there have been extensive studies in code generation by large language models (LLM), where benchmarks like HumanEval have been surpassed with an impressive 96.3% success rate, these benchmarks predominantly judge a model's performance on basic function-level code generation and lack the critical thinking and concept of scope required of real-world scenarios such as solving GitHub issues. Thi… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 8 pages, 2 figures, 7 tables