The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

Hassid, Michael; Remez, Tal; Gehring, Jonas; Schwartz, Roy; Adi, Yossi

Computer Science > Software Engineering

arXiv:2404.00725 (cs)

[Submitted on 31 Mar 2024]

Title:The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

Authors:Michael Hassid, Tal Remez, Jonas Gehring, Roy Schwartz, Yossi Adi

View PDF HTML (experimental)

Abstract:It is a common belief that large language models (LLMs) are better than smaller-sized ones. However, larger models also require significantly more time and compute during inference. This begs the question: what happens when both models operate under the same budget? (e.g., compute, run-time). To address this question, we analyze code generation LLMs of various sizes and make comparisons such as running a 70B model once vs. generating five outputs from a 13B model and selecting one. Our findings reveal that, in a standard unit-test setup, the repeated use of smaller models can yield consistent improvements, with gains of up to 15% across five tasks. On the other hand, in scenarios where unit-tests are unavailable, a ranking-based selection of candidates from the smaller model falls short of the performance of a single output from larger ones. Our results highlight the potential of using smaller models instead of larger ones, and the importance of studying approaches for ranking LLM outputs.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2404.00725 [cs.SE]
	(or arXiv:2404.00725v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2404.00725

Submission history

From: Yossi Adi [view email]
[v1] Sun, 31 Mar 2024 15:55:49 UTC (1,030 KB)

Computer Science > Software Engineering

Title:The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:The Larger the Better? Improved LLM Code-Generation via Budget Reallocation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators