Skip to main content

Showing 1–1 of 1 results for author: Hanin, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.02419  [pdf, other

    cs.LG cs.AI cs.CL eess.SY

    Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

    Authors: Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou

    Abstract: Many recent state-of-the-art results in language tasks were achieved using compound systems that perform multiple Language Model (LM) calls and aggregate their responses. However, there is little understanding of how the number of LM calls - e.g., when asking the LM to answer each question multiple times and taking a majority vote - affects such a compound system's performance. In this paper, we i… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.