Skip to main content

Showing 1–3 of 3 results for author: Stoica, I

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.02419  [pdf, other

    cs.LG cs.AI cs.CL eess.SY

    Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

    Authors: Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou

    Abstract: Many recent state-of-the-art results in language tasks were achieved using compound systems that perform multiple Language Model (LM) calls and aggregate their responses. However, there is little understanding of how the number of LM calls - e.g., when asking the LM to answer each question multiple times and taking a majority vote - affects such a compound system's performance. In this paper, we i… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  2. arXiv:2112.07238  [pdf, other

    eess.SY

    Composing MPC with LQR and Neural Network for Amortized Efficiency and Stable Control

    Authors: Fangyu Wu, Guanhua Wang, Siyuan Zhuang, Kehan Wang, Alexander Keimer, Ion Stoica, Alexandre Bayen

    Abstract: Model predictive control (MPC) is a powerful control method that handles dynamical systems with constraints. However, solving MPC iteratively in real time, i.e., implicit MPC, remains a computational challenge. To address this, common solutions include explicit MPC and function approximation. Both methods, whenever applicable, may improve the computational efficiency of the implicit MPC by several… ▽ More

    Submitted 2 August, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: 13 pages, 10 figures, 2 tables

  3. arXiv:1908.01275  [pdf, other

    cs.LG cs.AI eess.SY

    A View on Deep Reinforcement Learning in System Optimization

    Authors: Ameer Haj-Ali, Nesreen K. Ahmed, Ted Willke, Joseph Gonzalez, Krste Asanovic, Ion Stoica

    Abstract: Many real-world systems problems require reasoning about the long term consequences of actions taken to configure and manage the system. These problems with delayed and often sequentially aggregated reward, are often inherently reinforcement learning problems and present the opportunity to leverage the recent substantial advances in deep reinforcement learning. However, in some cases, it is not cl… ▽ More

    Submitted 4 September, 2019; v1 submitted 4 August, 2019; originally announced August 2019.