Skip to main content

Showing 1–2 of 2 results for author: Zadouri, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.05444  [pdf, other

    cs.CL cs.LG

    Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning

    Authors: Ted Zadouri, Ahmet Üstün, Arash Ahmadian, Beyza Ermiş, Acyr Locatelli, Sara Hooker

    Abstract: The Mixture of Experts (MoE) is a widely known neural architecture where an ensemble of specialized sub-models optimizes overall performance with a constant computational cost. However, conventional MoEs pose challenges at scale due to the need to store all experts in memory. In this paper, we push MoE to the limit. We propose extremely parameter-efficient MoE by uniquely combining MoE architectur… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  2. arXiv:2303.11937  [pdf, other

    cs.DS cs.LG math.OC

    High Probability Bounds for Stochastic Continuous Submodular Maximization

    Authors: Evan Becker, **gdong Gao, Ted Zadouri, Baharan Mirzasoleiman

    Abstract: We consider maximization of stochastic monotone continuous submodular functions (CSF) with a diminishing return property. Existing algorithms only guarantee the performance \textit{in expectation}, and do not bound the probability of getting a bad solution. This implies that for a particular run of the algorithms, the solution may be much worse than the provided guarantee in expectation. In this p… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS) 2023