Skip to main content

Showing 1–2 of 2 results for author: Manaa, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.06993  [pdf

    cs.DC cs.NI

    Ultima: Robust and Tail-Optimal AllReduce for Distributed Deep Learning in the Cloud

    Authors: Ertza Warraich, Omer Shabtai, Khalid Manaa, Shay Vargaftik, Yonatan Piasetzky, Matty Kadosh, Lalith Suresh, Muhammad Shahbaz

    Abstract: We present Ultima, a new collective-communication system for the cloud with bounded, predictable completion times for deep-learning jobs in the presence of varying computation (stragglers) and communication (congestion and gradient drops) variabilities. Ultima exploits the inherent resiliency and the stochastic nature of distributed deep-learning (DDL) training to work with approximated gradients,… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 12 pages

  2. arXiv:2307.08816  [pdf, other

    cs.LG cs.AI math.OC

    Accelerating Cutting-Plane Algorithms via Reinforcement Learning Surrogates

    Authors: Kyle Mana, Fernando Acero, Stephen Mak, Parisa Zehtabi, Michael Cashmore, Daniele Magazzeni, Manuela Veloso

    Abstract: Discrete optimization belongs to the set of $\mathcal{NP}$-hard problems, spanning fields such as mixed-integer programming and combinatorial optimization. A current standard approach to solving convex discrete optimization problems is the use of cutting-plane algorithms, which reach optimal solutions by iteratively adding inequalities known as \textit{cuts} to refine a feasible set. Despite the e… ▽ More

    Submitted 27 February, 2024; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Extended version (includes Supplementary Material). Accepted at AAAI 24 Main Track with Oral Presentation