Skip to main content

Showing 1–5 of 5 results for author: Sahu, A N

Searching in archive cs. Search in all archives.
.
  1. Resource-Efficient Federated Learning

    Authors: Ahmed M. Abdelmoniem, Atal Narayan Sahu, Marco Canini, Suhaib A. Fahmy

    Abstract: Federated Learning (FL) enables distributed training by learners using local data, thereby enhancing privacy and reducing communication. However, it presents numerous challenges relating to the heterogeneity of the data distribution, device capabilities, and participant availability as deployments scale, which can impact both model convergence and bias. Existing FL schemes use random participant s… ▽ More

    Submitted 4 November, 2022; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted to appear in ACM EuroSys 2023

  2. arXiv:2108.00951  [pdf, other

    cs.LG cs.DC math.OC

    Rethinking gradient sparsification as total error minimization

    Authors: Atal Narayan Sahu, Aritra Dutta, Ahmed M. Abdelmoniem, Trambak Banerjee, Marco Canini, Panos Kalnis

    Abstract: Gradient compression is a widely-established remedy to tackle the communication bottleneck in distributed training of large deep neural networks (DNNs). Under the error-feedback framework, Top-$k$ sparsification, sometimes with $k$ as little as $0.1\%$ of the gradient size, enables training to the same model quality as the uncompressed case for a similar iteration count. From the optimization pers… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 33 pages, 31 figures

  3. arXiv:2004.02163  [pdf, other

    math.OC cs.CC cs.DC math.NA

    On the Convergence Analysis of Asynchronous SGD for Solving Consistent Linear Systems

    Authors: Atal Narayan Sahu, Aritra Dutta, Aashutosh Tiwari, Peter Richtárik

    Abstract: In the realm of big data and machine learning, data-parallel, distributed stochastic algorithms have drawn significant attention in the present days.~While the synchronous versions of these algorithms are well understood in terms of their convergence, the convergence analyses of their asynchronous counterparts are not widely studied. In this paper, we propose and analyze a {\it distributed, asynch… ▽ More

    Submitted 5 April, 2020; originally announced April 2020.

    MSC Class: 15A06; 15B52; 65F10; 65Y20; 68Q25; 68W20; 68W40; 90C20

  4. arXiv:1911.08250  [pdf, other

    cs.DC cs.LG math.OC

    On the Discrepancy between the Theoretical Analysis and Practical Implementations of Compressed Communication for Distributed Deep Learning

    Authors: Aritra Dutta, El Houcine Bergou, Ahmed M. Abdelmoniem, Chen-Yu Ho, Atal Narayan Sahu, Marco Canini, Panos Kalnis

    Abstract: Compressed communication, in the form of sparsification or quantization of stochastic gradients, is employed to reduce communication costs in distributed data-parallel training of deep neural networks. However, there exists a discrepancy between theory and practice: while theoretical analysis of most existing compression methods assumes compression is applied to the gradients of the entire model,… ▽ More

    Submitted 19 November, 2019; originally announced November 2019.

    Comments: To Appear In Proceedings of Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

    Journal ref: In Proceedings of Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

  5. arXiv:1905.10988  [pdf, other

    cs.LG math.OC stat.ML

    Natural Compression for Distributed Deep Learning

    Authors: Samuel Horvath, Chen-Yu Ho, Ludovit Horvath, Atal Narayan Sahu, Marco Canini, Peter Richtarik

    Abstract: Modern deep learning models are often trained in parallel over a collection of distributed machines to reduce training time. In such settings, communication of model updates among machines becomes a significant performance bottleneck and various lossy update compression techniques have been proposed to alleviate this problem. In this work, we introduce a new, simple yet theoretically and practical… ▽ More

    Submitted 5 September, 2022; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: Proceedings of 3${}^{\text{rd}}$ Annual Conference on Mathematical and Scientific Machine Learning (MSML 2022)