Skip to main content

Showing 1–5 of 5 results for author: Islamov, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.20114  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Near Optimal Decentralized Optimization with Compression and Momentum Tracking

    Authors: Rustem Islamov, Yuan Gao, Sebastian U. Stich

    Abstract: Communication efficiency has garnered significant attention as it is considered the main bottleneck for large-scale decentralized Machine Learning applications in distributed and federated settings. In this regime, clients are restricted to transmitting small amounts of quantized information to their neighbors over a communication graph. Numerous endeavors have been made to address this challengin… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2311.05645  [pdf, other

    math.OC cs.LG stat.ML

    EControl: Fast Distributed Optimization with Compression and Error Control

    Authors: Yuan Gao, Rustem Islamov, Sebastian Stich

    Abstract: Modern distributed training relies heavily on communication compression to reduce the communication overhead. In this work, we study algorithms employing a popular class of contractive compressors in order to reduce communication overhead. However, the naive implementation often leads to unstable convergence or even exponential divergence due to the compression bias. Error Compensation (EC) is an… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  3. arXiv:2310.20452  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    AsGrad: A Sharp Unified Analysis of Asynchronous-SGD Algorithms

    Authors: Rustem Islamov, Mher Safaryan, Dan Alistarh

    Abstract: We analyze asynchronous-type algorithms for distributed SGD in the heterogeneous setting, where each worker has its own computation and communication speeds, as well as data distribution. In these algorithms, workers compute possibly stale and stochastic gradients associated with their local data at some iteration back in history and then return those gradients to the server without synchronizing… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  4. arXiv:2305.18929  [pdf, other

    cs.LG math.OC stat.ML

    Clip21: Error Feedback for Gradient Clip**

    Authors: Sarit Khirirat, Eduard Gorbunov, Samuel Horváth, Rustem Islamov, Fakhri Karray, Peter Richtárik

    Abstract: Motivated by the increasing popularity and importance of large-scale training under differential privacy (DP) constraints, we study distributed gradient methods with gradient clip**, i.e., clip** applied to the gradients computed from local information at the nodes. While gradient clip** is an essential tool for injecting formal DP guarantees into gradient-based methods [1], it also induces… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  5. arXiv:2305.18285  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Partially Personalized Federated Learning: Breaking the Curse of Data Heterogeneity

    Authors: Konstantin Mishchenko, Rustem Islamov, Eduard Gorbunov, Samuel Horváth

    Abstract: We present a partially personalized formulation of Federated Learning (FL) that strikes a balance between the flexibility of personalization and cooperativeness of global training. In our framework, we split the variables into global parameters, which are shared across all clients, and individual local parameters, which are kept private. We prove that under the right split of parameters, it is pos… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.