Skip to main content

Showing 1–16 of 16 results for author: Condat, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20623  [pdf, other

    cs.LG math.OC

    Prune at the Clients, Not the Server: Accelerated Sparse Training in Federated Learning

    Authors: Georg Meinhardt, Kai Yi, Laurent Condat, Peter Richtárik

    Abstract: In the recent paradigm of Federated Learning (FL), multiple clients train a shared model while kee** their local data private. Resource constraints of clients and communication costs pose major problems for training large models in FL. On the one hand, addressing the resource limitations of the clients, sparse training has proven to be a powerful tool in the centralized setting. On the other han… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  2. arXiv:2403.09904  [pdf, other

    cs.LG cs.AI cs.DC

    FedComLoc: Communication-Efficient Distributed Training of Sparse and Quantized Models

    Authors: Kai Yi, Georg Meinhardt, Laurent Condat, Peter Richtárik

    Abstract: Federated Learning (FL) has garnered increasing attention due to its unique characteristic of allowing heterogeneous clients to process their private data locally and interact with a central server, while being respectful of privacy. A critical bottleneck in FL is the communication cost. A pivotal strategy to mitigate this burden is \emph{Local Training}, which involves running multiple local stoc… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  3. arXiv:2403.04348  [pdf, other

    math.OC cs.DC cs.LG

    LoCoDL: Communication-Efficient Distributed Learning with Local Training and Compression

    Authors: Laurent Condat, Artavazd Maranjyan, Peter Richtárik

    Abstract: In Distributed optimization and Learning, and even more in the modern framework of federated learning, communication, which is slow and costly, is critical. We introduce LoCoDL, a communication-efficient algorithm that leverages the two popular and effective techniques of Local training, which reduces the communication frequency, and Compression, in which short bitstreams are sent instead of full-… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2310.07983  [pdf, other

    cs.LG math.OC stat.ML

    Revisiting Decentralized ProxSkip: Achieving Linear Speedup

    Authors: Luyao Guo, Sulaiman A. Alghunaim, Kun Yuan, Laurent Condat, **de Cao

    Abstract: The ProxSkip algorithm for decentralized and federated learning is gaining increasing attention due to its proven benefits in accelerating communication complexity while maintaining robustness against data heterogeneity. However, existing analyses of ProxSkip are limited to the strongly convex setting and do not achieve linear speedup, where convergence performance increases linearly with respect… ▽ More

    Submitted 19 April, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

  5. arXiv:2307.09836  [pdf, other

    cs.LG math.OC

    Near-Linear Time Projection onto the $\ell_{1,\infty}$ Ball; Application to Sparse Autoencoders

    Authors: Guillaume Perez, Laurent Condat, Michel Barlaud

    Abstract: Looking for sparsity is nowadays crucial to speed up the training of large-scale neural networks. Projections onto the $\ell_{1,2}$ and $\ell_{1,\infty}$ are among the most efficient techniques to sparsify and reduce the overall cost of neural networks. In this paper, we introduce a new projection algorithm for the $\ell_{1,\infty}$ norm ball. The worst-case time complexity of this algorithm is… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: 22 pages, 8 figures

  6. arXiv:2305.13170  [pdf, other

    cs.LG

    Explicit Personalization and Local Training: Double Communication Acceleration in Federated Learning

    Authors: Kai Yi, Laurent Condat, Peter Richtárik

    Abstract: Federated Learning is an evolving machine learning paradigm, in which multiple clients perform computations based on their individual private data, interspersed by communication with a remote server. A common strategy to curtail communication costs is Local Training, which consists in performing multiple local stochastic gradient descent steps between successive communication rounds. However, the… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  7. arXiv:2302.09832  [pdf, other

    cs.LG math.OC

    TAMUNA: Doubly Accelerated Distributed Optimization with Local Training, Compression, and Partial Participation

    Authors: Laurent Condat, Ivan Agarský, Grigory Malinovsky, Peter Richtárik

    Abstract: In distributed optimization and learning, several machines alternate between local computations in parallel and communication with a distant server. Communication is usually slow and costly and forms the main bottleneck. This is particularly true in federated learning, where a large number of users collaborate toward a global training task. In addition, it is desirable for a robust algorithm to al… ▽ More

    Submitted 27 April, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: This work is a follow-up of our previous work introducing CompressedScaffnew in paper arXiv:2210.13277

  8. arXiv:2210.13277  [pdf, other

    cs.LG cs.DC math.OC

    Provably Doubly Accelerated Federated Learning: The First Theoretically Successful Combination of Local Training and Communication Compression

    Authors: Laurent Condat, Ivan Agarský, Peter Richtárik

    Abstract: In federated learning, a large number of users are involved in a global learning task, in a collaborative way. They alternate local computations and two-way communication with a distant orchestrating server. Communication, which can be slow and costly, is the main bottleneck in this setting. To reduce the communication load and therefore accelerate distributed gradient descent, two strategies are… ▽ More

    Submitted 2 February, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

  9. arXiv:2205.04180  [pdf, other

    cs.LG cs.DC math.OC

    EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization

    Authors: Laurent Condat, Kai Yi, Peter Richtárik

    Abstract: In distributed or federated optimization and learning, communication between the different computing units is often the bottleneck and gradient compression is widely used to reduce the number of bits sent within each communication round of iterative methods. There are two classes of compression operators and separate algorithms making use of them. In the case of unbiased random compressors with bo… ▽ More

    Submitted 6 March, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

    Comments: Conference NeurIPS 2022

  10. arXiv:2108.02602  [pdf, ps, other

    math.OC cs.CV eess.SP

    Tikhonov Regularization of Circle-Valued Signals

    Authors: Laurent Condat

    Abstract: It is common to have to process signals or images whose values are cyclic and can be represented as points on the complex circle, like wrapped phases, angles, orientations, or color hues. We consider a Tikhonov-type regularization model to smoothen or interpolate circle-valued signals defined on arbitrary graphs. We propose a convex relaxation of this nonconvex problem as a semidefinite program, a… ▽ More

    Submitted 7 June, 2022; v1 submitted 5 August, 2021; originally announced August 2021.

  11. arXiv:2106.03056  [pdf, ps, other

    math.OC cs.LG

    MURANA: A Generic Framework for Stochastic Variance-Reduced Optimization

    Authors: Laurent Condat, Peter Richtárik

    Abstract: We propose a generic variance-reduced algorithm, which we call MUltiple RANdomized Algorithm (MURANA), for minimizing a sum of several smooth functions plus a regularizer, in a sequential or distributed manner. Our method is formulated with general stochastic operators, which allow us to model various strategies for reducing the computational complexity. For example, MURANA supports sparse activat… ▽ More

    Submitted 7 March, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

  12. arXiv:2010.03246  [pdf, other

    cs.LG math.OC

    Optimal Gradient Compression for Distributed and Federated Learning

    Authors: Alyazeed Albasyoni, Mher Safaryan, Laurent Condat, Peter Richtárik

    Abstract: Communicating information, like gradient vectors, between computing nodes in distributed and federated learning is typically an unavoidable burden, resulting in scalability issues. Indeed, communication might be slow and costly. Recent advances in communication-efficient training algorithms have reduced this bottleneck by using compression techniques, in the form of sparsification, quantization, o… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  13. arXiv:2010.00952  [pdf, other

    math.OC cs.LG math.NA

    Distributed Proximal Splitting Algorithms with Rates and Acceleration

    Authors: Laurent Condat, Grigory Malinovsky, Peter Richtárik

    Abstract: We analyze several generic proximal splitting algorithms well suited for large-scale convex nonsmooth optimization. We derive sublinear and linear convergence results with new rates on the function value suboptimality or distance to the solution, as well as new accelerated versions, using varying stepsizes. In addition, we propose distributed variants of these algorithms, which can be accelerated… ▽ More

    Submitted 27 January, 2022; v1 submitted 2 October, 2020; originally announced October 2020.

  14. arXiv:2004.02635  [pdf, other

    math.OC cs.LG stat.ML

    Dualize, Split, Randomize: Toward Fast Nonsmooth Optimization Algorithms

    Authors: Adil Salim, Laurent Condat, Konstantin Mishchenko, Peter Richtárik

    Abstract: We consider minimizing the sum of three convex functions, where the first one F is smooth, the second one is nonsmooth and proximable and the third one is the composition of a nonsmooth proximable function with a linear operator L. This template problem has many applications, for instance, in image processing and machine learning. First, we propose a new primal-dual algorithm, which we call PDDY,… ▽ More

    Submitted 26 July, 2022; v1 submitted 3 April, 2020; originally announced April 2020.

  15. arXiv:2004.01442  [pdf, other

    cs.LG math.OC stat.ML

    From Local SGD to Local Fixed-Point Methods for Federated Learning

    Authors: Grigory Malinovsky, Dmitry Kovalev, Elnur Gasanov, Laurent Condat, Peter Richtárik

    Abstract: Most algorithms for solving optimization problems or finding saddle points of convex-concave functions are fixed-point algorithms. In this work we consider the generic problem of finding a fixed point of an average of operators, or an approximation thereof, in a distributed setting. Our work is motivated by the needs of federated learning. In this context, each local operator models the computatio… ▽ More

    Submitted 16 June, 2020; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: Accepted to ICML 2020

  16. arXiv:1504.05854  [pdf, ps, other

    cs.LG math.NA math.OC

    On-the-fly Approximation of Multivariate Total Variation Minimization

    Authors: Jordan Frecon, Nelly Pustelnik, Patrice Abry, Laurent Condat

    Abstract: In the context of change-point detection, addressed by Total Variation minimization strategies, an efficient on-the-fly algorithm has been designed leading to exact solutions for univariate data. In this contribution, an extension of such an on-the-fly strategy to multivariate data is investigated. The proposed algorithm relies on the local validation of the Karush-Kuhn-Tucker conditions on the du… ▽ More

    Submitted 28 August, 2016; v1 submitted 22 April, 2015; originally announced April 2015.