Skip to main content

Showing 1–6 of 6 results for author: Jhunjhunwala, D

.
  1. arXiv:2403.12329  [pdf, other

    cs.LG cs.DC stat.ML

    FedFisher: Leveraging Fisher Information for One-Shot Federated Learning

    Authors: Divyansh Jhunjhunwala, Shiqiang Wang, Gauri Joshi

    Abstract: Standard federated learning (FL) algorithms typically require multiple rounds of communication between the server and the clients, which has several drawbacks, including requiring constant network connectivity, repeated investment of computational resources, and susceptibility to privacy attacks. One-Shot FL is a new paradigm that aims to address this challenge by enabling the server to train a gl… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to AISTATS 2024

  2. arXiv:2301.09604  [pdf, other

    cs.LG

    FedExP: Speeding Up Federated Averaging via Extrapolation

    Authors: Divyansh Jhunjhunwala, Shiqiang Wang, Gauri Joshi

    Abstract: Federated Averaging (FedAvg) remains the most popular algorithm for Federated Learning (FL) optimization due to its simple implementation, stateless nature, and privacy guarantees combined with secure aggregation. Recent work has sought to generalize the vanilla averaging in FedAvg to a generalized gradient descent step by treating client updates as pseudo-gradients and using a server step size. W… ▽ More

    Submitted 6 March, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Comments: Accepted to ICLR 2023. V2 fixes minor typos and cleans up proofs

  3. arXiv:2207.14130  [pdf, other

    cs.LG

    FedVARP: Tackling the Variance Due to Partial Client Participation in Federated Learning

    Authors: Divyansh Jhunjhunwala, Pranay Sharma, Aushim Nagarkatti, Gauri Joshi

    Abstract: Data-heterogeneous federated learning (FL) systems suffer from two significant sources of convergence error: 1) client drift error caused by performing multiple local optimization steps at clients, and 2) partial client participation error caused by the fact that only a small subset of the edge clients participate in every training round. We find that among these, only the former has received sign… ▽ More

    Submitted 28 July, 2022; originally announced July 2022.

    Comments: Accepted to UAI 2022

  4. arXiv:2205.14840  [pdf, other

    cs.LG

    Maximizing Global Model Appeal in Federated Learning

    Authors: Yae Jee Cho, Divyansh Jhunjhunwala, Tian Li, Virginia Smith, Gauri Joshi

    Abstract: Federated learning typically considers collaboratively training a global model using local data at edge clients. Clients may have their own individual requirements, such as having a minimal training loss threshold, which they expect to be met by the global model. However, due to client heterogeneity, the global model may not meet each client's requirements, and only a small subset may find the glo… ▽ More

    Submitted 4 February, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

  5. arXiv:2110.07751  [pdf, other

    cs.LG stat.ML

    Leveraging Spatial and Temporal Correlations in Sparsified Mean Estimation

    Authors: Divyansh Jhunjhunwala, Ankur Mallick, Advait Gadhikar, Swanand Kadhe, Gauri Joshi

    Abstract: We study the problem of estimating at a central server the mean of a set of vectors distributed across several nodes (one vector per node). When the vectors are high-dimensional, the communication cost of sending entire vectors may be prohibitive, and it may be imperative for them to use sparsification techniques. While most existing work on sparsified mean estimation is agnostic to the characteri… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted to NeurIPS 2021

  6. arXiv:2102.04487  [pdf, other

    cs.LG cs.DC stat.ML

    Adaptive Quantization of Model Updates for Communication-Efficient Federated Learning

    Authors: Divyansh Jhunjhunwala, Advait Gadhikar, Gauri Joshi, Yonina C. Eldar

    Abstract: Communication of model updates between client nodes and the central aggregating server is a major bottleneck in federated learning, especially in bandwidth-limited settings and high-dimensional models. Gradient quantization is an effective way of reducing the number of bits required to communicate each model update, albeit at the cost of having a higher error floor due to the higher variance of th… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted to ICASSP 2021