Skip to main content

Showing 1–14 of 14 results for author: Reisizadeh, A

.
  1. arXiv:2405.00965  [pdf, other

    cs.LG cs.DC

    Robust Decentralized Learning with Local Updates and Gradient Tracking

    Authors: Sajjad Ghiasvand, Amirhossein Reisizadeh, Mahnoosh Alizadeh, Ramtin Pedarsani

    Abstract: As distributed learning applications such as Federated Learning, the Internet of Things (IoT), and Edge Computing grow, it is critical to address the shortcomings of such technologies from a theoretical perspective. As an abstraction, we consider decentralized learning over a network of communicating clients or nodes and tackle two major challenges: data heterogeneity and adversarial robustness. W… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  2. arXiv:2308.11518  [pdf, ps, other

    cs.LG stat.ML

    EM for Mixture of Linear Regression with Clustered Data

    Authors: Amirhossein Reisizadeh, Khashayar Gatmiry, Asuman Ozdaglar

    Abstract: Modern data-driven and distributed learning frameworks deal with diverse massive data generated by clients spread across heterogeneous environments. Indeed, data heterogeneity is a major bottleneck in scaling up many distributed learning paradigms. In many settings however, heterogeneous data may be generated in clusters with shared structures, as is the case in several applications such as federa… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  3. arXiv:2303.00883  [pdf, other

    cs.LG math.OC stat.ML

    Variance-reduced Clip** for Non-convex Optimization

    Authors: Amirhossein Reisizadeh, Haochuan Li, Subhro Das, Ali Jadbabaie

    Abstract: Gradient clip** is a standard training technique used in deep learning applications such as large-scale language modeling to mitigate exploding gradients. Recent experimental studies have demonstrated a fairly special behavior in the smoothness of the training objective along its trajectory when trained with gradient clip**. That is, the smoothness grows with the gradient norm. This is in clea… ▽ More

    Submitted 2 June, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  4. arXiv:2206.08257  [pdf, other

    cs.LG math.OC

    Gradient Descent for Low-Rank Functions

    Authors: Romain Cosson, Ali Jadbabaie, Anuran Makur, Amirhossein Reisizadeh, Devavrat Shah

    Abstract: Several recent empirical studies demonstrate that important machine learning tasks, e.g., training deep neural networks, exhibit low-rank structure, where the loss function varies significantly in only a few directions of the input space. In this paper, we leverage such low-rank structure to reduce the high computational cost of canonical gradient-based methods such as gradient descent (GD). Our p… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: 26 pages, 2 figures

  5. arXiv:2206.02468  [pdf, ps, other

    cs.LG cs.AI stat.ML

    An Optimal Transport Approach to Personalized Federated Learning

    Authors: Farzan Farnia, Amirhossein Reisizadeh, Ramtin Pedarsani, Ali Jadbabaie

    Abstract: Federated learning is a distributed machine learning paradigm, which aims to train a model using the local data of many distributed clients. A key challenge in federated learning is that the data samples across the clients may not be identically distributed. To address this challenge, personalized federated learning with the goal of tailoring the learned model to the data distribution of every ind… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

  6. arXiv:2012.14453  [pdf, other

    cs.LG cs.DC stat.ML

    Straggler-Resilient Federated Learning: Leveraging the Interplay Between Statistical Accuracy and System Heterogeneity

    Authors: Amirhossein Reisizadeh, Isidoros Tziotis, Hamed Hassani, Aryan Mokhtari, Ramtin Pedarsani

    Abstract: Federated Learning is a novel paradigm that involves learning from data samples distributed across a large network of clients while the data remains local. It is, however, known that federated learning is prone to multiple system challenges including system heterogeneity where clients have different computation and communication capabilities. Such heterogeneity in clients' computation speeds has a… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

  7. arXiv:2006.08907  [pdf, other

    cs.LG math.OC stat.ML

    Robust Federated Learning: The Case of Affine Distribution Shifts

    Authors: Amirhossein Reisizadeh, Farzan Farnia, Ramtin Pedarsani, Ali Jadbabaie

    Abstract: Federated learning is a distributed paradigm that aims at training models using samples distributed across multiple users in a network while kee** the samples on users' devices with the aim of efficiency and protecting users privacy. In such settings, the training data is often statistically heterogeneous and manifests various distribution shifts across users, which degrades the performance of t… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  8. arXiv:1909.13014  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    FedPAQ: A Communication-Efficient Federated Learning Method with Periodic Averaging and Quantization

    Authors: Amirhossein Reisizadeh, Aryan Mokhtari, Hamed Hassani, Ali Jadbabaie, Ramtin Pedarsani

    Abstract: Federated learning is a distributed framework according to which a model is trained over a set of devices, while kee** data localized. This framework faces several systems-oriented challenges which include (i) communication bottleneck since a large number of devices upload their local updates to a parameter server, and (ii) scalability as the federated network consists of millions of devices. Du… ▽ More

    Submitted 7 June, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

  9. arXiv:1907.10595  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    Robust and Communication-Efficient Collaborative Learning

    Authors: Amirhossein Reisizadeh, Hossein Taheri, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani

    Abstract: We consider a decentralized learning problem, where a set of computing nodes aim at solving a non-convex optimization problem collaboratively. It is well-known that decentralized optimization schemes face two major system bottlenecks: stragglers' delay and communication overhead. In this paper, we tackle these bottlenecks by proposing a novel decentralized and gradient-based optimization algorithm… ▽ More

    Submitted 31 October, 2019; v1 submitted 24 July, 2019; originally announced July 2019.

  10. arXiv:1902.01981  [pdf, other

    stat.ML cs.DC cs.IT cs.LG stat.CO

    CodedReduce: A Fast and Robust Framework for Gradient Aggregation in Distributed Learning

    Authors: Amirhossein Reisizadeh, Saurav Prakash, Ramtin Pedarsani, Amir Salman Avestimehr

    Abstract: We focus on the commonly used synchronous Gradient Descent paradigm for large-scale distributed learning, for which there has been a growing interest to develop efficient and robust gradient aggregation strategies that overcome two key system bottlenecks: communication bandwidth and stragglers' delays. In particular, Ring-AllReduce (RAR) design has been proposed to avoid bandwidth bottleneck at an… ▽ More

    Submitted 29 September, 2021; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: Final version to appear in IEEE Transactions on Networking

  11. arXiv:1806.11536  [pdf, other

    cs.LG cs.DC math.OC stat.ML

    An Exact Quantized Decentralized Gradient Descent Algorithm

    Authors: Amirhossein Reisizadeh, Aryan Mokhtari, Hamed Hassani, Ramtin Pedarsani

    Abstract: We consider the problem of decentralized consensus optimization, where the sum of $n$ smooth and strongly convex functions are minimized over $n$ distributed agents that form a connected network. In particular, we consider the case that the communicated local decision variables among nodes are quantized in order to alleviate the communication bottleneck in distributed optimization. We propose the… ▽ More

    Submitted 1 August, 2019; v1 submitted 29 June, 2018; originally announced June 2018.

  12. Coded Computing for Distributed Graph Analytics

    Authors: Saurav Prakash, Amirhossein Reisizadeh, Ramtin Pedarsani, Amir Salman Avestimehr

    Abstract: Performance of distributed graph processing systems significantly suffers from 'communication bottleneck' as a large number of messages are exchanged among servers at each step of the computation. Motivated by graph based MapReduce, we propose a coded computing framework that leverages computation redundancy to alleviate the communication bottleneck in distributed graph processing. We develop a no… ▽ More

    Submitted 9 June, 2020; v1 submitted 16 January, 2018; originally announced January 2018.

    Comments: Accepted for publication in the IEEE Transactions on Information Theory

  13. arXiv:1707.00040  [pdf, other

    cs.IT

    Latency Analysis of Coded Computation Schemes over Wireless Networks

    Authors: Amirhossein Reisizadeh, Ramtin Pedarsani

    Abstract: Large-scale distributed computing systems face two major bottlenecks that limit their scalability: straggler delay caused by the variability of computation times at different worker nodes and communication bottlenecks caused by shuffling data across many nodes in the network. Recently, it has been shown that codes can provide significant gains in overcoming these bottlenecks. In particular, optima… ▽ More

    Submitted 30 June, 2017; originally announced July 2017.

  14. arXiv:1701.05973  [pdf, ps, other

    cs.DC cs.IT

    Coded Computation over Heterogeneous Clusters

    Authors: Amirhossein Reisizadeh, Saurav Prakash, Ramtin Pedarsani, Amir Salman Avestimehr

    Abstract: In large-scale distributed computing clusters, such as Amazon EC2, there are several types of "system noise" that can result in major degradation of performance: bottlenecks due to limited communication bandwidth, latency due to straggler nodes, etc. On the other hand, these systems enjoy abundance of redundancy - a vast number of computing nodes and large storage capacity. There have been recent… ▽ More

    Submitted 19 June, 2019; v1 submitted 20 January, 2017; originally announced January 2017.

    Comments: This work is published in IEEE Transaction on Information Theory (2019). A preliminary version of this work was published in IEEE International Symposium on Information Theory (ISIT) 2017