Skip to main content

Showing 1–12 of 12 results for author: Kamani, M M

.
  1. arXiv:2310.17761  [pdf, other

    cs.LG

    Distributed Personalized Empirical Risk Minimization

    Authors: Yuyang Deng, Mohammad Mahdi Kamani, Pouria Mahdavinia, Mehrdad Mahdavi

    Abstract: This paper advocates a new paradigm Personalized Empirical Risk Minimization (PERM) to facilitate learning from heterogeneous data sources without imposing stringent constraints on computational resources shared by participating devices. In PERM, we aim to learn a distinct model for each client by learning who to learn with and personalizing the aggregation of local empirical losses by effectively… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  2. arXiv:2211.06812  [pdf, other

    cs.LG cs.DC stat.ML

    FedRule: Federated Rule Recommendation System with Graph Neural Networks

    Authors: Yuhang Yao, Mohammad Mahdi Kamani, Zhongwei Cheng, Lin Chen, Carlee Joe-Wong, Tianqiang Liu

    Abstract: Much of the value that IoT (Internet-of-Things) devices bring to ``smart'' homes lies in their ability to automatically trigger other devices' actions: for example, a smart camera triggering a smart lock to unlock a door. Manually setting up these rules for smart devices or applications, however, is time-consuming and inefficient. Rule recommendation systems can automatically suggest rules for use… ▽ More

    Submitted 12 November, 2022; originally announced November 2022.

  3. arXiv:2203.09607  [pdf, other

    cs.LG stat.ML

    Learning Distributionally Robust Models at Scale via Composite Optimization

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Mehrdad Mahdavi, Amin Karbasi

    Abstract: To train machine learning models that are robust to distribution shifts in the data, distributionally robust optimization (DRO) has been proven very effective. However, the existing approaches to learning a distributionally robust model either require solving complex optimization problems such as semidefinite programming or a first-order method whose convergence scales linearly with the number of… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted to ICLR2022 as a conference paper. International Conference on Learning Representations (2022)

  4. arXiv:2110.09674  [pdf, other

    cs.CV cs.LG

    Adaptive Distillation: Aggregating Knowledge from Multiple Paths for Efficient Distillation

    Authors: Sumanth Chennupati, Mohammad Mahdi Kamani, Zhongwei Cheng, Lin Chen

    Abstract: Knowledge Distillation is becoming one of the primary trends among neural network compression algorithms to improve the generalization performance of a smaller student model with guidance from a larger teacher model. This momentous rise in applications of knowledge distillation is accompanied by the introduction of numerous algorithms for distilling the knowledge such as soft targets and hint laye… ▽ More

    Submitted 23 October, 2021; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: Accepted to BMVC 2021 for publication. V2. Added more results for ImageNet-1K

  5. arXiv:2107.10868  [pdf, other

    cs.LG math.OC

    Local SGD Optimizes Overparameterized Neural Networks in Polynomial Time

    Authors: Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

    Abstract: In this paper we prove that Local (S)GD (or FedAvg) can optimize deep neural networks with Rectified Linear Unit (ReLU) activation function in polynomial time. Despite the established convergence theory of Local SGD on optimizing general smooth functions in communication-efficient distributed optimization, its convergence on non-smooth ReLU networks still eludes full theoretical understanding. The… ▽ More

    Submitted 22 February, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

  6. arXiv:2104.01634  [pdf, other

    cs.LG cs.AI stat.ML

    Pareto Efficient Fairness in Supervised Learning: From Extraction to Tracing

    Authors: Mohammad Mahdi Kamani, Rana Forsati, James Z. Wang, Mehrdad Mahdavi

    Abstract: As algorithmic decision-making systems are becoming more pervasive, it is crucial to ensure such systems do not become mechanisms of unfair discrimination on the basis of gender, race, ethnicity, religion, etc. Moreover, due to the inherent trade-off between fairness measures and accuracy, it is desirable to learn fairness-enhanced models without significantly compromising the accuracy. In this pa… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

  7. arXiv:2102.12660  [pdf, other

    cs.LG cs.DC stat.ML

    Distributionally Robust Federated Averaging

    Authors: Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

    Abstract: In this paper, we study communication efficient distributed algorithms for distributionally robust federated learning via periodic averaging with adaptive sampling. In contrast to standard empirical risk minimization, due to the minimax structure of the underlying optimization problem, a key difficulty arises from the fact that the global parameter that controls the mixture of local losses can onl… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: Published in NeurIPS 2020: https://proceedings.neurips.cc/paper/2020/hash/ac450d10e166657ec8f93a1b65ca1b14-Abstract.html

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Vol. 33, 2020

  8. arXiv:2007.01154  [pdf, other

    cs.LG cs.DC stat.ML

    Federated Learning with Compression: Unified Analysis and Sharp Guarantees

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Aryan Mokhtari, Mehrdad Mahdavi

    Abstract: In federated learning, communication cost is often a critical bottleneck to scale up distributed optimization algorithms to collaboratively learn a model from millions of devices with potentially unreliable or limited communication and heterogeneous data distributions. Two notable trends to deal with the communication overhead of federated algorithms are gradient compression and local computation… ▽ More

    Submitted 20 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: version 2. more experiments and comparisons

  9. arXiv:2003.13461  [pdf, other

    cs.LG cs.DC stat.ML

    Adaptive Personalized Federated Learning

    Authors: Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

    Abstract: Investigation of the degree of personalization in federated learning algorithms has shown that only maximizing the performance of the global model will confine the capacity of the local models to personalize. In this paper, we advocate an adaptive personalized federated learning (APFL) algorithm, where each client will train their local models while contributing to the global model. We derive the… ▽ More

    Submitted 5 November, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: [v3] Added convergence analysis for nonconvex losses and additional experiments along with new baselines [v2] A new generalization analysis is provided. Also, additional experiments are added

  10. arXiv:1911.04931  [pdf, other

    cs.LG cs.DS math.OC stat.ML

    Efficient Fair Principal Component Analysis

    Authors: Mohammad Mahdi Kamani, Farzin Haddadpour, Rana Forsati, Mehrdad Mahdavi

    Abstract: It has been shown that dimension reduction methods such as PCA may be inherently prone to unfairness and treat data from different sensitive groups such as race, color, sex, etc., unfairly. In pursuit of fairness-enhancing dimensionality reduction, using the notion of Pareto optimality, we propose an adaptive first-order algorithm to learn a subspace that preserves fairness, while slightly comprom… ▽ More

    Submitted 7 March, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

  11. arXiv:1910.13598  [pdf, other

    cs.LG cs.DC stat.ML

    Local SGD with Periodic Averaging: Tighter Analysis and Adaptive Synchronization

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Mehrdad Mahdavi, Viveck R. Cadambe

    Abstract: Communication overhead is one of the key challenges that hinders the scalability of distributed optimization algorithms. In this paper, we study local distributed SGD, where data is partitioned among computation nodes, and the computation nodes perform local updates with periodically exchanging the model among the workers to perform averaging. While local SGD is empirically shown to provide promis… ▽ More

    Submitted 14 May, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Paper accepted to NeurIPS 2019 - We fixed a flaw in the earlier version regarding the dependency on constants but this change does not affect the communication complexity

  12. arXiv:1811.04184  [pdf, other

    cs.CV cs.AI cs.LG

    CAPTAIN: Comprehensive Composition Assistance for Photo Taking

    Authors: Farshid Farhat, Mohammad Mahdi Kamani, James Z. Wang

    Abstract: Many people are interested in taking astonishing photos and sharing with others. Emerging hightech hardware and software facilitate ubiquitousness and functionality of digital photography. Because composition matters in photography, researchers have leveraged some common composition techniques to assess the aesthetic quality of photos computationally. However, composition techniques developed by p… ▽ More

    Submitted 9 November, 2018; originally announced November 2018.

    Comments: 30 pages, 21 figures, 4 tables, submitted to IJCV (International Journal of Computer Vision)