Skip to main content

Showing 1–50 of 79 results for author: Mahdavi, M

.
  1. arXiv:2405.03792  [pdf, ps, other

    cs.DS

    Prize-Collecting Steiner Tree: A 1.79 Approximation

    Authors: Ali Ahmadi, Iman Gholami, MohammadTaghi Hajiaghayi, Peyman Jabbarzade, Mohammad Mahdavi

    Abstract: Prize-Collecting Steiner Tree (PCST) is a generalization of the Steiner Tree problem, a fundamental problem in computer science. In the classic Steiner Tree problem, we aim to connect a set of vertices known as terminals using the minimum-weight tree in a given weighted graph. In this generalized version, each vertex has a penalty, and there is flexibility to decide whether to connect each vertex… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  2. arXiv:2403.06871  [pdf, other

    cs.LG stat.ML

    On the Generalization Ability of Unsupervised Pretraining

    Authors: Yuyang Deng, Junyuan Hong, Jiayu Zhou, Mehrdad Mahdavi

    Abstract: Recent advances in unsupervised learning have shown that unsupervised pre-training, followed by fine-tuning, can improve model generalization. However, a rigorous understanding of how the representation function learned on an unlabeled dataset affects the generalization of the fine-tuned model is lacking. Existing theoretical research does not adequately account for the heterogeneity of the distri… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  3. arXiv:2402.16387  [pdf, other

    cs.LG cs.AI

    On the Generalization Capability of Temporal Graph Learning Algorithms: Theoretical Insights and a Simpler Method

    Authors: Weilin Cong, Jian Kang, Hanghang Tong, Mehrdad Mahdavi

    Abstract: Temporal Graph Learning (TGL) has become a prevalent technique across diverse real-world applications, especially in domains where data can be represented as a graph and evolves over time. Although TGL has recently seen notable progress in algorithmic solutions, its theoretical foundations remain largely unexplored. This paper aims at bridging this gap by investigating the generalization ability o… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  4. arXiv:2310.17761  [pdf, other

    cs.LG

    Distributed Personalized Empirical Risk Minimization

    Authors: Yuyang Deng, Mohammad Mahdi Kamani, Pouria Mahdavinia, Mehrdad Mahdavi

    Abstract: This paper advocates a new paradigm Personalized Empirical Risk Minimization (PERM) to facilitate learning from heterogeneous data sources without imposing stringent constraints on computational resources shared by participating devices. In PERM, we aim to learn a distinct model for each client by learning who to learn with and personalizing the aggregation of local empirical losses by effectively… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  5. arXiv:2310.11445  [pdf, ps, other

    quant-ph cs.LG math.OC

    Stochastic Quantum Sampling for Non-Logconcave Distributions and Estimating Partition Functions

    Authors: Guneykan Ozgul, Xiantao Li, Mehrdad Mahdavi, Chunhao Wang

    Abstract: We present quantum algorithms for sampling from non-logconcave probability distributions in the form of $π(x) \propto \exp(-βf(x))$. Here, $f$ can be written as a finite sum $f(x):= \frac{1}{N}\sum_{k=1}^N f_k(x)$. Our approach is based on quantum simulated annealing on slowly varying Markov chains derived from unadjusted Langevin algorithms, removing the necessity for function evaluations which c… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 32 pages

  6. arXiv:2310.04884  [pdf, ps, other

    cs.GT cs.LG

    Regret Analysis of Repeated Delegated Choice

    Authors: MohammadTaghi Hajiaghayi, Mohammad Mahdavi, Keivan Rezaei, Suho Shin

    Abstract: We present a study on a repeated delegated choice problem, which is the first to consider an online learning variant of Kleinberg and Kleinberg, EC'18. In this model, a principal interacts repeatedly with an agent who possesses an exogenous set of solutions to search for efficient ones. Each solution can yield varying utility for both the principal and the agent, and the agent may propose a soluti… ▽ More

    Submitted 13 February, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

  7. arXiv:2309.13016  [pdf, other

    cs.LG cs.CR

    Understanding Deep Gradient Leakage via Inversion Influence Functions

    Authors: Haobo Zhang, Junyuan Hong, Yuyang Deng, Mehrdad Mahdavi, Jiayu Zhou

    Abstract: Deep Gradient Leakage (DGL) is a highly effective attack that recovers private training images from gradient vectors. This attack casts significant privacy challenges on distributed learning from clients with sensitive data, where clients are required to share gradients. Defending against such attacks requires but lacks an understanding of when and how privacy leakage happens, mostly because of th… ▽ More

    Submitted 8 January, 2024; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 24 pages, 18 figures, accepted to NeurIPS2023

  8. arXiv:2309.10736  [pdf, other

    cs.LG

    Mixture Weight Estimation and Model Prediction in Multi-source Multi-target Domain Adaptation

    Authors: Yuyang Deng, Ilja Kuzborskij, Mehrdad Mahdavi

    Abstract: We consider the problem of learning a model from multiple heterogeneous sources with the goal of performing well on a new target distribution. The goal of learner is to mix these data sources in a target-distribution aware way and simultaneously minimize the empirical risk on the mixed source. The literature has made some tangible advancements in establishing theory of learning on mixture domain.… ▽ More

    Submitted 12 November, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

  9. arXiv:2309.05172  [pdf, ps, other

    cs.DS

    2-Approximation for Prize-Collecting Steiner Forest

    Authors: Ali Ahmadi, Iman Gholami, MohammadTaghi Hajiaghayi, Peyman Jabbarzade, Mohammad Mahdavi

    Abstract: Approximation algorithms for the prize-collecting Steiner forest problem (PCSF) have been a subject of research for over three decades, starting with the seminal works of Agrawal, Klein, and Ravi and Goemans and Williamson on Steiner forest and prize-collecting problems. In this paper, we propose and analyze a natural deterministic algorithm for PCSF that achieves a $2$-approximate solution in pol… ▽ More

    Submitted 6 May, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

  10. arXiv:2308.07970  [pdf

    cs.CR cs.MM

    Introducing a New Evaluation Criteria for EMD-Base Steganography Method

    Authors: Hanieh Rafiee, Mojtaba Mahdavi, AhmadReza NaghshNilchi

    Abstract: Steganography is a technique to hide the presence of secret communication. When one of the communication elements is under the influence of the enemy, it can be used. The main measure to evaluate steganography methods in a certain capacity is security. Therefore, in a certain capacity, reducing the amount of changes in the cover media, creates a higher embedding efficiency and thus more security o… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

  11. arXiv:2307.12229  [pdf, other

    cs.CV cs.LG

    EchoGLAD: Hierarchical Graph Neural Networks for Left Ventricle Landmark Detection on Echocardiograms

    Authors: Masoud Mokhtari, Mobina Mahdavi, Hooman Vaseli, Christina Luong, Purang Abolmaesumi, Teresa S. M. Tsang, Renjie Liao

    Abstract: The functional assessment of the left ventricle chamber of the heart requires detecting four landmark locations and measuring the internal dimension of the left ventricle and the approximate mass of the surrounding muscle. The key challenge of automating this task with machine learning is the sparsity of clinical labels, i.e., only a few landmark pixels in a high-dimensional image are annotated, l… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Comments: To be published in MICCAI 2023

  12. arXiv:2302.12351  [pdf, other

    cs.LG stat.ML

    On the Hardness of Robustness Transfer: A Perspective from Rademacher Complexity over Symmetric Difference Hypothesis Space

    Authors: Yuyang Deng, Nidham Gazagnadou, Junyuan Hong, Mehrdad Mahdavi, Lingjuan Lyu

    Abstract: Recent studies demonstrated that the adversarially robust learning under $\ell_\infty$ attack is harder to generalize to different domains than standard domain adaptation. How to transfer robustness across different domains has been a key question in domain adaptation field. To investigate the fundamental difficulty behind adversarially robust domain adaptation (or robustness transfer), we propose… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  13. arXiv:2302.11636  [pdf, other

    cs.LG cs.AI

    Do We Really Need Complicated Model Architectures For Temporal Networks?

    Authors: Weilin Cong, Si Zhang, Jian Kang, Baichuan Yuan, Hao Wu, Xin Zhou, Hanghang Tong, Mehrdad Mahdavi

    Abstract: Recurrent neural network (RNN) and self-attention mechanism (SAM) are the de facto methods to extract spatial-temporal information for temporal graph learning. Interestingly, we found that although both RNN and SAM could lead to a good performance, in practice neither of them is always necessary. In this paper, we propose GraphMixer, a conceptually and technically simple architecture that consists… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  14. arXiv:2302.08990  [pdf, other

    cs.LG cs.AI

    Efficiently Forgetting What You Have Learned in Graph Representation Learning via Projection

    Authors: Weilin Cong, Mehrdad Mahdavi

    Abstract: As privacy protection receives much attention, unlearning the effect of a specific node from a pre-trained graph learning model has become equally important. However, due to the node dependency in the graph-structured data, representation unlearning in Graph Neural Networks (GNNs) is challenging and less well explored. In this paper, we fill in this gap by first studying the unlearning problem in… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

  15. arXiv:2210.09382  [pdf, other

    cs.LG math.OC stat.ML

    Tight Analysis of Extra-gradient and Optimistic Gradient Methods For Nonconvex Minimax Problems

    Authors: Pouria Mahdavinia, Yuyang Deng, Haochuan Li, Mehrdad Mahdavi

    Abstract: Despite the established convergence theory of Optimistic Gradient Descent Ascent (OGDA) and Extragradient (EG) methods for the convex-concave minimax problems, little is known about the theoretical guarantees of these methods in nonconvex settings. To bridge this gap, for the first time, this paper establishes the convergence of OGDA and EG methods under the nonconvex-strongly-concave (NC-SC) and… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

  16. A Novel Hybrid Backscatter and Conventional Algorithm for Multi-Hop IoT Networks

    Authors: Mahmoud Raeisi, Mehdi Mahdavi, Ali Mohammad Doost Hosseini

    Abstract: This paper investigates a multi-hop cognitive radio network in terms of end-to-end bit delivery. The network exploits backscatter communication (BackCom) and harvest-then-transmit (HTT) mode in a hybrid manner. Such a network can be used in internet of things (IoT) applications in which IoT users coexist with a primary network (PN) and use the primary spectrum to transmit data in both BackCom and… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

    Journal ref: Transactions on Emerging Telecommunications Technologies, Volume 33, Issue 12, 2022, 19

  17. arXiv:2203.09607  [pdf, other

    cs.LG stat.ML

    Learning Distributionally Robust Models at Scale via Composite Optimization

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Mehrdad Mahdavi, Amin Karbasi

    Abstract: To train machine learning models that are robust to distribution shifts in the data, distributionally robust optimization (DRO) has been proven very effective. However, the existing approaches to learning a distributionally robust model either require solving complex optimization problems such as semidefinite programming or a first-order method whose convergence scales linearly with the number of… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted to ICLR2022 as a conference paper. International Conference on Learning Representations (2022)

  18. arXiv:2111.10447  [pdf, other

    cs.LG

    DyFormer: A Scalable Dynamic Graph Transformer with Provable Benefits on Generalization Ability

    Authors: Weilin Cong, Yanhong Wu, Yuandong Tian, Mengting Gu, Yinglong Xia, Chun-cheng Jason Chen, Mehrdad Mahdavi

    Abstract: Transformers have achieved great success in several domains, including Natural Language Processing and Computer Vision. However, its application to real-world graphs is less explored, mainly due to its high computation cost and its poor generalizability caused by the lack of enough training data in the graph domain. To fill in this gap, we propose a scalable Transformer-like dynamic graph learning… ▽ More

    Submitted 29 January, 2023; v1 submitted 19 November, 2021; originally announced November 2021.

  19. arXiv:2111.08202  [pdf, other

    cs.LG

    Learn Locally, Correct Globally: A Distributed Algorithm for Training Graph Neural Networks

    Authors: Morteza Ramezani, Weilin Cong, Mehrdad Mahdavi, Mahmut T. Kandemir, Anand Sivasubramaniam

    Abstract: Despite the recent success of Graph Neural Networks (GNNs), training GNNs on large graphs remains challenging. The limited resource capacities of the existing servers, the dependency between nodes in a graph, and the privacy concern due to the centralized storage and model learning have spurred the need to design an effective distributed algorithm for GNN training. However, existing distributed GN… ▽ More

    Submitted 13 March, 2022; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: The Tenth International Conference on Learning Representations (ICLR 2022)

  20. arXiv:2110.15174  [pdf, other

    cs.LG

    On Provable Benefits of Depth in Training Graph Convolutional Networks

    Authors: Weilin Cong, Morteza Ramezani, Mehrdad Mahdavi

    Abstract: Graph Convolutional Networks (GCNs) are known to suffer from performance degradation as the number of layers increases, which is usually attributed to over-smoothing. Despite the apparent consensus, we observe that there exists a discrepancy between the theoretical understanding of over-smoothing and the practical capabilities of GCNs. Specifically, we argue that over-smoothing does not necessaril… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  21. arXiv:2110.14057  [pdf, other

    cs.LG

    Meta-learning with an Adaptive Task Scheduler

    Authors: Huaxiu Yao, Yu Wang, Ying Wei, Peilin Zhao, Mehrdad Mahdavi, Defu Lian, Chelsea Finn

    Abstract: To benefit the learning of a new task, meta-learning has been proposed to transfer a well-generalized meta-model learned from various meta-training tasks. Existing meta-learning algorithms randomly sample meta-training tasks with a uniform probability, under the assumption that tasks are of equal importance. However, it is likely that tasks are detrimental with noise or imbalanced given a limited… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted by NeurIPS 2021

  22. arXiv:2107.10868  [pdf, other

    cs.LG math.OC

    Local SGD Optimizes Overparameterized Neural Networks in Polynomial Time

    Authors: Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

    Abstract: In this paper we prove that Local (S)GD (or FedAvg) can optimize deep neural networks with Rectified Linear Unit (ReLU) activation function in polynomial time. Despite the established convergence theory of Local SGD on optimizing general smooth functions in communication-efficient distributed optimization, its convergence on non-smooth ReLU networks still eludes full theoretical understanding. The… ▽ More

    Submitted 22 February, 2022; v1 submitted 22 July, 2021; originally announced July 2021.

  23. arXiv:2106.14150  [pdf

    cs.CV cs.MM

    Image content dependent semi-fragile watermarking with localized tamper detection

    Authors: Samira Hosseini, Mojtaba Mahdavi

    Abstract: Content-independent watermarks and block-wise independency can be considered as vulnerabilities in semi-fragile watermarking methods. In this paper to achieve the objectives of semi-fragile watermarking techniques, a method is proposed to not have the mentioned shortcomings. In the proposed method, the watermark is generated by relying on image content and a key. Furthermore, the embedding scheme… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

    Comments: 32 pages, 11 figures, 5 tables

  24. arXiv:2104.01634  [pdf, other

    cs.LG cs.AI stat.ML

    Pareto Efficient Fairness in Supervised Learning: From Extraction to Tracing

    Authors: Mohammad Mahdi Kamani, Rana Forsati, James Z. Wang, Mehrdad Mahdavi

    Abstract: As algorithmic decision-making systems are becoming more pervasive, it is crucial to ensure such systems do not become mechanisms of unfair discrimination on the basis of gender, race, ethnicity, religion, etc. Moreover, due to the inherent trade-off between fairness measures and accuracy, it is desirable to learn fairness-enhanced models without significantly compromising the accuracy. In this pa… ▽ More

    Submitted 4 April, 2021; originally announced April 2021.

  25. arXiv:2103.02696  [pdf, other

    cs.LG cs.AI cs.CV

    On the Importance of Sampling in Training GCNs: Tighter Analysis and Variance Reduction

    Authors: Weilin Cong, Morteza Ramezani, Mehrdad Mahdavi

    Abstract: Graph Convolutional Networks (GCNs) have achieved impressive empirical advancement across a wide variety of semi-supervised node classification tasks. Despite their great success, training GCNs on large graphs suffers from computational and memory issues. A potential path to circumvent these obstacles is sampling-based methods, where at each layer a subset of nodes is sampled. Although recent stud… ▽ More

    Submitted 1 November, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  26. arXiv:2102.13152  [pdf, other

    cs.LG math.OC

    Local Stochastic Gradient Descent Ascent: Convergence Analysis and Communication Efficiency

    Authors: Yuyang Deng, Mehrdad Mahdavi

    Abstract: Local SGD is a promising approach to overcome the communication overhead in distributed learning by reducing the synchronization frequency among worker nodes. Despite the recent theoretical advances of local SGD in empirical risk minimization, the efficiency of its counterpart in minimax optimization remains unexplored. Motivated by large scale minimax learning problems, such as adversarial robust… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: This paper has been accepted to AISTATS 2021

  27. arXiv:2102.12660  [pdf, other

    cs.LG cs.DC stat.ML

    Distributionally Robust Federated Averaging

    Authors: Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

    Abstract: In this paper, we study communication efficient distributed algorithms for distributionally robust federated learning via periodic averaging with adaptive sampling. In contrast to standard empirical risk minimization, due to the minimax structure of the underlying optimization problem, a key difficulty arises from the fact that the global parameter that controls the mixture of local losses can onl… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

    Comments: Published in NeurIPS 2020: https://proceedings.neurips.cc/paper/2020/hash/ac450d10e166657ec8f93a1b65ca1b14-Abstract.html

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Vol. 33, 2020

  28. arXiv:2102.04282  [pdf, other

    cs.LG

    Communication-efficient k-Means for Edge-based Machine Learning

    Authors: Hanlin Lu, Ting He, Shiqiang Wang, Changchang Liu, Mehrdad Mahdavi, Vijaykrishnan Narayanan, Kevin S. Chan, Stephen Pasteris

    Abstract: We consider the problem of computing the k-means centers for a large high-dimensional dataset in the context of edge-based machine learning, where data sources offload machine learning computation to nearby edge servers. k-Means computation is fundamental to many data analytics, and the capability of computing provably accurate k-means centers by leveraging the computation power of the edge server… ▽ More

    Submitted 21 January, 2022; v1 submitted 8 February, 2021; originally announced February 2021.

  29. arXiv:2101.03103  [pdf, other

    cs.CR

    Blockchain for steganography: advantages, new algorithms and open challenges

    Authors: Omid Torki, Maede Ashouri-Talouki, Mojtaba Mahdavi

    Abstract: Steganography is a solution for covert communication and blockchain is a p2p network for data transmission, so the benefits of blockchain can be used in steganography. In this paper, we discuss the advantages of blockchain in steganography, which include the ability to embed hidden data without manual change in the original data, as well as the readiness of the blockchain platform for data transmi… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

  30. arXiv:2010.11545  [pdf, other

    cs.LG

    Online Structured Meta-learning

    Authors: Huaxiu Yao, Yingbo Zhou, Mehrdad Mahdavi, Zhenhui Li, Richard Socher, Caiming Xiong

    Abstract: Learning quickly is of great importance for machine intelligence deployed in online platforms. With the capability of transferring knowledge from learned tasks, meta-learning has shown its effectiveness in online scenarios by continuously updating the model with the learned prior. However, current online meta-learning algorithms are limited to learn a globally-shared meta-learner, which may lead t… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted by NeurIPS 2020

  31. arXiv:2007.14259  [pdf

    cs.PL cs.LO cs.SE

    Inductive Reachability Witnesses

    Authors: Ali Asadi, Krishnendu Chatterjee, Hongfei Fu, Amir Kafshdar Goharshady, Mohammad Mahdavi

    Abstract: In this work, we consider the fundamental problem of reachability analysis over imperative programs with real variables. The reachability property requires that a program can reach certain target states during its execution. Previous works that tackle reachability analysis are either unable to handle programs consisting of general loops (e.g. symbolic execution), or lack completeness guarantees (e… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

  32. arXiv:2007.09758  [pdf, other

    eess.IV cs.CV cs.MM

    Full Quaternion Representation of Color images: A Case Study on QSVD-based Color Image Compression

    Authors: Alireza Parchami, Mojtaba Mahdavi

    Abstract: For many years, channels of a color image have been processed individually, or the image has been converted to grayscale one with respect to color image processing. Pure quaternion representation of color images solves this issue as it allows images to be processed in a holistic space. Nevertheless, it brings additional costs due to the extra fourth dimension. In this paper, we propose an approach… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: 15 pages, 16 figures, 1 table, submitted to Signal Processing journal

  33. arXiv:2007.01154  [pdf, other

    cs.LG cs.DC stat.ML

    Federated Learning with Compression: Unified Analysis and Sharp Guarantees

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Aryan Mokhtari, Mehrdad Mahdavi

    Abstract: In federated learning, communication cost is often a critical bottleneck to scale up distributed optimization algorithms to collaboratively learn a model from millions of devices with potentially unreliable or limited communication and heterogeneous data distributions. Two notable trends to deal with the communication overhead of federated algorithms are gradient compression and local computation… ▽ More

    Submitted 20 November, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: version 2. more experiments and comparisons

  34. arXiv:2006.13866  [pdf, other

    cs.LG stat.ML

    Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

    Authors: Weilin Cong, Rana Forsati, Mahmut Kandemir, Mehrdad Mahdavi

    Abstract: Sampling methods (e.g., node-wise, layer-wise, or subgraph) has become an indispensable strategy to speed up training large-scale Graph Neural Networks (GNNs). However, existing sampling methods are mostly based on the graph structural information and ignore the dynamicity of optimization, which leads to high variance in estimating the stochastic gradients. The high variance issue can be very pron… ▽ More

    Submitted 5 September, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  35. The Effect of Coupling Memory and Block Length on Spatially Coupled Serially Concatenated Codes

    Authors: Mojtaba Mahdavi, Muhammad Umar Farooq, Liang Liu, Ove Edfors, Viktor Öwall, Michael Lentmaier

    Abstract: Spatially coupled serially concatenated codes (SC-SCCs) are a class of spatially coupled turbo-like codes, which have a close-to-capacity performance and low error floor. In this paper we investigate the impact of coupling memory, block length, decoding window size, and number of iterations on the performance, complexity, and latency of SC-SCCs. Several design tradeoffs are presented to see the re… ▽ More

    Submitted 25 July, 2021; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Presented at the IEEE 93rd Vehicular Technology Conference (VTC) 2021-Spring

  36. arXiv:2003.13461  [pdf, other

    cs.LG cs.DC stat.ML

    Adaptive Personalized Federated Learning

    Authors: Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

    Abstract: Investigation of the degree of personalization in federated learning algorithms has shown that only maximizing the performance of the global model will confine the capacity of the local models to personalize. In this paper, we advocate an adaptive personalized federated learning (APFL) algorithm, where each client will train their local models while contributing to the global model. We derive the… ▽ More

    Submitted 5 November, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: [v3] Added convergence analysis for nonconvex losses and additional experiments along with new baselines [v2] A new generalization analysis is provided. Also, additional experiments are added

  37. arXiv:2003.08005  [pdf, other

    cs.CV

    ScanSSD: Scanning Single Shot Detector for Mathematical Formulas in PDF Document Images

    Authors: Parag Mali, Puneeth Kukkadapu, Mahshad Mahdavi, Richard Zanibbi

    Abstract: We introduce the Scanning Single Shot Detector (ScanSSD) for locating math formulas offset from text and embedded in textlines. ScanSSD uses only visual features for detection: no formatting or typesetting information such as layout, font, or character labels are employed. Given a 600 dpi document page image, a Single Shot Detector (SSD) locates formulas at multiple scales using sliding windows, a… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

    Comments: 8 pages, 7 figures

  38. Manipulation and exchange of Light with Orbital Angular Momentum in Quantum Dot Molecules

    Authors: Mahboubeh Mahdavi, Zahra Amini Sabegh, Mohammad Mohammadi, Hamid Reza Hamedi, Mohammad Mahmoudi

    Abstract: We study the interaction of laser pulses carrying orbital angular momentum (OAM) with structural asymmetry quantum dot molecules characterized by four energy levels. We demonstrate how the inter-dot tunneling endows exchange of optical vortices between different frequencies. We consider a case where a weak probe beam has an optical vortex and thus has a zero intensity at the center. The presence o… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

    Journal ref: Phys. Rev. A 101, 063811 (2020)

  39. arXiv:2002.01686  [pdf, other

    cs.IT

    Analysis of D2D Communication with RF Energy Harvesting and Interference Management

    Authors: Nasrin Razmi, Mehdi Mahdavi, Mohammadali Mohammadi, Petar Popovski

    Abstract: Device-to-device (D2D) underlaid cellular network, enabled with radio frequency energy harvesting (RFEH), and enhanced interference management schemes is a promising candidate to improve spectral and energy efficiency of next generation wireless networks. In this paper, we propose a time division duplexing (TDD)-based protocol, in which allows the devices to harvest energy from the downlink transm… ▽ More

    Submitted 5 February, 2020; originally announced February 2020.

  40. arXiv:1912.12082  [pdf, other

    cs.CV

    Pointwise Attention-Based Atrous Convolutional Neural Networks

    Authors: Mobina Mahdavi, Fahimeh Fooladgar, Shohreh Kasaei

    Abstract: With the rapid progress of deep convolutional neural networks, in almost all robotic applications, the availability of 3D point clouds improves the accuracy of 3D semantic segmentation methods. Rendering of these irregular, unstructured, and unordered 3D points to 2D images from multiple viewpoints imposes some issues such as loss of information due to 3D to 2D projection, discretizing artifacts,… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    Comments: 7 pages, 6 figures. Author one and author two contributed equally

  41. arXiv:1911.04931  [pdf, other

    cs.LG cs.DS math.OC stat.ML

    Efficient Fair Principal Component Analysis

    Authors: Mohammad Mahdi Kamani, Farzin Haddadpour, Rana Forsati, Mehrdad Mahdavi

    Abstract: It has been shown that dimension reduction methods such as PCA may be inherently prone to unfairness and treat data from different sensitive groups such as race, color, sex, etc., unfairly. In pursuit of fairness-enhancing dimensionality reduction, using the notion of Pareto optimality, we propose an adaptive first-order algorithm to learn a subspace that preserves fairness, while slightly comprom… ▽ More

    Submitted 7 March, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

  42. arXiv:1910.14425  [pdf, other

    cs.LG cs.DC stat.ML

    On the Convergence of Local Descent Methods in Federated Learning

    Authors: Farzin Haddadpour, Mehrdad Mahdavi

    Abstract: In federated distributed learning, the goal is to optimize a global training objective defined over distributed devices, where the data shard at each device is sampled from a possibly different distribution (a.k.a., heterogeneous or non i.i.d. data samples). In this paper, we generalize the local stochastic and full gradient descent with periodic averaging-- originally designed for homogeneous dis… ▽ More

    Submitted 6 December, 2019; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: 47 pages, "Updates from v1: A technical error in Lemma B3 is corrected"

  43. arXiv:1910.13598  [pdf, other

    cs.LG cs.DC stat.ML

    Local SGD with Periodic Averaging: Tighter Analysis and Adaptive Synchronization

    Authors: Farzin Haddadpour, Mohammad Mahdi Kamani, Mehrdad Mahdavi, Viveck R. Cadambe

    Abstract: Communication overhead is one of the key challenges that hinders the scalability of distributed optimization algorithms. In this paper, we study local distributed SGD, where data is partitioned among computation nodes, and the computation nodes perform local updates with periodically exchanging the model among the workers to perform averaging. While local SGD is empirically shown to provide promis… ▽ More

    Submitted 14 May, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Paper accepted to NeurIPS 2019 - We fixed a flaw in the earlier version regarding the dependency on constants but this change does not affect the communication complexity

  44. arXiv:1908.06309  [pdf, other

    cs.LG cs.DB stat.ML

    ED2: Two-stage Active Learning for Error Detection -- Technical Report

    Authors: Felix Neutatz, Mohammad Mahdavi, Ziawasch Abedjan

    Abstract: Traditional error detection approaches require user-defined parameters and rules. Thus, the user has to know both the error detection system and the data. However, we can also formulate error detection as a semi-supervised classification problem that only requires domain expertise. The challenges for such an approach are twofold: (1) to represent the data in a way that enables a classification mod… ▽ More

    Submitted 17 August, 2019; originally announced August 2019.

  45. arXiv:1712.05904  [pdf

    physics.chem-ph

    Copper-catalyzed efficient synthesis of 5-arylindazolo[3,2-b]quinazolin-7(5H)-ones from 2-nitrobenzaldehydes

    Authors: Zahra ArastehFard, Karim Akbari Dilmaghani, Mehdi Soheilizad, Mohammad Mahdavi

    Abstract: A novel and practical copper-catalyzed approach is developed for the preparation of 5-arylindazolo[3,2-b]quinazolin-7(5H)-ones. The 2-amino-N'-arylbenzohydrazide, which easily prepared by reaction of isatoic anhydride with arylhydrazine, through a condensation/intramolecular cyclization reacted by 2-nitrobenzaldehydes in the present of CuI to afford corresponding 5-arylindazolo[3,2-b]quinazolin-7(… ▽ More

    Submitted 15 December, 2017; originally announced December 2017.

    Comments: 5 pages, 3 schemes, 2 tables

  46. arXiv:1712.04648  [pdf, ps, other

    physics.plasm-ph

    Temperature Equilibration Rate of Quasi-Monoenergetic Deuteron Beam in a Fusion Plasmas

    Authors: M. Mahdavi, R. Azadifar, T. Khoorokhi

    Abstract: Thermal equilibrium rate can play an important role in the energy deposition of beam to the fuel in fast ignition due to high temperature difference between projectile ions and background plasma ions. In this study the temperature equilibration rate of a quasi-monoenergetic deuteron beam with an equimolar Deuterium-Tritium fusion plasma with a Maxwellian energy distribution is calculated by kineti… ▽ More

    Submitted 13 December, 2017; originally announced December 2017.

  47. arXiv:1711.07200  [pdf, ps, other

    physics.plasm-ph

    The electromagnetic instabilities propagation in weak relativistic quantum plasmas

    Authors: M. Mahdavi, H. Khanzadeh

    Abstract: The electromagnetic instabilities excited by the temperature anisotropy have been always one of the interesting issues in real high-density physical systems, where the relativistic and quantum effects due to spin can be important. This paper discusses the case where plasma is not strongly coupled but is still in regimes where a classic plasma description is not fully adequate. The length scale of… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

  48. arXiv:1705.07256  [pdf, other

    cs.LG cs.IT math.OC stat.ML

    Learning Feature Nonlinearities with Non-Convex Regularized Binned Regression

    Authors: Samet Oymak, Mehrdad Mahdavi, Jiasi Chen

    Abstract: For various applications, the relations between the dependent and independent variables are highly nonlinear. Consequently, for large scale complex problems, neural networks and regression trees are commonly preferred over linear models such as Lasso. This work proposes learning the feature nonlinearities by binning feature values and finding the best fit in each quantile using non-convex regulari… ▽ More

    Submitted 19 May, 2017; originally announced May 2017.

    Comments: 22 pages, 7 figures

  49. arXiv:1610.03045  [pdf, other

    cs.LG math.OC stat.ML

    Sketching Meets Random Projection in the Dual: A Provable Recovery Algorithm for Big and High-dimensional Data

    Authors: Jialei Wang, Jason D. Lee, Mehrdad Mahdavi, Mladen Kolar, Nathan Srebro

    Abstract: Sketching techniques have become popular for scaling up machine learning algorithms by reducing the sample size or dimensionality of massive data sets, while still maintaining the statistical power of big data. In this paper, we study sketching from an optimization point of view: we first show that the iterative Hessian sketch is an optimization process with preconditioning, and develop accelerate… ▽ More

    Submitted 10 October, 2016; originally announced October 2016.

  50. QCD analysis of nucleon structure functions in deep-inelastic neutrino-nucleon scattering: Laplace transform and Jacobi polynomials approach

    Authors: S. Mohammad Moosavi Nejad, Hamzeh Khanpour, S. Atashbar Tehrani, Mahdi Mahdavi

    Abstract: We present a detailed QCD analysis of nucleon structure functions $xF_3 (x, Q^2)$, based on Laplace transforms and Jacobi polynomials approach. The analysis corresponds to the next-to-leading order and next-to-next-to-leading order approximation of perturbative QCD. The Laplace transform technique, as an exact analytical solution, is used for the solution of nonsinglet Dokshitzer-Gribov-Lipatov-Al… ▽ More

    Submitted 4 October, 2016; v1 submitted 17 September, 2016; originally announced September 2016.

    Comments: 14 Pages, 8 Figures, 4 Tables

    Journal ref: Phys. Rev. C 94, 045201 (2016)