Skip to main content

Showing 1–21 of 21 results for author: Razaviyayn, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.15604  [pdf, ps, other

    cs.LG stat.ML

    Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization

    Authors: Yinbin Han, Meisam Razaviyayn, Renyuan Xu

    Abstract: Diffusion models have emerged as a powerful tool rivaling GANs in generating high-quality samples with improved fidelity, flexibility, and robustness. A key component of these models is to learn the score function through score matching. Despite empirical success on various tasks, it remains unclear whether gradient-based algorithms can learn the score function with a provable accuracy. As a first… ▽ More

    Submitted 12 March, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: 39 pages

  2. arXiv:2309.11682  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    Dr. FERMI: A Stochastic Distributionally Robust Fair Empirical Risk Minimization Framework

    Authors: Sina Baharlouei, Meisam Razaviyayn

    Abstract: While training fair machine learning models has been studied extensively in recent years, most developed methods rely on the assumption that the training and test data have similar distributions. In the presence of distribution shifts, fair models may behave unfairly on test data. There have been some developments for fair learning robust to distribution shifts to address this shortcoming. However… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: 22 pages, 3 figures

  3. arXiv:2306.15056  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Optimal Differentially Private Model Training with Public Data

    Authors: Andrew Lowy, Zeman Li, Tianjian Huang, Meisam Razaviyayn

    Abstract: Differential privacy (DP) ensures that training a machine learning model does not leak private data. In practice, we may have access to auxiliary public data that is free of privacy concerns. In this work, we assume access to a given amount of public data and settle the following fundamental open questions: 1. What is the optimal (worst-case) error of a DP model trained over a private data set whi… ▽ More

    Submitted 13 February, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: V2 changed the title and added high-dimensional approximate semi-DP lower bounds

  4. arXiv:2303.08431  [pdf, other

    cs.LG math.OC stat.ML

    Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators

    Authors: Yinbin Han, Meisam Razaviyayn, Renyuan Xu

    Abstract: Nonlinear control systems with partial information to the decision maker are prevalent in a variety of applications. As a step toward studying such nonlinear systems, this work explores reinforcement learning methods for finding the optimal policy in the nearly linear-quadratic regulator systems. In particular, we consider a dynamic system that combines linear and nonlinear components, and is gove… ▽ More

    Submitted 16 February, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 34 pages

  5. arXiv:2209.07403  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Private Stochastic Optimization With Large Worst-Case Lipschitz Parameter: Optimal Rates for (Non-Smooth) Convex Losses and Extension to Non-Convex Losses

    Authors: Andrew Lowy, Meisam Razaviyayn

    Abstract: We study differentially private (DP) stochastic optimization (SO) with loss functions whose worst-case Lipschitz parameter over all data points may be extremely large. To date, the vast majority of work on DP SO assumes that the loss is uniformly Lipschitz continuous over data (i.e. stochastic gradients are uniformly bounded over all data points). While this assumption is convenient, it often lead… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: Appeared in the International Conference on Algorithmic Learning Theory (ALT) 2023. This version improves the runtime bound in Theorem 6

  6. arXiv:2106.09779  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Private Federated Learning Without a Trusted Server: Optimal Algorithms for Convex Losses

    Authors: Andrew Lowy, Meisam Razaviyayn

    Abstract: This paper studies federated learning (FL)--especially cross-silo FL--with data from people who do not trust the server or other silos. In this setting, each silo (e.g. hospital) has data from different people (e.g. patients) and must maintain the privacy of each person's data (e.g. medical record), even if the server or other silos act as adversarial eavesdroppers. This requirement motivates the… ▽ More

    Submitted 14 June, 2023; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: ICLR 2023

  7. arXiv:2105.05953  [pdf, other

    stat.ML cs.LG math.OC stat.ME

    Efficient Algorithms for Estimating the Parameters of Mixed Linear Regression Models

    Authors: Babak Barazandeh, Ali Ghafelebashi, Meisam Razaviyayn, Ram Sriharsha

    Abstract: Mixed linear regression (MLR) model is among the most exemplary statistical tools for modeling non-linear distributions using a mixture of linear models. When the additive noise in MLR model is Gaussian, Expectation-Maximization (EM) algorithm is a widely-used algorithm for maximum likelihood estimation of MLR parameters. However, when noise is non-Gaussian, the steps of EM algorithm may not have… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  8. arXiv:2102.04704  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Output Perturbation for Differentially Private Convex Optimization with Improved Population Loss Bounds, Runtimes and Applications to Private Adversarial Training

    Authors: Andrew Lowy, Meisam Razaviyayn

    Abstract: Finding efficient, easily implementable differentially private (DP) algorithms that offer strong excess risk bounds is an important problem in modern machine learning. To date, most work has focused on private empirical risk minimization (ERM) or private population loss minimization. However, there are often other objectives--such as fairness, adversarial robustness, or sensitivity to outliers--be… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

  9. arXiv:2012.02901  [pdf, other

    math.ST stat.AP stat.ME stat.ML

    Near-Optimal Procedures for Model Discrimination with Non-Disclosure Properties

    Authors: Dmitrii M. Ostrovskii, Mohamed Ndaoud, Adel Javanmard, Meisam Razaviyayn

    Abstract: Let $θ_0,θ_1 \in \mathbb{R}^d$ be the population risk minimizers associated to some loss $\ell:\mathbb{R}^d\times \mathcal{Z}\to\mathbb{R}$ and two distributions $\mathbb{P}_0,\mathbb{P}_1$ on $\mathcal{Z}$. The models $θ_0,θ_1$ are unknown, and $\mathbb{P}_0,\mathbb{P}_1$ can be accessed by drawing i.i.d samples from them. Our work is motivated by the following model discrimination question: "Wha… ▽ More

    Submitted 10 July, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: 52 pages, 2 figures; corrected the proof of the lower bound; added new applications and the Fisher information-based argument in Appendix F

  10. arXiv:2006.08141  [pdf, other

    math.OC cs.LG stat.ML

    Non-convex Min-Max Optimization: Applications, Challenges, and Recent Theoretical Advances

    Authors: Meisam Razaviyayn, Tianjian Huang, Songtao Lu, Maher Nouiehed, Maziar Sanjabi, Mingyi Hong

    Abstract: The min-max optimization problem, also known as the saddle point problem, is a classical optimization problem which is also studied in the context of zero-sum games. Given a class of objective functions, the goal is to find a value for the argument which leads to a small objective value even for the worst case function in the given class. Min-max optimization problems have recently become very pop… ▽ More

    Submitted 18 August, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

    Journal ref: IEEE Signal Processing Magazine (Volume: 37, Issue: 5, Sept. 2020)

  11. arXiv:2003.08093  [pdf, other

    math.OC cs.GT cs.LG stat.ML

    Solving Non-Convex Non-Differentiable Min-Max Games using Proximal Gradient Method

    Authors: Babak Barazandeh, Meisam Razaviyayn

    Abstract: Min-max saddle point games appear in a wide range of applications in machine leaning and signal processing. Despite their wide applicability, theoretical studies are mostly limited to the special convex-concave structure. While some recent works generalized these results to special smooth non-convex cases, our understanding of non-smooth scenarios is still limited. In this work, we study special f… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  12. arXiv:2001.07819  [pdf, other

    stat.ML cs.DS cs.LG math.OC

    Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities

    Authors: Zhongruo Wang, Krishnakumar Balasubramanian, Shiqian Ma, Meisam Razaviyayn

    Abstract: In this paper, we study zeroth-order algorithms for minimax optimization problems that are nonconvex in one variable and strongly-concave in the other variable. Such minimax optimization problems have attracted significant attention lately due to their applications in modern machine learning tasks. We first consider a deterministic version of the problem. We design and analyze the Zeroth-Order Gra… ▽ More

    Submitted 4 April, 2022; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: To appear in the Journal of Global Optimization

  13. arXiv:1907.04450  [pdf, ps, other

    math.OC cs.CC stat.ML

    SNAP: Finding Approximate Second-Order Stationary Solutions Efficiently for Non-convex Linearly Constrained Problems

    Authors: Songtao Lu, Meisam Razaviyayn, Bo Yang, Kejun Huang, Mingyi Hong

    Abstract: This paper proposes low-complexity algorithms for finding approximate second-order stationary points (SOSPs) of problems with smooth non-convex objective and linear constraints. While finding (approximate) SOSPs is computationally intractable, we first show that generic instances of the problem can be solved efficiently. More specifically, for a generic problem instance, certain strict complementa… ▽ More

    Submitted 9 July, 2019; originally announced July 2019.

  14. arXiv:1906.12005  [pdf, other

    cs.LG stat.ML

    Rényi Fair Inference

    Authors: Sina Baharlouei, Maher Nouiehed, Ahmad Beirami, Meisam Razaviyayn

    Abstract: Machine learning algorithms have been increasingly deployed in critical automated decision-making systems that directly affect human lives. When these algorithms are only trained to minimize the training/test error, they could suffer from systematic discrimination against individuals based on their sensitive attributes such as gender or race. Recently, there has been a surge in machine learning so… ▽ More

    Submitted 13 January, 2020; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: 11 pages, 1 figure

    Journal ref: International Conference on Learning Representation, 2020

  15. arXiv:1904.09775  [pdf, other

    cs.LG cs.AI stat.ML

    Training generative networks using random discriminators

    Authors: Babak Barazandeh, Meisam Razaviyayn, Maziar Sanjabi

    Abstract: In recent years, Generative Adversarial Networks (GANs) have drawn a lot of attentions for learning the underlying distribution of data in various applications. Despite their wide applicability, training GANs is notoriously difficult. This difficulty is due to the min-max nature of the resulting optimization problem and the lack of proper tools of solving general (non-convex, non-concave) min-max… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

  16. arXiv:1902.08297  [pdf, other

    math.OC cs.LG stat.ML

    Solving a Class of Non-Convex Min-Max Games Using Iterative First Order Methods

    Authors: Maher Nouiehed, Maziar Sanjabi, Tianjian Huang, Jason D. Lee, Meisam Razaviyayn

    Abstract: Recent applications that arise in machine learning have surged significant interest in solving min-max saddle point games. This problem has been extensively studied in the convex-concave regime for which a global equilibrium solution can be computed efficiently. In this paper, we study the problem in the non-convex regime and show that an \varepsilon--first order stationary point of the game can b… ▽ More

    Submitted 30 October, 2019; v1 submitted 21 February, 2019; originally announced February 2019.

  17. arXiv:1809.08705  [pdf, other

    cs.LG stat.ML

    On the Behavior of the Expectation-Maximization Algorithm for Mixture Models

    Authors: Babak Barazandeh, Meisam Razaviyayn

    Abstract: Finite mixture models are among the most popular statistical models used in different data science disciplines. Despite their broad applicability, inference under these models typically leads to computationally challenging non-convex problems. While the Expectation-Maximization (EM) algorithm is the most popular approach for solving these non-convex problems, the behavior of this algorithm is not… ▽ More

    Submitted 23 September, 2018; originally announced September 2018.

  18. arXiv:1802.08249  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence and Robustness of Training GANs with Regularized Optimal Transport

    Authors: Maziar Sanjabi, Jimmy Ba, Meisam Razaviyayn, Jason D. Lee

    Abstract: Generative Adversarial Networks (GANs) are one of the most practical methods for learning data distributions. A popular GAN formulation is based on the use of Wasserstein distance as a metric between probability distributions. Unfortunately, minimizing the Wasserstein distance between the data distribution and the generative model distribution is a computationally challenging problem as its object… ▽ More

    Submitted 22 May, 2018; v1 submitted 21 February, 2018; originally announced February 2018.

  19. arXiv:1711.05323  [pdf, other

    stat.ML cs.LG

    On Optimal Generalizability in Parametric Learning

    Authors: Ahmad Beirami, Meisam Razaviyayn, Shahin Shahrampour, Vahid Tarokh

    Abstract: We consider the parametric learning problem, where the objective of the learner is determined by a parametric loss function. Employing empirical risk minimization with possibly regularization, the inferred parameter vector will be biased toward the training samples. Such bias is measured by the cross validation procedure in practice where the data set is partitioned into a training set used for tr… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

    Comments: Proc. of 2017 Advances in Neural Information Processing Systems (NIPS 2017)

  20. arXiv:1511.01776  [pdf, ps, other

    cs.LG stat.ML

    Computational Intractability of Dictionary Learning for Sparse Representation

    Authors: Meisam Razaviyayn, Hung-Wei Tseng, Zhi-Quan Luo

    Abstract: In this paper we consider the dictionary learning problem for sparse representation. We first show that this problem is NP-hard by polynomial time reduction of the densest cut problem. Then, using successive convex approximation strategies, we propose efficient dictionary learning schemes to solve several practical formulations of this problem to stationary points. Unlike many existing algorithms… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

  21. arXiv:1310.6957  [pdf, ps, other

    math.OC stat.ME

    Iteration Complexity Analysis of Block Coordinate Descent Methods

    Authors: Mingyi Hong, Xiangfeng Wang, Meisam Razaviyayn, Zhi-Quan Luo

    Abstract: In this paper, we provide a unified iteration complexity analysis for a family of general block coordinate descent (BCD) methods, covering popular methods such as the block coordinate gradient descent (BCGD) and the block coordinate proximal gradient (BCPG), under various different coordinate update rules. We unify these algorithms under the so-called Block Successive Upper-bound Minimization (BSU… ▽ More

    Submitted 28 April, 2015; v1 submitted 25 October, 2013; originally announced October 2013.