Skip to main content

Showing 1–50 of 113 results for author: Toh, K

.
  1. arXiv:2407.03294  [pdf, ps, other

    math.OC cs.LG

    Vertex Exchange Method for a Class of Quadratic Programming Problems

    Authors: Ling Liang, Kim-Chuan Toh, Haizhao Yang

    Abstract: A vertex exchange method is proposed for solving the strongly convex quadratic program subject to the generalized simplex constraint. We conduct rigorous convergence analysis for the proposed algorithm and demonstrate its essential roles in solving some important classes of constrained convex optimization. To get a feasible initial point to execute the algorithm, we also present and analyze a high… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 32 pages, 5 tables

    MSC Class: 90C06; 90C22; 90C25

  2. arXiv:2407.03272  [pdf, other

    math.OC math.NA

    Nesterov's Accelerated Jacobi-Type Methods for Large-scale Symmetric Positive Semidefinite Linear Systems

    Authors: Ling Liang, Qiyuan Pang, Kim-Chuan Toh, Haizhao Yang

    Abstract: Solving symmetric positive semidefinite linear systems is an essential task in many scientific computing problems. While Jacobi-type methods, including the classical Jacobi method and the weighted Jacobi method, exhibit simplicity in their forms and friendliness to parallelization, they are not attractive either because of the potential convergence failure or their slow convergence rate. This pape… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 20 pages

    MSC Class: 90C06; 90C22; 90C25

  3. arXiv:2406.18287  [pdf, other

    math.OC

    Learning-rate-free Momentum SGD with Reshuffling Converges in Nonsmooth Nonconvex Optimization

    Authors: Xiaoyin Hu, Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we propose a generalized framework for develo** learning-rate-free momentum stochastic gradient descent (SGD) methods in the minimization of nonsmooth nonconvex functions, especially in training nonsmooth neural networks. Our framework adaptively generates learning rates based on the historical data of stochastic subgradients and iterates. Under mild conditions, we prove that our… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 26 pages

  4. arXiv:2406.12013  [pdf, other

    math.OC

    Convergence rates of S.O.S hierarchies for polynomial semidefinite programs

    Authors: Hoang Anh Tran, Kim-Chuan Toh

    Abstract: We introduce a S.O.S hierarchy of lower bounds for a polynomial optimization problem whose constraint is expressed as a matrix polynomial semidefinite condition. Our approach involves utilizing a penalty function framework to directly address the matrix-based constraint, making it applicable to both discrete and continuous polynomial optimization problems. We investigate the convergence rates of t… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    MSC Class: 90C22; 90C26; 41A10; 41A50

  5. arXiv:2406.04646  [pdf, other

    math.OC

    An Inexact Bregman Proximal Difference-of-Convex Algorithm with Two Types of Relative Stop** Criteria

    Authors: Lei Yang, **g**g Hu, Kim-Chuan Toh

    Abstract: In this paper, we consider a class of difference-of-convex (DC) optimization problems, where the global Lipschitz gradient continuity assumption on the smooth part of the objective function is not required. Such problems are prevalent in many contemporary applications such as compressed sensing, statistical regression, and machine learning, and can be solved by a general Bregman proximal DC algori… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  6. arXiv:2404.17386  [pdf, other

    math.OC

    Stochastic Bregman Subgradient Methods for Nonsmooth Nonconvex Optimization Problems

    Authors: Kuangyu Ding, Kim-Chuan Toh

    Abstract: This paper focuses on the problem of minimizing a locally Lipschitz continuous function. Motivated by the effectiveness of Bregman gradient methods in training nonsmooth deep neural networks and the recent progress in stochastic subgradient methods for nonsmooth nonconvex optimization problems \cite{bolte2021conservative,bolte2022subgradient,xiao2023adam}, we investigate the long-term behavior of… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 28 pages, 6 figures

  7. arXiv:2404.09438  [pdf, other

    math.OC cs.LG stat.ML

    Develo** Lagrangian-based Methods for Nonsmooth Nonconvex Optimization

    Authors: Nachuan Xiao, Kuangyu Ding, Xiaoyin Hu, Kim-Chuan Toh

    Abstract: In this paper, we consider the minimization of a nonsmooth nonconvex objective function $f(x)$ over a closed convex subset $\mathcal{X}$ of $\mathbb{R}^n$, with additional nonsmooth nonconvex constraints $c(x) = 0$. We develop a unified framework for develo** Lagrangian-based methods, which takes a single-step update to the primal variables by some subgradient methods in each iteration. These su… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 30 pages, 4 figures

  8. arXiv:2402.15619  [pdf, other

    stat.AP stat.CO stat.ME

    Towards Improved Uncertainty Quantification of Stochastic Epidemic Models Using Sequential Monte Carlo

    Authors: Arindam Fadikar, Abby Stevens, Nicholson Collier, Kok Ben Toh, Olga Morozova, Anna Hotton, Jared Clark, David Higdon, Jonathan Ozik

    Abstract: Sequential Monte Carlo (SMC) algorithms represent a suite of robust computational methodologies utilized for state estimation and parameter inference within dynamical systems, particularly in real-time or online environments where data arrives sequentially over time. In this research endeavor, we propose an integrated framework that combines a stochastic epidemic simulator with a sequential import… ▽ More

    Submitted 6 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures

  9. arXiv:2402.06033  [pdf, ps, other

    math.OC cs.LG

    An Inexact Halpern Iteration with Application to Distributionally Robust Optimization

    Authors: Ling Liang, Kim-Chuan Toh, Jia-Jie Zhu

    Abstract: The Halpern iteration for solving monotone inclusion problems has gained increasing interests in recent years due to its simple form and appealing convergence properties. In this paper, we investigate the inexact variants of the scheme in both deterministic and stochastic settings. We conduct extensive convergence analysis and show that by choosing the inexactness tolerances appropriately, the ine… ▽ More

    Submitted 12 February, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Correct a typo in the title and update authors' information

  10. arXiv:2402.03942  [pdf, other

    math.OC

    Wasserstein distributionally robust optimization and its tractable regularization formulations

    Authors: Hong T. M. Chu, Meixia Lin, Kim-Chuan Toh

    Abstract: We study a variety of Wasserstein distributionally robust optimization (WDRO) problems where the distributions in the ambiguity set are chosen by constraining their Wasserstein discrepancies to the empirical distribution. Using the notion of weak Lipschitz property, we derive lower and upper bounds of the corresponding worst-case loss quantity and propose sufficient conditions under which this qua… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  11. arXiv:2312.13970  [pdf, other

    cs.LG cs.AI math.OC

    On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods

    Authors: Anh Duc Nguyen, Tuan Dung Nguyen, Quang Minh Nguyen, Hoang H. Nguyen, Lam M. Nguyen, Kim-Chuan Toh

    Abstract: This paper studies the Partial Optimal Transport (POT) problem between two unbalanced measures with at most $n$ supports and its applications in various AI tasks such as color transfer or domain adaptation. There is hence the need for fast approximations of POT with increasingly large problem sizes in arising applications. We first theoretically and experimentally investigate the infeasibility of… ▽ More

    Submitted 22 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  12. arXiv:2312.07908  [pdf, ps, other

    math.OC

    A feasible method for general convex low-rank SDP problems

    Authors: Tianyun Tang, Kim-Chuan Toh

    Abstract: In this work, we consider the low rank decomposition (SDPR) of general convex semidefinite programming problems (SDP) that contain both a positive semidefinite matrix and a nonnegative vector as variables. We develop a rank-support-adaptive feasible method to solve (SDPR) based on Riemannian optimization. The method is able to escape from a saddle point to ensure its convergence to a global optima… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 36 pages, 1 figure

    MSC Class: 90C06; 90C22; 90C30

  13. arXiv:2312.05801  [pdf, other

    cond-mat.mtrl-sci

    Stability and Character of Zero Field Skyrmionic States in Hybrid Magnetic Multilayer Nanodots

    Authors: Alexander Kang-Jun Toh, McCoy W. Lim, T. S. Suraj, Xiaoye Chen, Hang Khume Tan, Royston Lim, Xuan Min Cheng, Nelson Lim, Sherry Yap, Durgesh Kumar, S. N. Piramanayagam, Pin Ho, Anjan Soumyanarayanan

    Abstract: Ambient magnetic skyrmions stabilized in multilayer nanostructures are of immense interest due to their relevance to magnetic tunnel junction (MTJ) devices for memory and unconventional computing applications. However, existing skyrmionic nanostructures built using conventional metallic or oxide multilayer nanodots are unable to concurrently fulfill the requirements of nanoscale skyrmion stability… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  14. arXiv:2311.06448  [pdf, other

    math.OC

    A Sparse Smoothing Newton Method for Solving Discrete Optimal Transport Problems

    Authors: Di Hou, Ling Liang, Kim-Chuan Toh

    Abstract: The discrete optimal transport (OT) problem, which offers an effective computational tool for comparing two discrete probability distributions, has recently attracted much attention and played essential roles in many modern applications. This paper proposes to solve the discrete OT problem by applying a squared smoothing Newton method via the Huber smoothing function for solving the corresponding… ▽ More

    Submitted 16 May, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: 29 pages, 17 figures

    MSC Class: 90C05; 90C06; 90C25

  15. arXiv:2311.01976  [pdf, other

    math.OC

    A Corrected Inexact Proximal Augmented Lagrangian Method with a Relative Error Criterion for a Class of Group-quadratic Regularized Optimal Transport Problems

    Authors: Lei Yang, Ling Liang, Hong T. M. Chu, Kim-Chuan Toh

    Abstract: The optimal transport (OT) problem and its related problems have attracted significant attention and have been extensively studied in various applications. In this paper, we focus on a class of group-quadratic regularized OT problems which aim to find solutions with specialized structures that are advantageous in practical scenarios. To solve this class of problems, we propose a corrected inexact… ▽ More

    Submitted 2 April, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 37 pages, 6 figures

    MSC Class: 90C05; 90C06; 90C25

  16. arXiv:2310.08858  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    Adam-family Methods with Decoupled Weight Decay in Deep Learning

    Authors: Kuangyu Ding, Nachuan Xiao, Kim-Chuan Toh

    Abstract: In this paper, we investigate the convergence properties of a wide class of Adam-family methods for minimizing quadratically regularized nonsmooth nonconvex optimization problems, especially in the context of training nonsmooth neural networks with weight decay. Motivated by the AdamW method, we propose a novel framework for Adam-family methods with decoupled weight decay. Within our framework, th… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 26 pages

  17. arXiv:2310.00376  [pdf, other

    math.OC

    Self-adaptive ADMM for semi-strongly convex problems

    Authors: Tianyun Tang, Kim-Chuan Toh

    Abstract: In this paper, we develop a self-adaptive ADMM that updates the penalty parameter adaptively. When one part of the objective function is strongly convex i.e., the problem is semi-strongly convex, our algorithm can update the penalty parameter adaptively with guaranteed convergence. We establish various types of convergence results including accelerated convergence rate of O(1/k^2), linear converge… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 36 pages, 2 figures

    MSC Class: 90C06; 90C25; 90C90

  18. arXiv:2308.16690  [pdf, other

    math.OC

    On solving a rank regularized minimization problem via equivalent factorized column-sparse regularized models

    Authors: Wen**g Li, Wei Bian, Kim-Chuan Toh

    Abstract: Rank regularized minimization problem is an ideal model for the low-rank matrix completion/recovery problem. The matrix factorization approach can transform the high-dimensional rank regularized problem to a low-dimensional factorized column-sparse regularized problem. The latter can greatly facilitate fast computations in applicable algorithms, but needs to overcome the simultaneous non-convexity… ▽ More

    Submitted 20 May, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: 46 pages

    MSC Class: 90C46; 90C26; 65K05

  19. arXiv:2307.10855  [pdf, ps, other

    math.OC

    Quantifying low rank approximations of third order symmetric tensors

    Authors: Shenglong Hu, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we present a method to certify the approximation quality of a low rank tensor to a given third order symmetric tensor. Under mild assumptions, best low rank approximation is attained if a control parameter is zero or quantified quasi-optimal low rank approximation is obtained if the control parameter is positive.This is based on a primal-dual method for computing a low rank approxim… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 46pages

  20. arXiv:2307.10053  [pdf, other

    math.OC cs.AI cs.LG stat.ML

    SGD-type Methods with Guaranteed Global Stability in Nonsmooth Nonconvex Optimization

    Authors: Nachuan Xiao, Xiaoyin Hu, Kim-Chuan Toh

    Abstract: In this paper, we focus on providing convergence guarantees for variants of the stochastic subgradient descent (SGD) method in minimizing nonsmooth nonconvex functions. We first develop a general framework to establish global stability for general stochastic subgradient methods, where the corresponding differential inclusion admits a coercive Lyapunov function. We prove that, with sufficiently sma… ▽ More

    Submitted 13 May, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 36 pages

  21. arXiv:2306.17369  [pdf, other

    math.OC

    Adaptive sieving: A dimension reduction technique for sparse optimization problems

    Authors: Yancheng Yuan, Meixia Lin, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we propose an adaptive sieving (AS) strategy for solving general sparse machine learning models by effectively exploring the intrinsic sparsity of the solutions, wherein only a sequence of reduced problems with much smaller sizes need to be solved. We further apply the proposed AS strategy to generate solution paths for large-scale sparse optimization problems efficiently. We establ… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  22. arXiv:2306.14522  [pdf, other

    math.OC cs.LG

    Nonconvex Stochastic Bregman Proximal Gradient Method with Application to Deep Learning

    Authors: Kuangyu Ding, **gyang Li, Kim-Chuan Toh

    Abstract: The widely used stochastic gradient methods for minimizing nonconvex composite objective functions require the Lipschitz smoothness of the differentiable part. But the requirement does not hold true for problem classes including quadratic inverse problems and training neural networks. To address this issue, we investigate a family of stochastic Bregman proximal gradient (SBPG) methods, which only… ▽ More

    Submitted 29 June, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 37 pages

  23. arXiv:2306.14196  [pdf, other

    math.OC

    A Highly Efficient Algorithm for Solving Exclusive Lasso Problems

    Authors: Meixia Lin, Yancheng Yuan, Defeng Sun, Kim-Chuan Toh

    Abstract: The exclusive lasso (also known as elitist lasso) regularizer has become popular recently due to its superior performance on intra-group feature selection. Its complex nature poses difficulties for the computation of high-dimensional machine learning models involving such a regularizer. In this paper, we propose a highly efficient dual Newton method based proximal point algorithm (PPDNA) for solvi… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2009.08719

  24. arXiv:2306.11003  [pdf, other

    q-bio.PE physics.soc-ph

    Agent-based modeling of the COVID-19 pandemic in Florida

    Authors: Alexander N. Pillai, Kok Ben Toh, Dianela Perdomo, Sanjana Bhargava, Arlin Stoltzfus, Ira M. Longini Jr., Carl A. B. Pearson, Thomas J. Hladish

    Abstract: The onset of the COVID-19 pandemic drove a widespread, often uncoordinated effort by research groups to develop mathematical models of SARS-CoV-2 to study its spread and inform control efforts. The urgent demand for insight at the outset of the pandemic meant early models were typically either simple or repurposed from existing research agendas. Our group predominantly uses agent-based models (ABM… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  25. arXiv:2305.03938  [pdf, other

    math.OC cs.LG stat.ML

    Adam-family Methods for Nonsmooth Optimization with Convergence Guarantees

    Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we present a comprehensive study on the convergence properties of Adam-family methods for nonsmooth optimization, especially in the training of nonsmooth neural networks. We introduce a novel two-timescale framework that adopts a two-timescale updating scheme, and prove its convergence properties under mild assumptions. Our proposed framework encompasses various popular Adam-family… ▽ More

    Submitted 19 February, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 53 pages

  26. arXiv:2305.03926  [pdf, other

    stat.AP stat.ME stat.ML

    Trajectory-oriented optimization of stochastic epidemiological models

    Authors: Arindam Fadikar, Mickael Binois, Nicholson Collier, Abby Stevens, Kok Ben Toh, Jonathan Ozik

    Abstract: Epidemiological models must be calibrated to ground truth for downstream tasks such as producing forward projections or running what-if scenarios. The meaning of calibration changes in case of a stochastic model since output from such a model is generally described via an ensemble or a distribution. Each member of the ensemble is usually mapped to a random number seed (explicitly or implicitly). W… ▽ More

    Submitted 13 September, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

  27. arXiv:2304.10092  [pdf, ps, other

    math.OC

    A Riemannian Dimension-reduced Second Order Method with Application in Sensor Network Localization

    Authors: Tianyun Tang, Kim-Chuan Toh, Nachuan Xiao, Yinyu Ye

    Abstract: In this paper, we propose a cubic-regularized Riemannian optimization method (RDRSOM), which partially exploits the second order information and achieves the iteration complexity of $\mathcal{O}(1/ε^{3/2})$. In order to reduce the per-iteration computational cost, we further propose a practical version of (RDRSOM), which is an extension of the well known Barzilai-Borwein method and achieves the it… ▽ More

    Submitted 24 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 19 pages

  28. arXiv:2304.01467  [pdf, ps, other

    math.OC

    A Partial Exact Penalty Function Approach for Constrained Optimization

    Authors: Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we focus on a class of constrained nonlinear optimization problems (NLP), where some of its equality constraints define a closed embedded submanifold $\mathcal{M}$ in $\mathbb{R}^n$. Although NLP can be solved directly by various existing approaches for constrained optimization in Euclidean space, these approaches usually fail to recognize the manifold structure of $\mathcal{M}$. To… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 27 pages

  29. arXiv:2303.06893  [pdf, ps, other

    math.OC

    On proximal augmented Lagrangian based decomposition methods for dual block-angular convex composite programming problems

    Authors: Kuang-Yu Ding, Xin-Yee Lam, Kim-Chuan Toh

    Abstract: We design inexact proximal augmented Lagrangian based decomposition methods for convex composite programming problems with dual block-angular structures. Our methods are particularly well suited for convex quadratic programming problems arising from stochastic programming models. The algorithmic framework is based on the application of the abstract inexact proximal ADMM framework developed in [Che… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  30. arXiv:2303.06599  [pdf, ps, other

    math.OC

    A feasible method for solving an SDP relaxation of the quadratic knapsack problem

    Authors: Tianyun Tang, Kim-Chuan Toh

    Abstract: In this paper, we consider an SDP relaxation of the quadratic knapsack problem (QKP). After using the Burer-Monteiro factorization, we get a non-convex optimization problem, whose feasible region is an algebraic variety. Although there might be non-regular points on the algebraic variety, we prove that the algebraic variety is a smooth manifold except for a trivial point for a generic input data.… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

    MSC Class: 90C06; 90C10; 90C22

  31. arXiv:2303.05825  [pdf, ps, other

    math.OC

    A squared smoothing Newton method for semidefinite programming

    Authors: Ling Liang, Defeng Sun, Kim-Chuan Toh

    Abstract: This paper proposes a squared smoothing Newton method via the Huber smoothing function for solving semidefinite programming problems (SDPs). We first study the fundamental properties of the matrix-valued map** defined upon the Huber function. Using these results and existing ones in the literature, we then conduct rigorous convergence analysis and establish convergence properties for the propose… ▽ More

    Submitted 2 July, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: 49 pages

    MSC Class: 90C06; 90C22; 90C25

  32. arXiv:2302.08020  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    All-Electrical Skyrmionic Bits in a Chiral Magnetic Tunnel Junction

    Authors: Shaohai Chen, Pin Ho, James Lourembam, Alexander K. J. Toh, Jifei Huang, Xiaoye Chen, Hang Khume Tan, Sherry K. L. Yap, Royston J. J. Lim, Hui Ru Tan, T. S. Suraj, Yeow Teck Toh, Idayu Lim, **g Zhou, Hong **g Chung, Sze Ter Lim, Anjan Soumyanarayanan

    Abstract: Topological spin textures such as magnetic skyrmions hold considerable promise as robust, nanometre-scale, mobile bits for sustainable computing. A longstanding roadblock to unleashing their potential is the absence of a device enabling deterministic electrical readout of individual spin textures. Here we present the wafer-scale realization of a nanoscale chiral magnetic tunnel junction (MTJ) host… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 8 pages, 5 figures

    Journal ref: Nature (2024) 627, 522

  33. arXiv:2212.02698  [pdf, other

    math.OC cs.MS

    CDOpt: A Python Package for a Class of Riemannian Optimization

    Authors: Nachuan Xiao, Xiaoyin Hu, Xin Liu, Kim-Chuan Toh

    Abstract: Optimization over the embedded submanifold defined by constraints $c(x) = 0$ has attracted much interest over the past few decades due to its wide applications in various areas. Plenty of related optimization packages have been developed based on Riemannian optimization approaches, which rely on some basic geometrical materials of Riemannian manifolds, including retractions, vector transports, etc… ▽ More

    Submitted 28 March, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: 31 pages

  34. arXiv:2209.06175  [pdf, ps, other

    math.OC cs.LG math.AG

    Tractable hierarchies of convex relaxations for polynomial optimization on the nonnegative orthant

    Authors: Ngoc Hoang Anh Mai, Victor Magron, Jean-Bernard Lasserre, Kim-Chuan Toh

    Abstract: We consider polynomial optimization problems (POP) on a semialgebraic set contained in the nonnegative orthant (every POP on a compact set can be put in this format by a simple translation of the origin). Such a POP can be converted to an equivalent POP by squaring each variable. Using even symmetry and the concept of factor width, we propose a hierarchy of semidefinite relaxations based on the ex… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 39 pages, 15 tables

  35. arXiv:2208.07514  [pdf, other

    stat.ME math.OC math.ST

    On Efficient and Scalable Computation of the Nonparametric Maximum Likelihood Estimator in Mixture Models

    Authors: Yang**g Zhang, Ying Cui, Bodhisattva Sen, Kim-Chuan Toh

    Abstract: In this paper we study the computation of the nonparametric maximum likelihood estimator (NPMLE) in multivariate mixture models. Our first approach discretizes this infinite dimensional convex optimization problem by fixing the support points of the NPMLE and optimizing over the mixture proportions. In this context we propose, leveraging the sparsity of the solution, an efficient and scalable semi… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Journal ref: Journal of Machine Learning Research, 25 (2024), pp. 1-46

  36. arXiv:2208.00732  [pdf, ps, other

    math.OC

    An Improved Unconstrained Approach for Bilevel Optimization

    Authors: Xiaoyin Hu, Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we focus on the nonconvex-strongly-convex bilevel optimization problem (BLO). In this BLO, the objective function of the upper-level problem is nonconvex and possibly nonsmooth, and the lower-level problem is smooth and strongly convex with respect to the underlying variable $y$. We show that the feasible region of BLO is a Riemannian manifold. Then we transform BLO to its correspon… ▽ More

    Submitted 23 December, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: 27 pages, revised version

    MSC Class: 15A18; 65F15; 65K05; 90C06

  37. arXiv:2205.14922  [pdf, other

    cs.LG cs.CV

    ACIL: Analytic Class-Incremental Learning with Absolute Memorization and Privacy Protection

    Authors: Hui** Zhuang, Zhenyu Weng, Hongxin Wei, Renchunzi Xie, Kar-Ann Toh, Zhi** Lin

    Abstract: Class-incremental learning (CIL) learns a classification model with training data of different classes arising progressively. Existing CIL either suffers from serious accuracy loss due to catastrophic forgetting, or invades data privacy by revisiting used exemplars. Inspired by linear learning formulations, we propose an analytic class-incremental learning (ACIL) with absolute memorization of past… ▽ More

    Submitted 10 December, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: published in NeurIPS 2022

  38. arXiv:2205.10500  [pdf, other

    math.OC

    A Constraint Dissolving Approach for Nonsmooth Optimization over the Stiefel Manifold

    Authors: Xiaoyin Hu, Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: This paper focus on the minimization of a possibly nonsmooth objective function over the Stiefel manifold. The existing approaches either lack efficiency or can only tackle prox-friendly objective functions. We propose a constraint dissolving function named NCDF and show that it has the same first-order stationary points and local minimizers as the original problem in a neighborhood of the Stiefel… ▽ More

    Submitted 20 January, 2023; v1 submitted 21 May, 2022; originally announced May 2022.

    Comments: Revised version, 26 pages

  39. arXiv:2204.14067  [pdf, other

    cs.LG math.OC

    Accelerating nuclear-norm regularized low-rank matrix optimization through Burer-Monteiro decomposition

    Authors: Ching-pei Lee, Ling Liang, Tianyun Tang, Kim-Chuan Toh

    Abstract: This work proposes a rapid algorithm, BM-Global, for nuclear-norm-regularized convex and low-rank matrix optimization problems. BM-Global efficiently decreases the objective value via low-cost steps leveraging the nonconvex but smooth Burer-Monteiro (BM) decomposition, while effectively escapes saddle points and spurious local minima ubiquitous in the BM form to obtain guarantees of fast convergen… ▽ More

    Submitted 13 January, 2023; v1 submitted 29 April, 2022; originally announced April 2022.

    Comments: 51 pages, including 16 pages of supplementary materials

  40. arXiv:2203.10319  [pdf, ps, other

    math.OC

    Dissolving Constraints for Riemannian Optimization

    Authors: Nachuan Xiao, Xin Liu, Kim-Chuan Toh

    Abstract: In this paper, we consider optimization problems over closed embedded submanifolds of $\mathbb{R}^n$, which are defined by the constraints $c(x) = 0$. We propose a class of constraint dissolving approaches for these Riemannian optimization problems. In these proposed approaches, solving a Riemannian optimization problem is transferred into the unconstrained minimization of a constraint dissolving… ▽ More

    Submitted 14 October, 2022; v1 submitted 19 March, 2022; originally announced March 2022.

    Comments: 38 pages

  41. arXiv:2203.09721  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Deterministic Bridge Regression for Compressive Classification

    Authors: Kar-Ann Toh, Giuseppe Molteni, Zhi** Lin

    Abstract: Pattern classification with compact representation is an important component in machine intelligence. In this work, an analytic bridge solution is proposed for compressive classification. The proposal has been based upon solving a penalized error formulation utilizing an approximated $\ell_p$-norm. The solution comes in a primal form for over-determined systems and in a dual form for under-determi… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

  42. arXiv:2202.06504  [pdf, other

    cs.CV

    Analytic Learning of Convolutional Neural Network For Pattern Recognition

    Authors: Hui** Zhuang, Zhi** Lin, Yimin Yang, Kar-Ann Toh

    Abstract: Training convolutional neural networks (CNNs) with back-propagation (BP) is time-consuming and resource-intensive particularly in view of the need to visit the dataset multiple times. In contrast, analytic learning attempts to obtain the weights in one epoch. However, existing attempts to analytic learning considered only the multilayer perceptron (MLP). In this article, we propose an analytic con… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  43. arXiv:2112.04256  [pdf, ps, other

    math.OC

    Solving graph equipartition SDPs on an algebraic variety

    Authors: Tianyun Tang, Kim-Chuan Toh

    Abstract: Semidefinite programs are generally challenging to solve due to their high dimensionality. Burer and Monteiro developed a non-convex approach to solve linear SDP problems by applying its low rank property. Their approach is fast because they used factorization to reduce the problem size. In this paper, we focus on solving the SDP relaxation of a graph equipartition problem, which involves an addit… ▽ More

    Submitted 2 August, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: 44 pages, 0 figure

    MSC Class: 90C06; 90C22; 90C27

  44. arXiv:2109.05690  [pdf, other

    math.OC

    Inexact Bregman Proximal Gradient Method and its Inertial Variant with Absolute and Relative Stop** Criteria

    Authors: Lei Yang, Kim-Chuan Toh

    Abstract: The Bregman proximal gradient method (BPGM), which uses the Bregman distance as a proximity measure in the iterative scheme, has recently been re-developed for minimizing convex composite problems \textit{without} the global Lipschitz gradient continuity assumption. This makes the BPGM appealing for a wide range of applications, and hence it has received growing attention in recent years. However,… ▽ More

    Submitted 23 October, 2023; v1 submitted 12 September, 2021; originally announced September 2021.

  45. arXiv:2109.05251  [pdf, other

    math.OC

    DC algorithms for a class of sparse group $\ell_0$ regularized optimization problems

    Authors: Wen**g Li, Wei Bian, Kim-Chuan Toh

    Abstract: In this paper, we consider a class of sparse group $\ell_0$ regularized optimization problems. Firstly, we give a continuous relaxation model of the considered problem and establish the equivalence of these two problems in the sense of global minimizers. Then, we define a class of stationary points of the relaxation problem, and prove that any defined stationary point is a local minimizer of the c… ▽ More

    Submitted 5 May, 2022; v1 submitted 11 September, 2021; originally announced September 2021.

  46. arXiv:2109.03632  [pdf, other

    math.OC

    On Regularized Square-root Regression Problems: Distributionally Robust Interpretation and Fast Computations

    Authors: Hong T. M. Chu, Kim-Chuan Toh, Yang**g Zhang

    Abstract: Square-root (loss) regularized models have recently become popular in linear regression due to their nice statistical properties. Moreover, some of these models can be interpreted as the distributionally robust optimization counterparts of the traditional least-squares regularized models. In this paper, we give a unified proof to show that any square-root regularized model whose penalty function b… ▽ More

    Submitted 5 October, 2023; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 39 pages, 7 figures

    Journal ref: Journal of Machine Learning Research, 23 (2022), pp 1-39

  47. arXiv:2108.07462  [pdf, ps, other

    math.OC

    A Dimension Reduction Technique for Large-scale Structured Sparse Optimization Problems with Application to Convex Clustering

    Authors: Yancheng Yuan, Tsung-Hui Chang, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we propose a novel adaptive sieving (AS) technique and an enhanced AS (EAS) technique, which are solver independent and could accelerate optimization algorithms for solving large scale convex optimization problems with intrinsic structured sparsity. We establish the finite convergence property of the AS technique and the EAS technique with inexact solutions of the reduced subproblem… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    MSC Class: 90C06; 90C25; 90C90

  48. arXiv:2105.14033  [pdf, other

    math.OC cs.CV cs.LG

    An Inexact Projected Gradient Method with Rounding and Lifting by Nonlinear Programming for Solving Rank-One Semidefinite Relaxation of Polynomial Optimization

    Authors: Heng Yang, Ling Liang, Luca Carlone, Kim-Chuan Toh

    Abstract: We consider solving high-order semidefinite programming (SDP) relaxations of nonconvex polynomial optimization problems (POPs) that often admit degenerate rank-one optimal solutions. Instead of solving the SDP alone, we propose a new algorithmic framework that blends local search using the nonconvex POP into global descent using the convex SDP. In particular, we first design a globally convergent… ▽ More

    Submitted 26 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: Code available at https://github.com/MIT-SPARK/STRIDE

    MSC Class: 90C06; 90C22; 90C23; 90C55

  49. arXiv:2105.10370  [pdf, other

    math.OC

    Bregman Proximal Point Algorithm Revisited: A New Inexact Version and its Inertial Variant

    Authors: Lei Yang, Kim-Chuan Toh

    Abstract: We study a general convex optimization problem, which covers various classic problems in different areas and particularly includes many optimal transport related problems arising in recent years. To solve this problem, we revisit the classic Bregman proximal point algorithm (BPPA) and introduce a new inexact stop** condition for solving the subproblems, which can circumvent the underlying feasib… ▽ More

    Submitted 16 May, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

  50. arXiv:2103.13108  [pdf, ps, other

    math.OC

    QPPAL: A two-phase proximal augmented Lagrangian method for high dimensional convex quadratic programming problems

    Authors: Ling Liang, Xudong Li, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we aim to solve high dimensional convex quadratic programming (QP) problems with a large number of quadratic terms, linear equality and inequality constraints. In order to solve the targeted {\bf QP} problems to a desired accuracy efficiently, we develop a two-phase {\bf P}roximal {\bf A}ugmented {\bf L}agrangian method {(QPPAL)}, with Phase I to generate a reasonably good initial p… ▽ More

    Submitted 28 January, 2022; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: 28 pages, 4 figures

    MSC Class: 90C06; 90C22; 90C25