Skip to main content

Showing 51–100 of 114 results for author: Toh, K

.
  1. arXiv:2103.13108  [pdf, ps, other

    math.OC

    QPPAL: A two-phase proximal augmented Lagrangian method for high dimensional convex quadratic programming problems

    Authors: Ling Liang, Xudong Li, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we aim to solve high dimensional convex quadratic programming (QP) problems with a large number of quadratic terms, linear equality and inequality constraints. In order to solve the targeted {\bf QP} problems to a desired accuracy efficiently, we develop a two-phase {\bf P}roximal {\bf A}ugmented {\bf L}agrangian method {(QPPAL)}, with Phase I to generate a reasonably good initial p… ▽ More

    Submitted 28 January, 2022; v1 submitted 24 March, 2021; originally announced March 2021.

    Comments: 28 pages, 4 figures

    MSC Class: 90C06; 90C22; 90C25

  2. An Analytic Layer-wise Deep Learning Framework with Applications to Robotics

    Authors: Huu-Thiet Nguyen, Chien Chern Cheah, Kar-Ann Toh

    Abstract: Deep learning (DL) has achieved great success in many applications, but it has been less well analyzed from the theoretical perspective. The unexplainable success of black-box DL models has raised questions among scientists and promoted the emergence of the field of explainable artificial intelligence (XAI). In robotics, it is particularly important to deploy DL algorithms in a predictable and sta… ▽ More

    Submitted 24 August, 2023; v1 submitted 6 February, 2021; originally announced February 2021.

    Comments: The paper has been published in Automatica

    Journal ref: Automatica, vol. 135, Jan. 2022

  3. Solving Challenging Large Scale QAPs

    Authors: Koichi Fujii, Naoki Ito, Sunyoung Kim, Masakazu Kojima, Yuji Shinano, Kim-Chuan Toh

    Abstract: We report our progress on the project for solving larger scale quadratic assignment problems (QAPs). Our main approach to solve large scale NP-hard combinatorial optimization problems such as QAPs is a parallel branch-and-bound method efficiently implemented on a powerful computer system using the Ubiquity Generator (UG) framework that can utilize more than 100,000 cores. Lower bounding procedures… ▽ More

    Submitted 23 January, 2021; originally announced January 2021.

    Comments: 15 pages

    Report number: ZIB-Report (21-02) MSC Class: 90C20; 90C22

  4. arXiv:2012.04862  [pdf, other

    math.OC

    An augmented Lagrangian method with constraint generation for shape-constrained convex regression problems

    Authors: Meixia Lin, Defeng Sun, Kim-Chuan Toh

    Abstract: Shape-constrained convex regression problem deals with fitting a convex function to the observed data, where additional constraints are imposed, such as component-wise monotonicity and uniform Lipschitz continuity. This paper provides a unified framework for computing the least squares estimator of a multivariate shape-constrained convex regression function in $\mathbb{R}^d$. We prove that the lea… ▽ More

    Submitted 20 November, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2002.11410

  5. arXiv:2012.03747  [pdf, other

    cs.LG cs.DC

    Accumulated Decoupled Learning: Mitigating Gradient Staleness in Inter-Layer Model Parallelization

    Authors: Hui** Zhuang, Zhi** Lin, Kar-Ann Toh

    Abstract: Decoupled learning is a branch of model parallelism which parallelizes the training of a network by splitting it depth-wise into multiple modules. Techniques from decoupled learning usually lead to stale gradient effect because of their asynchronous implementation, thereby causing performance degradation. In this paper, we propose an accumulated decoupled learning (ADL) which incorporates the grad… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  6. arXiv:2011.14312  [pdf, other

    math.OC

    An efficient implementable inexact entropic proximal point algorithm for a class of linear programming problems

    Authors: Hong T. M. Chu, Ling Liang, Kim-Chuan Toh, Lei Yang

    Abstract: We introduce a class of specially structured linear programming (LP) problems, which has favorable modeling capability for important application problems in different areas such as optimal transport, discrete tomography and economics. To solve these generally large-scale LP problems efficiently, we design an implementable inexact entropic proximal point algorithm (iEPPA) combined with an easy-to-i… ▽ More

    Submitted 23 April, 2022; v1 submitted 29 November, 2020; originally announced November 2020.

    Comments: 28 pages, 6 figures

  7. Learning Graph Laplacian with MCP

    Authors: Yang**g Zhang, Kim-Chuan Toh, Defeng Sun

    Abstract: We consider the problem of learning a graph under the Laplacian constraint with a non-convex penalty: minimax concave penalty (MCP). For solving the MCP penalized graphical model, we design an inexact proximal difference-of-convex algorithm (DCA) and prove its convergence to critical points. We note that each subproblem of the proximal DCA enjoys the nice property that the objective function in it… ▽ More

    Submitted 5 October, 2023; v1 submitted 22 October, 2020; originally announced October 2020.

    Comments: 32 pages

  8. arXiv:2010.08772  [pdf, ps, other

    math.OC

    An Inexact Augmented Lagrangian Method for Second-order Cone Programming with Applications

    Authors: Ling Liang, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we adopt the augmented Lagrangian method (ALM) to solve convex quadratic second-order cone programming problems (SOCPs). Fruitful results on the efficiency of the ALM have been established in the literature. Recently, it has been shown in [Cui, Sun, and Toh, {\em Math. Program.}, 178 (2019), pp. 381--415] that if the quadratic growth condition holds at an optimal solution for the du… ▽ More

    Submitted 22 October, 2021; v1 submitted 17 October, 2020; originally announced October 2020.

    Comments: 25 pages, 0 figure

    MSC Class: 90C06; 90C22; 90C25

  9. arXiv:2009.11272  [pdf, ps, other

    math.OC

    On Degenerate Doubly Nonnegative Projection Problems

    Authors: Ying Cui, Ling Liang, Defeng Sun, Kim-Chuan Toh

    Abstract: The doubly nonnegative (DNN) cone, being the set of all positive semidefinite matrices whose elements are nonnegative, is a popular approximation of the computationally intractable completely positive cone. The major difficulty for implementing a Newton-type method to compute the projection of a given large scale matrix onto the DNN cone lies in the possible failure of the constraint nondegeneracy… ▽ More

    Submitted 1 September, 2021; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: 28 pages, 0 figure

    MSC Class: 90C06; 90C22; 90C25

  10. arXiv:2009.08719  [pdf, other

    math.OC

    Adaptive Sieving with PPDNA: Generating Solution Paths of Exclusive Lasso Models

    Authors: Meixia Lin, Yancheng Yuan, Defeng Sun, Kim-Chuan Toh

    Abstract: The exclusive lasso (also known as elitist lasso) regularization has become popular recently due to its superior performance on structured sparsity. Its complex nature poses difficulties for the computation of high-dimensional machine learning models involving such a regularizer. In this paper, we propose an adaptive sieving (AS) strategy for generating solution paths of machine learning models wi… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    MSC Class: 90C06; 90C25; 90C90

  11. arXiv:2004.08115  [pdf, other

    math.OC stat.AP stat.CO stat.ML

    Estimation of sparse Gaussian graphical models with hidden clustering structure

    Authors: Meixia Lin, Defeng Sun, Kim-Chuan Toh, Cheng**g Wang

    Abstract: Estimation of Gaussian graphical models is important in natural science when modeling the statistical relationships between variables in the form of a graph. The sparsity and clustering structure of the concentration matrix is enforced to reduce model complexity and describe inherent regularities. We propose a model to estimate the sparse Gaussian graphical models with hidden clustering structure,… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

  12. arXiv:2002.11410  [pdf, other

    math.OC stat.ML

    Efficient algorithms for multivariate shape-constrained convex regression problems

    Authors: Meixia Lin, Defeng Sun, Kim-Chuan Toh

    Abstract: Shape-constrained convex regression problem deals with fitting a convex function to the observed data, where additional constraints are imposed, such as component-wise monotonicity and uniform Lipschitz continuity. This paper provides a comprehensive mechanism for computing the least squares estimator of a multivariate shape-constrained convex regression function in $\mathbb{R}^d$. We prove that t… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  13. arXiv:2001.02118  [pdf, ps, other

    math.OC

    Mesh Independence of a Majorized ABCD Method for Sparse PDE-constrained Optimization Problems

    Authors: Xiaoliang Song, Defeng Sun, Kim-Chuan Toh

    Abstract: A majorized accelerated block coordinate descent (mABCD) method in Hilbert space is analyzed to solve a sparse PDE-constrained optimization problem via its dual. The finite element approximation method is investigated. The attractive $O(1/k^2)$ iteration complexity of {the mABCD} method for the dual objective function values can be achieved. Based on the convergence result, we prove the robustness… ▽ More

    Submitted 3 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1709.00005, arXiv:1708.09094, arXiv:1709.09539

  14. A Proximal Point Dual Newton Algorithm for Solving Group Graphical Lasso Problems

    Authors: Yang**g Zhang, Ning Zhang, Defeng Sun, Kim-Chuan Toh

    Abstract: Undirected graphical models have been especially popular for learning the conditional independence structure among a large number of variables where the observations are drawn independently and identically from the same distribution. However, many modern statistical problems would involve categorical data or time-varying data, which might follow different but related underlying distributions. In o… ▽ More

    Submitted 17 August, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 24 pages

    MSC Class: 90C22; 90C25; 90C31; 62J10

    Journal ref: SIAM Journal on Optimization, 30 (2020) , 2197-2220

  15. arXiv:1905.12840  [pdf, other

    math.OC

    A Newton-bracketing method for a simple conic optimization problem

    Authors: Sunyoung Kim, Masakazu Kojima, Kim-Chuan Toh

    Abstract: For the Lagrangian-DNN relaxation of quadratic optimization problems (QOPs), we propose a Newton-bracketing method to improve the performance of the bisection-projection method implemented in BBCPOP [to appear in ACM Tran. Softw., 2019]. The relaxation problem is converted into the problem of finding the largest zero $y^*$ of a continuously differentiable (except at $y^*$) convex function… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 19 pages, 2 figures

    MSC Class: 90C20; 90C22; 90C25

  16. arXiv:1903.11460  [pdf, ps, other

    math.OC cs.LG math.NA stat.CO stat.ML

    A sparse semismooth Newton based proximal majorization-minimization algorithm for nonconvex square-root-loss regression problems

    Authors: Peipei Tang, Cheng**g Wang, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we consider high-dimensional nonconvex square-root-loss regression problems and introduce a proximal majorization-minimization (PMM) algorithm for these problems. Our key idea for making the proposed PMM to be efficient is to develop a sparse semismooth Newton method to solve the corresponding subproblems. By using the Kurdyka-Łojasiewicz property exhibited in the underlining proble… ▽ More

    Submitted 27 May, 2020; v1 submitted 27 March, 2019; originally announced March 2019.

    Comments: 34 pages, 8 tables

  17. arXiv:1903.09546  [pdf, ps, other

    math.OC

    An asymptotically superlinearly convergent semismooth Newton augmented Lagrangian method for Linear Programming

    Authors: Xudong Li, Defeng Sun, Kim-Chuan Toh

    Abstract: Powerful interior-point methods (IPM) based commercial solvers, such as Gurobi and Mosek, have been hugely successful in solving large-scale linear programming (LP) problems. The high efficiency of these solvers depends critically on the sparsity of the problem data and advanced matrix factorization techniques. For a large scale LP problem with data matrix $A$ that is dense (possibly structured) o… ▽ More

    Submitted 19 March, 2020; v1 submitted 22 March, 2019; originally announced March 2019.

    Comments: Due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract appearing here is slightly shorter than that in the PDF file

    MSC Class: 90C05; 90C06; 90C25; 65F10

  18. arXiv:1903.07325  [pdf, other

    math.OC

    Doubly nonnegative relaxations are equivalent to completely positive reformulations of quadratic optimization problems with block-clique graph structures

    Authors: Sunyoung Kim, Masakazu Kojima, Kim-Chuan Toh

    Abstract: We study the equivalence among a nonconvex QOP, its CPP and DNN relaxations under the assumption that the aggregated and correlative sparsity of the data matrices of the CPP relaxation is represented by a block-clique graph $G$. By exploiting the correlative sparsity, we decompose the CPP relaxation problem into a clique-tree structured family of smaller subproblems. Each subproblem is associated… ▽ More

    Submitted 18 March, 2019; originally announced March 2019.

    Comments: 25 pages, 4 figures

  19. An Efficient Linearly Convergent Regularized Proximal Point Algorithm for Fused Multiple Graphical Lasso Problems

    Authors: Ning Zhang, Yang**g Zhang, Defeng Sun, Kim-Chuan Toh

    Abstract: Nowadays, analysing data from different classes or over a temporal grid has attracted a great deal of interest. As a result, various multiple graphical models for learning a collection of graphical models simultaneously have been derived by introducing sparsity in graphs and similarity across multiple graphs. This paper focuses on the fused multiple graphical Lasso model which encourages not only… ▽ More

    Submitted 19 February, 2019; originally announced February 2019.

    Journal ref: SIAM Journal on Mathematics of Data Science, 3(2021), pp. 524-543

  20. arXiv:1902.00151  [pdf, ps, other

    math.OC cs.LG

    A dual Newton based preconditioned proximal point algorithm for exclusive lasso models

    Authors: Meixia Lin, Defeng Sun, Kim-Chuan Toh, Yancheng Yuan

    Abstract: The exclusive lasso (also known as elitist lasso) regularization has become popular recently due to its superior performance on group sparsity. Compared to the group lasso regularization which enforces the competition on variables among different groups, the exclusive lasso regularization also enforces the competition within each group. In this paper, we propose a highly efficient dual Newton base… ▽ More

    Submitted 6 December, 2019; v1 submitted 31 January, 2019; originally announced February 2019.

  21. arXiv:1901.02179  [pdf, other

    math.OC

    A Geometrical Analysis of a Class of Nonconvex Conic Programs for Convex Conic Reformulations of Quadratic and Polynomial Optimization Problems

    Authors: Sunyoung Kim, Masakazu Kojima, Kim-Chuan Toh

    Abstract: We present a geometrical analysis on the completely positive programming reformulation of quadratic optimization problems and its extension to polynomial optimization problems with a class of geometrically defined nonconvex conic programs and their covexification. The class of nonconvex conic programs is described with a linear objective functionin a linear space $V$, and the constraint set is rep… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: 27 pages, 2 figures

    MSC Class: 90C20; 90C25; 90C26

  22. A Unified Algorithmic Framework of Symmetric Gauss-Seidel Decomposition based Proximal ADMMs for Convex Composite Programming

    Authors: Liang Chen, Defeng Sun, Kim-Chuan Toh, Ning Zhang

    Abstract: This paper aims to present a fairly accessible generalization of several symmetric Gauss-Seidel decomposition based multi-block proximal alternating direction methods of multipliers (ADMMs) for convex composite optimization problems. The proposed method unifies and refines many constructive techniques that were separately developed for the computational efficiency of multi-block ADMM-type algorith… ▽ More

    Submitted 4 April, 2019; v1 submitted 16 December, 2018; originally announced December 2018.

    MSC Class: 90C25; 90C22; 90C06; 65K05

    Journal ref: Journal of Computational Mathematics, 37(2019), 739--757

  23. arXiv:1812.05243  [pdf, other

    math.OC

    A New Homotopy Proximal Variable-Metric Framework for Composite Convex Minimization

    Authors: Quoc Tran-Dinh, Liang Ling, Kim-Chuan Toh

    Abstract: This paper suggests two novel ideas to develop new proximal variable-metric methods for solving a class of composite convex optimization problems. The first idea is a new parameterization of the optimality condition which allows us to develop a class of homotopy proximal variable-metric methods. We show that under appropriate assumptions such as strong convexity-type and smoothness, or self-concor… ▽ More

    Submitted 12 December, 2018; originally announced December 2018.

    Comments: 35 pages, 1 figure, and 6 tables

    Report number: UNC-STOR-3.12.2018 MSC Class: 90C25; 90C06; 90-08

  24. arXiv:1812.04941  [pdf, ps, other

    math.OC

    A semi-proximal augmented Lagrangian based decomposition method for primal block angular convex composite quadratic conic programming problems

    Authors: Xin-Yee Lam, Defeng Sun, Kim-Chuan Toh

    Abstract: We propose a semi-proximal augmented Lagrangian based decomposition method for convex composite quadratic conic programming problems with primal block angular structures. Using our algorithmic framework, we are able to naturally derive several well known augmented Lagrangian based decomposition methods for stochastic programming such as the diagonal quadratic approximation method of Mulvey and Rus… ▽ More

    Submitted 12 December, 2018; originally announced December 2018.

    Comments: 32 pages

  25. arXiv:1811.08227  [pdf, ps, other

    cs.LG stat.ML

    Analytic Network Learning

    Authors: Kar-Ann Toh

    Abstract: Based on the property that solving the system of linear matrix equations via the column space and the row space projections boils down to an approximation in the least squares error sense, a formulation for learning the weight matrices of the multilayer network can be derived. By exploiting into the vast number of feasible solutions of these interdependent weight matrices, the learning can be perf… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

    Comments: Some of the preliminary ideas of this work has been presented in the IEEE/ACIS 17th International Conference on Computer and Information Science: "Learning from the kernel and the range space" (ICIS 2018)

  26. arXiv:1810.13372  [pdf, ps, other

    math.OC

    Best Nonnegative Rank-One Approximations of Tensors

    Authors: Shenglong Hu, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we study the polynomial optimization problem of multi-forms over the intersection of the multi-spheres and the nonnegative orthants. This class of problems is NP-hard in general, and includes the problem of finding the best nonnegative rank-one approximation of a given tensor. A Positivstellensatz is given for this class of polynomial optimization problems, based on which a globally… ▽ More

    Submitted 31 October, 2018; originally announced October 2018.

    Comments: 27 pages

    MSC Class: 15A18; 15A42; 15A69; 90C22

  27. arXiv:1810.11581  [pdf, ps, other

    cs.LG stat.ML

    Gradient-Free Learning Based on the Kernel and the Range Space

    Authors: Kar-Ann Toh, Zhi** Lin, Zhengguo Li, Beomseok Oh, Lei Sun

    Abstract: In this article, we show that solving the system of linear equations by manipulating the kernel and the range space is equivalent to solving the problem of least squares error approximation. This establishes the ground for a gradient-free learning search when the system can be expressed in the form of a linear matrix equation. When the nonlinear activation function is invertible, the learning prob… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.

    Comments: The idea of kernel and range projection was first introduced in the IEEE/ACIS ICIS conference which was held in Singapore in June 2018. This article presents a full development of the method supported by extensive numerical results

  28. arXiv:1810.09856  [pdf, ps, other

    math.OC

    Spectral operators of matrices: semismoothness and characterizations of the generalized Jacobian

    Authors: Chao Ding, Defeng Sun, Jie Sun, Kim-Chuan Toh

    Abstract: Spectral operators of matrices proposed recently in [C. Ding, D.F. Sun, J. Sun, and K.C. Toh, Math. Program. {\bf 168}, 509--531 (2018)] are a class of matrix valued functions, which map matrices to matrices by applying a vector-to-vector function to all eigenvalues/singular values of the underlying matrices. Spectral operators play a crucial role in the study of various applications involving mat… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: 25 pages. arXiv admin note: substantial text overlap with arXiv:1401.2269

    MSC Class: 90C25; 90C06; 65K05; 49J50; 49J52

  29. arXiv:1810.09071  [pdf, ps, other

    cs.LG stat.ML

    Learning from the Kernel and the Range Space

    Authors: Kar-Ann Toh

    Abstract: In this article, a novel approach to learning a complex function which can be written as the system of linear equations is introduced. This learning is grounded upon the observation that solving the system of linear equations by a manipulation in the kernel and the range space boils down to an estimation based on the least squares error approximation. The learning approach is applied to learn a de… ▽ More

    Submitted 21 October, 2018; originally announced October 2018.

    Comments: Camera-ready finalized on 22 April 2018, paper presented on 07 June 2018 in the 17th IEEE/ACIS International Conference on Computer and Information Science (ICIS) 2018

  30. arXiv:1810.02677  [pdf, other

    cs.LG math.OC stat.ML

    Convex Clustering: Model, Theoretical Guarantee and Efficient Algorithm

    Authors: Defeng Sun, Kim-Chuan Toh, Yancheng Yuan

    Abstract: Clustering is a fundamental problem in unsupervised learning. Popular methods like K-means, may suffer from poor performance as they are prone to get stuck in its local minima. Recently, the sum-of-norms (SON) model (also known as the clustering path) has been proposed in Pelckmans et al. (2005), Lindsten et al. (2011) and Hocking et al. (2011). The perfect recovery properties of the convex cluste… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

    Comments: arXiv admin note: substantial text overlap with arXiv:1802.07091

  31. arXiv:1809.04249  [pdf, other

    math.OC stat.ML

    A Fast Globally Linearly Convergent Algorithm for the Computation of Wasserstein Barycenters

    Authors: Lei Yang, Jia Li, Defeng Sun, Kim-Chuan Toh

    Abstract: We consider the problem of computing a Wasserstein barycenter for a set of discrete probability distributions with finite supports, which finds many applications in areas such as statistics, machine learning and image processing. When the support points of the barycenter are pre-specified, this problem can be modeled as a linear programming (LP) problem whose size can be extremely large. To handle… ▽ More

    Submitted 26 December, 2020; v1 submitted 12 September, 2018; originally announced September 2018.

  32. arXiv:1808.07181  [pdf, other

    math.OC stat.ML

    Efficient sparse semismooth Newton methods for the clustered lasso problem

    Authors: Meixia Lin, Yong-** Liu, Defeng Sun, Kim-Chuan Toh

    Abstract: We focus on solving the clustered lasso problem, which is a least squares problem with the $\ell_1$-type penalties imposed on both the coefficients and their pairwise differences to learn the group structure of the regression parameters. Here we first reformulate the clustered lasso regularizer as a weighted ordered-lasso regularizer, which is essential in reducing the computational cost from… ▽ More

    Submitted 1 May, 2019; v1 submitted 21 August, 2018; originally announced August 2018.

  33. arXiv:1806.03404  [pdf, ps, other

    cs.LG stat.ML

    Deterministic Stretchy Regression

    Authors: Kar-Ann Toh, Lei Sun, Zhi** Lin

    Abstract: An extension of the regularized least-squares in which the estimation parameters are stretchable is introduced and studied in this paper. The solution of this ridge regression with stretchable parameters is given in primal and dual spaces and in closed-form. Essentially, the proposed solution stretches the covariance computation by a power term, thereby compressing or amplifying the estimation par… ▽ More

    Submitted 8 June, 2018; originally announced June 2018.

    Comments: Submitted for journal (JMLR) review since 28-Sept-2017

  34. arXiv:1804.00761  [pdf, other

    math.OC

    BBCPOP: A Sparse Doubly Nonnegative Relaxation of Polynomial Optimization Problems with Binary, Box and Complementarity Constraints

    Authors: Naoki Ito, Sunyoung Kim, Masakazu Kojima, Akiko Takeda, Kim-Chuan Toh

    Abstract: The software package BBCPOP is a MATLAB implementation of a hierarchy of sparse doubly nonnegative (DNN) relaxations of a class of polynomial optimization (minimization) problems (POPs) with binary, box and complementarity (BBC) constraints. Given a POP in the class and a relaxation order, BBCPOP constructs a simple conic optimization problem (COP), which serves as a DNN relaxation of the POP, and… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

    Comments: 28 pages, 4 figures

    MSC Class: 90C20; 90C22; 90C25; 90C26

  35. arXiv:1803.10803  [pdf, other

    math.OC

    On the Equivalence of Inexact Proximal ALM and ADMM for a Class of Convex Composite Programming

    Authors: Liang Chen, Xudong Li, Defeng Sun, Kim-Chuan Toh

    Abstract: In this paper, we show that for a class of linearly constrained convex composite optimization problems, an (inexact) symmetric Gauss-Seidel based majorized multi-block proximal alternating direction method of multipliers (ADMM) is equivalent to an {\em inexact} proximal augmented Lagrangian method (ALM). This equivalence not only provides new perspectives for understanding some ADMM-type algorithm… ▽ More

    Submitted 28 January, 2019; v1 submitted 28 March, 2018; originally announced March 2018.

    MSC Class: 90C25; 65K05; 90C06; 49M27; 90C20

  36. arXiv:1803.10740  [pdf, other

    math.OC

    Solving the OSCAR and SLOPE Models Using a Semismooth Newton-Based Augmented Lagrangian Method

    Authors: Ziyan Luo, Defeng Sun, Kim-Chuan Toh, Naihua Xiu

    Abstract: The octagonal shrinkage and clustering algorithm for regression (OSCAR), equipped with the $\ell_1$-norm and a pair-wise $\ell_{\infty}$-norm regularizer, is a useful tool for feature selection and grou** in high-dimensional data analysis. The computational challenge posed by OSCAR, for high dimensional and/or large sample size data, has not yet been well resolved due to the non-smoothness and i… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

  37. arXiv:1803.06566  [pdf, other

    math.OC

    Computing the Best Approximation Over the Intersection of a Polyhedral Set and the Doubly Nonnegative Cone

    Authors: Ying Cui, Defeng Sun, Kim-Chuan Toh

    Abstract: This paper introduces an efficient algorithm for computing the best approximation of a given matrix onto the intersection of linear equalities, inequalities and the doubly nonnegative cone (the cone of all positive semidefinite matrices whose elements are nonnegative). In contrast to directly applying the block coordinate descent type methods, we propose an inexact accelerated (two-)block coordina… ▽ More

    Submitted 17 March, 2018; originally announced March 2018.

  38. arXiv:1802.07091  [pdf, other

    math.OC cs.LG

    An Efficient Semismooth Newton Based Algorithm for Convex Clustering

    Authors: Yancheng Yuan, Defeng Sun, Kim-Chuan Toh

    Abstract: Clustering may be the most fundamental problem in unsupervised learning which is still active in machine learning research because its importance in many applications. Popular methods like K-means, may suffer from instability as they are prone to get stuck in its local minima. Recently, the sum-of-norms (SON) model (also known as clustering path), which is a convex relaxation of hierarchical clust… ▽ More

    Submitted 20 February, 2018; originally announced February 2018.

  39. An efficient Hessian based algorithm for solving large-scale sparse group Lasso problems

    Authors: Yang**g Zhang, Ning Zhang, Defeng Sun, Kim-Chuan Toh

    Abstract: The sparse group Lasso is a widely used statistical model which encourages the sparsity both on a group and within the group level. In this paper, we develop an efficient augmented Lagrangian method for large-scale non-overlap** sparse group Lasso problems with each subproblem being solved by a superlinearly convergent inexact semismooth Newton method. Theoretically, we prove that, if the penalt… ▽ More

    Submitted 16 December, 2017; originally announced December 2017.

    Journal ref: Mathematical Programming, 179 (2020), pp. 223-263

  40. arXiv:1710.10604  [pdf, other

    math.OC

    SDPNAL+: A Matlab software for semidefinite programming with bound constraints (version 1.0)

    Authors: Defeng Sun, Kim-Chuan Toh, Yancheng Yuan, Xin-Yuan Zhao

    Abstract: SDPNAL+ is a {\sc Matlab} software package that implements an augmented Lagrangian based method to solve large scale semidefinite programming problems with bound constraints. The implementation was initially based on a majorized semismooth Newton-CG augmented Lagrangian method, here we designed it within an inexact symmetric Gauss-Seidel based semi-proximal ADMM/ALM (alternating direction method o… ▽ More

    Submitted 16 May, 2019; v1 submitted 29 October, 2017; originally announced October 2017.

    Journal ref: Optimization Methods and Software (2019) [https://doi.org/10.1080/10556788.2019.1576176]

  41. arXiv:1706.08800  [pdf, other

    math.OC

    On the R-superlinear convergence of the KKT residues generated by the augmented Lagrangian method for convex composite conic programming

    Authors: Ying Cui, Defeng Sun, Kim-Chuan Toh

    Abstract: Due to the possible lack of primal-dual-type error bounds, the superlinear convergence for the Karush-Kuhn-Tucker (KKT) residues of the sequence generated by augmented Lagrangian method (ALM) for solving convex composite conic programming (CCCP) has long been an outstanding open question. In this paper, we aim to resolve this issue by first conducting convergence rate analysis for the ALM with Roc… ▽ More

    Submitted 27 June, 2017; originally announced June 2017.

  42. arXiv:1706.08732  [pdf, other

    math.OC

    On efficiently solving the subproblems of a level-set method for fused lasso problems

    Authors: Xudong Li, Defeng Sun, Kim-Chuan Toh

    Abstract: In applying the level-set method developed in [Van den Berg and Friedlander, SIAM J. on Scientific Computing, 31 (2008), pp.~890--912 and SIAM J. on Optimization, 21 (2011), pp.~1201--1229] to solve the fused lasso problems, one needs to solve a sequence of regularized least squares subproblems. In order to make the level-set method practical, we develop a highly efficient inexact semismooth Newto… ▽ More

    Submitted 27 June, 2017; originally announced June 2017.

    MSC Class: 90C06; 90C20; 90C22; 90C25

  43. arXiv:1703.06629  [pdf, ps, other

    math.NA

    A block symmetric Gauss-Seidel decomposition theorem for convex composite quadratic programming and its applications

    Authors: Xudong Li, Defeng Sun, Kim-Chuan Toh

    Abstract: For a symmetric positive semidefinite linear system of equations $\mathcal{Q} {\bf x} = {\bf b}$, where ${\bf x} = (x_1,\ldots,x_s)$ is partitioned into $s$ blocks, with $s \geq 2$, we show that each cycle of the classical block symmetric Gauss-Seidel (block sGS) method exactly solves the associated quadratic programming (QP) problem but added with an extra proximal term of the form… ▽ More

    Submitted 22 May, 2017; v1 submitted 20 March, 2017; originally announced March 2017.

    MSC Class: 90C06; 90C20; 90C25; 65F10

  44. arXiv:1702.05934  [pdf, other

    math.OC

    On the efficient computation of a generalized Jacobian of the projector over the Birkhoff polytope

    Authors: Xudong Li, Defeng Sun, Kim-Chuan Toh

    Abstract: We derive an explicit formula, as well as an efficient procedure, for constructing a generalized Jacobian for the projector of a given square matrix onto the Birkhoff polytope, i.e., the set of doubly stochastic matrices. To guarantee the high efficiency of our procedure, a semismooth Newton method for solving the dual of the projection problem is proposed and efficiently implemented. Extensive nu… ▽ More

    Submitted 31 August, 2018; v1 submitted 20 February, 2017; originally announced February 2017.

    MSC Class: 90C06; 90C20; 90C25; 65F10

  45. arXiv:1611.09065  [pdf, other

    cs.CY

    DrivingStyles: A mobile platform for driving styles and fuel consumption characterization

    Authors: Javier E. Meseguer, C. K. Toh, Carlos T. Calafate, Juan Carlos Cano, Pietro Manzoni

    Abstract: Intelligent Transportation Systems (ITS) rely on connected vehicle applications to address real-world problems. Research is currently being conducted to support safety, mobility and environmental applications. This paper presents the DrivingStyles architecture, which adopts data mining techniques and neural networks to analyze and generate a classification of driving styles and fuel consumption ba… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Comments: Journal of Communications and Networks

  46. arXiv:1610.00875  [pdf, ps, other

    math.OC

    On the Asymptotic Superlinear Convergence of the Augmented Lagrangian Method for Semidefinite Programming with Multiple Solutions

    Authors: Ying Cui, Defeng Sun, Kim-Chuan Toh

    Abstract: Solving large scale convex semidefinite programming (SDP) problems has long been a challenging task numerically. Fortunately, several powerful solvers including SDPNAL, SDPNAL+ and QSDPNAL have recently been developed to solve linear and convex quadratic SDP problems to high accuracy successfully. These solvers are based on the augmented Lagrangian method (ALM) applied to the dual problems with th… ▽ More

    Submitted 4 October, 2016; originally announced October 2016.

  47. arXiv:1609.07664  [pdf, other

    stat.ML math.OC

    Max-Norm Optimization for Robust Matrix Recovery

    Authors: Ethan X. Fang, Han Liu, Kim-Chuan Toh, Wen-Xin Zhou

    Abstract: This paper studies the matrix completion problem under arbitrary sampling schemes. We propose a new estimator incorporating both max-norm and nuclear-norm regularization, based on which we can conduct efficient low-rank matrix recovery using a random subset of entries observed with additive noise under general non-uniform and unknown sampling distributions. This method significantly relaxes the un… ▽ More

    Submitted 24 September, 2016; originally announced September 2016.

    Comments: 32 pages, 4 figures

  48. arXiv:1607.05428  [pdf, ps, other

    math.OC

    A highly efficient semismooth Newton augmented Lagrangian method for solving Lasso problems

    Authors: Xudong Li, Defeng Sun, Kim-Chuan Toh

    Abstract: We develop a fast and robust algorithm for solving large scale convex composite optimization models with an emphasis on the $\ell_1$-regularized least squares regression (Lasso) problems. Despite the fact that there exist a large number of solvers in the literature for the Lasso problems, we found that no solver can efficiently handle difficult large scale regression problems with real data. By le… ▽ More

    Submitted 3 May, 2017; v1 submitted 19 July, 2016; originally announced July 2016.

    MSC Class: 65F10; 90C06; 90C25; 90C31

  49. arXiv:1607.01151  [pdf, ps, other

    math.OC

    Sparse-BSOS: a bounded degree SOS hierarchy for large scale polynomial optimization with sparsity

    Authors: Tillmann Weisser, Jean-Bernard Lasserre, Kim-Chuan Toh

    Abstract: We provide a sparse version of the bounded degree SOS hierarchy BSOS [7] for polynomial optimization problems. It permits to treat large scale problems which satisfy a structured sparsity pattern. When the sparsity pattern satisfies the running intersection property this Sparse-BSOS hierarchy of semidefinite programs (with semidefinite constraints of fixed size) converges to the global optimum of… ▽ More

    Submitted 27 May, 2017; v1 submitted 5 July, 2016; originally announced July 2016.

    Report number: Rapport LAAS n{\textdegree} 16193

  50. arXiv:1604.05473  [pdf, ps, other

    math.OC

    Fast algorithms for large scale generalized distance weighted discrimination

    Authors: Xin Yee Lam, J. S. Marron, Defeng Sun, Kim-Chuan Toh

    Abstract: High dimension low sample size statistical analysis is important in a wide range of applications. In such situations, the highly appealing discrimination method, support vector machine, can be improved to alleviate data piling at the margin. This leads naturally to the development of distance weighted discrimination (DWD), which can be modeled as a second-order cone programming problem and solved… ▽ More

    Submitted 16 August, 2017; v1 submitted 19 April, 2016; originally announced April 2016.

    MSC Class: 90C25; 90C06; 90C90