Skip to main content

Showing 1–20 of 20 results for author: Freund, R M

.
  1. arXiv:2406.01942  [pdf, other

    math.OC

    The Role of Level-Set Geometry on the Performance of PDHG for Conic Linear Optimization

    Authors: Zikai Xiong, Robert M. Freund

    Abstract: We consider solving huge-scale instances of (convex) conic linear optimization problems, at the scale where matrix-factorization-free methods are attractive or necessary. The restarted primal-dual hybrid gradient method (rPDHG) -- with heuristic enhancements and GPU implementation -- has been very successful in solving huge-scale linear programming (LP) problems; however its application to more ge… ▽ More

    Submitted 23 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 68 pages, 10 figures, 3 tables

  2. arXiv:2312.14774  [pdf, other

    math.OC

    Computational Guarantees for Restarted PDHG for LP based on "Limiting Error Ratios" and LP Sharpness

    Authors: Zikai Xiong, Robert Michael Freund

    Abstract: In recent years, there has been growing interest in solving linear optimization problems - or more simply "LP" - using first-order methods. The restarted primal-dual hybrid gradient method (PDHG) - together with some heuristic techniques - has emerged as a powerful tool for solving huge-scale LPs. However, the theoretical understanding of it and the validation of various heuristic implementation t… ▽ More

    Submitted 29 April, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  3. arXiv:2312.13773  [pdf, other

    math.OC

    On the Relation Between LP Sharpness and Limiting Error Ratio and Complexity Implications for Restarted PDHG

    Authors: Zikai Xiong, Robert M. Freund

    Abstract: There has been a recent surge in development of first-order methods (FOMs) for solving huge-scale linear programming (LP) problems. The attractiveness of FOMs for LP stems in part from the fact that they avoid costly matrix factorization computation. However, the efficiency of FOMs is significantly influenced - both in theory and in practice - by certain instance-specific LP condition measures. Xi… ▽ More

    Submitted 27 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

  4. arXiv:2301.01530  [pdf, other

    math.OC

    Nonlinear conjugate gradient methods: worst-case convergence rates via computer-assisted analyses

    Authors: Shuvomoy Das Gupta, Robert M. Freund, Xu Andy Sun, Adrien Taylor

    Abstract: We propose a computer-assisted approach to the analysis of the worst-case convergence of nonlinear conjugate gradient methods (NCGMs). Those methods are known for their generally good empirical performances for large-scale optimization, while having relatively incomplete analyses. Using our computer-assisted approach, we establish novel complexity bounds for the Polak-Ribière-Polyak (PRP) and the… ▽ More

    Submitted 18 April, 2024; v1 submitted 4 January, 2023; originally announced January 2023.

    Comments: 48 pages, 6 figures, 9 tables

  5. arXiv:2208.13933  [pdf, other

    cs.LG math.OC stat.ML

    Using Taylor-Approximated Gradients to Improve the Frank-Wolfe Method for Empirical Risk Minimization

    Authors: Zikai Xiong, Robert M. Freund

    Abstract: The Frank-Wolfe method has become increasingly useful in statistical and machine learning applications, due to the structure-inducing properties of the iterates, and especially in settings where linear minimization over the feasible set is more computationally efficient than projection. In the setting of Empirical Risk Minimization -- one of the fundamental optimization problems in statistical and… ▽ More

    Submitted 21 November, 2023; v1 submitted 29 August, 2022; originally announced August 2022.

    Comments: 30 pages, 2 figures

  6. arXiv:2010.08999  [pdf, other

    math.OC

    Analysis of the Frank-Wolfe Method for Convex Composite Optimization involving a Logarithmically-Homogeneous Barrier

    Authors: Renbo Zhao, Robert M. Freund

    Abstract: We present and analyze a new generalized Frank-Wolfe method for the composite optimization problem $(P):{\min}_{x\in\mathbb{R}^n}\; f(\mathsf{A} x) + h(x)$, where $f$ is a $θ$-logarithmically-homogeneous self-concordant barrier, $\mathsf{A}$ is a linear operator and the function $h$ has bounded domain but is possibly non-smooth. We show that our generalized Frank-Wolfe method requires… ▽ More

    Submitted 5 December, 2021; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: See Version 1 (v1) for the analysis of the Frank-Wolfe method with adaptive step-size applied to the Hölder smooth functions

  7. arXiv:2002.11860  [pdf, other

    math.OC cs.LG

    Stochastic Frank-Wolfe for Constrained Finite-Sum Minimization

    Authors: Geoffrey Négiar, Gideon Dresdner, Alicia Tsai, Laurent El Ghaoui, Francesco Locatello, Robert M. Freund, Fabian Pedregosa

    Abstract: We propose a novel Stochastic Frank-Wolfe (a.k.a. conditional gradient) algorithm for constrained smooth finite-sum minimization with a generalized linear prediction/structure. This class of problems includes empirical risk minimization with sparse, low-rank, or other structured constraints. The proposed method is simple to implement, does not require step-size tuning, and has a constant per-itera… ▽ More

    Submitted 8 September, 2022; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: Proceedings of the 37th International Conference on Machine Learning, 2020

  8. arXiv:1910.03114  [pdf, ps, other

    math.OC

    An Oblivious Ellipsoid Algorithm for Solving a System of (In)Feasible Linear Inequalities

    Authors: Jourdain Lamperski, Robert M. Freund, Michael J. Todd

    Abstract: The ellipsoid algorithm is a fundamental algorithm for computing a solution to the system of $m$ linear inequalities in $n$ variables $(P): A^{\top}x \le u$ when its set of solutions has positive volume. However, when $(P)$ is infeasible, the ellipsoid algorithm has no mechanism for proving that $(P)$ is infeasible. This is in contrast to the other two fundamental algorithms for tackling $(P)$, na… ▽ More

    Submitted 28 December, 2020; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: 49 pages, 2 figures

    MSC Class: 90C05; 90C60; 68Q25

  9. arXiv:1810.08727  [pdf, ps, other

    math.OC cs.LG stat.CO stat.ML

    Condition Number Analysis of Logistic Regression, and its Implications for Standard First-Order Solution Methods

    Authors: Robert M. Freund, Paul Grigas, Rahul Mazumder

    Abstract: Logistic regression is one of the most popular methods in binary classification, wherein estimation of model parameters is carried out by solving the maximum likelihood (ML) optimization problem, and the ML estimator is defined to be the optimal solution of this problem. It is well known that the ML estimator exists when the data is non-separable, but fails to exist when the data is separable. Fir… ▽ More

    Submitted 19 October, 2018; originally announced October 2018.

    Comments: 38 pages

  10. arXiv:1807.07680  [pdf, other

    math.OC

    Generalized Stochastic Frank-Wolfe Algorithm with Stochastic "Substitute" Gradient for Structured Convex Optimization

    Authors: Haihao Lu, Robert M. Freund

    Abstract: The stochastic Frank-Wolfe method has recently attracted much general interest in the context of optimization for statistical and machine learning due to its ability to work with a more general feasible region. However, there has been a complexity gap in the guaranteed convergence rate for stochastic Frank-Wolfe compared to its deterministic counterpart. In this work, we present a new generalized… ▽ More

    Submitted 4 November, 2019; v1 submitted 19 July, 2018; originally announced July 2018.

  11. arXiv:1806.02476  [pdf, ps, other

    math.OC

    Accelerating Greedy Coordinate Descent Methods

    Authors: Haihao Lu, Robert M. Freund, Vahab Mirrokni

    Abstract: We study ways to accelerate greedy coordinate descent in theory and in practice, where "accelerate" refers either to $O(1/k^2)$ convergence in theory, in practice, or both. We introduce and study two algorithms: Accelerated Semi-Greedy Coordinate Descent (ASCD) and Accelerated Greedy Coordinate Descent (AGCD). While ASCD takes greedy steps in the $x$-updates and randomized steps in the $z$-updates… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

  12. arXiv:1610.05708  [pdf, ps, other

    math.OC

    Relatively-Smooth Convex Optimization by First-Order Methods, and Applications

    Authors: Haihao Lu, Robert M. Freund, Yurii Nesterov

    Abstract: The usual approach to develo** and analyzing first-order methods for smooth convex optimization assumes that the gradient of the objective function is uniformly smooth with some Lipschitz constant $L$. However, in many settings the differentiable convex function $f(\cdot)$ is not uniformly smooth -- for example in $D$-optimal design where $f(x):=-\ln \det(HXH^T)$, or even the univariate setting… ▽ More

    Submitted 10 October, 2017; v1 submitted 18 October, 2016; originally announced October 2016.

  13. arXiv:1511.02974  [pdf, other

    math.OC

    New Computational Guarantees for Solving Convex Optimization Problems with First Order Methods, via a Function Growth Condition Measure

    Authors: Robert M. Freund, Haihao Lu

    Abstract: Motivated by recent work of Renegar, we present new computational methods and associated computational guarantees for solving convex optimization problems using first-order methods. Our problem of interest is the general convex optimization problem $f^* = \min_{x \in Q} f(x)$, where we presume knowledge of a strict lower bound $f_{\mathrm{slb}} < f^*$. [Indeed, $f_{\mathrm{slb}}$ is naturally know… ▽ More

    Submitted 8 November, 2016; v1 submitted 9 November, 2015; originally announced November 2015.

    Comments: 1 figure

    MSC Class: 90C25

  14. arXiv:1511.02204  [pdf, other

    math.OC stat.CO stat.ML

    An Extended Frank-Wolfe Method with "In-Face" Directions, and its Application to Low-Rank Matrix Completion

    Authors: Robert M. Freund, Paul Grigas, Rahul Mazumder

    Abstract: Motivated principally by the low-rank matrix completion problem, we present an extension of the Frank-Wolfe method that is designed to induce near-optimal solutions on low-dimensional faces of the feasible region. This is accomplished by a new approach to generating ``in-face" directions at each iteration, as well as through new choice rules for selecting between in-face and ``regular" Frank-Wolfe… ▽ More

    Submitted 6 November, 2015; originally announced November 2015.

    Comments: 25 pages, 3 tables and 2 figues

    MSC Class: 90C25 ACM Class: G.1.6

  15. arXiv:1505.04243  [pdf, other

    math.ST cs.LG math.OC stat.ML

    A New Perspective on Boosting in Linear Regression via Subgradient Optimization and Relatives

    Authors: Robert M. Freund, Paul Grigas, Rahul Mazumder

    Abstract: In this paper we analyze boosting algorithms in linear regression from a new perspective: that of modern first-order methods in convex optimization. We show that classic boosting algorithms in linear regression, namely the incremental forward stagewise algorithm (FS$_\varepsilon$) and least squares boosting (LS-Boost($\varepsilon$)), can be viewed as subgradient descent to minimize the loss functi… ▽ More

    Submitted 16 May, 2015; originally announced May 2015.

    MSC Class: 62J05; 62J07; 90C25

  16. arXiv:1405.4350  [pdf, other

    physics.comp-ph math.OC physics.optics

    Robust topology optimization of three-dimensional photonic-crystal band-gap structures

    Authors: Han Men, Karen Y. K. Lee, Robert M. Freund, Jaime Peraire, Steven G. Johnson

    Abstract: We perform full 3D topology optimization (in which "every voxel" of the unit cell is a degree of freedom) of photonic-crystal structures in order to find optimal omnidirectional band gaps for various symmetry groups, including fcc (including diamond), bcc, and simple-cubic lattices. Even without imposing the constraints of any fabrication process, the resulting optimal gaps are only slightly large… ▽ More

    Submitted 19 May, 2014; v1 submitted 16 May, 2014; originally announced May 2014.

    Comments: 17 pages, 9 figures, submitted to Optics Express

    Journal ref: Optics Express, Vol. 22, Issue 19, pp. 22632-22648 (2014)

  17. Fabrication-Adaptive Optimization, with an Application to Photonic Crystal Design

    Authors: Han Men, Robert M. Freund, Ngoc C. Nguyen, Joel Saa-Seoane, Jaime Peraire

    Abstract: It is often the case that the computed optimal solution of an optimization problem cannot be implemented directly, irrespective of data accuracy, due to either (i) technological limitations (such as physical tolerances of machines or processes), (ii) the deliberate simplification of a model to keep it tractable (by ignoring certain types of constraints that pose computational difficulties), and/or… ▽ More

    Submitted 19 May, 2014; v1 submitted 21 July, 2013; originally announced July 2013.

    Journal ref: Operations research 62, 418-434 (2014)

  18. arXiv:1307.1192  [pdf, ps, other

    stat.ML cs.LG math.OC

    AdaBoost and Forward Stagewise Regression are First-Order Convex Optimization Methods

    Authors: Robert M. Freund, Paul Grigas, Rahul Mazumder

    Abstract: Boosting methods are highly popular and effective supervised learning methods which combine weak learners into a single accurate model with good statistical performance. In this paper, we analyze two well-known boosting methods, AdaBoost and Incremental Forward Stagewise Regression (FS$_\varepsilon$), by establishing their precise connections to the Mirror Descent algorithm, which is a first-order… ▽ More

    Submitted 3 July, 2013; originally announced July 2013.

    MSC Class: 68Q32; 68T05; 62J05; 90C25 ACM Class: I.2.6; I.5.1; G.3; G.1.6

  19. arXiv:1307.0873  [pdf, ps, other

    math.OC

    New Analysis and Results for the Frank-Wolfe Method

    Authors: Robert M. Freund, Paul Grigas

    Abstract: We present new results for the Frank-Wolfe method (also known as the conditional gradient method). We derive computational guarantees for arbitrary step-size sequences, which are then applied to various step-size rules, including simple averaging and constant step-sizes. We also develop step-size rules and computational guarantees that depend naturally on the warm-start quality of the initial (and… ▽ More

    Submitted 2 June, 2014; v1 submitted 2 July, 2013; originally announced July 2013.

    Comments: Changed the name of the method from "conditional gradient" to "Frank-Wolfe"

    Report number: MIT Operations Research Center Working Paper OR395-13 MSC Class: 90C25 ACM Class: G.1.6

  20. arXiv:0907.2267  [pdf, other

    math.OC math-ph physics.comp-ph physics.optics

    Band Gap Optimization of Two-Dimensional Photonic Crystals Using Semidefinite Programming and Subspace Methods

    Authors: Han Men, Ngoc-Cuong Nguyen, Robert M. Freund, Pablo A. Parrilo, Jaume Peraire

    Abstract: In this paper, we consider the optimal design of photonic crystal band structures for two-dimensional square lattices. The mathematical formulation of the band gap optimization problem leads to an infinite-dimensional Hermitian eigenvalue optimization problem parametrized by the dielectric material and the wave vector. To make the problem tractable, the original eigenvalue problem is discretized… ▽ More

    Submitted 13 July, 2009; originally announced July 2009.

    Comments: 23 pages, submitted

    Journal ref: Journal of Computational Physics 229 (2010) pp. 3706-3725