Skip to main content

Showing 1–15 of 15 results for author: Kronqvist, J

.
  1. arXiv:2405.16267  [pdf, other

    cs.LG

    A GPU-Accelerated Bi-linear ADMM Algorithm for Distributed Sparse Machine Learning

    Authors: Alireza Olama, Andreas Lundell, Jan Kronqvist, Elham Ahmadi, Eduardo Camponogara

    Abstract: This paper introduces the Bi-linear consensus Alternating Direction Method of Multipliers (Bi-cADMM), aimed at solving large-scale regularized Sparse Machine Learning (SML) problems defined over a network of computational nodes. Mathematically, these are stated as minimization problems with convex local loss functions over a global decision vector, subject to an explicit $\ell_0$ norm constraint t… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2402.13595  [pdf, ps, other

    math.OC cs.LG stat.ML

    A cutting plane algorithm for globally solving low dimensional k-means clustering problems

    Authors: Martin Ryner, Jan Kronqvist, Johan Karlsson

    Abstract: Clustering is one of the most fundamental tools in data science and machine learning, and k-means clustering is one of the most common such methods. There is a variety of approximate algorithms for the k-means problem, but computing the globally optimal solution is in general NP-hard. In this paper we consider the k-means problem for instances with low dimensional data and formulate it as a struct… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 12 pages, 3 figures

    MSC Class: 90C26 (Primary) 90C27 (Secondary) ACM Class: G.1.6

  3. arXiv:2402.12283  [pdf, other

    math.OC

    Solution Polishing via Path Relinking for Continuous Black-Box Optimization

    Authors: Dimitri Papageorgiou, Jan Kronqvist, Asha Ramanujam, James Kor, Youngdae Kim, Can Li

    Abstract: When faced with a limited budget of function evaluations, state-of-the-art black-box optimization (BBO) solvers struggle to obtain globally, or sometimes even locally, optimal solutions. In such cases, one may pursue solution polishing, i.e., a computational method to improve (or ``polish'') an incumbent solution, typically via some sort of evolutionary algorithm involving two or more solutions. W… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 20 pages of main text, 7 figures in main text, 26 total figures

    MSC Class: 90

  4. arXiv:2307.10463  [pdf, other

    math.OC

    LineWalker: Line Search for Black Box Derivative-Free Optimization and Surrogate Model Construction

    Authors: Dimitri J. Papageorgiou, Jan Kronqvist, Krishnan Kumaran

    Abstract: This paper describes a simple, but effective sampling method for optimizing and learning a discrete approximation (or surrogate) of a multi-dimensional function along a one-dimensional line segment of interest. The method does not rely on derivative information and the function to be learned can be a computationally-expensive ``black box'' function that must be queried via simulation or other mean… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: 58 pages, 7 main figures, 29 total figures

  5. arXiv:2307.09057  [pdf, ps, other

    math.OC cs.LG stat.ML

    Globally solving the Gromov-Wasserstein problem for point clouds in low dimensional Euclidean spaces

    Authors: Martin Ryner, Jan Kronqvist, Johan Karlsson

    Abstract: This paper presents a framework for computing the Gromov-Wasserstein problem between two sets of points in low dimensional spaces, where the discrepancy is the squared Euclidean norm. The Gromov-Wasserstein problem is a generalization of the optimal transport problem that finds the assignment between two sets preserving pairwise distances as much as possible. This can be used to quantify the simil… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 20 pages, 5 figures

    MSC Class: 90C26

  6. arXiv:2305.06785  [pdf, other

    math.OC

    Alternating mixed-integer programming and neural network training for approximating stochastic two-stage problems

    Authors: Jan Kronqvist, Boda Li, Jan Rolfes, Shudian Zhao

    Abstract: The presented work addresses two-stage stochastic programs (2SPs), a broadly applicable model to capture optimization problems subject to uncertain parameters with adjustable decision variables. In case the adjustable or second-stage variables contain discrete decisions, the corresponding 2SPs are known to be NP-complete. The standard approach of forming a single-stage deterministic equivalent pro… ▽ More

    Submitted 19 July, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: 16 pages, 2 figures

    MSC Class: 90C15; 90C11; 90-10

  7. arXiv:2303.10985  [pdf, other

    math.OC

    A Column Generation Approach for Radiation Therapy Patient Scheduling with Planned Machine Unavailability and Uncertain Future Arrivals

    Authors: Sara Frimodig, Per Enqvist, Jan Kronqvist

    Abstract: The number of cancer cases per year is rapidly increasing worldwide. In radiation therapy (RT), radiation from linear accelerators is used to kill malignant tumor cells. Scheduling patients for RT is difficult both due to the numerous medical and technical constraints, and because of the stochastic inflow of patients with different urgency levels. In this paper, a Column Generation (CG) approach i… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 17 pages, 1 appendix

  8. A mixed-integer approximation of robust optimization problems with mixed-integer adjustments

    Authors: Jan Kronqvist, Boda Li, Jan Rolfes

    Abstract: In the present article we propose a mixed-integer approximation of adjustable-robust optimization (ARO) problems, that have both, continuous and discrete variables on the lowest level. As these trilevel problems are notoriously hard to solve, we restrict ourselves to weakly-connected instances. Our approach allows us to approximate, and in some cases exactly represent, the trilevel problem as a si… ▽ More

    Submitted 19 July, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: 17 pages, 4 figures

    MSC Class: 90C17; 90C11; 90-10

  9. arXiv:2302.10344  [pdf, other

    math.OC cs.LG

    Model-based feature selection for neural networks: A mixed-integer programming approach

    Authors: Shudian Zhao, Calvin Tsay, Jan Kronqvist

    Abstract: In this work, we develop a novel input feature selection framework for ReLU-based deep neural networks (DNNs), which builds upon a mixed-integer optimization approach. While the method is generally applicable to various classification tasks, we focus on finding input features for image classification for clarity of presentation. The idea is to use a trained DNN, or an ensemble of trained DNNs, to… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 15 pages, 3 figures, 5 tables

  10. arXiv:2210.16896  [pdf, other

    math.OC

    Sparse Convex Optimization Toolkit: A Mixed-Integer Framework

    Authors: Alireza Olama, Eduardo Camponogara, Jan Kronqvist

    Abstract: This paper proposes an open-source distributed solver for solving Sparse Convex Optimization (SCO) problems over computational networks. Motivated by past algorithmic advances in mixed-integer optimization, the Sparse Convex Optimization Toolkit (SCOT) adopts a mixed-integer approach to find exact solutions to SCO problems. In particular, SCOT brings together various techniques to transform the or… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  11. arXiv:2202.05198  [pdf, other

    math.OC cs.LG

    P-split formulations: A class of intermediate formulations between big-M and convex hull for disjunctive constraints

    Authors: Jan Kronqvist, Ruth Misener, Calvin Tsay

    Abstract: We develop a class of mixed-integer formulations for disjunctive constraints intermediate to the big-M and convex hull formulations in terms of relaxation strength. The main idea is to capture the best of both the big-M and convex hull formulations: a computationally light formulation with a tight relaxation. The "P-split" formulations are based on a lifted transformation that splits convex additi… ▽ More

    Submitted 27 May, 2024; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: 29 pages, 6 figures

  12. arXiv:2201.10035  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Maximizing information from chemical engineering data sets: Applications to machine learning

    Authors: Alexander Thebelt, Johannes Wiebe, Jan Kronqvist, Calvin Tsay, Ruth Misener

    Abstract: It is well-documented how artificial intelligence can have (and already is having) a big impact on chemical engineering. But classical machine learning approaches may be weak for many chemical engineering applications. This review discusses how challenging data characteristics arise in chemical engineering applications. We identify four characteristics of data arising in chemical engineering appli… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 34 pages, 3 figures, 1 table

  13. arXiv:2102.04373  [pdf, other

    math.OC cs.LG stat.ML

    Partition-based formulations for mixed-integer optimization of trained ReLU neural networks

    Authors: Calvin Tsay, Jan Kronqvist, Alexander Thebelt, Ruth Misener

    Abstract: This paper introduces a class of mixed-integer formulations for trained ReLU neural networks. The approach balances model size and tightness by partitioning node inputs into a number of groups and forming the convex hull over the partitions via disjunctive programming. At one extreme, one partition per input recovers the convex hull of a node, i.e., the tightest possible formulation for each node.… ▽ More

    Submitted 20 October, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: Conference on Neural Information Processing Systems (NeurIPS) 2021

  14. arXiv:2101.12708  [pdf, other

    math.OC cs.LG

    Between steps: Intermediate relaxations between big-M and convex hull formulations

    Authors: Jan Kronqvist, Ruth Misener, Calvin Tsay

    Abstract: This work develops a class of relaxations in between the big-M and convex hull formulations of disjunctions, drawing advantages from both. The proposed "P-split" formulations split convex additively separable constraints into P partitions and form the convex hull of the partitioned disjuncts. Parameter P represents the trade-off of model size vs. relaxation strength. We examine the novel formulati… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: 16 pages

  15. arXiv:2003.04774  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    ENTMOOT: A Framework for Optimization over Ensemble Tree Models

    Authors: Alexander Thebelt, Jan Kronqvist, Miten Mistry, Robert M. Lee, Nathan Sudermann-Merx, Ruth Misener

    Abstract: Gradient boosted trees and other regression tree models perform well in a wide range of real-world, industrial applications. These tree models (i) offer insight into important prediction features, (ii) effectively manage sparse data, and (iii) have excellent prediction capabilities. Despite their advantages, they are generally unpopular for decision-making tasks and black-box optimization, which i… ▽ More

    Submitted 18 May, 2021; v1 submitted 10 March, 2020; originally announced March 2020.

    Comments: 33 pages, 10 figures, 2 tables