Skip to main content

Showing 1–50 of 59 results for author: Bertsimas, D

Searching in archive math. Search in all archives.
.
  1. arXiv:2405.07068  [pdf, other

    math.OC cs.LG

    Catastrophe Insurance: An Adaptive Robust Optimization Approach

    Authors: Dimitris Bertsimas, Cynthia Zeng

    Abstract: The escalating frequency and severity of natural disasters, exacerbated by climate change, underscore the critical role of insurance in facilitating recovery and promoting investments in risk reduction. This work introduces a novel Adaptive Robust Optimization (ARO) framework tailored for the calculation of catastrophe insurance premiums, with a case study applied to the United States National Flo… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  2. arXiv:2403.19871  [pdf, other

    cs.LG cs.AI math.OC

    Towards Stable Machine Learning Model Retraining via Slowly Varying Sequences

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr, Yu Ma, Phevos Paschalidis

    Abstract: We consider the task of retraining machine learning (ML) models when new batches of data become available. Existing methods focus largely on greedy approaches to find the best-performing model for each batch, without considering the stability of the model's structure across retraining iterations. In this study, we propose a methodology for finding sequences of ML models that are stable across retr… ▽ More

    Submitted 22 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  3. arXiv:2311.06960  [pdf, other

    cs.LG math.OC

    Robust Regression over Averaged Uncertainty

    Authors: Dimitris Bertsimas, Yu Ma

    Abstract: We propose a new formulation of robust regression by integrating all realizations of the uncertainty set and taking an averaged approach to obtain the optimal solution for the ordinary least-squared regression problem. We show that this formulation surprisingly recovers ridge regression and establishes the missing link between robust optimization and the mean squared error approaches for existing… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  4. arXiv:2311.01742  [pdf, other

    math.OC cs.LG

    Global Optimization: A Machine Learning Approach

    Authors: Dimitris Bertsimas, Georgios Margaritis

    Abstract: Many approaches for addressing Global Optimization problems typically rely on relaxations of nonlinear constraints over specific mathematical primitives. This is restricting in applications with constraints that are black-box, implicit or consist of more general primitives. Trying to address such limitations, Bertsimas and Ozturk (2023) proposed OCTHaGOn as a way of solving black-box global optimi… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Submitted to the Journal of Global Optimization. 35 pages

  5. arXiv:2309.08162  [pdf, other

    math.OC

    Adaptive Pricing in Unit Commitment Under Load and Capacity Uncertainty

    Authors: Dimitris Bertsimas, Angelos G. Koulouras

    Abstract: The increase of renewables in the grid and the volatility of the load create uncertainties in the day-ahead prices of electricity markets. Adaptive robust optimization (ARO) and stochastic optimization have been used to make commitment and dispatch decisions that adapt to the load and capacity uncertainty. These approaches have been successfully applied in practice but current pricing approaches u… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  6. arXiv:2307.12409  [pdf, other

    cs.LG math.OC

    A Machine Learning Approach to Two-Stage Adaptive Robust Optimization

    Authors: Dimitris Bertsimas, Cheol Woo Kim

    Abstract: We propose an approach based on machine learning to solve two-stage linear adaptive robust optimization (ARO) problems with binary here-and-now variables and polyhedral uncertainty sets. We encode the optimal here-and-now decisions, the worst-case scenarios associated with the optimal here-and-now decisions, and the optimal wait-and-see decisions into what we denote as the strategy. We solve multi… ▽ More

    Submitted 7 December, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

  7. arXiv:2305.17299  [pdf, other

    stat.ML cs.AI cs.LG math.OC

    Improving Stability in Decision Tree Models

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr

    Abstract: Owing to their inherently interpretable structure, decision trees are commonly used in applications where interpretability is essential. Recent work has focused on improving various aspects of decision trees, including their predictive power and robustness; however, their instability, albeit well-documented, has been addressed to a lesser extent. In this paper, we take a step towards the stabiliza… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  8. arXiv:2305.12292  [pdf, other

    cs.LG math.OC stat.ML

    Optimal Low-Rank Matrix Completion: Semidefinite Relaxations and Eigenvector Disjunctions

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Sean Lo, Jean Pauphilet

    Abstract: Low-rank matrix completion consists of computing a matrix of minimal complexity that recovers a given set of observations as accurately as possible. Unfortunately, existing methods for matrix completion are heuristics that, while highly scalable and often identifying high-quality solutions, do not possess any optimality guarantees. We reexamine matrix completion with an optimality-oriented eye. We… ▽ More

    Submitted 26 January, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

    Comments: Updated version with new numerics showcasing relaxation for rank k>1

  9. arXiv:2304.04308  [pdf, other

    cs.LG cs.AI math.OC

    Ensemble Modeling for Time Series Forecasting: an Adaptive Robust Optimization Approach

    Authors: Dimitris Bertsimas, Leonard Boussioux

    Abstract: Accurate time series forecasting is critical for a wide range of problems with temporal data. Ensemble modeling is a well-established technique for leveraging multiple predictive models to increase accuracy and robustness, as the performance of a single predictor can be highly variable due to shifts in the underlying data distribution. This paper proposes a new methodology for building robust ense… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  10. arXiv:2303.07695  [pdf, other

    math.OC

    A Stochastic Benders Decomposition Scheme for Large-Scale Stochastic Network Design

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet, Periklis Petridis

    Abstract: Network design problems involve constructing edges in a transportation or supply chain network to minimize construction and daily operational costs. We study a stochastic version where operational costs are uncertain due to fluctuating demand and estimated as a sample average from historical data. This problem is computationally challenging, and instances with as few as 100 nodes often cannot be s… ▽ More

    Submitted 29 April, 2024; v1 submitted 14 March, 2023; originally announced March 2023.

  11. arXiv:2303.06515  [pdf, other

    math.OC cs.LG

    Multistage Stochastic Optimization via Kernels

    Authors: Dimitris Bertsimas, Kimberly Villalobos Carballo

    Abstract: We develop a non-parametric, data-driven, tractable approach for solving multistage stochastic optimization problems in which decisions do not affect the uncertainty. The proposed framework represents the decision variables as elements of a reproducing kernel Hilbert space and performs functional stochastic gradient descent to minimize the empirical regularized loss. By incorporating sparsificatio… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  12. arXiv:2302.10369  [pdf, other

    math.OC

    The Benefit of Uncertainty Coupling in Robust and Adaptive Robust Optimization

    Authors: Dimitris Bertsimas, Liangyuan Na, Bartolomeo Stellato

    Abstract: Despite the modeling power for problems under uncertainty, robust optimization (RO) and adaptive robust optimization (ARO) can exhibit too conservative solutions in terms of objective value degradation compared to the nominal case. One of the main reasons behind this conservatism is that, in many practical applications, uncertain constraints are directly designed as constraint-wise without taking… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Comments: 51 pages, 13 figures

  13. arXiv:2210.08326  [pdf, ps, other

    stat.ME cs.LG math.OC stat.ML

    Distributionally Robust Causal Inference with Observational Data

    Authors: Dimitris Bertsimas, Kosuke Imai, Michael Lingzhi Li

    Abstract: We consider the estimation of average treatment effects in observational studies and propose a new framework of robust causal inference with unobserved confounders. Our approach is based on distributionally robust optimization and proceeds in two steps. We first specify the maximal degree to which the distribution of unobserved potential outcomes may deviate from that of observed outcomes. We then… ▽ More

    Submitted 2 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

  14. arXiv:2210.06590  [pdf, other

    math.OC math.ST

    Sparse PCA: a Geometric Approach

    Authors: Dimitris Bertsimas, Driss Lahlou Kitane

    Abstract: We consider the problem of maximizing the variance explained from a data matrix using orthogonal sparse principal components that have a support of fixed cardinality. While most existing methods focus on building principal components (PCs) iteratively through deflation, we propose GeoSPCA, a novel algorithm to build all PCs at once while satisfying the orthogonality constraints which brings substa… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: 35 pages, 10 figures

  15. arXiv:2209.06341  [pdf, other

    math.OC

    Decarbonizing OCP

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Vassilis Digalakis Jr

    Abstract: We present our collaboration with the OCP Group, one of the world's largest producers of phosphate and phosphate-based products, to reduce OCP's carbon emissions significantly. We study the problem of decarbonizing OCP's electricity supply by installing a mixture of solar panels and batteries to minimize its time-discounted investment cost plus the cost of satisfying its remaining demand via the n… ▽ More

    Submitted 16 June, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: Submitted to MSOM on 08/2022

  16. arXiv:2206.00176  [pdf, other

    cs.LG eess.SY math.OC

    Learning Sparse Nonlinear Dynamics via Mixed-Integer Optimization

    Authors: Dimitris Bertsimas, Wes Gurnee

    Abstract: Discovering governing equations of complex dynamical systems directly from data is a central problem in scientific machine learning. In recent years, the sparse identification of nonlinear dynamics (SINDy) framework, powered by heuristic sparse regression methods, has become a dominant tool for learning parsimonious models. We propose an exact formulation of the SINDy problem using mixed-integer o… ▽ More

    Submitted 31 May, 2022; originally announced June 2022.

  17. arXiv:2202.06017  [pdf, other

    math.OC

    Global Optimization via Optimal Decision Trees

    Authors: Dimitris Bertsimas, Berk Öztürk

    Abstract: The global optimization literature places large emphasis on reducing intractable optimization problems into more tractable structured optimization forms. In order to achieve this goal, many existing methods are restricted to optimization over explicit constraints and objectives that use a subset of possible mathematical primitives. These are limiting in real-world contexts where more general expli… ▽ More

    Submitted 12 February, 2022; originally announced February 2022.

    Comments: 52 pages, 9 figures, 10 tables. Submitted to Operations Research

  18. arXiv:2112.09279  [pdf, other

    cs.LG math.OC stat.ML

    Robust Upper Bounds for Adversarial Training

    Authors: Dimitris Bertsimas, Xavier Boix, Kimberly Villalobos Carballo, Dick den Hertog

    Abstract: Many state-of-the-art adversarial training methods for deep learning leverage upper bounds of the adversarial loss to provide security guarantees against adversarial attacks. Yet, these methods rely on convex relaxations to propagate lower and upper bounds for intermediate layers, which affect the tightness of the bound at the output layer. We introduce a new approach to adversarial training by mi… ▽ More

    Submitted 5 April, 2023; v1 submitted 16 December, 2021; originally announced December 2021.

  19. arXiv:2111.04469  [pdf, other

    math.OC cs.LG stat.ML

    Mixed-Integer Optimization with Constraint Learning

    Authors: Donato Maragno, Holly Wiberg, Dimitris Bertsimas, S. Ilker Birbil, Dick den Hertog, Adejuyigbe Fajemisin

    Abstract: We establish a broad methodological foundation for mixed-integer optimization with learned constraints. We propose an end-to-end pipeline for data-driven decision making in which constraints and objectives are directly learned from data using machine learning, and the trained models are embedded in an optimization formulation. We exploit the mixed-integer optimization-representability of many mach… ▽ More

    Submitted 26 October, 2023; v1 submitted 4 November, 2021; originally announced November 2021.

  20. arXiv:2109.12701  [pdf, other

    stat.ML cs.LG math.OC

    Sparse Plus Low Rank Matrix Decomposition: A Discrete Optimization Approach

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Nicholas A. G. Johnson

    Abstract: We study the Sparse Plus Low-Rank decomposition problem (SLR), which is the problem of decomposing a corrupted data matrix into a sparse matrix of perturbations plus a low-rank matrix containing the ground truth. SLR is a fundamental problem in Operations Research and Machine Learning which arises in various applications, including data compression, latent semantic indexing, collaborative filterin… ▽ More

    Submitted 1 October, 2023; v1 submitted 26 September, 2021; originally announced September 2021.

    Journal ref: Journal of Machine Learning Research, 24(267), 1-51 (2023)

  21. arXiv:2105.05947  [pdf, other

    math.OC cs.LG stat.ML

    A new perspective on low-rank optimization

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: A key question in many low-rank problems throughout optimization, machine learning, and statistics is to characterize the convex hulls of simple low-rank sets and judiciously apply these convex hulls to obtain strong yet computationally tractable convex relaxations. We invoke the matrix perspective function - the matrix analog of the perspective function - and characterize explicitly the convex hu… ▽ More

    Submitted 2 March, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Comments: Major revision submitted to Mathematical Programming

  22. arXiv:2103.02506  [pdf, ps, other

    math.OC stat.CO stat.ML

    Stochastic Cutting Planes for Data-Driven Optimization

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We introduce a stochastic version of the cutting-plane method for a large class of data-driven Mixed-Integer Nonlinear Optimization (MINLO) problems. We show that under very weak assumptions the stochastic algorithm is able to converge to an $ε$-optimal solution with high probability. Numerical experiments on several problems show that stochastic cutting planes is able to deliver a multiple order-… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

  23. arXiv:2102.10773  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Slowly Varying Regression under Sparsity

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr, Michael Linghzi Li, Omar Skali Lami

    Abstract: We present the framework of slowly varying regression under sparsity, allowing sparse regression models to exhibit slow and sparse variations. The problem of parameter estimation is formulated as a mixed-integer optimization problem. We demonstrate that it can be precisely reformulated as a binary convex optimization problem through a novel relaxation technique. This relaxation involves a new equa… ▽ More

    Submitted 11 November, 2023; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: Submitted to Operations Research. First submission: 02/2021

  24. arXiv:2102.07309  [pdf, other

    q-bio.PE math.DS math.OC

    Where to locate COVID-19 mass vaccination facilities?

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr., Alexander Jacquillat, Michael Lingzhi Li, Alessandro Previero

    Abstract: The outbreak of COVID-19 led to a record-breaking race to develop a vaccine. However, the limited vaccine capacity creates another massive challenge: how to distribute vaccines to mitigate the near-end impact of the pandemic? In the United States in particular, the new Biden administration is launching mass vaccination sites across the country, raising the obvious question of where to locate these… ▽ More

    Submitted 18 July, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

  25. arXiv:2012.13331  [pdf, other

    math.OC econ.GN eess.SY

    Computation of Convex Hull Prices in Electricity Markets with Non-Convexities using Dantzig-Wolfe Decomposition

    Authors: Panagiotis Andrianesis, Dimitris Bertsimas, Michael C. Caramanis, William W. Hogan

    Abstract: The presence of non-convexities in electricity markets has been an active research area for about two decades. The -- inevitable under current marginal cost pricing -- problem of guaranteeing that no market participant incurs losses in the day-ahead market is addressed in current practice through make-whole payments a.k.a. uplift. Alternative pricing rules have been studied to deal with this probl… ▽ More

    Submitted 24 October, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

    Comments: 11 pages

  26. arXiv:2012.04419  [pdf, ps, other

    math.OC

    Pareto Adaptive Robust Optimality via a Fourier-Motzkin Elimination Lens

    Authors: Dimitris Bertsimas, Stefan ten Eikelder, Dick den Hertog, Nikolaos Trichakis

    Abstract: We formalize the concept of Pareto Adaptive Robust Optimality (PARO) for linear Adaptive Robust Optimization (ARO) problems. A worst-case optimal solution pair of here-and-now decisions and wait-and-see decisions is PARO if it cannot be Pareto dominated by another solution, i.e., there does not exist another such pair that performs at least as good in all scenarios in the uncertainty set and stric… ▽ More

    Submitted 5 May, 2022; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Revised version. 41 pages, 3 figures

  27. arXiv:2009.10395  [pdf, other

    math.OC cs.LG stat.ML

    Mixed-Projection Conic Optimization: A New Paradigm for Modeling Rank Constraints

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: We propose a framework for modeling and solving low-rank optimization problems to certifiable optimality. We introduce symmetric projection matrices that satisfy $Y^2=Y$, the matrix analog of binary variables that satisfy $z^2=z$, to model rank constraints. By leveraging regularization and strong duality, we prove that this modeling paradigm yields tractable convex optimization problems over the n… ▽ More

    Submitted 2 April, 2021; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: major revision submitted to Operations Research

    Journal ref: Operations Research, Articles in Advance 2021

  28. arXiv:2006.16509  [pdf, other

    stat.AP math.OC q-bio.PE stat.ML

    From predictions to prescriptions: A data-driven response to COVID-19

    Authors: Dimitris Bertsimas, Léonard Boussioux, Ryan Cory Wright, Arthur Delarue, Vassilis Digalakis Jr., Alexandre Jacquillat, Driss Lahlou Kitane, Galit Lukin, Michael Lingzhi Li, Luca Mingardi, Omid Nohadani, Agni Orfanoudaki, Theodore Papalexopoulos, Ivan Paskov, Jean Pauphilet, Omar Skali Lami, Bartolomeo Stellato, Hamza Tazi Bouardi, Kimberly Villalobos Carballo, Holly Wiberg, Cynthia Zeng

    Abstract: The COVID-19 pandemic has created unprecedented challenges worldwide. Strained healthcare providers make difficult decisions on patient triage, treatment and care management on a daily basis. Policy makers have imposed social distancing measures to slow the disease, at a steep economic price. We design analytical tools to support these decisions and combat the pandemic. Specifically, we propose a… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: Submitted to PNAS

  29. arXiv:2005.05195  [pdf, other

    math.OC cs.LG math.ST stat.CO

    Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: Sparse principal component analysis (PCA) is a popular dimensionality reduction technique for obtaining principal components which are linear combinations of a small subset of the original features. Existing approaches cannot supply certifiably optimal principal components with more than $p=100s$ of variables. By reformulating sparse PCA as a convex mixed-integer semidefinite optimization problem,… ▽ More

    Submitted 25 August, 2021; v1 submitted 11 May, 2020; originally announced May 2020.

    Comments: Revision submitted to JMLR

    Journal ref: Journal of Machine Learning Research 23(13):1-35, 2022

  30. arXiv:1910.09092  [pdf, ps, other

    cs.LG math.OC stat.ME stat.ML

    Fast Exact Matrix Completion: A Unified Optimization Framework for Matrix Completion

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We formulate the problem of matrix completion with and without side information as a non-convex optimization problem. We design fastImpute based on non-convex gradient descent and show it converges to a global minimum that is guaranteed to recover closely the underlying matrix while it scales to matrices of sizes beyond $10^5 \times 10^5$. We report experiments on both synthetic and real-world dat… ▽ More

    Submitted 31 December, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

    Journal ref: Journal of Machine Learning Research 21 (2020) 1-43

  31. arXiv:1910.03143  [pdf, other

    math.OC cs.LG stat.ML

    On Polyhedral and Second-Order Cone Decompositions of Semidefinite Optimization Problems

    Authors: Dimitris Bertsimas, Ryan Cory-Wright

    Abstract: We study a cutting-plane method for semidefinite optimization problems (SDOs), and supply a proof of the method's convergence, under a boundedness assumption. By relating the method's rate of convergence to an initial outer approximation's diameter, we argue that the method performs well when initialized with a second-order-cone approximation, instead of a linear approximation. We invoke the metho… ▽ More

    Submitted 2 December, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Submitted minor revision to Operations Research Letters; removed footnotes and corrected some minor typos in previous version

    Journal ref: Operations Research Letters 48(1):78-85 (2020)

  32. arXiv:1907.07307  [pdf, other

    math.OC cs.LG stat.ML

    Dynamic optimization with side information

    Authors: Dimitris Bertsimas, Christopher McCord, Bradley Sturt

    Abstract: We develop a tractable and flexible approach for incorporating side information into dynamic optimization under uncertainty. The proposed framework uses predictive machine learning methods (such as $k$-nearest neighbors, kernel regression, and random forests) to weight the relative importance of various data-driven uncertainty sets in a robust optimization formulation. Through a novel measure conc… ▽ More

    Submitted 21 July, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

  33. arXiv:1907.07142  [pdf, other

    math.OC

    Two-stage sample robust optimization

    Authors: Dimitris Bertsimas, Shimrit Shtern, Bradley Sturt

    Abstract: We investigate a simple approximation scheme, based on overlap** linear decision rules, for solving data-driven two-stage distributionally robust optimization problems with the type-$\infty$ Wasserstein ambiguity set. Our main result establishes that this approximation scheme is asymptotically optimal for two-stage stochastic linear optimization problems; that is, under mild assumptions, the opt… ▽ More

    Submitted 3 November, 2020; v1 submitted 16 July, 2019; originally announced July 2019.

  34. arXiv:1907.02206  [pdf, other

    math.OC cs.LG

    Online Mixed-Integer Optimization in Milliseconds

    Authors: Dimitris Bertsimas, Bartolomeo Stellato

    Abstract: We propose a method to solve online mixed-integer optimization (MIO) problems at very high speed using machine learning. By exploiting the repetitive nature of online optimization, we are able to greatly speedup the solution time. Our approach encodes the optimal solution into a small amount of information denoted as strategy using the Voice of Optimization framework proposed in [BS21]. In this wa… ▽ More

    Submitted 22 March, 2021; v1 submitted 3 July, 2019; originally announced July 2019.

  35. arXiv:1907.02109  [pdf, other

    math.OC cs.LG stat.ML

    A unified approach to mixed-integer optimization problems with logical constraints

    Authors: Dimitris Bertsimas, Ryan Cory-Wright, Jean Pauphilet

    Abstract: We propose a unified framework to address a family of classical mixed-integer optimization problems with logically constrained decision variables, including network design, facility location, unit commitment, sparse portfolio selection, binary quadratic optimization, sparse principal analysis and sparse learning problems. These problems exhibit logical relationships between continuous and discrete… ▽ More

    Submitted 25 January, 2021; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Revised version (including title change). The old title was "A unified approach to mixed-integer optimization: Nonlinear formulations and scalable algorithms"

  36. arXiv:1906.10283  [pdf, other

    stat.ML cs.LG math.OC

    Certifiably Optimal Sparse Inverse Covariance Estimation

    Authors: Dimitris Bertsimas, Jourdain Lamperski, Jean Pauphilet

    Abstract: We consider the maximum likelihood estimation of sparse inverse covariance matrices. We demonstrate that current heuristic approaches primarily encourage robustness, instead of the desired sparsity. We give a novel approach that solves the cardinality constrained likelihood problem to certifiable optimality. The approach uses techniques from mixed-integer optimization and convex optimization, and… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    MSC Class: 90C11; 90C22; 62H12

    Journal ref: Mathematical Programming 184 (2020) 491-530

  37. arXiv:1902.03272  [pdf, ps, other

    stat.ML cs.LG math.OC

    Scalable Holistic Linear Regression

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We propose a new scalable algorithm for holistic linear regression building on Bertsimas & King (2016). Specifically, we develop new theory to model significance and multicollinearity as lazy constraints rather than checking the conditions iteratively. The resulting algorithm scales with the number of samples $n$ in the 10,000s, compared to the low 100s in the previous framework. Computational res… ▽ More

    Submitted 3 March, 2020; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: Accepted by Operation Research Letters

  38. arXiv:1812.09991  [pdf, other

    math.OC

    The Voice of Optimization

    Authors: Dimitris Bertsimas, Bartolomeo Stellato

    Abstract: We introduce the idea that using optimal classification trees (OCTs) and optimal classification trees with-hyperplanes (OCT-Hs), interpretable machine learning algorithms developed by Bertsimas and Dunn [2017, 2018], we are able to obtain insight on the strategy behind the optimal solution in continuous and mixed-integer convex optimization problem as a function of key parameters that affect the p… ▽ More

    Submitted 2 June, 2020; v1 submitted 24 December, 2018; originally announced December 2018.

  39. arXiv:1812.06647  [pdf, ps, other

    math.OC cs.LG stat.ML

    Interpretable Matrix Completion: A Discrete Optimization Approach

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We consider the problem of matrix completion on an $n \times m$ matrix. We introduce the problem of Interpretable Matrix Completion that aims to provide meaningful insights for the low-rank matrix using side information. We show that the problem can be reformulated as a binary convex optimization problem. We design OptComplete, based on a novel concept of stochastic cutting planes to enable effici… ▽ More

    Submitted 3 March, 2020; v1 submitted 17 December, 2018; originally announced December 2018.

    Comments: Submitted to Operational Research

  40. A Scalable Algorithm For Sparse Portfolio Selection

    Authors: Dimitris Bertsimas, Ryan Cory-Wright

    Abstract: The sparse portfolio selection problem is one of the most famous and frequently-studied problems in the optimization and financial economics literatures. In a universe of risky assets, the goal is to construct a portfolio with maximal expected return and minimum variance, subject to an upper bound on the number of positions, linear inequalities and minimum investment constraints. Existing certifia… ▽ More

    Submitted 28 March, 2021; v1 submitted 31 October, 2018; originally announced November 2018.

    Comments: Minor revision submitted to INFORMS Journal on Computing

    Journal ref: INFORMS Journal on Computing, Articles in Advance, 2022

  41. arXiv:1807.02812  [pdf, other

    math.OC

    A Scalable Algorithm for Two-Stage Adaptive Linear Optimization

    Authors: Dimitris Bertsimas, Shimrit Shtern

    Abstract: The column-and-constraint generation (CCG) method was introduced by \citet{Zeng2013} for solving two-stage adaptive optimization. We found that the CCG method is quite scalable, but sometimes, and in some applications often, produces infeasible first-stage solutions, even though the problem is feasible. In this research, we extend the CCG method in a way that (a) maintains scalability and (b) alwa… ▽ More

    Submitted 8 July, 2018; originally announced July 2018.

  42. arXiv:1711.09974  [pdf, other

    math.OC math.PR stat.ML

    Bootstrap Robust Prescriptive Analytics

    Authors: Dimitris Bertsimas, Bart Van Parys

    Abstract: We address the problem of prescribing an optimal decision in a framework where the cost function depends on uncertain problem parameters that need to be learned from data. Earlier work proposed prescriptive formulations based on supervised machine learning methods. These prescriptive methods can factor in contextual information on a potentially large number of covariates to take context specific a… ▽ More

    Submitted 8 June, 2021; v1 submitted 27 November, 2017; originally announced November 2017.

  43. Sparse Classification: a scalable discrete optimization perspective

    Authors: Dimitris Bertsimas, Jean Pauphilet, Bart Van Parys

    Abstract: We formulate the sparse classification problem of $n$ samples with $p$ features as a binary convex optimization problem and propose a cutting-plane algorithm to solve it exactly. For sparse logistic regression and sparse SVM, our algorithm finds optimal solutions for $n$ and $p$ in the $10,000$s within minutes. On synthetic data our algorithm achieves perfect support recovery in the large sample r… ▽ More

    Submitted 30 June, 2020; v1 submitted 3 October, 2017; originally announced October 2017.

    Journal ref: Machine Learning, 2021

  44. arXiv:1709.10030  [pdf, other

    math.OC cs.LG stat.ML

    Sparse Hierarchical Regression with Polynomials

    Authors: Dimitris Bertsimas, Bart Van Parys

    Abstract: We present a novel method for exact hierarchical sparse polynomial regression. Our regressor is that degree $r$ polynomial which depends on at most $k$ inputs, counting at most $\ell$ monomial terms, which minimizes the sum of the squares of its prediction errors. The previous hierarchical sparse specification aligns well with modern big data settings where many inputs are not relevant for predict… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

  45. arXiv:1709.10029  [pdf, other

    math.OC stat.ML

    Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

    Authors: Dimitris Bertsimas, Bart Van Parys

    Abstract: We present a novel binary convex reformulation of the sparse regression problem that constitutes a new duality perspective. We devise a new cutting plane method and provide evidence that it can solve to provable optimality the sparse regression problem for sample sizes n and number of regressors p in the 100,000s, that is two orders of magnitude better than the current state of the art, in seconds… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.

  46. arXiv:1708.04527  [pdf, other

    stat.ME math.OC math.ST stat.CO stat.ML

    The Trimmed Lasso: Sparsity and Robustness

    Authors: Dimitris Bertsimas, Martin S. Copenhaver, Rahul Mazumder

    Abstract: Nonconvex penalty methods for sparse modeling in linear regression have been a topic of fervent interest in recent years. Herein, we study a family of nonconvex penalty functions that we call the trimmed Lasso and that offers exact control over the desired level of sparsity of estimators. We analyze its structural properties and in doing so show the following: 1) Drawing parallels between robust… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: 32 pages (excluding appendix); 4 figures

  47. arXiv:1605.02347  [pdf, other

    math.OC

    The Power and Limits of Predictive Approaches to Observational-Data-Driven Optimization

    Authors: Dimitris Bertsimas, Nathan Kallus

    Abstract: While data-driven decision-making is transforming modern operations, most large-scale data is of an observational nature, such as transactional records. These data pose unique challenges in a variety of operational problems posed as stochastic optimization problems, including pricing and inventory management, where one must evaluate the effect of a decision, such as price or order quantity, on an… ▽ More

    Submitted 20 May, 2017; v1 submitted 8 May, 2016; originally announced May 2016.

  48. arXiv:1604.06837  [pdf, other

    stat.ME math.OC stat.CO

    Certifiably Optimal Low Rank Factor Analysis

    Authors: Dimitris Bertsimas, Martin S. Copenhaver, Rahul Mazumder

    Abstract: Factor Analysis (FA) is a technique of fundamental importance that is widely used in classical and modern multivariate statistics, psychometrics and econometrics. In this paper, we revisit the classical rank-constrained FA problem, which seeks to approximate an observed covariance matrix ($\boldsymbolΣ$), by the sum of a Positive Semidefinite (PSD) low-rank component ($\boldsymbolΘ$) and a diagona… ▽ More

    Submitted 22 April, 2016; originally announced April 2016.

    Journal ref: JMLR 18(29) (2017)

  49. arXiv:1507.03133  [pdf, other

    stat.ME math.OC stat.CO stat.ML

    Best Subset Selection via a Modern Optimization Lens

    Authors: Dimitris Bertsimas, Angela King, Rahul Mazumder

    Abstract: In the last twenty-five years (1990-2014), algorithmic advances in integer optimization combined with hardware improvements have resulted in an astonishing 200 billion factor speedup in solving Mixed Integer Optimization (MIO) problems. We present a MIO approach for solving the classical best subset selection problem of choosing $k$ out of $p$ features in linear regression given $n$ observations.… ▽ More

    Submitted 11 July, 2015; originally announced July 2015.

    Comments: This is a revised version (May, 2015) of the first submission in June 2014

  50. arXiv:1411.6160  [pdf, ps, other

    math.ST cs.LG math.OC stat.ML

    Characterization of the equivalence of robustification and regularization in linear and matrix regression

    Authors: Dimitris Bertsimas, Martin S. Copenhaver

    Abstract: The notion of develo** statistical methods in machine learning which are robust to adversarial perturbations in the underlying data has been the subject of increasing interest in recent years. A common feature of this work is that the adversarial robustification often corresponds exactly to regularization methods which appear as a loss function plus a penalty. In this paper we deepen and extend… ▽ More

    Submitted 25 February, 2017; v1 submitted 22 November, 2014; originally announced November 2014.

    MSC Class: 62J; 90C25; 49M29; 90C11; 15A83