Search | arXiv e-print repository

Differentiable Distributionally Robust Optimization Layers

Abstract: In recent years, there has been a growing research interest in decision-focused learning, which embeds optimization problems as a layer in learning pipelines and demonstrates a superior performance than the prediction-focused approach. However, for distributionally robust optimization (DRO), a popular paradigm for decision-making under uncertainty, it is still unknown how to embed it as a layer, i… ▽ More In recent years, there has been a growing research interest in decision-focused learning, which embeds optimization problems as a layer in learning pipelines and demonstrates a superior performance than the prediction-focused approach. However, for distributionally robust optimization (DRO), a popular paradigm for decision-making under uncertainty, it is still unknown how to embed it as a layer, i.e., how to differentiate decisions with respect to an ambiguity set. In this paper, we develop such differentiable DRO layers for generic mixed-integer DRO problems with parameterized second-order conic ambiguity sets and discuss its extension to Wasserstein ambiguity sets. To differentiate the mixed-integer decisions, we propose a novel dual-view methodology by handling continuous and discrete parts of decisions via different principles. Specifically, we construct a differentiable energy-based surrogate to implement the dual-view methodology and use importance sampling to estimate its gradient. We further prove that such a surrogate enjoys the asymptotic convergency under regularization. As an application of the proposed differentiable DRO layers, we develop a novel decision-focused learning pipeline for contextual distributionally robust decision-making tasks and compare it with the prediction-focused approach in experiments. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: In Forty-first International Conference on Machine Learning (2024)

arXiv:2311.02953 [pdf, other]

Data-Driven Bayesian Nonparametric Wasserstein Distributionally Robust Optimization

Authors: Chao Ning, Xutao Ma

Abstract: In this work, we develop a novel data-driven Bayesian nonparametric Wasserstein distributionally robust optimization (BNWDRO) framework for decision-making under uncertainty. The proposed framework unifies a Bayesian nonparametric method and the Wasserstein metric to decipher the global-local features of uncertainty data and encode these features into a novel data-driven ambiguity set. By establis… ▽ More In this work, we develop a novel data-driven Bayesian nonparametric Wasserstein distributionally robust optimization (BNWDRO) framework for decision-making under uncertainty. The proposed framework unifies a Bayesian nonparametric method and the Wasserstein metric to decipher the global-local features of uncertainty data and encode these features into a novel data-driven ambiguity set. By establishing the theoretical connection between this data-driven ambiguity set and the conventional Wasserstein ambiguity set, we prove that the proposed framework enjoys the finite sample guarantee and asymptotic consistency. To efficiently solve the resulting distributionally robust optimization problem under the BNWDRO framework, we derive for this optimization problem an equivalent reformulation, which is kept tractable for many practical scenarios. Numerical experiments on a unit commitment problem verify the effectiveness of the proposed BNWDRO framework compared with existing methods. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 9 pages, including Supplementary Material

arXiv:2303.09337 [pdf, other]

A Unified Early Termination Technique for Primal-dual Algorithms in Mixed Integer Conic Programming

Authors: Yuwen Chen, Catherine Ning, Paul Goulart

Abstract: We propose an early termination technique for mixed integer conic programming for use within branch-and-bound based solvers. Our approach generalizes previous early termination results for ADMM-based solvers to a broader class of primal-dual algorithms, including both operator splitting methods and interior point methods. The complexity for checking early termination is $O(n)$ for each termination… ▽ More We propose an early termination technique for mixed integer conic programming for use within branch-and-bound based solvers. Our approach generalizes previous early termination results for ADMM-based solvers to a broader class of primal-dual algorithms, including both operator splitting methods and interior point methods. The complexity for checking early termination is $O(n)$ for each termination check assuming a bounded problem domain. We show that this domain restriction can be relaxed for problems whose data satisfies a simple rank condition, in which case each check requires an $O(n^2)$ solve using a linear system that must be factored only once at the root node. We further show how this approach can be used in hybrid model predictive control as long as system inputs are bounded. Numerical results show that our method leads to a moderate reduction in the total iterations required for branch-and-bound conic solvers with interior-point based subsolvers. △ Less

Submitted 16 March, 2023; originally announced March 2023.

arXiv:2011.11441 [pdf]

doi 10.1016/j.automatica.2020.109402

Online Learning Based Risk-Averse Stochastic MPC of Constrained Linear Uncertain Systems

Authors: Chao Ning, Fengqi You

Abstract: This paper investigates the problem of designing data-driven stochastic Model Predictive Control (MPC) for linear time-invariant systems under additive stochastic disturbance, whose probability distribution is unknown but can be partially inferred from data. We propose a novel online learning based risk-averse stochastic MPC framework in which Conditional Value-at-Risk (CVaR) constraints on system… ▽ More This paper investigates the problem of designing data-driven stochastic Model Predictive Control (MPC) for linear time-invariant systems under additive stochastic disturbance, whose probability distribution is unknown but can be partially inferred from data. We propose a novel online learning based risk-averse stochastic MPC framework in which Conditional Value-at-Risk (CVaR) constraints on system states are required to hold for a family of distributions called an ambiguity set. The ambiguity set is constructed from disturbance data by leveraging a Dirichlet process mixture model that is self-adaptive to the underlying data structure and complexity. Specifically, the structural property of multimodality is exploit-ed, so that the first- and second-order moment information of each mixture component is incorporated into the ambiguity set. A novel constraint tightening strategy is then developed based on an equivalent reformulation of distributionally ro-bust CVaR constraints over the proposed ambiguity set. As more data are gathered during the runtime of the controller, the ambiguity set is updated online using real-time disturbance data, which enables the risk-averse stochastic MPC to cope with time-varying disturbance distributions. The online variational inference algorithm employed does not require all collected data be learned from scratch, and therefore the proposed MPC is endowed with the guaranteed computational complexity of online learning. The guarantees on recursive feasibility and closed-loop stability of the proposed MPC are established via a safe update scheme. Numerical examples are used to illustrate the effectiveness and advantages of the proposed MPC. △ Less

Submitted 20 November, 2020; originally announced November 2020.

Journal ref: Automatica, Volume 125, March 2021, 109402

arXiv:1904.01934 [pdf]

doi 10.1016/j.compchemeng.2019.03.034

Optimization under Uncertainty in the Era of Big Data and Deep Learning: When Machine Learning Meets Mathematical Programming

Authors: Chao Ning, Fengqi You

Abstract: This paper reviews recent advances in the field of optimization under uncertainty via a modern data lens, highlights key research challenges and promise of data-driven optimization that organically integrates machine learning and mathematical programming for decision-making under uncertainty, and identifies potential research opportunities. A brief review of classical mathematical programming tech… ▽ More This paper reviews recent advances in the field of optimization under uncertainty via a modern data lens, highlights key research challenges and promise of data-driven optimization that organically integrates machine learning and mathematical programming for decision-making under uncertainty, and identifies potential research opportunities. A brief review of classical mathematical programming techniques for hedging against uncertainty is first presented, along with their wide spectrum of applications in Process Systems Engineering. A comprehensive review and classification of the relevant publications on data-driven distributionally robust optimization, data-driven chance constrained program, data-driven robust optimization, and data-driven scenario-based optimization is then presented. This paper also identifies fertile avenues for future research that focuses on a closed-loop data-driven optimization framework, which allows the feedback from mathematical programming to machine learning, as well as scenario-based optimization leveraging the power of deep learning techniques. Perspectives on online learning-based data-driven multistage optimization with a learning-while-optimizing scheme is presented. △ Less

Submitted 3 April, 2019; originally announced April 2019.

Journal ref: Comput. Chem. Eng., Volume 125, 9 June 2019, Pages 434-448

arXiv:1810.05931 [pdf]

doi 10.1016/j.automatica.2019.108802

A Transformation-Proximal Bundle Algorithm for Multistage Adaptive Robust Optimization and Application to Constrained Robust Optimal Control

Authors: Chao Ning, Fengqi You

Abstract: This paper presents a novel transformation-proximal bundle algorithm for multistage adaptive robust optimization problems. By partitioning recourse decisions into state and control decisions, the proposed algorithm applies affine control policy only to state decisions and allows control decisions to be fully adaptive, thus transforming the original problem into an equivalent two-stage Adaptive Rob… ▽ More This paper presents a novel transformation-proximal bundle algorithm for multistage adaptive robust optimization problems. By partitioning recourse decisions into state and control decisions, the proposed algorithm applies affine control policy only to state decisions and allows control decisions to be fully adaptive, thus transforming the original problem into an equivalent two-stage Adaptive Robust Optimization (ARO) problem. Importantly, this multi-to-two transformation is general enough to be employed with any two-stage ARO solution algorithms, thus opening a new avenue for a variety of multistage ARO algorithms. The proximal bundle method is developed for the resulting two-stage problem along with convergence analysis. In an inventory control application, the affine disturbance-feedback control policy suffers from a severe suboptimality with an average gap of 34.88%, while the proposed algorithm generates an average gap of merely 1.68%. △ Less

Submitted 29 December, 2019; v1 submitted 13 October, 2018; originally announced October 2018.

Journal ref: Automatica, Volume 113, March 2020, 108802

arXiv:1804.02738 [pdf, ps, other]

Instability of the solitary wave solutions for the generalized derivative nonlinear Schrödinger equation in the endpoint case

Authors: Bing Li, Cui Ning

Abstract: We consider the stability theory of solitary wave solutions for the generalized derivative nonlinear Schrödinger equation $$ i\partial_{t}u+\partial_{x}^{2}u+i|u|^{2σ}\partial_x u=0, $$ where $1<σ<2$. The equation has a two-parameter family of solitary wave solutions of the form… ▽ More We consider the stability theory of solitary wave solutions for the generalized derivative nonlinear Schrödinger equation $$ i\partial_{t}u+\partial_{x}^{2}u+i|u|^{2σ}\partial_x u=0, $$ where $1<σ<2$. The equation has a two-parameter family of solitary wave solutions of the form $$ u_{ω,c}(t,x)=e^{iωt+i\frac c2(x-ct)-\frac{i}{2σ+2}\int_{-\infty}^{x-ct}\varphi^{2σ}_{ω,c}(y)dy}\varphi_{ω,c}(x-ct). $$ The stability theory in the frequency region of $|c|<2\sqrtω$ was studied previously. In this paper, we prove the instability of the solitary wave solutions in the endpoint case $c=2\sqrtω$. △ Less

Submitted 8 April, 2018; originally announced April 2018.

Comments: 18

MSC Class: 35B35; 35L70

arXiv:1803.07700 [pdf, ps, other]

Instability of the solitary wave solutions for the genenalized derivative Nonlinear Schrödinger equation in the critical frequency case

Authors: Zihua Guo, Cui Ning, Yifei Wu

Abstract: We study the stability theory of solitary wave solutions for the generalized derivative nonlinear Schrödinger equation $$ i\partial_{t}u+\partial_{x}^{2}u+i|u|^{2σ}\partial_x u=0. $$ The equation has a two-parameter family of solitary wave solutions of the form \begin{align*} φ_{ω,c}(x)=\varphi_{ω,c}(x)\exp{\big\{ i\frac c2 x-\frac{i}{2σ+2}\int_{-\infty}^{x}\varphi^{2σ}_{ω,c}(y)dy\big\}}. \end{ali… ▽ More We study the stability theory of solitary wave solutions for the generalized derivative nonlinear Schrödinger equation $$ i\partial_{t}u+\partial_{x}^{2}u+i|u|^{2σ}\partial_x u=0. $$ The equation has a two-parameter family of solitary wave solutions of the form \begin{align*} φ_{ω,c}(x)=\varphi_{ω,c}(x)\exp{\big\{ i\frac c2 x-\frac{i}{2σ+2}\int_{-\infty}^{x}\varphi^{2σ}_{ω,c}(y)dy\big\}}. \end{align*} Here $ \varphi_{ω,c}$ is some real-valued function. It was proved in \cite{LiSiSu1} that the solitary wave solutions are stable if $-2\sqrt{ω}<c <2z_0\sqrt{ω}$, and unstable if $2z_0\sqrt{ω}<c <2\sqrt{ω}$ for some $z_0\in(0,1)$. We prove the instability at the borderline case $c =2z_0\sqrt{ω}$ for $1<σ<2$, improving the previous results in \cite{Fu-16-DNLS} where $3/2<σ<2$. △ Less

Submitted 20 March, 2018; originally announced March 2018.

Comments: The same result was obtained independently by Miao-Tang-Xu (paper appeared on arXiv on Mar 20, 2018) by different method. They used the third derivative of the energy around the solitary wave. Our method does not require higher regularity of the energy but constructs a delicate virial identity

arXiv:1707.09198 [pdf]

doi 10.1016/j.compchemeng.2017.12.015

Data-Driven Stochastic Robust Optimization: A General Computational Framework and Algorithm for Optimization under Uncertainty in the Big Data Era

Authors: Chao Ning, Fengqi You

Abstract: A novel data-driven stochastic robust optimization (DDSRO) framework is proposed for optimization under uncertainty leveraging labeled multi-class uncertainty data. Uncertainty data in large datasets are often collected from various conditions, which are encoded by class labels. Machine learning methods including Dirichlet process mixture model and maximum likelihood estimation are employed for un… ▽ More A novel data-driven stochastic robust optimization (DDSRO) framework is proposed for optimization under uncertainty leveraging labeled multi-class uncertainty data. Uncertainty data in large datasets are often collected from various conditions, which are encoded by class labels. Machine learning methods including Dirichlet process mixture model and maximum likelihood estimation are employed for uncertainty modeling. A DDSRO framework is further proposed based on the data-driven uncertainty model through a bi-level optimization structure. The outer optimization problem follows a two-stage stochastic programming approach to optimize the expected objective across different data classes; adaptive robust optimization is nested as the inner problem to ensure the robustness of the solution while maintaining computational tractability. A decomposition-based algorithm is further developed to solve the resulting multi-level optimization problem efficiently. Case studies on process network design and planning are presented to demonstrate the applicability of the proposed framework and algorithm. △ Less

Submitted 29 December, 2017; v1 submitted 28 July, 2017; originally announced July 2017.

Journal ref: Computers & Chemical Engineering, Volume 111, Pages 115-133, 4 March 2018,

Showing 1–9 of 9 results for author: Ning, C