-
Online Matching and Contention Resolution for Edge Arrivals with Vanishing Probabilities
Authors:
Will Ma,
Calum MacRury,
Pranav Nuti
Abstract:
We study the performance of sequential contention resolution and matching algorithms on random graphs with vanishing edge probabilities. When the edges of the graph are processed in an adversarially-chosen order, we derive a new OCRS that is $0.382$-selectable, attaining the "independence benchmark" from the literature under the vanishing edge probabilities assumption. Complementary to this positi…
▽ More
We study the performance of sequential contention resolution and matching algorithms on random graphs with vanishing edge probabilities. When the edges of the graph are processed in an adversarially-chosen order, we derive a new OCRS that is $0.382$-selectable, attaining the "independence benchmark" from the literature under the vanishing edge probabilities assumption. Complementary to this positive result, we show that no OCRS can be more than $0.390$-selectable, significantly improving upon the upper bound of $0.428$ from the literature. We also derive negative results that are specialized to bipartite graphs or subfamilies of OCRS's. Meanwhile, when the edges of the graph are processed in a uniformly random order, we show that the simple greedy contention resolution scheme which accepts all active and feasible edges is $1/2$-selectable. This result is tight due to a known upper bound. Finally, when the algorithm can choose the processing order, we show that a slight tweak to the random order -- give each vertex a random priority and process edges in lexicographic order -- results in a strictly better contention resolution scheme that is $1-\ln(2-1/e)\approx0.510$-selectable. Our positive results also apply to online matching on $1$-uniform random graphs with vanishing (non-identical) edge probabilities, extending and unifying some results from the random graphs literature.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Inference under covariate-adaptive randomization with many strata
Authors:
Jiahui Xin,
Hanzhong Liu,
Wei Ma
Abstract:
Covariate-adaptive randomization is widely employed to balance baseline covariates in interventional studies such as clinical trials and experiments in development economics. Recent years have witnessed substantial progress in inference under covariate-adaptive randomization with a fixed number of strata. However, concerns have been raised about the impact of a large number of strata on its design…
▽ More
Covariate-adaptive randomization is widely employed to balance baseline covariates in interventional studies such as clinical trials and experiments in development economics. Recent years have witnessed substantial progress in inference under covariate-adaptive randomization with a fixed number of strata. However, concerns have been raised about the impact of a large number of strata on its design and analysis, which is a common scenario in practice, such as in multicenter randomized clinical trials. In this paper, we propose a general framework for inference under covariate-adaptive randomization, which extends the seminal works of Bugni et al. (2018, 2019) by allowing for a diverging number of strata. Furthermore, we introduce a novel weighted regression adjustment that ensures efficiency improvement. On top of establishing the asymptotic theory, practical algorithms for handling situations involving an extremely large number of strata are also developed. Moreover, by linking design balance and inference robustness, we highlight the advantages of stratified block randomization, which enforces better covariate balance within strata compared to simple randomization. This paper offers a comprehensive landscape of inference under covariate-adaptive randomization, spanning from fixed to diverging to extremely large numbers of strata.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Extreme Point Pursuit -- Part II: Further Error Bound Analysis and Applications
Authors:
Junbin Liu,
Ya Liu,
Wing-Kin Ma,
Mingjie Shao,
Anthony Man-Cho So
Abstract:
In the first part of this study, a convex-constrained penalized formulation was studied for a class of constant modulus (CM) problems. In particular, the error bound techniques were shown to play a vital role in providing exact penalization results. In this second part of the study, we continue our error bound analysis for the cases of partial permutation matrices, size-constrained assignment matr…
▽ More
In the first part of this study, a convex-constrained penalized formulation was studied for a class of constant modulus (CM) problems. In particular, the error bound techniques were shown to play a vital role in providing exact penalization results. In this second part of the study, we continue our error bound analysis for the cases of partial permutation matrices, size-constrained assignment matrices and non-negative semi-orthogonal matrices. We develop new error bounds and penalized formulations for these three cases, and the new formulations possess good structures for building computationally efficient algorithms. Moreover, we provide numerical results to demonstrate our framework in a variety of applications such as the densest k-subgraph problem, graph matching, size-constrained clustering, non-negative orthogonal matrix factorization and sparse fair principal component analysis.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Extreme Point Pursuit -- Part I: A Framework for Constant Modulus Optimization
Authors:
Junbin Liu,
Ya Liu,
Wing-Kin Ma,
Mingjie Shao,
Anthony Man-Cho So
Abstract:
This study develops a framework for a class of constant modulus (CM) optimization problems, which covers binary constraints, discrete phase constraints, semi-orthogonal matrix constraints, non-negative semi-orthogonal matrix constraints, and several types of binary assignment constraints. Capitalizing on the basic principles of concave minimization and error bounds, we study a convex-constrained p…
▽ More
This study develops a framework for a class of constant modulus (CM) optimization problems, which covers binary constraints, discrete phase constraints, semi-orthogonal matrix constraints, non-negative semi-orthogonal matrix constraints, and several types of binary assignment constraints. Capitalizing on the basic principles of concave minimization and error bounds, we study a convex-constrained penalized formulation for general CM problems. The advantage of such formulation is that it allows us to leverage non-convex optimization techniques, such as the simple projected gradient method, to build algorithms. As the first part of this study, we explore the theory of this framework. We study conditions under which the formulation provides exact penalization results. We also examine computational aspects relating to the use of the projected gradient method for each type of CM constraint. Our study suggests that the proposed framework has a broad scope of applicability.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Online Contention Resolution Schemes for Network Revenue Management and Combinatorial Auctions
Authors:
Will Ma,
Calum MacRury,
**gwei Zhang
Abstract:
In the Network Revenue Management (NRM) problem, products composed of up to L resources are sold to stochastically arriving customers. We take a randomized rounding approach to NRM, motivated by developments in Online Contention Resolution Schemes (OCRS). The goal is to take a fractional solution to NRM that satisfies the resource constraints in expectation, and implement it in an online policy th…
▽ More
In the Network Revenue Management (NRM) problem, products composed of up to L resources are sold to stochastically arriving customers. We take a randomized rounding approach to NRM, motivated by developments in Online Contention Resolution Schemes (OCRS). The goal is to take a fractional solution to NRM that satisfies the resource constraints in expectation, and implement it in an online policy that satisfies the resource constraints in any state, while (approximately) preserving all of the sales that were prescribed by the fractional solution.
OCRS cannot be naively applied to NRM or revenue management problems in general, because customer substitution induces a negative correlation in products being demanded. We start by deriving an OCRS that achieves a guarantee of 1/(1+L) for NRM with customer substitution, matching a common benchmark in the literature. We then show how to beat this benchmark for all integers L>1 assuming no substitution, i.e., in the standard OCRS setting. By contrast, we show that this benchmark is unbeatable using OCRS or any fractional relaxation if there is customer substitution, for all integers L that are the power of a prime number. Finally, we show how to beat 1/(1+L) even with customer substitution, if the products comprise one item from each of up to L groups.
Our results have corresponding implications for Online Combinatorial Auctions, in which buyers bid for bundles of up to L items, and buyers being single-minded is akin to no substitution. Our final result also beats 1/(1+L) for Prophet Inequality on the intersection of L partition matroids. All in all, our paper provides a unifying framework for applying OCRS to these problems, delineating the impact of substitution, and establishing a separation between the guarantees achievable with vs. without substitution under general resource constraints parametrized by L.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
A unified framework for covariate adjustment under stratified randomization
Authors:
Fuyi Tu,
Wei Ma,
Hanzhong Liu
Abstract:
Randomization, as a key technique in clinical trials, can eliminate sources of bias and produce comparable treatment groups. In randomized experiments, the treatment effect is a parameter of general interest. Researchers have explored the validity of using linear models to estimate the treatment effect and perform covariate adjustment and thus improve the estimation efficiency. However, the relati…
▽ More
Randomization, as a key technique in clinical trials, can eliminate sources of bias and produce comparable treatment groups. In randomized experiments, the treatment effect is a parameter of general interest. Researchers have explored the validity of using linear models to estimate the treatment effect and perform covariate adjustment and thus improve the estimation efficiency. However, the relationship between covariates and outcomes is not necessarily linear, and is often intricate. Advances in statistical theory and related computer technology allow us to use nonparametric and machine learning methods to better estimate the relationship between covariates and outcomes and thus obtain further efficiency gains. However, theoretical studies on how to draw valid inferences when using nonparametric and machine learning methods under stratified randomization are yet to be conducted. In this paper, we discuss a unified framework for covariate adjustment and corresponding statistical inference under stratified randomization and present a detailed proof of the validity of using local linear kernel-weighted least squares regression for covariate adjustment in treatment effect estimators as a special case. In the case of high-dimensional data, we additionally propose an algorithm for statistical inference using machine learning methods under stratified randomization, which makes use of sample splitting to alleviate the requirements on the asymptotic properties of machine learning methods. Finally, we compare the performances of treatment effect estimators using different machine learning methods by considering various data generation scenarios, to guide practical research.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Multiple Testing of Linear Forms for Noisy Matrix Completion
Authors:
Wanteng Ma,
Lilun Du,
Dong Xia,
Ming Yuan
Abstract:
Many important tasks of large-scale recommender systems can be naturally cast as testing multiple linear forms for noisy matrix completion. These problems, however, present unique challenges because of the subtle bias-and-variance tradeoff of and an intricate dependence among the estimated entries induced by the low-rank structure. In this paper, we develop a general approach to overcome these dif…
▽ More
Many important tasks of large-scale recommender systems can be naturally cast as testing multiple linear forms for noisy matrix completion. These problems, however, present unique challenges because of the subtle bias-and-variance tradeoff of and an intricate dependence among the estimated entries induced by the low-rank structure. In this paper, we develop a general approach to overcome these difficulties by introducing new statistics for individual tests with sharp asymptotics both marginally and jointly, and utilizing them to control the false discovery rate (FDR) via a data splitting and symmetric aggregation scheme. We show that valid FDR control can be achieved with guaranteed power under nearly optimal sample size requirements using the proposed methodology. Extensive numerical simulations and real data examples are also presented to further illustrate its practical merits.
△ Less
Submitted 30 November, 2023;
originally announced December 2023.
-
Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms
Authors:
Xiangyuan Zhang,
Weichao Mao,
Saviz Mowlavi,
Mouhacine Benosman,
Tamer Başar
Abstract:
We introduce controlgym, a library of thirty-six industrial control settings, and ten infinite-dimensional partial differential equation (PDE)-based control problems. Integrated within the OpenAI Gym/Gymnasium (Gym) framework, controlgym allows direct applications of standard reinforcement learning (RL) algorithms like stable-baselines3. Our control environments complement those in Gym with contin…
▽ More
We introduce controlgym, a library of thirty-six industrial control settings, and ten infinite-dimensional partial differential equation (PDE)-based control problems. Integrated within the OpenAI Gym/Gymnasium (Gym) framework, controlgym allows direct applications of standard reinforcement learning (RL) algorithms like stable-baselines3. Our control environments complement those in Gym with continuous, unbounded action and observation spaces, motivated by real-world control applications. Moreover, the PDE control environments uniquely allow the users to extend the state dimensionality of the system to infinity while preserving the intrinsic dynamics. This feature is crucial for evaluating the scalability of RL algorithms for control. This project serves the learning for dynamics & control (L4DC) community, aiming to explore key questions: the convergence of RL algorithms in learning control policies; the stability and robustness issues of learning-based controllers; and the scalability of RL algorithms to high- and potentially infinite-dimensional systems. We open-source the controlgym project at https://github.com/xiangyuan-zhang/controlgym.
△ Less
Submitted 23 April, 2024; v1 submitted 30 November, 2023;
originally announced November 2023.
-
Interaction tests with covariate-adaptive randomization
Authors:
Likun Zhang,
Wei Ma
Abstract:
Treatment-covariate interaction tests are commonly applied by researchers to examine whether the treatment effect varies across patient subgroups defined by baseline characteristics. The objective of this study is to explore treatment-covariate interaction tests involving covariate-adaptive randomization. Without assuming a parametric data generating model, we investigate usual interaction tests a…
▽ More
Treatment-covariate interaction tests are commonly applied by researchers to examine whether the treatment effect varies across patient subgroups defined by baseline characteristics. The objective of this study is to explore treatment-covariate interaction tests involving covariate-adaptive randomization. Without assuming a parametric data generating model, we investigate usual interaction tests and observe that they tend to be conservative: specifically, their limiting rejection probabilities under the null hypothesis do not exceed the nominal level and are typically strictly lower than it. To address this problem, we propose modifications to the usual tests to obtain corresponding valid tests. Moreover, we introduce a novel class of stratified-adjusted interaction tests that are simple, more powerful than the usual and modified tests, and broadly applicable to most covariate-adaptive randomization methods. The results are general to encompass two types of interaction tests: one involving stratification covariates and the other involving additional covariates that are not used for randomization. Our study clarifies the application of interaction tests in clinical trials and offers valuable tools for revealing treatment heterogeneity, crucial for advancing personalized medicine.
△ Less
Submitted 10 March, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
Random-order Contention Resolution via Continuous Induction: Tightness for Bipartite Matching under Vertex Arrivals
Authors:
Calum MacRury,
Will Ma
Abstract:
We introduce a new approach for designing Random-order Contention Resolution Schemes (RCRS) via exact solution in continuous time. Given a function $c(y):[0,1] \rightarrow [0,1]$, we show how to select each element which arrives at time $y \in [0,1]$ with probability exactly $c(y)$. We provide a rigorous algorithmic framework for achieving this, which discretizes the time interval and also needs t…
▽ More
We introduce a new approach for designing Random-order Contention Resolution Schemes (RCRS) via exact solution in continuous time. Given a function $c(y):[0,1] \rightarrow [0,1]$, we show how to select each element which arrives at time $y \in [0,1]$ with probability exactly $c(y)$. We provide a rigorous algorithmic framework for achieving this, which discretizes the time interval and also needs to sample its past execution to ensure these exact selection probabilities. We showcase our framework in the context of online contention resolution schemes for matching with random-order vertex arrivals. For bipartite graphs with two-sided arrivals, we design a $(1+e^{-2})/2 \approx 0.567$-selectable RCRS, which we also show to be tight. Next, we show that the presence of short odd-length cycles is the only barrier to attaining a (tight) $(1+e^{-2})/2$-selectable RCRS on general graphs. By generalizing our bipartite RCRS, we design an RCRS for graphs with odd-length girth $g$ which is $(1+e^{-2})/2$-selectable as $g \rightarrow \infty$. This convergence happens very rapidly: for triangle-free graphs (i.e., $g \ge 5$), we attain a $121/240 + 7/16 e^2 \approx 0.563$-selectable RCRS. Finally, for general graphs we improve on the $8/15 \approx 0.533$-selectable RCRS of Fu et al. (ICALP, 2021) and design an RCRS which is at least $0.535$-selectable. Due to the reduction of Ezra et al. (EC, 2020), our bounds yield a $0.535$-competitive (respectively, $(1+e^{-2})/2$-competitive) algorithm for prophet secretary matching on general (respectively, bipartite) graphs under vertex arrivals.
△ Less
Submitted 12 December, 2023; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Multi-Agent Search for a Moving and Camouflaging Target
Authors:
Miguel Lejeune,
Johannes O. Royset,
Wenbo Ma
Abstract:
In multi-agent search planning for a randomly moving and camouflaging target, we examine heterogeneous searchers that differ in terms of their endurance level, travel speed, and detection ability. This leads to a convex mixed-integer nonlinear program, which we reformulate using three linearization techniques. We develop preprocessing steps, outer approximations via lazy constraints, and bundle-ba…
▽ More
In multi-agent search planning for a randomly moving and camouflaging target, we examine heterogeneous searchers that differ in terms of their endurance level, travel speed, and detection ability. This leads to a convex mixed-integer nonlinear program, which we reformulate using three linearization techniques. We develop preprocessing steps, outer approximations via lazy constraints, and bundle-based cutting plane methods to address large-scale instances. Further specializations emerge when the target moves according to a Markov chain. We carry out an extensive numerical study to show the computational efficiency of our methods and to derive insights regarding which approach should be favored for which type of problem instance.
△ Less
Submitted 1 November, 2023; v1 submitted 5 September, 2023;
originally announced September 2023.
-
Dynamic Pricing for Reusable Resources: The Power of Two Prices
Authors:
Santiago R. Balseiro,
Will Ma,
Wenxin Zhang
Abstract:
Motivated by real-world applications such as rental and cloud computing services, we investigate pricing for reusable resources. We consider a system where a single resource with a fixed number of identical copies serves customers with heterogeneous willingness-to-pay (WTP), and the usage duration distribution is general. Optimal dynamic policies are computationally intractable when usage duration…
▽ More
Motivated by real-world applications such as rental and cloud computing services, we investigate pricing for reusable resources. We consider a system where a single resource with a fixed number of identical copies serves customers with heterogeneous willingness-to-pay (WTP), and the usage duration distribution is general. Optimal dynamic policies are computationally intractable when usage durations are not memoryless, so existing literature has focused on static pricing, whose steady-state reward rate converges to optimality at rate $\mathcal{O}(c^{-1/2})$ when supply and demand scale with $c$. We show, however, that this convergence rate is suboptimal, and propose a class of dynamic "stock-dependent" policies that 1) preserves computational tractability and 2) has a steady-state reward rate converging to optimality faster than $c^{-1/2}$. We characterize the tight convergence rate for stock-dependent policies and show that they can in fact be achieved by a simple two-price policy, that sets a higher price when the stock is below some threshold and a lower price otherwise. Finally, we demonstrate this "minimally dynamic" class of two-price policies to perform well numerically, even in non-asymptotic settings, suggesting that a little dynamicity can go a long way.
△ Less
Submitted 26 August, 2023;
originally announced August 2023.
-
Data-driven Approximation of Distributionally Robust Chance Constraints using Bayesian Credible Intervals
Authors:
Zhi** Chen,
Wentao Ma,
Bingbing Ji
Abstract:
The non-convexity and intractability of distributionally robust chance constraints make them challenging to cope with. From a data-driven perspective, we propose formulating it as a robust optimization problem to ensure that the distributionally robust chance constraint is satisfied with high probability. To incorporate available data and prior distribution knowledge, we construct ambiguity sets f…
▽ More
The non-convexity and intractability of distributionally robust chance constraints make them challenging to cope with. From a data-driven perspective, we propose formulating it as a robust optimization problem to ensure that the distributionally robust chance constraint is satisfied with high probability. To incorporate available data and prior distribution knowledge, we construct ambiguity sets for the distributionally robust chance constraint using Bayesian credible intervals. We establish the congruent relationship between the ambiguity set in Bayesian distributionally robust chance constraints and the uncertainty set in a specific robust optimization. In contrast to most existent uncertainty set construction methods which are only applicable for particular settings, our approach provides a unified framework for constructing uncertainty sets under different marginal distribution assumptions, thus making it more flexible and widely applicable. Additionally, under the concavity assumption, our method provides strong finite sample probability guarantees for optimal solutions. The practicality and effectiveness of our approach are illustrated with numerical experiments on portfolio management and queuing system problems. Overall, our approach offers a promising solution to distributionally robust chance constrained problems and has potential applications in other fields.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Decay of geometry for a class of cubic polynomials
Authors:
Haoyang Ji,
Wenxiu Ma
Abstract:
In this paper we study a class of bimodal cubic polynomials for which its critical points have the same $ω$-limit set which is an invariant Cantor set. These maps have generalized Fibonacci combinatorics in terms of generalized renormalization on the twin principal nest. It is proved that such maps possess `decay of geometry' in the sense that the scaling factor of the twin principal nest decrease…
▽ More
In this paper we study a class of bimodal cubic polynomials for which its critical points have the same $ω$-limit set which is an invariant Cantor set. These maps have generalized Fibonacci combinatorics in terms of generalized renormalization on the twin principal nest. It is proved that such maps possess `decay of geometry' in the sense that the scaling factor of the twin principal nest decreases at least exponentially fast. As an application, we prove that they have no Cantor attractor.
△ Less
Submitted 5 July, 2023; v1 submitted 20 April, 2023;
originally announced April 2023.
-
Efficient Solution of Bimaterial Riemann Problems for Compressible Multi-Material Flow Simulations
Authors:
Wentao Ma,
Xuning Zhao,
Shafquat Islam,
Aditya Narkhede,
Kevin Wang
Abstract:
When solving compressible multi-material flow problems, an unresolved challenge is the computation of advective fluxes across material interfaces that separate drastically different thermodynamic states and relations. A popular idea in this regard is to locally construct bimaterial Riemann problems, and to apply their exact solutions in flux computation. For general equations of state, however, fi…
▽ More
When solving compressible multi-material flow problems, an unresolved challenge is the computation of advective fluxes across material interfaces that separate drastically different thermodynamic states and relations. A popular idea in this regard is to locally construct bimaterial Riemann problems, and to apply their exact solutions in flux computation. For general equations of state, however, finding the exact solution of a Riemann problem is expensive as it requires nested loops. Multiplied by the large number of Riemann problems constructed during a simulation, the computational cost often becomes prohibitive. The work presented in this paper aims to accelerate the solution of bimaterial Riemann problems without introducing approximations or offline precomputation tasks. The basic idea is to exploit some special properties of the Riemann problem equations, and to recycle previous solutions as much as possible. Following this idea, four acceleration methods are developed, including (1) a change of integration variable through rarefaction fans, (2) storing and reusing integration trajectory data, (3) step size adaptation, and (4) constructing an R-tree on the fly to generate initial guesses. The performance of these acceleration methods are assessed using four example problems in underwater explosion, laser-induced cavitation, and hypervelocity impact. These problems exhibit strong shock waves, large interface deformation, contact of multiple (>2) interfaces, and interaction between gases and condensed matters. In these challenging cases, the solution of bimaterial Riemann problems is accelerated by 37 to 87 times. As a result, the total cost of advective flux computation, which includes the exact Riemann problem solution at material interfaces and the numerical flux calculation over the entire computational domain, is accelerated by 18 to 81 times.
△ Less
Submitted 22 August, 2023; v1 submitted 15 March, 2023;
originally announced March 2023.
-
Noda Iteration for Computing Generalized Tensor Eigenpairs
Authors:
Wanli Ma,
Weiyang Ding,
Yimin Wei
Abstract:
In this paper, we propose the tensor Noda iteration (NI) and its inexact version for solving the eigenvalue problem of a particular class of tensor pairs called generalized $\mathcal{M}$-tensor pairs. A generalized $\mathcal{M}$-tensor pair consists of a weakly irreducible nonnegative tensor and a nonsingular $\mathcal{M}$-tensor within a linear combination. It is shown that any generalized…
▽ More
In this paper, we propose the tensor Noda iteration (NI) and its inexact version for solving the eigenvalue problem of a particular class of tensor pairs called generalized $\mathcal{M}$-tensor pairs. A generalized $\mathcal{M}$-tensor pair consists of a weakly irreducible nonnegative tensor and a nonsingular $\mathcal{M}$-tensor within a linear combination. It is shown that any generalized $\mathcal{M}$-tensor pair admits a unique positive generalized eigenvalue with a positive eigenvector. A modified tensor Noda iteration(MTNI) is developed for extending the Noda iteration for nonnegative matrix eigenproblems. In addition, the inexact generalized tensor Noda iteration method (IGTNI) and the generalized Newton-Noda iteration method (GNNI) are also introduced for more efficient implementations and faster convergence. Under a mild assumption on the initial values, the convergence of these algorithms is guaranteed. The efficiency of these algorithms is illustrated by numerical experiments.
△ Less
Submitted 2 March, 2023;
originally announced March 2023.
-
From Contextual Data to Newsvendor Decisions: On the Actual Performance of Data-Driven Algorithms
Authors:
Omar Besbes,
Will Ma,
Omar Mouchtaki
Abstract:
In this work, we explore a framework for contextual decision-making to study how the relevance and quantity of past data affects the performance of a data-driven policy. We analyze a contextual Newsvendor problem in which a decision-maker needs to trade-off between an underage and an overage cost in the face of uncertain demand. We consider a setting in which past demands observed under ``close by…
▽ More
In this work, we explore a framework for contextual decision-making to study how the relevance and quantity of past data affects the performance of a data-driven policy. We analyze a contextual Newsvendor problem in which a decision-maker needs to trade-off between an underage and an overage cost in the face of uncertain demand. We consider a setting in which past demands observed under ``close by'' contexts come from close by distributions and analyze the performance of data-driven algorithms through a notion of context-dependent worst-case expected regret. We analyze the broad class of Weighted Empirical Risk Minimization (WERM) policies which weigh past data according to their similarity in the contextual space. This class includes classical policies such as ERM, k-Nearest Neighbors and kernel-based policies. Our main methodological contribution is to characterize exactly the worst-case regret of any WERM policy on any given configuration of contexts. To the best of our knowledge, this provides the first understanding of tight performance guarantees in any contextual decision-making problem, with past literature focusing on upper bounds via concentration inequalities. We instead take an optimization approach, and isolate a structure in the Newsvendor loss function that allows to reduce the infinite-dimensional optimization problem over worst-case distributions to a simple line search.
This in turn allows us to unveil fundamental insights that were obfuscated by previous general-purpose bounds. We characterize actual guaranteed performance as a function of the contexts, as well as granular insights on the learning curve of algorithms.
△ Less
Submitted 27 July, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Theory of generating spaces of convex sets and their applications to solvability of convex programs in Banach spaces
Authors:
Lixin Cheng,
Weihao Mao
Abstract:
When optimization theorists consider optimization problems in infinite dimensional spaces, they need to deal with closed convex subsets(usually cones) which mostly have empty interior. These subsets often prevent optimization theorists from applying powerful techniques to study these optimization problems. In this paper, by nonsupport point, we present generating spaces which are relative to a Ban…
▽ More
When optimization theorists consider optimization problems in infinite dimensional spaces, they need to deal with closed convex subsets(usually cones) which mostly have empty interior. These subsets often prevent optimization theorists from applying powerful techniques to study these optimization problems. In this paper, by nonsupport point, we present generating spaces which are relative to a Banach space and a nonsupport point of its convex closed subset. Then for optimization problems in infinite dimensional spaces, in some general cases, we replace original spaces by generating spaces while containing solutions. Thus this method enable us to apply powerful classical techniques to optimization problems in very general class of infinite dimensional spaces. Based on functional analysis, from classical Banach spaces to separable Banach spaces, from Banach lattice to latticization, we give characterizations of generating spaces and conclude that they are actually linearly isometric to $L_\infty$($\ell _\infty$) or their closed subspaces. Thus continuous linear functional involved in these techniques could be chosen from $L_\infty^*$($\ell_\infty^*$). After that, applications in Penalty principle, Lagrange duality and scalarization function are further studied by this method.
△ Less
Submitted 18 October, 2022;
originally announced October 2022.
-
Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions
Authors:
Jiashuo Jiang,
Will Ma,
Jiawei Zhang
Abstract:
We study the classical Network Revenue Management (NRM) problem with accept/reject decisions and $T$ IID arrivals. We consider a distributional form where each arrival must fall under a finite number of possible categories, each with a deterministic resource consumption vector, but a random value distributed continuously over an interval. We develop an online algorithm that achieves $O(\log^2 T)$…
▽ More
We study the classical Network Revenue Management (NRM) problem with accept/reject decisions and $T$ IID arrivals. We consider a distributional form where each arrival must fall under a finite number of possible categories, each with a deterministic resource consumption vector, but a random value distributed continuously over an interval. We develop an online algorithm that achieves $O(\log^2 T)$ regret under this model, with the only (necessary) assumption being that the probability densities are bounded away from 0. We derive a second result that achieves $O(\log T)$ regret under an additional assumption of second-order growth. To our knowledge, these are the first results achieving logarithmic-level regret in an NRM model with continuous values that do not require any kind of ``non-degeneracy'' assumptions. Our results are achieved via new techniques including a new method of bounding myopic regret, a ``semi-fluid'' relaxation of the offline allocation, and an improved bound on the ``dual convergence''.
△ Less
Submitted 8 February, 2024; v1 submitted 14 October, 2022;
originally announced October 2022.
-
On (Random-order) Online Contention Resolution Schemes for the Matching Polytope of (Bipartite) Graphs
Authors:
Calum MacRury,
Will Ma,
Nathaniel Grammel
Abstract:
Online Contention Resolution Schemes (OCRS's) represent a modern tool for selecting a subset of elements, subject to resource constraints, when the elements are presented to the algorithm sequentially. OCRS's have led to some of the best-known competitive ratio guarantees for online resource allocation problems, with the added benefit of treating different online decisions -- accept/reject, probin…
▽ More
Online Contention Resolution Schemes (OCRS's) represent a modern tool for selecting a subset of elements, subject to resource constraints, when the elements are presented to the algorithm sequentially. OCRS's have led to some of the best-known competitive ratio guarantees for online resource allocation problems, with the added benefit of treating different online decisions -- accept/reject, probing, pricing -- in a unified manner. This paper analyzes OCRS's for resource constraints defined by matchings in graphs, a fundamental structure in combinatorial optimization. We consider two dimensions of variants: the elements being presented in adversarial or random order; and the graph being bipartite or general. We improve the state of the art for all combinations of variants, both in terms of algorithmic guarantees and impossibility results. Some of our algorithmic guarantees are best-known even compared to Contention Resolution Schemes that can choose the order of arrival or are offline. All in all, our results for OCRS directly improve the best-known competitive ratios for online accept/reject, probing, and pricing problems on graphs in a unified manner.
△ Less
Submitted 1 April, 2024; v1 submitted 15 September, 2022;
originally announced September 2022.
-
Optimal Regularized Online Allocation by Adaptive Re-Solving
Authors:
Wanteng Ma,
Ying Cao,
Danny H. K. Tsang,
Dong Xia
Abstract:
This paper introduces a dual-based algorithm framework for solving the regularized online resource allocation problems, which have potentially non-concave cumulative rewards, hard resource constraints, and a non-separable regularizer. Under a strategy of adaptively updating the resource constraints, the proposed framework only requests approximate solutions to the empirical dual problems up to a c…
▽ More
This paper introduces a dual-based algorithm framework for solving the regularized online resource allocation problems, which have potentially non-concave cumulative rewards, hard resource constraints, and a non-separable regularizer. Under a strategy of adaptively updating the resource constraints, the proposed framework only requests approximate solutions to the empirical dual problems up to a certain accuracy and yet delivers an optimal logarithmic regret under a locally second-order growth condition. Surprisingly, a delicate analysis of the dual objective function enables us to eliminate the notorious log-log factor in regret bound. The flexible framework renders renowned and computationally fast algorithms immediately applicable, e.g., dual stochastic gradient descent. Additionally, an infrequent re-solving scheme is proposed, which significantly reduces computational demands without compromising the optimal regret performance. A worst-case square-root regret lower bound is established if the resource constraints are not adaptively updated during dual optimization, which underscores the critical role of adaptive dual variable update. Comprehensive numerical experiments demonstrate the merits of the proposed algorithm framework.
△ Less
Submitted 15 July, 2023; v1 submitted 1 September, 2022;
originally announced September 2022.
-
Drone-Delivery Network for Opioid Overdose -- Nonlinear Integer Queueing-Optimization Models and Methods
Authors:
Miguel Lejeune,
Wenbo Ma
Abstract:
We propose a new stochastic emergency network design model that uses a fleet of drones to quickly deliver naxolone in response to opioid overdoses. The network is represented as a collection of M/G/K queuing systems in which the capacity K of each system is a decision variable and the service time is modelled as a decision-dependent random variable. The model is an optimization-based queuing probl…
▽ More
We propose a new stochastic emergency network design model that uses a fleet of drones to quickly deliver naxolone in response to opioid overdoses. The network is represented as a collection of M/G/K queuing systems in which the capacity K of each system is a decision variable and the service time is modelled as a decision-dependent random variable. The model is an optimization-based queuing problem which locates fixed (drone bases) and mobile (drones) servers and determines the drone dispatching decisions, and takes the form of a nonlinear integer problem, which is intractable in its original form. We develop an efficient reformulation and algorithmic framework. Our approach reformulates the multiple nonlinearities (fractional, polynomial, exponential, factorial terms) to give a mixed-integer linear programming (MILP) formulation. We demonstrate its generalizablity and show that the problem of minimizing the average response time of a network of M/G/K queuing systems with unknown capacity K is always MILP-representable. We design two algorithms and demonstrate that the outer approximation branch-and-cut method is the most efficient and scales well. The analysis based on real-life overdose data reveals that drones can in Virginia Beach: 1) decrease the response time by 78%, 2) increase the survival chance by 432%, 3) save up to 34 additional lives per year, and 4) provide annually up to 287 additional quality-adjusted life years.
△ Less
Submitted 25 January, 2024; v1 submitted 28 June, 2022;
originally announced June 2022.
-
Online Bipartite Matching with Advice: Tight Robustness-Consistency Tradeoffs for the Two-Stage Model
Authors:
Billy **,
Will Ma
Abstract:
We study the two-stage vertex-weighted online bipartite matching problem of Feng, Niazadeh, and Saberi (SODA 2021) in a setting where the algorithm has access to a suggested matching that is recommended in the first stage. We evaluate an algorithm by its robustness $R$, which is its performance relative to that of the optimal offline matching, and its consistency $C$, which is its performance when…
▽ More
We study the two-stage vertex-weighted online bipartite matching problem of Feng, Niazadeh, and Saberi (SODA 2021) in a setting where the algorithm has access to a suggested matching that is recommended in the first stage. We evaluate an algorithm by its robustness $R$, which is its performance relative to that of the optimal offline matching, and its consistency $C$, which is its performance when the advice or the prediction given is correct. We characterize for this problem the Pareto-efficient frontier between robustness and consistency, which is rare in the literature on advice-augmented algorithms, yet necessary for quantifying such an algorithm to be optimal. Specifically, we propose an algorithm that is $R$-robust and $C$-consistent for any $(R,C)$ with $0 \leq R \leq \frac{3}{4}$ and $\sqrt{1-R} + \sqrt{1-C} = 1$, and prove that no other algorithm can achieve a better tradeoff.
△ Less
Submitted 23 November, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Beyond IID: data-driven decision-making in heterogeneous environments
Authors:
Omar Besbes,
Will Ma,
Omar Mouchtaki
Abstract:
How should one leverage historical data when past observations are not perfectly indicative of the future, e.g., due to the presence of unobserved confounders which one cannot "correct" for? Motivated by this question, we study a data-driven decision-making framework in which historical samples are generated from unknown and different distributions assumed to lie in a heterogeneity ball with known…
▽ More
How should one leverage historical data when past observations are not perfectly indicative of the future, e.g., due to the presence of unobserved confounders which one cannot "correct" for? Motivated by this question, we study a data-driven decision-making framework in which historical samples are generated from unknown and different distributions assumed to lie in a heterogeneity ball with known radius and centered around the (also) unknown future (out-of-sample) distribution on which the performance of a decision will be evaluated. This work aims at analyzing the performance of central data-driven policies but also near-optimal ones in these heterogeneous environments and understanding key drivers of performance. We establish a first result which allows to upper bound the asymptotic worst-case regret of a broad class of policies. Leveraging this result, for any integral probability metric, we provide a general analysis of the performance achieved by Sample Average Approximation (SAA) as a function of the radius of the heterogeneity ball. This analysis is centered around the approximation parameter, a notion of complexity we introduce to capture how the interplay between the heterogeneity and the problem structure impacts the performance of SAA. In turn, we illustrate through several widely-studied problems -- e.g., newsvendor, pricing -- how this methodology can be applied and find that the performance of SAA varies considerably depending on the combinations of problem classes and heterogeneity. The failure of SAA for certain instances motivates the design of alternative policies to achieve rate-optimality. We derive problem-dependent policies achieving strong guarantees for the illustrative problems described above and provide initial results towards a principled approach for the design and analysis of general rate-optimal algorithms.
△ Less
Submitted 19 June, 2024; v1 submitted 20 June, 2022;
originally announced June 2022.
-
Estimating probabilistic dynamic origin-destination demands using multi-day traffic data on computational graphs
Authors:
Wei Ma,
Sean Qian
Abstract:
System-level decision making in transportation needs to understand day-to-day variation of network flows, which calls for accurate modeling and estimation of probabilistic dynamic travel demand on networks. Most existing studies estimate deterministic dynamic origin-destination (OD) demand, while the day-to-day variation of demand and flow is overlooked. Estimating probabilistic distributions of d…
▽ More
System-level decision making in transportation needs to understand day-to-day variation of network flows, which calls for accurate modeling and estimation of probabilistic dynamic travel demand on networks. Most existing studies estimate deterministic dynamic origin-destination (OD) demand, while the day-to-day variation of demand and flow is overlooked. Estimating probabilistic distributions of dynamic OD demand is challenging due to the complexity of the spatio-temporal networks and the computational intensity of the high-dimensional problems. With the availability of massive traffic data and the emergence of advanced computational methods, this paper develops a data-driven framework that solves the probabilistic dynamic origin-destination demand estimation (PDODE) problem using multi-day data. Different statistical distances (e.g., lp-norm, Wasserstein distance, KL divergence, Bhattacharyya distance) are used and compared to measure the gap between the estimated and the observed traffic conditions, and it is found that 2-Wasserstein distance achieves a balanced accuracy in estimating both mean and standard deviation. The proposed framework is cast into the computational graph and a reparametrization trick is developed to estimate the mean and standard deviation of the probabilistic dynamic OD demand simultaneously. We demonstrate the effectiveness and efficiency of the proposed PDODE framework on both small and real-world networks. In particular, it is demonstrated that the proposed PDODE framework can mitigate the overfitting issues by considering the demand variation. Overall, the developed PDODE framework provides a practical tool for public agencies to understand the sources of demand stochasticity, evaluate day-to-day variation of network flow, and make reliable decisions for intelligent transportation systems.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Optimization of bus scheduling and bus-berth matching at curbside stops under connected vehicle environment
Authors:
Wan**g Ma,
Shiqi Ou,
Chunhui Yu
Abstract:
It is commonly seen that buses are blocked by the ones in front serving passengers and have to queue outside a curbside bus stop although there are vacant berths at the stop. The resultant bus delays degrade the service level of urban public transportation. A potential solution is to reschedule the arrivals of the buses at the stop for full utilization of the berths with the aid of connected vehic…
▽ More
It is commonly seen that buses are blocked by the ones in front serving passengers and have to queue outside a curbside bus stop although there are vacant berths at the stop. The resultant bus delays degrade the service level of urban public transportation. A potential solution is to reschedule the arrivals of the buses at the stop for full utilization of the berths with the aid of connected vehicle technologies. This study proposes a mixed-integer linear programming model to optimize the scheduling of bus arrivals and the bus-berth matching at a curbside stop under connected vehicle environment. The objective is the minimization of the bus delays weighted by the number of passengers on the buses. Bus arrival times at the stop and the assignment of berths are optimized together with bus departure times from the stop. Bus punctuality is also taken into consideration. The proposed model could be applied dynamically to cater to time-varying traffic conditions. Numerical studies validate the advantages of the proposed model over the first-come-first-service strategy and the relaxed model without bus punctuality in terms of weighted bus delays and bus punctuality. Sensitivity analyses show that: 1) the proposed model is robust to the fluctuation of bus service time; and 2) a smaller number of berths may be preferred on condition that the bus demand does not exceed the stop capacity.
△ Less
Submitted 29 November, 2021; v1 submitted 22 June, 2021;
originally announced June 2021.
-
Optimizing for Strategy Diversity in the Design of Video Games
Authors:
Oussama Hanguir,
Will Ma,
Christopher Thomas Ryan,
Jiangze Han
Abstract:
We consider the problem of designing a linear program that has diverse solutions as the right-hand side varies. This problem arises in video game settings where designers aim to have players use different "weapons" or "tactics" as they progress. We model this design question as a choice over the constraint matrix $A$ and cost vector $c$ to maximize the number of possible \emph{supports} of unique…
▽ More
We consider the problem of designing a linear program that has diverse solutions as the right-hand side varies. This problem arises in video game settings where designers aim to have players use different "weapons" or "tactics" as they progress. We model this design question as a choice over the constraint matrix $A$ and cost vector $c$ to maximize the number of possible \emph{supports} of unique optimal solutions (what we call "loadouts") of Linear Programs $\max\{c^\top x \mid Ax \le b, x \ge 0\}$ with nonnegative data considered over all resource vectors $b$. We provide an upper bound on the optimal number of loadouts and provide a family of constructions that have an asymptotically optimal number of loadouts. The upper bound is based on a connection between our problem and the study of triangulations of point sets arising from polyhedral combinatorics, and specifically the combinatorics of the cyclic polytope. Our asymptotically optimal construction also draws inspiration from the properties of the cyclic polytope.
△ Less
Submitted 30 June, 2024; v1 submitted 22 June, 2021;
originally announced June 2021.
-
The convergence rate of of multivariate operators on simplex in Orlicz space
Authors:
Wan Ma,
Lihong Chang,
Yongxia Qiang
Abstract:
The approximation of functions in Orlicz space by multivariate operators on simplex is considered. The convergence rate is given by using modulus of smoothness.
The approximation of functions in Orlicz space by multivariate operators on simplex is considered. The convergence rate is given by using modulus of smoothness.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
A $\dbar$-steepest descent method for oscillatory Riemann-Hilbert problems
Authors:
Fudong Wang,
Wen-Xiu Ma
Abstract:
We study the asymptotic behavior of Riemann-Hilbert problems (RHP) arising in the AKNS hierarchy of integrable equations. Our analysis is based on the $\dbar$-steepest descent method. We consider RHPs arising from the inverse scattering transform of the AKNS hierarchy with $H^{1,1}(\R)$ initial data. The analysis will be divided into three regions: fast decay region, oscillating region and self-si…
▽ More
We study the asymptotic behavior of Riemann-Hilbert problems (RHP) arising in the AKNS hierarchy of integrable equations. Our analysis is based on the $\dbar$-steepest descent method. We consider RHPs arising from the inverse scattering transform of the AKNS hierarchy with $H^{1,1}(\R)$ initial data. The analysis will be divided into three regions: fast decay region, oscillating region and self-similarity region (the Painlevé region). The resulting formulas can be directly applied to study the long-time asymptotic of the solutions of integrable equations such as NLS, mKdV and their higher-order generalizations.
△ Less
Submitted 11 June, 2021; v1 submitted 28 November, 2020;
originally announced November 2020.
-
A general theory of regression adjustment for covariate-adaptive randomization: OLS, Lasso, and beyond
Authors:
Hanzhong Liu,
Fuyi Tu,
Wei Ma
Abstract:
We consider the problem of estimating and inferring treatment effects in randomized experiments. In practice, stratified randomization, or more generally, covariate-adaptive randomization, is routinely used in the design stage to balance the treatment allocations with respect to a few variables that are most relevant to the outcomes. Then, regression is performed in the analysis stage to adjust th…
▽ More
We consider the problem of estimating and inferring treatment effects in randomized experiments. In practice, stratified randomization, or more generally, covariate-adaptive randomization, is routinely used in the design stage to balance the treatment allocations with respect to a few variables that are most relevant to the outcomes. Then, regression is performed in the analysis stage to adjust the remaining imbalances to yield more efficient treatment effect estimators. Building upon and unifying the recent results obtained for ordinary least squares adjusted estimators under covariate-adaptive randomization, this paper presents a general theory of regression adjustment that allows for arbitrary model misspecification and the presence of a large number of baseline covariates. We exemplify the theory on two Lasso-adjusted treatment effect estimators, both of which are optimal in their respective classes. In addition, nonparametric consistent variance estimators are proposed to facilitate valid inferences, which work irrespective of the specific randomization methods used. The robustness and improved efficiency of the proposed estimators are demonstrated through a simulation study and a clinical trial example. This study sheds light on improving treatment effect estimation efficiency by implementing machine learning methods in covariate-adaptive randomized experiments.
△ Less
Submitted 19 November, 2020;
originally announced November 2020.
-
Batalin--Vilkovisky algebra structures on the Hochschild cohomology of generalized Weyl algebras
Authors:
Liyu Liu,
Wen Ma
Abstract:
This paper is devoted to the calculation of Batalin-Vilkovisky algebra structures on the Hochschild cohomology of skew Calabi-Yau generalized Weyl algebras. We firstly establish a Van den Bergh duality at the level of complex. Then based on the results of Solotar et al., we apply Kowalzig and Krähmer's method to the Hochschild homology of generalized Weyl algebras, and translate the homological in…
▽ More
This paper is devoted to the calculation of Batalin-Vilkovisky algebra structures on the Hochschild cohomology of skew Calabi-Yau generalized Weyl algebras. We firstly establish a Van den Bergh duality at the level of complex. Then based on the results of Solotar et al., we apply Kowalzig and Krähmer's method to the Hochschild homology of generalized Weyl algebras, and translate the homological information into cohomological one by virtue of the Van den Bergh duality, obtaining the desired Batalin-Vilkovisky algebra structures. Finally, we apply our results to quantum weighted projective lines and Podleś quantum spheres, and the Batalin-Vilkovisky algebra structures for them are described completely.
△ Less
Submitted 12 October, 2021; v1 submitted 12 September, 2020;
originally announced September 2020.
-
Testing for Treatment Effect in Covariate-Adaptive Randomized Clinical Trials with Generalized Linear Models and Omitted Covariates
Authors:
Li Yang,
Wei Ma,
Yichen Qin,
Feifang Hu
Abstract:
Concerns have been expressed over the validity of statistical inference under covariate-adaptive randomization despite the extensive use in clinical trials. In the literature, the inferential properties under covariate-adaptive randomization have been mainly studied for continuous responses; in particular, it is well known that the usual two sample t-test for treatment effect is typically conserva…
▽ More
Concerns have been expressed over the validity of statistical inference under covariate-adaptive randomization despite the extensive use in clinical trials. In the literature, the inferential properties under covariate-adaptive randomization have been mainly studied for continuous responses; in particular, it is well known that the usual two sample t-test for treatment effect is typically conservative, in the sense that the actual test size is smaller than the nominal level. This phenomenon of invalid tests has also been found for generalized linear models without adjusting for the covariates and are sometimes more worrisome due to inflated Type I error. The purpose of this study is to examine the unadjusted test for treatment effect under generalized linear models and covariate-adaptive randomization. For a large class of covariate-adaptive randomization methods, we obtain the asymptotic distribution of the test statistic under the null hypothesis and derive the conditions under which the test is conservative, valid, or anti-conservative. Several commonly used generalized linear models, such as logistic regression and Poisson regression, are discussed in detail. An adjustment method is also proposed to achieve a valid size based on the asymptotic results. Numerical studies confirm the theoretical findings and demonstrate the effectiveness of the proposed adjustment method.
△ Less
Submitted 2 May, 2021; v1 submitted 9 September, 2020;
originally announced September 2020.
-
Managing connected and automated vehicles with flexible routing at "lane-allocation-free'' intersections
Authors:
Wan**g Ma,
Ruochen Hao,
Chunhui Yu,
Tuo Sun,
Bart van Arem
Abstract:
Trajectory planning and coordination for connected and automated vehicles (CAVs) have been studied at isolated ``signal-free'' intersections and in ``signal-free'' corridors under the fully CAV environment in the literature. Most of the existing studies are based on the definition of approaching and exit lanes. The route a vehicle takes to pass through an intersection is determined from its moveme…
▽ More
Trajectory planning and coordination for connected and automated vehicles (CAVs) have been studied at isolated ``signal-free'' intersections and in ``signal-free'' corridors under the fully CAV environment in the literature. Most of the existing studies are based on the definition of approaching and exit lanes. The route a vehicle takes to pass through an intersection is determined from its movement. That is, only the origin and destination arms are included. This study proposes a mixed-integer linear programming (MILP) model to optimize vehicle trajectories at an isolated ``signal-free'' intersection without lane allocation, which is denoted as ``lane-allocation-free'' (LAF) control. Each lane can be used as both approaching and exit lanes for all vehicle movements including left-turn, through, and right-turn. A vehicle can take a flexible route by way of multiple arms to pass through the intersection. In this way, the spatial-temporal resources are expected to be fully utilized. The interactions between vehicle trajectories are modeled explicitly at the microscopic level. Vehicle routes and trajectories (i.e., car-following and lane-changing behaviors) at the intersection are optimized in one unified framework for system optimality in terms of total vehicle delay. Considering varying traffic conditions, the planning horizon is adaptively adjusted in the implementation procedure of the proposed model to make a balance between solution feasibility and computational burden. Numerical studies validate the advantages of the proposed LAF control in terms of both vehicle delay and throughput with different demand structures and temporal safety gaps.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
Understanding Notions of Stationarity in Non-Smooth Optimization
Authors:
Jia** Li,
Anthony Man-Cho So,
Wing-Kin Ma
Abstract:
Many contemporary applications in signal processing and machine learning give rise to structured non-convex non-smooth optimization problems that can often be tackled by simple iterative methods quite effectively. One of the keys to understanding such a phenomenon---and, in fact, one of the very difficult conundrums even for experts---lie in the study of "stationary points" of the problem in quest…
▽ More
Many contemporary applications in signal processing and machine learning give rise to structured non-convex non-smooth optimization problems that can often be tackled by simple iterative methods quite effectively. One of the keys to understanding such a phenomenon---and, in fact, one of the very difficult conundrums even for experts---lie in the study of "stationary points" of the problem in question. Unlike smooth optimization, for which the definition of a stationary point is rather standard, there is a myriad of definitions of stationarity in non-smooth optimization. In this article, we give an introduction to different stationarity concepts for several important classes of non-convex non-smooth functions and discuss the geometric interpretations and further clarify the relationship among these different concepts. We then demonstrate the relevance of these constructions in some representative applications and how they could affect the performance of iterative methods for tackling these applications.
△ Less
Submitted 26 June, 2020;
originally announced June 2020.
-
Coupled Control Systems: Periodic Orbit Generation with Application to Quadrupedal Locomotion
Authors:
Wen-Loong Ma,
Noel Csomay-Shanklin,
Aaron D. Ames
Abstract:
A robotic system can be viewed as a collection of lower-dimensional systems that are coupled via reaction forces (Lagrange multipliers) enforcing holonomic constraints. Inspired by this viewpoint, this paper presents a novel formulation for nonlinear control systems that are subject to coupling constraints via virtual "coupling" inputs that abstractly play the role of Lagrange multipliers. The mai…
▽ More
A robotic system can be viewed as a collection of lower-dimensional systems that are coupled via reaction forces (Lagrange multipliers) enforcing holonomic constraints. Inspired by this viewpoint, this paper presents a novel formulation for nonlinear control systems that are subject to coupling constraints via virtual "coupling" inputs that abstractly play the role of Lagrange multipliers. The main contribution of this paper is a process---mirroring solving for Lagrange multipliers in robotic systems---wherein we isolate subsystems free of coupling constraints that provably encode the full-order dynamics of the coupled control system from which it was derived. This dimension reduction is leveraged in the formulation of a nonlinear optimization problem for the isolated subsystem that yields periodic orbits for the full-order coupled system. We consider the application of these ideas to robotic systems, which can be decomposed into subsystems. Specifically, we view a quadruped as a coupled control system consisting of two bipedal robots, wherein applying the framework developed allows for gaits (periodic orbits) to be generated for the individual biped yielding a gait for the full-order quadruped. This is demonstrated through walking experiments of a quadrupedal robot in simulation and on rough terrains.
△ Less
Submitted 18 March, 2020;
originally announced March 2020.
-
A Distributionally Robust Area Under Curve Maximization Model
Authors:
Wenbo Ma,
Miguel A. Lejeune
Abstract:
Area under ROC curve (AUC) is a widely used performance measure for classification models. We propose two new distributionally robust AUC maximization models (DR-AUC) that rely on the Kantorovich metric and approximate the AUC with the hinge loss function. We consider the two cases with respectively fixed and variable support for the worst-case distribution. We use duality theory to reformulate th…
▽ More
Area under ROC curve (AUC) is a widely used performance measure for classification models. We propose two new distributionally robust AUC maximization models (DR-AUC) that rely on the Kantorovich metric and approximate the AUC with the hinge loss function. We consider the two cases with respectively fixed and variable support for the worst-case distribution. We use duality theory to reformulate the DR-AUC models and derive tractable convex optimization problems. The numerical experiments show that the proposed DR-AUC models -- benchmarked with the standard deterministic AUC and the support vector machine models - perform better in general and in particular improve the worst-case out-of-sample performance over the majority of the considered datasets, thereby showing their robustness. The results are particularly encouraging since our numerical experiments are conducted with training sets of small size which have been known to be conducive to low out-of-sample performance.
△ Less
Submitted 7 May, 2020; v1 submitted 17 February, 2020;
originally announced February 2020.
-
Hybrid Inexact BCD for Coupled Structured Matrix Factorization in Hyperspectral Super-Resolution
Authors:
Ruiyuan Wu,
Hoi-To Wai,
Wing-Kin Ma
Abstract:
This paper develops a first-order optimization method for coupled structured matrix factorization (CoSMF) problems that arise in the context of hyperspectral super-resolution (HSR) in remote sensing. To best leverage the problem structures for computational efficiency, we introduce a hybrid inexact block coordinate descent (HiBCD) scheme wherein one coordinate is updated via the fast proximal grad…
▽ More
This paper develops a first-order optimization method for coupled structured matrix factorization (CoSMF) problems that arise in the context of hyperspectral super-resolution (HSR) in remote sensing. To best leverage the problem structures for computational efficiency, we introduce a hybrid inexact block coordinate descent (HiBCD) scheme wherein one coordinate is updated via the fast proximal gradient (FPG) method, while another via the Frank-Wolfe (FW) method. The FPG-type methods are known to take less number of iterations to converge, by numerical experience, while the FW-type methods can offer lower per-iteration complexity in certain cases; and we wish to take the best of both. We show that the limit points of this HiBCD scheme are stationary. Our proof treats HiBCD as an optimization framework for a class of multi-block structured optimization problems, and our stationarity claim is applicable not only to CoSMF but also to many other problems. Previous optimization research showed the same stationarity result for inexact block coordinate descent with either FPG or FW updates only. Numerical results indicate that the proposed HiBCD scheme is computationally much more efficient than the state-of-the-art CoSMF schemes in HSR.
△ Less
Submitted 20 February, 2020; v1 submitted 19 September, 2019;
originally announced September 2019.
-
Multi-stage and Multi-customer Assortment Optimization with Inventory Constraints
Authors:
Elaheh Fata,
Will Ma,
David Simchi-Levi
Abstract:
We consider an assortment optimization problem where a customer chooses a single item from a sequence of sets shown to her, while limited inventories constrain the items offered to customers over time. In the special case where all of the assortments have size one, our problem captures the online stochastic matching with timeouts problem. For this problem, we derive a polynomial-time approximation…
▽ More
We consider an assortment optimization problem where a customer chooses a single item from a sequence of sets shown to her, while limited inventories constrain the items offered to customers over time. In the special case where all of the assortments have size one, our problem captures the online stochastic matching with timeouts problem. For this problem, we derive a polynomial-time approximation algorithm which earns at least 1-ln(2-1/e), or 0.51, of the optimum. This improves upon the previous-best approximation ratio of 0.46, and furthermore, we show that it is tight. For the general assortment problem, we establish the first constant-factor approximation ratio of 0.09 for the case that different types of customers value items differently, and an approximation ratio of 0.15 for the case that different customers value each item the same. Our algorithms are based on rounding an LP relaxation for multi-stage assortment optimization, and improve upon previous randomized rounding schemes to derive the tight ratio of 1-ln(2-1/e).
△ Less
Submitted 26 July, 2020; v1 submitted 26 August, 2019;
originally announced August 2019.
-
Long-time asymptotic behaviour for the fifth order modified Korteweg-de Vries equation
Authors:
Fudong Wang,
Wen-Xiu Ma
Abstract:
Following Deift-Zhou's nonlinear steepest descent method, the long-time asymptotic behavior for the Cauchy problem of the 5th order modified Korteweg-de Vries equation is analyzed. Based on the inverse scattering transform, the 5th order MKdV is transformed to a 2 by 2 oscillatory Riemann-Hilbert problem, then by manipulating the Cauchy operator and reducing the degree of the phase function, the l…
▽ More
Following Deift-Zhou's nonlinear steepest descent method, the long-time asymptotic behavior for the Cauchy problem of the 5th order modified Korteweg-de Vries equation is analyzed. Based on the inverse scattering transform, the 5th order MKdV is transformed to a 2 by 2 oscillatory Riemann-Hilbert problem, then by manipulating the Cauchy operator and reducing the degree of the phase function, the long-time asymptotics of the solution is given in terms of solutions of the parabolic cylinder equation.
△ Less
Submitted 30 July, 2019;
originally announced July 2019.
-
Online Matching Frameworks under Stochastic Rewards, Product Ranking, and Unknown Patience
Authors:
Brian Brubach,
Nathaniel Grammel,
Will Ma,
Aravind Srinivasan
Abstract:
We study generalizations of online bipartite matching in which each arriving vertex (customer) views a ranked list of offline vertices (products) and matches to (purchases) the first one they deem acceptable. The number of products that the customer has patience to view can be stochastic and dependent on the products seen. We develop a framework that views the interaction with each customer as an…
▽ More
We study generalizations of online bipartite matching in which each arriving vertex (customer) views a ranked list of offline vertices (products) and matches to (purchases) the first one they deem acceptable. The number of products that the customer has patience to view can be stochastic and dependent on the products seen. We develop a framework that views the interaction with each customer as an abstract resource consumption process, and derive new results for these online matching problems under the adversarial, non-stationary, and IID arrival models, assuming we can (approximately) solve the product ranking problem for each single customer. To that end, we show new results for product ranking under two cascade-click models: an optimal algorithm when each item has its own hazard rate for making the customer depart, and a 1/2-approximate algorithm when the customer has a general item-independent patience distribution. We also present a constant-factor 0.027-approximate algorithm in a new model where items are not initially available and arrive over time. We complement these positive results by presenting three additional negative results relating to these problems.
△ Less
Submitted 26 June, 2023; v1 submitted 8 July, 2019;
originally announced July 2019.
-
A Fine-Grained Variant of the Hierarchy of Lasserre
Authors:
Wann-Jiun Ma,
Jakub Marecek,
Martin Mevissen
Abstract:
There has been much recent interest in hierarchies of progressively stronger convexifications of polynomial optimisation problems (POP). These often converge to the global optimum of the POP, asymptotically, but prove challenging to solve beyond the first level in the hierarchy for modest instances. We present a finer-grained variant of the Lasserre hierarchy, together with first-order methods for…
▽ More
There has been much recent interest in hierarchies of progressively stronger convexifications of polynomial optimisation problems (POP). These often converge to the global optimum of the POP, asymptotically, but prove challenging to solve beyond the first level in the hierarchy for modest instances. We present a finer-grained variant of the Lasserre hierarchy, together with first-order methods for solving the convexifications, which allow for efficient warm-starting with solutions from lower levels in the hierarchy.
△ Less
Submitted 23 June, 2019;
originally announced June 2019.
-
Hierarchical and Safe Motion Control for Cooperative Locomotion of Robotic Guide Dogs and Humans: A Hybrid Systems Approach
Authors:
Kaveh Akbari Hamed,
Vinay R. Kamidi,
Wen-Loong Ma,
Alexander Leonessa,
Aaron D. Ames
Abstract:
This paper presents a hierarchical control strategy based on hybrid systems theory, nonlinear control, and safety-critical systems to enable cooperative locomotion of robotic guide dogs and visually impaired people. We address high-dimensional and complex hybrid dynamical models that represent collaborative locomotion. At the high level of the control scheme, local and nonlinear baseline controlle…
▽ More
This paper presents a hierarchical control strategy based on hybrid systems theory, nonlinear control, and safety-critical systems to enable cooperative locomotion of robotic guide dogs and visually impaired people. We address high-dimensional and complex hybrid dynamical models that represent collaborative locomotion. At the high level of the control scheme, local and nonlinear baseline controllers, based on the virtual constraints approach, are designed to induce exponentially stable dynamic gaits. The baseline controller for the leash is assumed to be a nonlinear controller that keeps the human in a safe distance from the dog while following it. At the lower level, a real-time quadratic programming (QP) is solved for modifying the baseline controllers of the robot as well as the leash to avoid obstacles. In particular, the QP framework is set up based on control barrier functions (CBFs) to compute optimal control inputs that guarantee safety while being close to the baseline controllers. The stability of the complex periodic gaits is investigated through the Poincare return map. To demonstrate the power of the analytical foundation, the control algorithms are transferred into an extensive numerical simulation of a complex model that represents cooperative locomotion of a quadrupedal robot, referred to as Vision 60, and a human model. The complex model has 16 continuous-time domains with 60 state variables and 20 control inputs.
△ Less
Submitted 5 April, 2019;
originally announced April 2019.
-
Distributed Feedback Controllers for Stable Cooperative Locomotion of Quadrupedal Robots: A Virtual Constraint Approach
Authors:
Kaveh Akbari Hamed,
Vinay R. Kamidi,
Abhishek Pandala,
Wen-Loong Ma,
Aaron D. Ames
Abstract:
This paper aims to develop distributed feedback control algorithms that allow cooperative locomotion of quadrupedal robots which are coupled to each other by holonomic constraints. These constraints can arise from collaborative manipulation of objects during locomotion. In addressing this problem, the complex hybrid dynamical models that describe collaborative legged locomotion are studied. The co…
▽ More
This paper aims to develop distributed feedback control algorithms that allow cooperative locomotion of quadrupedal robots which are coupled to each other by holonomic constraints. These constraints can arise from collaborative manipulation of objects during locomotion. In addressing this problem, the complex hybrid dynamical models that describe collaborative legged locomotion are studied. The complex periodic orbits (i.e., gaits) of these sophisticated and high-dimensional hybrid systems are investigated. We consider a set of virtual constraints that stabilizes locomotion of a single agent. The paper then generates modified and local virtual constraints for each agent that allow stable collaborative locomotion. Optimal distributed feedback controllers, based on nonlinear control and quadratic programming, are developed to impose the local virtual constraints. To demonstrate the power of the analytical foundation, an extensive numerical simulation for cooperative locomotion of two quadrupedal robots with robotic manipulators is presented. The numerical complex hybrid model has 64 continuous-time domains, 192 discrete-time transitions, 96 state variables, and 36 control inputs.
△ Less
Submitted 1 October, 2019; v1 submitted 10 February, 2019;
originally announced February 2019.
-
Statistical inference of probabilistic origin-destination demand using day-to-day traffic data
Authors:
Wei Ma,
Zhen Qian
Abstract:
Recent transportation network studies on uncertainty and reliability call for modeling the probabilistic O-D demand and probabilistic network flow. Making the best use of day-to-day traffic data collected over many years, this paper develops a novel theoretical framework for estimating the mean and variance/covariance matrix of O-D demand considering the day-to-day variation induced by travelers'…
▽ More
Recent transportation network studies on uncertainty and reliability call for modeling the probabilistic O-D demand and probabilistic network flow. Making the best use of day-to-day traffic data collected over many years, this paper develops a novel theoretical framework for estimating the mean and variance/covariance matrix of O-D demand considering the day-to-day variation induced by travelers' independent route choices. It also estimates the probability distributions of link/path flow and their travel cost where the variance stems from three sources, O-D demand, route choice and unknown errors. The framework estimates O-D demand mean and variance/covariance matrix iteratively, also known as iterative generalized least squares (IGLS) in statistics. Lasso regularization is employed to obtain sparse covariance matrix for better interpretation and computational efficiency. Though the probabilistic O-D estimation (ODE) works with a much larger solution space than the deterministic ODE, we show that its estimator for O-D demand mean is no worse than the best possible estimator by an error that reduces with the increase in sample size. The probabilistic ODE is examined on two small networks and two real-world large-scale networks. The solution converges quickly under the IGLS framework. In all those experiments, the results of the probabilistic ODE are compelling, satisfactory and computationally plausible. Lasso regularization on the covariance matrix estimation leans to underestimate most of variance/covariance entries. A proper Lasso penalty ensures a good trade-off between bias and variance of the estimation.
△ Less
Submitted 28 January, 2019;
originally announced January 2019.
-
Strong mixed-integer programming formulations for trained neural networks
Authors:
Ross Anderson,
Joey Huchette,
Will Ma,
Christian Tjandraatmadja,
Juan Pablo Vielma
Abstract:
We present strong mixed-integer programming (MIP) formulations for high-dimensional piecewise linear functions that correspond to trained neural networks. These formulations can be used for a number of important tasks, such as verifying that an image classification network is robust to adversarial inputs, or solving decision problems where the objective function is a machine learning model. We pre…
▽ More
We present strong mixed-integer programming (MIP) formulations for high-dimensional piecewise linear functions that correspond to trained neural networks. These formulations can be used for a number of important tasks, such as verifying that an image classification network is robust to adversarial inputs, or solving decision problems where the objective function is a machine learning model. We present a generic framework, which may be of independent interest, that provides a way to construct sharp or ideal formulations for the maximum of d affine functions over arbitrary polyhedral input domains. We apply this result to derive MIP formulations for a number of the most popular nonlinear operations (e.g. ReLU and max pooling) that are strictly stronger than other approaches from the literature. We corroborate this computationally, showing that our formulations are able to offer substantial improvements in solve time on verification tasks for image classification networks.
△ Less
Submitted 21 January, 2020; v1 submitted 5 November, 2018;
originally announced November 2018.
-
Nakayama automorphisms of Ore extensions over polynomial algebras
Authors:
Liyu Liu,
Wen Ma
Abstract:
Nakayama automorphisms play an important role in several mathematical branches, which are known to be tough to compute in general. We compute the Nakayama automorphism $ν$ of any Ore extension $R[x; σ, δ]$ over a polynomial algebra $R$ in $n$ variables for an arbitrary $n$. The formula of $ν$ is obtained explicitly. When $σ$ is not the identity map, the invariant $E^G$ is also investigated in term…
▽ More
Nakayama automorphisms play an important role in several mathematical branches, which are known to be tough to compute in general. We compute the Nakayama automorphism $ν$ of any Ore extension $R[x; σ, δ]$ over a polynomial algebra $R$ in $n$ variables for an arbitrary $n$. The formula of $ν$ is obtained explicitly. When $σ$ is not the identity map, the invariant $E^G$ is also investigated in term of Zhang's twist, where $G$ is a cyclic group sharing the same order with $σ$.
△ Less
Submitted 10 August, 2018;
originally announced August 2018.
-
Statistical Inference for Covariate-Adaptive Randomization Procedures
Authors:
Wei Ma,
Yichen Qin,
Yang Li,
Feifang Hu
Abstract:
Covariate-adaptive randomization (CAR) procedures are frequently used in comparative studies to increase the covariate balance across treatment groups. However, because randomization inevitably uses the covariate information when forming balanced treatment groups, the validity of classical statistical methods after such randomization is often unclear. In this article, we derive the theoretical pro…
▽ More
Covariate-adaptive randomization (CAR) procedures are frequently used in comparative studies to increase the covariate balance across treatment groups. However, because randomization inevitably uses the covariate information when forming balanced treatment groups, the validity of classical statistical methods after such randomization is often unclear. In this article, we derive the theoretical properties of statistical methods based on general CAR under the linear model framework. More importantly, we explicitly unveil the relationship between covariate-adaptive and inference properties by deriving the asymptotic representations of the corresponding estimators. We apply the proposed general theory to various randomization procedures such as complete randomization, rerandomization, pairwise sequential randomization, and Atkinson's $D_A$-biased coin design and compare their performance analytically. Based on the theoretical results, we then propose a new approach to obtain valid and more powerful tests. These results open a door to understand and analyze experiments based on CAR. Simulation studies provide further evidence of the advantages of the proposed framework and the theoretical results. Supplementary materials for this article are available online.
△ Less
Submitted 7 September, 2020; v1 submitted 25 July, 2018;
originally announced July 2018.
-
Loss of information in feedforward social networks
Authors:
Simon Stolarczyk,
Manisha Bhardwaj,
Kevin E. Bassler,
Wei Ji Ma,
Kresimir Josic
Abstract:
We consider model social networks in which information propagates directionally across layers of rational agents. Each agent makes a locally optimal estimate of the state of the world, and communicates this estimate to agents downstream. When agents receive information from the same source their estimates are correlated. We show that the resulting redundancy can lead to the loss of information abo…
▽ More
We consider model social networks in which information propagates directionally across layers of rational agents. Each agent makes a locally optimal estimate of the state of the world, and communicates this estimate to agents downstream. When agents receive information from the same source their estimates are correlated. We show that the resulting redundancy can lead to the loss of information about the state of the world across layers of the network, even when all agents have full knowledge of the network's structure. A simple algebraic condition identifies networks in which information loss occurs, and we show that all such networks must contain a particular network motif. We also study random networks asymptotically as the number of agents increases, and find a sharp transition in the probability of information loss at the point at which the number of agents in one layer exceeds the number in the previous layer.
△ Less
Submitted 26 September, 2016;
originally announced September 2016.
-
Trigonal curves and algebro-geometric solutions to soliton hierarchies
Authors:
Wen-Xiu Ma
Abstract:
Using linear combinations of Lax matrices of soliton hierarchies, we introduce trigonal curves by their characteristic equations, and determine Dubrovin type equations for zeros and poles of meromorphic functions defined as ratios of the Baker-Akhiezer functions. We straighten out all flows in soliton hierarchies under the Abel-Jacobi coordinates associated with Lax pairs, and generate algebro-geo…
▽ More
Using linear combinations of Lax matrices of soliton hierarchies, we introduce trigonal curves by their characteristic equations, and determine Dubrovin type equations for zeros and poles of meromorphic functions defined as ratios of the Baker-Akhiezer functions. We straighten out all flows in soliton hierarchies under the Abel-Jacobi coordinates associated with Lax pairs, and generate algebro-geometric solutions to soliton hierarchies in terms of the Riemann theta functions, through observing asymptotic behaviors of the Baker-Akhiezer functions. We analyze the four-component AKNS soliton hierarchy in such a way that it leads to a general theory of trigonal curves applicable to construction of algebro-geometric solutions of an arbitrary soliton hierarchy.
△ Less
Submitted 25 July, 2016;
originally announced July 2016.
-
Special Values of Motivic $L$-Functions and Zeta-Polynomials for Symmetric Powers of Elliptic Curves
Authors:
Steffen Löbrich,
Wenjun Ma,
Jesse Thorner
Abstract:
Let $\mathcal{M}$ be a pure motive over $\mathbb{Q}$ of odd weight $w\geq 3$, even rank $d\geq 2$, and global conductor $N$ whose $L$-function $L(s,\mathcal{M})$ coincides with the $L$-function of a self-dual algebraic tempered cuspidal symplectic representation of $\mathrm{GL}_{d}(\mathbb{A}_{\mathbb{Q}})$. We show that a certain polynomial which generates special values of $L(s,\mathcal{M})$ (in…
▽ More
Let $\mathcal{M}$ be a pure motive over $\mathbb{Q}$ of odd weight $w\geq 3$, even rank $d\geq 2$, and global conductor $N$ whose $L$-function $L(s,\mathcal{M})$ coincides with the $L$-function of a self-dual algebraic tempered cuspidal symplectic representation of $\mathrm{GL}_{d}(\mathbb{A}_{\mathbb{Q}})$. We show that a certain polynomial which generates special values of $L(s,\mathcal{M})$ (including all of the critical values) has all of its zeros equidistributed on the unit circle, provided that $N$ or $w$ are sufficiently large with respect to $d$. These special values have arithmetic significance in the context of the Bloch-Kato conjecture. We focus on applications to symmetric powers of semistable elliptic curves over $\mathbb{Q}$. Using the Rodriguez-Villegas transform, we use these results to construct large classes of "zeta-polynomials" (in the sense of Manin) arising from symmetric powers of semistable elliptic curves; these polynomials have a functional equation relating $s\mapsto 1-s$, and all of their zeros on the line $\operatorname{Re}(s)=1/2$.
△ Less
Submitted 1 June, 2017; v1 submitted 23 June, 2016;
originally announced June 2016.