Skip to main content

Showing 1–28 of 28 results for author: Haskell, W B

.
  1. arXiv:2401.06020  [pdf, other

    math.OC

    Dynamic Capital Requirements for Markov Decision Processes

    Authors: William B. Haskell, Abhishek Gupta, Shi** Shao

    Abstract: We build on the theory of capital requirements (CRs) to create a new framework for modeling dynamic risk preferences. The key question is how to evaluate the risk of a payoff stream sequentially as new information is revealed. In our model, we associate each payoff stream with a disbursement strategy and a premium schedule to form a triple of stochastic processes. We characterize risk preferences… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  2. arXiv:2211.04586  [pdf, other

    cs.LG cs.GT cs.MA econ.TH stat.ML

    Learning to Price Supply Chain Contracts against a Learning Retailer

    Authors: Xuejun Zhao, Ruihao Zhu, William B. Haskell

    Abstract: The rise of big data analytics has automated the decision-making of companies and increased supply chain agility. In this paper, we study the supply chain contract design problem faced by a data-driven supplier who needs to respond to the inventory decisions of the downstream retailer. Both the supplier and the retailer are uncertain about the market demand and need to learn about it sequentially.… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  3. arXiv:2209.12937  [pdf, ps, other

    math.OC eess.SY

    Robustness to Modeling Errors in Risk-Sensitive Markov Decision Problems with Markov Risk Measures

    Authors: Shi** Shao, Abhishek Gupta, William B. Haskell

    Abstract: We consider risk-sensitive Markov decision processes (MDPs), where the MDP model is influenced by a parameter which takes values in a compact metric space. We identify sufficient conditions under which small perturbations in the model parameters lead to small changes in the optimal value function and optimal policy. We further establish the robustness of the risk-sensitive optimal policies to mode… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: 24 pages, submitted to SIAM Journal on Control and Optimization

  4. arXiv:2008.13309  [pdf, other

    q-fin.RM math.OC

    Preference Robust Optimization with Quasi-Concave Choice Functions for Multi-Attribute Prospects

    Authors: Jian Wu, William B. Haskell, Wenjie Huang, Huifu Xu

    Abstract: Preference robust choice models concern decision-making problems where the decision maker's (DM) utility/risk preferences are ambiguous and the evaluation is based on the worst-case utility function/risk measure from a set of plausible utility functions/risk measures. The current preference robust choice models are mostly built upon von Neumann-Morgenstern expected utility theory, the theory of co… ▽ More

    Submitted 5 April, 2022; v1 submitted 30 August, 2020; originally announced August 2020.

    Comments: 59 pages, 6 figures, submitted

  5. arXiv:2008.08275   

    math.ST

    Asymptotic Analysis for Data-Driven Inventory Policies

    Authors: Xun Zhang, Zhisheng Ye, William B. Haskell

    Abstract: We study periodic review stochastic inventory control in the data-driven setting where the retailer makes ordering decisions based only on historical demand observations without any knowledge of the probability distribution of the demand. Since an (s, S)-policy is optimal when the demand distribution is known, we investigate the statistical properties of the data-driven (s, S)-policy obtained by r… ▽ More

    Submitted 4 November, 2021; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: The authors plan to include the updated version into a research proposal. To avoid the possible inconvinence, the authors decided to remove the updated version for now

  6. arXiv:2006.12450  [pdf, other

    math.OC

    A dynamic analytic method for risk-aware controlled martingale problems

    Authors: Jukka Isohätälä, William B. Haskell

    Abstract: We present a new, tractable method for solving and analyzing risk-aware control problems over finite and infinite, discounted time-horizons where the dynamics of the controlled process are described as a martingale problem. Supposing general Polish state and action spaces, and using generalized, relaxed controls, we state a risk-aware dynamic optimal control problem of minimizing risk of costs des… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    MSC Class: Primary; 93E20; 60J25; Secondary; 60J35; 90C30

  7. arXiv:2003.11403  [pdf, ps, other

    cs.LG eess.SY math.OC math.PR stat.ML

    Convergence of Recursive Stochastic Algorithms using Wasserstein Divergence

    Authors: Abhishek Gupta, William B. Haskell

    Abstract: This paper develops a unified framework, based on iterated random operator theory, to analyze the convergence of constant stepsize recursive stochastic algorithms (RSAs). RSAs use randomization to efficiently compute expectations, and so their iterates form a stochastic process. The key idea of our analysis is to lift the RSA into an appropriate higher-dimensional space and then express it as an e… ▽ More

    Submitted 5 January, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: 34 pages, submitted to SIMODS

    MSC Class: 93E35; 60J20; 68Q32

  8. arXiv:2003.10888  [pdf, other

    math.OC

    A Randomized Nonlinear Rescaling Method in Large-Scale Constrained Convex Optimization

    Authors: Bo Wei, William B. Haskell, Sixiang Zhao

    Abstract: We propose a new randomized algorithm for solving convex optimization problems that have a large number of constraints (with high probability). Existing methods like interior-point or Newton-type algorithms are hard to apply to such problems because they have expensive computation and storage requirements for Hessians and matrix inversions. Our algorithm is based on nonlinear rescaling (NLR), whic… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  9. arXiv:1906.09437  [pdf, ps, other

    stat.ML cs.LG

    A Unifying Framework for Variance Reduction Algorithms for Finding Zeroes of Monotone Operators

    Authors: Xun Zhang, William B. Haskell, Zhisheng Ye

    Abstract: It is common to encounter large-scale monotone inclusion problems where the objective has a finite sum structure. We develop a general framework for variance-reduced forward-backward splitting algorithms for this problem. This framework includes a number of existing deterministic and variance-reduced algorithms for function minimization as special cases, and it is also applicable to more general p… ▽ More

    Submitted 16 March, 2021; v1 submitted 22 June, 2019; originally announced June 2019.

  10. arXiv:1905.05328  [pdf, other

    math.OC

    A Flexible Multi-Facility Capacity Expansion Problem with Risk Aversion

    Authors: Sixiang Zhao, William B. Haskell, Michel-Alexandre Cardin

    Abstract: This paper studies flexible multi-facility capacity expansion with risk aversion. In this setting, the decision maker can periodically expand the capacity of facilities given observations of uncertain demand. We model this situation as a multi-stage stochastic programming problem. We express risk aversion in this problem through conditional value-at-risk (CVaR), and we formulate a mean-CVaR object… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

  11. arXiv:1901.05768  [pdf, other

    math.OC stat.ME

    A Multi-Level Simulation Optimization Approach for Quantile Functions

    Authors: Songhao Wang, Szu Hui Ng, William Benjamin Haskell

    Abstract: Quantile is a popular performance measure for a stochastic system to evaluate its variability and risk. To reduce the risk, selecting the actions that minimize the tail quantiles of some loss distributions is typically of interest for decision makers. When the loss distribution is observed via simulations, evaluating and optimizing its quantile functions can be challenging, especially when the sim… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

  12. arXiv:1901.05154  [pdf, other

    math.OC

    An Accelerated Fitted Value Iteration Algorithm for MDPs with Finite and Vector-Valued Action Space

    Authors: Sixiang Zhao, William B. Haskell, Michel-Alexandre Cardin

    Abstract: This paper studies an accelerated fitted value iteration (FVI) algorithm to solve high-dimensional Markov decision processes (MDPs). FVI is an approximate dynamic programming algorithm that has desirable theoretical properties. However, it can be intractable when the action space is finite but vector-valued. To solve such MDPs via FVI, we first approximate the value functions by a two-layer neural… ▽ More

    Submitted 25 November, 2020; v1 submitted 16 January, 2019; originally announced January 2019.

  13. arXiv:1901.04882  [pdf, other

    cs.GT cs.MA math.OC

    Model and Reinforcement Learning for Markov Games with Risk Preferences

    Authors: Wenjie Huang, Pham Viet Hai, William B. Haskell

    Abstract: We motivate and propose a new model for non-cooperative Markov game which considers the interactions of risk-aware players. This model characterizes the time-consistent dynamic "risk" from both stochastic state transitions (inherent to the game) and randomized mixed strategies (due to all other players). An appropriate risk-aware equilibrium concept is proposed and the existence of such equilibria… ▽ More

    Submitted 21 November, 2019; v1 submitted 15 January, 2019; originally announced January 2019.

    Comments: 38 pages, 6 tables, 5 figures

  14. arXiv:1812.09179  [pdf, ps, other

    math.OC

    Risk aware minimum principle for optimal control of stochastic differential equations

    Authors: Jukka Isohätälä, William B. Haskell

    Abstract: We present a probabilistic formulation of risk aware optimal control problems for stochastic differential equations. Risk awareness is in our framework captured by objective functions in which the risk neutral expectation is replaced by a risk function, a nonlinear functional of random variables that account for the controller's risk preferences. We state and prove a risk aware minimum principle t… ▽ More

    Submitted 18 October, 2019; v1 submitted 21 December, 2018; originally announced December 2018.

  15. arXiv:1812.09017  [pdf, ps, other

    math.OC

    Corporative Stochastic Approximation with Random Constraint Sampling for Semi-Infinite Programming

    Authors: Bo Wei, William B. Haskell, Sixiang Zhao

    Abstract: We developed a corporative stochastic approximation (CSA) type algorithm for semi-infinite programming (SIP), where the cut generation problem is solved inexactly. First, we provide general error bounds for inexact CSA. Then, we propose two specific random constraint sampling schemes to approximately solve the cut generation problem. When the objective and constraint functions are generally convex… ▽ More

    Submitted 21 December, 2018; originally announced December 2018.

  16. arXiv:1809.05385  [pdf, ps, other

    math.OC

    Index-Based Policy for Risk-Averse Multi-Armed Bandit

    Authors: Jianyu Xu, William B. Haskell, Zhisheng Ye

    Abstract: The multi-armed bandit (MAB) is a classical online optimization model for the trade-off between exploration and exploitation. The traditional MAB is concerned with finding the arm that minimizes the mean cost. However, minimizing the mean does not take the risk of the problem into account. We now want to accommodate risk-averse decision makers. In this work, we introduce a coherent risk measure as… ▽ More

    Submitted 14 September, 2018; originally announced September 2018.

  17. arXiv:1805.06632  [pdf, other

    q-fin.RM cs.AI cs.IR math.OC

    Preference Elicitation and Robust Optimization with Multi-Attribute Quasi-Concave Choice Functions

    Authors: William B. Haskell, Wenjie Huang, Huifu Xu

    Abstract: Decision maker's preferences are often captured by some choice functions which are used to rank prospects. In this paper, we consider ambiguity in choice functions over a multi-attribute prospect space. Our main result is a robust preference model where the optimal decision is based on the worst-case choice function from an ambiguity set constructed through preference elicitation with pairwise com… ▽ More

    Submitted 17 May, 2018; originally announced May 2018.

    Comments: 36 pages, 4 figures, submitted to Operations Research

  18. arXiv:1805.04238  [pdf, other

    math.OC cs.AI

    Stochastic Approximation for Risk-aware Markov Decision Processes

    Authors: Wenjie Huang, William B. Haskell

    Abstract: We develop a stochastic approximation-type algorithm to solve finite state/action, infinite-horizon, risk-aware Markov decision processes. Our algorithm has two loops. The inner loop computes the risk by solving a stochastic saddle-point problem. The outer loop performs $Q$-learning to compute an optimal risk-aware policy. Several widely investigated risk measures (e.g. conditional value-at-risk,… ▽ More

    Submitted 3 December, 2019; v1 submitted 11 May, 2018; originally announced May 2018.

    Comments: 34 pages, 4 figures, 2 tables

  19. arXiv:1803.10898  [pdf, other

    math.OC

    An Inexact Primal-Dual Algorithm for Semi-Infinite Programming

    Authors: Bo Wei, William B. Haskell, Sixiang Zhao

    Abstract: This paper considers an inexact primal-dual algorithm for semi-infinite programming (SIP) for which it provides general error bounds. To implement the dual variable update, we create a new prox function for nonnegative measures which turns out to be a generalization of the Kullback-Leibler divergence for probability distributions. We show that under suitable conditions on the error, this algorithm… ▽ More

    Submitted 15 January, 2019; v1 submitted 28 March, 2018; originally announced March 2018.

  20. arXiv:1801.04745  [pdf, ps, other

    eess.SY math.OC

    Distributionally Robust Optimization for Sequential Decision Making

    Authors: Zhi Chen, Pengqian Yu, William B. Haskell

    Abstract: The distributionally robust Markov Decision Process (MDP) approach asks for a distributionally robust policy that achieves the maximal expected total reward under the most adversarial distribution of uncertain parameters. In this paper, we study distributionally robust MDPs where ambiguity sets for the uncertain parameters are of a format that can easily incorporate in its description the uncertai… ▽ More

    Submitted 9 October, 2018; v1 submitted 15 January, 2018; originally announced January 2018.

  21. arXiv:1711.03669  [pdf, other

    math.OC

    An Inexact Primal-Dual Smoothing Framework for Large-Scale Non-Bilinear Saddle Point Problems

    Authors: Le Thi Khanh Hien, Renbo Zhao, William B. Haskell

    Abstract: We develop an inexact primal-dual first-order smoothing framework to solve a class of non-bilinear saddle point problems with primal strong convexity. Compared with existing methods, our framework yields a significant improvement over the primal oracle complexity, while it has competitive dual oracle complexity. In addition, we consider the situation where the primal-dual coupling term has a large… ▽ More

    Submitted 24 July, 2023; v1 submitted 9 November, 2017; originally announced November 2017.

  22. arXiv:1709.07506  [pdf, other

    math.OC

    An Empirical Dynamic Programming Algorithm for Continuous MDPs

    Authors: William B. Haskell, Rahul Jain, Hiteshi Sharma, Pengqian Yu

    Abstract: We propose universal randomized function approximation-based empirical value iteration (EVI) algorithms for Markov decision processes. The `empirical' nature comes from each iteration being done empirically from samples available from simulations of the next state. This makes the Bellman operator a random operator. A parametric and a non-parametric method for function approximation using a paramet… ▽ More

    Submitted 23 April, 2019; v1 submitted 21 September, 2017; originally announced September 2017.

    Comments: Accepted for publication in IEEE Transactions on Automatic Control

  23. arXiv:1705.06884  [pdf, other

    stat.ML cs.LG math.OC

    A Unified Framework for Stochastic Matrix Factorization via Variance Reduction

    Authors: Renbo Zhao, William B. Haskell, Jiashi Feng

    Abstract: We propose a unified framework to speed up the existing stochastic matrix factorization (SMF) algorithms via variance reduction. Our framework is general and it subsumes several well-known SMF formulations in the literature. We perform a non-asymptotic convergence analysis of our framework and derive computational and sample complexities for our algorithm to converge to an $ε$-stationary point in… ▽ More

    Submitted 21 May, 2017; v1 submitted 19 May, 2017; originally announced May 2017.

  24. arXiv:1704.00116  [pdf, other

    math.OC cs.IT stat.ML

    Stochastic L-BFGS: Improved Convergence Rates and Practical Acceleration Strategies

    Authors: Renbo Zhao, William B. Haskell, Vincent Y. F. Tan

    Abstract: We revisit the stochastic limited-memory BFGS (L-BFGS) algorithm. By proposing a new framework for the convergence analysis, we prove improved convergence rates and computational complexities of the stochastic L-BFGS algorithms compared to previous works. In addition, we propose several practical acceleration strategies to speed up the empirical performance of such algorithms. We also provide theo… ▽ More

    Submitted 24 October, 2017; v1 submitted 31 March, 2017; originally announced April 2017.

  25. arXiv:1701.01290  [pdf, other

    eess.SY math.OC

    Approximate Value Iteration for Risk-aware Markov Decision Processes

    Authors: Pengqian Yu, William B. Haskell, Huan Xu

    Abstract: We consider large-scale Markov decision processes (MDPs) with a risk measure of variability in cost, under the risk-aware MDPs paradigm. Previous studies showed that risk-aware MDPs, based on a minimax approach to handling risk, can be solved using dynamic programming for small to medium sized problems. However, due to the "curse of dimensionality", MDPs that model real-life problems are typically… ▽ More

    Submitted 16 May, 2017; v1 submitted 5 January, 2017; originally announced January 2017.

  26. arXiv:1610.06702   

    math.OC

    Random constraint sampling and duality for convex optimization

    Authors: William B. Haskell, Yu Pengqian

    Abstract: We are interested in solving convex optimization problems with large numbers of constraints. Randomized algorithms, such as random constraint sampling, have been very successful in giving nearly optimal solutions to such problems. In this paper, we combine random constraint sampling with the classical primal-dual algorithm for convex optimization problems with large numbers of constraints, and we… ▽ More

    Submitted 26 November, 2016; v1 submitted 21 October, 2016; originally announced October 2016.

    Comments: Substantially revised draft in preparation, with much stronger results

  27. arXiv:1311.5918  [pdf, other

    math.OC

    Empirical Dynamic Programming

    Authors: William B. Haskell, Rahul Jain, Dileep Kalathil

    Abstract: We propose empirical dynamic programming algorithms for Markov decision processes (MDPs). In these algorithms, the exact expectation in the Bellman operator in classical value iteration is replaced by an empirical estimate to get `empirical value iteration' (EVI). Policy evaluation and policy improvement in classical policy iteration are also replaced by simulation to get `empirical policy iterati… ▽ More

    Submitted 22 November, 2013; originally announced November 2013.

    Comments: 34 Pages, 1 Figure

  28. arXiv:1206.4568  [pdf, ps, other

    math.OC

    Stochastic dominance-constrained Markov decision processes

    Authors: William B. Haskell, Rahul Jain

    Abstract: We are interested in risk constraints for infinite horizon discrete time Markov decision processes (MDPs). Starting with average reward MDPs, we show that increasing concave stochastic dominance constraints on the empirical distribution of reward lead to linear constraints on occupation measures. The optimal policy for the resulting class of dominance-constrained MDPs is obtained by solving a line… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.