Skip to main content

Showing 1–29 of 29 results for author: Powell, W B

.
  1. arXiv:2302.11386  [pdf, other

    math.OC

    Entropy Minimization for Optimization of Expensive, Unimodal Functions

    Authors: Xiaohe Luo, Warren B. Powell

    Abstract: Maximization of an expensive, unimodal function under random observations has been an important problem in hyperparameter tuning. It features expensive function evaluations (which means small budgets) and a high level of noise. We develop an algorithm based on entropy reduction of a probabilistic belief about the optimum. The algorithm provides an efficient way of estimating the computationally in… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  2. arXiv:2301.07013  [pdf, other

    math.OC

    An Information-Collecting Drone Management Problem for Wildfire Mitigation

    Authors: Lawrence Thul, Warren B Powell

    Abstract: We present a formal mathematical multi-agent modeling framework for autonomously combating a wildland fire with unmanned aerial vehicles. The problem is formulated as a collaboration between a drone and a helicopter equipped with a tanker. The modeling solutions are designed to capture the communication between agents and the information processes between the agents and their environment. The dron… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: 42 pages, 6 figures

  3. arXiv:2301.06497  [pdf, other

    math.OC

    The Information-Collecting Vehicle Routing Problem: Stochastic Optimization for Emergency Storm Response

    Authors: Lina Al-Kanj, Warren B. Powell, Belgacem Bouzaiene-Ayari

    Abstract: We address the problem of mitigating damage to a power grid following a storm by managing a vehicle that has to be routed while simultaneously performing two tasks: learning about damage from the grid (which requires direct observation) and repairing damage that it observes. The learning process is assisted by calls from customers notifying the utility that they have lost power (``lights-out calls… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: 44 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:1605.05711

  4. arXiv:2204.07317  [pdf, other

    math.OC

    Stochastic Search for a Parametric Cost Function Approximation: Energy storage with rolling forecasts

    Authors: Saeed Ghadimi, Warren B. Powell

    Abstract: Rolling forecasts have been almost overlooked in the renewable energy storage literature. In this paper, we provide a new approach for handling uncertainty not just in the accuracy of a forecast, but in the evolution of forecasts over time. Our approach shifts the focus from modeling the uncertainty in a lookahead model to accurate simulations in a stochastic base model. We develop a robust policy… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  5. arXiv:2201.00258  [pdf, other

    math.OC cs.AI

    The Parametric Cost Function Approximation: A new approach for multistage stochastic programming

    Authors: Warren B Powell, Saeed Ghadimi

    Abstract: The most common approaches for solving multistage stochastic programming problems in the research literature have been to either use value functions ("dynamic programming") or scenario trees ("stochastic programming") to approximate the impact of a decision now on the future. By contrast, common industry practice is to use a deterministic approximation of the future which is easier to understand a… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: 3 figures

    MSC Class: 68 ACM Class: F.2; I.2

  6. arXiv:2004.05417  [pdf, other

    cs.LG cs.AI

    Optimal Learning for Sequential Decisions in Laboratory Experimentation

    Authors: Kristopher Reyes, Warren B Powell

    Abstract: The process of discovery in the physical, biological and medical sciences can be painstakingly slow. Most experiments fail, and the time from initiation of research until a new advance reaches commercial production can span 20 years. This tutorial is aimed to provide experimental scientists with a foundation in the science of making decisions. Using numerical examples drawn from the experiences of… ▽ More

    Submitted 13 April, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

  7. arXiv:2002.06238  [pdf, other

    cs.LG cs.AI stat.ML

    On State Variables, Bandit Problems and POMDPs

    Authors: Warren B Powell

    Abstract: State variables are easily the most subtle dimension of sequential decision problems. This is especially true in the context of active learning problems (bandit problems") where decisions affect what we observe and learn. We describe our canonical framework that models {\it any} sequential decision problem, and present our definition of state variables that allows us to claim: Any properly modeled… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

  8. arXiv:2001.06026  [pdf, other

    math.OC

    Risk Directed Importance Sampling in Stochastic Dual Dynamic Programming with Hidden Markov Models for Grid Level Energy Storage

    Authors: Joseph L. Durante, Juliana Nascimento, Warren B. Powell

    Abstract: Power systems that need to integrate renewables at a large scale must account for the high levels of uncertainty introduced by these power sources. This can be accomplished with a system of many distributed grid-level storage devices. However, develo** a cost-effective and robust control policy in this setting is a challenge due to the high dimensionality of the resource state and the highly vol… ▽ More

    Submitted 1 February, 2020; v1 submitted 16 January, 2020; originally announced January 2020.

    Comments: 42 pages, 7 figures, Replacement: added additional references

  9. arXiv:2001.00831  [pdf, other

    math.OC

    Reinforcement Learning via Parametric Cost Function Approximation for Multistage Stochastic Programming

    Authors: Saeed Ghadimi, Raymond T. Perkins, Warren B. Powell

    Abstract: The most common approaches for solving stochastic resource allocation problems in the research literature is to either use value functions ("dynamic programming") or scenario trees ("stochastic programming") to approximate the impact of a decision now on the future. By contrast, common industry practice is to use a deterministic approximation of the future which is easier to understand and solve,… ▽ More

    Submitted 2 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1703.04644

  10. arXiv:1912.09484  [pdf, other

    math.OC cs.LG eess.SP eess.SY math.PR stat.ML

    Zeroth-order Stochastic Compositional Algorithms for Risk-Aware Learning

    Authors: Dionysios S. Kalogerias, Warren B. Powell

    Abstract: We present $\textit{Free-MESSAGE}^{p}$, the first zeroth-order algorithm for (weakly-)convex mean-semideviation-based risk-aware learning, which is also the first three-level zeroth-order compositional stochastic optimization algorithm whatsoever. Using a non-trivial extension of Nesterov's classical results on Gaussian smoothing, we develop the $\textit{Free-MESSAGE}^{p}$ algorithm from first pri… ▽ More

    Submitted 13 December, 2021; v1 submitted 19 December, 2019; originally announced December 2019.

    Comments: 31 pages, major revision of the first version

  11. arXiv:1912.03513  [pdf, other

    cs.AI cs.LG eess.SY stat.ML

    From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions

    Authors: Warren B Powell

    Abstract: There are over 15 distinct communities that work in the general area of sequential decisions and information, often referred to as decisions under uncertainty or stochastic optimization. We focus on two of the most important fields: stochastic optimal control, with its roots in deterministic optimal control, and reinforcement learning, with its roots in Markov decision processes. Building on prior… ▽ More

    Submitted 18 December, 2019; v1 submitted 7 December, 2019; originally announced December 2019.

    Comments: 47 pages, 6 figures

  12. arXiv:1810.08124  [pdf, ps, other

    cs.AI eess.SY

    Approximate Dynamic Programming for Planning a Ride-Sharing System using Autonomous Fleets of Electric Vehicles

    Authors: Lina Al-Kanj, Juliana Nascimento, Warren B. Powell

    Abstract: Within a decade, almost every major auto company, along with fleet operators such as Uber, have announced plans to put autonomous vehicles on the road. At the same time, electric vehicles are quickly emerging as a next-generation technology that is cost effective, in addition to offering the benefits of reducing the carbon footprint. The combination of a centrally managed fleet of driverless vehic… ▽ More

    Submitted 11 December, 2018; v1 submitted 18 October, 2018; originally announced October 2018.

  13. arXiv:1804.00636  [pdf, other

    math.OC stat.AP stat.ME stat.ML

    Recursive Optimization of Convex Risk Measures: Mean-Semideviation Models

    Authors: Dionysios S. Kalogerias, Warren B. Powell

    Abstract: We develop recursive, data-driven, stochastic subgradient methods for optimizing a new, versatile, and application-driven class of convex risk measures, termed here as mean-semideviations, strictly generalizing the well-known and popular mean-upper-semideviation. We introduce the MESSAGEp algorithm, which is an efficient compositional subgradient procedure for iteratively solving convex mean-semid… ▽ More

    Submitted 29 October, 2018; v1 submitted 2 April, 2018; originally announced April 2018.

    Comments: 90 pages, 3 figures. Update: Substantial revision of the technical content, with an additional fully detailed analysis in regard to the rate of convergence of the MESSAGEp algorithm. NOTE: Please open in browser to see the math in the abstract!

  14. arXiv:1710.03914  [pdf, other

    math.OC

    Backward Approximate Dynamic Programming with Hidden Semi-Markov Stochastic Models in Energy Storage Optimization

    Authors: Joseph L. Durante, Juliana Nascimento, Warren B. Powell

    Abstract: We consider an energy storage problem involving a wind farm with a forecasted power output, a stochastic load, an energy storage device, and a connection to the larger power grid with stochastic prices. Electricity prices and wind power forecast errors are modeled using a novel hidden semi-Markov model that accurately replicates not just the distribution of the errors, but also crossing times, cap… ▽ More

    Submitted 1 February, 2020; v1 submitted 11 October, 2017; originally announced October 2017.

    Comments: 36 pages, 7 figures, replacement: up to date version, additional references added

  15. arXiv:1704.05963  [pdf, other

    math.OC cs.AI cs.LG

    Monte Carlo Tree Search with Sampled Information Relaxation Dual Bounds

    Authors: Daniel R. Jiang, Lina Al-Kanj, Warren B. Powell

    Abstract: Monte Carlo Tree Search (MCTS), most famously used in game-play artificial intelligence (e.g., the game of Go), is a well-known strategy for constructing approximate solutions to sequential decision problems. Its primary innovation is the use of a heuristic, known as a default policy, to obtain Monte Carlo estimates of downstream values for states in a decision tree. This information is used to it… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

    Comments: 33 pages, 6 figures

  16. arXiv:1703.04644  [pdf, other

    math.OC

    Stochastic Optimization with Parametric Cost Function Approximations

    Authors: Raymond T. Perkins III, Warren B. Powell

    Abstract: A widely used heuristic for solving stochastic optimization problems is to use a deterministic rolling horizon procedure, which has been modified to handle uncertainty (e.g. buffer stocks, schedule slack). This approach has been criticized for its use of a deterministic approximation of a stochastic problem, which is the major motivation for stochastic programming. We recast this debate by identif… ▽ More

    Submitted 14 March, 2017; originally announced March 2017.

  17. arXiv:1611.07161  [pdf, other

    stat.ML

    Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models

    Authors: Xinyu He, Warren B. Powell

    Abstract: We consider the problem of estimating the expected value of information (the knowledge gradient) for Bayesian learning problems where the belief model is nonlinear in the parameters. Our goal is to maximize some metric, while simultaneously learning the unknown parameters of the nonlinear belief model, by guiding a sequential experimentation process which is expensive. We overcome the problem of c… ▽ More

    Submitted 22 November, 2016; originally announced November 2016.

  18. arXiv:1605.05711  [pdf, ps, other

    math.OC cs.AI eess.SY

    The Information-Collecting Vehicle Routing Problem: Stochastic Optimization for Emergency Storm Response

    Authors: Lina Al-Kanj, Warren B. Powell, Belgacem Bouzaiene-Ayari

    Abstract: Utilities face the challenge of responding to power outages due to storms and ice damage, but most power grids are not equipped with sensors to pinpoint the precise location of the faults causing the outage. Instead, utilities have to depend primarily on phone calls (trouble calls) from customers who have lost power to guide the dispatching of utility trucks. In this paper, we develop a policy tha… ▽ More

    Submitted 18 May, 2016; originally announced May 2016.

  19. arXiv:1605.02848  [pdf, other

    math.OC

    Practicality of Nested Risk Measures for Dynamic Electric Vehicle Charging

    Authors: Daniel R. Jiang, Warren B. Powell

    Abstract: We consider the sequential decision problem faced by the manager of an electric vehicle (EV) charging station, who aims to satisfy the charging demand of the customer while minimizing cost. Since the total time needed to charge the EV up to capacity is often less than the amount of time that the customer is away, there are opportunities to exploit electricity spot price variations within some rese… ▽ More

    Submitted 3 October, 2017; v1 submitted 10 May, 2016; originally announced May 2016.

    Comments: 45 pages, 15 figures

  20. arXiv:1605.01521  [pdf, ps, other

    math.OC stat.CO

    SDDP vs. ADP: The Effect of Dimensionality in Multistage Stochastic Optimization for Grid Level Energy Storage

    Authors: Tsvetan Asamov, Daniel F. Salas, Warren B. Powell

    Abstract: There has been widespread interest in the use of grid-level storage to handle the variability from increasing penetrations of wind and solar energy. This problem setting requires optimizing energy storage and release decisions for anywhere from a half-dozen, to potentially hundreds of storage devices spread around the grid as new technologies evolve. We approach this problem using two competing al… ▽ More

    Submitted 5 May, 2016; originally announced May 2016.

  21. arXiv:1509.01920  [pdf, other

    math.OC cs.AI

    Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures

    Authors: Daniel R. Jiang, Warren B. Powell

    Abstract: In this paper, we consider a finite-horizon Markov decision process (MDP) for which the objective at each stage is to minimize a quantile-based risk measure (QBRM) of the sequence of future costs; we call the overall objective a dynamic quantile-based risk measure (DQBRM). In particular, we consider optimizing dynamic risk measures where the one-step risk measures are QBRMs, a class of risk measur… ▽ More

    Submitted 8 May, 2017; v1 submitted 7 September, 2015; originally announced September 2015.

    Comments: 39 pages, 7 figures

  22. arXiv:1508.01551  [pdf, ps, other

    math.OC stat.AP stat.ML

    A Knowledge Gradient Policy for Sequencing Experiments to Identify the Structure of RNA Molecules Using a Sparse Additive Belief Model

    Authors: Yan Li, Kristofer G. Reyes, Jorge Vazquez-Anderson, Yingfei Wang, Lydia M. Contreras, Warren B. Powell

    Abstract: We present a sparse knowledge gradient (SpKG) algorithm for adaptively selecting the targeted regions within a large RNA molecule to identify which regions are most amenable to interactions with other molecules. Experimentally, such regions can be inferred from fluorescence measurements obtained by binding a complementary probe with fluorescence markers to the targeted regions. We use a biophysica… ▽ More

    Submitted 6 August, 2015; originally announced August 2015.

  23. arXiv:1505.02227  [pdf, ps, other

    math.OC stat.CO

    Regularized Decomposition of High-Dimensional Multistage Stochastic Programs with Markov Uncertainty

    Authors: Tsvetan Asamov, Warren B. Powell

    Abstract: We develop a quadratic regularization approach for the solution of high-dimensional multistage stochastic optimization problems characterized by a potentially large number of time periods/stages (e.g. hundreds), a high-dimensional resource state variable, and a Markov information process. The resulting algorithms are shown to converge to an optimal policy after a finite number of iterations under… ▽ More

    Submitted 26 February, 2017; v1 submitted 8 May, 2015; originally announced May 2015.

  24. arXiv:1407.2676  [pdf, other

    math.OC cs.AI cs.LG eess.SY stat.ML

    A New Optimal Stepsize For Approximate Dynamic Programming

    Authors: Ilya O. Ryzhov, Peter I. Frazier, Warren B. Powell

    Abstract: Approximate dynamic programming (ADP) has proven itself in a wide range of applications spanning large-scale transportation problems, health care, revenue management, and energy systems. The design of effective ADP algorithms has many dimensions, but one crucial factor is the stepsize rule used to update a value function approximation. Many operations research applications are computationally inte… ▽ More

    Submitted 13 July, 2014; v1 submitted 9 July, 2014; originally announced July 2014.

    Comments: Matlab files are included with the paper source

  25. arXiv:1402.3575  [pdf, other

    math.OC

    Optimal Hour-Ahead Bidding in the Real-Time Electricity Market with Battery Storage using Approximate Dynamic Programming

    Authors: Daniel R. Jiang, Warren B. Powell

    Abstract: There is growing interest in the use of grid-level storage to smooth variations in supply that are likely to arise with increased use of wind and solar energy. Energy arbitrage, the process of buying, storing, and selling electricity to exploit variations in electricity spot prices, is becoming an important way of paying for expensive investments into grid-level storage. Independent system operato… ▽ More

    Submitted 31 August, 2015; v1 submitted 14 February, 2014; originally announced February 2014.

    Comments: 28 pages, 11 figures

    Journal ref: INFORMS Journal on Computing. Volume 27, Issue 3, pp. 525-543, 2015

  26. arXiv:1401.1590  [pdf, other

    math.OC

    An Approximate Dynamic Programming Algorithm for Monotone Value Functions

    Authors: Daniel R. Jiang, Warren B. Powell

    Abstract: Many sequential decision problems can be formulated as Markov Decision Processes (MDPs) where the optimal value function (or cost-to-go function) can be shown to satisfy a monotone structure in some or all of its dimensions. When the state space becomes large, traditional techniques, such as the backward dynamic programming algorithm (i.e., backward induction or value iteration), may no longer be… ▽ More

    Submitted 1 September, 2015; v1 submitted 8 January, 2014; originally announced January 2014.

    Comments: 35 pages, 11 figures

  27. arXiv:1401.0843  [pdf, other

    math.OC cs.LG

    Least Squares Policy Iteration with Instrumental Variables vs. Direct Policy Search: Comparison Against Optimal Benchmarks Using Energy Storage

    Authors: Warren R. Scott, Warren B. Powell, Somayeh Moazehi

    Abstract: This paper studies approximate policy iteration (API) methods which use least-squares Bellman error minimization for policy evaluation. We address several of its enhancements, namely, Bellman error minimization using instrumental variables, least-squares projected Bellman error minimization, and projected Bellman error minimization using instrumental variables. We prove that for a general discrete… ▽ More

    Submitted 4 January, 2014; originally announced January 2014.

    Comments: 37 pages, 9 figures

  28. arXiv:1006.4338  [pdf, other

    math.OC stat.ML

    Stochastic Search with an Observable State Variable

    Authors: Lauren A. Hannah, Warren B. Powell, David M. Blei

    Abstract: In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimati… ▽ More

    Submitted 15 July, 2010; v1 submitted 22 June, 2010; originally announced June 2010.

  29. arXiv:0909.5194  [pdf, other

    stat.ML

    Dirichlet Process Mixtures of Generalized Linear Models

    Authors: Lauren A. Hannah, David M. Blei, Warren B. Powell

    Abstract: We propose Dirichlet Process mixtures of Generalized Linear Models (DP-GLM), a new method of nonparametric regression that accommodates continuous and categorical inputs, and responses that can be modeled by a generalized linear model. We prove conditions for the asymptotic unbiasedness of the DP-GLM regression mean function estimate. We also give examples for when those conditions hold, including… ▽ More

    Submitted 15 July, 2010; v1 submitted 28 September, 2009; originally announced September 2009.