-
Switch Updating in SPSA Algorithm for Stochastic Optimization with Inequality Constraints
Authors:
Zhichao Jia,
Ziyi Wei,
James C. Spall
Abstract:
Simultaneous perturbation stochastic approximation (SPSA) is widely used in stochastic optimization due to its high efficiency, asymptotic stability, and reduced number of required loss function measurements. However, the standard SPSA algorithm needs to be modified to deal with constrained problems. In recent years, sequential quadratic programming (SQP)-based projection ideas and penalty ideas h…
▽ More
Simultaneous perturbation stochastic approximation (SPSA) is widely used in stochastic optimization due to its high efficiency, asymptotic stability, and reduced number of required loss function measurements. However, the standard SPSA algorithm needs to be modified to deal with constrained problems. In recent years, sequential quadratic programming (SQP)-based projection ideas and penalty ideas have been analyzed. Both ideas have convergence results and a potentially wide range of applications, but with some limitations in practical consideration, such as computation time, complexity, and feasibility guarantee. We propose an SPSA-based switch updating algorithm, which updates based on the loss function or the inequality constraints, depending on current feasibility in each iteration. We show convergence results for the algorithm, and analyze its properties relative to other methods. We also numerically compare the switch updating algorithm with the penalty function approach for two constrained examples.
△ Less
Submitted 5 February, 2023;
originally announced February 2023.
-
Discrete Stochastic Optimization for Public Health Interventions with Constraints
Authors:
Zewei Li,
James C. Spall
Abstract:
Many public health threats exist, motivating the need to find optimal intervention strategies. Given the stochastic nature of the threats (e.g., the spread of pandemic influenza, the occurrence of drug overdoses, and the prevalence of alcohol-related threats), deterministic optimization approaches may be inappropriate. In this paper, we implement a stochastic optimization method to address aspects…
▽ More
Many public health threats exist, motivating the need to find optimal intervention strategies. Given the stochastic nature of the threats (e.g., the spread of pandemic influenza, the occurrence of drug overdoses, and the prevalence of alcohol-related threats), deterministic optimization approaches may be inappropriate. In this paper, we implement a stochastic optimization method to address aspects of the 2009 H1N1 and the COVID-19 pandemics, with the spread of disease modeled by the open source Monte Carlo simulations, FluTE and Covasim, respectively. Without testing every possible option, the objective of the optimization is to determine the best combination of intervention strategies so as to result in minimal economic loss to society. To reach our objective, this application-oriented paper uses the discrete simultaneous perturbation stochastic approximation method (DSPSA), a recursive simulation-based optimization algorithm, to update the input parameters in the disease simulation software so that the output iteratively approaches minimal economic loss. Assuming that the simulation models for the spread of disease (FluTE for H1N1 and Covasim for COVID-19 in our case) are accurate representations for the population being studied, the simulation-based strategy we present provides decision makers a powerful tool to mitigate potential human and economic losses from any epidemic. The basic approach is also applicable in other public health problems, such as opioid abuse and drunk driving.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Efficient Implementation of Second-Order Stochastic Approximation Algorithms in High-Dimensional Problems
Authors:
**gyi Zhu,
Long Wang,
James C. Spall
Abstract:
Stochastic approximation (SA) algorithms have been widely applied in minimization problems when the loss functions and/or the gradient information are only accessible through noisy evaluations. Stochastic gradient (SG) descent---a first-order algorithm and a workhorse of much machine learning---is perhaps the most famous form of SA. Among all SA algorithms, the second-order simultaneous perturbati…
▽ More
Stochastic approximation (SA) algorithms have been widely applied in minimization problems when the loss functions and/or the gradient information are only accessible through noisy evaluations. Stochastic gradient (SG) descent---a first-order algorithm and a workhorse of much machine learning---is perhaps the most famous form of SA. Among all SA algorithms, the second-order simultaneous perturbation stochastic approximation (2SPSA) and the second-order stochastic gradient (2SG) are particularly efficient in handling high-dimensional problems, covering both gradient-free and gradient-based scenarios. However, due to the necessary matrix operations, the per-iteration floating-point-operations (FLOPs) cost of the standard 2SPSA/2SG is $O(p^3)$, where $p$ is the dimension of the underlying parameter. Note that the $O(p^3)$ FLOPs cost is distinct from the classical SPSA-based per-iteration $O(1)$ cost in terms of the number of noisy function evaluations. In this work, we propose a technique to efficiently implement the 2SPSA/2SG algorithms via the symmetric indefinite matrix factorization and show that the FLOPs cost is reduced from $O(p^3)$ to $O(p^2)$. The formal almost sure convergence and rate of convergence for the newly proposed approach are directly inherited from the standard 2SPSA/2SG. The improvement in efficiency and numerical stability is demonstrated in two numerical studies.
△ Less
Submitted 22 August, 2019; v1 submitted 22 June, 2019;
originally announced June 2019.
-
Error Analysis for the Particle Filter: Methods and Theoretical Support
Authors:
Ziyu Liu,
Shihong Wei,
James C. Spall
Abstract:
The particle filter is a popular Bayesian filtering algorithm for use in cases where the state-space model is nonlinear and/or the random terms (initial state or noises) are non-Gaussian distributed. We study the behavior of the error in the particle filter algorithm as the number of particles gets large. After a decomposition of the error into two terms, we show that the difference between the es…
▽ More
The particle filter is a popular Bayesian filtering algorithm for use in cases where the state-space model is nonlinear and/or the random terms (initial state or noises) are non-Gaussian distributed. We study the behavior of the error in the particle filter algorithm as the number of particles gets large. After a decomposition of the error into two terms, we show that the difference between the estimator and the conditional mean is asymptotically normal when the resampling is done at every step in the filtering process. Two nonlinear/non-Gaussian examples are tested to verify this conclusion.
△ Less
Submitted 26 March, 2019;
originally announced March 2019.
-
Modeling Traffic Networks Using Integrated Route and Link Data
Authors:
Xilei Zhao,
James C. Spall
Abstract:
Real-time navigation services, such as Google Maps and Waze, are widely used in daily life. These services provide rich data resources in real-time traffic conditions and travel time predictions; however, they have not been fully applied in transportation modeling. This paper aims to use traffic data from Google Maps and applying cutting-edge technologies in maximum likelihood estimation to model…
▽ More
Real-time navigation services, such as Google Maps and Waze, are widely used in daily life. These services provide rich data resources in real-time traffic conditions and travel time predictions; however, they have not been fully applied in transportation modeling. This paper aims to use traffic data from Google Maps and applying cutting-edge technologies in maximum likelihood estimation to model traffic networks and travel time reliability. This paper integrates Google Maps travel time data for routes and traffic condition data for links to model the complexities of traffic networks. We then formulate the Fisher information matrix and apply the asymptotic normality to obtain the probability distribution of the travel time estimates for a random route within the network of interest. We also derive the travel time reliability by considering two levels of uncertainties, i.e., the uncertainty of the route's travel time and the uncertainty of its travel time estimates. The proposed method could provide a more realistic and precise travel time reliability estimate. The methodology is applied to a small network in the downtown Baltimore area, where we propose a link data collection strategy and provide empirical evidence to show data independence by following this strategy. We also show results for maximum likelihood estimates and travel time reliability measures for different routes within the network. Furthermore, we use the historical data from a different network to validate this approach, showing our method provides a more accurate and precise estimate compared to the sample mean of the empirical data.
△ Less
Submitted 3 November, 2018;
originally announced November 2018.