Search | arXiv e-print repository

Optimal Experimental Design for Partially Observable Pure Birth Processes

Authors: Ali Eshragh, Matthew P. Skerritt, Bruno Salvy, Thomas McCallum

Abstract: We develop an efficient algorithm to find optimal observation times by maximizing the Fisher information for the birth rate of a partially observable pure birth process involving $n$ observations. Partially observable implies that at each of the $n$ observation time points for counting the number of individuals present in the pure birth process, each individual is observed independently with a fix… ▽ More We develop an efficient algorithm to find optimal observation times by maximizing the Fisher information for the birth rate of a partially observable pure birth process involving $n$ observations. Partially observable implies that at each of the $n$ observation time points for counting the number of individuals present in the pure birth process, each individual is observed independently with a fixed probability $p$, modeling detection difficulties or constraints on resources. We apply concepts and techniques from generating functions, using a combination of symbolic and numeric computation, to establish a recursion for evaluating and optimizing the Fisher information. Our numerical results reveal the efficacy of this new method. An implementation of the algorithm is available publicly. △ Less

Submitted 20 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

arXiv:2402.03525 [pdf, ps, other]

Deep Reinforcement Learning for Picker Routing Problem in Warehousing

Authors: George Dunn, Hadi Charkhgard, Ali Eshragh, Sasan Mahmoudinazlou, Elizabeth Stojanovski

Abstract: Order Picker Routing is a critical issue in Warehouse Operations Management. Due to the complexity of the problem and the need for quick solutions, suboptimal algorithms are frequently employed in practice. However, Reinforcement Learning offers an appealing alternative to traditional heuristics, potentially outperforming existing methods in terms of speed and accuracy. We introduce an attention b… ▽ More Order Picker Routing is a critical issue in Warehouse Operations Management. Due to the complexity of the problem and the need for quick solutions, suboptimal algorithms are frequently employed in practice. However, Reinforcement Learning offers an appealing alternative to traditional heuristics, potentially outperforming existing methods in terms of speed and accuracy. We introduce an attention based neural network for modeling picker tours, which is trained using Reinforcement Learning. Our method is evaluated against existing heuristics across a range of problem parameters to demonstrate its efficacy. A key advantage of our proposed method is its ability to offer an option to reduce the perceived complexity of routes. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2401.00122 [pdf, other]

SALSA: Sequential Approximate Leverage-Score Algorithm with Application in Analyzing Big Time Series Data

Authors: Ali Eshragh, Luke Yerbury, Asef Nazari, Fred Roosta, Michael W. Mahoney

Abstract: We develop a new efficient sequential approximate leverage score algorithm, SALSA, using methods from randomized numerical linear algebra (RandNLA) for large matrices. We demonstrate that, with high probability, the accuracy of SALSA's approximations is within $(1 + O({\varepsilon}))$ of the true leverage scores. In addition, we show that the theoretical computational complexity and numerical accu… ▽ More We develop a new efficient sequential approximate leverage score algorithm, SALSA, using methods from randomized numerical linear algebra (RandNLA) for large matrices. We demonstrate that, with high probability, the accuracy of SALSA's approximations is within $(1 + O({\varepsilon}))$ of the true leverage scores. In addition, we show that the theoretical computational complexity and numerical accuracy of SALSA surpass existing approximations. These theoretical results are subsequently utilized to develop an efficient algorithm, named LSARMA, for fitting an appropriate ARMA model to large-scale time series data. Our proposed algorithm is, with high probability, guaranteed to find the maximum likelihood estimates of the parameters for the true underlying ARMA model. Furthermore, it has a worst-case running time that significantly improves those of the state-of-the-art alternatives in big data regimes. Empirical results on large-scale data strongly support these theoretical results and underscore the efficacy of our new approach. △ Less

Submitted 29 December, 2023; originally announced January 2024.

Comments: 42 pages, 7 figures

MSC Class: 62M10

arXiv:2212.12832 [pdf, ps, other]

Large Markov Decision Processes and Combinatorial Optimization

Authors: Ali Eshragh

Abstract: Markov decision processes continue to gain in popularity for modeling a wide range of applications ranging from analysis of supply chains and queuing networks to cognitive science and control of autonomous vehicles. Nonetheless, they tend to become numerically intractable as the size of the model grows fast. Recent works use machine learning techniques to overcome this crucial issue, but with no c… ▽ More Markov decision processes continue to gain in popularity for modeling a wide range of applications ranging from analysis of supply chains and queuing networks to cognitive science and control of autonomous vehicles. Nonetheless, they tend to become numerically intractable as the size of the model grows fast. Recent works use machine learning techniques to overcome this crucial issue, but with no convergence guarantee. This note provides a brief overview of literature on solving large Markov decision processes, and exploiting them to solve important combinatorial optimization problems. △ Less

Submitted 6 December, 2023; v1 submitted 24 December, 2022; originally announced December 2022.

arXiv:2212.02255 [pdf, other]

A Hybrid Statistical-Machine Learning Approach for Analysing Online Customer Behavior: An Empirical Study

Authors: Saed Alizamir, Kasun Bandara, Ali Eshragh, Foaad Iravani

Abstract: We apply classical statistical methods in conjunction with the state-of-the-art machine learning techniques to develop a hybrid interpretable model to analyse 454,897 online customers' behavior for a particular product category at the largest online retailer in China, that is JD. While most mere machine learning methods are plagued by the lack of interpretability in practice, our novel hybrid appr… ▽ More We apply classical statistical methods in conjunction with the state-of-the-art machine learning techniques to develop a hybrid interpretable model to analyse 454,897 online customers' behavior for a particular product category at the largest online retailer in China, that is JD. While most mere machine learning methods are plagued by the lack of interpretability in practice, our novel hybrid approach will address this practical issue by generating explainable output. This analysis involves identifying what features and characteristics have the most significant impact on customers' purchase behavior, thereby enabling us to predict future sales with a high level of accuracy, and identify the most impactful variables. Our results reveal that customers' product choice is insensitive to the promised delivery time, but this factor significantly impacts customers' order quantity. We also show that the effectiveness of various discounting methods depends on the specific product and the discount size. We identify product classes for which certain discounting approaches are more effective and provide recommendations on better use of different discounting tools. Customers' choice behavior across different product classes is mostly driven by price, and to a lesser extent, by customer demographics. The former finding asks for exercising care in deciding when and how much discount should be offered, whereas the latter identifies opportunities for personalized ads and targeted marketing. Further, to curb customers' batch ordering behavior and avoid the undesirable Bullwhip effect, JD should improve its logistics to ensure faster delivery of orders. △ Less

Submitted 1 December, 2022; originally announced December 2022.

arXiv:2112.12994 [pdf, other]

Toeplitz Least Squares Problems, Fast Algorithms and Big Data

Authors: Ali Eshragh, Oliver Di Pietro, Michael A. Saunders

Abstract: In time series analysis, when fitting an autoregressive model, one must solve a Toeplitz ordinary least squares problem numerous times to find an appropriate model, which can severely affect computational times with large data sets. Two recent algorithms (LSAR and Repeated Halving) have applied randomized numerical linear algebra (RandNLA) techniques to fitting an autoregressive model to big time-… ▽ More In time series analysis, when fitting an autoregressive model, one must solve a Toeplitz ordinary least squares problem numerous times to find an appropriate model, which can severely affect computational times with large data sets. Two recent algorithms (LSAR and Repeated Halving) have applied randomized numerical linear algebra (RandNLA) techniques to fitting an autoregressive model to big time-series data. We investigate and compare the quality of these two approximation algorithms on large-scale synthetic and real-world data. While both algorithms display comparable results for synthetic datasets, the LSAR algorithm appears to be more robust when applied to real-world time series data. We conclude that RandNLA is effective in the context of big-data time series. △ Less

Submitted 24 December, 2021; originally announced December 2021.

Comments: 28 pages, 11 figures

MSC Class: 62M10; 68T09; 62R07

arXiv:2103.09175 [pdf, other]

Rollage: Efficient Rolling Average Algorithm to Estimate ARMA Models for Big Time Series Data

Authors: Ali Eshragh, Glen Livingston, Thomas McCarthy McCann, Luke Yerbury

Abstract: We develop a new efficient algorithm for the analysis of large-scale time series data. We firstly define rolling averages, derive their analytical properties, and establish their asymptotic distribution. These theoretical results are subsequently exploited to develop an efficient algorithm, called Rollage, for fitting an appropriate AR model to big time series data. When used in conjunction with t… ▽ More We develop a new efficient algorithm for the analysis of large-scale time series data. We firstly define rolling averages, derive their analytical properties, and establish their asymptotic distribution. These theoretical results are subsequently exploited to develop an efficient algorithm, called Rollage, for fitting an appropriate AR model to big time series data. When used in conjunction with the Durbin's algorithm, we show that the Rollage algorithm can be used as a criterion to optimally fit ARMA models to big time series data. Empirical experiments on large-scale synthetic time series data support the theoretical results and reveal the efficacy of this new approach, especially when compared to existing methodology. △ Less

Submitted 23 December, 2022; v1 submitted 16 March, 2021; originally announced March 2021.

MSC Class: 62M10; 62R07

arXiv:2012.14933 [pdf, ps, other]

Surprise Maximization: A Dynamic Programming Approach

Authors: Ali Eshragh

Abstract: Borwein et al. (2000) solved a surprise maximization problem by applying results from convex analysis and mathematical programming. Although, their proof is elegant, it requires advanced knowledge from both areas to understand it. Here, we provide another approach to derive an optimal solution of the problem by utilizing dynamic programming. Borwein et al. (2000) solved a surprise maximization problem by applying results from convex analysis and mathematical programming. Although, their proof is elegant, it requires advanced knowledge from both areas to understand it. Here, we provide another approach to derive an optimal solution of the problem by utilizing dynamic programming. △ Less

Submitted 29 December, 2020; originally announced December 2020.

MSC Class: 90C39 (Primary) 90C30; 90C25 (Secondary)

arXiv:2010.08920 [pdf, ps, other]

Average-reward model-free reinforcement learning: a systematic review and literature map**

Authors: Vektor Dewanto, George Dunn, Ali Eshragh, Marcus Gallagher, Fred Roosta

Abstract: Reinforcement learning is important part of artificial intelligence. In this paper, we review model-free reinforcement learning that utilizes the average reward optimality criterion in the infinite horizon setting. Motivated by the solo survey by Mahadevan (1996a), we provide an updated review of work in this area and extend it to cover policy-iteration and function approximation methods (in addit… ▽ More Reinforcement learning is important part of artificial intelligence. In this paper, we review model-free reinforcement learning that utilizes the average reward optimality criterion in the infinite horizon setting. Motivated by the solo survey by Mahadevan (1996a), we provide an updated review of work in this area and extend it to cover policy-iteration and function approximation methods (in addition to the value-iteration and tabular counterparts). We present a comprehensive literature map**. We also identify and discuss opportunities for future work. △ Less

Submitted 3 August, 2021; v1 submitted 18 October, 2020; originally announced October 2020.

Comments: 36 pages, refined prelim and politer sections

arXiv:2005.12455 [pdf, other]

doi 10.1371/journal.pone.0240153

Modeling the Dynamics of the COVID-19 Population in Australia: A Probabilistic Analysis

Authors: Ali Eshragh, Saed Alizamir, Peter Howley, Elizabeth Stojanovski

Abstract: The novel Corona Virus COVID-19 arrived on Australian shores around 25 January 2020. This paper presents a novel method of dynamically modeling and forecasting the COVID-19 pandemic in Australia with a high degree of accuracy and in a timely manner using limited data; a valuable resource that can be used to guide government decision-making on societal restrictions on a daily and/or weekly basis. T… ▽ More The novel Corona Virus COVID-19 arrived on Australian shores around 25 January 2020. This paper presents a novel method of dynamically modeling and forecasting the COVID-19 pandemic in Australia with a high degree of accuracy and in a timely manner using limited data; a valuable resource that can be used to guide government decision-making on societal restrictions on a daily and/or weekly basis. The "partially-observable stochastic process" used in this study predicts not only the future actual values with extremely low error, but also the percentage of unobserved COVID-19 cases in the population. The model can further assist policy makers to assess the effectiveness of several possible alternative scenarios in their decision-making processes. △ Less

Submitted 25 May, 2020; originally announced May 2020.

Comments: 25 pages, 7 figures, 3 tables

MSC Class: 92D30; 62M20; 60J28 ACM Class: G.3

arXiv:1911.12321 [pdf, other]

LSAR: Efficient Leverage Score Sampling Algorithm for the Analysis of Big Time Series Data

Authors: Ali Eshragh, Fred Roosta, Asef Nazari, Michael W. Mahoney

Abstract: We apply methods from randomized numerical linear algebra (RandNLA) to develop improved algorithms for the analysis of large-scale time series data. We first develop a new fast algorithm to estimate the leverage scores of an autoregressive (AR) model in big data regimes. We show that the accuracy of approximations lies within $(1+\bigO{\varepsilon})$ of the true leverage scores with high probabili… ▽ More We apply methods from randomized numerical linear algebra (RandNLA) to develop improved algorithms for the analysis of large-scale time series data. We first develop a new fast algorithm to estimate the leverage scores of an autoregressive (AR) model in big data regimes. We show that the accuracy of approximations lies within $(1+\bigO{\varepsilon})$ of the true leverage scores with high probability. These theoretical results are subsequently exploited to develop an efficient algorithm, called LSAR, for fitting an appropriate AR model to big time series data. Our proposed algorithm is guaranteed, with high probability, to find the maximum likelihood estimates of the parameters of the underlying true AR model and has a worst case running time that significantly improves those of the state-of-the-art alternatives in big data regimes. Empirical results on large-scale synthetic as well as real data highly support the theoretical results and reveal the efficacy of this new approach. △ Less

Submitted 30 October, 2021; v1 submitted 27 November, 2019; originally announced November 2019.

Comments: 38 pages, 8 figures

arXiv:1911.00817 [pdf, other]

doi 10.1007/s10666-021-09806-1

The Importance of Environmental Factors in Forecasting Australian Power Demand

Authors: Ali Eshragh, Benjamin Ganim, Terry Perkins, Kasun Bandara

Abstract: We develop a time series model to forecast weekly peak power demand for three main states of Australia for a yearly time-scale, and show the crucial role of environmental factors in improving the forecasts. More precisely, we construct a seasonal autoregressive integrated moving average (SARIMA) model and reinforce it by employing the exogenous environmental variables including, maximum temperatur… ▽ More We develop a time series model to forecast weekly peak power demand for three main states of Australia for a yearly time-scale, and show the crucial role of environmental factors in improving the forecasts. More precisely, we construct a seasonal autoregressive integrated moving average (SARIMA) model and reinforce it by employing the exogenous environmental variables including, maximum temperature, minimum temperature, and solar exposure. The estimated hybrid SARIMA-regression model exhibits an excellent mean absolute percentage error (MAPE) of 3.41%. Moreover, our analysis demonstrates the importance of the environmental factors by showing a remarkable improvement of 46.3% in MAPE for the hybrid model over the crude SARIMA model which merely includes the power demand variables. In order to illustrate the efficacy of our model, we compare our outcome with the state-of-the-art machine learning methods in forecasting. The results reveal that our model outperforms the latter approach. △ Less

Submitted 30 October, 2021; v1 submitted 2 November, 2019; originally announced November 2019.

Comments: Keywords: Electricity power peak demand forecasting, Environmental factors, SARIMA-regression Model

MSC Class: 62M10; 97K80

arXiv:1909.02716 [pdf]

Demand Forecasting in the Presence of Systematic Events: Cases in Capturing Sales Promotions

Authors: Mahdi Abolghasemi, Ali Eshragh, Jason Hurley, Behnam Fahimnia

Abstract: Reliable demand forecasts are critical for the effective supply chain management. Several endogenous and exogenous variables can influence the dynamics of demand, and hence a single statistical model that only consists of historical sales data is often insufficient to produce accurate forecasts. In practice, the forecasts generated by baseline statistical models are often judgmentally adjusted by… ▽ More Reliable demand forecasts are critical for the effective supply chain management. Several endogenous and exogenous variables can influence the dynamics of demand, and hence a single statistical model that only consists of historical sales data is often insufficient to produce accurate forecasts. In practice, the forecasts generated by baseline statistical models are often judgmentally adjusted by forecasters to incorporate factors and information that are not incorporated in the baseline models. There are however systematic events whose effect can be effectively quantified and modeled to help minimize human intervention in adjusting the baseline forecasts. In this paper, we develop and test a novel regime-switching approach to quantify systematic information/events and objectively incorporate them into the baseline statistical model. Our simple yet practical and effective model can help limit forecast adjustments to only focus on the impact of less systematic events such as sudden climate change or dynamic market activities. The proposed model and approach is validated empirically using sales and promotional data from two Australian companies. Discussions focus on a thorough analysis of the forecasting and benchmarking results. Our analysis indicates that the proposed model can successfully improve the forecast accuracy when compared to the current industry practice which heavily relies on human judgment to factor in all types of information/events. △ Less

Submitted 6 September, 2019; originally announced September 2019.

arXiv:1905.05557 [pdf, ps, other]

doi 10.1016/j.orl.2020.04.007

An analytical bound on the fleet size in vehicle routing problems: a dynamic programming approach

Authors: Ali Eshragh, Rasul Esmaeilbeigi, Richard Middleton

Abstract: We present an analytical upper bound on the number of required vehicles for vehicle routing problems with split deliveries and any number of capacitated depots. We show that a fleet size greater than the proposed bound is not achievable based on a set of common assumptions. This property of the upper bound is proved through a dynamic programming approach. We also discuss the validity of the bound… ▽ More We present an analytical upper bound on the number of required vehicles for vehicle routing problems with split deliveries and any number of capacitated depots. We show that a fleet size greater than the proposed bound is not achievable based on a set of common assumptions. This property of the upper bound is proved through a dynamic programming approach. We also discuss the validity of the bound for a wide variety of routing problems with or without split deliveries. △ Less

Submitted 11 April, 2020; v1 submitted 14 May, 2019; originally announced May 2019.

arXiv:1901.10868 [pdf, ps, other]

Learning to Project in Multi-Objective Binary Linear Programming

Authors: Alvaro Sierra-Altamiranda, Hadi Charkhgard, Iman Dayarian, Ali Eshragh, Sorna Javadi

Abstract: In this paper, we investigate the possibility of improving the performance of multi-objective optimization solution approaches using machine learning techniques. Specifically, we focus on multi-objective binary linear programs and employ one of the most effective and recently developed criterion space search algorithms, the so-called KSA, during our study. This algorithm computes all nondominated… ▽ More In this paper, we investigate the possibility of improving the performance of multi-objective optimization solution approaches using machine learning techniques. Specifically, we focus on multi-objective binary linear programs and employ one of the most effective and recently developed criterion space search algorithms, the so-called KSA, during our study. This algorithm computes all nondominated points of a problem with p objectives by searching on a projected criterion space, i.e., a (p-1)-dimensional criterion apace. We present an effective and fast learning approach to identify on which projected space the KSA should work. We also present several generic features/variables that can be used in machine learning techniques for identifying the best projected space. Finally, we present an effective bi-objective optimization based heuristic for selecting the best subset of the features to overcome the issue of overfitting in learning. Through an extensive computational study over 2000 instances of tri-objective Knapsack and Assignment problems, we demonstrate that an improvement of up to 12% in time can be achieved by the proposed learning method compared to a random selection of the projected space. △ Less

Submitted 30 January, 2019; originally announced January 2019.

arXiv:1805.04725 [pdf, ps, other]

doi 10.1287/moor.2019.1009

Hamiltonian cycles and subsets of discounted occupational measures

Authors: Ali Eshragh, Jerzy A. Filar, Thomas Kalinowski, Sogol Mohammadian

Abstract: We study a certain polytope arising from embedding the Hamiltonian cycle problem in a discounted Markov decision process. The Hamiltonian cycle problem can be reduced to finding particular extreme points of a certain polytope associated with the input graph. This polytope is a subset of the space of discounted occupational measures. We characterize the feasible bases of the polytope for a general… ▽ More We study a certain polytope arising from embedding the Hamiltonian cycle problem in a discounted Markov decision process. The Hamiltonian cycle problem can be reduced to finding particular extreme points of a certain polytope associated with the input graph. This polytope is a subset of the space of discounted occupational measures. We characterize the feasible bases of the polytope for a general input graph $G$, and determine the expected numbers of different types of feasible bases when the underlying graph is random. We utilize these results to demonstrate that augmenting certain additional constraints to reduce the polyhedral domain can eliminate a large number of feasible bases that do not correspond to Hamiltonian cycles. Finally, we develop a random walk algorithm on the feasible bases of the reduced polytope and present some numerical results. We conclude with a conjecture on the feasible bases of the reduced polytope. △ Less

Submitted 25 January, 2019; v1 submitted 12 May, 2018; originally announced May 2018.

Comments: revised based on referees comments

MSC Class: 90C27; 90C35

arXiv:1804.07935 [pdf, ps, other]

Best subset selection in linear regression via bi-objective mixed integer linear programming

Authors: Hadi Charkhgard, Ali Eshragh

Abstract: We study the problem of choosing the best subset of p features in linear regression given n observations. This problem naturally contains two objective functions including minimizing the amount of bias and minimizing the number of predictors. The existing approaches transform the problem into a single-objective optimization problem. We explain the main weaknesses of existing approaches, and to ove… ▽ More We study the problem of choosing the best subset of p features in linear regression given n observations. This problem naturally contains two objective functions including minimizing the amount of bias and minimizing the number of predictors. The existing approaches transform the problem into a single-objective optimization problem. We explain the main weaknesses of existing approaches, and to overcome their drawbacks, we propose a bi-objective mixed integer linear programming approach. A computational study shows the efficacy of the proposed approach. △ Less

Submitted 21 April, 2018; originally announced April 2018.

Comments: 13 pages, 4 figures, 1 table

MSC Class: 62J05; 90C29

Showing 1–17 of 17 results for author: Eshragh, A