-
Diffusion Approximations of Speed-Aware Join-the-Shortest-Queue Scheme: Transient and Stationary Analysis
Authors:
Sanidhay Bhambay,
Burak Büke,
Arpan Mukhopadhyay
Abstract:
The Join-the-Shortest-Queue (JSQ) load balancing scheme is widely acknowledged for its effectiveness in minimizing the average response time for jobs in systems with identical servers. However, when applied to a heterogeneous server system with servers of different processing speeds, the JSQ scheme exhibits suboptimal performance. Recently, a variation of JSQ called the Speed-Aware-Join-the-Shorte…
▽ More
The Join-the-Shortest-Queue (JSQ) load balancing scheme is widely acknowledged for its effectiveness in minimizing the average response time for jobs in systems with identical servers. However, when applied to a heterogeneous server system with servers of different processing speeds, the JSQ scheme exhibits suboptimal performance. Recently, a variation of JSQ called the Speed-Aware-Join-the-Shortest-Queue (SA-JSQ) scheme has been shown to attain fluid limit optimality for systems with heterogeneous servers. In this paper, we examine the SA-JSQ scheme for heterogeneous server systems under the Halfin-Whitt regime. Our analysis begins by establishing that the scaled and centered version of the system state weakly converges to a diffusion process characterized by stochastic integral equations. Furthermore, we prove that the diffusion process is positive recurrent and the sequence of stationary measures for the scaled and centered queue length processes converge to the stationary measure for the limiting diffusion process. To achieve this result, we employ Stein's method with a generator expansion approach.
△ Less
Submitted 16 December, 2023;
originally announced December 2023.
-
Many-Server Queueing Systems with Heterogeneous Strategic Servers in Heavy Traffic
Authors:
Burak Büke,
Goncalo dos Reis,
Vadim Platonov
Abstract:
In most service systems, the servers are humans who desire to experience a certain level of idleness. In call centers, this manifests itself as the call avoidance behavior, where servers strategically adjust their service rate to strike a balance between the idleness they receive and effort to work harder. Moreover, being humans, each server values this trade-off differently and has different capa…
▽ More
In most service systems, the servers are humans who desire to experience a certain level of idleness. In call centers, this manifests itself as the call avoidance behavior, where servers strategically adjust their service rate to strike a balance between the idleness they receive and effort to work harder. Moreover, being humans, each server values this trade-off differently and has different capabilities. Drawing ideas on mean-field games we develop a novel framework relying on measure-valued processes to simultaneously address strategic server behavior and inherent server heterogeneity in service systems. This framework enables us to extend the recent literature on strategic servers in four new directions by: (i) incorporating individual choices of servers, (ii) incorporating individual abilities of servers, (iii) modeling the discomfort experienced by servers due to low levels of idleness, and (iv) considering more general routing policies. Using our framework, we are able to asymptotically characterize asymmetric Nash equilibria for many-server systems with strategic servers.
In simpler cases, it has been shown that the purely quality-driven regime is asymptotically optimal. However, we show that if the discomfort increases fast enough as the idleness approaches zero, the quality-and-efficiency-driven regime and other quality driven regimes can be optimal. This is the first time this conclusion appears in the literature.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Many-Server Queues with Random Service Rates in the Halfin-Whitt Regime: A Measure-Valued Process Approach
Authors:
Burak Büke,
Wenyi Qin
Abstract:
We consider many-server queueing systems with heterogeneous exponential servers and renewal arrivals. The service rate of each server is a random variable drawn from a given distribution. We develop a framework for analyzing the heavy traffic limit of these queues in random environment using probability measure-valued stochastic processes. We introduce the measure-valued fairness process which den…
▽ More
We consider many-server queueing systems with heterogeneous exponential servers and renewal arrivals. The service rate of each server is a random variable drawn from a given distribution. We develop a framework for analyzing the heavy traffic limit of these queues in random environment using probability measure-valued stochastic processes. We introduce the measure-valued fairness process which denotes the proportion of cumulative idleness experienced by servers whose rates fall in a Borel subset of the support of the service rates. It can be shown that these fairness processes do not converge in the usual Skorokhod-$J_1$ topology, hence we introduce a new notion of convergence based on shifted versions of these processes. We also introduce some useful martingales to identify limiting fairness processes under different routing policies.
△ Less
Submitted 10 May, 2019;
originally announced May 2019.
-
Separable Approximations and Decomposition Methods for the Augmented Lagrangian
Authors:
Rachael Tappenden,
Peter Richtarik,
Burak Buke
Abstract:
In this paper we study decomposition methods based on separable approximations for minimizing the augmented Lagrangian. In particular, we study and compare the Diagonal Quadratic Approximation Method (DQAM) of Mulvey and Ruszczyński and the Parallel Coordinate Descent Method (PCDM) of Richtárik and Takáč. We show that the two methods are equivalent for feasibility problems up to the selection of a…
▽ More
In this paper we study decomposition methods based on separable approximations for minimizing the augmented Lagrangian. In particular, we study and compare the Diagonal Quadratic Approximation Method (DQAM) of Mulvey and Ruszczyński and the Parallel Coordinate Descent Method (PCDM) of Richtárik and Takáč. We show that the two methods are equivalent for feasibility problems up to the selection of a single step-size parameter. Furthermore, we prove an improved complexity bound for PCDM under strong convexity, and show that this bound is at least $8(L'/\bar{L})(ω-1)^2$ times better than the best known bound for DQAM, where $ω$ is the degree of partial separability and $L'$ and $\bar{L}$ are the maximum and average of the block Lipschitz constants of the gradient of the quadratic penalty appearing in the augmented Lagrangian.
△ Less
Submitted 30 August, 2013;
originally announced August 2013.
-
Cross-training with Imperfect training Schemes
Authors:
Burak Buke,
Ozgur M. Araz,
John W. Fowler
Abstract:
Cross-training workers is one of the most efficient ways to achieve flexibility in manufacturing and service systems to increase responsiveness to demand variability. However, it is generally the case that cross-trained employees are not as productive as employees who are originally trained on a specific task. Also, the productivity of the cross-trained workers depend on when they are cross-traine…
▽ More
Cross-training workers is one of the most efficient ways to achieve flexibility in manufacturing and service systems to increase responsiveness to demand variability. However, it is generally the case that cross-trained employees are not as productive as employees who are originally trained on a specific task. Also, the productivity of the cross-trained workers depend on when they are cross-trained. In this work, we consider a two-stage model to analyze the affect of variations in productivity levels of workers on cross-training policies. Our results indicate that the most important factor determining the problem structure is the consistency in productivity levels of workers trained at different times. As long as cross-training can be done in a consistent manner, the productivity differences between cross-trained workers and workers originally trained on the task plays a minor role. We also analyze the effect of the variabilities in demand and producivity levels. We show that if the productivity levels of workers trained at different times are consistent, the decision maker is inclined to defer the cross-training decisions as the variability of demand or productivity levels increases. However, when the productivities of workers trained at different times differ, the decision maker may prefer to invest more in cross-training earlier as variability increases.
△ Less
Submitted 26 August, 2013;
originally announced August 2013.