Search | arXiv e-print repository

arXiv:2010.15525 [pdf, other]

doi 10.1287/ijoc.2021.1100

Self-Learning Threshold-Based Load Balancing

Authors: Diego Goldsztajn, Sem C. Borst, Johan S. H. van Leeuwaarden, Debankur Mukherjee, Philip A. Whiting

Abstract: We consider a large-scale service system where incoming tasks have to be instantaneously dispatched to one out of many parallel server pools. The user-perceived performance degrades with the number of concurrent tasks and the dispatcher aims at maximizing the overall quality-of-service by balancing the load through a simple threshold policy. We demonstrate that such a policy is optimal on the flui… ▽ More We consider a large-scale service system where incoming tasks have to be instantaneously dispatched to one out of many parallel server pools. The user-perceived performance degrades with the number of concurrent tasks and the dispatcher aims at maximizing the overall quality-of-service by balancing the load through a simple threshold policy. We demonstrate that such a policy is optimal on the fluid and diffusion scales, while only involving a small communication overhead, which is crucial for large-scale deployments. In order to set the threshold optimally, it is important, however, to learn the load of the system, which may be unknown. For that purpose, we design a control rule for tuning the threshold in an online manner. We derive conditions which guarantee that this adaptive threshold settles at the optimal value, along with estimates for the time until this happens. In addition, we provide numerical experiments which support the theoretical results and further indicate that our policy copes effectively with time-varying demand patterns. △ Less

Submitted 11 September, 2023; v1 submitted 29 October, 2020; originally announced October 2020.

Comments: 52 pages, 6 figures

MSC Class: 60F17; 60K25 (Primary) 68M20 (Secondary) ACM Class: C.4; G.3

Journal ref: INFORMS Journal on Computing, 34(1):39-54, 2022

arXiv:1612.00723 [pdf, ps, other]

doi 10.1287/stsy.2018.0016

Universality of Power-of-$d$ Load Balancing in Many-Server Systems

Authors: Debankur Mukherjee, Sem C. Borst, Johan S. H. van Leeuwaarden, Philip A. Whiting

Abstract: We consider a system of $N$ parallel single-server queues with unit exponential service rates and a single dispatcher where tasks arrive as a Poisson process of rate $λ(N)$. When a task arrives, the dispatcher assigns it to a server with the shortest queue among $d(N)$ randomly selected servers ($1 \leq d(N) \leq N$). This load balancing strategy is referred to as a JSQ($d(N)$) scheme, marking tha… ▽ More We consider a system of $N$ parallel single-server queues with unit exponential service rates and a single dispatcher where tasks arrive as a Poisson process of rate $λ(N)$. When a task arrives, the dispatcher assigns it to a server with the shortest queue among $d(N)$ randomly selected servers ($1 \leq d(N) \leq N$). This load balancing strategy is referred to as a JSQ($d(N)$) scheme, marking that it subsumes the celebrated Join-the-Shortest Queue (JSQ) policy as a crucial special case for $d(N) = N$. We construct a stochastic coupling to bound the difference in the queue length processes between the JSQ policy and a scheme with an arbitrary value of $d(N)$. We use the coupling to derive the fluid limit in the regime where $λ(N) / N \to λ< 1$ as $N \to \infty$ with $d(N) \to\infty$, along with the associated fixed point. The fluid limit turns out not to depend on the exact growth rate of $d(N)$, and in particular coincides with that for the JSQ policy. We further leverage the coupling to establish that the diffusion limit in the critical regime where $(N - λ(N)) / \sqrt{N} \to β> 0$ as $N \to \infty$ with $d(N)/(\sqrt{N} \log (N))\to\infty$ corresponds to that for the JSQ policy. These results indicate that the optimality of the JSQ policy can be preserved at the fluid-level and diffusion-level while reducing the overhead by nearly a factor O($N$) and O($\sqrt{N}/\log(N)$), respectively. △ Less

Submitted 16 November, 2018; v1 submitted 2 December, 2016; originally announced December 2016.

Comments: 39 pages, 2 figures, companion paper of arXiv:1612.00722

Journal ref: Stoch.Syst. 8 4 (2018) 265-292

arXiv:1612.00722 [pdf, ps, other]

doi 10.1287/moor.2019.1042

Asymptotic Optimality of Power-of-$d$ Load Balancing in Large-Scale Systems

Authors: Debankur Mukherjee, Sem C. Borst, Johan S. H. van Leeuwaarden, Philip A. Whiting

Abstract: We consider a system of $N$ identical server pools and a single dispatcher where tasks arrive as a Poisson process of rate $λ(N)$. Arriving tasks cannot be queued, and must immediately be assigned to one of the server pools to start execution, or discarded. The execution times are assumed to be exponentially distributed with unit mean, and do not depend on the number of other tasks receiving servi… ▽ More We consider a system of $N$ identical server pools and a single dispatcher where tasks arrive as a Poisson process of rate $λ(N)$. Arriving tasks cannot be queued, and must immediately be assigned to one of the server pools to start execution, or discarded. The execution times are assumed to be exponentially distributed with unit mean, and do not depend on the number of other tasks receiving service. However, the experienced performance (e.g. in terms of received throughput) does degrade with an increasing number of concurrent tasks at the same server pool. The dispatcher therefore aims to evenly distribute the tasks across the various server pools. Specifically, when a task arrives, the dispatcher assigns it to the server pool with the minimum number of tasks among $d(N)$ randomly selected server pools. This assignment strategy is called the JSQ$(d(N))$ scheme, as it resembles the power-of-$d$ version of the Join-the-Shortest-Queue (JSQ) policy, and will also be referred to as such in the special case $d(N) = N$. We construct a stochastic coupling to bound the difference in the system occupancy processes between the JSQ policy and a scheme with an arbitrary value of $d(N)$. We use the coupling to derive the fluid limit in case $d(N) \to \infty$ and $λ(N)/N \to λ$ as $N \to \infty$, along with the associated fixed point. The fluid limit turns out to be insensitive to the exact growth rate of $d(N)$, and coincides with that for the JSQ policy. We further leverage the coupling to establish that the diffusion limit corresponds to that for the JSQ policy as well, as long as $d(N)/\sqrt{N} \log(N) \to \infty$, and characterize the common limiting diffusion process. These results indicate that the JSQ optimality can be preserved at the fluid-level and diffusion-level while reducing the overhead by nearly a factor O($N$) and O($\sqrt{N}/\log(N)$), respectively. △ Less

Submitted 2 December, 2016; originally announced December 2016.

Comments: 48 pages, 3 figures, companion paper of arXiv:1612.00723

Journal ref: Math. Oper. Res. 45 4 (2020) 1535-1571

arXiv:1611.09723 [pdf, other]

Mean-field limits for large-scale random-access networks

Authors: Fabio Cecchi, Sem C. Borst, Johan S. H. van Leeuwaarden, Philip A. Whiting

Abstract: We establish mean-field limits for large-scale random-access networks with buffer dynamics and arbitrary interference graphs. While saturated-buffer scenarios have been widely investigated and yield useful throughput estimates for persistent sessions, they fail to capture the fluctuations in buffer contents over time, and provide no insight in the delay performance of flows with intermittent packe… ▽ More We establish mean-field limits for large-scale random-access networks with buffer dynamics and arbitrary interference graphs. While saturated-buffer scenarios have been widely investigated and yield useful throughput estimates for persistent sessions, they fail to capture the fluctuations in buffer contents over time, and provide no insight in the delay performance of flows with intermittent packet arrivals. Motivated by that issue, we explore in the present paper random-access networks with buffer dynamics, where flows with empty buffers refrain from competition for the medium. The occurrence of empty buffers thus results in a complex dynamic interaction between activity states and buffer contents, which severely complicates the performance analysis. Hence we focus on a many-sources regime where the total number of nodes grows large, which not only offers mathematical tractability but is also highly relevant with the densification of wireless networks as the Internet of Things emerges. We exploit time scale separation properties to prove that the properly scaled buffer occupancy process converges to the solution of a deterministic initial-value problem, and establish the existence and uniqueness of the associated fixed point. This approach simplifies the performance analysis of networks with huge numbers of nodes to a low-dimensional fixed-point calculation. For the case of a complete interference graph, we demonstrate asymptotic stability, provide a simple closed-form expression for the fixed point, and prove interchange of the mean-field and steady-state limits. This yields asymptotically exact approximations for key performance metrics, in particular the stationary buffer content and packet delay distributions. The methodological framework that we develop easily extends to various model refinements as will be illustrated by several examples. △ Less

Submitted 24 April, 2019; v1 submitted 29 November, 2016; originally announced November 2016.

arXiv:1510.02657 [pdf, ps, other]

doi 10.1017/jpr.2016.68

Universality of Load Balancing Schemes on Diffusion Scale

Authors: D. Mukherjee, S. C. Borst, J. S. H. van Leeuwaarden, P. A. Whiting

Abstract: We consider a system of $N$ parallel queues with identical exponential service rates and a single dispatcher where tasks arrive as a Poisson process. When a task arrives, the dispatcher always assigns it to an idle server, if there is any, and to a server with the shortest queue among $d$ randomly selected servers otherwise $(1 \leq d \leq N)$. This load balancing scheme subsumes the so-called Joi… ▽ More We consider a system of $N$ parallel queues with identical exponential service rates and a single dispatcher where tasks arrive as a Poisson process. When a task arrives, the dispatcher always assigns it to an idle server, if there is any, and to a server with the shortest queue among $d$ randomly selected servers otherwise $(1 \leq d \leq N)$. This load balancing scheme subsumes the so-called Join-the-Idle Queue (JIQ) policy $(d = 1)$ and the celebrated Join-the-Shortest Queue (JSQ) policy $(d = N)$ as two crucial special cases. We develop a stochastic coupling construction to obtain the diffusion limit of the queue process in the Halfin-Whitt heavy-traffic regime, and establish that it does not depend on the value of $d$, implying that assigning tasks to idle servers is sufficient for diffusion level optimality. △ Less

Submitted 3 March, 2016; v1 submitted 9 October, 2015; originally announced October 2015.

Journal ref: J.Appl.Probab. 53 4 (2016) 1111-1124

arXiv:1211.3958 [pdf, other]

Asymptotic Results for Random Polynomials on the Unit Circle

Authors: Gabriel H. Tucci, Philip A. Whiting

Abstract: In this paper we study the asymptotic behavior of the maximum magnitude of a complex random polynomial with i.i.d. uniformly distributed random roots on the unit circle. More specifically, let $\{n_k\}_{k=1}^{\infty}$ be an infinite sequence of positive integers and let $\{z_{k}\}_{k=1}^{\infty}$ be a sequence of i.i.d. uniform distributed random variables on the unit circle. The above pair of seq… ▽ More In this paper we study the asymptotic behavior of the maximum magnitude of a complex random polynomial with i.i.d. uniformly distributed random roots on the unit circle. More specifically, let $\{n_k\}_{k=1}^{\infty}$ be an infinite sequence of positive integers and let $\{z_{k}\}_{k=1}^{\infty}$ be a sequence of i.i.d. uniform distributed random variables on the unit circle. The above pair of sequences determine a sequence of random polynomials $P_{N}(z) = \prod_{k=1}^{N}{(z-z_k)^{n_k}}$ with random roots on the unit circle and their corresponding multiplicities. In this work, we show that subject to a certain regularity condition on the sequence $\{n_k\}_{k=1}^{\infty}$, the log maximum magnitude of these polynomials scales as $s_{N}I^{*}$ where $s_{N}^{2}=\sum_{k=1}^{N}{n_{k}^{2}}$ and $I^{*}$ is a strictly positive random variable. △ Less

Submitted 16 November, 2012; originally announced November 2012.

Comments: 14 pages. arXiv admin note: text overlap with arXiv:1202.3184

arXiv:1202.3184 [pdf, ps, other]

Asymptotic Behavior of the Maximum and Minimum Singular Value of Random Vandermonde Matrices

Authors: Gabriel H. Tucci, Philip A. Whiting

Abstract: This work examines various statistical distributions in connection with random Vandermonde matrices and their extension to $d$--dimensional phase distributions. Upper and lower bound asymptotics for the maximum singular value are found to be $O(\log^{1/2}{N^{d}})$ and $Ω((\log N^{d} /(\log \log N^d))^{1/2})$ respectively where $N$ is the dimension of the matrix, generalizing the results in \cite{T… ▽ More This work examines various statistical distributions in connection with random Vandermonde matrices and their extension to $d$--dimensional phase distributions. Upper and lower bound asymptotics for the maximum singular value are found to be $O(\log^{1/2}{N^{d}})$ and $Ω((\log N^{d} /(\log \log N^d))^{1/2})$ respectively where $N$ is the dimension of the matrix, generalizing the results in \cite{TW}. We further study the behavior of the minimum singular value of these random matrices. In particular, we prove that the minimum singular value is at most $N\exp(-C\sqrt{N}))$ with high probability where $C$ is a constant independent on $N$. Furthermore, the value of the constant $C$ is determined explicitly. The main result is obtained in two different ways. One approach uses techniques from stochastic processes and in particular, a construction related to the Brownian bridge. The other one is a more direct analytical approach involving combinatorics and complex analysis. As a consequence, we obtain a lower bound for the maximum absolute value of a random complex polynomial on the unit circle, which may be of independent mathematical interest. Lastly, for each sequence of positive integers ${k_p}_{p=1}^{\infty}$ we present a generalized version of the previously discussed matrices. The classical random Vandermonde matrix corresponds to the sequence $k_{p}=p-1$. We find a combinatorial formula for their moments and we show that the limit eigenvalue distribution converges to a probability measure supported on $[0,\infty)$. Finally, we show that for the sequence $k_p=2^{p}$ the limit eigenvalue distribution is the famous Marchenko--Pastur distribution. △ Less

Submitted 16 November, 2012; v1 submitted 14 February, 2012; originally announced February 2012.

Comments: Accepted for publication at Journal of Theoretical Probability

MSC Class: 15B52; 15B51; 60B20

arXiv:1010.0624 [pdf, other]

Eigenvalue Results for Large Scale Random Vandermonde Matrices with Unit Complex Entries

Authors: Gabriel H. Tucci, Philip A. Whiting

Abstract: This paper centers on the limit eigenvalue distribution for random Vandermonde matrices with unit magnitude complex entries. The phases of the entries are chosen independently and identically distributed from the interval $[-π,π]$. Various types of distribution for the phase are considered and we establish the existence of the empirical eigenvalue distribution in the large matrix limit on a wide r… ▽ More This paper centers on the limit eigenvalue distribution for random Vandermonde matrices with unit magnitude complex entries. The phases of the entries are chosen independently and identically distributed from the interval $[-π,π]$. Various types of distribution for the phase are considered and we establish the existence of the empirical eigenvalue distribution in the large matrix limit on a wide range of cases. The rate of growth of the maximum eigenvalue is examined and shown to be no greater than $O(\log N)$ and no slower than $O(\log N/\log\log N)$ where $N$ is the dimension of the matrix. Additional results include the existence of the capacity of the Vandermonde channel (limit integral for the expected log determinant). △ Less

Submitted 4 October, 2010; originally announced October 2010.

Comments: Submitted to Transactions of Information Theory

Showing 1–8 of 8 results for author: Whiting, P A