Markov Decision Process and Approximate Dynamic Programming for a Patient Assignment Scheduling problem

Małgorzata M. O’Reilly       Sebastian Krasnicki  22footnotemark: 2   James Montgomery 
  Mojtaba Heydar    Richard Turner    Pieter Van Dam    Peter Maree 
email: [email protected] of Natural Sciences, University of Tasmania, Hobart TAS 7001, Australia.School of Information & Communication Technology, University of Tasmania, Hobart TAS 7001, Australia.BHP, Perth, Western Australia.School of Medicine, University of Tasmania, Hobart TAS 7001, Australia.School of Nursing, University of Tasmania, Hobart TAS 7001, Australia.Strategy and Planning, Department of Health, Hobart TAS 7001, Australia.
(June 26, 2024)
Abstract

We study the Patient Assignment Scheduling (PAS) problem in a random environment that arises in the management of patient flow in the hospital systems, due to the stochastic nature of the arrivals as well as the Length of Stay distribution.

We develop a Markov Decision Process (MDP) which aims to assign the newly arrived patients in an optimal way so as to minimise the total expected long-run cost per unit time over an infinite horizon. We assume Poisson arrival rates that depend on patient types, and Length of Stay distributions that depend on whether patients stay in their primary wards or not.

Since the instances of realistic size of this problem are not easy to solve, we develop numerical methods based on Approximate Dynamic Programming. We illustrate the theory with numerical examples with parameters obtained by fitting to data from a tertiary referral hospital in Australia, and demonstrate the application potential of our methodology under practical considerations.

Keywords: Patient Assignment Scheduling problem, Poisson arrivals, Length of Stay distribution, Markov chains, Markov Decision Process, Approximate Dynamic Programming.

Mathematics Subject Classification: 60J80 – 60J22 – 92D25 – 65H10

Funding:  This research was supported by funding through the Australian Research Council Linkage Project LP140100152.

1 Introduction

In this paper we consider the Patient Assignment Scheduling (PAS) problem in which, at the start of each day, newly arrived patients are assigned to beds in different wards, considering their needs, priority, and available resources, in a way so as to optimise total expected daily cost over an infinite time horizon. We use the term ‘bed’ in the sense of the resource that consists of the staff (nurses and clinicians) available to attend to a patient in a physical bed, rather than the physical bed itself.

We assume that that patients are assigned to the beds at the start of time period d=0,1,2,𝑑012d=0,1,2,\ldotsitalic_d = 0 , 1 , 2 , …, see Figure 1. The allocation involves a group of patients, waiting to be admitted to a suitable bed. Upon ceompleting their treatments, patients are discharged. The duration of time between arrival and discharge is a random variable referred to as the Length of Stay (LoS). We assume an infinite horizon problem and develop a Markov Decision Process (MDP) to solve this stochastic problem so as to minimise the total expected long-run cost per unit time.

Here, without loss of generality we assume that the index d=0,1,2,𝑑012d=0,1,2,\ldotsitalic_d = 0 , 1 , 2 , … corresponds to day d𝑑ditalic_d, but note that this could denote some other time interval of interest such as an 8888-hour time block.

s𝑠sitalic_ss(a)superscript𝑠𝑎s^{(a)}italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPTssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT\vdots\ldotsd𝑑ditalic_dd+1𝑑1d+1italic_d + 1\ldots
Figure 1: Decision epochs of the PAS problem. At the start of time period d𝑑ditalic_d we observe some state s𝑠sitalic_s and then make some decision a𝑎aitalic_a about how to allocate/transfer patients. This transforms the system into a post decision state s(a)superscript𝑠𝑎s^{(a)}italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT and then the system evolves in a stochastic manner, until new state ssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is observed at the start of the next time period d+1𝑑1d+1italic_d + 1.

Patient assignment is a key process in the management of hospitals, whereby bed managers allocate inbound patients to appropriate wards, rooms and beds. These decisions are made by teams of ward and bed managers at intermittent times throughout the day, using their experience and understanding of the natural dynamics of the hospital. From 2018 to 2019, the Australian hospital system recorded 11.5 millions hospitalisations, and 8.4 million emergency department presentations, resulting in 365,000 emergency admissions [2]. This corresponded to 31,500 patient assignment decisions made daily across Australia, with 1000 of these being unplanned [2].

These decisions are not trivial, since complications can arise due to variations in hospital policy and the highly stochastic and complex nature of healthcare. Potential patients can arrive at any time of day, with a variety of conditions and specific needs. Hospital policy may affect where patients can be assigned, on the basis of a patient’s gender or to comply with prescribed healthcare standards. Managers may also evaluate whether it is best to transfer patients between wards. Because of these complicating factors it can be unclear if the assignments made by bed managers are optimal.

The importance of maintaining good patient flow cannot be understated. Carter, Puch and Larson in their literature review of emergency department (ED) crowding found that ED overcrowding has a significant positive correlation with patient mortality and with patients leaving the hospital untreated [8]. Morley et al. in their literature review also found an increase in patient mortality, as well as a higher exposure to error, poorer patient outcomes and increased patient length of stay, both in the ED, and in the ward to which a patient is eventually assigned [20]. They also noted that the inability to quickly assign patients from the ED to an inpatient bed, was a significant contributor to overcrowding.

Regarding overcrowding in non-emergency wards, there is a long standing debate on what level of hospital occupancy is too high. Bagust et al. [4] found in their simulation-based approach that hospital occupancy above 85%percent8585\%85 % created discernible risk, while occupancy above 90%percent9090\%90 % caused regular bed crises. This figure has since been adopted by many as a standard benchmark for hospital occupancy [9]. However, multiple other authors [5, 9] have emphasized that adopting 85%percent8585\%85 % as the figure at which hospitals operate most efficiently, misinterprets Bagust et al. [4], as this figure only applies to the specific example in [4]. Further, Bain et al. [5] state that using any single figure as a target for average occupancy is overly simplistic. Cummings et al. [9] note that while the results of [4] have been misinterpreted, there is a positive correlation between hospital occupancy and emergency department delays.

Solutions to these problems require improvements in the management of patient flow. Howell et al. [17] found that by assigning additional resources to the tasks of bed management and patient assignment, ED length of stay was reduced by an average of 98 minutes for admitted patients. Clearly, tangible benefits are made to patients if they are assigned correctly.

We call the mathematical formulation of patient assignment, the Patient Assignment Scheduling (PAS) problem, where we attempt to find optimal assignments of all patients as they arrive to the hospital. This problem was described by Demeester et al in [11], Bilgin et al. in [6], and Vancroonenburg et al. in [26].

Demeester et al. [11] define a useful approach of hard and soft constraints. Patient assignments that violate hard constraints are infeasible, while assignments that violate soft constraints incur some cost, which contribute to the objective function. An example of a hard constraint is that two patients cannot be assigned to the same bed, while an example of a soft constraint is that a patient should be assigned to the ward that corresponds to the their condition. In their model, the variable for patient length of stay (LoS) used is deterministic, and so too is the cost of assigning a patient. This approach ignores the inherent stochasticity of a hospital system, but provides a useful framework for building upon.

Bilgin et al. [6] and Vancroonenburg et al. in [26] use a very similar approach, defining a set of constraints to form an objective function, but each adding their own novel idea. Bilgin et al. [6] combine the PAS problem with a nurse scheduling problem, while Vancroonenburg et al. in [26] introduce parameters to study the effect of uncertainty and so their model is no longer fully deterministic. However, to truly capture the stochasticity of the system, consideration must be given to how the system evolves in time. The model must incorporate current assignment decisions having an impact on future assignments.

Using a similar integer programming approach to [6, 11, 26], Abera et al. [3] demonstrate how the PAS problem can be solved in a stochastic scenario. They consider first that a patient’s LoS is a random variable that may take any value on some distribution, and is unknown at the time of decision making. They then consider that patient arrivals to the system are also random, meaning that the cost of making any given assignment is stochastic. That is, the cost of an assignment depends on how long a patient stays, and what types of patients will arrive next. To evaluate the assignment cost, and hence the optimal assignment, Abera et al. [3] simulate the system several times to determine the expected cost over some planning horizon. The planning horizons used as examples in [3] are 14, 28 and 56 days.

One shared result that is found in [3, 6, 11, 26], is that the integer linear program of any non-trivial size problem, cannot be solved explicitly. That is, it was not achievable to find the exact optimal solution. Demeester et al. [11] found in their first example of a hospital with similar-to\scriptstyle\sim120120120120 beds, an optimal solution was not found after a whole week of computation, which is an unacceptable amount of time for assigning patients. To overcome this, various heuristics were required in all of the above work. The authors of the papers listed above all begin by using a neighbourhood search technique. That is, starting with some initial solution, and searching ‘neighbouring’ solutions to see if there is an improvement. The exact methodology of how the neighbourhood is defined and searched, differs between authors. Demeester et al. [11] use a Tabu search, Abera et al. [3] use simulated annealing, while Bilgin et al. [6] use a hyper-heuristic approach (heuristics for selecting heuristics). Regardless of the approach, it is clear that heuristics are a necessity for solving problems of any realistic size.

While the formulation of the PAS problem as an integer program is well researched, alternative formulations exist. Hulshof et al. [18] and Dai and Shi [10] build models related to the PAS problem, based on Markovian Decision Processes (MDP). Such an alternative formulation allows for an exploration of possible decisions that a hospital manager can make. Hulshof et al. [18] explore resource assignment in hospitals and decision making on how many patients should be assigned to different queues. Dai and Shi [10] explore the use of patient overflow (allowing wards to ‘overflow’ into one another when full) and take a ‘higher-level’ approach to decision making. Rather than making assignment decisions for each patient, decisions are made on whether to allow patient overflow for each time period.

Similarly to the integer program approaches, the exact solutions of these MDPs cannot be obtained for large-size problems. Approximate Dynamic Programming (ADP) approaches are applied in [10, 18] to overcome what is called the curse of dimensionality. The ADP approach, described by Powell in [23], allows the cost in each state to be approximated by a set of features, which record some partial information about states, and corresponding weights. In addition to ADP, Dai and Shi [10] utilize a least-squares temporal difference (LSTD) learning algorithm to solve the problem.

We now discuss the work of Heydar et al. [15], which is the basis of the model proposed here. Heydar et al. [15] models the PAS as a continuous-time MDP, where the system is observed at every individual patient arrival or departure. The goal of the model is to identify the best possible patient assignment given information about the arriving patient and the occupancy of the hospital. Heterogeneity is included in the model by defining different patient classes and wards, and assuming that each patient class has an ideal ward, while other wards are unsuitable. In addition to assigning patients directly, Heydar et al. [15] also consider transfers of patients between wards to best fit patients, so as to minimise their appropriately defined objective function.

However, Heydar et al. [15] note that transfers are positively correlated with patient mortality and LoS, as shown in [14], meaning that transfers are given careful consideration and an additional cost in [15]. We adopt this feature as a key component in our model.

Similarly to Abera et al. [3], Heydar et al. [15] evaluate total expected cost over a finite horizon, meaning limited consideration is given to the state of the system in the long run. In order to address this gap, we generalise the model used in Heydar et al. [15] and extend it to an infinite horizon problem. In order to address the curse of dimensionality, we propose methodology based on ADP and the approximate policy iteration algorithm used by Dai and Shi in [10].

Furthermore, we assume that patient assignments are made at discrete time points such as at the start of each day. This is arguably a more realistic approach involving a group of patients to be assigned, which also means that assignment decisions are far more complex, as multiple assignments must be made, and an order of assignments must be decided.

Finally, we demonstrate that the algorithm used by Dai and Shi in [10] can be adapted to our model to find near-optimal solutions to the PAS problem.

The rest of this paper is structured as follows. In Section 2 we describe the key components of our model, such as state space, transition probabilities, decision variables, constraints, and cost variables. In Section 3 we give the details of the Approximate Dynamic Programming approach, and illustrate the application of the theory through our numerical examples in Section 4. This is followed by concluding remarks in Section 5.

2 Markov Decision Process

Let ={1,2,,I}12𝐼\mathcal{I}=\{1,2,\dots,I\}caligraphic_I = { 1 , 2 , … , italic_I } be the set of all patient types, where type i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I may correspond to the medical needs of the patients, their age, gender, and other aspects of inclusive care, see e.g. [13]. We assume that type-i𝑖iitalic_i patients arrive at a rate λisubscript𝜆𝑖\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT per day on day d𝑑ditalic_d. Further, we assume that there are K𝐾Kitalic_K wards in the hospital, each with capacity mksubscript𝑚𝑘m_{k}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, k𝒦={1,,K}𝑘𝒦1𝐾k\in\mathcal{K}=\{1,\ldots,K\}italic_k ∈ caligraphic_K = { 1 , … , italic_K }. The waiting room labelled (K+1)𝐾1(K+1)( italic_K + 1 ) has capacity mK+1subscript𝑚𝐾1m_{K+1}italic_m start_POSTSUBSCRIPT italic_K + 1 end_POSTSUBSCRIPT.

The set of wards that are suitable for patients type i𝑖iitalic_i is denoted 𝒦(i)𝒦𝑖\mathcal{K}(i)caligraphic_K ( italic_i ), for some 𝒦(i)𝒦𝒦𝑖𝒦\mathcal{K}(i)\subset\mathcal{K}caligraphic_K ( italic_i ) ⊂ caligraphic_K. We denote by w(1,i)𝒦(i)𝑤1𝑖𝒦𝑖w(1,i)\in\mathcal{K}(i)italic_w ( 1 , italic_i ) ∈ caligraphic_K ( italic_i ) the best ward for type-i𝑖iitalic_i patient, by w(2,i)𝒦(i)𝑤2𝑖𝒦𝑖w(2,i)\in\mathcal{K}(i)italic_w ( 2 , italic_i ) ∈ caligraphic_K ( italic_i ) the second-best ward for type-i𝑖iitalic_i patient, and so on, and let 𝒲(i)=(w(1,i),,w(K,i))𝒲𝑖𝑤1𝑖𝑤𝐾𝑖\mathcal{W}(i)=(w(1,i),\ldots,w(K,i))caligraphic_W ( italic_i ) = ( italic_w ( 1 , italic_i ) , … , italic_w ( italic_K , italic_i ) ) be the ordered sequence of wards in 𝒦(i)𝒦𝑖\mathcal{K}(i)caligraphic_K ( italic_i ) corresponding to type-i𝑖iitalic_i patients. For example, if 𝒦(i)={1,2,3}𝒦={1,2,3,4,5}𝒦𝑖123𝒦12345\mathcal{K}(i)=\{1,2,3\}\subset\mathcal{K}=\{1,2,3,4,5\}caligraphic_K ( italic_i ) = { 1 , 2 , 3 } ⊂ caligraphic_K = { 1 , 2 , 3 , 4 , 5 } and 𝒲(i)=(3,1,2)𝒲𝑖312\mathcal{W}(i)=(3,1,2)caligraphic_W ( italic_i ) = ( 3 , 1 , 2 ), then this means that ward 3333 is the best ward for type-i𝑖iitalic_i patient, ward 1111 is the second best, ward 2222 is the third best, and wards 4444 and 5555 are not suitable for type-i𝑖iitalic_i patients.

In Section 2.1 below, we contruct a model based on a suitable Markov Decision Process to find the optimal policy 𝝅superscript𝝅\mbox{\boldmath$\pi$}^{*}bold_italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT from the set of all policies 𝝅=(a𝝅(s))s𝒮𝝅subscriptsuperscript𝑎𝝅𝑠𝑠𝒮\mbox{\boldmath$\pi$}=(a^{\mbox{\boldmath$\pi$}}(s))_{s\in\mathcal{S}}bold_italic_π = ( italic_a start_POSTSUPERSCRIPT bold_italic_π end_POSTSUPERSCRIPT ( italic_s ) ) start_POSTSUBSCRIPT italic_s ∈ caligraphic_S end_POSTSUBSCRIPT consisting of decisions a𝝅(s)superscript𝑎𝝅𝑠a^{\mbox{\boldmath$\pi$}}(s)italic_a start_POSTSUPERSCRIPT bold_italic_π end_POSTSUPERSCRIPT ( italic_s ) taken whenever state s𝑠sitalic_s in state space 𝒮𝒮\mathcal{S}caligraphic_S is observed, so as to minimise the long-run expected cost per unit time,

E=min𝝅E𝝅=min𝝅limD(1D𝔼(d=0D1C(Sd,a𝝅(Sd)))),𝝅=argmin𝝅E𝝅,formulae-sequencesuperscript𝐸subscript𝝅superscript𝐸𝝅subscript𝝅subscript𝐷1𝐷𝔼superscriptsubscript𝑑0𝐷1𝐶subscript𝑆𝑑superscript𝑎𝝅subscript𝑆𝑑superscript𝝅subscript𝝅superscript𝐸𝝅\displaystyle E^{*}=\min_{\mbox{\boldmath$\pi$}}E^{\mbox{\boldmath$\pi$}}=\min% _{\mbox{\boldmath$\pi$}}\lim_{D\to\infty}\left(\frac{1}{D}\ \mathbb{E}\left(% \sum_{d=0}^{D-1}C(S_{d},a^{\mbox{\boldmath$\pi$}}(S_{d}))\right)\right),\ % \mbox{\boldmath$\pi$}^{*}=\arg\min_{\mbox{\boldmath$\pi$}}E^{\mbox{\boldmath$% \pi$}},italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_min start_POSTSUBSCRIPT bold_italic_π end_POSTSUBSCRIPT italic_E start_POSTSUPERSCRIPT bold_italic_π end_POSTSUPERSCRIPT = roman_min start_POSTSUBSCRIPT bold_italic_π end_POSTSUBSCRIPT roman_lim start_POSTSUBSCRIPT italic_D → ∞ end_POSTSUBSCRIPT ( divide start_ARG 1 end_ARG start_ARG italic_D end_ARG blackboard_E ( ∑ start_POSTSUBSCRIPT italic_d = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_D - 1 end_POSTSUPERSCRIPT italic_C ( italic_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT bold_italic_π end_POSTSUPERSCRIPT ( italic_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) ) ) ) , bold_italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT bold_italic_π end_POSTSUBSCRIPT italic_E start_POSTSUPERSCRIPT bold_italic_π end_POSTSUPERSCRIPT , (1)

where Sd𝒮subscript𝑆𝑑𝒮S_{d}\in\mathcal{S}italic_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ∈ caligraphic_S is a state of the system observed at the start of day d𝑑ditalic_d, and C(Sd,a𝝅(s))𝐶subscript𝑆𝑑superscript𝑎𝝅𝑠C(S_{d},a^{\mbox{\boldmath$\pi$}}(s))italic_C ( italic_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT , italic_a start_POSTSUPERSCRIPT bold_italic_π end_POSTSUPERSCRIPT ( italic_s ) ) is the cost of a decision then taken assuming policy 𝝅𝝅\pibold_italic_π is in place.

In practice, we find 𝝅superscript𝝅\mbox{\boldmath$\pi$}^{*}bold_italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT by solving the Bellman’s optimality equation for all states s𝒮𝑠𝒮s\in\mathcal{S}italic_s ∈ caligraphic_S,

E+V(s)superscript𝐸𝑉𝑠\displaystyle E^{*}+V(s)italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_V ( italic_s ) =\displaystyle== mina𝒜(s){C(s,a)+𝔼(V(s)|(s,a))}=mina𝒜(s){C(s,a)+s𝒮(s|(s,a))V(s)},subscript𝑎𝒜𝑠𝐶𝑠𝑎𝔼conditional𝑉superscript𝑠𝑠𝑎subscript𝑎𝒜𝑠𝐶𝑠𝑎subscriptsuperscript𝑠𝒮conditionalsuperscript𝑠𝑠𝑎𝑉superscript𝑠\displaystyle\min_{a\in\mathcal{A}(s)}\Bigg{\{}C(s,a)+\mathbb{E}\left(V(s^{% \prime})\ |\ (s,a)\right)\Bigg{\}}=\min_{a\in\mathcal{A}(s)}\Bigg{\{}C(s,a)+% \sum_{s^{\prime}\in\mathcal{S}}\mathbb{P}\left(s^{\prime}\ |\ (s,a)\right)V(s^% {\prime})\Bigg{\}},roman_min start_POSTSUBSCRIPT italic_a ∈ caligraphic_A ( italic_s ) end_POSTSUBSCRIPT { italic_C ( italic_s , italic_a ) + blackboard_E ( italic_V ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) | ( italic_s , italic_a ) ) } = roman_min start_POSTSUBSCRIPT italic_a ∈ caligraphic_A ( italic_s ) end_POSTSUBSCRIPT { italic_C ( italic_s , italic_a ) + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S end_POSTSUBSCRIPT blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) italic_V ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } ,

where V(s)𝑉𝑠V(s)italic_V ( italic_s ) is the minimum long-run average cost given current state s𝑠sitalic_s, and 𝒜(s)𝒜𝑠\mathcal{A}(s)caligraphic_A ( italic_s ) is the set of all decisions that are possible when state s𝑠sitalic_s is observed. We use Approximate Dynamic Programming methods [21, 22, 24] to solve Equation (2) in real-sized problems, in which the number of states and decisions are intractable, as noted in [10, 12, 15, 18].

2.1 Model

Consider a discrete-time Markov chain {Sd:d=0,1,}conditional-setsubscript𝑆𝑑𝑑01\{S_{d}:d=0,1,\ldots\}{ italic_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT : italic_d = 0 , 1 , … }, where Sd=([Nk,i]𝒦×,[Qi]1×)𝒮subscript𝑆𝑑subscriptdelimited-[]subscript𝑁𝑘𝑖𝒦subscriptdelimited-[]subscript𝑄𝑖1𝒮S_{d}=\left([N_{k,i}]_{\mathcal{K}\times\mathcal{I}},[Q_{i}]_{1\times\mathcal{% I}}\right)\in\mathcal{S}italic_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = ( [ italic_N start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) ∈ caligraphic_S is the state of the system at the start of day d=0,1,2,𝑑012d=0,1,2,\ldotsitalic_d = 0 , 1 , 2 , … recording Nk,isubscript𝑁𝑘𝑖N_{k,i}italic_N start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT, the number of type-i𝑖iitalic_i patients in ward k𝑘kitalic_k, and Qisubscript𝑄𝑖Q_{i}italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, the number of newly arrived type-i𝑖iitalic_i patients who are yet to be assigned to the wards. The state space 𝒮𝒮\mathcal{S}caligraphic_S of the process is given by,

𝒮𝒮\displaystyle\mathcal{S}caligraphic_S =\displaystyle== {([nk,i]𝒦×,[qi]1×):nk,i0,qi0,i=1Ink,imk,i=1IqimK+1},conditional-setsubscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1formulae-sequencesubscript𝑛𝑘𝑖0formulae-sequencesubscript𝑞𝑖0formulae-sequencesuperscriptsubscript𝑖1𝐼subscript𝑛𝑘𝑖subscript𝑚𝑘superscriptsubscript𝑖1𝐼subscript𝑞𝑖subscript𝑚𝐾1\displaystyle\{\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times% \mathcal{I}}\right):n_{k,i}\geq 0,q_{i}\geq 0,\sum_{i=1}^{I}n_{k,i}\leq m_{k},% \sum_{i=1}^{I}q_{i}\leq m_{K+1}\},{ ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) : italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ≥ 0 , italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ≤ italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_m start_POSTSUBSCRIPT italic_K + 1 end_POSTSUBSCRIPT } ,

with possibly additional constraints on the total number of accepted arrivals, as discussed below.

2.2 Decisions

Suppose that we observe state s=([nk,i]𝒦×,[qi]1×)𝒮𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1𝒮s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}% \right)\in\mathcal{S}italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) ∈ caligraphic_S at the start of a day and choose a suitable decision a𝒜(s)𝑎𝒜𝑠a\in\mathcal{A}(s)italic_a ∈ caligraphic_A ( italic_s ) from the set of available decisions 𝒜(s)𝒜𝑠\mathcal{A}(s)caligraphic_A ( italic_s ), such that a=(xk,i,yk,l,i)i;k,=1,K𝑎subscriptsubscript𝑥𝑘𝑖subscript𝑦𝑘𝑙𝑖formulae-sequence𝑖𝑘1𝐾a=(x_{k,i},y_{k,l,i})_{i\in\mathcal{I};k,\ell=1,\ldots K}italic_a = ( italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k , italic_l , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ caligraphic_I ; italic_k , roman_ℓ = 1 , … italic_K end_POSTSUBSCRIPT, where

  • xk,isubscript𝑥𝑘𝑖x_{k,i}italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT is the number of type-i𝑖iitalic_i newly arrived patients to assign to ward k𝑘kitalic_k, and,

  • yk,,isubscript𝑦𝑘𝑖y_{k,\ell,i}italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT is the number of type-i𝑖iitalic_i patients to be transferred from ward k𝑘kitalic_k to ward \ellroman_ℓ.

Each decision should satisfy the following sets of constraints. First, all newly arrived type-i𝑖iitalic_i patients have to be assigned to some ward, and so

k=1Kxk,i=qi,i.formulae-sequencesuperscriptsubscript𝑘1𝐾subscript𝑥𝑘𝑖subscript𝑞𝑖for-all𝑖\displaystyle\sum_{k=1}^{K}x_{k,i}=q_{i},\ \forall i\in\mathcal{I}.∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , ∀ italic_i ∈ caligraphic_I . (3)

Next, the total number of patients in ward k𝑘kitalic_k, denoted nk,,tsubscript𝑛𝑘𝑡n_{k,\bullet,t}italic_n start_POSTSUBSCRIPT italic_k , ∙ , italic_t end_POSTSUBSCRIPT, and given by,

nk,subscript𝑛𝑘\displaystyle n_{k,\bullet}italic_n start_POSTSUBSCRIPT italic_k , ∙ end_POSTSUBSCRIPT =\displaystyle== i=1Ink,i,superscriptsubscript𝑖1𝐼subscript𝑛𝑘𝑖\displaystyle\sum_{i=1}^{I}n_{k,i},∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , (4)
nk,isubscript𝑛𝑘𝑖\displaystyle n_{k,i}italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT =\displaystyle== nk,i+xk,i+=1kK(y,k,iyk,,i),subscript𝑛𝑘𝑖subscript𝑥𝑘𝑖superscriptsubscript1𝑘𝐾subscript𝑦𝑘𝑖subscript𝑦𝑘𝑖\displaystyle n_{k,i}+x_{k,i}+\sum_{\begin{subarray}{c}\ell=1\\ \ell\neq k\end{subarray}}^{K}\big{(}y_{\ell,k,i}-y_{k,\ell,i}\big{)},italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_ℓ = 1 end_CELL end_ROW start_ROW start_CELL roman_ℓ ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT roman_ℓ , italic_k , italic_i end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT ) , (5)

where nk,isubscript𝑛𝑘𝑖n_{k,i}italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT is the total number of type-i𝑖iitalic_i patients in ward k𝑘kitalic_k, cannot exceed the capacity of the ward, and so,

nk,mk,k𝒦.formulae-sequencesubscript𝑛𝑘subscript𝑚𝑘for-all𝑘𝒦\displaystyle n_{k,\bullet}\leq m_{k},\ \forall k\in\mathcal{K}.italic_n start_POSTSUBSCRIPT italic_k , ∙ end_POSTSUBSCRIPT ≤ italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k ∈ caligraphic_K . (6)

Further, we may only transfer the available patients, which gives,

=1kKyk,,ink,i,k𝒦,i,formulae-sequencesuperscriptsubscript1𝑘𝐾subscript𝑦𝑘𝑖subscript𝑛𝑘𝑖formulae-sequencefor-all𝑘𝒦for-all𝑖\displaystyle\sum_{\begin{subarray}{c}\ell=1\\ \ell\neq k\end{subarray}}^{K}y_{k,\ell,i}\leq n_{k,i},\ \forall k\in\mathcal{K% },\ \forall i\in\mathcal{I},∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_ℓ = 1 end_CELL end_ROW start_ROW start_CELL roman_ℓ ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT ≤ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , ∀ italic_k ∈ caligraphic_K , ∀ italic_i ∈ caligraphic_I , (7)

and if a transfer occurs, it should be in one direction, that is,

I{yk,,i0}+I{y,k,i0}𝐼subscript𝑦𝑘𝑖0𝐼subscript𝑦𝑘𝑖0\displaystyle I\{y_{k,\ell,i}\not=0\}+I\{y_{\ell,k,i}\not=0\}italic_I { italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT ≠ 0 } + italic_I { italic_y start_POSTSUBSCRIPT roman_ℓ , italic_k , italic_i end_POSTSUBSCRIPT ≠ 0 } \displaystyle\leq 1,k,𝒦,i,formulae-sequence1for-all𝑘𝒦for-all𝑖\displaystyle 1,\ \forall k,\ell\in\mathcal{K},\ \forall i\in\mathcal{I},1 , ∀ italic_k , roman_ℓ ∈ caligraphic_K , ∀ italic_i ∈ caligraphic_I , (8)

where I{}𝐼I\{\cdot\}italic_I { ⋅ } is an indicator function taking value 1111 if the statement in the brackets is true, and 00 otherwise, which ensures that only one of yk,,isubscript𝑦𝑘𝑖y_{k,\ell,i}italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT and y,k,isubscript𝑦𝑘𝑖y_{\ell,k,i}italic_y start_POSTSUBSCRIPT roman_ℓ , italic_k , italic_i end_POSTSUBSCRIPT takes a nonzero value.

2.3 Transition probabilities

Now, given decision a𝑎aitalic_a and current state s𝒮𝑠𝒮s\in\mathcal{S}italic_s ∈ caligraphic_S, we define the following key random variables:

  • Zk,isubscript𝑍𝑘𝑖Z_{k,i}italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT, the number of type-i𝑖iitalic_i patients who depart from ward k𝑘kitalic_k during one day;

  • Z=k=1Ki=1IZk,i𝑍superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑍𝑘𝑖Z=\sum_{k=1}^{K}\sum_{i=1}^{I}Z_{k,i}italic_Z = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT, the total number of departures;

  • B=k=1KmkZ𝐵superscriptsubscript𝑘1𝐾subscript𝑚𝑘𝑍B=\sum_{k=1}^{K}m_{k}-Zitalic_B = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_Z , the total number of available beds after departures;

  • Q=i=1IQi𝑄superscriptsubscript𝑖1𝐼subscript𝑄𝑖Q=\sum_{i=1}^{I}Q_{i}italic_Q = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, the total number of arrivals;

which we use below to determine the transition probabilities of the process {Sd:d=0,1,}conditional-setsubscript𝑆𝑑𝑑01\{S_{d}:d=0,1,\ldots\}{ italic_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT : italic_d = 0 , 1 , … }. The distribution of these variables depends on (s,a)𝑠𝑎(s,a)( italic_s , italic_a ), as we discuss below.

Assuming state s=([nk,i]𝒦×,[qi]1×)𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}\right)italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) and decision a=(xk,i,yk,l,i)i;k=1,K𝑎subscriptsubscript𝑥𝑘𝑖subscript𝑦𝑘𝑙𝑖formulae-sequence𝑖𝑘1𝐾a=(x_{k,i},y_{k,l,i})_{i\in\mathcal{I};k\ell=1,\ldots K}italic_a = ( italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k , italic_l , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ caligraphic_I ; italic_k roman_ℓ = 1 , … italic_K end_POSTSUBSCRIPT, the resulting post-decision state is s(a)=([nk,i(a)]𝒦×)([nk,i(a)]𝒦×,[0]1×)superscript𝑠𝑎subscriptdelimited-[]subscriptsuperscript𝑛𝑎𝑘𝑖𝒦subscriptdelimited-[]subscriptsuperscript𝑛𝑎𝑘𝑖𝒦subscriptdelimited-[]01s^{(a)}=\left([n^{(a)}_{k,i}]_{\mathcal{K}\times\mathcal{I}}\right)\equiv\left% ([n^{(a)}_{k,i}]_{\mathcal{K}\times\mathcal{I}},[0]_{1\times\mathcal{I}}\right)italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = ( [ italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT ) ≡ ( [ italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ 0 ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) such that,

nk,i(a)=nk,i+xk,i+=1kK(y,k,iyk,,i),k𝒦,i.formulae-sequencesubscriptsuperscript𝑛𝑎𝑘𝑖subscript𝑛𝑘𝑖subscript𝑥𝑘𝑖superscriptsubscript1𝑘𝐾subscript𝑦𝑘𝑖subscript𝑦𝑘𝑖formulae-sequencefor-all𝑘𝒦for-all𝑖n^{(a)}_{k,i}=n_{k,i}+x_{k,i}+\sum_{\begin{subarray}{c}\ell=1\\ \ell\neq k\end{subarray}}^{K}\big{(}y_{\ell,k,i}-y_{k,\ell,i}\big{)},\ \forall k% \in\mathcal{K},\ \forall i\in\mathcal{I}.italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_ℓ = 1 end_CELL end_ROW start_ROW start_CELL roman_ℓ ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT roman_ℓ , italic_k , italic_i end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT ) , ∀ italic_k ∈ caligraphic_K , ∀ italic_i ∈ caligraphic_I . (9)

Next, the process transitions from state s(a)superscript𝑠𝑎s^{(a)}italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT on day d𝑑ditalic_d to some state s=([nk,i]𝒦×,[qi]1×)superscript𝑠subscriptdelimited-[]subscriptsuperscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscriptsuperscript𝑞𝑖1s^{\prime}=\left([n^{\prime}_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q^{\prime}% _{i}]_{1\times\mathcal{I}}\right)italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ( [ italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) on day (d+1)𝑑1(d+1)( italic_d + 1 ), for d=0,1,2,𝑑012d=0,1,2,\ldotsitalic_d = 0 , 1 , 2 , …, with probability given by

(s|(s,a))conditionalsuperscript𝑠𝑠𝑎\displaystyle\mathbb{P}\left(s^{\prime}\ |\ (s,a)\right)blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) =\displaystyle== ((Zk,i=zk,i)k=1,,K;i=1,,I and (Qi=qi)i=1,,I)subscriptsubscript𝑍𝑘𝑖subscript𝑧𝑘𝑖formulae-sequence𝑘1𝐾𝑖1𝐼 and subscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼\displaystyle\mathbb{P}\left((Z_{k,i}=z_{k,i})_{k=1,\ldots,K;i=1,\ldots,I}% \mbox{ and }(Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\right)blackboard_P ( ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 , … , italic_K ; italic_i = 1 , … , italic_I end_POSTSUBSCRIPT and ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT ) (10)
=\displaystyle== ((Zk,i=zk,i)k=1,,K;i=1,,I)×((Qi=qi)i=1,,I|(Zk,i=zk,i)k=1,,K;i=1,,I)subscriptsubscript𝑍𝑘𝑖subscript𝑧𝑘𝑖formulae-sequence𝑘1𝐾𝑖1𝐼conditionalsubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼subscriptsubscript𝑍𝑘𝑖subscript𝑧𝑘𝑖formulae-sequence𝑘1𝐾𝑖1𝐼\displaystyle\mathbb{P}\left((Z_{k,i}=z_{k,i})_{k=1,\ldots,K;i=1,\ldots,I}% \right)\times\mathbb{P}\left((Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\ |\ (Z_{k,i% }=z_{k,i})_{k=1,\ldots,K;i=1,\ldots,I}\right)blackboard_P ( ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 , … , italic_K ; italic_i = 1 , … , italic_I end_POSTSUBSCRIPT ) × blackboard_P ( ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT | ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 , … , italic_K ; italic_i = 1 , … , italic_I end_POSTSUBSCRIPT )
=\displaystyle== (k=1Ki=1I(Zk,i=zk,i))×((Qi=qi)i=1,,I|(Zk,i=zk,i)k=1,,K;i=1,,I)superscriptsubscriptproduct𝑘1𝐾superscriptsubscriptproduct𝑖1𝐼subscript𝑍𝑘𝑖subscript𝑧𝑘𝑖conditionalsubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼subscriptsubscript𝑍𝑘𝑖subscript𝑧𝑘𝑖formulae-sequence𝑘1𝐾𝑖1𝐼\displaystyle\left(\prod_{k=1}^{K}\prod_{i=1}^{I}{\mathbb{P}(Z_{k,i}=z_{k,i})}% \right)\times\mathbb{P}\left((Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\ |\ (Z_{k,i% }=z_{k,i})_{k=1,\ldots,K;i=1,\ldots,I}\right)( ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT blackboard_P ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) ) × blackboard_P ( ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT | ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 , … , italic_K ; italic_i = 1 , … , italic_I end_POSTSUBSCRIPT )

when the condition

nk,i=nk,i(a)zk,isubscriptsuperscript𝑛𝑘𝑖subscriptsuperscript𝑛𝑎𝑘𝑖subscript𝑧𝑘𝑖n^{\prime}_{k,i}=n^{(a)}_{k,i}-z_{k,i}italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT - italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT (11)

is met for all k𝒦𝑘𝒦k\in\mathcal{K}italic_k ∈ caligraphic_K, i𝑖i\in\mathcal{I}italic_i ∈ caligraphic_I; and (s|(s,a))=0conditionalsuperscript𝑠𝑠𝑎0\mathbb{P}\left(s^{\prime}\ |\ (s,a)\right)=0blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) = 0 otherwise.

We apply conditional probabilities ((Qi=qi)i=1,,I|(Zk,i=zk,i)k=1,,K;i=1,,I)conditionalsubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼subscriptsubscript𝑍𝑘𝑖subscript𝑧𝑘𝑖formulae-sequence𝑘1𝐾𝑖1𝐼\mathbb{P}\left((Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\ |\ (Z_{k,i}=z_{k,i})_{k% =1,\ldots,K;i=1,\ldots,I}\right)blackboard_P ( ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT | ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 , … , italic_K ; italic_i = 1 , … , italic_I end_POSTSUBSCRIPT ) in (10) since the number of accepted type-i𝑖iitalic_i arrivals may depend on the number of available beds after departures, and consider the following alternative modelling approaches:

  • Suppose that there is no restriction on the total number of arrivals, as in Dai and Shi [10]. Then Qisubscript𝑄𝑖Q_{i}italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT follows Poisson distribution, QiPoi(λi)similar-tosubscript𝑄𝑖𝑃𝑜𝑖subscript𝜆𝑖Q_{i}\sim Poi(\lambda_{i})italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∼ italic_P italic_o italic_i ( italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), and so

    ((Qi=qi)i=1,,I|(Zk,i=zk,i)k=1,,K;i=1,,I)conditionalsubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼subscriptsubscript𝑍𝑘𝑖subscript𝑧𝑘𝑖formulae-sequence𝑘1𝐾𝑖1𝐼\displaystyle\mathbb{P}\left((Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\ |\ (Z_{k,i% }=z_{k,i})_{k=1,\ldots,K;i=1,\ldots,I}\right)blackboard_P ( ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT | ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 , … , italic_K ; italic_i = 1 , … , italic_I end_POSTSUBSCRIPT ) =\displaystyle== i=1I(Qi=qi)=i=1I(λi)qieλi(qi)!.superscriptsubscriptproduct𝑖1𝐼subscript𝑄𝑖subscript𝑞𝑖superscriptsubscriptproduct𝑖1𝐼superscriptsubscript𝜆𝑖subscript𝑞𝑖superscript𝑒subscript𝜆𝑖subscript𝑞𝑖\displaystyle\prod_{i=1}^{I}\mathbb{P}(Q_{i}=q_{i})=\prod_{i=1}^{I}\dfrac{(% \lambda_{i})^{q_{i}}e^{-\lambda_{i}}}{(q_{i})!}.∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT blackboard_P ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT divide start_ARG ( italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ! end_ARG . (12)

    In this approach, arriving patients are not lost to the system, however the number of patients still waiting to be assigned and their total waiting times could be very large (and exceed many days). This may not be a realistic assumption since in practice the capacity of the system is limited (due to the availability of beds and staffing) and there are limits on the accepted maximum waiting times for various patient types (e.g. 24242424-hour limit for some types of emergency patients).

  • Suppose that the total number of arrivals Q𝑄Qitalic_Q may not exceed the total number of available beds. Then, with b=k=1Kmkk=1Ki=1Izk,i𝑏superscriptsubscript𝑘1𝐾subscript𝑚𝑘superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑧𝑘𝑖b=\sum_{k=1}^{K}m_{k}-\sum_{k=1}^{K}\sum_{i=1}^{I}z_{k,i}italic_b = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT recording the total number of available beds after the departures, we have,

    (Q=q|B=b)𝑄conditional𝑞𝐵𝑏\displaystyle\mathbb{P}(Q=q\ |\ B=b)blackboard_P ( italic_Q = italic_q | italic_B = italic_b ) =\displaystyle== (λ)qeλ(q)!,q=0,1,2,b1,formulae-sequencesuperscript𝜆𝑞superscript𝑒𝜆𝑞𝑞012𝑏1\displaystyle\dfrac{(\lambda)^{q}e^{-\lambda}}{(q)!},\ q=0,1,2\ldots,b-1,divide start_ARG ( italic_λ ) start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_q ) ! end_ARG , italic_q = 0 , 1 , 2 … , italic_b - 1 , (13)
    (Q=b|B=b)𝑄conditional𝑏𝐵𝑏\displaystyle\mathbb{P}(Q=b\ |\ B=b)blackboard_P ( italic_Q = italic_b | italic_B = italic_b ) =\displaystyle== 1q=0b1(λ)qeλ(q)!,1superscriptsubscript𝑞0𝑏1superscript𝜆𝑞superscript𝑒𝜆𝑞\displaystyle 1-\sum_{q=0}^{b-1}\dfrac{(\lambda)^{q}e^{-\lambda}}{(q)!},1 - ∑ start_POSTSUBSCRIPT italic_q = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b - 1 end_POSTSUPERSCRIPT divide start_ARG ( italic_λ ) start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_q ) ! end_ARG , (14)

    and, with q=i=1Iqi𝑞superscriptsubscript𝑖1𝐼subscript𝑞𝑖q=\sum_{i=1}^{I}q_{i}italic_q = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT,

    ((Qi=qi)i=1,,I|(Zk,i=zk,i)k=1,,K;i=1,,I)conditionalsubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼subscriptsubscript𝑍𝑘𝑖subscript𝑧𝑘𝑖formulae-sequence𝑘1𝐾𝑖1𝐼\displaystyle\mathbb{P}\left((Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\ |\ (Z_{k,i% }=z_{k,i})_{k=1,\ldots,K;i=1,\ldots,I}\right)blackboard_P ( ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT | ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 , … , italic_K ; italic_i = 1 , … , italic_I end_POSTSUBSCRIPT ) (15)
    =\displaystyle== ((Qi=qi)i=1,,I|B=b)conditionalsubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼𝐵𝑏\displaystyle\mathbb{P}\left((Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\ |\ B=b\right)blackboard_P ( ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT | italic_B = italic_b )
    =\displaystyle== ((Qi=qi)i=1,,I|Q=q,B=b)×(Q=q|B=b)formulae-sequenceconditionalsubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼𝑄𝑞𝐵𝑏𝑄conditional𝑞𝐵𝑏\displaystyle\mathbb{P}\left((Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\ |\ Q=q,B=b% \right)\times\mathbb{P}(Q=q\ |\ B=b)blackboard_P ( ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT | italic_Q = italic_q , italic_B = italic_b ) × blackboard_P ( italic_Q = italic_q | italic_B = italic_b )
    =\displaystyle== ((Qi=qi)i=1,,I|Q=q)×(Q=q|B=b)conditionalsubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼𝑄𝑞𝑄conditional𝑞𝐵𝑏\displaystyle\mathbb{P}\left((Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}\ |\ Q=q% \right)\times\mathbb{P}(Q=q\ |\ B=b)blackboard_P ( ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT | italic_Q = italic_q ) × blackboard_P ( italic_Q = italic_q | italic_B = italic_b )
    =\displaystyle== q!i=1Iqi!×i=1I(λiλ)qi×(Q=q|B=b),𝑞superscriptsubscriptproduct𝑖1𝐼subscript𝑞𝑖superscriptsubscriptproduct𝑖1𝐼superscriptsubscript𝜆𝑖𝜆subscript𝑞𝑖𝑄conditional𝑞𝐵𝑏\displaystyle\frac{q!}{\prod_{i=1}^{I}q_{i}!}\times\prod_{i=1}^{I}\left(\frac{% \lambda_{i}}{\lambda}\right)^{q_{i}}\times\mathbb{P}(Q=q\ |\ B=b),divide start_ARG italic_q ! end_ARG start_ARG ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ! end_ARG × ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ( divide start_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_λ end_ARG ) start_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT × blackboard_P ( italic_Q = italic_q | italic_B = italic_b ) ,

    since the conditional distribution of (Qi=qi)i=1,,Isubscriptsubscript𝑄𝑖subscriptsuperscript𝑞𝑖𝑖1𝐼(Q_{i}=q^{\prime}_{i})_{i=1,\ldots,I}( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i = 1 , … , italic_I end_POSTSUBSCRIPT given Q=q𝑄𝑞Q=qitalic_Q = italic_q is multinomial, where λi/λsubscript𝜆𝑖𝜆\lambda_{i}/\lambdaitalic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_λ is the probability that an arriving patient is of type i𝑖iitalic_i. In this approach, patients arriving to a system with no free beds, are redirected to other health systems. Such approach may be more realistic and can be used to determine the rate of patients that are redirected to help decide a suitable size of the hospital system.

To model random variables Zk,isubscript𝑍𝑘𝑖Z_{k,i}italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT, we assume that departures are independent of one another and so Zk,isubscript𝑍𝑘𝑖Z_{k,i}italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT follows Binomial distribution, Zk,iBin(nk,i,pk,i)similar-tosubscript𝑍𝑘𝑖𝐵𝑖𝑛subscript𝑛𝑘𝑖subscript𝑝𝑘𝑖Z_{k,i}\sim Bin(n_{k,i},p_{k,i})italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ∼ italic_B italic_i italic_n ( italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ), with

(Zk,i=z)subscript𝑍𝑘𝑖𝑧\displaystyle\mathbb{P}(Z_{k,i}=z)blackboard_P ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_z ) =\displaystyle== (nk,iz)(pk,i)z(1pk,i)nk,iz,z=0,1,,nk,i,formulae-sequencebinomialsubscript𝑛𝑘𝑖𝑧superscriptsubscript𝑝𝑘𝑖𝑧superscript1subscript𝑝𝑘𝑖subscript𝑛𝑘𝑖𝑧𝑧01subscript𝑛𝑘𝑖\displaystyle\binom{n_{k,i}}{z}(p_{k,i})^{z}(1-p_{k,i})^{n_{k,i}-z},\ z=0,1,% \ldots,n_{k,i},( FRACOP start_ARG italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z end_ARG ) ( italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_z end_POSTSUPERSCRIPT ( 1 - italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT - italic_z end_POSTSUPERSCRIPT , italic_z = 0 , 1 , … , italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , (16)
pk,isubscript𝑝𝑘𝑖\displaystyle p_{k,i}italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT =\displaystyle== (RLOSk,i1),𝑅𝐿𝑂subscript𝑆𝑘𝑖1\displaystyle\mathbb{P}(RLOS_{k,i}\leq 1),blackboard_P ( italic_R italic_L italic_O italic_S start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ≤ 1 ) , (17)

where RLOSk,i𝑅𝐿𝑂subscript𝑆𝑘𝑖RLOS_{k,i}italic_R italic_L italic_O italic_S start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT is a random variable recording the remaining length of stay of type-i𝑖iitalic_i patient that is in ward k𝑘kitalic_k at the start of the day. That is, for each type-i𝑖iitalic_i patient in ward k𝑘kitalic_k, we perform a Bernoulli trial with probability of success pk,isubscript𝑝𝑘𝑖p_{k,i}italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT to determine if the patient leaves the system on the following day or not.

2.4 Costs

There is an immediate cost C(s,a)𝐶𝑠𝑎C(s,a)italic_C ( italic_s , italic_a ) associated with decision a𝑎aitalic_a given current state s𝒮𝑠𝒮s\in\mathcal{S}italic_s ∈ caligraphic_S, which may include costs of assignment, transfer, and patients being in nonprimary wards, defined as follows.

  • Assignment cost: k=1Ki=1Ixk,i×ck,i(σ)superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑥𝑘𝑖subscriptsuperscript𝑐𝜎𝑘𝑖\sum_{k=1}^{K}\sum_{i=1}^{I}x_{k,i}\times c^{(\sigma)}_{k,i}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT × italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT.

  • Transfer cost: k=1K=1Ki=1Iyk,,i×ck,,i(t)superscriptsubscript𝑘1𝐾superscriptsubscript1𝐾superscriptsubscript𝑖1𝐼subscript𝑦𝑘𝑖subscriptsuperscript𝑐𝑡𝑘𝑖\sum_{k=1}^{K}\sum_{\ell=1}^{K}\sum_{i=1}^{I}y_{k,\ell,i}\times c^{(t)}_{k,% \ell,i}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT × italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT.

  • Penalty cost for being in a nonprimary ward: k=1Ki=1Ink,i×ck,i(p)superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑛𝑘𝑖subscriptsuperscript𝑐𝑝𝑘𝑖\sum_{k=1}^{K}\sum_{i=1}^{I}n_{k,i}\times c^{(p)}_{k,i}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT × italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT.

The total immediate cost of decision a=(xk,i,yk,l,i)i;k,=1,K𝑎subscriptsubscript𝑥𝑘𝑖subscript𝑦𝑘𝑙𝑖formulae-sequence𝑖𝑘1𝐾a=(x_{k,i},y_{k,l,i})_{i\in\mathcal{I};k,\ell=1,\ldots K}italic_a = ( italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k , italic_l , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ caligraphic_I ; italic_k , roman_ℓ = 1 , … italic_K end_POSTSUBSCRIPT given state s=([nk,i]𝒦×,[qi]1×)𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}\right)italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) is then,

C(s,a)𝐶𝑠𝑎\displaystyle C(s,a)italic_C ( italic_s , italic_a ) =\displaystyle== k=1Ki=1I{xk,i×ck,i(σ)+=1kKyk,,i×ck,,i(t)+nk,i(a)×ck,i(p)}superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑥𝑘𝑖subscriptsuperscript𝑐𝜎𝑘𝑖superscriptsubscript1𝑘𝐾subscript𝑦𝑘𝑖subscriptsuperscript𝑐𝑡𝑘𝑖subscriptsuperscript𝑛𝑎𝑘𝑖subscriptsuperscript𝑐𝑝𝑘𝑖\displaystyle\sum_{k=1}^{K}\sum_{i=1}^{I}\bigg{\{}x_{k,i}\times c^{(\sigma)}_{% k,i}+\sum_{\begin{subarray}{c}\ell=1\\ \ell\neq k\end{subarray}}^{K}y_{k,\ell,i}\times c^{(t)}_{k,\ell,i}+n^{(a)}_{k,% i}\times c^{(p)}_{k,i}\bigg{\}}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT { italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT × italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_ℓ = 1 end_CELL end_ROW start_ROW start_CELL roman_ℓ ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT × italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT + italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT × italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT } (18)
=\displaystyle== k=1Ki=1I{xk,i×ck,i(σ)+=1kKyk,,i×ck,,i(t)+(nk,i+xk,i+=1kK(y,k,iyk,,i))×ck,i(p)}.superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑥𝑘𝑖subscriptsuperscript𝑐𝜎𝑘𝑖superscriptsubscript1𝑘𝐾subscript𝑦𝑘𝑖subscriptsuperscript𝑐𝑡𝑘𝑖subscript𝑛𝑘𝑖subscript𝑥𝑘𝑖superscriptsubscript1𝑘𝐾subscript𝑦𝑘𝑖subscript𝑦𝑘𝑖subscriptsuperscript𝑐𝑝𝑘𝑖\displaystyle\sum_{k=1}^{K}\sum_{i=1}^{I}\bigg{\{}x_{k,i}\times c^{(\sigma)}_{% k,i}+\sum_{\begin{subarray}{c}\ell=1\\ \ell\neq k\end{subarray}}^{K}y_{k,\ell,i}\times c^{(t)}_{k,\ell,i}+\Big{(}n_{k% ,i}+x_{k,i}+\sum_{\begin{subarray}{c}\ell=1\\ \ell\neq k\end{subarray}}^{K}(y_{\ell,k,i}-y_{k,\ell,i})\Big{)}\times c^{(p)}_% {k,i}\bigg{\}}.∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT { italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT × italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_ℓ = 1 end_CELL end_ROW start_ROW start_CELL roman_ℓ ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT × italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT + ( italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT start_ARG start_ROW start_CELL roman_ℓ = 1 end_CELL end_ROW start_ROW start_CELL roman_ℓ ≠ italic_k end_CELL end_ROW end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT roman_ℓ , italic_k , italic_i end_POSTSUBSCRIPT - italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT ) ) × italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT } .

By above, for all s=([nk,i]𝒦×,[qi]1×)𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}\right)italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ), Eq. (2) can be written as,

E+V(s)superscript𝐸𝑉𝑠\displaystyle E^{*}+V\left(s\right)italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_V ( italic_s ) =\displaystyle== mina𝒜(s){C(s,a)+s=([nk,i]𝒦×,[qi]1×)𝒮(s|(s,a))V(s)},subscript𝑎𝒜𝑠𝐶𝑠𝑎subscriptsuperscript𝑠subscriptdelimited-[]subscriptsuperscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscriptsuperscript𝑞𝑖1𝒮conditionalsuperscript𝑠𝑠𝑎𝑉superscript𝑠\displaystyle\min_{a\in\mathcal{A}(s)}\Bigg{\{}C(s,a)+\sum_{s^{\prime}=\left([% n^{{}^{\prime}}_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q^{\prime}_{i}]_{1% \times\mathcal{I}}\right)\in\mathcal{S}}\mathbb{P}\left(s^{\prime}\ |\ (s,a)% \right)V\left(s^{\prime}\right)\Bigg{\}},roman_min start_POSTSUBSCRIPT italic_a ∈ caligraphic_A ( italic_s ) end_POSTSUBSCRIPT { italic_C ( italic_s , italic_a ) + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ( [ italic_n start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) ∈ caligraphic_S end_POSTSUBSCRIPT blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) italic_V ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) } ,

where (s|(s,a))conditionalsuperscript𝑠𝑠𝑎\mathbb{P}\left(s^{\prime}\ |\ (s,a)\right)blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) and C(s,a)𝐶𝑠𝑎C\left(s,a\right)italic_C ( italic_s , italic_a ) are given by Eqs. (10) and (18), respectively.

3 Approximate Dynamic Programming approach

Policy Iteration is a standard method of solving Equation (2) when the size of the state space 𝒮𝒮\mathcal{S}caligraphic_S is not too large, see Algorithm 1 in Appendix A, also refer to Puterman [25] for further details.

However, Policy Iteration is not suitable here, and so we apply Approximate Dynamic Programming methods introduced by Powell in [21, 22, 24] to address the curses of dimensionality in Eq. (2), as follows.

  1. 1.

    First, similar to Heydar et al. [15], we apply basis functions ϕf(s)subscriptitalic-ϕ𝑓𝑠\phi_{f}(s)italic_ϕ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_s ), f𝑓f\in\mathcal{F}italic_f ∈ caligraphic_F, to record some suitable information about states s𝑠sitalic_s, referred to as state features, and then the approximation,

    V(s)fϕf(s)θn(f)=ϕ(s)T𝜽n,𝑉𝑠subscript𝑓subscriptitalic-ϕ𝑓𝑠subscriptsuperscript𝜃𝑓𝑛bold-italic-ϕsuperscript𝑠𝑇subscript𝜽𝑛\displaystyle V(s)\approx\sum_{f\in\mathcal{F}}\phi_{f}(s)\theta^{(f)}_{n}=% \mbox{\boldmath$\phi$}(s)^{T}\mbox{\boldmath$\theta$}_{n},italic_V ( italic_s ) ≈ ∑ start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_s ) italic_θ start_POSTSUPERSCRIPT ( italic_f ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = bold_italic_ϕ ( italic_s ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT bold_italic_θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , (20)

    where ϕ(s)=[ϕf(s)]fbold-italic-ϕ𝑠subscriptdelimited-[]subscriptitalic-ϕ𝑓𝑠𝑓\mbox{\boldmath$\phi$}(s)=[\phi_{f}(s)]_{f\in\mathcal{F}}bold_italic_ϕ ( italic_s ) = [ italic_ϕ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_s ) ] start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT is the vector of features, and 𝜽n=[θn(f)]fsubscript𝜽𝑛subscriptdelimited-[]subscriptsuperscript𝜃𝑓𝑛𝑓\mbox{\boldmath$\theta$}_{n}=[\theta^{(f)}_{n}]_{f\in\mathcal{F}}bold_italic_θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = [ italic_θ start_POSTSUPERSCRIPT ( italic_f ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT is the vector of corresponding weights θn(f)subscriptsuperscript𝜃𝑓𝑛\theta^{(f)}_{n}italic_θ start_POSTSUPERSCRIPT ( italic_f ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, evaluated at the n𝑛nitalic_n-th iteration of an algorithm.

  2. 2.

    Next, we apply the Approximate Policy Iteration presented by Dai and Shi in [10], in which the weights θn(f)subscriptsuperscript𝜃𝑓𝑛\theta^{(f)}_{n}italic_θ start_POSTSUPERSCRIPT ( italic_f ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT are recursively updated in an iteration that is repeated N𝑁Nitalic_N times. Each iteration n𝑛nitalic_n involves a simulation of M𝑀Mitalic_M states for some large M𝑀Mitalic_M. We summarise this approach in Algorithms 23 in Appendix A. We note that the Markov chains applied in our models are irreducible and positive recurrent, and so the algorithms are guaranteed to converge as M𝑀M\to\inftyitalic_M → ∞  [10].

Consider our Model in Section 2.1. Suppose that the vector of features of state s=([nk,i]𝒦×,[qi]1×)𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}\right)italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) is the vector s𝑠\vec{s}over→ start_ARG italic_s end_ARG, and so a vector containing all information about state s𝑠sitalic_s. We note that to compute the expression in Line 6 of Algorithms 2 in Appendix A and in Line 4 in Algorithm 3 in Appendix A, it is then convenient to apply the following equivalence. Assuming s=([nk,i]𝒦×,[qi]1×)𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}\right)italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ), and with b=k=1Kmkk=1Ki=1Ink,i(a)𝑏superscriptsubscript𝑘1𝐾subscript𝑚𝑘superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscriptsuperscript𝑛𝑎𝑘𝑖b=\sum_{k=1}^{K}m_{k}-\sum_{k=1}^{K}\sum_{i=1}^{I}n^{(a)}_{k,i}italic_b = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT, we have

s𝒮(s|(s,a))ϕ(s)𝜽subscriptsuperscript𝑠𝒮conditionalsuperscript𝑠𝑠𝑎bold-italic-ϕsuperscript𝑠𝜽\displaystyle\sum_{s^{{}^{\prime}}\in\mathcal{S}}\mathbb{P}\left(s^{\prime}\ |% \ (s,a)\right)\mbox{\boldmath$\phi$}(s^{{}^{\prime}})\mbox{\boldmath$\theta$}∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_S end_POSTSUBSCRIPT blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) bold_italic_ϕ ( italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) bold_italic_θ =\displaystyle== 𝔼(ϕ(s)𝜽|(s,a))𝔼conditionalbold-italic-ϕsuperscript𝑠𝜽𝑠𝑎\displaystyle\mathbb{E}(\mbox{\boldmath$\phi$}(s^{{}^{\prime}})\mbox{\boldmath% $\theta$}\ |\ (s,a))blackboard_E ( bold_italic_ϕ ( italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) bold_italic_θ | ( italic_s , italic_a ) ) (21)
=\displaystyle== k=1Ki=1I𝔼(Nk,i)θk,i+i=1I𝔼(Qi)θisuperscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼𝔼superscriptsubscript𝑁𝑘𝑖subscript𝜃𝑘𝑖superscriptsubscript𝑖1𝐼𝔼superscriptsubscript𝑄𝑖subscript𝜃𝑖\displaystyle\sum_{k=1}^{K}\sum_{i=1}^{I}\mathbb{E}(N_{k,i}^{{}^{\prime}})% \theta_{k,i}+\sum_{i=1}^{I}\mathbb{E}(Q_{i}^{{}^{\prime}})\theta_{i}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT blackboard_E ( italic_N start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) italic_θ start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT blackboard_E ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (22)
=\displaystyle== k=1Ki=1I(nk,i(a)𝔼(Zk,i))θk,i+i=1I𝔼(𝔼(Qi|Q))θisuperscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscriptsuperscript𝑛𝑎𝑘𝑖𝔼subscript𝑍𝑘𝑖subscript𝜃𝑘𝑖superscriptsubscript𝑖1𝐼𝔼𝔼conditionalsuperscriptsubscript𝑄𝑖𝑄subscript𝜃𝑖\displaystyle\sum_{k=1}^{K}\sum_{i=1}^{I}(n^{(a)}_{k,i}-\mathbb{E}(Z_{k,i}))% \theta_{k,i}+\sum_{i=1}^{I}\mathbb{E}\left(\mathbb{E}(Q_{i}^{{}^{\prime}}\ |\ % Q\ )\right)\theta_{i}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ( italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT - blackboard_E ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) ) italic_θ start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT blackboard_E ( blackboard_E ( italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT | italic_Q ) ) italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (23)
=\displaystyle== k=1Ki=1Ink,i(a)(1pk,i)θk,i+i=1Iλiλ×𝔼(Q)θi,superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscriptsuperscript𝑛𝑎𝑘𝑖1subscript𝑝𝑘𝑖subscript𝜃𝑘𝑖superscriptsubscript𝑖1𝐼subscript𝜆𝑖𝜆𝔼𝑄subscript𝜃𝑖\displaystyle\sum_{k=1}^{K}\sum_{i=1}^{I}n^{(a)}_{k,i}(1-p_{k,i})\theta_{k,i}+% \sum_{i=1}^{I}\frac{\lambda_{i}}{\lambda}\times\mathbb{E}(Q)\theta_{i},∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ( 1 - italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) italic_θ start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_λ end_ARG × blackboard_E ( italic_Q ) italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , (24)

where nk,i(a)subscriptsuperscript𝑛𝑎𝑘𝑖n^{(a)}_{k,i}italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT is given by Eq. (9), 𝔼(Zk,i)=nk,i(a)pk,i𝔼subscript𝑍𝑘𝑖subscriptsuperscript𝑛𝑎𝑘𝑖subscript𝑝𝑘𝑖\mathbb{E}(Z_{k,i})=n^{(a)}_{k,i}p_{k,i}blackboard_E ( italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) = italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT (binomial mean), Nk,isuperscriptsubscript𝑁𝑘𝑖N_{k,i}^{{}^{\prime}}italic_N start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT is a random variable recording the number of type-i𝑖iitalic_i patients in ward k𝑘kitalic_k and Qisuperscriptsubscript𝑄𝑖Q_{i}^{{}^{\prime}}italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT a random variable recording the number of type-i𝑖iitalic_i patients waiting to be assigned when state ssuperscript𝑠s^{{}^{\prime}}italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT is observed, and Q=i=1IQi𝑄superscriptsubscript𝑖1𝐼superscriptsubscript𝑄𝑖Q=\sum_{i=1}^{I}Q_{i}^{{}^{\prime}}italic_Q = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT is the total number of arrivals.

If the number of arrivals is unrestricted with mK+1=subscript𝑚𝐾1m_{K+1}=\inftyitalic_m start_POSTSUBSCRIPT italic_K + 1 end_POSTSUBSCRIPT = ∞, then 𝔼(Q)=λ𝔼𝑄𝜆\mathbb{E}(Q)=\lambdablackboard_E ( italic_Q ) = italic_λ. Alternatively, if the number of arrivals is restricted so that we only allow arrivals that can be assigned, which can be written as,

k=1Ki=1Ink,i+i=1Iqik=1Kmk=m,superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼superscriptsubscript𝑛𝑘𝑖superscriptsubscript𝑖1𝐼superscriptsubscript𝑞𝑖superscriptsubscript𝑘1𝐾subscript𝑚𝑘𝑚\sum_{k=1}^{K}\sum_{i=1}^{I}n_{k,i}^{{}^{\prime}}+\sum_{i=1}^{I}q_{i}^{{}^{% \prime}}\leq\sum_{k=1}^{K}m_{k}=m,∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_m , (25)

then we have the following approach for evaluating 𝔼(Q)𝔼𝑄\mathbb{E}(Q)blackboard_E ( italic_Q ).

Let Z𝑍Zitalic_Z be the random variable recording the total number of departures,

Z𝑍\displaystyle Zitalic_Z =\displaystyle== k=1Ki=1IZk,i,superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑍𝑘𝑖\displaystyle\sum_{k=1}^{K}\sum_{i=1}^{I}Z_{k,i},∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , (26)

taking values z=0,,zmax𝑧0subscript𝑧𝑚𝑎𝑥z=0,\ldots,z_{max}italic_z = 0 , … , italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT, with zmax=k=1Ki=1Ink,i(a)subscript𝑧𝑚𝑎𝑥superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼superscriptsubscript𝑛𝑘𝑖𝑎z_{max}=\sum_{k=1}^{K}\sum_{i=1}^{I}n_{k,i}^{(a)}italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT. Let N𝑁Nitalic_N be the number of available beds for the new arrivals, given by

N𝑁\displaystyle Nitalic_N =\displaystyle== mk=1Ki=1INk,i=mzmax+Z,𝑚superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼superscriptsubscript𝑁𝑘𝑖𝑚subscript𝑧𝑚𝑎𝑥𝑍\displaystyle m-\sum_{k=1}^{K}\sum_{i=1}^{I}N_{k,i}^{{}^{\prime}}=m-z_{max}+Z,italic_m - ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT = italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_Z , (27)

taking values n=mzmax,,m𝑛𝑚subscript𝑧𝑚𝑎𝑥𝑚n=m-z_{max},\ldots,mitalic_n = italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT , … , italic_m. Therefore,

𝔼(Q|Z=z)𝔼conditional𝑄𝑍𝑧\displaystyle\mathbb{E}\left(Q\ |\ Z=z\right)blackboard_E ( italic_Q | italic_Z = italic_z ) =\displaystyle== 𝔼(Q|N=mzmax+z)𝔼conditional𝑄𝑁𝑚subscript𝑧𝑚𝑎𝑥𝑧\displaystyle\mathbb{E}\left(Q\ |\ N=m-z_{max}+z\right)blackboard_E ( italic_Q | italic_N = italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z )
=\displaystyle== k=0mzmax+zk×λkk!eλ+k=mzmax+z+1(mzmax+z)×λkk!eλsuperscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧𝑘superscript𝜆𝑘𝑘superscript𝑒𝜆superscriptsubscript𝑘𝑚subscript𝑧𝑚𝑎𝑥𝑧1𝑚subscript𝑧𝑚𝑎𝑥𝑧superscript𝜆𝑘𝑘superscript𝑒𝜆\displaystyle\sum_{k=0}^{m-z_{max}+z}k\times\frac{\lambda^{k}}{k!}e^{-\lambda}% +\sum_{k=m-z_{max}+z+1}^{\infty}(m-z_{max}+z)\times\frac{\lambda^{k}}{k!}e^{-\lambda}∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z end_POSTSUPERSCRIPT italic_k × divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_k = italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z ) × divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT
=\displaystyle== k=0mzmax+zk×λkk!eλ+(mzmax+z)(1k=0mzmax+zλkk!eλ)superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧𝑘superscript𝜆𝑘𝑘superscript𝑒𝜆𝑚subscript𝑧𝑚𝑎𝑥𝑧1superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧superscript𝜆𝑘𝑘superscript𝑒𝜆\displaystyle\sum_{k=0}^{m-z_{max}+z}k\times\frac{\lambda^{k}}{k!}e^{-\lambda}% +(m-z_{max}+z)\left(1-\sum_{k=0}^{m-z_{max}+z}\frac{\lambda^{k}}{k!}e^{-% \lambda}\right)∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z end_POSTSUPERSCRIPT italic_k × divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT + ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z ) ( 1 - ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT )
=\displaystyle== k=0mzmax+z1λ×λkk!eλ+(mzmax+z)(1k=0mzmax+zλkk!eλ)superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧1𝜆superscript𝜆𝑘𝑘superscript𝑒𝜆𝑚subscript𝑧𝑚𝑎𝑥𝑧1superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧superscript𝜆𝑘𝑘superscript𝑒𝜆\displaystyle\sum_{k=0}^{m-z_{max}+z-1}\lambda\times\frac{\lambda^{k}}{k!}e^{-% \lambda}+(m-z_{max}+z)\left(1-\sum_{k=0}^{m-z_{max}+z}\frac{\lambda^{k}}{k!}e^% {-\lambda}\right)∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z - 1 end_POSTSUPERSCRIPT italic_λ × divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT + ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z ) ( 1 - ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT )

and so

𝔼(Q)𝔼𝑄\displaystyle\mathbb{E}(Q)blackboard_E ( italic_Q ) =\displaystyle== z=0zmax𝔼(Q|Z=z)(Z=z)superscriptsubscript𝑧0subscript𝑧𝑚𝑎𝑥𝔼conditional𝑄𝑍𝑧𝑍𝑧\displaystyle\sum_{z=0}^{z_{max}}\mathbb{E}\left(Q\ |\ Z=z\right)\mathbb{P}(Z=z)∑ start_POSTSUBSCRIPT italic_z = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUPERSCRIPT blackboard_E ( italic_Q | italic_Z = italic_z ) blackboard_P ( italic_Z = italic_z ) (28)
=\displaystyle== z=0zmax(k=0mzmax+zk×λkk!eλ+(mzmax+z)(1k=0mzmax+zλkk!eλ))(Z=z)superscriptsubscript𝑧0subscript𝑧𝑚𝑎𝑥superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧𝑘superscript𝜆𝑘𝑘superscript𝑒𝜆𝑚subscript𝑧𝑚𝑎𝑥𝑧1superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧superscript𝜆𝑘𝑘superscript𝑒𝜆𝑍𝑧\displaystyle\sum_{z=0}^{z_{max}}\left(\sum_{k=0}^{m-z_{max}+z}k\times\frac{% \lambda^{k}}{k!}e^{-\lambda}+(m-z_{max}+z)\left(1-\sum_{k=0}^{m-z_{max}+z}% \frac{\lambda^{k}}{k!}e^{-\lambda}\right)\right)\mathbb{P}(Z=z)∑ start_POSTSUBSCRIPT italic_z = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z end_POSTSUPERSCRIPT italic_k × divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT + ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z ) ( 1 - ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT ) ) blackboard_P ( italic_Z = italic_z )
=\displaystyle== z=0zmax(Z=z)(mzmax+z)superscriptsubscript𝑧0subscript𝑧𝑚𝑎𝑥𝑍𝑧𝑚subscript𝑧𝑚𝑎𝑥𝑧\displaystyle\sum_{z=0}^{z_{max}}\mathbb{P}(Z=z)(m-z_{max}+z)∑ start_POSTSUBSCRIPT italic_z = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUPERSCRIPT blackboard_P ( italic_Z = italic_z ) ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z )
+z=0zmax(k=0mzmax+z1λ×λkk!eλ(mzmax+z)(k=0mzmax+zλkk!eλ))(Z=z)superscriptsubscript𝑧0subscript𝑧𝑚𝑎𝑥superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧1𝜆superscript𝜆𝑘𝑘superscript𝑒𝜆𝑚subscript𝑧𝑚𝑎𝑥𝑧superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧superscript𝜆𝑘𝑘superscript𝑒𝜆𝑍𝑧\displaystyle+\sum_{z=0}^{z_{max}}\left(\sum_{k=0}^{m-z_{max}+z-1}\lambda% \times\frac{\lambda^{k}}{k!}e^{-\lambda}-(m-z_{max}+z)\left(\sum_{k=0}^{m-z_{% max}+z}\frac{\lambda^{k}}{k!}e^{-\lambda}\right)\right)\mathbb{P}(Z=z)+ ∑ start_POSTSUBSCRIPT italic_z = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z - 1 end_POSTSUPERSCRIPT italic_λ × divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT - ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z ) ( ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT ) ) blackboard_P ( italic_Z = italic_z )
=\displaystyle== mzmax+𝔼(Z)𝑚subscript𝑧𝑚𝑎𝑥𝔼𝑍\displaystyle m-z_{max}+\mathbb{E}(Z)italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + blackboard_E ( italic_Z )
+z=0zmax(λ(mzmax+z))(k=0mzmax+z1λkk!eλ)(Z=z)superscriptsubscript𝑧0subscript𝑧𝑚𝑎𝑥𝜆𝑚subscript𝑧𝑚𝑎𝑥𝑧superscriptsubscript𝑘0𝑚subscript𝑧𝑚𝑎𝑥𝑧1superscript𝜆𝑘𝑘superscript𝑒𝜆𝑍𝑧\displaystyle+\sum_{z=0}^{z_{max}}\left(\lambda-(m-z_{max}+z)\right)\left(\sum% _{k=0}^{m-z_{max}+z-1}\frac{\lambda^{k}}{k!}e^{-\lambda}\right)\mathbb{P}(Z=z)+ ∑ start_POSTSUBSCRIPT italic_z = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_λ - ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z ) ) ( ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z - 1 end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT ) blackboard_P ( italic_Z = italic_z )
z=0zmax(mzmax+z)(λmzmax+z(mzmax+z)!eλ)(Z=z),superscriptsubscript𝑧0subscript𝑧𝑚𝑎𝑥𝑚subscript𝑧𝑚𝑎𝑥𝑧superscript𝜆𝑚subscript𝑧𝑚𝑎𝑥𝑧𝑚subscript𝑧𝑚𝑎𝑥𝑧superscript𝑒𝜆𝑍𝑧\displaystyle-\sum_{z=0}^{z_{max}}(m-z_{max}+z)\left(\frac{\lambda^{m-z_{max}+% z}}{(m-z_{max}+z)!}e^{-\lambda}\right)\mathbb{P}(Z=z),- ∑ start_POSTSUBSCRIPT italic_z = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z ) ( divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_m - italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT + italic_z ) ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT ) blackboard_P ( italic_Z = italic_z ) ,

where 𝔼(Z)=k=1Ki=1Ink,i(a)pk,i𝔼𝑍superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscriptsuperscript𝑛𝑎𝑘𝑖subscript𝑝𝑘𝑖\mathbb{E}(Z)=\sum_{k=1}^{K}\sum_{i=1}^{I}n^{(a)}_{k,i}p_{k,i}blackboard_E ( italic_Z ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT.

Further, in order to compute (Z=z)𝑍𝑧\mathbb{P}(Z=z)blackboard_P ( italic_Z = italic_z ) in (28), we note that Z=k=1Ki=1IZk,iI(nk,i(a)0)𝑍superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑍𝑘𝑖𝐼superscriptsubscript𝑛𝑘𝑖𝑎0Z=\sum_{k=1}^{K}\sum_{i=1}^{I}Z_{k,i}I(n_{k,i}^{(a)}\not=0)italic_Z = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT italic_I ( italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ≠ 0 ) is a sum of independent binomial random variables Zk,iBin(nk,i(a),pk,i)similar-tosubscript𝑍𝑘𝑖𝐵𝑖𝑛superscriptsubscript𝑛𝑘𝑖𝑎subscript𝑝𝑘𝑖Z_{k,i}\sim Bin(n_{k,i}^{(a)},p_{k,i})italic_Z start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ∼ italic_B italic_i italic_n ( italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT , italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) such that nk,i(a)0superscriptsubscript𝑛𝑘𝑖𝑎0n_{k,i}^{(a)}\not=0italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ≠ 0.

A method for computing the distribution of a sum of independent binomial random variables was discussed by Butler and Stephens in [7]. Below, we suggest an alternative method, which relies on simple matrix multiplications and standard theory of Markov chains.

Without loss of generality, suppose that Z=i=1MZi𝑍superscriptsubscript𝑖1𝑀subscript𝑍𝑖Z=\sum_{i=1}^{M}Z_{i}italic_Z = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a random variable such that ZiBin(ni,pi)similar-tosubscript𝑍𝑖𝐵𝑖𝑛subscript𝑛𝑖subscript𝑝𝑖Z_{i}\sim Bin(n_{i},p_{i})italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∼ italic_B italic_i italic_n ( italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) are independent Binomial random variables with some parameters ni>0subscript𝑛𝑖0n_{i}>0italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 and 0<pi<10subscript𝑝𝑖10<p_{i}<10 < italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < 1, for i=1,M𝑖1𝑀i=1,\ldots Mitalic_i = 1 , … italic_M.

To compute (Z=z)𝑍𝑧\mathbb{P}(Z=z)blackboard_P ( italic_Z = italic_z ), it is convenient to construct a discrete-time Markov chain corresponding to the Bernoulli trials n1,n2,,nMsubscript𝑛1subscript𝑛2subscript𝑛𝑀n_{1},n_{2},\ldots,n_{M}italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT such that a success at each trial results in the chain moving one step to the right, and failure results in the chain remaining at the original state. Then, the distribution of the chain after all n1++nMsubscript𝑛1subscript𝑛𝑀n_{1}+\ldots+n_{M}italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT trials will give the distribution of Z𝑍Zitalic_Z.

So, we consider a discrete-time Markov chain {(J(t)):t=0,1,2,,zmax}conditional-set𝐽𝑡𝑡012subscript𝑧𝑚𝑎𝑥\{(J(t)):t=0,1,2,\ldots,z_{max}\}{ ( italic_J ( italic_t ) ) : italic_t = 0 , 1 , 2 , … , italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT } terminating at time zmax=i=1Mnisubscript𝑧𝑚𝑎𝑥superscriptsubscript𝑖1𝑀subscript𝑛𝑖z_{max}=\sum_{i=1}^{M}n_{i}italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, with state space 𝒮={0,,zmax}𝒮0subscript𝑧𝑚𝑎𝑥\mathcal{S}=\{0,\ldots,z_{max}\}caligraphic_S = { 0 , … , italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT } and an initial state J(0)=0𝐽00J(0)=0italic_J ( 0 ) = 0, which evolves as follows. We perform the first n1subscript𝑛1n_{1}italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT Bernoulli trials at times t=1,,n1𝑡1subscript𝑛1t=1,\ldots,n_{1}italic_t = 1 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and let (J(t)=J(t1)+1)=p1𝐽𝑡𝐽𝑡11subscript𝑝1\mathbb{P}(J(t)=J(t-1)+1)=p_{1}blackboard_P ( italic_J ( italic_t ) = italic_J ( italic_t - 1 ) + 1 ) = italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, (J(t)=J(t1))=1p1𝐽𝑡𝐽𝑡11subscript𝑝1\mathbb{P}(J(t)=J(t-1))=1-p_{1}blackboard_P ( italic_J ( italic_t ) = italic_J ( italic_t - 1 ) ) = 1 - italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Then, the distribution of the chain after the first n1subscript𝑛1n_{1}italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT trials, and so at time t=n1𝑡subscript𝑛1t=n_{1}italic_t = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, is given by

𝜶(n1)𝜶subscript𝑛1\displaystyle\mbox{\boldmath$\alpha$}(n_{1})bold_italic_α ( italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) =\displaystyle== [𝜶(0)𝟎1×n1]𝐏(1),𝜶0subscript01subscript𝑛1superscript𝐏1\displaystyle\left[\mbox{\boldmath$\alpha$}(0)\quad{\bf 0}_{1\times n_{1}}% \right]{\bf P}^{(1)},[ bold_italic_α ( 0 ) bold_0 start_POSTSUBSCRIPT 1 × italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , (29)

where 𝜶(0)=[α(0)j]j=0𝜶0subscriptdelimited-[]𝛼subscript0𝑗𝑗0\mbox{\boldmath$\alpha$}(0)=[\alpha(0)_{j}]_{j=0}bold_italic_α ( 0 ) = [ italic_α ( 0 ) start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_j = 0 end_POSTSUBSCRIPT with α(0)0=1𝛼subscript001\alpha(0)_{0}=1italic_α ( 0 ) start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 1, and 𝐏[1]=[Pj,j[1]]j,j=0,1,,n1superscript𝐏delimited-[]1subscriptdelimited-[]superscriptsubscript𝑃𝑗superscript𝑗delimited-[]1formulae-sequence𝑗superscript𝑗01subscript𝑛1{\bf P}^{[1]}=[P_{j,j^{\prime}}^{[1]}]_{j,j^{\prime}=0,1,\ldots,n_{1}}bold_P start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT = [ italic_P start_POSTSUBSCRIPT italic_j , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT italic_j , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 , 1 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that Pj,j[1]=(Z1=jj)superscriptsubscript𝑃𝑗superscript𝑗delimited-[]1subscript𝑍1superscript𝑗𝑗P_{j,j^{\prime}}^{[1]}=\mathbb{P}(Z_{1}=j^{\prime}-j)italic_P start_POSTSUBSCRIPT italic_j , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT [ 1 ] end_POSTSUPERSCRIPT = blackboard_P ( italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_j ).

Next, we repeat this and perform nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT Bernoulli trials at times t=n1++ni1+1,,n1++ni𝑡subscript𝑛1subscript𝑛𝑖11subscript𝑛1subscript𝑛𝑖t=n_{1}+\ldots+n_{i-1}+1,\ldots,n_{1}+\ldots+n_{i}italic_t = italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT + 1 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and let (J(t)=J(t1)+1)=pi𝐽𝑡𝐽𝑡11subscript𝑝𝑖\mathbb{P}(J(t)=J(t-1)+1)=p_{i}blackboard_P ( italic_J ( italic_t ) = italic_J ( italic_t - 1 ) + 1 ) = italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, (J(t)=J(t1))=1pi𝐽𝑡𝐽𝑡11subscript𝑝𝑖\mathbb{P}(J(t)=J(t-1))=1-p_{i}blackboard_P ( italic_J ( italic_t ) = italic_J ( italic_t - 1 ) ) = 1 - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, for each i=2,,M𝑖2𝑀i=2,\ldots,Mitalic_i = 2 , … , italic_M. Then, the distribution after additional nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT trials is given by the recursion

𝜶(n1++ni)𝜶subscript𝑛1subscript𝑛𝑖\displaystyle\mbox{\boldmath$\alpha$}(n_{1}+\ldots+n_{i})bold_italic_α ( italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) =\displaystyle== [𝜶(n1++ni1)𝟎1×ni]𝐏(i),𝜶subscript𝑛1subscript𝑛𝑖1subscript01subscript𝑛𝑖superscript𝐏𝑖\displaystyle\left[\mbox{\boldmath$\alpha$}(n_{1}+\ldots+n_{i-1})\quad{\bf 0}_% {1\times n_{i}}\right]{\bf P}^{(i)},[ bold_italic_α ( italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT ) bold_0 start_POSTSUBSCRIPT 1 × italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] bold_P start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , (30)

where 𝐏[i]=[Pj,j[i]]j,j=0,1,,n1++nisuperscript𝐏delimited-[]𝑖subscriptdelimited-[]superscriptsubscript𝑃𝑗superscript𝑗delimited-[]𝑖formulae-sequence𝑗superscript𝑗01subscript𝑛1subscript𝑛𝑖{\bf P}^{[i]}=[P_{j,j^{\prime}}^{[i]}]_{j,j^{\prime}=0,1,\ldots,n_{1}+\ldots+n% _{i}}bold_P start_POSTSUPERSCRIPT [ italic_i ] end_POSTSUPERSCRIPT = [ italic_P start_POSTSUBSCRIPT italic_j , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT [ italic_i ] end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT italic_j , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 , 1 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT, Pj,j[i]=(Z1=jj)superscriptsubscript𝑃𝑗superscript𝑗delimited-[]𝑖subscript𝑍1superscript𝑗𝑗P_{j,j^{\prime}}^{[i]}=\mathbb{P}(Z_{1}=j^{\prime}-j)italic_P start_POSTSUBSCRIPT italic_j , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT [ italic_i ] end_POSTSUPERSCRIPT = blackboard_P ( italic_Z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_j ). To compute 𝐏(i)superscript𝐏𝑖{\bf P}^{(i)}bold_P start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT we apply the formula

𝐏(i)superscript𝐏𝑖\displaystyle{\bf P}^{(i)}bold_P start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT =\displaystyle== (𝐀(i))ni,superscriptsuperscript𝐀𝑖subscript𝑛𝑖\displaystyle\left({\bf A}^{(i)}\right)^{n_{i}},( bold_A start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , (31)

where

𝐀(i)superscript𝐀𝑖\displaystyle{\bf A}^{(i)}bold_A start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT =\displaystyle== [A(i)]j,j=0,1,,n1++ni=[1pipi0001pipi0001pipi].subscriptdelimited-[]superscript𝐴𝑖formulae-sequence𝑗superscript𝑗01subscript𝑛1subscript𝑛𝑖delimited-[]1subscript𝑝𝑖subscript𝑝𝑖0001subscript𝑝𝑖subscript𝑝𝑖0001subscript𝑝𝑖subscript𝑝𝑖\displaystyle[A^{(i)}]_{j,j^{\prime}=0,1,\ldots,n_{1}+\ldots+n_{i}}=\left[% \begin{array}[]{llllll}1-p_{i}&p_{i}&0&\ldots&\ldots&0\\ 0&1-p_{i}&p_{i}&\ldots&\ldots&0\\ \vdots&\vdots&\vdots&\vdots&\vdots&\vdots\\ 0&\ldots&0&\ldots&1-p_{i}&p_{i}\end{array}\right].[ italic_A start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT italic_j , italic_j start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 0 , 1 , … , italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT = [ start_ARRAY start_ROW start_CELL 1 - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL 0 end_CELL start_CELL … end_CELL start_CELL … end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL … end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL … end_CELL start_CELL 0 end_CELL start_CELL … end_CELL start_CELL 1 - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] .

It follows that, for j=0,1,,zmax𝑗01subscript𝑧𝑚𝑎𝑥j=0,1,\ldots,z_{max}italic_j = 0 , 1 , … , italic_z start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT,

(Z=j)𝑍𝑗\displaystyle\mathbb{P}(Z=j)blackboard_P ( italic_Z = italic_j ) =\displaystyle== [𝜶(n1++nM)]j=[𝜶(n1++nM1)𝟎1×nM]𝐏(M).subscriptdelimited-[]𝜶subscript𝑛1subscript𝑛𝑀𝑗𝜶subscript𝑛1subscript𝑛𝑀1subscript01subscript𝑛𝑀superscript𝐏𝑀\displaystyle[\mbox{\boldmath$\alpha$}(n_{1}+\ldots+n_{M})]_{j}=\left[\mbox{% \boldmath$\alpha$}(n_{1}+\ldots+n_{M-1})\quad{\bf 0}_{1\times n_{M}}\right]{% \bf P}^{(M)}.[ bold_italic_α ( italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) ] start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = [ bold_italic_α ( italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + italic_n start_POSTSUBSCRIPT italic_M - 1 end_POSTSUBSCRIPT ) bold_0 start_POSTSUBSCRIPT 1 × italic_n start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] bold_P start_POSTSUPERSCRIPT ( italic_M ) end_POSTSUPERSCRIPT . (33)

This method for computing the probabilities (Z=z)𝑍𝑧\mathbb{P}(Z=z)blackboard_P ( italic_Z = italic_z ) for a sum Z=i=1MZi𝑍superscriptsubscript𝑖1𝑀subscript𝑍𝑖Z=\sum_{i=1}^{M}Z_{i}italic_Z = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT of independent nonnegative random variables Zisubscript𝑍𝑖Z_{i}italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT that take values in finite sets, can be applied for any desired discrete distributions of Zisubscript𝑍𝑖Z_{i}italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, by replacing (31) with a suitable formula for 𝐏(i)superscript𝐏𝑖{\bf P}^{(i)}bold_P start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT.

Alternatively, (Z=z)𝑍𝑧\mathbb{P}(Z=z)blackboard_P ( italic_Z = italic_z ) can be computed by numerically inverting the probability generating function GZ(s)subscript𝐺𝑍𝑠G_{Z}(s)italic_G start_POSTSUBSCRIPT italic_Z end_POSTSUBSCRIPT ( italic_s ) of the random variable Z𝑍Zitalic_Z, which here is given by

GZ(s)subscript𝐺𝑍𝑠\displaystyle G_{Z}(s)italic_G start_POSTSUBSCRIPT italic_Z end_POSTSUBSCRIPT ( italic_s ) =\displaystyle== i=1MGZi(s)=i=1M(1pi+spi)ni,superscriptsubscriptproduct𝑖1𝑀subscript𝐺subscript𝑍𝑖𝑠superscriptsubscriptproduct𝑖1𝑀superscript1subscript𝑝𝑖𝑠subscript𝑝𝑖subscript𝑛𝑖\displaystyle\prod_{i=1}^{M}G_{Z_{i}}(s)=\prod_{i=1}^{M}(1-p_{i}+sp_{i})^{n_{i% }},∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_G start_POSTSUBSCRIPT italic_Z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s ) = ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ( 1 - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_s italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , (34)

using a suitable inversion algorithm, for example see Abate and Whitt [1].

4 Numerical examples

Here, we construct examples to illustrate the theory and the application of algorithms discussed above. First, in Example 1, we construct a Markov model with a small state space, and demonstrate that the approximate solution converges to the exact optimal solution (which we obtained by applying standard dynamic programming methods). Next, in Example 2, we construct a realistically-sized Markov model with assumptions driven by practical considerations driven by the conditions of real-world hospitals, and demonstrate the application potential of our methodology.

4.1 Small-sized example

We consider the following simple example to illustrate the application of our Model in Section 2.1. As the size of the state space 𝒮𝒮\mathcal{S}caligraphic_S in this example is small enough to allow for the application of standard dynamic programming methods, we apply Algorithm 1 in Appendix A to find the exact solution and then compare it with the approximation obtained using Algorithms 23.

Example 1

Consider a healthcare facility with K=2𝐾2K=2italic_K = 2 wards, with capacity mk=1subscript𝑚𝑘1m_{k}=1italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 1 for each ward k𝑘kitalic_k, and I=2𝐼2I=2italic_I = 2 patient types. Assuming that the maximum capacity of the waiting area is m3=2subscript𝑚32m_{3}=2italic_m start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = 2, that is i=1Iqim3superscriptsubscript𝑖1𝐼subscript𝑞𝑖subscript𝑚3\sum_{i=1}^{I}q_{i}\leq m_{3}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_m start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, and that the arrivals are only accepted if there is available capacity in the system according to

k=1Ki=1Ink,i+i=1Iqik=1Kmk=m,superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscript𝑛𝑘𝑖superscriptsubscript𝑖1𝐼subscript𝑞𝑖superscriptsubscript𝑘1𝐾subscript𝑚𝑘𝑚\sum_{k=1}^{K}\sum_{i=1}^{I}n_{k,i}+\sum_{i=1}^{I}q_{i}\leq\sum_{k=1}^{K}m_{k}% =m,∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_m , (35)

we define the state space of the system as follows:

𝒮𝒮\displaystyle\mathcal{S}caligraphic_S =\displaystyle== {([n1,1n1,2n2,1n2,2],[q1q2]):j=12n1,j1,j=12n2,j1,i=12+j=12ni,j+i=12qi2}conditional-setmatrixsubscript𝑛11subscript𝑛12subscript𝑛21subscript𝑛22matrixsubscript𝑞1subscript𝑞2formulae-sequencesuperscriptsubscript𝑗12subscript𝑛1𝑗1formulae-sequencesuperscriptsubscript𝑗12subscript𝑛2𝑗1superscriptsubscript𝑖12superscriptsubscript𝑗12subscript𝑛𝑖𝑗superscriptsubscript𝑖12subscript𝑞𝑖2\displaystyle\left\{\left(\begin{bmatrix}n_{1,1}\ n_{1,2}\\ n_{2,1}\ n_{2,2}\end{bmatrix},\begin{bmatrix}q_{1}\ q_{2}\end{bmatrix}\right):% \sum_{j=1}^{2}n_{1,j}\leq 1,\sum_{j=1}^{2}n_{2,j}\leq 1,\sum_{i=1}^{2}+\sum_{j% =1}^{2}n_{i,j}+\sum_{i=1}^{2}q_{i}\leq 2\right\}{ ( [ start_ARG start_ROW start_CELL italic_n start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_n start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL italic_q start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ) : ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 1 , italic_j end_POSTSUBSCRIPT ≤ 1 , ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT 2 , italic_j end_POSTSUBSCRIPT ≤ 1 , ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ 2 }
=\displaystyle== {1,2,3,,22},12322\displaystyle\left\{1,2,3,\ldots,22\right\},{ 1 , 2 , 3 , … , 22 } ,

with

1([0 00 0],[0 0]),2([0 00 0],[1 0]),3([0 00 0],[0 1]),4([0 00 0],[2 0]),formulae-sequence1matrix0000matrix00formulae-sequence2matrix0000matrix10formulae-sequence3matrix0000matrix014matrix0000matrix20\displaystyle 1\equiv\left(\begin{bmatrix}0\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),2\equiv\left(\begin{% bmatrix}0\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}1\ 0\end{bmatrix}\right),3\equiv\left(\begin{% bmatrix}0\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 1\end{bmatrix}\right),4\equiv\left(\begin{% bmatrix}0\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}2\ 0\end{bmatrix}\right),1 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) , 2 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] ) , 3 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] ) , 4 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 2 0 end_CELL end_ROW end_ARG ] ) ,
5([0 00 0],[1 1]),6([0 00 0],[0 2]),7([1 00 0],[0 0]),8([1 00 0],[1 0]),formulae-sequence5matrix0000matrix11formulae-sequence6matrix0000matrix02formulae-sequence7matrix1000matrix008matrix1000matrix10\displaystyle 5\equiv\left(\begin{bmatrix}0\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}1\ 1\end{bmatrix}\right),6\equiv\left(\begin{% bmatrix}0\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 2\end{bmatrix}\right),7\equiv\left(\begin{% bmatrix}1\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),8\equiv\left(\begin{% bmatrix}1\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}1\ 0\end{bmatrix}\right),5 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 1 1 end_CELL end_ROW end_ARG ] ) , 6 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 2 end_CELL end_ROW end_ARG ] ) , 7 ≡ ( [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) , 8 ≡ ( [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] ) ,
9([1 00 0],[0 1]),10([0 10 0],[0 0]),11([0 10 0],[1 0]),12([0 10 0],[0 1]),formulae-sequence9matrix1000matrix01formulae-sequence10matrix0100matrix00formulae-sequence11matrix0100matrix1012matrix0100matrix01\displaystyle 9\equiv\left(\begin{bmatrix}1\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 1\end{bmatrix}\right),10\equiv\left(\begin% {bmatrix}0\ 1\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),11\equiv\left(\begin% {bmatrix}0\ 1\\ 0\ 0\end{bmatrix},\begin{bmatrix}1\ 0\end{bmatrix}\right),12\equiv\left(\begin% {bmatrix}0\ 1\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 1\end{bmatrix}\right),9 ≡ ( [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] ) , 10 ≡ ( [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) , 11 ≡ ( [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] ) , 12 ≡ ( [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] ) ,
13([0 01 0],[0 0]),14([0 01 0],[1 0]),15([0 01 0],[0 1]),16([0 00 1],[0 0]),formulae-sequence13matrix0010matrix00formulae-sequence14matrix0010matrix10formulae-sequence15matrix0010matrix0116matrix0001matrix00\displaystyle 13\equiv\left(\begin{bmatrix}0\ 0\\ 1\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),14\equiv\left(\begin% {bmatrix}0\ 0\\ 1\ 0\end{bmatrix},\begin{bmatrix}1\ 0\end{bmatrix}\right),15\equiv\left(\begin% {bmatrix}0\ 0\\ 1\ 0\end{bmatrix},\begin{bmatrix}0\ 1\end{bmatrix}\right),16\equiv\left(\begin% {bmatrix}0\ 0\\ 0\ 1\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),13 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) , 14 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] ) , 15 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] ) , 16 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) ,
17([0 00 1],[1 0]),18([0 00 1],[0 1]),19([1 01 0],[0 0]),20([0 11 0],[0 0]),formulae-sequence17matrix0001matrix10formulae-sequence18matrix0001matrix01formulae-sequence19matrix1010matrix0020matrix0110matrix00\displaystyle 17\equiv\left(\begin{bmatrix}0\ 0\\ 0\ 1\end{bmatrix},\begin{bmatrix}1\ 0\end{bmatrix}\right),18\equiv\left(\begin% {bmatrix}0\ 0\\ 0\ 1\end{bmatrix},\begin{bmatrix}0\ 1\end{bmatrix}\right),19\equiv\left(\begin% {bmatrix}1\ 0\\ 1\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),20\equiv\left(\begin% {bmatrix}0\ 1\\ 1\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),17 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] ) , 18 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] ) , 19 ≡ ( [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) , 20 ≡ ( [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) ,
21([1 00 1],[0 0]),22([0 10 1],[0 0]).formulae-sequence21matrix1001matrix0022matrix0101matrix00\displaystyle 21\equiv\left(\begin{bmatrix}1\ 0\\ 0\ 1\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),22\equiv\left(\begin% {bmatrix}0\ 1\\ 0\ 1\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right).21 ≡ ( [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) , 22 ≡ ( [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) .

Further, assume that ward i𝑖iitalic_i is the preferred ward for patient type i𝑖iitalic_i, for i=1,2𝑖12i=1,2italic_i = 1 , 2, which is reflected by the expected values of the LoS, which are lower when patient type i𝑖iitalic_i stays in ward i𝑖iitalic_i for the whole duration of their stay. To model this, we assume that probabilities pk,i=(RLOSk,i1)subscript𝑝𝑘𝑖𝑅𝐿𝑂subscript𝑆𝑘𝑖1p_{k,i}=\mathbb{P}(RLOS_{k,i}\leq 1)italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = blackboard_P ( italic_R italic_L italic_O italic_S start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ≤ 1 ) that patient type i𝑖iitalic_i that is in ward k𝑘kitalic_k today, leaves the hospital the next day, are given by,

𝐩=[pk,i]𝒦×𝐩subscriptdelimited-[]subscript𝑝𝑘𝑖𝒦\displaystyle{\bf p}=[p_{k,i}]_{\mathcal{K}\times\mathcal{I}}bold_p = [ italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT =\displaystyle== [1/51/41/101/3],delimited-[]151411013\displaystyle\left[\begin{array}[]{cc}1/5&1/4\\ 1/10&1/3\end{array}\right],[ start_ARRAY start_ROW start_CELL 1 / 5 end_CELL start_CELL 1 / 4 end_CELL end_ROW start_ROW start_CELL 1 / 10 end_CELL start_CELL 1 / 3 end_CELL end_ROW end_ARRAY ] , (38)

and so the expected LoS for patient type i=1𝑖1i=1italic_i = 1 is E(LoSk,i)=5𝐸𝐿𝑜subscript𝑆𝑘𝑖5E(LoS_{k,i})=5italic_E ( italic_L italic_o italic_S start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) = 5 days if they stay in ward k=1𝑘1k=1italic_k = 1, or E(LoSk,i)=10𝐸𝐿𝑜subscript𝑆𝑘𝑖10E(LoS_{k,i})=10italic_E ( italic_L italic_o italic_S start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) = 10 if they stay in ward k=2𝑘2k=2italic_k = 2, and some number between these two if the patient is transferred between the wards during their stay at the hospital. For patient type i=2𝑖2i=2italic_i = 2 we have E(LoSk,i)=3𝐸𝐿𝑜subscript𝑆𝑘𝑖3E(LoS_{k,i})=3italic_E ( italic_L italic_o italic_S start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) = 3 days if they stay in ward k=2𝑘2k=2italic_k = 2, or E(LoSk,i)=4𝐸𝐿𝑜subscript𝑆𝑘𝑖4E(LoS_{k,i})=4italic_E ( italic_L italic_o italic_S start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) = 4 if they stay in ward k=1𝑘1k=1italic_k = 1, and some number between these two if the patient is transferred between the wards during their stay at the hospital.

We consider two decisions,

  • a=1𝑎1a=1italic_a = 1, assign arrived patients to their best available wards without transferring the patients between the wards, and

  • a=2𝑎2a=2italic_a = 2, assign arrived patients to their best available wards and allow transferring patients if required.

As example, given state s=15([0 01 0],[0 1]),𝑠15matrix0010matrix01s=15\equiv\left(\begin{bmatrix}0\ 0\\ 1\ 0\end{bmatrix},\begin{bmatrix}0\ 1\end{bmatrix}\right),italic_s = 15 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] ) , decision a=1𝑎1a=1italic_a = 1 with transform it into post decision state s(a)=([0 11 0]),superscript𝑠𝑎matrix0110s^{(a)}=\left(\begin{bmatrix}0\ 1\\ 1\ 0\end{bmatrix}\right),italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = ( [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] ) , and then the process will move to state s=10([0 10 0],[0 0])𝑠10matrix0100matrix00s=10\equiv\left(\begin{bmatrix}0\ 1\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right)italic_s = 10 ≡ ( [ start_ARG start_ROW start_CELL 0 1 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) if patient type 1111 departs from ward 2222, or to state s=13([0 01 0],[0 0])𝑠13matrix0010matrix00s=13\equiv\left(\begin{bmatrix}0\ 0\\ 1\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right)italic_s = 13 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 1 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) if patient type 2222 departs from ward 1111. Alternatively, decision a=2𝑎2a=2italic_a = 2 will transform s=15𝑠15s=15italic_s = 15 into post-decision state s(a)=([1 00 1]),superscript𝑠𝑎matrix1001s^{(a)}=\left(\begin{bmatrix}1\ 0\\ 0\ 1\end{bmatrix}\right),italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = ( [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] ) , and then the process will move to state s=16([0 00 1],[0 0]),𝑠16matrix0001matrix00s=16\equiv\left(\begin{bmatrix}0\ 0\\ 0\ 1\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),italic_s = 16 ≡ ( [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW start_ROW start_CELL 0 1 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) , if patient type 1111 departs from ward 1111, or to state s=7([1 00 0],[0 0]),𝑠7matrix1000matrix00s=7\equiv\left(\begin{bmatrix}1\ 0\\ 0\ 0\end{bmatrix},\begin{bmatrix}0\ 0\end{bmatrix}\right),italic_s = 7 ≡ ( [ start_ARG start_ROW start_CELL 1 0 end_CELL end_ROW start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] , [ start_ARG start_ROW start_CELL 0 0 end_CELL end_ROW end_ARG ] ) , if patient type 2222 departs from ward 2222. No arrivals to waiting area are permitted if they cannot be assigned, that is, when the system is already full.

We assume the following cost parameters:

  • Assignment cost to ward k𝑘kitalic_k per type-i𝑖iitalic_i patient: ck,i(σ)=c(σ)=1subscriptsuperscript𝑐𝜎𝑘𝑖superscript𝑐𝜎1c^{(\sigma)}_{k,i}=c^{(\sigma)}=1italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT = 1 for all k,i𝑘𝑖k,iitalic_k , italic_i.

  • Transfer cost from ward k𝑘kitalic_k to ward \ellroman_ℓ per type-i𝑖iitalic_i patient: ck,,i(t)=c(t)=1.1subscriptsuperscript𝑐𝑡𝑘𝑖superscript𝑐𝑡1.1c^{(t)}_{k,\ell,i}=c^{(t)}=1.1italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT = italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = 1.1 for all k,,i𝑘𝑖k,\ell,iitalic_k , roman_ℓ , italic_i.

  • Penalty cost for being in a nonprimary ward k𝑘kitalic_k per type-i𝑖iitalic_i patient: ck,i(p)=c(p)=0.2subscriptsuperscript𝑐𝑝𝑘𝑖superscript𝑐𝑝0.2c^{(p)}_{k,i}=c^{(p)}=0.2italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT = 0.2 for all k,i𝑘𝑖k,iitalic_k , italic_i.

We apply policy iteration summarised in Algorithm 1 in Appendix A to find the exact solution. To do so, we first write explicit expressions for the probability matrices 𝐏(a)superscript𝐏𝑎{\bf P}^{(a)}bold_P start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT and cost vectors 𝐂(a)superscript𝐂𝑎{\bf C}^{(a)}bold_C start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT for a=1,2𝑎12a=1,2italic_a = 1 , 2. The details of this derivation are given in Appendix B.

From policy iteration we identify the optimal policy in our states of interest to be a=2𝑎2a=2italic_a = 2, ‘assign the waiting patients to their most preferred ward by transferring patients currently in the ward’. We determine that this optimal policy invokes a long run average cost per stage of E=0.4098superscript𝐸0.4098E^{*}=0.4098italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = 0.4098 units.

We now present the results of the approximate policy iteration algorithm, summarised in Algorithms 2 and 3. For simulations, we assumed the starting value of each weight 𝜽0=𝟏×104subscript𝜽01superscript104\mbox{\boldmath$\theta$}_{0}=\boldsymbol{1}\times 10^{-4}bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = bold_1 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT.

Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 2: Values of θn(f)subscriptsuperscript𝜃𝑓𝑛\theta^{(f)}_{n}italic_θ start_POSTSUPERSCRIPT ( italic_f ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in iteration n𝑛nitalic_n of Algorithm 2 in in Example 1. Simulations were run for M=103,104,105𝑀superscript103superscript104superscript105M=10^{3},10^{4},10^{5}italic_M = 10 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT and 106superscript10610^{6}10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT states, respectively.
Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 3: Values of Ensubscript𝐸𝑛E_{n}italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT (blue) compared to the optimal value E=0.4098superscript𝐸0.4098E^{*}=0.4098italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = 0.4098 (red) in iteration n𝑛nitalic_n of Algorithm 2 in Example 1. Simulations were run for M=103,104,105𝑀superscript103superscript104superscript105M=10^{3},10^{4},10^{5}italic_M = 10 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT and 106superscript10610^{6}10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT states, respectively.

We find that the estimated value of E𝐸Eitalic_E converges to the optimal value E=0.4098superscript𝐸0.4098E^{*}=0.4098italic_E start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = 0.4098 for large values of M𝑀Mitalic_M, after a small number of iterations. As expected, the values of θ𝜃\thetaitalic_θ and E𝐸Eitalic_E converge more tightly as M𝑀Mitalic_M, the number of states simulated, increases, as shown in Figure 3. Second, we observe that in this example, each value of 𝜽𝜽\thetabold_italic_θ converges to a distinct value, as shown in Figure 2. This is because the parameters 𝒑𝒑\boldsymbol{p}bold_italic_p and 𝝀𝝀\boldsymbol{\lambda}bold_italic_λ are asymmetric, so we do not expect patients of different types i𝑖iitalic_i to be weighted equally.

4.2 Realistically-sized example

We consider a realistically-sized example of our Model in Section 2.1. The structure of the model here is influenced by the practical considerations within real-world hospitals, in which patients are allocated to their primary (preferred) wards based on their types (medical needs), and when these wards are full, suitable policies are applied to decide which nonprimary ward a patient should be allocated to instead. Furthermore, we assume that patients arriving to a hospital when the system is full (all wards are full), are redirected to another facility. Therefore, the number of daily arrivals is unrestricted, however the number of arrivals admitted to the hospital is bounded by the system capacity. We fitted the parameters of the arrival and length of stay distributions to data, and focused on the application of our model to the analysis of how decision making affects the number of patients in nonprimary wards.

Example 2

Similary to Dai and Shi [10], assume five types of patients, i=1,,5𝑖15i=1,\ldots,5italic_i = 1 , … , 5 (I=5𝐼5I=5italic_I = 5): Orthopedic (Orth), Cardio (Card), Surgery (Surg), General Medicine (GenMed), and Other Medicine (OthMed); and corresponding wards, k=i=1,,5formulae-sequence𝑘𝑖15k=i=1,\ldots,5italic_k = italic_i = 1 , … , 5 (K=5𝐾5K=5italic_K = 5), most suitable to these patient types, respectively.

To estimate the key parameters of our model, we applied Matlab and performed the analysis of five-year patient flow data obtained from an Australian tertiary referral hospital, see Table 1.

i𝑖iitalic_i 1111 (Ortho) 2222 (Card) 3333 (Surg) 4444 (GenMed) 5555 (OthMed)
λisubscript𝜆𝑖\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT 2.02522.02522.02522.0252 3.35653.35653.35653.3565 10.015910.015910.015910.0159 11.744211.744211.744211.7442 38.785338.785338.785338.7853
𝔼(LoSi)𝔼𝐿𝑜subscript𝑆𝑖\mathbb{E}(LoS_{i})blackboard_E ( italic_L italic_o italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) 5.14735.14735.14735.1473 4.14144.14144.14144.1414 3.93733.93733.93733.9373 4.52094.52094.52094.5209 2.85052.85052.85052.8505
pk,isubscript𝑝𝑘𝑖p_{k,i}italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT, k=i𝑘𝑖k=iitalic_k = italic_i 0.19430.19430.19430.1943 0.24150.24150.24150.2415 0.25400.25400.25400.2540 0.22120.22120.22120.2212 0.35080.35080.35080.3508
mksubscript𝑚𝑘m_{k}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, k=i𝑘𝑖k=iitalic_k = italic_i 12121212 15151515 38383838 50505050 99999999
Table 1: Model parameters in Example 2 (λisubscript𝜆𝑖\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and 𝔼(LoSi)𝔼𝐿𝑜subscript𝑆𝑖\mathbb{E}(LoS_{i})blackboard_E ( italic_L italic_o italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) were fitted to data, and the remaining parameters were estimated): λisubscript𝜆𝑖\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (daily arrival rate of patients type i𝑖iitalic_i), pk,isubscript𝑝𝑘𝑖p_{k,i}italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT (probability of patient of type i𝑖iitalic_i departing from ward k𝑘kitalic_k within a day), and mksubscript𝑚𝑘m_{k}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (capacity of ward k𝑘kitalic_k). The total capacity of the system is m=kmk=214𝑚subscript𝑘subscript𝑚𝑘214m=\sum_{k}m_{k}=214italic_m = ∑ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 214. We apply pi,i=1/𝔼(LoSi)subscript𝑝𝑖𝑖1𝔼𝐿𝑜subscript𝑆𝑖p_{i,i}=1/\mathbb{E}(LoS_{i})italic_p start_POSTSUBSCRIPT italic_i , italic_i end_POSTSUBSCRIPT = 1 / blackboard_E ( italic_L italic_o italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (probability of patient of type i𝑖iitalic_i departing within a day, assuming they are in the most suitable ward). When ki𝑘𝑖k\not=iitalic_k ≠ italic_i, we let pk,i=β×pi,isubscript𝑝𝑘𝑖𝛽subscript𝑝𝑖𝑖p_{k,i}=\beta\times p_{i,i}italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_β × italic_p start_POSTSUBSCRIPT italic_i , italic_i end_POSTSUBSCRIPT, for some β<1𝛽1\beta<1italic_β < 1 (so that the LoS of patients that are in less suitable wards, is increased). Here, we set β=1/1.25𝛽11.25\beta=1/1.25italic_β = 1 / 1.25 (and so the LoS of patients in less suitable wards increases by 25%percent2525\%25 % on the average). Further, to estimate mksubscript𝑚𝑘m_{k}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, for each k=i𝑘𝑖k=iitalic_k = italic_i, we applied M/M/N/N queueing model with arrival rate λisubscript𝜆𝑖\lambda_{i}italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and service rate μi=1/𝔼(LoSi)subscript𝜇𝑖1𝔼𝐿𝑜subscript𝑆𝑖\mu_{i}=1/\mathbb{E}(LoS_{i})italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 / blackboard_E ( italic_L italic_o italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) to find N=mk𝑁subscript𝑚𝑘N=m_{k}italic_N = italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT such that the proportion of time the queue is full is πN<0.15subscript𝜋𝑁0.15\pi_{N}<0.15italic_π start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT < 0.15.

In order to focus on the impact of decision making on the number of patients observed in nonprimary wards, we applied the following cost parameters:

  • Transfer cost from ward k𝑘kitalic_k to ward \ellroman_ℓ per type-i𝑖iitalic_i patient: ck,,i(t)=c(t)=1.1subscriptsuperscript𝑐𝑡𝑘𝑖superscript𝑐𝑡1.1c^{(t)}_{k,\ell,i}=c^{(t)}=1.1italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT = italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT = 1.1 for all k,k,iformulae-sequence𝑘𝑘𝑖k,\ell\not=k,iitalic_k , roman_ℓ ≠ italic_k , italic_i and assume large value at k=𝑘k=\ellitalic_k = roman_ℓ (to avoid such decision).

  • Penalty cost for being in a nonprimary ward k𝑘kitalic_k per type-i𝑖iitalic_i patient: ck,i(p)=c(p)=0.2subscriptsuperscript𝑐𝑝𝑘𝑖superscript𝑐𝑝0.2c^{(p)}_{k,i}=c^{(p)}=0.2italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT = 0.2 for all k,i𝑘𝑖k,iitalic_k , italic_i.

(Here, we assumed the assignment costs c(σ)=1superscript𝑐𝜎1c^{(\sigma)}=1italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT = 1 for each admitted or redirected patient, but have not included these costs in the optimisation, since all patients need to be admitted to some facility, and our focus is on the number of patients in nonprimary wards.)

We assume that the capacity of the waiting area is unlimited with mk+1=subscript𝑚𝑘1m_{k+1}=\inftyitalic_m start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT = ∞ and so the total number of arrivals i=1Iqimk+1superscriptsubscript𝑖1𝐼subscript𝑞𝑖subscript𝑚𝑘1\sum_{i=1}^{I}q_{i}\leq m_{k+1}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_m start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT is unrestricted. However, the arrivals are only accepted if there is available capacity in the system according to i=1Ink,imksuperscriptsubscript𝑖1𝐼subscriptsuperscript𝑛𝑘𝑖subscript𝑚𝑘\sum_{i=1}^{I}n^{\prime}_{k,i}\leq m_{k}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ≤ italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, where ([nk,i]𝒦×)subscriptdelimited-[]subscriptsuperscript𝑛𝑘𝑖𝒦\left([n^{\prime}_{k,i}]_{\mathcal{K}\times\mathcal{I}}\right)( [ italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT ) is post-decision state. Patients that may not be admitted, are redirected to another facility. The state space of the system is then

𝒮𝒮\displaystyle\mathcal{S}caligraphic_S =\displaystyle== {([nk,i]𝒦×,[qi]1×):nk,i0,qi0,i=1Ink,imk},conditional-setsubscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1formulae-sequencesubscript𝑛𝑘𝑖0formulae-sequencesubscript𝑞𝑖0superscriptsubscript𝑖1𝐼subscript𝑛𝑘𝑖subscript𝑚𝑘\displaystyle\{\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times% \mathcal{I}}\right):n_{k,i}\geq 0,q_{i}\geq 0,\sum_{i=1}^{I}n_{k,i}\leq m_{k}\},{ ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) : italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ≥ 0 , italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ≤ italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } ,

while the set of post-decision states is

𝒮superscript𝒮\displaystyle\mathcal{S}^{\prime}caligraphic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT =\displaystyle== {([nk,i]𝒦×):i=1Ink,imk},conditional-setsubscriptdelimited-[]subscriptsuperscript𝑛𝑘𝑖𝒦superscriptsubscript𝑖1𝐼subscriptsuperscript𝑛𝑘𝑖subscript𝑚𝑘\displaystyle\left\{\left([n^{\prime}_{k,i}]_{\mathcal{K}\times\mathcal{I}}% \right):\sum_{i=1}^{I}n^{\prime}_{k,i}\leq m_{k}\right\}\ ,{ ( [ italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT ) : ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ≤ italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } ,

with |𝒮|=k=1Kg=0mk(g+I1I1)=k=1K(mk+II)superscript𝒮superscriptsubscriptproduct𝑘1𝐾superscriptsubscript𝑔0subscript𝑚𝑘binomial𝑔𝐼1𝐼1superscriptsubscriptproduct𝑘1𝐾binomialsubscript𝑚𝑘𝐼𝐼|\mathcal{S}^{\prime}|=\prod_{k=1}^{K}\sum_{g=0}^{m_{k}}\binom{g+I-1}{I-1}=% \prod_{k=1}^{K}\binom{m_{k}+I}{I}| caligraphic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | = ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_g = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( FRACOP start_ARG italic_g + italic_I - 1 end_ARG start_ARG italic_I - 1 end_ARG ) = ∏ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ( FRACOP start_ARG italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_I end_ARG start_ARG italic_I end_ARG ) by Heydar et al. [15]. Here, |𝒮|=2.9544e+28superscript𝒮2.9544𝑒28|\mathcal{S}^{\prime}|=2.9544e+28| caligraphic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | = 2.9544 italic_e + 28.

Next, we assume the following order of assignment, represented as a matrix 𝐎=[Oi,k]𝐎delimited-[]subscript𝑂𝑖𝑘{\bf O}=[O_{i,k}]bold_O = [ italic_O start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT ], where Oi,k=1subscript𝑂𝑖𝑘1O_{i,k}=1italic_O start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT = 1 means that ward k𝑘kitalic_k is the first choice for type i𝑖iitalic_i, Oi,k=2subscript𝑂𝑖𝑘2O_{i,k}=2italic_O start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT = 2 is the second choice for type i𝑖iitalic_i, and so on, with

𝐎𝐎\displaystyle{\bf O}bold_O =\displaystyle== [k=Orthok=Cardk=Surgk=GenMedk=OthMedi=Ortho15234i=Card31245i=Surg23154i=GenMed35412i=OthMed35421].delimited-[]missing-subexpression𝑘Ortho𝑘Card𝑘Surg𝑘GenMed𝑘OthMedmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression𝑖Ortho15234missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression𝑖Card31245missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression𝑖Surg23154missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression𝑖GenMed35412missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpression𝑖OthMed35421\displaystyle\left[\begin{array}[]{l||ccccc}&k=\mbox{Ortho}&k=\mbox{Card}&k=% \mbox{Surg}&k=\mbox{GenMed}&k=\mbox{OthMed}\\ \hline\cr\hline\cr i=\mbox{Ortho}&1&5&2&3&4\\ \hline\cr i=\mbox{Card}&3&1&2&4&5\\ \hline\cr i=\mbox{Surg}&2&3&1&5&4\\ \hline\cr i=\mbox{GenMed}&3&5&4&1&2\\ \hline\cr i=\mbox{OthMed}&3&5&4&2&1\end{array}\right].[ start_ARRAY start_ROW start_CELL end_CELL start_CELL italic_k = Ortho end_CELL start_CELL italic_k = Card end_CELL start_CELL italic_k = Surg end_CELL start_CELL italic_k = GenMed end_CELL start_CELL italic_k = OthMed end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_i = Ortho end_CELL start_CELL 1 end_CELL start_CELL 5 end_CELL start_CELL 2 end_CELL start_CELL 3 end_CELL start_CELL 4 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_i = Card end_CELL start_CELL 3 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 4 end_CELL start_CELL 5 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_i = Surg end_CELL start_CELL 2 end_CELL start_CELL 3 end_CELL start_CELL 1 end_CELL start_CELL 5 end_CELL start_CELL 4 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_i = GenMed end_CELL start_CELL 3 end_CELL start_CELL 5 end_CELL start_CELL 4 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL italic_i = OthMed end_CELL start_CELL 3 end_CELL start_CELL 5 end_CELL start_CELL 4 end_CELL start_CELL 2 end_CELL start_CELL 1 end_CELL end_ROW end_ARRAY ] . (45)

As example, for the assignment of ’Ortho’ patients, the considered order of wards is ‘Ortho’, then ‘Surg’, then ‘GenMed’, then ‘OthMed’, and ‘Card’ is the last choice (used when there are no other choices). This order is similar to the order of assignment in Dai and Shi [10, Figure 1].

Suppose that patient type represents the order of assignment, and so we assign patients in the order of their type, assigning type 1111 patients first, then type 2222 patients, and so on. We consider two decisions,

  • a=1𝑎1a=1italic_a = 1, assign arrived patients to their best available wards, in the order of their types, without transferring patients between the wards, and

  • a=2,3𝑎23a=2,3italic_a = 2 , 3, assign arrived patients to their best available wards, in the order of their types, and allow transferring patients if required, with the constraint of no more than y=4,10𝑦410y=4,10italic_y = 4 , 10 transfers in total, respectively.

As example, if

  • q1=7subscript𝑞17q_{1}=7italic_q start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 7, q2=3subscript𝑞23q_{2}=3italic_q start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 3, qi=0subscript𝑞𝑖0q_{i}=0italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 for i=3,4,5𝑖345i=3,4,5italic_i = 3 , 4 , 5 (there are 7777 type-1111 patients and 3333 type-2222 patients in the queue);

  • and n1,1=11subscript𝑛1111n_{1,1}=11italic_n start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT = 11, n1,2=4subscript𝑛124n_{1,2}=4italic_n start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT = 4, n1,i=0subscript𝑛1𝑖0n_{1,i}=0italic_n start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT = 0 for i=3,4,5𝑖345i=3,4,5italic_i = 3 , 4 , 5 (there are 22222222 patients in ward 1111, so it is full),

then applying a=2𝑎2a=2italic_a = 2 means that

  • 4444 type-1111 patients will be assigned to ward 1111, while 3333 type-1111 patients will be assigned to their best available ward other than ward 1111,

  • while 4444 type-2222 patients will need to be transferred from ward 1111 to their best available ward (to make space for type-1111 patients),

  • and finally, 3333 type-2222 patients will be assigned from the queue to their best available ward.

More precisely, the assignments of patients corresponding to decisions a=1𝑎1a=1italic_a = 1 and a=2,3𝑎23a=2,3italic_a = 2 , 3, are described in Algorithms 4-5 in Appendix A, respectively.

Next, we present the simulation output in Figures 4-6, focusing on the impact of these decisions on the number of patients observed that are in nonprimary wards (and ignoring the costs of these decisions for now). Simulation output suggests that under the model parameters assumed in Table 1, a policy in which we apply decision a=3𝑎3a=3italic_a = 3 all the time, results in a better performance of the system in the sense that it reduces the number of patients in nonprimary wards when compared with applying decision a=1𝑎1a=1italic_a = 1 or a=2𝑎2a=2italic_a = 2 all the time. The performance of the system under decision a=3𝑎3a=3italic_a = 3 is only slightly better than when compared with decision a=2𝑎2a=2italic_a = 2 however.

In order to evalute the performance of the system under decisions a=1,2,3𝑎123a=1,2,3italic_a = 1 , 2 , 3, we require to also consider the costs. Since the size of the state space is very large, we apply the Approximate Policy Iteration summarised in Algorithms 23 in Appendix A.

We consider the vector of features ϕ(s)bold-italic-ϕ𝑠\mbox{\boldmath$\phi$}(s)bold_italic_ϕ ( italic_s ) of state s=([nk,i]𝒦×,[qi]1×)𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}\right)italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) defined as

ϕ(s)bold-italic-ϕ𝑠\displaystyle\mbox{\boldmath$\phi$}(s)bold_italic_ϕ ( italic_s ) =\displaystyle== [n1,1,i1n1,i,,n5,5,i5n5,i,q1,,q5],delimited-[]subscript𝑛11subscript𝑖1subscript𝑛1𝑖subscript𝑛55subscript𝑖5subscript𝑛5𝑖subscript𝑞1subscript𝑞5missing-subexpressionmissing-subexpression\displaystyle\left[\begin{array}[]{cccccccccc}n_{1,1},&\sum_{i\not=1}n_{1,i},&% \ldots,&n_{5,5},&\sum_{i\not=5}n_{5,i},&q_{1},&\ldots,&q_{5}\end{array}\right],[ start_ARRAY start_ROW start_CELL italic_n start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT , end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_i ≠ 1 end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 1 , italic_i end_POSTSUBSCRIPT , end_CELL start_CELL … , end_CELL start_CELL italic_n start_POSTSUBSCRIPT 5 , 5 end_POSTSUBSCRIPT , end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_i ≠ 5 end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT 5 , italic_i end_POSTSUBSCRIPT , end_CELL start_CELL italic_q start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , end_CELL start_CELL … , end_CELL start_CELL italic_q start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] , (47)

that is, for each ward k=1,,5𝑘15k=1,\ldots,5italic_k = 1 , … , 5, nk,k=subscript𝑛𝑘𝑘absentn_{k,k}=italic_n start_POSTSUBSCRIPT italic_k , italic_k end_POSTSUBSCRIPT = the total number of type-i𝑖iitalic_i patients in the primary ward i=k𝑖𝑘i=kitalic_i = italic_k; iknk,isubscript𝑖𝑘subscript𝑛𝑘𝑖\sum_{i\not=k}n_{k,i}∑ start_POSTSUBSCRIPT italic_i ≠ italic_k end_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT, the total number of other-type patients in that ward; and for each patient type i=1,,5𝑖15i=1,\ldots,5italic_i = 1 , … , 5, qi=subscript𝑞𝑖absentq_{i}=italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = the number of type-i𝑖iitalic_i patients in the queue waiting to be assigned.

Then, by (24), 𝔼(Q)=λ𝔼𝑄𝜆\mathbb{E}(Q)=\lambdablackboard_E ( italic_Q ) = italic_λ, and given state s𝑠sitalic_s and decision a𝑎aitalic_a, we have

s𝒮(s|(s,a))ϕ(s)𝜽subscriptsuperscript𝑠𝒮conditionalsuperscript𝑠𝑠𝑎bold-italic-ϕsuperscript𝑠𝜽\displaystyle\sum_{s^{{}^{\prime}}\in\mathcal{S}}\mathbb{P}\left(s^{\prime}\ |% \ (s,a)\right)\mbox{\boldmath$\phi$}(s^{{}^{\prime}})\mbox{\boldmath$\theta$}∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_S end_POSTSUBSCRIPT blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) bold_italic_ϕ ( italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) bold_italic_θ =\displaystyle== 𝔼(ϕ(s)𝜽|(s,a))=k=1Ki=1Ink,i(a)(1pk,i)θk,i+i=1Iλiθi,𝔼conditionalbold-italic-ϕsuperscript𝑠𝜽𝑠𝑎superscriptsubscript𝑘1𝐾superscriptsubscript𝑖1𝐼subscriptsuperscript𝑛𝑎𝑘𝑖1subscript𝑝𝑘𝑖subscript𝜃𝑘𝑖superscriptsubscript𝑖1𝐼subscript𝜆𝑖subscript𝜃𝑖\displaystyle\mathbb{E}(\mbox{\boldmath$\phi$}(s^{{}^{\prime}})\mbox{\boldmath% $\theta$}\ |\ (s,a))=\sum_{k=1}^{K}\sum_{i=1}^{I}n^{(a)}_{k,i}(1-p_{k,i})% \theta_{k,i}+\sum_{i=1}^{I}\lambda_{i}\theta_{i},blackboard_E ( bold_italic_ϕ ( italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) bold_italic_θ | ( italic_s , italic_a ) ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ( 1 - italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) italic_θ start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ,

which is the expression we use in our computatations in Algorithms 23. The output presented in Figures 7-8 demonstrates that the algorithm converged as expected.

To analyse the impact of the algorithm on the performance of the system, we have then also simulated the evolution of a system in which the decision a(s)𝑎𝑠a(s)italic_a ( italic_s ) given current state s𝑠sitalic_s is chosen according to

a(s)=argmina𝒜(s){C(s,a)+k=1Ki=1Ink,i(a)(1pk,i)θk,i+i=1Iλiθi,\displaystyle a(s)=\operatorname*{arg\,min}_{a\in\mathcal{A}(s)}\{C(s,a)+\sum_% {k=1}^{K}\sum_{i=1}^{I}n^{(a)}_{k,i}(1-p_{k,i})\theta_{k,i}+\sum_{i=1}^{I}% \lambda_{i}\theta_{i},italic_a ( italic_s ) = start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ caligraphic_A ( italic_s ) end_POSTSUBSCRIPT { italic_C ( italic_s , italic_a ) + ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ( 1 - italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ) italic_θ start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , (48)

using the parameters 𝜽𝜽\thetabold_italic_θ obtained from the output of the algorithm, and compared the output with the performance of the system under decisions a=1,2,3𝑎123a=1,2,3italic_a = 1 , 2 , 3. Each simulation M=1,,1000𝑀11000M=1,\ldots,1000italic_M = 1 , … , 1000, was executed over a 5-year period. The output is presented in Figures 9-10 and Table 2 below.

Policy Mean cost Mean nonprimary Mean redirected
Apply a=1 always 6.8177 34.0887 6.6719
Apply a=2 always 5.7643 20.1370 5.9918
Apply a=3 always 5.6983 18.9862 5.9408
Near optimal solution 5.6449 20.2566 5.9720
Table 2: The output from simulations in Example 2.

We observe (Figure 9, Table 2) that under the near-optimal policy obtained by the application of this approach, the costs are minimised (costs are the lowest under near-optimal policy, and are the highest under a=1𝑎1a=1italic_a = 1), with mean costs: 6.81776.81776.81776.8177 (a=1𝑎1a=1italic_a = 1), 5.76435.76435.76435.7643 (a=2𝑎2a=2italic_a = 2), 5.69835.69835.69835.6983 (a=3𝑎3a=3italic_a = 3), and 5.64495.6449{\bf 5.6449}bold_5.6449 (our near optimal solution). Simultaneously, the performance of the system in the sense of reducing the number of patients observed in the nonprimary wards and the number of redirected patients per day, is also improved. The number of patients in nonprimary wards is significantly lower than under a=1𝑎1a=1italic_a = 1, and similar to a=2𝑎2a=2italic_a = 2 but with lower costs than under a=2,3𝑎23a=2,3italic_a = 2 , 3.

Next (Figure 10), under near-optimal solution, 82.1918%percent82.191882.1918\%82.1918 % of the time no additional transfers are required. Also, under near-optimal solution, decisions a=1,2,3𝑎123a=1,2,3italic_a = 1 , 2 , 3 are chosen approximately 50%percent5050\%50 %, 40%percent4040\%40 % and 10%percent1010\%10 % of the time, respectively.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 4: Simulation of a system with limited capacity under policy that applies a=1𝑎1a=1italic_a = 1 (no transfers): Nk,i(a)(t)subscriptsuperscript𝑁𝑎𝑘𝑖𝑡N^{(a)}_{k,i}(t)italic_N start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ( italic_t ) is the number of type i𝑖iitalic_i patients in ward k𝑘kitalic_k, N^(a)(t)=kiNk,i(a)(t)superscript^𝑁𝑎𝑡subscript𝑘𝑖subscriptsuperscript𝑁𝑎𝑘𝑖𝑡\widehat{N}^{(a)}(t)=\sum_{k\not=i}N^{(a)}_{k,i}(t)over^ start_ARG italic_N end_ARG start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ( italic_t ) = ∑ start_POSTSUBSCRIPT italic_k ≠ italic_i end_POSTSUBSCRIPT italic_N start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ( italic_t ) is the number of patients in nonprimary wards on day t𝑡titalic_t, post decision. We apply (45) and the parameters from Table 1.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 5: Simulation of a system with limited capacity under policy that applies a=2𝑎2a=2italic_a = 2 (with no more than y=4𝑦4y=4italic_y = 4 transfers): We note the reduction of the total number of patients in nonprimary wards in comparison to the output for a=1𝑎1a=1italic_a = 1 in Figure 4.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 6: Simulation of a system with limited capacity under policy that applies a=3𝑎3a=3italic_a = 3 (with no more than y=10𝑦10y=10italic_y = 10 transfers): We note the further reduction of the total number of patients in nonprimary wards in comparison to the output for a=2𝑎2a=2italic_a = 2 in Figure 5.
Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 7: Values of θn(f)subscriptsuperscript𝜃𝑓𝑛\theta^{(f)}_{n}italic_θ start_POSTSUPERSCRIPT ( italic_f ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in iteration n𝑛nitalic_n of Algorithm 2 in Example 2, for M=103,104,105𝑀superscript103superscript104superscript105M=10^{3},10^{4},10^{5}italic_M = 10 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT and 106superscript10610^{6}10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT.
Refer to caption Refer to caption
Refer to caption Refer to caption
Figure 8: Values of Ensubscript𝐸𝑛E_{n}italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in iteration n𝑛nitalic_n of Algorithm 2 in Example 2, for M=103,104,105𝑀superscript103superscript104superscript105M=10^{3},10^{4},10^{5}italic_M = 10 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT , 10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT and 106superscript10610^{6}10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT.

a=1𝑎1a=1italic_a = 1:
Refer to caption  Refer to caption
a=3𝑎3a=3italic_a = 3:
Refer to caption  Refer to caption
near-optimal solution obtained using (48):
Refer to caption  Refer to caption

Figure 9: Simulations under decisions a=1𝑎1a=1italic_a = 1 (top row), a=3𝑎3a=3italic_a = 3 (middle row), and near-optimal solution.
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 10: Percentage of decisions a=1,2,3𝑎123a=1,2,3italic_a = 1 , 2 , 3 made under near-optimal solution (top left); and the number of additional transfers needed (if they were permitted without constraints).

5 Conclusion

We proposed a Markov Decision Process for a Patient Assignment Scheduling problem where, at the start of each time period (e.g. day, 8-hour block etc), patients are allocated to suitable wards, following relevant hospital policy. Due to the large size of the state space required to record the information about the hospital system, we developed methodology based on Approximate Dynamic Programming, for the computation of near-optimal solutions. We demonstrated the application potential of our model through examples with parameters fitted to data from a tertiary referral hospital in Australia.

Our model can be applied to a range of scenarios that are of interest in practical contexts in hospitals. We will analyse scheduling problems in data-driven examples, and consider additional costs, such as waiting costs or redirecting costs, and decisions such as transfers of inpatients within the hospitals that are not triggerred by arrivals, and take into account the duration of stay at the time of decision making. We will also consider scenarios with time varying parameters, in particular with seasonal components. These analyses will be reported in our forthcoming and future work.

6 Statements and declarations


Data

Data used in the paper was obtained following ethical approval from the Tasmanian Health and Medical Human Research Ethics Committee (HREC No 23633) and site-specific approval from the Research Governance Office of the Tasmanian Health Service.


Authorship contribution statement

This research has contributed to the Honours thesis by Krasnicki [19]. The following are the contributions of the authors, Małgorzata M. O’Reilly (MMO), Sebastian Krasnicki (SK), James Montgomery (JM), Mojtaba Heydar (MH), Richard Turner (RT), Pieter Van Dam (PVD), Peter Maree (PM):

  • Conceptualisation, Mathematical background: MMO, SK, MH, and JM;

  • Conceptualisation, Clinical background: RT, PVD, PM;

  • Sections 2-4, Appendix A-B, Methodology development: MMO, SK, and JM;

  • Sections 2-4, Derivation of the probability formulae: MMO;

  • Section 4, Example 1, Coding and analysis: SK, MMO, and MH;

  • Section 4, Example 2, Coding and analysis: MMO and JM.

Declaration of competing interests

The authors have no competing interests to declare that are relevant to the content of this article.

References

  • [1] Numerical inversion of probability generating functions. Operations Research Letters, 12(4):245–251, 1992.
  • [2] Australia’s hospitals at a glance 2018-19. Cat. no. HSE 247. Canberra: AIHW. Australian Institute of Health and Welfare, 2020.
  • [3] A. K. Abera, M. M. O’Reilly, M. Fackrell, B. R. Holland, and M. Heydar. On the decision support model for the patient admission scheduling problem with random arrivals and departures: A solution approach. Stochastic Models, 36(2):312–336, 2020.
  • [4] A. Bagust, M. Place, and J. W. Posnett. Dynamics of bed use in accommodating emergency admissions: Stochastic simulation model. British medical journal, 318(7203):155–158, 1999.
  • [5] C. A. Bain, P. G. Taylor, G. McDonnell, and A. Georgiou. Myths of ideal hospital occupancy. Medical Journal of Australia, 192(1):42–43, 2010.
  • [6] B. Bilgin, P. Demeester, M. Misir, W. Vancroonenburg, and G. Vanden Berghe. One hyperheuristic approach to two timetabling problems in health care. Journal of Heuristics, 18(3):401–434, 2012.
  • [7] K. Butler and M. A. Stephens. The distribution of a sum of independent binomial random variables. Methodology and Computing in Applied Probability, 19(2):557–571, 2017.
  • [8] E. J. Carter, S. M. Pouch, and E. L. Larson. The relationship between emergency department crowding and patient outcomes: A systematic review. Journal of Nursing Scholarship, 46(2):106–115, 2014.
  • [9] E. Cummings, L. Ellis, A. Georgiou, K. E., C. Showell, and P. Turner. An evidence-based review and training resource on smooth patient flow. Ministry of Health, New South Wales Government, Australia, pages 1–34, 2012.
  • [10] J. G. Dai and P. Shi. Inpatient overflow: An approximate dynamic programming approach. Manufacturing and Service Operations Management, 21(4):894–911, 2019.
  • [11] P. Demeester, W. Souffriau, P. De Causmaecker, and G. Vanden Berghe. A hybrid tabu search algorithm for automatically assigning patients to beds. Artificial Intelligence in Medicine, 48(1):61–70, 2010.
  • [12] Y. Gocgun. Simulation-based approximate policy iteration for dynamic patient scheduling for radiation therapy. Health Care Management Science, 21(3):317–325, 2018.
  • [13] R. Grant, A. K. J. Smith, L. Newett, M. Nash, R. Turner, and L. Owen. Tasmanian healthcare professionals’ & students’ capacity for LGBTI + inclusive care: A qualitative inquiry. Health and Social Care in the Community, 29(4):957–966, 2021.
  • [14] W. B. Hall, L. E. Willis, S. Medvedev, and S. S. Carson. The implications of long-term acute care hospital transfer practices for measures of in-hospital mortality and length of stay. American Journal of Respiratory and Critical Care Medicine, 185(1):53–57, 2012.
  • [15] M. Heydar, M. M. O‘Reilly, E. Trainer, M. Fackrell, P. G. Taylor, and A. Tirdad. A stochastic model for the patient-bed assignment problem with random arrivals and departures. Annals of Operations Research, pages 1–33, 2021.
  • [16] R. Howard. Dynamic Programming and Markov Processes. The MIT Press, Cambridge, 1960.
  • [17] E. Howell, E. Bessman, S. Kravet, K. Kolodner, R. Marshall, and S. Wright. Active bed management by hospitalists and emergency department throughput. Annals of Internal Medicine, 149(11):804–810, 2008.
  • [18] P. J. H. Hulshof, M. R. K. Mes, R. J. Boucherie, and E. W. Hans. Patient admission planning using approximate dynamic programming. Flexible Services and Manufacturing Journal, 28(1-2):30–61, 2016.
  • [19] S. Krasnicki. Modelling of optimal decision making in healthcare systems, Honours Thesis, The University of Tasmania, 2021.
  • [20] C. Morley, M. Unwin, G. M. Peterson, J. Stankovich, and L. Kinsman. Emergency department crowding: A systematic review of causes, consequences and solutions. PLoS ONE, 13(8), 2018.
  • [21] W. Powell. What you should know about approximate dynamic programming. Naval Research Logistics, 56(3):239–249, 2009.
  • [22] W. Powell. Approximate Dynamic Programming: Solving the Curses of Dimensionality: Second Edition. Wiley, 2011.
  • [23] W. B. Powell. What you should know about approximate dynamic programming. Naval Research Logistics, 56(3):239–249, 2009.
  • [24] W. B. Powell. Perspectives of approximate dynamic programming. Annals of Operations Research, 241(1-2):319–356, 2016.
  • [25] M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley, New York, 1994.
  • [26] W. Vancroonenburg, P. De Causmaecker, and G. Vanden Berghe. A study of decision support models for online patient-to-room assignment planning. Annals of Operations Research, 239:253–271, 2016.

Appendix A Appendix: Algorithms

Algorithm 1 Policy Iteration (Howard [16])
1:Initialise an aribitrary policy ^𝝅=(a^𝝅(s))s𝒮^absent𝝅subscriptsuperscript𝑎^absent𝝅𝑠𝑠𝒮\widehat{}\mbox{\boldmath$\pi$}=(a^{\widehat{}\mbox{\boldmath$\pi$}}(s))_{s\in% \mathcal{S}}over^ start_ARG end_ARG bold_italic_π = ( italic_a start_POSTSUPERSCRIPT over^ start_ARG end_ARG bold_italic_π end_POSTSUPERSCRIPT ( italic_s ) ) start_POSTSUBSCRIPT italic_s ∈ caligraphic_S end_POSTSUBSCRIPT.
2:Let v(m)=0𝑣𝑚0v(m)=0italic_v ( italic_m ) = 0, where 𝒮={1,,m}𝒮1𝑚\mathcal{S}=\{1,\ldots,m\}caligraphic_S = { 1 , … , italic_m }.
3:Solve E+v(s)s𝒮(s|(s,a^𝝅(s)))v(s)=C(s,a^𝝅(s))𝐸𝑣𝑠subscriptsuperscript𝑠𝒮conditionalsuperscript𝑠𝑠superscript𝑎^absent𝝅𝑠𝑣superscript𝑠𝐶𝑠superscript𝑎^absent𝝅𝑠E+v(s)-\sum_{s^{\prime}\in\mathcal{S}}\mathbb{P}\left(s^{\prime}\ |\ (s,a^{% \widehat{}\mbox{\boldmath$\pi$}}(s))\right)v(s^{\prime})=C(s,a^{\widehat{}% \mbox{\boldmath$\pi$}}(s))italic_E + italic_v ( italic_s ) - ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S end_POSTSUBSCRIPT blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a start_POSTSUPERSCRIPT over^ start_ARG end_ARG bold_italic_π end_POSTSUPERSCRIPT ( italic_s ) ) ) italic_v ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_C ( italic_s , italic_a start_POSTSUPERSCRIPT over^ start_ARG end_ARG bold_italic_π end_POSTSUPERSCRIPT ( italic_s ) ) for all s=1,,m1𝑠1𝑚1s=1,\ldots,m-1italic_s = 1 , … , italic_m - 1.
4:Solve 𝝅=argmina𝒜(s){C(s,a)+s𝒮(s|(s,a))v(s)}𝝅subscript𝑎𝒜𝑠𝐶𝑠𝑎subscriptsuperscript𝑠𝒮conditionalsuperscript𝑠𝑠𝑎𝑣superscript𝑠\mbox{\boldmath$\pi$}=\arg\min_{a\in\mathcal{A}(s)}\Bigg{\{}C(s,a)+\sum_{s^{% \prime}\in\mathcal{S}}\mathbb{P}\left(s^{\prime}\ |\ (s,a)\right)v(s^{\prime})% \Bigg{\}}bold_italic_π = roman_arg roman_min start_POSTSUBSCRIPT italic_a ∈ caligraphic_A ( italic_s ) end_POSTSUBSCRIPT { italic_C ( italic_s , italic_a ) + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S end_POSTSUBSCRIPT blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) italic_v ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) }.
5:If 𝝅=^𝝅𝝅^absent𝝅\mbox{\boldmath$\pi$}=\widehat{}\mbox{\boldmath$\pi$}bold_italic_π = over^ start_ARG end_ARG bold_italic_π, then go to Step 6. Else, let ^𝝅=𝝅^absent𝝅𝝅\widehat{}\mbox{\boldmath$\pi$}=\mbox{\boldmath$\pi$}over^ start_ARG end_ARG bold_italic_π = bold_italic_π and go to Step 2.
6:The optimal policy is 𝝅=𝝅superscript𝝅𝝅\mbox{\boldmath$\pi$}^{*}=\mbox{\boldmath$\pi$}bold_italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = bold_italic_π.
Algorithm 2 (Adapted from Dai and Shi [10])
1:Initialise 𝜽=𝜽0𝜽subscript𝜽0\mbox{\boldmath$\theta$}=\mbox{\boldmath$\theta$}_{0}bold_italic_θ = bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, n=0𝑛0n=0italic_n = 0. Choose large N𝑁Nitalic_N.
2:Initialise E0subscript𝐸0E_{0}italic_E start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT using Algorithm 3 for E𝐸Eitalic_E with 𝜽=𝜽0𝜽subscript𝜽0\mbox{\boldmath$\theta$}=\mbox{\boldmath$\theta$}_{0}bold_italic_θ = bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.
3:for n=0,1,2,,N1𝑛012𝑁1n=0,1,2,\ldots,N-1italic_n = 0 , 1 , 2 , … , italic_N - 1 do
4:     Randomly initialise starting state s0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Let 𝑨=𝐎𝑨𝐎\boldsymbol{A}={\bf O}bold_italic_A = bold_O, 𝒃=𝟎𝒃0\boldsymbol{b}=\boldsymbol{0}bold_italic_b = bold_0.
5:     for m=0,1,2,,M1𝑚012𝑀1m=0,1,2,\ldots,M-1italic_m = 0 , 1 , 2 , … , italic_M - 1 do
6:         Given s=sm𝑠subscript𝑠𝑚s=s_{m}italic_s = italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, take decision a(s)=argmina𝒜(s){C(s,a)+s𝒮(s|(s,a))ϕ(s)𝜽}𝑎𝑠subscriptargmin𝑎𝒜𝑠𝐶𝑠𝑎subscriptsuperscript𝑠𝒮conditionalsuperscript𝑠𝑠𝑎bold-italic-ϕsuperscript𝑠𝜽a(s)=\operatorname*{arg\,min}_{a\in\mathcal{A}(s)}\{C(s,a)+\sum_{s^{{}^{\prime% }}\in\mathcal{S}}\mathbb{P}\left(s^{\prime}\ |\ (s,a)\right)\mbox{\boldmath$% \phi$}(s^{{}^{\prime}})\mbox{\boldmath$\theta$}\}italic_a ( italic_s ) = start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ caligraphic_A ( italic_s ) end_POSTSUBSCRIPT { italic_C ( italic_s , italic_a ) + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_S end_POSTSUBSCRIPT blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) bold_italic_ϕ ( italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) bold_italic_θ }.
7:         Randomly generate state sm+1subscript𝑠𝑚1s_{m+1}italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT using model parameters and post-decision state sm(a)superscriptsubscript𝑠𝑚𝑎s_{m}^{(a)}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT.
8:         Compute 𝑨m=1Mϕ(sm)T(ϕ(sm)ϕ(sm+1))subscript𝑨𝑚1𝑀bold-italic-ϕsuperscriptsubscript𝑠𝑚𝑇bold-italic-ϕsubscript𝑠𝑚bold-italic-ϕsubscript𝑠𝑚1\boldsymbol{A}_{m}=\frac{1}{M}\mbox{\boldmath$\phi$}(s_{m})^{T}(\mbox{% \boldmath$\phi$}(s_{m})-\mbox{\boldmath$\phi$}(s_{m+1}))bold_italic_A start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_M end_ARG bold_italic_ϕ ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( bold_italic_ϕ ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) - bold_italic_ϕ ( italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT ) ).
9:         Compute 𝒃m=1Mϕ(sm)T(C(sm,a(sm))En)subscript𝒃𝑚1𝑀bold-italic-ϕsuperscriptsubscript𝑠𝑚𝑇𝐶subscript𝑠𝑚𝑎subscript𝑠𝑚subscript𝐸𝑛\boldsymbol{b}_{m}=\frac{1}{M}\mbox{\boldmath$\phi$}(s_{m})^{T}(C(s_{m},a(s_{m% }))-E_{n})bold_italic_b start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_M end_ARG bold_italic_ϕ ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ( italic_C ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_a ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) - italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ).
10:         Let 𝑨=𝑨+𝑨m𝑨𝑨subscript𝑨𝑚\boldsymbol{A}=\boldsymbol{A}+\boldsymbol{A}_{m}bold_italic_A = bold_italic_A + bold_italic_A start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT.
11:         Let 𝒃=𝒃+𝒃m𝒃𝒃subscript𝒃𝑚\boldsymbol{b}=\boldsymbol{b}+\boldsymbol{b}_{m}bold_italic_b = bold_italic_b + bold_italic_b start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT.
12:     end for
13:     Solve the set of linear equations 𝑨𝜽n+1=𝒃𝑨subscript𝜽𝑛1𝒃\boldsymbol{A}\mbox{\boldmath$\theta$}_{n+1}=\boldsymbol{b}bold_italic_A bold_italic_θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT = bold_italic_b for 𝜽n+1subscript𝜽𝑛1\mbox{\boldmath$\theta$}_{n+1}bold_italic_θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT.
14:     Update 𝜽=𝜽n+1𝜽subscript𝜽𝑛1\mbox{\boldmath$\theta$}=\mbox{\boldmath$\theta$}_{n+1}bold_italic_θ = bold_italic_θ start_POSTSUBSCRIPT italic_n + 1 end_POSTSUBSCRIPT.
15:     Let n=n+1𝑛𝑛1n=n+1italic_n = italic_n + 1.
16:     Compute Ensubscript𝐸𝑛E_{n}italic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT using Algorithm 3 for E𝐸Eitalic_E with 𝜽=𝜽n𝜽subscript𝜽𝑛\mbox{\boldmath$\theta$}=\mbox{\boldmath$\theta$}_{n}bold_italic_θ = bold_italic_θ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.
17:end for
18:E𝝅ENsuperscript𝐸superscript𝝅subscript𝐸𝑁E^{\mbox{\boldmath$\pi$}^{*}}\approx E_{N}italic_E start_POSTSUPERSCRIPT bold_italic_π start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ≈ italic_E start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT.
Algorithm 3 (Adapted from Dai and Shi [10])
1:Input 𝜽𝜽\thetabold_italic_θ. Let E=0𝐸0E=0italic_E = 0.
2:Randomly initialise starting state s0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.
3:for m=0,1,2,,M1𝑚012𝑀1m=0,1,2,\ldots,M-1italic_m = 0 , 1 , 2 , … , italic_M - 1 do
4:     Given state s=sm𝑠subscript𝑠𝑚s=s_{m}italic_s = italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, take decision a(s)=argmina𝒜(s){C(s,a)+s𝒮(s|(s,a))ϕ(s)𝜽}𝑎𝑠subscriptargmin𝑎𝒜𝑠𝐶𝑠𝑎subscriptsuperscript𝑠𝒮conditionalsuperscript𝑠𝑠𝑎bold-italic-ϕsuperscript𝑠𝜽a(s)=\operatorname*{arg\,min}_{a\in\mathcal{A}(s)}\{C(s,a)+\sum_{s^{{}^{\prime% }}\in\mathcal{S}}\mathbb{P}\left(s^{\prime}\ |\ (s,a)\right)\mbox{\boldmath$% \phi$}(s^{{}^{\prime}})\mbox{\boldmath$\theta$}\}italic_a ( italic_s ) = start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ caligraphic_A ( italic_s ) end_POSTSUBSCRIPT { italic_C ( italic_s , italic_a ) + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ∈ caligraphic_S end_POSTSUBSCRIPT blackboard_P ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ( italic_s , italic_a ) ) bold_italic_ϕ ( italic_s start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ) bold_italic_θ }.
5:     Randomly generate state sm+1subscript𝑠𝑚1s_{m+1}italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT using model parameters and post-decision state sma(s)superscriptsubscript𝑠𝑚𝑎𝑠s_{m}^{a(s)}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a ( italic_s ) end_POSTSUPERSCRIPT.
6:     Let E=E+C(s,a(s))𝐸𝐸𝐶𝑠𝑎𝑠E=E+C(s,a(s))italic_E = italic_E + italic_C ( italic_s , italic_a ( italic_s ) ).
7:end for
8:Compute E=E/M𝐸𝐸𝑀E=E/Mitalic_E = italic_E / italic_M.
Algorithm 4 Decision a=1𝑎1a=1italic_a = 1. We make a decision based on priorities so as to guarantee the best possible bed for higher priority patients, without allowing transfers. Recursively, we find the highest priority of patients to allocate yet and transfer them to their best available wards.
1:Input s=([nk,i]𝒦×,[qi]1×)𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}\right)italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ).
2:Initialise s(a)=[nk,i(a)]𝒦×=[nk,i]𝒦×superscript𝑠𝑎subscriptdelimited-[]superscriptsubscript𝑛𝑘𝑖𝑎𝒦subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦s^{(a)}=[n_{k,i}^{(a)}]_{\mathcal{K}\times\mathcal{I}}=[n_{k,i}]_{\mathcal{K}% \times\mathcal{I}}italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT = [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT and [xk,i]𝒦×=𝟎subscriptdelimited-[]subscript𝑥𝑘𝑖𝒦0[x_{k,i}]_{\mathcal{K}\times\mathcal{I}}={\bf 0}[ italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT = bold_0.
3:while i=1Iqi>0superscriptsubscript𝑖1𝐼subscript𝑞𝑖0\sum_{i=1}^{I}q_{i}>0∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 do
4:     Let i=min{j:qj>0}𝑖:𝑗subscript𝑞𝑗0i=\min\{j:q_{j}>0\}italic_i = roman_min { italic_j : italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > 0 }, current highest priority to allocate.
5:     while qi>0subscript𝑞𝑖0q_{i}>0italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 do (allocate type-i𝑖iitalic_i patients)
6:         Find =min{:j=1Inw(,i),j(a)<mw(,i)}superscript:superscriptsubscript𝑗1𝐼superscriptsubscript𝑛𝑤𝑖𝑗𝑎subscript𝑚𝑤𝑖\ell^{*}=\min\{\ell:\sum_{j=1}^{I}n_{w(\ell,i),j}^{(a)}<m_{w(\ell,i)}\}roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_min { roman_ℓ : ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_w ( roman_ℓ , italic_i ) , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT < italic_m start_POSTSUBSCRIPT italic_w ( roman_ℓ , italic_i ) end_POSTSUBSCRIPT }.
7:         Let k=w(,i)superscript𝑘𝑤superscript𝑖k^{*}=w(\ell^{*},i)italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_w ( roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i ), best ward for type-i𝑖iitalic_i patient with an available bed.
8:         Let xk,i=xk,i+1subscript𝑥superscript𝑘𝑖subscript𝑥superscript𝑘𝑖1x_{k^{*},i}=x_{k^{*},i}+1italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT + 1, and nk,i(a)=nk,i(a)+1superscriptsubscript𝑛superscript𝑘𝑖𝑎superscriptsubscript𝑛superscript𝑘𝑖𝑎1n_{k^{*},i}^{(a)}=n_{k^{*},i}^{(a)}+1italic_n start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = italic_n start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT + 1.
9:         Let qi=qi1subscript𝑞𝑖subscript𝑞𝑖1q_{i}=q_{i}-1italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1.
10:     end while
11:end while
Algorithm 5 Decisions a=2,3𝑎23a=2,3italic_a = 2 , 3. We make a decision based on priorities so as to guarantee the best possible bed for higher priority patients, and allow up to y𝑦yitalic_y transfers. Recursively, we find the highest priority of patients to allocate/transfer yet (Line 6). Next, we transfer type-i𝑖iitalic_i patients (Lines 7-13) before allocating newly arrived type-i𝑖iitalic_i patients (Lines 14-30). When transferring patients, we find a type-i𝑖iitalic_i patient that is in the worst ward and transfer them to a ward with the lowest transfer cost.
1:Input s=([nk,i]𝒦×,[qi]1×)𝑠subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦subscriptdelimited-[]subscript𝑞𝑖1s=\left([n_{k,i}]_{\mathcal{K}\times\mathcal{I}},[q_{i}]_{1\times\mathcal{I}}\right)italic_s = ( [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT , [ italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × caligraphic_I end_POSTSUBSCRIPT ) and the maximum number of transfers y𝑦yitalic_y.
2:Initialise y=ysuperscript𝑦𝑦y^{*}=yitalic_y start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_y, current number of available transfers.
3:Initialise [yk,i]𝒦×=𝟎subscriptdelimited-[]subscript𝑦𝑘𝑖𝒦0[y_{k,i}]_{\mathcal{K}\times\mathcal{I}}={\bf 0}[ italic_y start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT = bold_0, patients to be transferred.
4:Initialise s(a)=[nk,i(a)]𝒦×=[nk,i]𝒦×superscript𝑠𝑎subscriptdelimited-[]superscriptsubscript𝑛𝑘𝑖𝑎𝒦subscriptdelimited-[]subscript𝑛𝑘𝑖𝒦s^{(a)}=[n_{k,i}^{(a)}]_{\mathcal{K}\times\mathcal{I}}=[n_{k,i}]_{\mathcal{K}% \times\mathcal{I}}italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT = [ italic_n start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT caligraphic_K × caligraphic_I end_POSTSUBSCRIPT and (xk,i,yk,,i)i;k,=1,K=𝟎subscriptsubscript𝑥𝑘𝑖subscript𝑦𝑘𝑖formulae-sequence𝑖𝑘1𝐾0(x_{k,i},y_{k,\ell,i})_{i\in\mathcal{I};k,\ell=1,\ldots K}={\bf 0}( italic_x start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k , roman_ℓ , italic_i end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_i ∈ caligraphic_I ; italic_k , roman_ℓ = 1 , … italic_K end_POSTSUBSCRIPT = bold_0.
5:while i=1Iqi>0superscriptsubscript𝑖1𝐼subscript𝑞𝑖0\sum_{i=1}^{I}q_{i}>0∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 or i=1Ik=1Kyk,i>0superscriptsubscript𝑖1𝐼superscriptsubscript𝑘1𝐾subscript𝑦𝑘𝑖0\sum_{i=1}^{I}\sum_{k=1}^{K}y_{k,i}>0∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT > 0 do
6:     Let i=min{j:qj>0 or k=1Kyk,j>0}𝑖:𝑗subscript𝑞𝑗0 or superscriptsubscript𝑘1𝐾subscript𝑦𝑘𝑗0i=\min\{j:q_{j}>0\mbox{ or }\sum_{k=1}^{K}y_{k,j}>0\}italic_i = roman_min { italic_j : italic_q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > 0 or ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_k , italic_j end_POSTSUBSCRIPT > 0 }, current highest priority to allocate/transfer.
7:     while k=1Kyk,i>0superscriptsubscript𝑘1𝐾subscript𝑦𝑘𝑖0\sum_{k=1}^{K}y_{k,i}>0∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_y start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT > 0 do (transfer type-i𝑖iitalic_i patient from ward ksuperscript𝑘k^{*}italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT to superscript\ell^{*}roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT)
8:         h=max{:yw(,i),i>0}superscript:subscript𝑦𝑤𝑖𝑖0h^{*}=\max\{\ell:y_{w(\ell,i),i}>0\}italic_h start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_max { roman_ℓ : italic_y start_POSTSUBSCRIPT italic_w ( roman_ℓ , italic_i ) , italic_i end_POSTSUBSCRIPT > 0 };
9:         k=w(h,i)superscript𝑘𝑤superscript𝑖k^{*}=w(h^{*},i)italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_w ( italic_h start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i ) is the worst ward with a type-i𝑖iitalic_i patient;
10:         =argmin=1,,K{ck,,i(t):j=1In,j(a)<m}superscriptsubscript1𝐾:subscriptsuperscript𝑐𝑡superscript𝑘𝑖superscriptsubscript𝑗1𝐼superscriptsubscript𝑛𝑗𝑎subscript𝑚\ell^{*}=\arg\min_{\ell=1,\ldots,K}\{c^{(t)}_{k^{*},\ell,i}:\sum_{j=1}^{I}n_{% \ell,j}^{(a)}<m_{\ell}\}roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT roman_ℓ = 1 , … , italic_K end_POSTSUBSCRIPT { italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , roman_ℓ , italic_i end_POSTSUBSCRIPT : ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_ℓ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT < italic_m start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT } has the lowest transfer cost.
11:         Let n,i(a)=n,i(a)+1superscriptsubscript𝑛superscript𝑖𝑎superscriptsubscript𝑛superscript𝑖𝑎1n_{\ell^{*},i}^{(a)}=n_{\ell^{*},i}^{(a)}+1italic_n start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = italic_n start_POSTSUBSCRIPT roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT + 1.
12:         Let yk,,i=yk,,i+1subscript𝑦superscript𝑘superscript𝑖subscript𝑦superscript𝑘superscript𝑖1y_{k^{*},\ell^{*},i}=y_{k^{*},\ell^{*},i}+1italic_y start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT + 1, yk,i=yk,i1subscript𝑦superscript𝑘𝑖subscript𝑦superscript𝑘𝑖1y_{k^{*},i}=y_{k^{*},i}-1italic_y start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT - 1.
13:     end while
14:     while qi>0subscript𝑞𝑖0q_{i}>0italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 do (allocate type-i𝑖iitalic_i patients)
15:         if y=0superscript𝑦0y^{*}=0italic_y start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = 0 (no more transfers allowed) then (allocate type-i𝑖iitalic_i patient)
16:              =min{:j=1Inw(,i),j(a)<mw(,i)}superscript:superscriptsubscript𝑗1𝐼superscriptsubscript𝑛𝑤𝑖𝑗𝑎subscript𝑚𝑤𝑖\ell^{*}=\min\{\ell:\sum_{j=1}^{I}n_{w(\ell,i),j}^{(a)}<m_{w(\ell,i)}\}roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_min { roman_ℓ : ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_w ( roman_ℓ , italic_i ) , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT < italic_m start_POSTSUBSCRIPT italic_w ( roman_ℓ , italic_i ) end_POSTSUBSCRIPT };
17:              k=w(,i)superscript𝑘𝑤superscript𝑖k^{*}=w(\ell^{*},i)italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_w ( roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i ) is the best ward for patient type i𝑖iitalic_i with an available bed.
18:              Let xk,i=xk,i+1subscript𝑥superscript𝑘𝑖subscript𝑥superscript𝑘𝑖1x_{k^{*},i}=x_{k^{*},i}+1italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT + 1, and nk,i(a)=nk,i(a)+1superscriptsubscript𝑛superscript𝑘𝑖𝑎superscriptsubscript𝑛superscript𝑘𝑖𝑎1n_{k^{*},i}^{(a)}=n_{k^{*},i}^{(a)}+1italic_n start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = italic_n start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT + 1.
19:         end if
20:         if y>0superscript𝑦0y^{*}>0italic_y start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT > 0 (transfers allowed) then (allocate type-i𝑖iitalic_i patient to ward ksuperscript𝑘k^{*}italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT)
21:              =min{:j=1inw(,i),j(a)<mw(,i)}superscript:superscriptsubscript𝑗1𝑖superscriptsubscript𝑛𝑤𝑖𝑗𝑎subscript𝑚𝑤𝑖\ell^{*}=\min\{\ell:\sum_{j=1}^{i}n_{w(\ell,i),j}^{(a)}<m_{w(\ell,i)}\}roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_min { roman_ℓ : ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_w ( roman_ℓ , italic_i ) , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT < italic_m start_POSTSUBSCRIPT italic_w ( roman_ℓ , italic_i ) end_POSTSUBSCRIPT };
22:              k=w(,i)superscript𝑘𝑤superscript𝑖k^{*}=w(\ell^{*},i)italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_w ( roman_ℓ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i ) is the best ward for patient type i𝑖iitalic_i with a potential bed.
23:              Let xk,i=xk,i+1subscript𝑥superscript𝑘𝑖subscript𝑥superscript𝑘𝑖1x_{k^{*},i}=x_{k^{*},i}+1italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT + 1, and nk,i(a)=nk,i(a)+1superscriptsubscript𝑛superscript𝑘𝑖𝑎superscriptsubscript𝑛superscript𝑘𝑖𝑎1n_{k^{*},i}^{(a)}=n_{k^{*},i}^{(a)}+1italic_n start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = italic_n start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT + 1.
24:              if j=1Ink,j(a)=mk+1superscriptsubscript𝑗1𝐼superscriptsubscript𝑛superscript𝑘𝑗𝑎subscript𝑚superscript𝑘1\sum_{j=1}^{I}n_{k^{*},j}^{(a)}=m_{k^{*}}+1∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = italic_m start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + 1 then (must transfer patient isuperscript𝑖i^{*}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT out of ward ksuperscript𝑘k^{*}italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT)
25:                  i=max{j:j=i+1,,I;nk,j(a)0}superscript𝑖:𝑗formulae-sequence𝑗𝑖1𝐼superscriptsubscript𝑛superscript𝑘𝑗𝑎0i^{*}=\max\{j:j=i+1,\ldots,I;n_{k^{*},j}^{(a)}\not=0\}italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_max { italic_j : italic_j = italic_i + 1 , … , italic_I ; italic_n start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT ≠ 0 } is the lowest priority patient in ward ksuperscript𝑘k^{*}italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.
26:                  Let yk,i=yk,i+1subscript𝑦superscript𝑘superscript𝑖subscript𝑦superscript𝑘superscript𝑖1y_{k^{*},i^{*}}=y_{k^{*},i^{*}}+1italic_y start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , italic_i start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT + 1, y=y1superscript𝑦superscript𝑦1y^{*}=y^{*}-1italic_y start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_y start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - 1.
27:              end if
28:         end if
29:         Let qi=qi1subscript𝑞𝑖subscript𝑞𝑖1q_{i}=q_{i}-1italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - 1.
30:     end while
31:end while

Appendix B Appendix: Example 1 – matrices 𝐏(a)superscript𝐏𝑎{\bf P}^{(a)}bold_P start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT and vectors 𝐂(a)superscript𝐂𝑎{\bf C}^{(a)}bold_C start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT

We note that the set of all possible post-decision states s(a)superscript𝑠𝑎s^{(a)}italic_s start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT is {1,7,10,13,16,19,20,21,22}1710131619202122\{1,7,10,13,16,19,20,21,22\}{ 1 , 7 , 10 , 13 , 16 , 19 , 20 , 21 , 22 } for both a=1,2𝑎12a=1,2italic_a = 1 , 2, with

s(1)={1 for s=17 for s=2,710 for s=1013 for s=1316 for s=3,1619 for s=4,8,14,1920 for s=11,15,2021 for s=5,9,17,2122 for s=6,12,18,22,s(2)={s(1) for s11,1521 for s=11,15,\displaystyle s^{(1)}=\left\{\begin{array}[]{lll}1&\mbox{ for }&s=1\\ 7&\mbox{ for }&s=2,7\\ 10&\mbox{ for }&s=10\\ 13&\mbox{ for }&s=13\\ 16&\mbox{ for }&s=3,16\\ 19&\mbox{ for }&s=4,8,14,19\\ 20&\mbox{ for }&s=11,15,20\\ 21&\mbox{ for }&s=5,9,17,21\\ 22&\mbox{ for }&s=6,12,18,22\end{array}\right.\quad,\quad s^{(2)}=\left\{% \begin{array}[]{lll}s^{(1)}&\mbox{ for }&s\not=11,15\\ 21&\mbox{ for }&s=11,15\end{array}\right.\quad,italic_s start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT = { start_ARRAY start_ROW start_CELL 1 end_CELL start_CELL for end_CELL start_CELL italic_s = 1 end_CELL end_ROW start_ROW start_CELL 7 end_CELL start_CELL for end_CELL start_CELL italic_s = 2 , 7 end_CELL end_ROW start_ROW start_CELL 10 end_CELL start_CELL for end_CELL start_CELL italic_s = 10 end_CELL end_ROW start_ROW start_CELL 13 end_CELL start_CELL for end_CELL start_CELL italic_s = 13 end_CELL end_ROW start_ROW start_CELL 16 end_CELL start_CELL for end_CELL start_CELL italic_s = 3 , 16 end_CELL end_ROW start_ROW start_CELL 19 end_CELL start_CELL for end_CELL start_CELL italic_s = 4 , 8 , 14 , 19 end_CELL end_ROW start_ROW start_CELL 20 end_CELL start_CELL for end_CELL start_CELL italic_s = 11 , 15 , 20 end_CELL end_ROW start_ROW start_CELL 21 end_CELL start_CELL for end_CELL start_CELL italic_s = 5 , 9 , 17 , 21 end_CELL end_ROW start_ROW start_CELL 22 end_CELL start_CELL for end_CELL start_CELL italic_s = 6 , 12 , 18 , 22 end_CELL end_ROW end_ARRAY , italic_s start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT = { start_ARRAY start_ROW start_CELL italic_s start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT end_CELL start_CELL for end_CELL start_CELL italic_s ≠ 11 , 15 end_CELL end_ROW start_ROW start_CELL 21 end_CELL start_CELL for end_CELL start_CELL italic_s = 11 , 15 end_CELL end_ROW end_ARRAY , (60)

where the difference between s(1)superscript𝑠1s^{(1)}italic_s start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT and s(2)superscript𝑠2s^{(2)}italic_s start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT is in places corresponding to s=11,15𝑠1115s=11,15italic_s = 11 , 15, which are the only two states where choosing between a=1𝑎1a=1italic_a = 1 or a=2𝑎2a=2italic_a = 2 can make a difference.

The probability matrices 𝐏(a)superscript𝐏𝑎{\bf P}^{(a)}bold_P start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT of the discrete-time Markov chains corresponding to decisions a=1,2𝑎12a=1,2italic_a = 1 , 2 are evaluated as follows. First, denote qk,i=1pk,isubscript𝑞𝑘𝑖1subscript𝑝𝑘𝑖q_{k,i}=1-p_{k,i}italic_q start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT = 1 - italic_p start_POSTSUBSCRIPT italic_k , italic_i end_POSTSUBSCRIPT, the probability that type-i𝑖iitalic_i patient does not depart from ward k𝑘kitalic_k. Next, denote p(k)=λkk!eλ𝑝𝑘superscript𝜆𝑘𝑘superscript𝑒𝜆p(k)=\frac{\lambda^{k}}{k!}e^{-\lambda}italic_p ( italic_k ) = divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT, k=0,1,2,𝑘012k=0,1,2,\ldotsitalic_k = 0 , 1 , 2 , …, the probability of observing k𝑘kitalic_k arrivals in a Poisson process with rate λ𝜆\lambdaitalic_λ. Let

p(n~)𝑝~𝑛\displaystyle p(\widetilde{n})italic_p ( over~ start_ARG italic_n end_ARG ) =\displaystyle== 1k=0n1p(k)=1k=0n1λkk!eλ,n=1,2,,formulae-sequence1superscriptsubscript𝑘0𝑛1𝑝𝑘1superscriptsubscript𝑘0𝑛1superscript𝜆𝑘𝑘superscript𝑒𝜆𝑛12\displaystyle 1-\sum_{k=0}^{n-1}p(k)=1-\sum_{k=0}^{n-1}\frac{\lambda^{k}}{k!}e% ^{-\lambda},\quad n=1,2,\ldots,1 - ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT italic_p ( italic_k ) = 1 - ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT , italic_n = 1 , 2 , … , (61)

with p(0~)=1𝑝~01p(\widetilde{0})=1italic_p ( over~ start_ARG 0 end_ARG ) = 1, interpreted as the probability that at least n𝑛nitalic_n arrivals were observed; and

p(1,,I|n)𝑝subscript1conditionalsubscript𝐼𝑛\displaystyle p(\ell_{1},\ldots,\ell_{I}\ |\ n)italic_p ( roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT | italic_n ) =\displaystyle== n!i=1Ii!×i=1I(λiλ)i=n!λn×i=1Iλiii!n=1,2,,formulae-sequence𝑛superscriptsubscriptproduct𝑖1𝐼subscript𝑖superscriptsubscriptproduct𝑖1𝐼superscriptsubscript𝜆𝑖𝜆subscript𝑖𝑛superscript𝜆𝑛superscriptsubscriptproduct𝑖1𝐼superscriptsubscript𝜆𝑖subscript𝑖subscript𝑖𝑛12\displaystyle\frac{n!}{\prod_{i=1}^{I}\ell_{i}!}\times\prod_{i=1}^{I}\left(% \frac{\lambda_{i}}{\lambda}\right)^{\ell_{i}}=\frac{n!}{\lambda^{n}}\times% \prod_{i=1}^{I}\frac{\lambda_{i}^{\ell_{i}}}{\ell_{i}!}\quad n=1,2,\ldots,divide start_ARG italic_n ! end_ARG start_ARG ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ! end_ARG × ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ( divide start_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_λ end_ARG ) start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT = divide start_ARG italic_n ! end_ARG start_ARG italic_λ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_ARG × ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ! end_ARG italic_n = 1 , 2 , … , (62)

with 1++I=nsubscript1subscript𝐼𝑛\ell_{1}+\ldots+\ell_{I}=nroman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT = italic_n, interpreted as the conditional probability that, given n𝑛nitalic_n patients have been admitted, there were 1,,Isubscript1subscript𝐼\ell_{1},\ldots,\ell_{I}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT patients of type 1,,I1𝐼1,\ldots,I1 , … , italic_I, respectively.

Further, let p(1,,I)(n)subscriptsuperscript𝑝𝑛subscript1subscript𝐼p^{(n)}_{(\ell_{1},\ldots,\ell_{I})}italic_p start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT be the probability that n𝑛nitalic_n patients have been admitted and there were 1,,Isubscript1subscript𝐼\ell_{1},\ldots,\ell_{I}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT patients of type 1,,I1𝐼1,\ldots,I1 , … , italic_I, respectively, with 1++I=nsubscript1subscript𝐼𝑛\ell_{1}+\ldots+\ell_{I}=nroman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT = italic_n. Then, if the number of admissions is restricted by 1++INsubscript1subscript𝐼𝑁\ell_{1}+\ldots+\ell_{I}\leq Nroman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + … + roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ≤ italic_N, where N𝑁Nitalic_N is the number of available beds, we have,

p(1,,I)(n)subscriptsuperscript𝑝𝑛subscript1subscript𝐼\displaystyle p^{(n)}_{(\ell_{1},\ldots,\ell_{I})}italic_p start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT =\displaystyle== p(1,,I|n)p(n)=eλ×i=1Iλiii! for n<N,formulae-sequence𝑝subscript1conditionalsubscript𝐼𝑛𝑝𝑛superscript𝑒𝜆superscriptsubscriptproduct𝑖1𝐼superscriptsubscript𝜆𝑖subscript𝑖subscript𝑖 for 𝑛𝑁\displaystyle p(\ell_{1},\ldots,\ell_{I}\ |\ n)p(n)=e^{-\lambda}\times\prod_{i% =1}^{I}\frac{\lambda_{i}^{\ell_{i}}}{\ell_{i}!}\quad\mbox{ for }n<N,italic_p ( roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT | italic_n ) italic_p ( italic_n ) = italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT × ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ! end_ARG for italic_n < italic_N , (63)
p(1,,I)(N~)subscriptsuperscript𝑝~𝑁subscript1subscript𝐼\displaystyle p^{(\widetilde{N})}_{(\ell_{1},\ldots,\ell_{I})}italic_p start_POSTSUPERSCRIPT ( over~ start_ARG italic_N end_ARG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT =\displaystyle== p(1,,I|N)p(N~)=(1k=0N1λkk!eλ)×N!λN×i=1Iλiii!,𝑝subscript1conditionalsubscript𝐼𝑁𝑝~𝑁1superscriptsubscript𝑘0𝑁1superscript𝜆𝑘𝑘superscript𝑒𝜆𝑁superscript𝜆𝑁superscriptsubscriptproduct𝑖1𝐼superscriptsubscript𝜆𝑖subscript𝑖subscript𝑖\displaystyle p(\ell_{1},\ldots,\ell_{I}\ |\ N)p(\widetilde{N})=\left(1-\sum_{% k=0}^{N-1}\frac{\lambda^{k}}{k!}e^{-\lambda}\right)\times\frac{N!}{\lambda^{N}% }\times\prod_{i=1}^{I}\frac{\lambda_{i}^{\ell_{i}}}{\ell_{i}!},italic_p ( roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT | italic_N ) italic_p ( over~ start_ARG italic_N end_ARG ) = ( 1 - ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N - 1 end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG italic_k ! end_ARG italic_e start_POSTSUPERSCRIPT - italic_λ end_POSTSUPERSCRIPT ) × divide start_ARG italic_N ! end_ARG start_ARG italic_λ start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT end_ARG × ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT divide start_ARG italic_λ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG start_ARG roman_ℓ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ! end_ARG , (64)

and p(1,,I)(n)=0subscriptsuperscript𝑝𝑛subscript1subscript𝐼0p^{(n)}_{(\ell_{1},\ldots,\ell_{I})}=0italic_p start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , roman_ℓ start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT = 0 otherwise, with p(0,0)(0~)=1subscriptsuperscript𝑝~0001p^{(\widetilde{0})}_{(0,0)}=1italic_p start_POSTSUPERSCRIPT ( over~ start_ARG 0 end_ARG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 0 , 0 ) end_POSTSUBSCRIPT = 1.

Also, denote,

𝐩𝐩\displaystyle{\bf p}bold_p =\displaystyle== [p(0,0)(0)p(1,0)(1)p(0,1)(1)p(2,0)(2~)p(1,1)(2~)p(0,2)(2~)],delimited-[]subscriptsuperscript𝑝000subscriptsuperscript𝑝110subscriptsuperscript𝑝101subscriptsuperscript𝑝~220subscriptsuperscript𝑝~211subscriptsuperscript𝑝~202\displaystyle\left[\begin{array}[]{cccccc}p^{(0)}_{(0,0)}&p^{(1)}_{(1,0)}&p^{(% 1)}_{(0,1)}&p^{(\widetilde{2})}_{(2,0)}&p^{(\widetilde{2})}_{(1,1)}&p^{(% \widetilde{2})}_{(0,2)}\end{array}\right],[ start_ARRAY start_ROW start_CELL italic_p start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 0 , 0 ) end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 1 , 0 ) end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 0 , 1 ) end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUPERSCRIPT ( over~ start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 2 , 0 ) end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUPERSCRIPT ( over~ start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 1 , 1 ) end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUPERSCRIPT ( over~ start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 0 , 2 ) end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] , (66)
𝐰𝐰\displaystyle{\bf w}bold_w =\displaystyle== [p(0,0)(0)p(1,0)(1~)p(0,1)(1~)].delimited-[]subscriptsuperscript𝑝000subscriptsuperscript𝑝~110subscriptsuperscript𝑝~101\displaystyle\left[\begin{array}[]{ccc}p^{(0)}_{(0,0)}&p^{(\widetilde{1})}_{(1% ,0)}&p^{(\widetilde{1})}_{(0,1)}\end{array}\right].[ start_ARRAY start_ROW start_CELL italic_p start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 0 , 0 ) end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUPERSCRIPT ( over~ start_ARG 1 end_ARG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 1 , 0 ) end_POSTSUBSCRIPT end_CELL start_CELL italic_p start_POSTSUPERSCRIPT ( over~ start_ARG 1 end_ARG ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( 0 , 1 ) end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] . (68)

Then, for a=1𝑎1a=1italic_a = 1, the nonzero entries of 𝐏(1)superscript𝐏1{\bf P}^{(1)}bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT are,

[𝐏1,1(1)𝐏1,6(1)]delimited-[]subscriptsuperscript𝐏111subscriptsuperscript𝐏116\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{1,1}&\ldots&{\bf P}^{(1)% }_{1,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== 𝐩,𝐩\displaystyle{\bf p},bold_p , (70)

for s=2,7𝑠27s=2,7italic_s = 2 , 7,

[𝐏s,1(1)𝐏s,6(1)]delimited-[]subscriptsuperscript𝐏1𝑠1subscriptsuperscript𝐏1𝑠6\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,1}&\ldots&{\bf P}^{(1)% }_{s,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,1×𝐩,subscript𝑝11𝐩\displaystyle p_{1,1}\times{\bf p},italic_p start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT × bold_p , (72)
[𝐏s,7(1)𝐏s,9(1)]delimited-[]subscriptsuperscript𝐏1𝑠7subscriptsuperscript𝐏1𝑠9\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,7}&\ldots&{\bf P}^{(1)% }_{s,9}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 7 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 9 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q1,1×𝐰,subscript𝑞11𝐰\displaystyle q_{1,1}\times{\bf w},italic_q start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT × bold_w , (74)

for s=3,16𝑠316s=3,16italic_s = 3 , 16,

[𝐏s,1(1)𝐏s,6(1)]delimited-[]subscriptsuperscript𝐏1𝑠1subscriptsuperscript𝐏1𝑠6\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,1}&\ldots&{\bf P}^{(1)% }_{s,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p2,2×𝐩,subscript𝑝22𝐩\displaystyle p_{2,2}\times{\bf p},italic_p start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_p , (76)
[𝐏s,16(1)𝐏s,18(1)]delimited-[]subscriptsuperscript𝐏1𝑠16subscriptsuperscript𝐏1𝑠18\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,16}&\ldots&{\bf P}^{(1% )}_{s,18}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 16 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 18 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q2,2×𝐰,subscript𝑞22𝐰\displaystyle q_{2,2}\times{\bf w},italic_q start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_w , (78)

for s=4,8,14,19𝑠481419s=4,8,14,19italic_s = 4 , 8 , 14 , 19,

[𝐏s,1(1)𝐏s,6(1)]delimited-[]subscriptsuperscript𝐏1𝑠1subscriptsuperscript𝐏1𝑠6\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,1}&\ldots&{\bf P}^{(1)% }_{s,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,1p2,1×𝐩,subscript𝑝11subscript𝑝21𝐩\displaystyle p_{1,1}p_{2,1}\times{\bf p},italic_p start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT × bold_p , (80)
[𝐏s,7(1)𝐏s,9(1)]delimited-[]subscriptsuperscript𝐏1𝑠7subscriptsuperscript𝐏1𝑠9\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,7}&\ldots&{\bf P}^{(1)% }_{s,9}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 7 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 9 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q1,1p2,1×𝐰subscript𝑞11subscript𝑝21𝐰\displaystyle q_{1,1}p_{2,1}\times{\bf w}italic_q start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT × bold_w (82)
[𝐏s,13(1)𝐏s,15(1)]delimited-[]subscriptsuperscript𝐏1𝑠13subscriptsuperscript𝐏1𝑠15\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,13}&\ldots&{\bf P}^{(1% )}_{s,15}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 13 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 15 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,1q2,1×𝐰,subscript𝑝11subscript𝑞21𝐰\displaystyle p_{1,1}q_{2,1}\times{\bf w},italic_p start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT × bold_w , (84)
𝐏s,19(1)subscriptsuperscript𝐏1𝑠19\displaystyle{\bf P}^{(1)}_{s,19}bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 19 end_POSTSUBSCRIPT =\displaystyle== q1,1q2,1,subscript𝑞11subscript𝑞21\displaystyle q_{1,1}q_{2,1},italic_q start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT , (85)

for s=5,9,17,21𝑠591721s=5,9,17,21italic_s = 5 , 9 , 17 , 21,

[𝐏s,1(1)𝐏s,6(1)]delimited-[]subscriptsuperscript𝐏1𝑠1subscriptsuperscript𝐏1𝑠6\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,1}&\ldots&{\bf P}^{(1)% }_{s,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,1p2,2×𝐩,subscript𝑝11subscript𝑝22𝐩\displaystyle p_{1,1}p_{2,2}\times{\bf p},italic_p start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_p , (87)
[𝐏s,7(1)𝐏s,9(1)]delimited-[]subscriptsuperscript𝐏1𝑠7subscriptsuperscript𝐏1𝑠9\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,7}&\ldots&{\bf P}^{(1)% }_{s,9}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 7 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 9 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q1,1p2,2×𝐰,subscript𝑞11subscript𝑝22𝐰\displaystyle q_{1,1}p_{2,2}\times{\bf w},italic_q start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_w , (89)
[𝐏s,16(1)𝐏s,18(1)]delimited-[]subscriptsuperscript𝐏1𝑠16subscriptsuperscript𝐏1𝑠18\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,16}&\ldots&{\bf P}^{(1% )}_{s,18}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 16 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 18 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,1q2,2×𝐰,subscript𝑝11subscript𝑞22𝐰\displaystyle p_{1,1}q_{2,2}\times{\bf w},italic_p start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_w , (91)
𝐏s,21(1)subscriptsuperscript𝐏1𝑠21\displaystyle{\bf P}^{(1)}_{s,21}bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 21 end_POSTSUBSCRIPT =\displaystyle== q1,1q2,2,subscript𝑞11subscript𝑞22\displaystyle q_{1,1}q_{2,2},italic_q start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT , (92)

for s=6,12,18,22𝑠6121822s=6,12,18,22italic_s = 6 , 12 , 18 , 22,

[𝐏s,1(1)𝐏s,6(1)]delimited-[]subscriptsuperscript𝐏1𝑠1subscriptsuperscript𝐏1𝑠6\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,1}&\ldots&{\bf P}^{(1)% }_{s,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,2p2,2×𝐩,subscript𝑝12subscript𝑝22𝐩\displaystyle p_{1,2}p_{2,2}\times{\bf p},italic_p start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_p , (95)
[𝐏s,10(1)𝐏s,12(1)]delimited-[]subscriptsuperscript𝐏1𝑠10subscriptsuperscript𝐏1𝑠12\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,10}&\ldots&{\bf P}^{(1% )}_{s,12}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 10 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 12 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q1,2p2,2×𝐰,subscript𝑞12subscript𝑝22𝐰\displaystyle q_{1,2}p_{2,2}\times{\bf w},italic_q start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_w , (97)
[𝐏s,16(1)𝐏s,18(1)]delimited-[]subscriptsuperscript𝐏1𝑠16subscriptsuperscript𝐏1𝑠18\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,16}&\ldots&{\bf P}^{(1% )}_{s,18}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 16 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 18 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,2q2,2×𝐰,subscript𝑝12subscript𝑞22𝐰\displaystyle p_{1,2}q_{2,2}\times{\bf w},italic_p start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_w , (99)
𝐏s,22(1)subscriptsuperscript𝐏1𝑠22\displaystyle{\bf P}^{(1)}_{s,22}bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 22 end_POSTSUBSCRIPT =\displaystyle== q1,2q2,2,subscript𝑞12subscript𝑞22\displaystyle q_{1,2}q_{2,2},italic_q start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT , (100)

for s=10𝑠10s=10italic_s = 10,

[𝐏10,1(1)𝐏10,6(1)]delimited-[]subscriptsuperscript𝐏1101subscriptsuperscript𝐏1106\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{10,1}&\ldots&{\bf P}^{(1% )}_{10,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 10 , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 10 , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,2×𝐩,subscript𝑝12𝐩\displaystyle p_{1,2}\times{\bf p},italic_p start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT × bold_p , (102)
[𝐏10,10(1)𝐏10,12(1)]delimited-[]subscriptsuperscript𝐏11010subscriptsuperscript𝐏11012\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{10,10}&\ldots&{\bf P}^{(% 1)}_{10,12}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 10 , 10 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 10 , 12 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q1,2×𝐰,subscript𝑞12𝐰\displaystyle q_{1,2}\times{\bf w},italic_q start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT × bold_w , (104)

for s=11,15,20𝑠111520s=11,15,20italic_s = 11 , 15 , 20,

[𝐏s,1(1)𝐏s,6(1)]delimited-[]subscriptsuperscript𝐏1𝑠1subscriptsuperscript𝐏1𝑠6\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,1}&\ldots&{\bf P}^{(1)% }_{s,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,2p2,1×𝐩,subscript𝑝12subscript𝑝21𝐩\displaystyle p_{1,2}p_{2,1}\times{\bf p},italic_p start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT × bold_p , (107)
[𝐏s,10(1)𝐏s,12(1)]delimited-[]subscriptsuperscript𝐏1𝑠10subscriptsuperscript𝐏1𝑠12\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,10}&\ldots&{\bf P}^{(1% )}_{s,12}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 10 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 12 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q1,2p2,1×𝐰,subscript𝑞12subscript𝑝21𝐰\displaystyle q_{1,2}p_{2,1}\times{\bf w},italic_q start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT × bold_w , (109)
[𝐏s,13(1)𝐏s,15(1)]delimited-[]subscriptsuperscript𝐏1𝑠13subscriptsuperscript𝐏1𝑠15\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,13}&\ldots&{\bf P}^{(1% )}_{s,15}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 13 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 15 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,2q2,1×𝐰,subscript𝑝12subscript𝑞21𝐰\displaystyle p_{1,2}q_{2,1}\times{\bf w},italic_p start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT × bold_w , (111)
𝐏s,20(1)subscriptsuperscript𝐏1𝑠20\displaystyle{\bf P}^{(1)}_{s,20}bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 20 end_POSTSUBSCRIPT =\displaystyle== q1,2q2,1,subscript𝑞12subscript𝑞21\displaystyle q_{1,2}q_{2,1},italic_q start_POSTSUBSCRIPT 1 , 2 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT , (112)

and for s=13𝑠13s=13italic_s = 13,

[𝐏13,1(1)𝐏13,6(1)]delimited-[]subscriptsuperscript𝐏1131subscriptsuperscript𝐏1136\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{13,1}&\ldots&{\bf P}^{(1% )}_{13,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 13 , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 13 , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p2,1×𝐩,subscript𝑝21𝐩\displaystyle p_{2,1}\times{\bf p},italic_p start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT × bold_p , (114)
[𝐏13,13(1)𝐏13,15(1)]delimited-[]subscriptsuperscript𝐏11313subscriptsuperscript𝐏11315\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{13,13}&\ldots&{\bf P}^{(% 1)}_{13,15}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 13 , 13 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 13 , 15 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q2,1×𝐰.subscript𝑞21𝐰\displaystyle q_{2,1}\times{\bf w}.italic_q start_POSTSUBSCRIPT 2 , 1 end_POSTSUBSCRIPT × bold_w . (116)

For a=2𝑎2a=2italic_a = 2, rows s=11,15𝑠1115s=11,15italic_s = 11 , 15 are the only rows in 𝐏(2)superscript𝐏2{\bf P}^{(2)}bold_P start_POSTSUPERSCRIPT ( 2 ) end_POSTSUPERSCRIPT that differ from the rows of 𝐏(1)superscript𝐏1{\bf P}^{(1)}bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT, with their nonzero entries given by,

[𝐏s,1(1)𝐏s,6(1)]delimited-[]subscriptsuperscript𝐏1𝑠1subscriptsuperscript𝐏1𝑠6\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,1}&\ldots&{\bf P}^{(1)% }_{s,6}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 6 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,1p2,2×𝐩,subscript𝑝11subscript𝑝22𝐩\displaystyle p_{1,1}p_{2,2}\times{\bf p},italic_p start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_p , (118)
[𝐏s,7(1)𝐏s,9(1)]delimited-[]subscriptsuperscript𝐏1𝑠7subscriptsuperscript𝐏1𝑠9\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,7}&\ldots&{\bf P}^{(1)% }_{s,9}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 7 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 9 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== q1,1p2,2×𝐰,subscript𝑞11subscript𝑝22𝐰\displaystyle q_{1,1}p_{2,2}\times{\bf w},italic_q start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_w , (120)
[𝐏s,16(1)𝐏s,18(1)]delimited-[]subscriptsuperscript𝐏1𝑠16subscriptsuperscript𝐏1𝑠18\displaystyle\left[\begin{array}[]{ccc}{\bf P}^{(1)}_{s,16}&\ldots&{\bf P}^{(1% )}_{s,18}\end{array}\right][ start_ARRAY start_ROW start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 16 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 18 end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] =\displaystyle== p1,1q2,2×𝐰,subscript𝑝11subscript𝑞22𝐰\displaystyle p_{1,1}q_{2,2}\times{\bf w},italic_p start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT × bold_w , (122)
𝐏s,21(1)subscriptsuperscript𝐏1𝑠21\displaystyle{\bf P}^{(1)}_{s,21}bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s , 21 end_POSTSUBSCRIPT =\displaystyle== q1,1q2,2,subscript𝑞11subscript𝑞22\displaystyle q_{1,1}q_{2,2},italic_q start_POSTSUBSCRIPT 1 , 1 end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT 2 , 2 end_POSTSUBSCRIPT , (123)

and the remaining rows are the same as in 𝐏(1)superscript𝐏1{\bf P}^{(1)}bold_P start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT.

Finally, costs C(s,a)𝐶𝑠𝑎C(s,a)italic_C ( italic_s , italic_a ) are recorded as vectors 𝐂(a)=[C(s,a)]s=1,,22superscript𝐂𝑎subscriptdelimited-[]𝐶𝑠𝑎𝑠122{\bf C}^{(a)}=[C(s,a)]_{s=1,\ldots,22}bold_C start_POSTSUPERSCRIPT ( italic_a ) end_POSTSUPERSCRIPT = [ italic_C ( italic_s , italic_a ) ] start_POSTSUBSCRIPT italic_s = 1 , … , 22 end_POSTSUBSCRIPT corresponding to decisions a=1,2𝑎12a=1,2italic_a = 1 , 2, such that,

C(s,1)𝐶𝑠1\displaystyle C(s,1)italic_C ( italic_s , 1 ) =\displaystyle== 2c(σ)I{s=4,5,6}+c(σ)I{s=2,3,8,9,11,12,14,15,17,18}2superscript𝑐𝜎𝐼𝑠456superscript𝑐𝜎𝐼𝑠2389111214151718\displaystyle 2c^{(\sigma)}I\{s=4,5,6\}+c^{(\sigma)}I\{s=2,3,8,9,11,12,14,15,1% 7,18\}2 italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT italic_I { italic_s = 4 , 5 , 6 } + italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT italic_I { italic_s = 2 , 3 , 8 , 9 , 11 , 12 , 14 , 15 , 17 , 18 }
+c(p)I{s=4,6,8,10,12,13,14,18,19,22}+2c(p)I{s=11,15,20}superscript𝑐𝑝𝐼𝑠468101213141819222superscript𝑐𝑝𝐼𝑠111520\displaystyle+c^{(p)}I\{s=4,6,8,10,12,13,14,18,19,22\}+2c^{(p)}I\{s=11,15,20\}+ italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT italic_I { italic_s = 4 , 6 , 8 , 10 , 12 , 13 , 14 , 18 , 19 , 22 } + 2 italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT italic_I { italic_s = 11 , 15 , 20 }
C(s,2)𝐶𝑠2\displaystyle C(s,2)italic_C ( italic_s , 2 ) =\displaystyle== 2c(σ)I{s=4,5,6}+c(σ)I{s=2,3,8,9,11,12,14,15,17,18}2superscript𝑐𝜎𝐼𝑠456superscript𝑐𝜎𝐼𝑠2389111214151718\displaystyle 2c^{(\sigma)}I\{s=4,5,6\}+c^{(\sigma)}I\{s=2,3,8,9,11,12,14,15,1% 7,18\}2 italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT italic_I { italic_s = 4 , 5 , 6 } + italic_c start_POSTSUPERSCRIPT ( italic_σ ) end_POSTSUPERSCRIPT italic_I { italic_s = 2 , 3 , 8 , 9 , 11 , 12 , 14 , 15 , 17 , 18 }
+c(p)I{s=4,6,8,10,12,13,14,18,19,22}+2c(p)I{s=20}+c(t)I{s=11,15},superscript𝑐𝑝𝐼𝑠468101213141819222superscript𝑐𝑝𝐼𝑠20superscript𝑐𝑡𝐼𝑠1115\displaystyle+c^{(p)}I\{s=4,6,8,10,12,13,14,18,19,22\}+2c^{(p)}I\{s=20\}+c^{(t% )}I\{s=11,15\},+ italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT italic_I { italic_s = 4 , 6 , 8 , 10 , 12 , 13 , 14 , 18 , 19 , 22 } + 2 italic_c start_POSTSUPERSCRIPT ( italic_p ) end_POSTSUPERSCRIPT italic_I { italic_s = 20 } + italic_c start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT italic_I { italic_s = 11 , 15 } ,

since the costs do not depend on the next observed state.