CVXSADes: a stochastic algorithm for constructing optimal exact regression designs with single or multiple objectives

Chi-Kuang Yeh \XeTeXLinkBox Corresponding author: [email protected] Department of Statistics and Actuarial Science, University of Waterloo Julie Zhou Department of Mathematics and Statistics, University of Victoria

May 5, 2024


ABSTRACT

We propose an algorithm to construct optimal exact designs (EDs). Most of the work in the optimal regression design literature focuses on the approximate design (AD) paradigm due to its desired properties, including the optimality verification conditions derived by Kiefer, (1959, 1974). ADs may have unbalanced weights, and practitioners may have difficulty implementing them with a designated run size n𝑛nitalic_n. Some EDs are constructed using rounding methods to get an integer number of runs at each support point of an AD, but this approach may not yield optimal results. To construct EDs, one may need to perform new combinatorial constructions for each n𝑛nitalic_n, and there is no unified approach to construct them. Therefore, we develop a systematic way to construct EDs for any given n𝑛nitalic_n. Our method can transform ADs into EDs while retaining high statistical efficiency in two steps. The first step involves constructing an AD by utilizing the convex nature of many design criteria. The second step employs a simulated annealing algorithm to search for the ED stochastically. Through several applications, we demonstrate the utility of our method for various design problems. Additionally, we show that the design efficiency approaches unity as the number of design points increases.


Keywords: design of experiment, optimal approximate design, exact design, multiple-objective design, maximin design, stochastic optimization, annealing algorithm, CVX solver


MSC 2020: 62K05, 62K20.

1 Introduction

Consider a general regression model,

yi=η(𝐱i,𝜽)+ϵi,i=1,,n,formulae-sequencesubscript𝑦𝑖𝜂subscript𝐱𝑖𝜽subscriptitalic-ϵ𝑖𝑖1𝑛y_{i}=\eta(\boldsymbol{\mathrm{x}}_{i},\boldsymbol{\theta})+\epsilon_{i},\quad i% =1,\ldots,n,italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_η ( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_θ ) + italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i = 1 , … , italic_n , (1)

where yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the i𝑖iitalic_i-th observation of a response variable y𝑦yitalic_y at design point 𝐱iSpsubscript𝐱𝑖𝑆superscript𝑝\boldsymbol{\mathrm{x}}_{i}\in S\subset\mathbb{R}^{p}bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_S ⊂ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT, S𝑆Sitalic_S is a design space, 𝜽q𝜽superscript𝑞\boldsymbol{\theta}\in\mathbb{R}^{q}bold_italic_θ ∈ blackboard_R start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT is the unknown regression parameter vector, response function η(𝐱i,𝜽)𝜂subscript𝐱𝑖𝜽\eta(\boldsymbol{\mathrm{x}}_{i},\boldsymbol{\theta})italic_η ( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_θ ) can be a linear or nonlinear function of 𝜽𝜽\boldsymbol{\theta}bold_italic_θ, and the errors ϵisubscriptitalic-ϵ𝑖\epsilon_{i}italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are assumed to be uncorrelated with mean zero and finite variance σ2superscript𝜎2\sigma^{2}italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Let 𝜽^^𝜽\hat{\boldsymbol{\theta}}over^ start_ARG bold_italic_θ end_ARG be an estimator of 𝜽𝜽\boldsymbol{\theta}bold_italic_θ, such as the least squares estimator. Various optimal designs are defined by minimizing ϕ{𝒞ov(𝜽^)}italic-ϕ𝒞𝑜𝑣^𝜽\phi\left\{\mathcal{C}ov(\hat{\boldsymbol{\theta}})\right\}italic_ϕ { caligraphic_C italic_o italic_v ( over^ start_ARG bold_italic_θ end_ARG ) } over the design points 𝐱1,,𝐱nsubscript𝐱1subscript𝐱𝑛\boldsymbol{\mathrm{x}}_{1},\ldots,\boldsymbol{\mathrm{x}}_{n}bold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, where function ϕ()italic-ϕ\phi(\cdot)italic_ϕ ( ⋅ ) can be determinant, trace, or other scalar functions. The resulting designs are called optimal exact designs (OEDs), which depend on the response function η(,)𝜂\eta(\cdot,\cdot)italic_η ( ⋅ , ⋅ ), the design space S𝑆Sitalic_S, the estimator 𝜽^^𝜽\hat{\boldsymbol{\theta}}over^ start_ARG bold_italic_θ end_ARG, the scalar function ϕ()italic-ϕ\phi(\cdot)italic_ϕ ( ⋅ ), and the number of points n𝑛nitalic_n. As for searching for the OEDs, coordinate exchange and simulated annealing (SA) algorithms have been developed and used; see Meyer and Nachtsheim, (1988, 1995), Wilmut and Zhou, (2011), Smucker et al., (2012), Rempel and Zhou, (2014) and Palhazi Cuervo et al., (2016), for a small sample of recent contributions to this problem. It is well known that it is difficult to construct OEDs, even for relatively simple problems; see Section 1.7 in Berger and Wong, (2009) for more details. Note that other than the model and the parameter values, the exact designs also depend on the number of the run size n𝑛nitalic_n, and the practitioner needs to recalculate the design for each n𝑛nitalic_n, which makes it challenging to construct in practice.

To avoid calculating a huge number (near-infinite) of different exact designs for each given n𝑛nitalic_n, Kiefer, (1959, 1974) proposed and developed the general equivalence theory for optimal approximate designs (OADs). With the approximate design, one does not need to recalculate the design for each n𝑛nitalic_n. Instead, it calculates the proportion of how many resources should be allocated at different support points. The equivalence theorem is useful for constructing OADs analytically and numerically. After obtaining OADs, we may use rounding to convert them to exact designs. This method is usually suggested in research papers; e.g. Pukelsheim and Rieder, (1992) and its follow-up works. Before explaining the details of the conversion, we first give a short review of OADs. Let ξ(𝐱)𝜉𝐱\xi(\boldsymbol{\mathrm{x}})italic_ξ ( bold_x ) be a discrete distribution (design) with k𝑘kitalic_k support points in S𝑆Sitalic_S, say, 𝐯1,,𝐯ksubscript𝐯1subscript𝐯𝑘{\bf v}_{1},\ldots,{\bf v}_{k}bold_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, and their corresponding weights are denoted by, w1,,wksubscript𝑤1subscript𝑤𝑘w_{1},\ldots,w_{k}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, respectively. Note that k𝑘kitalic_k is not fixed and can be any positive integer. Denote the set of all discrete distributions on S𝑆Sitalic_S as ΞSsubscriptΞ𝑆\Xi_{S}roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT. The information matrix of a design ξ(𝐱)ΞS𝜉𝐱subscriptΞ𝑆\xi(\boldsymbol{\mathrm{x}})\in\Xi_{S}italic_ξ ( bold_x ) ∈ roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT for model (1) is given by

𝐈(ξ,𝜽)=i=1kwi𝐟(𝐯i,𝜽)𝐟(𝐯i,𝜽),𝐈𝜉superscript𝜽superscriptsubscript𝑖1𝑘subscript𝑤𝑖𝐟subscript𝐯𝑖superscript𝜽superscript𝐟topsubscript𝐯𝑖superscript𝜽\displaystyle\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta}^{*})=\sum_{i=1}^{% k}w_{i}\boldsymbol{\mathrm{f}}({\bf v}_{i},\boldsymbol{\theta}^{*})\boldsymbol% {\mathrm{f}}^{\top}({\bf v}_{i},\boldsymbol{\theta}^{*}),bold_I ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_f ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) , (2)

where vector 𝐟(𝐱,𝜽)=η(𝐱,𝜽)𝜽𝐟𝐱𝜽𝜂𝐱𝜽𝜽\boldsymbol{\mathrm{f}}(\boldsymbol{\mathrm{x}},\boldsymbol{\theta})=\frac{% \partial\eta(\boldsymbol{\mathrm{x}},\boldsymbol{\theta})}{\partial\boldsymbol% {\theta}}bold_f ( bold_x , bold_italic_θ ) = divide start_ARG ∂ italic_η ( bold_x , bold_italic_θ ) end_ARG start_ARG ∂ bold_italic_θ end_ARG, and 𝜽superscript𝜽\boldsymbol{\theta}^{*}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is the true value of 𝜽𝜽\boldsymbol{\theta}bold_italic_θ. The covariance matrix of 𝜽^^𝜽\hat{\boldsymbol{\theta}}over^ start_ARG bold_italic_θ end_ARG, 𝒞ov(𝜽^)𝒞𝑜𝑣^𝜽\mathcal{C}ov(\hat{\boldsymbol{\theta}})caligraphic_C italic_o italic_v ( over^ start_ARG bold_italic_θ end_ARG ), is proportional to 𝐈1(ξ,𝜽)superscript𝐈1𝜉superscript𝜽\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ). An OAD is defined as the minimizer of ϕ{𝐈1(ξ,𝜽)}italic-ϕsuperscript𝐈1𝜉superscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } over all possible designs ξΞS𝜉subscriptΞ𝑆\xi\in\Xi_{S}italic_ξ ∈ roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT for a given function ϕ()italic-ϕ\phi(\cdot)italic_ϕ ( ⋅ ). If 𝐈(ξ,𝜽)𝐈𝜉superscript𝜽\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta}^{*})bold_I ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) depends on 𝜽superscript𝜽\boldsymbol{\theta}^{*}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, then the OAD is called a locally OAD or simply OAD in this paper. Notice that, for linear response functions, 𝐈(ξ,𝜽)𝐈𝜉superscript𝜽\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta}^{*})bold_I ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) does not depend on 𝜽superscript𝜽\boldsymbol{\theta}^{*}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. In practice, we do not know 𝜽superscript𝜽\boldsymbol{\theta}^{*}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and replace 𝜽superscript𝜽\boldsymbol{\theta}^{*}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT in (2) by an estimate, which may be available from pilot studies or the domain knowledge.

Since it is easier to construct OADs than OEDs, OADs have been obtained for various models, design spaces, and optimality criteria. Often numerical methods are used for finding OADs, and the methods include, for example, multiplicative algorithm (Zhang and Mukerjee, , 2013; Bose and Mukerjee, , 2015), cocktail algorithm (Yu, , 2011), genetic algorithm (Broudiscou et al., , 1996; Hamada et al., , 2001), semi-definite programming method (Papp, , 2012; Duarte and Wong, , 2015; Ye et al., , 2017), semi-infinite programming tools in Duarte and Wong, (2014) and Duarte et al., (2015), particle swarm method (Chen et al., , 2015), convex optimization method via CVX toolbox (Grant and Boyd, , 2020) in Gao and Zhou, (2017), a general method in Yang et al., (2013), and an efficient method in Duan et al., (2022). Mandal et al., (2015) provides a comprehensive review of the algorithmic approaches utilized in most of the methods mentioned above. Recently, Wong and Zhou, (2019, 2023) gave detailed discussions and comments on several algorithms, and they also developed effective numerical algorithms for finding OADs with multiple objective functions.

As the advantages of OADs are mentioned, tremendous efforts have been put into them, but how to effectively construct n𝑛nitalic_n design points efficiently from those OADs is still uncertain. Suppose ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is an OAD with m𝑚mitalic_m support points in S𝑆Sitalic_S, say 𝐯1,,𝐯msuperscriptsubscript𝐯1superscriptsubscript𝐯𝑚{\bf v}_{1}^{*},\ldots,{\bf v}_{m}^{*}bold_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , … , bold_v start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and their weights are w1,,wmsuperscriptsubscript𝑤1superscriptsubscript𝑤𝑚w_{1}^{*},\ldots,w_{m}^{*}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, respectively. It is clear that ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT does not depend on n𝑛nitalic_n. However, it may not be easy to implement ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT with n𝑛nitalic_n runs in practice, as nwi𝑛superscriptsubscript𝑤𝑖nw_{i}^{*}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m, are usually not integers. A general suggestion is to round each nwi𝑛superscriptsubscript𝑤𝑖nw_{i}^{*}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT to the nearest positive integer subject to the total number of design points being n𝑛nitalic_n (Wong and Zhou, , 2019). In some situations, how best to round nwi𝑛superscriptsubscript𝑤𝑖nw_{i}^{*}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT may not be clear. For instance, if an approximate design has weights as (w1,,w5)=(0.2493,0.2465,0.1033,0.1517,0.2492)superscriptsubscript𝑤1superscriptsubscript𝑤50.24930.24650.10330.15170.2492(w_{1}^{*},\ldots,w_{5}^{*})=(0.2493,0.2465,0.1033,0.1517,0.2492)( italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , … , italic_w start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ( 0.2493 , 0.2465 , 0.1033 , 0.1517 , 0.2492 ), from a design in Haines et al., (2018, Example 4.2), then it is not clear how to choose n=10𝑛10n=10italic_n = 10 design points from the approximate design. Are there good strategies for selecting n𝑛nitalic_n design points from OADs?

In addition, the debate surrounding this rounding method has persisted for a while. In a review paper by John and Draper, (1975), the author stated, “the rounding-off procedure may eliminate points with small measure, thereby changing the nature of the design.” López-Fidalgo, (2023) recalls in his recent book, stating that the controversy regarding exact design is discussed in Section 1.4.4, where he mentions that the rounding approach from approximate design to exact design is highly controversial, with some, including Box, dissenting. Particularly, on page 7, López-Fidalgo, (2023) states, “Box has never accepted the use of the approximate designs introduced by Kiefer, (1959),” and on page 11, he further remarks, “This idea came from Kiefer, (1974) and it used to be a controversial topic that George Box and others never have liked.” In particular, when the number of available runs subject to the available resources is small, it is unclear which support points to keep from the approximate design or whether to keep any of them at all. Mukerjee and Huda, (2016) also raised issues related to rounding for fractional factorial designs. They studied procedures for finding highly efficient exact designs from approximate design, however their procedure can only be applied for designs with a finite number of points.

In this paper, we propose a stochastic algorithm to construct OEDs to address these issues and systematically construct exact designs. Our proposed method utilizes a meta-heuristic algorithm, which does not rely on restrictive assumptions and does not require computing the gradient of the loss (objective) function in optimal design problems, which may be difficult to obtain in complicated design problems. Our algorithm first constructs an OAD and uses it as a starting point to search for exact designs. Additionally, OADs are used to compute the design efficiency of OEDs. Various optimality criteria are studied, including single-objective and multiple-objective criteria. To summarize, this paper makes three important contributions to optimal design of experiments as follows.

(i) Fast and gradient-free algorithm: Our first contribution is the development of a general algorithm for finding highly efficient OEDs via a CVX solver and an SA algorithm. Highly efficient OEDs can be found easily from the algorithm. In particular, our method does not require the derivative/gradient and Hessian matrix of the objective function of the design problems.

(ii) Importance of finding both exact and approximate designs: Our second contribution is to use our algorithm for finding both the OAD and OED. We show the importance of searching for both OAD and OED together. If we just use a SA algorithm to search for exact designs, we do not know if the resulting designs are optimal or efficient. Computing both OED and OAD allows us to compute the design efficiency of an OED. If we just search for OADs and use a rounding method to obtain OEDs, then the resulting designs may not be efficient, or we do not know how to do the rounding. We can find highly efficient exact designs by computing an OAD and then searching for an exact design. The efficiency is computed by using the OAD.

(iii) Exact design on complex setup: Our third contribution is to construct highly efficient exact designs in complicated settings. The conventional methods to compute the exact designs in the literature mainly centre around the low dimensional (small p𝑝pitalic_p) problems and one objective function; for instance, Duarte et al., (2020) only considers the settings in at most three dimensions and only one objective function. There is a need to fill the gap between constructing the exact design in more complex settings, including design in high-dimension, and for more than one objective function. Our method is demonstrated to be able to find exact designs in higher dimensions, seven-dimensional space, and exact designs with multiple competing objectives. We present four applications with various optimality criteria and design spaces, including multiple-objective criteria and high-dimensional design spaces.

The rest of the paper is organized as follows. In Section 2, we review the optimal design criteria with single and multiple objectives, related equivalence theory for OADs, and convex optimal design problems on discrete design spaces. In Section 3, we develop an effective algorithm to find OADs and OEDs for single and multiple objectives. We also give several properties about the OADs and OEDs and many remarks on the properties of the proposed algorithm. Section 4 presents several applications and their OEDs, which are difficult to find by conventional methods. Finally, we close the paper with the concluding remarks in Section 5. All proofs and derivations are in the Appendix. The implementation is available on the author’s GitHub page https://github.com/chikuang/CVXSADes.

2 Optimality criteria with single or multiple objectives

Various optimality criteria have been proposed and studied in the literature; see, for example, Fedorov, (1972), Pukelsheim, (1993), Berger and Wong, (2009), and Dean et al., (2015). Here we recall several criteria to illustrate optimal design problems and equivalence theory, and present convex optimization problems on discrete design spaces.

2.1 Optimal design problems

A-, c-, D-, I-, and E-optimality criteria are commonly used to construct optimal designs with a single objective function. The optimal design problems for finding OADs can be written as

minξΞSϕ{𝐈1(ξ,𝜽)},subscript𝜉subscriptΞ𝑆italic-ϕsuperscript𝐈1𝜉superscript𝜽\displaystyle\min_{\xi\in\Xi_{S}}\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,% \boldsymbol{\theta}^{*})\right\},roman_min start_POSTSUBSCRIPT italic_ξ ∈ roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } , (3)

where 𝐈(ξ,𝜽)𝐈𝜉superscript𝜽\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta}^{*})bold_I ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) is defined in (2), and ϕitalic-ϕ\phiitalic_ϕ is a scalar function. For example, ϕitalic-ϕ\phiitalic_ϕ is the determinant function for D-optimality, and trace for A-optimality. We denote trace and determinant functions as tr()tr\operatorname{tr}(\cdot)roman_tr ( ⋅ ) and det()\det(\cdot)roman_det ( ⋅ ), respectively. Let ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT be the solution to problem (3), which depends on ϕitalic-ϕ\phiitalic_ϕ, and ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is called an OAD.

The equivalence theory states the necessary and sufficient condition that an OAD satisfies. A general form for the condition is

dϕ(𝐱,ξϕ)0,for all𝐱S,formulae-sequencesubscript𝑑italic-ϕ𝐱superscriptsubscript𝜉italic-ϕ0for all𝐱𝑆\displaystyle d_{\phi}(\boldsymbol{\mathrm{x}},\xi_{\phi}^{*})\leq 0,~{}~{}% \mbox{for all}~{}\boldsymbol{\mathrm{x}}\in S,italic_d start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( bold_x , italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ 0 , for all bold_x ∈ italic_S , (4)

where function dϕ(𝐱,ξϕ)subscript𝑑italic-ϕ𝐱superscriptsubscript𝜉italic-ϕd_{\phi}(\boldsymbol{\mathrm{x}},\xi_{\phi}^{*})italic_d start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( bold_x , italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) depends on the optimality criterion and is the negative of the directional derivative of ϕitalic-ϕ\phiitalic_ϕ. The equality in (4) holds at the support points of ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. For a D-optimal design ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT,

dϕ(𝐱,ξϕ)=𝐟(𝐱,𝜽)𝐈1(ξϕ,𝜽)𝐟(𝐱,𝜽)q.subscript𝑑italic-ϕ𝐱superscriptsubscript𝜉italic-ϕsuperscript𝐟top𝐱superscript𝜽superscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽𝐟𝐱superscript𝜽𝑞\displaystyle d_{\phi}(\boldsymbol{\mathrm{x}},\xi_{\phi}^{*})=\boldsymbol{% \mathrm{f}}^{\top}(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})\boldsymbol% {\mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*})\boldsymbol{\mathrm{f% }}(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})-q.italic_d start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( bold_x , italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_f ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - italic_q .

If ϕ{𝐈1(ξ,𝜽)}=tr{𝐂𝐈1(ξ,𝜽)𝐂}italic-ϕsuperscript𝐈1𝜉superscript𝜽trsuperscript𝐂topsuperscript𝐈1𝜉superscript𝜽𝐂\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})\right\}=% \operatorname{tr}\left\{{\bf C}^{\top}\boldsymbol{\mathrm{I}}^{-1}(\xi,% \boldsymbol{\theta}^{*}){\bf C}\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } = roman_tr { bold_C start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_C } with a constant matrix 𝐂𝐂{\bf C}bold_C (q×r𝑞𝑟q\times ritalic_q × italic_r; rq𝑟𝑞r\leq qitalic_r ≤ italic_q), which includes A-, c-, and I-optimality criteria, then

dϕ(𝐱,ξϕ)=𝐟(𝐱,𝜽)𝐈1(ξϕ,𝜽)𝐂𝐂𝐈1(ξϕ,𝜽)𝐟(𝐱,𝜽)tr{𝐂𝐈1(ξϕ,𝜽)𝐂}.subscript𝑑italic-ϕ𝐱superscriptsubscript𝜉italic-ϕsuperscript𝐟top𝐱superscript𝜽superscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽superscript𝐂𝐂topsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽𝐟𝐱superscript𝜽trsuperscript𝐂topsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽𝐂\displaystyle d_{\phi}(\boldsymbol{\mathrm{x}},\xi_{\phi}^{*})=\boldsymbol{% \mathrm{f}}^{\top}(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})\boldsymbol% {\mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*}){\bf C}{\bf C}^{\top}% \boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*})% \boldsymbol{\mathrm{f}}(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})-% \operatorname{tr}\left\{{\bf C}^{\top}\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^% {*},\boldsymbol{\theta}^{*}){\bf C}\right\}.italic_d start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( bold_x , italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_CC start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_f ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - roman_tr { bold_C start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_C } .

For a generalized linear model (GLM), the maximum likelihood estimator is often used to estimate parameter 𝜽𝜽\boldsymbol{\theta}bold_italic_θ. The information matrix of design ξ(𝐱)𝜉𝐱\xi(\boldsymbol{\mathrm{x}})italic_ξ ( bold_x ) is different from that in (2) and can be written as

𝐈(ξ,𝜽)=i=1kwiλ(𝐯i,𝜽)𝐟(𝐯i)𝐟(𝐯i),𝐈𝜉superscript𝜽superscriptsubscript𝑖1𝑘subscript𝑤𝑖𝜆subscript𝐯𝑖superscript𝜽𝐟subscript𝐯𝑖superscript𝐟topsubscript𝐯𝑖\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta}^{*})=\sum_{i=1}^{k}w_{i}% \lambda({\bf v}_{i},\boldsymbol{\theta}^{*})\boldsymbol{\mathrm{f}}({\bf v}_{i% })\boldsymbol{\mathrm{f}}^{\top}({\bf v}_{i}),bold_I ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_λ ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_f ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , (5)

where λ(𝐱,𝜽)𝜆𝐱superscript𝜽\lambda(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})italic_λ ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) and 𝐟(𝐱)𝐟𝐱\boldsymbol{\mathrm{f}}(\boldsymbol{\mathrm{x}})bold_f ( bold_x ) depend on the link and predictor functions of the GLM. In function dϕ(𝐱,ξϕ)subscript𝑑italic-ϕ𝐱superscriptsubscript𝜉italic-ϕd_{\phi}(\boldsymbol{\mathrm{x}},\xi_{\phi}^{*})italic_d start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( bold_x , italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), we replace 𝐟(𝐱,𝜽)𝐟𝐱superscript𝜽\boldsymbol{\mathrm{f}}(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})bold_f ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) by λ(𝐱,𝜽)𝐟(𝐱)𝜆𝐱superscript𝜽𝐟𝐱\sqrt{\lambda({\boldsymbol{\mathrm{x}}},\boldsymbol{\theta}^{*})}\boldsymbol{% \mathrm{f}}(\boldsymbol{\mathrm{x}})square-root start_ARG italic_λ ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG bold_f ( bold_x ), and several examples are given in Section 4.

2.2 Optimal design problems on discrete design spaces

The necessary and sufficient condition in (4) enables us to find analytical and numerical solutions of ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for various models, including polynomial, second-order, and nonlinear models. However, it is still challenging to find ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for complicated models or design spaces. Wong and Zhou, (2019) discussed and investigated OADs on discrete design spaces. They used the fact that optimal design problems are convex optimization problems for commonly used optimality criteria, and applied CVX solver for finding OADs. Here we recall some details of the optimal design problems on discrete design spaces.

Let SN={𝐮1,,𝐮N}Ssubscript𝑆𝑁subscript𝐮1subscript𝐮𝑁𝑆S_{N}=\{{\bf u}_{1},\ldots,{\bf u}_{N}\}\subset Sitalic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT = { bold_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_u start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } ⊂ italic_S be a discrete design space with N𝑁Nitalic_N points, where 𝐮1,,𝐮Nsubscript𝐮1subscript𝐮𝑁{\bf u}_{1},\ldots,{\bf u}_{N}bold_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_u start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT are user selected points. One possible choice is to use equally spaced grid points in S𝑆Sitalic_S. Denote any distribution on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT by

ξN=(𝐮1𝐮2𝐮Nw1w2wN),subscript𝜉𝑁subscript𝐮1subscript𝐮2subscript𝐮𝑁subscript𝑤1subscript𝑤2subscript𝑤𝑁\xi_{N}=\left(\begin{array}[]{cccc}{\bf u}_{1}&{\bf u}_{2}&\ldots&{\bf u}_{N}% \\ w_{1}&w_{2}&\ldots&w_{N}\\ \end{array}\right),italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT = ( start_ARRAY start_ROW start_CELL bold_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL bold_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL bold_u start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_w start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_w start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ) ,

where the weights satisfy wj0subscript𝑤𝑗0w_{j}\geq 0italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ 0 for j=1,,N𝑗1𝑁j=1,\ldots,Nitalic_j = 1 , … , italic_N and j=1Nwj=1superscriptsubscript𝑗1𝑁subscript𝑤𝑗1\sum_{j=1}^{N}w_{j}=1∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 1, and a point 𝐮jsubscript𝐮𝑗{\bf u}_{j}bold_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is a support point of ξNsubscript𝜉𝑁\xi_{N}italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT if the corresponding weight wj>0subscript𝑤𝑗0w_{j}>0italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT > 0. Let ΞSNsubscriptΞsubscript𝑆𝑁\Xi_{S_{N}}roman_Ξ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT be the set of all distributions on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT. The information matrix of design ξNsubscript𝜉𝑁\xi_{N}italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT become 𝐈(ξN,𝜽)𝐈subscript𝜉𝑁superscript𝜽\boldsymbol{\mathrm{I}}(\xi_{N},\boldsymbol{\theta}^{*})bold_I ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), which can be computed from (2) or (5), replacing 𝐯isubscript𝐯𝑖{\bf v}_{i}bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT by 𝐮isubscript𝐮𝑖{\bf u}_{i}bold_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and replacing i=1ksuperscriptsubscript𝑖1𝑘\sum_{i=1}^{k}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT by i=1Nsuperscriptsubscript𝑖1𝑁\sum_{i=1}^{N}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT. It is important to notice that in ξNsubscript𝜉𝑁\xi_{N}italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, 𝐮1,,𝐮Nsubscript𝐮1subscript𝐮𝑁{\bf u}_{1},\ldots,{\bf u}_{N}bold_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_u start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT are fixed points, but w1,,wNsubscript𝑤1subscript𝑤𝑁w_{1},\ldots,w_{N}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT are unknown weights. In addition, 𝐈(ξN,𝜽)𝐈subscript𝜉𝑁superscript𝜽\boldsymbol{\mathrm{I}}(\xi_{N},\boldsymbol{\theta}^{*})bold_I ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) is linear in weights w1,,wNsubscript𝑤1subscript𝑤𝑁w_{1},\ldots,w_{N}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT. Let weight vector 𝐰=(w1,,wN)𝐰superscriptsubscript𝑤1subscript𝑤𝑁top{\bf w}=(w_{1},\ldots,w_{N})^{\top}bold_w = ( italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Then ϕ{𝐈1(ξN,𝜽)}italic-ϕsuperscript𝐈1subscript𝜉𝑁superscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{N},\boldsymbol{\theta}^{*})\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } is a convex function of 𝐰𝐰{\bf w}bold_w for commonly used optimality criteria; see Boyd and Vandenberghe, (2004), and Wong and Zhou, (2019). An OAD on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, denoted by ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, is a solution to the following optimization problem,

minξNΞSNϕ{𝐈1(ξN,𝜽)},subscriptsubscript𝜉𝑁subscriptΞsubscript𝑆𝑁italic-ϕsuperscript𝐈1subscript𝜉𝑁superscript𝜽\min_{\xi_{N}\in\Xi_{S_{N}}}\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{N},% \boldsymbol{\theta}^{*})\right\},roman_min start_POSTSUBSCRIPT italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ∈ roman_Ξ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ,

or

{min𝐰ϕ{𝐈1(ξN,𝜽)}subject to:wj0,j=1,,N,j=1Nwj=1.casessubscript𝐰italic-ϕsuperscript𝐈1subscript𝜉𝑁superscript𝜽formulae-sequencesubject to:subscript𝑤𝑗0formulae-sequence𝑗1𝑁superscriptsubscript𝑗1𝑁subscript𝑤𝑗1\displaystyle\left\{\begin{array}[]{l}\min_{\bf w}\phi\left\{\boldsymbol{% \mathrm{I}}^{-1}(\xi_{N},\boldsymbol{\theta}^{*})\right\}\\ \mbox{subject to:}~{}~{}w_{j}\geq 0,~{}j=1,\ldots,N,~{}~{}\sum_{j=1}^{N}w_{j}=% 1.\\ \end{array}\right.{ start_ARRAY start_ROW start_CELL roman_min start_POSTSUBSCRIPT bold_w end_POSTSUBSCRIPT italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } end_CELL end_ROW start_ROW start_CELL subject to: italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ 0 , italic_j = 1 , … , italic_N , ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 1 . end_CELL end_ROW end_ARRAY (8)

Problem (8) is a constraint convex optimization problem if ϕ{𝐈1(ξN,𝜽)}italic-ϕsuperscript𝐈1subscript𝜉𝑁superscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{N},\boldsymbol{\theta}^{*})\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } is a convex function of 𝐰𝐰{\bf w}bold_w. It can be solved by CVX solver, and detailed procedures of using CVX for finding ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are given in Wong and Zhou, (2019).

2.3 Multiple-objective optimal designs

For multiple-objective optimal designs, there are mainly three optimality criteria, which are often used. They are compound, multiple efficiency constraint, and maximin efficiency criteria (Wong and Zhou, , 2023). Since compound and multiple efficiency constraint criteria need extra information to form the design problems, we only focus on maximin efficiency criterion to discuss OEDs. Let ϕ1(ξ),,ϕ(ξ)subscriptitalic-ϕ1𝜉subscriptitalic-ϕ𝜉\phi_{1}(\xi),\ldots,\phi_{\ell}(\xi)italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ξ ) , … , italic_ϕ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_ξ ) be \ellroman_ℓ objective functions. For instance, ϕ1(ξ)=det{𝐈1(ξ,𝜽)},,ϕ(ξ)=tr{𝐈1(ξ,𝜽)}formulae-sequencesubscriptitalic-ϕ1𝜉superscript𝐈1𝜉superscript𝜽subscriptitalic-ϕ𝜉trsuperscript𝐈1𝜉superscript𝜽\phi_{1}(\xi)=\det\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^% {*})\right\},\ldots,\phi_{\ell}(\xi)=\operatorname{tr}\left\{\boldsymbol{% \mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})\right\}italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ξ ) = roman_det { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } , … , italic_ϕ start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ( italic_ξ ) = roman_tr { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) }, which are defined as different scalar functions of the same information matrix of a model. They can also be defined as the same scalar function of information matrices of several models. Suppose there are three competing models for an experiment, which leads to three different information matrices, say, 𝐈1(ξ,𝜽1)subscript𝐈1𝜉superscriptsubscript𝜽1\boldsymbol{\mathrm{I}}_{1}(\xi,\boldsymbol{\theta}_{1}^{*})bold_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), 𝐈2(ξ,𝜽2)subscript𝐈2𝜉superscriptsubscript𝜽2\boldsymbol{\mathrm{I}}_{2}(\xi,\boldsymbol{\theta}_{2}^{*})bold_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), and 𝐈3(ξ,𝜽3)subscript𝐈3𝜉superscriptsubscript𝜽3\boldsymbol{\mathrm{I}}_{3}(\xi,\boldsymbol{\theta}_{3}^{*})bold_I start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), where 𝜽1,𝜽2superscriptsubscript𝜽1superscriptsubscript𝜽2\boldsymbol{\theta}_{1}^{*},\boldsymbol{\theta}_{2}^{*}bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and 𝜽3superscriptsubscript𝜽3\boldsymbol{\theta}_{3}^{*}bold_italic_θ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are the true parameters for the three models, respectively. We may then define ϕ1(ξ)=det{𝐈1(ξ,𝜽1)}subscriptitalic-ϕ1𝜉subscript𝐈1𝜉superscriptsubscript𝜽1\phi_{1}(\xi)=\det\left\{\boldsymbol{\mathrm{I}}_{1}(\xi,\boldsymbol{\theta}_{% 1}^{*})\right\}italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ξ ) = roman_det { bold_I start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) }, ϕ2(ξ)=det{𝐈2(ξ,𝜽2)}subscriptitalic-ϕ2𝜉subscript𝐈2𝜉superscriptsubscript𝜽2\phi_{2}(\xi)=\det\left\{\boldsymbol{\mathrm{I}}_{2}(\xi,\boldsymbol{\theta}_{% 2}^{*})\right\}italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_ξ ) = roman_det { bold_I start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) }, and ϕ3(ξ)=det{𝐈3(ξ,𝜽3)}subscriptitalic-ϕ3𝜉subscript𝐈3𝜉superscriptsubscript𝜽3\phi_{3}(\xi)=\det\left\{\boldsymbol{\mathrm{I}}_{3}(\xi,\boldsymbol{\theta}_{% 3}^{*})\right\}italic_ϕ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_ξ ) = roman_det { bold_I start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) }. Alternatively, we can use different scalar functions of those information matrices.

A maximin efficiency design is defined below. First, we minimize each ϕi(ξ)subscriptitalic-ϕ𝑖𝜉\phi_{i}(\xi)italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_ξ ) over ξ𝜉\xiitalic_ξ to obtain an OAD ξϕisuperscriptsubscript𝜉subscriptitalic-ϕ𝑖\xi_{\phi_{i}}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, i=1,,𝑖1i=1,\ldots,\ellitalic_i = 1 , … , roman_ℓ. Second, we define design efficiency for a design ξ𝜉\xiitalic_ξ and a given criterion ϕitalic-ϕ\phiitalic_ϕ as

𝔼ffϕ(ξ)=ϕ{𝐈1(ξϕ,𝜽)}ϕ{𝐈1(ξ,𝜽)}.subscript𝔼ffitalic-ϕ𝜉italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽italic-ϕsuperscript𝐈1𝜉superscript𝜽\displaystyle\operatorname{\mathbb{E}ff}_{\phi}(\xi)=\frac{\phi\left\{% \boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*})\right\}}{% \phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})\right\}}.start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ ) = divide start_ARG italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } end_ARG start_ARG italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } end_ARG . (9)

For the \ellroman_ℓ objective functions, there are \ellroman_ℓ efficiencies for a design ξ𝜉\xiitalic_ξ, which can be written as

𝔼ffϕi(ξ)=ϕi(ξϕi)ϕi(ξ),i=1,,.formulae-sequencesubscript𝔼ffsubscriptitalic-ϕ𝑖𝜉subscriptitalic-ϕ𝑖superscriptsubscript𝜉subscriptitalic-ϕ𝑖subscriptitalic-ϕ𝑖𝜉𝑖1\operatorname{\mathbb{E}ff}_{\phi_{i}}(\xi)=\frac{\phi_{i}(\xi_{\phi_{i}}^{*})% }{\phi_{i}(\xi)},~{}~{}i=1,\ldots,\ell.start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) = divide start_ARG italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_ξ ) end_ARG , italic_i = 1 , … , roman_ℓ .

Third, we find a solution to the maximin problem given as

maxξΞSmin1il𝔼ffϕi(ξ),subscript𝜉subscriptΞ𝑆subscript1𝑖𝑙subscript𝔼ffsubscriptitalic-ϕ𝑖𝜉\max_{\xi\in\Xi_{S}}\min_{1\leq i\leq l}~{}\operatorname{\mathbb{E}ff}_{\phi_{% i}}(\xi),roman_max start_POSTSUBSCRIPT italic_ξ ∈ roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT 1 ≤ italic_i ≤ italic_l end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) ,

and the solution is called a maximin efficiency design. The maximin problem is hard to solve. However, this problem can be transformed into a convex optimization problem on a discrete design space SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, which can be solved easily via CVX solver. On SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, in the maximin problem we replace ξ𝜉\xiitalic_ξ and ΞSsubscriptΞ𝑆\Xi_{S}roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT by ξNsubscript𝜉𝑁\xi_{N}italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT and ΞSNsubscriptΞsubscript𝑆𝑁\Xi_{S_{N}}roman_Ξ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT, respectively, and use ξϕi,Nsuperscriptsubscript𝜉subscriptitalic-ϕ𝑖𝑁\xi_{\phi_{i},N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT to compute the efficiencies. Wong and Zhou, (2023) developed an effective algorithm for finding maximin optimal designs with various kinds of objective functions. Gao et al., (2024) also derived the necessary and sufficient conditions for multiple-objective optimal designs on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT.

3 Algorithms for OEDs

In this section, we develop an algorithm to construct OEDs which depends on n𝑛nitalic_n. Let ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT be the OED for an optimality criterion ϕitalic-ϕ\phiitalic_ϕ, and its support points are denoted by 𝐱~1,,𝐱~msubscript~𝐱1subscript~𝐱𝑚{\tilde{\boldsymbol{\mathrm{x}}}}_{1},\ldots,{\tilde{\boldsymbol{\mathrm{x}}}}% _{m}over~ start_ARG bold_x end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT with corresponding weights w~1,,w~msubscript~𝑤1subscript~𝑤𝑚{\tilde{w}}_{1},\ldots,{\tilde{w}}_{m}over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT. In contrast to u1,,uNsubscript𝑢1subscript𝑢𝑁u_{1},\cdots,u_{N}italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_u start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, these 𝐱~1subscript~𝐱1{\tilde{\boldsymbol{\mathrm{x}}}}_{1}over~ start_ARG bold_x end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT are not fixed/user specified. The weights must satisfy (i) nw~i𝑛subscript~𝑤𝑖n{\tilde{w}}_{i}italic_n over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a positive integer for each i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m, (ii) i=1mw~i=1superscriptsubscript𝑖1𝑚subscript~𝑤𝑖1\sum_{i=1}^{m}{\tilde{w}}_{i}=1∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT over~ start_ARG italic_w end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1. Define ΞnsubscriptΞ𝑛\Xi_{n}roman_Ξ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT to be the set of all exact designs on S𝑆Sitalic_S with size n𝑛nitalic_n. It is clear that ΞnΞssubscriptΞ𝑛subscriptΞ𝑠\Xi_{n}\subset\Xi_{s}roman_Ξ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊂ roman_Ξ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT. Since ξn,ϕΞnΞSsuperscriptsubscript𝜉𝑛italic-ϕsubscriptΞ𝑛subscriptΞ𝑆\xi_{n,\phi}^{*}\in\Xi_{n}\subset\Xi_{S}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∈ roman_Ξ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊂ roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT and the OAD ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is a solution to (3), it is obvious that

ϕ{𝐈1(ξn,ϕ,𝜽)}ϕ{𝐈1(ξϕ,𝜽)},italic-ϕsuperscript𝐈1superscriptsubscript𝜉𝑛italic-ϕsuperscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{n,\phi}^{*},\boldsymbol{\theta}^{% *})\right\}\geq\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},% \boldsymbol{\theta}^{*})\right\},italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ≥ italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ,

which implies that 𝔼ffϕ(ξn,ϕ)1subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ1\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})\leq 1start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≤ 1. A design ξ𝜉\xiitalic_ξ is said to be highly efficient if 𝔼ffϕ(ξ)subscript𝔼ffitalic-ϕ𝜉\operatorname{\mathbb{E}ff}_{\phi}(\xi)start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ ) is close to 1111. In practice we may use 𝔼ffϕ(ξ)0.95subscript𝔼ffitalic-ϕ𝜉0.95\operatorname{\mathbb{E}ff}_{\phi}(\xi)\geq 0.95start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ ) ≥ 0.95 to define highly efficient design ξ𝜉\xiitalic_ξ.

For D-optimality, we may minimize det{𝐈1(ξ,𝜽)}superscript𝐈1𝜉superscript𝜽\det\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})\right\}roman_det { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } or log(det{𝐈1(ξ,𝜽)})superscript𝐈1𝜉superscript𝜽\log\left(\det\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})% \right\}\right)roman_log ( roman_det { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ) to find D-optimal designs, but the D-efficiency is given by

𝔼ffD(ξ)=(det{𝐈1(ξϕ,𝜽)})1/q(det{𝐈1(ξ,𝜽)})1/q,subscript𝔼ff𝐷𝜉superscriptsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽1𝑞superscriptsuperscript𝐈1𝜉superscript𝜽1𝑞\operatorname{\mathbb{E}ff}_{D}(\xi)=\frac{\left(\det\left\{\boldsymbol{% \mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*})\right\}\right)^{1/q}}% {\left(\det\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})% \right\}\right)^{1/q}},start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_D end_POSTSUBSCRIPT ( italic_ξ ) = divide start_ARG ( roman_det { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ) start_POSTSUPERSCRIPT 1 / italic_q end_POSTSUPERSCRIPT end_ARG start_ARG ( roman_det { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ) start_POSTSUPERSCRIPT 1 / italic_q end_POSTSUPERSCRIPT end_ARG ,

where ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is a D-optimal design. For other optimality criteria, the design efficiency is usually given by (9).

Notice that ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT may not always be available. In those cases, the OAD on Snsubscript𝑆𝑛S_{n}italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT,ξϕ,N,\xi_{\phi,N}^{*}, italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, can be used to compute a modified design efficiency as

𝔼ff~ϕ(ξn,ϕ)=ϕ{𝐈1(ξϕ,N,𝜽)}ϕ{𝐈1(ξn,ϕ,𝜽)}.subscript~𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕitalic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕ𝑁superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉𝑛italic-ϕsuperscript𝜽\widetilde{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})=\frac{\phi% \left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi,N}^{*},\boldsymbol{\theta}^{*})% \right\}}{\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{n,\phi}^{*},\boldsymbol% {\theta}^{*})\right\}}.over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = divide start_ARG italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } end_ARG start_ARG italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } end_ARG .

Since ϕ{𝐈1(ξϕ,N,𝜽)}ϕ{𝐈1(ξϕ,𝜽)},italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕ𝑁superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi,N}^{*},\boldsymbol{\theta}^{% *})\right\}\geq\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},% \boldsymbol{\theta}^{*})\right\},italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ≥ italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } , it is possible that 𝔼ff~ϕ(ξn,ϕ)>1subscript~𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ1\widetilde{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})>1over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) > 1. Nevertheless, we can still say that ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is highly efficient when 𝔼ff~ϕ(ξn,ϕ)0.95subscript~𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ0.95\widetilde{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})\geq 0.95over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ≥ 0.95. Additional comments are given in Section 3.2.

We will introduce our proposal for an effective algorithm to find highly efficient ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for small n𝑛nitalic_n and ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT in Section 3.1. Then we discuss and explore various properties of the algorithm and ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT in Section 3.2.

3.1 Algorithms

We develop a general algorithm for finding an approximate design ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for a given design problem, and then construct an exact design ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT with high efficiency. In Algorithm 1, we first compute the OAD, ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT via CVX, and then ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is obtained through a SA with a starting design generated from ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. To describe the algorithm clearly, we use a general design problem below to explain the detailed steps in the algorithm. The objective function is Φ(ξ)=ϕ{𝐈1(ξ,𝜽))\Phi(\xi)=\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})\right)roman_Φ ( italic_ξ ) = italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ), where ξ𝜉\xiitalic_ξ is a design on design space Sp𝑆superscript𝑝S\subset\mathbb{R}^{p}italic_S ⊂ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT.

Algorithm 1: CVXSADes for computing OADs and OEDs.
Input:
* SN={𝐮1,,𝐮N}subscript𝑆𝑁subscript𝐮1subscript𝐮𝑁S_{N}=\{{\bf u}_{1},\ldots,{\bf u}_{N}\}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT = { bold_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , bold_u start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT }: a set of discrete design space
* Function 𝒇(𝐱,𝜽)𝒇𝐱superscript𝜽\boldsymbol{f}(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})bold_italic_f ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) or functions λ(𝐱,𝜽)𝜆𝐱superscript𝜽\lambda(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})italic_λ ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) and 𝒇(𝐱)𝒇𝐱\boldsymbol{f}(\boldsymbol{\mathrm{x}})bold_italic_f ( bold_x ) to compute the information matrix 𝐈(ξ,𝜽)𝐈𝜉superscript𝜽\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta}^{*})bold_I ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) in (2) or (5)
* Φ(ξ)=ϕ{𝐈1(ξ,𝜽)}Φ𝜉italic-ϕsuperscript𝐈1𝜉superscript𝜽\Phi(\xi)=\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})\right\}roman_Φ ( italic_ξ ) = italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) }: a loss/objective function
* n𝑛nitalic_n: the number of points in the exact design
* M𝑀Mitalic_M: the number of times to run the annealing algorithm
* T0,Tmax,α,K,δ=105subscript𝑇0subscript𝑇𝑚𝑎𝑥𝛼𝐾𝛿superscript105T_{0},~{}T_{max},\alpha,K,\delta=10^{-5}italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_T start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT , italic_α , italic_K , italic_δ = 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT: parameters in the annealing algorithm for initial temperature, minimum temperature, cooling factor, number of iterations for each temperature, and tolerance for a stop** criterion, respectively
—————— Main steps of the algorithm ————————
———————————————————————–
Step 1: Compute an OAD ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT as a solution to problem (8) via CVX solver;
Step 2: For j=1,,M𝑗1𝑀j=1,\dots,Mitalic_j = 1 , … , italic_M do
2.1: Get an initial exact design from ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and denote it as ξn,ϕ,j(0)superscriptsubscript𝜉𝑛italic-ϕ𝑗0\xi_{n,\phi,j}^{(0)}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT. The support points of ξn,ϕ,j(0)superscriptsubscript𝜉𝑛italic-ϕ𝑗0\xi_{n,\phi,j}^{(0)}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT are denoted by 𝐱1(0),,𝐱m(0)superscriptsubscript𝐱10superscriptsubscript𝐱𝑚0\boldsymbol{\mathrm{x}}_{1}^{(0)},\ldots,\boldsymbol{\mathrm{x}}_{m}^{(0)}bold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , … , bold_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT, which are the same as those of ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. The weights w1(0),,wm(0)superscriptsubscript𝑤10superscriptsubscript𝑤𝑚0w_{1}^{(0)},\ldots,w_{m}^{(0)}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT, at 𝐱1(0),,𝐱m(0)superscriptsubscript𝐱10superscriptsubscript𝐱𝑚0\boldsymbol{\mathrm{x}}_{1}^{(0)},\ldots,\boldsymbol{\mathrm{x}}_{m}^{(0)}bold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , … , bold_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT, respectively, are obtained as follows. Let ni(0)superscriptsubscript𝑛𝑖0n_{i}^{(0)}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT be the rounded integer from nwi𝑛superscriptsubscript𝑤𝑖nw_{i}^{*}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, satisfying i=1mni(0)=nsuperscriptsubscript𝑖1𝑚superscriptsubscript𝑛𝑖0𝑛\sum_{i=1}^{m}n_{i}^{(0)}=n∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = italic_n, and let wi(0)=ni(0)/nsuperscriptsubscript𝑤𝑖0superscriptsubscript𝑛𝑖0𝑛w_{i}^{(0)}=n_{i}^{(0)}/nitalic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT / italic_n for i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m. Note that w1,,wmsuperscriptsubscript𝑤1superscriptsubscript𝑤𝑚w_{1}^{*},\ldots,w_{m}^{*}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are the weights at the support points of ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT;
2.2: Let t1𝑡1t\leftarrow 1italic_t ← 1, TT0𝑇subscript𝑇0T\leftarrow T_{0}italic_T ← italic_T start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, 11subscript11\ell_{1}\leftarrow 1roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ← 1 and 20subscript20\ell_{2}\leftarrow 0roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ← 0;
2.3: Simulated Annealing: while T>Tmin𝑇subscript𝑇𝑚𝑖𝑛T>T_{min}italic_T > italic_T start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT and |21|>δsubscript2subscript1𝛿\lvert\ell_{2}-\ell_{1}\rvert>\delta| roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | > italic_δ do
2.3.1: Make a small change in ξn,ϕ,j(t1)superscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡1\xi_{n,\phi,j}^{(t-1)}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT to get ξn,ϕ,j(t)superscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡\xi_{n,\phi,j}^{(t)}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT. It is done by moving a randomly selected design point in ξn,ϕ,j(t1)superscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡1\xi_{n,\phi,j}^{(t-1)}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT, say 𝐱~~𝐱\tilde{\boldsymbol{\mathrm{x}}}over~ start_ARG bold_x end_ARG, to a new location in S𝑆Sitalic_S, and the new location is randomly generated in a small hyper-cube centred at 𝐱~~𝐱\tilde{\boldsymbol{\mathrm{x}}}over~ start_ARG bold_x end_ARG;
2.3.2: ξn,ϕ,j(t)superscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡\xi_{n,\phi,j}^{(t)}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT is accepted if exp{(Φ(ξn,ϕ,j(t))Φ(ξn,ϕ,j(t1)))/T}>u(t)Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡1𝑇superscript𝑢𝑡\exp\left\{-(\Phi(\xi_{n,\phi,j}^{(t)})-\Phi(\xi_{n,\phi,j}^{(t-1)}))/T\right% \}>u^{(t)}roman_exp { - ( roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) - roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT ) ) / italic_T } > italic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT, where u(t)unif(0,1)similar-tosuperscript𝑢𝑡unif01u^{(t)}\sim\mbox{unif}(0,1)italic_u start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ∼ unif ( 0 , 1 ). If it is accepted, then 1Φ(ξn,ϕ,j(t1))subscript1Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡1\ell_{1}\leftarrow\Phi(\xi_{n,\phi,j}^{(t-1)})roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ← roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t - 1 ) end_POSTSUPERSCRIPT ), 2Φ(ξn,ϕ,j(t))subscript2Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡\ell_{2}\leftarrow\Phi(\xi_{n,\phi,j}^{(t)})roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ← roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) and tt+1𝑡𝑡1t\leftarrow t+1italic_t ← italic_t + 1;
2.3.3: Let TαT𝑇𝛼𝑇T\leftarrow\alpha\cdot Titalic_T ← italic_α ⋅ italic_T after using the same temperature for K𝐾Kitalic_K times;
2.4: Let ξn,ϕ,jsuperscriptsubscript𝜉𝑛italic-ϕ𝑗\xi_{n,\phi,j}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT be the last accepted design. Compute Φ(ξn,ϕ,j)Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗\Phi(\xi_{n,\phi,j}^{*})roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) and the modified design efficiency 𝔼ff~ϕ(ξn,ϕ,j)Φ(ξϕ,N)Φ(ξn,ϕ,j)subscript~𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ𝑗Φsuperscriptsubscript𝜉italic-ϕ𝑁Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗\widetilde{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi,j}^{*})\leftarrow% \frac{\Phi(\xi_{\phi,N}^{*})}{\Phi(\xi_{n,\phi,j}^{*})}over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ← divide start_ARG roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG start_ARG roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG;
Step 3: Select best the design from {ξn,ϕ,j}j=1Msuperscriptsubscriptsuperscriptsubscript𝜉𝑛italic-ϕ𝑗𝑗1𝑀\{\xi_{n,\phi,j}^{*}\}_{j=1}^{M}{ italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT with the highest efficiency 𝔼ff~ϕsubscript~𝔼ffitalic-ϕ\widetilde{\operatorname{\mathbb{E}ff}}_{\phi}over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT and denote it as an OED ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.
Refer to caption
Figure 1: Plot of loss function values Φ(ξn,ϕ,j(t))Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡\Phi(\xi_{n,\phi,j}^{(t)})roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) in an annealing algorithm, where the red line is the value of the initial value after rounding Φ(ξn,ϕ,j(0))Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗0\Phi(\xi_{n,\phi,j}^{(0)})roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT ).
Remark 1.

Wong and Zhou, (2019, 2023) have provided details for finding OADs via CVX with both single and multiple-objective OADs on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT. CVX solver can solve problem (8) easily with N𝑁Nitalic_N as large as 10,000. From our numerical results, N𝑁Nitalic_N does not have to be very large to obtain a highly efficient OAD ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Often we can choose N<10,000𝑁10000N<10,000italic_N < 10 , 000. In Application 1 in Section 4, a highly efficient OAD ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is obtained with N=212=441𝑁superscript212441N=21^{2}=441italic_N = 21 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 441, and additional comments are provided there.

Remark 2.

To get an initial design ξn,ϕ(0)superscriptsubscript𝜉𝑛italic-ϕ0\xi_{n,\phi}^{(0)}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT, we need to get the rounded integers ni(0)superscriptsubscript𝑛𝑖0n_{i}^{(0)}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT. When it is not clear how to round nwi𝑛superscriptsubscript𝑤𝑖nw_{i}^{*}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for some applications, we can just take floor or ceiling of nwi𝑛superscriptsubscript𝑤𝑖nw_{i}^{*}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT such that i=1mni(0)=nsuperscriptsubscript𝑖1𝑚superscriptsubscript𝑛𝑖0𝑛\sum_{i=1}^{m}n_{i}^{(0)}=n∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = italic_n. Since ξn,ϕ(0)superscriptsubscript𝜉𝑛italic-ϕ0\xi_{n,\phi}^{(0)}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT is just a starting point in the annealing algorithm, the rounding effect is not crucial for the substantial task. As usual, running the annealing algorithm several times is helpful to find an exact design with a high design efficiency value.

Remark 3.

SA algorithm has been used to find optimal designs in the literature, where there are several parameters in the algorithm that need to be adjusted for each optimization problem. For instance, Wilmut and Zhou, (2011) discussed some strategies for adjusting those parameters. Here, it is helpful to use a plot of loss function values Φ(ξn,ϕ,j(t))Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡\Phi(\xi_{n,\phi,j}^{(t)})roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) versus t𝑡titalic_t (iteration) for checking the annealing parameters, for each fixed j𝑗jitalic_j. Figure 1 (from Application 2 in Section 4) shows a plot from the annealing algorithm when the parameters are set appropriately. At the beginning of the search Φ(ξn,ϕ,j(t))Φsuperscriptsubscript𝜉𝑛italic-ϕ𝑗𝑡\Phi(\xi_{n,\phi,j}^{(t)})roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) fluctuates, and eventually as t𝑡titalic_t increases Φ(ξn,ϕ.j(t))Φsuperscriptsubscript𝜉formulae-sequence𝑛italic-ϕ𝑗𝑡\Phi(\xi_{n,\phi.j}^{(t)})roman_Φ ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ . italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_t ) end_POSTSUPERSCRIPT ) decreases and converges to a limit.

Remark 4.

If an OAD ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT on S𝑆Sitalic_S is available, skip Step 1 and replace ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT by it in Step 2.1.

Remark 5.

Since a SA algorithm is used in Step 2.3, it does not guarantee to find the best exact design. However, we can set a required efficiency value, say 0.950.950.950.95, and run the algorithm M𝑀Mitalic_M times to search for a highly efficient exact design.

Remark 6.

Algorithm 1 also works for the maximin optimal design by setting

Φ(ξ)=min1il𝔼ffϕi(ξ).Φ𝜉subscript1𝑖𝑙subscript𝔼ffsubscriptitalic-ϕ𝑖𝜉\Phi(\xi)=-\min_{1\leq i\leq l}~{}\operatorname{\mathbb{E}ff}_{\phi_{i}}(\xi).roman_Φ ( italic_ξ ) = - roman_min start_POSTSUBSCRIPT 1 ≤ italic_i ≤ italic_l end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) .

In Section 4, we provide four representative applications with various models and optimality criteria to show that Algorithm 1 is effective and works well.

3.2 Properties of optimal designs

Figure 2 shows the relationship among the three sets of distributions, where ΞnsubscriptΞ𝑛\Xi_{n}roman_Ξ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is the set of exact designs with run size n𝑛nitalic_n on S𝑆Sitalic_S. It is obvious that the optimal designs, ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for any criterion on Ξn,ΞSN,subscriptΞ𝑛subscriptΞsubscript𝑆𝑁\Xi_{n},\Xi_{S_{N}},roman_Ξ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , roman_Ξ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT , and ΞSsubscriptΞ𝑆\Xi_{S}roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT, respectively, satisfy

ϕ{𝐈1(ξn,ϕ,𝜽)}ϕ{𝐈1(ξϕ,𝜽)},ϕ{𝐈1(ξϕ,N,𝜽)}ϕ{𝐈1(ξϕ,𝜽)},formulae-sequenceitalic-ϕsuperscript𝐈1superscriptsubscript𝜉𝑛italic-ϕsuperscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕ𝑁superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{n,\phi}^{*},\boldsymbol{\theta}^{% *})\right\}\geq\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},% \boldsymbol{\theta}^{*})\right\},~{}~{}\phi\left\{\boldsymbol{\mathrm{I}}^{-1}% (\xi_{\phi,N}^{*},\boldsymbol{\theta}^{*})\right\}\geq\phi\left\{\boldsymbol{% \mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*})\right\},italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ≥ italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } , italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ≥ italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ,

for any n𝑛nitalic_n and N𝑁Nitalic_N. Theorem 1 below shows some asymptotic results of ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT under some mild conditions.

ΞSsubscriptΞ𝑆\mathrm{\Xi}_{S}roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPTΞSNsubscriptΞsubscript𝑆𝑁\mathrm{\Xi}_{S_{N}}roman_Ξ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPTΞnsubscriptΞ𝑛\mathrm{\Xi}_{n}roman_Ξ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT
Figure 2: Illustration of the three sets of distributions, ΞSsubscriptΞ𝑆\Xi_{S}roman_Ξ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT, ΞSNsubscriptΞsubscript𝑆𝑁\Xi_{S_{N}}roman_Ξ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT and ΞnsubscriptΞ𝑛\Xi_{n}roman_Ξ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT for the optimal designs, OADs and OEDs, respectively.
Theorem 1.

Consider a regression model and a design space S𝑆Sitalic_S. Let ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT be the OED for a given n𝑛nitalic_n, ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT be the OAD on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, and ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT be the OAD design on S𝑆Sitalic_S. Assume all entries of I(𝐱,𝛉)𝐼𝐱superscript𝛉I(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})italic_I ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) are continuous functions of 𝐱𝐱\boldsymbol{\mathrm{x}}bold_x and S𝑆Sitalic_S is a bounded region, where I(𝐱,𝛉)=𝐟(𝐱,𝛉)𝐟(𝐱,𝛉)𝐼𝐱superscript𝛉𝐟𝐱superscript𝛉superscript𝐟top𝐱superscript𝛉I(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})={\bf f}(\boldsymbol{\mathrm% {x}},\boldsymbol{\theta}^{*}){\bf f}^{\top}(\boldsymbol{\mathrm{x}},% \boldsymbol{\theta}^{*})italic_I ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = bold_f ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) for model (1), or I(𝐱,𝛉)=λ(𝐱,𝛉)𝐟(𝐱)𝐟(𝐱)𝐼𝐱superscript𝛉𝜆𝐱superscript𝛉𝐟𝐱superscript𝐟top𝐱I(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})=\lambda(\boldsymbol{\mathrm% {x}},\boldsymbol{\theta}^{*}){\bf f}(\boldsymbol{\mathrm{x}}){\bf f}^{\top}(% \boldsymbol{\mathrm{x}})italic_I ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = italic_λ ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_f ( bold_x ) bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x ) for a GLM. In addition, SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT is formed by the Cartesian product of the equally spaced points for each design variable. We have the following results:

  1. (i)

    limn𝔼ffϕ(ξn,ϕ)=1subscript𝑛subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ1\lim_{n\to\infty}\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})=1roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1.

  2. (ii)

    limN𝔼ffϕ(ξϕ,N)=1subscript𝑁subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁1\lim_{N\to\infty}\operatorname{\mathbb{E}ff}_{\phi}(\xi_{\phi,N}^{*})=1roman_lim start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1.

The detailed proof is given in the Appendix. The asymptotic results indicate that ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT can be highly efficient for large n𝑛nitalic_n and N𝑁Nitalic_N. However, in practice we are often interested in exact designs with small n𝑛nitalic_n, and Algorithm 1 is very helpful for finding those exact designs.

From Theorem 1, limn𝔼ffϕ(ξn,ϕ)=1subscript𝑛subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ1\lim_{n\to\infty}\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})=1roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1. However, 𝔼ffϕ(ξn,ϕ)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) may not be an increasing function of n𝑛nitalic_n. For example, if an OAD has 4 support points with equal weights, 1/4,1/4,1/4,1/4141414141/4,1/4,1/4,1/41 / 4 , 1 / 4 , 1 / 4 , 1 / 4, then it is easy to construct OEDs for n𝑛nitalic_n being multiples of 4 and those OEDs have 𝔼ffϕ(ξn,ϕ)=1subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ1\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})=1start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1. However, when n𝑛nitalic_n is not a multiple of 4, it is clear that 𝔼ffϕ(ξn,ϕ)<1subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ1\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})<1start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) < 1.

When the design space S𝑆Sitalic_S is discrete, we take SN=Ssubscript𝑆𝑁𝑆S_{N}=Sitalic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT = italic_S. Then we have ξϕ=ξϕ,Nsuperscriptsubscript𝜉italic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi}^{*}=\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. In this situation, efficiency measures 𝔼ffϕ(ξn,ϕ)=𝔼ff~ϕ(ξn,ϕ)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕsubscript~𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})=\widetilde{\operatorname{% \mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ). In general, for any S𝑆Sitalic_S and SNSsubscript𝑆𝑁𝑆S_{N}\subset Sitalic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ⊂ italic_S, design efficiency measures satisfy

𝔼ffϕ(ξn,ϕ)=𝔼ffϕ(ξϕ,N)𝔼ff~ϕ(ξn,ϕ),subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕsubscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁subscript~𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})=\operatorname{\mathbb{E}% ff}_{\phi}(\xi_{\phi,N}^{*})\cdot\widetilde{\operatorname{\mathbb{E}ff}}_{\phi% }(\xi_{n,\phi}^{*}),start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ⋅ over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ,

by their definitions in Sections 2.3 and 3. For instance, if 𝔼ffϕ(ξϕ,N)=0.97subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁0.97\operatorname{\mathbb{E}ff}_{\phi}(\xi_{\phi,N}^{*})=0.97start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 0.97 and 𝔼ff~ϕ(ξn,ϕ)=0.98subscript~𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ0.98\widetilde{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})=0.98over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 0.98, then 𝔼ffϕ(ξn,ϕ)=0.9506subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ0.9506\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})=0.9506start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 0.9506. Since 𝔼ff~ϕ(ξn,ϕ)subscript~𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ\widetilde{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) may be larger than 1, 𝔼ffϕ(ξn,ϕ)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) can be larger than 𝔼ffϕ(ξϕ,N)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁\operatorname{\mathbb{E}ff}_{\phi}(\xi_{\phi,N}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) for some models and in particular for small N𝑁Nitalic_N. 𝔼ffϕ(ξϕ,N)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁\operatorname{\mathbb{E}ff}_{\phi}(\xi_{\phi,N}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) may not be an increasing function of N𝑁Nitalic_N, but limN𝔼ffϕ(ξϕ,N)=1subscript𝑁subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁1\lim_{N\to\infty}\operatorname{\mathbb{E}ff}_{\phi}(\xi_{\phi,N}^{*})=1roman_lim start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1 from Theorem 1.

Algorithm 1 works well and is effective to find highly efficient ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for small or moderate n𝑛nitalic_n. The number of distinct support points in ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT may not be the same as that in ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT or ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. When n𝑛nitalic_n is very large, the SA algorithm in Algorithm 1 can be slow. This is true for any SA algorithms. In this situation, we can use a rounding method to construct OEDs, as illustrated in the proof of Theorem 1 in the Appendix, and we can replace ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT by ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT when ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is not available. In that case the number of distinct support points in ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are the same as that in ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT or ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT.

4 Applications

We apply our proposed algorithm, CVXSADes, to construct OEDs for various models and various values of n𝑛nitalic_n. Representative results are given and discussed in four applications below. Application 1 is for a D-optimality criterion for a logistic model with two design variables, where it is not clear how to round nwi𝑛subscript𝑤𝑖nw_{i}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT from OADs. In addition, we discuss the choice of N𝑁Nitalic_N and the efficiency of OADs ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT as N𝑁Nitalic_N varies. Application 2 is about constructing optimal group testing designs over a discrete design space S𝑆Sitalic_S. The OAD ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT from Algorithm 1 is more appropriate than the optimal designs in a paper. Application 3 concerns high-dimensional designs, where the design space comprises seven design variables. The high-dimensionality presents a challenge in finding OADs and OEDs. Algorithm 1 provides an alternative method to find optimal designs and gives better approximate designs than those in a couple of examples in Xu et al., (2019). Application 4 shows results for maximin optimal designs in which multiple objectives compete against each other. In each of the applications, we demonstrate the usefulness and effectiveness of our proposed algorithm.

Application 1.

(Two-variable logit model) Consider a two-variable binary logistic regression model with interaction, as discussed in Haines et al., (2018), where it was used to study the effectiveness of different combinations of the concentration of two insecticides. The model is given as

logit(p)=log(p1p)=θ0+θ1x1+θ2x2+θ12x1x2,logit𝑝𝑝1𝑝subscript𝜃0subscript𝜃1subscript𝑥1subscript𝜃2subscript𝑥2subscript𝜃12subscript𝑥1subscript𝑥2\mbox{logit}(p)=\log\left(\frac{p}{1-p}\right)=\theta_{0}+\theta_{1}x_{1}+% \theta_{2}x_{2}+\theta_{12}x_{1}x_{2},logit ( italic_p ) = roman_log ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_p end_ARG ) = italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_θ start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ,

where p𝑝pitalic_p is the probability of success, i.e., p=P(Y=1)𝑝𝑃𝑌1p=P(Y=1)italic_p = italic_P ( italic_Y = 1 ), Y𝑌Yitalic_Y is a binary response variable, x1subscript𝑥1x_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and x2subscript𝑥2x_{2}italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT are two design variables, such as the doses of two drugs. D-OEDs were studied and derived analytically in Haines et al., (2018) for various parameter values 𝜽superscript𝜽\boldsymbol{\theta}^{*}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and design spaces. For this GLM, the information matrix of design ξ(𝐱)𝜉𝐱\xi(\boldsymbol{\mathrm{x}})italic_ξ ( bold_x ) in (5) becomes

𝐈(ξ,𝜽)=i=1kwiexp(𝐟(𝐯i)𝜽)(1+exp(𝐟(𝐯i)𝜽))2𝐟(𝐯i)𝐟(𝐯i),𝐈𝜉superscript𝜽superscriptsubscript𝑖1𝑘subscript𝑤𝑖superscript𝐟topsubscript𝐯𝑖superscript𝜽superscript1superscript𝐟topsubscript𝐯𝑖superscript𝜽2𝐟subscript𝐯𝑖superscript𝐟topsubscript𝐯𝑖\displaystyle\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta}^{*})=\sum_{i=1}^{% k}w_{i}\frac{\exp(\boldsymbol{\mathrm{f}}^{\top}({\bf v}_{i})\boldsymbol{% \theta}^{*})}{\left(1+\exp(\boldsymbol{\mathrm{f}}^{\top}({\bf v}_{i})% \boldsymbol{\theta}^{*})\right)^{2}}\boldsymbol{\mathrm{f}}({\bf v}_{i})% \boldsymbol{\mathrm{f}}^{\top}({\bf v}_{i}),bold_I ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT divide start_ARG roman_exp ( bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) end_ARG start_ARG ( 1 + roman_exp ( bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG bold_f ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , (10)

where 𝐟(𝐱)=(1,x1,x2,x1x2)𝐟𝐱superscript1subscript𝑥1subscript𝑥2subscript𝑥1subscript𝑥2top\boldsymbol{\mathrm{f}}(\boldsymbol{\mathrm{x}})=(1,x_{1},x_{2},x_{1}x_{2})^{\top}bold_f ( bold_x ) = ( 1 , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT.

Refer to caption
Figure 3: Plots of support points of D-OAD and D-OED in Application 1, case (i) with parameters 𝜽=(3,4,6,1)𝜽superscript3461top\boldsymbol{\theta}=(-3,4,6,1)^{\top}bold_italic_θ = ( - 3 , 4 , 6 , 1 ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, design space S=[0,1]2𝑆superscript012S=[0,1]^{2}italic_S = [ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and N=512𝑁superscript512N=51^{2}italic_N = 51 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT grid, where the numbers in blue-colour are the weights or numbers of observations. (a) Design 1 (D-OAD on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT) and Design 2 from Haines et al., (2018, Example 4.2(b)), (b) D-OED for n=10𝑛10n=10italic_n = 10, (c) D-OED for n=15𝑛15n=15italic_n = 15, (d) D-OED for n=20𝑛20n=20italic_n = 20.

Since the D-optimality is used as a loss function ϕitalic-ϕ\phiitalic_ϕ, corresponding OADs and OEDs are denoted as D-OADs and D-OEDs. Using Algorithm 1 we compute D-OADs ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT and construct D-OEDs for various values of N𝑁Nitalic_N and n𝑛nitalic_n. Then we compare ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT with ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT in Haines et al., (2018) and comment on the D-OEDs. We have worked on several cases of 𝜽superscript𝜽\boldsymbol{\theta}^{*}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and design spaces. Here we give representative results for two cases:
(i) 𝜽=(3,4,6,1)superscript𝜽superscript3461top\boldsymbol{\theta}^{*}=(-3,4,6,1)^{\top}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( - 3 , 4 , 6 , 1 ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and S=[0,1]2𝑆superscript012S=[0,1]^{2}italic_S = [ 0 , 1 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT as in Haines et al., (2018, Example 4.2(b)),
(ii) 𝜽=(2.2054,13.5803,2.2547,1.6262)superscript𝜽superscript2.205413.58032.25471.6262top\boldsymbol{\theta}^{*}=(-2.2054,13.5803,2.2547,1.6262)^{\top}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( - 2.2054 , 13.5803 , 2.2547 , 1.6262 ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and S=[0,2]2𝑆superscript022S=[0,2]^{2}italic_S = [ 0 , 2 ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT as in Haines et al., (2018, Section 5).

For case (i), D-OADs and D-OEDs are plotted in Figure 3. In the approximate design in Figure 3(a), SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT includes N=512𝑁superscript512N=51^{2}italic_N = 51 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT grid points in S𝑆Sitalic_S, which is formed by Cartesian product of 51 equally spaced points in [0,1] in each dimension. Note that ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT have 5 support points that are almost the same, but ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT has one extra support point with a very small weight (0.0033). The weights of ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are displayed there, and the weights of ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are slightly different and not shown in the plot for clear presentation. In addition, the loss function values of the two designs are almost the same, with 𝔼ffD(ξN,ϕ)=0.9998subscript𝔼ff𝐷superscriptsubscript𝜉𝑁italic-ϕ0.9998\operatorname{\mathbb{E}ff}_{D}(\xi_{N,\phi}^{*})=0.9998start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_D end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_N , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 0.9998. Three D-OEDs with n=10,15,𝑛1015n=10,15,italic_n = 10 , 15 , and 20 are plotted in Figure 3(b), (c) and (d), respectively, with efficiency 𝔼ffϕ(ξn,ϕ)=0.9836,0.9785formulae-sequencesubscriptsuperscript𝔼ffsimilar-toitalic-ϕabsentsuperscriptsubscript𝜉𝑛italic-ϕ0.98360.9785\stackrel{{\scriptstyle\sim}}{{\smash{\operatorname{\mathbb{E}ff}}\rule{0.0pt}% {4.73611pt}}}_{\phi}(\xi_{n,\phi}^{*})=0.9836,0.9785start_RELOP SUPERSCRIPTOP start_ARG start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION end_ARG start_ARG ∼ end_ARG end_RELOP start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 0.9836 , 0.9785 and 1.00011.00011.00011.0001. This indicates that these D-OEDs are highly efficient. The support points of each D-OED are clustered around 5 points.

Refer to caption
Figure 4: Plots of support points of D-OADs and D-OEDs, where the numbers in blue-colour are the weights or numbers of observations. (a) Design I (D-OAD on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT with N=412𝑁superscript412N=41^{2}italic_N = 41 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT) and Design II from Haines et al., (2018, the 5-support-point design in Section 5), (b) Design III (D-OAD on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT with N=512𝑁superscript512N=51^{2}italic_N = 51 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT) and Design IV from Haines et al., (2018, the 6-support-point design in Section 5), (c) Design V (D-OED for n=10𝑛10n=10italic_n = 10 started from Design I), Design VI (D-OED for n=10𝑛10n=10italic_n = 10 started from Design III), (d) D-OED for n=15𝑛15n=15italic_n = 15 started from Design III.

For case (ii), D-OADs and D-OEDs are plotted in Figure 4. There are three D-OADs in Haines et al., (2018, Section 5), which have 4, 5, and 6 support points, respectively. The one with 6 support points has the smallest value of ϕ(ξϕ)italic-ϕsuperscriptsubscript𝜉italic-ϕ{\phi}(\xi_{\phi}^{*})italic_ϕ ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), and we use it for the efficiency calculation of ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Several values of N𝑁Nitalic_N are used to compute ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Representative results are given in Table 1, which indicate that 𝔼ffϕ(ξϕ,N)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{\phi,N}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) increases as N𝑁Nitalic_N increases, 𝔼ffϕ(ξn,ϕ)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) seems to be fluctuated a bit, and but both ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT have a high efficiency with the N𝑁Nitalic_N in the latter being as small as 212superscript21221^{2}21 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. The interpretation of having a constant 𝔼ffϕ(ξn,ϕ)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) is that the annealing algorithm can effectively find highly efficient exact designs starting from different ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. This implies that the choice of N𝑁Nitalic_N does not affect the OED much, and in practice, we can choose a moderate N𝑁Nitalic_N. In Figure 4 (a) and (b), ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are plotted. For clear presentation, the plots only display the weights in ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, which can also have 5 and 6 support points with different values of N𝑁Nitalic_N. The locations of the support points in ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT are similar to those in ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. In Figure 4(c) and (d), D-OEDs are plotted. Design V and VI are the same, but the annealing algorithm used different starting designs. In Haines et al., (2018, Section 5), it is not clear how to get D-OEDs from ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT with any given value of n𝑛nitalic_n. Algorithm 1 provides an effective way to construct D-OEDs for any n𝑛nitalic_n. The exact design in Figure 4(d) has 𝔼ffϕ(ξn,ϕ)=0.9935subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ0.9935{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})=0.9935start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 0.9935 for n=15𝑛15n=15italic_n = 15.

Table 1: Efficiency of ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT (with n=10𝑛10n=10italic_n = 10) for case (ii) in Application 1.
N𝑁Nitalic_N    𝔼ffϕ(ξϕ,N)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{\phi,N}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )    𝔼ffϕ(ξn,ϕ)subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ{\operatorname{\mathbb{E}ff}}_{\phi}(\xi_{n,\phi}^{*})start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )
212superscript21221^{2}21 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 0.9716 0.9822
312superscript31231^{2}31 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 0.9901 0.9513
412superscript41241^{2}41 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 0.9961 0.9793
512superscript51251^{2}51 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 0.9985 0.9794
812superscript81281^{2}81 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 0.9984 0.9822
Application 2.

(Group testing design for disease prevalence) Group testing is employed to study rare diseases when testing individuals for a trait is costly (Hughes-Oliver and Swallow, , 1994; Hughes-Oliver and Rosenberger, , 2000). Instead of taking samples from each individual and testing them individually, it is more cost-efficient to conduct group testing, wherein samples from individuals are pooled as a group and tested together as a unit. In Huang et al., (2017), optimal group testing designs are studied, where optimal group sizes are selected for group testing experiments. Since the design space S𝑆Sitalic_S only includes integer values, finding OADs on discrete design spaces SN(=S)annotatedsubscript𝑆𝑁absent𝑆S_{N}(=S)italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ( = italic_S ) via CVX is extremely useful. We demonstrate this for D- and c-optimality criteria in Huang et al., (2017), and present OADs and OEDs for various values of n𝑛nitalic_n. In addition, we find and comment on interesting features in OEDs.

To present the optimality criteria clearly, we rewrite the information matrix from Huang et al., (2017) as follows,

𝐈(ξ,𝜽)=i=1kwiλ(xi)𝐟(xi)𝐟(xi),𝐈𝜉𝜽superscriptsubscript𝑖1𝑘subscript𝑤𝑖𝜆subscript𝑥𝑖𝐟subscript𝑥𝑖superscript𝐟topsubscript𝑥𝑖\displaystyle\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta})=\sum_{i=1}^{k}w_% {i}\lambda(x_{i})\boldsymbol{\mathrm{f}}(x_{i})\boldsymbol{\mathrm{f}}^{\top}(% x_{i}),bold_I ( italic_ξ , bold_italic_θ ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_λ ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) bold_f ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , (11)

where 𝜽=(p0,p1,p2)𝜽superscriptsubscript𝑝0subscript𝑝1subscript𝑝2top\boldsymbol{\theta}=(p_{0},p_{1},p_{2})^{\top}bold_italic_θ = ( italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, λ(x)=1/(π(x)(1π(x)))𝜆𝑥1𝜋𝑥1𝜋𝑥\lambda(x)=1/(\pi(x)(1-\pi(x)))italic_λ ( italic_x ) = 1 / ( italic_π ( italic_x ) ( 1 - italic_π ( italic_x ) ) ) with π(x)=p1(p1+p21)(1p0)x𝜋𝑥subscript𝑝1subscript𝑝1subscript𝑝21superscript1subscript𝑝0𝑥\pi(x)=p_{1}-(p_{1}+p_{2}-1)(1-p_{0})^{x}italic_π ( italic_x ) = italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - ( italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 ) ( 1 - italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT, and

𝐟(x)=(x(p1+p21)(1p0)x11(1p0)x(1p0)x).𝐟𝑥𝑥subscript𝑝1subscript𝑝21superscript1subscript𝑝0𝑥11superscript1subscript𝑝0𝑥superscript1subscript𝑝0𝑥\boldsymbol{\mathrm{f}}(x)=\left(\begin{array}[]{c}x(p_{1}+p_{2}-1)(1-p_{0})^{% x-1}\\ 1-(1-p_{0})^{x}\\ -(1-p_{0})^{x}\end{array}\right).bold_f ( italic_x ) = ( start_ARRAY start_ROW start_CELL italic_x ( italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - 1 ) ( 1 - italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_x - 1 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL 1 - ( 1 - italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL - ( 1 - italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ) .

See the details of the group testing model and information matrix in Huang et al., (2017). We consider D-optimality and c-optimality with vector 𝐜1=(1,0,0)subscript𝐜1superscript100top{\bf c}_{1}=(1,0,0)^{\top}bold_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ( 1 , 0 , 0 ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. We minimize ϕ{𝐈1(ξ,𝜽)}=𝐜1𝐈1(ξ,𝜽)𝐜1italic-ϕsuperscript𝐈1𝜉superscript𝜽superscriptsubscript𝐜1topsuperscript𝐈1𝜉superscript𝜽subscript𝐜1\phi\{\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*})\}={\bf c}_{1}^% {\top}\boldsymbol{\mathrm{I}}^{-1}(\xi,\boldsymbol{\theta}^{*}){\bf c}_{1}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } = bold_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT to obtain c-optimal designs. Representative results are given in Table 2 for 𝜽=(0.07,0.93,0.96)superscript𝜽superscript0.070.930.96top\boldsymbol{\theta}^{*}=(0.07,0.93,0.96)^{\top}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( 0.07 , 0.93 , 0.96 ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and SN=S={1,2,3,,61}subscript𝑆𝑁𝑆12361S_{N}=S=\{1,2,3,\ldots,61\}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT = italic_S = { 1 , 2 , 3 , … , 61 }. From Huang et al., (2017), they found that the D-OAD has three support points: 1, 16.79, 61, which are similar to those of ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT in Table 2. However, one of their support points is not an integer, which is not the design space. Thus, ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is more appropriate for this experiment. Since S𝑆Sitalic_S only includes integers, we modify the annealing algorithm by adding 1 or 11-1- 1 to the selected design point to obtain a new design point in S𝑆Sitalic_S. As usual, we make sure that all new points are in S𝑆Sitalic_S. The D-OEDs for various values of n𝑛nitalic_n from Algorithm 1 are the same as those by rounding nwi𝑛subscript𝑤𝑖nw_{i}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to integers, and they are highly efficient.

For c-optimal designs, ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT in Table 2 is also more appropriate for this experiment, since the c-optimal design in Huang et al., (2017) also includes a non-integer support point (15.68). It is interesting to notice that the c-OEDs do not have the sample support points as in ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and the design efficiencies are very high. One of the support points in ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is 16161616 with a large weight 0.6279, but some of the exact designs include a support point 15151515 or 17171717 and do not include 16161616 as a support point. See the results for n=10𝑛10n=10italic_n = 10, 11111111 and 14141414. Figure 1 gives a plot of loss function value versus iteration number in the annealing algorithm for finding c-OED with n=12𝑛12n=12italic_n = 12 where the y-axis is in log 10101010 scale. Note that it is easy to find OADs and OEDs using Algorithm 1 for any 𝜽superscript𝜽\boldsymbol{\theta}^{*}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT value, any integer design space, and any optimality criterion.

Table 2: c- and D-OADs, and c-, D-OEDs, ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT and ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT in Application 2, where Per(i,j,k𝑖𝑗𝑘i,j,kitalic_i , italic_j , italic_k) means any permutation of i,j,k𝑖𝑗𝑘i,j,kitalic_i , italic_j , italic_k.
D-optimality support points weights/observations ϕ(𝐈1)italic-ϕsuperscript𝐈1{\phi}(\boldsymbol{\mathrm{I}}^{-1})italic_ϕ ( bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) 𝔼ff~(ξn,ϕ)~𝔼ffsuperscriptsubscript𝜉𝑛italic-ϕ\widetilde{\operatorname{\mathbb{E}ff}}(\xi_{n,\phi}^{*})over~ start_ARG blackboard_E roman_ff end_ARG ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )
approximate ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT 1, 17, 61 1/3, 1/3, 1/3 0.1448 1.0000
n=10𝑛10n=10italic_n = 10 1, 17, 61 Per(4,3,3) 0.1462 0.9906
n=11𝑛11n=11italic_n = 11 1, 17, 61 Per(4,4,3) 0.1461 0.9912
n=12𝑛12n=12italic_n = 12 1, 17, 61 4, 4, 4 0.1448 1.0000
n=13𝑛13n=13italic_n = 13 1, 17, 61 Per(5,4,4) 0.1457 0.9944
n=14𝑛14n=14italic_n = 14 1, 17, 61 Per(5,5,4) 0.1456 0.9946
c-optimality support points weights/observations ϕ(𝐈1)italic-ϕsuperscript𝐈1{\phi}(\boldsymbol{\mathrm{I}}^{-1})italic_ϕ ( bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) 𝔼ff~(ξn,ϕ)~𝔼ffsuperscriptsubscript𝜉𝑛italic-ϕ\widetilde{\operatorname{\mathbb{E}ff}}(\xi_{n,\phi}^{*})over~ start_ARG blackboard_E roman_ff end_ARG ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )
approximate ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT 1, 16, 61 0.1310, 0.6279, 0.2411 0.0354 1.0000
n=10𝑛10n=10italic_n = 10 1, 17, 61 1, 6, 3 0.0361 0.9799
n=11𝑛11n=11italic_n = 11 1, 17, 61 1, 7, 3 0.0361 0.9808
n=12𝑛12n=12italic_n = 12 1, 15, 16, 61 2, 4, 3, 3 0.0358 0.9891
n=13𝑛13n=13italic_n = 13 1, 15, 16, 61 2, 7, 1, 3 0.0355 0.9968
n=14𝑛14n=14italic_n = 14 1, 15, 61 2, 9, 3 0.0355 0.9970
Application 3.

(Seven-dimensional design space) In real-world applications, experiments often involve many design variables, and efficient designs can help save resources and prevent wasted time. Consider a logistic model with seven design variables and its information matrix is similar to that in (10) with 𝐟(𝐱)=(1,x1,x2,x3,x4,x5,x6,x7)𝐟𝐱superscript1subscript𝑥1subscript𝑥2subscript𝑥3subscript𝑥4subscript𝑥5subscript𝑥6subscript𝑥7top\boldsymbol{\mathrm{f}}(\boldsymbol{\mathrm{x}})=(1,x_{1},x_{2},x_{3},x_{4},x_% {5},x_{6},x_{7})^{\top}bold_f ( bold_x ) = ( 1 , italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Xu et al., (2019) proposed an innovative algorithm using differential evolution for finding D-OADs with several design variables, and used this model as an example. When the design space S=[1,1]7𝑆superscript117S=[-1,1]^{7}italic_S = [ - 1 , 1 ] start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT, the D-OAD in Xu et al., (2019, Table 4) with

𝜽=(0.4926,0.6280,0.3283,0.4378,0.5283,0.6120,0.6837,0.2061)superscript𝜽superscript0.49260.62800.32830.43780.52830.61200.68370.2061top\boldsymbol{\theta}^{*}=(-0.4926,-0.6280,-0.3283,0.4378,0.5283,-0.6120,-0.6837% ,-0.2061)^{\top}bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( - 0.4926 , - 0.6280 , - 0.3283 , 0.4378 , 0.5283 , - 0.6120 , - 0.6837 , - 0.2061 ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT

has 48 support points. The corresponding weights for the 48 support points are, 0.0230, 0.0160, 0.0255, …, 0.0269. However, it is challenging to implement the D-OAD in practice. For instance, given a run size, let us say n=30𝑛30n=30italic_n = 30, how we construct the exact design remains unclear. We cannot have all the 48 support points in the exact design. In addition, it is not easy to round nwi𝑛subscript𝑤𝑖nw_{i}italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as nwi<1𝑛subscript𝑤𝑖1nw_{i}<1italic_n italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT < 1 for many of the support points in the D-OAD.

Algorithm 1 can be used construct D-OADs and D-OEDs for various design spaces and n𝑛nitalic_n. Representative results are given in Tables 3 and 4. For each design variable, we take 4 equally spaced points on [1,1]11[-1,1][ - 1 , 1 ], i.e., 1,1/3,1/3,1113131-1,-1/3,1/3,1- 1 , - 1 / 3 , 1 / 3 , 1, and construct SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT by including the Cartesian product of the equally spaced points for the 7 variables, which gives N=47=16,384formulae-sequence𝑁superscript4716384N=4^{7}=16,384italic_N = 4 start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT = 16 , 384. Table 3 presents the D-OAD ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT via CVX. It is clear that ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT has 29 support points, which is much less than 48, the number of support points in Xu et al., (2019). In addition, ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT has a loss function value (det{𝐈1(ξϕ,N,𝜽)})1/q=4.9485superscriptsuperscript𝐈1superscriptsubscript𝜉italic-ϕ𝑁superscript𝜽1𝑞4.9485\left(\det\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi,N}^{*},\boldsymbol{% \theta}^{*})\right\}\right)^{1/q}=4.9485( roman_det { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ) start_POSTSUPERSCRIPT 1 / italic_q end_POSTSUPERSCRIPT = 4.9485, which is smaller than 4.95734.95734.95734.9573, the loss function value of the approximate design in Xu et al., (2019). Thus, CVX solver finds a better D-OAD with a smaller loss function and a smaller number of support points. Table 4 gives an exact design ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT with n=30𝑛30n=30italic_n = 30, which has 22 support points. Some of the support points are not at the corners of the hyper-cube [1,1]7superscript117[-1,1]^{7}[ - 1 , 1 ] start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT, since the annealing algorithm allows us to make small changes of design points in S𝑆Sitalic_S. This ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT has a loss function value (det{𝐈1(ξn,ϕ,𝜽)})1/q=5.1231superscriptsuperscript𝐈1superscriptsubscript𝜉𝑛italic-ϕsuperscript𝜽1𝑞5.1231\left(\det\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{n,\phi}^{*},\boldsymbol{% \theta}^{*})\right\}\right)^{1/q}=5.1231( roman_det { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ) start_POSTSUPERSCRIPT 1 / italic_q end_POSTSUPERSCRIPT = 5.1231, which yields an efficiency 𝔼ff~(ξn,ϕ)=0.9659~𝔼ffsuperscriptsubscript𝜉𝑛italic-ϕ0.9659\widetilde{\operatorname{\mathbb{E}ff}}(\xi_{n,\phi}^{*})=0.9659over~ start_ARG blackboard_E roman_ff end_ARG ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 0.9659.

Table 3: D-OAD ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT via CVX with N=47𝑁superscript47N=4^{7}italic_N = 4 start_POSTSUPERSCRIPT 7 end_POSTSUPERSCRIPT.
Support point x1subscript𝑥1x_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT x2subscript𝑥2x_{2}italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT x3subscript𝑥3x_{3}italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT x4subscript𝑥4x_{4}italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT x5subscript𝑥5x_{5}italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT x6subscript𝑥6x_{6}italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT x7subscript𝑥7x_{7}italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT weight
1 -1 -1 -1 -1 -1 1 1 0.0627
2 -1 -1 -1 1 -1 1 -1 0.0732
3 -1 -1 -1 1 1 -1 1 0.0487
4 -1 -1 1 -1 1 -1 -1 0.0499
5 -1 -1 1 -1 1 1 -1 0.0460
6 -1 1 -1 -1 -1 -1 -1 0.0088
7 -1 1 -1 -1 1 -1 -1 0.0561
8 -1 1 -1 1 -1 1 1 0.0212
9 -1 1 -1 1 1 -1 -1 0.0226
10 -1 1 1 -1 -1 -1 1 0.0840
11 -1 1 1 1 -1 1 -1 0.0306
12 -1 1 1 1 1 -1 1 0.0023
13 -1 1 1 1 1 1 1 0.0730
14 1 -1 -1 -1 -1 -1 1 0.0135
15 1 -1 -1 1 -1 -1 1 0.0217
16 1 -1 -1 1 1 -1 1 0.0415
17 1 -1 1 -1 -1 -1 1 0.0409
18 1 -1 1 -1 -1 1 -1 0.0375
19 1 -1 1 -1 1 -1 -1 0.0073
20 1 -1 1 1 -1 1 1 0.0255
21 1 -1 1 1 1 -1 -1 0.0489
22 1 -1 1 1 1 1 -1 0.0100
23 1 1 -1 -1 -1 -1 -1 0.0491
24 1 1 -1 1 -1 -1 -1 0.0404
25 1 1 1 -1 -1 -1 1 0.0042
26 1 1 1 1 -1 -1 1 0.0058
27 1 1 1 1 -1 1 -1 0.0420
28 1 1 1 1 -1 1 1 0.0033
29 1 1 1 1 1 -1 1 0.0295
Table 4: A D-OED ξn,ϕsuperscriptsubscript𝜉𝑛italic-ϕ\xi_{n,\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT with n=30𝑛30n=30italic_n = 30, where nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denotes the number of observations at each support point.
Support point x1subscript𝑥1x_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT x2subscript𝑥2x_{2}italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT x3subscript𝑥3x_{3}italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT x4subscript𝑥4x_{4}italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT x5subscript𝑥5x_{5}italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT x6subscript𝑥6x_{6}italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT x7subscript𝑥7x_{7}italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
1 -1 -1 -1 -1 -1 1 1 1
2 -1 -1 -1 1 -1 1 -1 2
3 -1 -1 -1 1 -1 1 -0.9999 1
4 -1 -1 -1 1 1 -1 1 2
5 -1 -1 1 -1 1 -1 -1 2
6 -1 -1 1 -1 1 1 -1 2
7 -1 1 -1 -1 -1 -1 -1 1
8 -1 1 -1 -1 1 -1 -1 2
9 -1 1 -1 1 -1 1 1 1
10 -1 1 -1 1 1 -1 -1 1
11 -1 1 1 -1 -1 -1 1 3
12 -1 1 1 1 -1 1 -1 1
13 -1 1 1 1 1 1 1 2
14 -0.9996 1 1 1 1 -1 1 1
15 -0.9995 -1 -1 -1 -1 0.9995 1 2
16 1 -1 -1 1 1 -1 1 1
17 1 -1 1 -1 -1 -1 1 1
18 1 -1 1 -1 -1 1 -1 1
19 1 -1 1 1 1 -1 -1 1
20 1 0.9998 -1 -1 -1 -1 -1 1
21 1 1 -1 1 -1 -1 -1 1
22 1 1 1 1 -1 1 -0.9978 1
Application 4.

(Maximin design for competing criteria in dose finding study) In the dose-finding study, the aim is to find a model that characterizes the dose-response relationship effectively (Dette et al., , 2008). However, a complex decision-making process is required due to its complexity and other external considerations, such as efficacy and ethics tradeoffs. Often, more than one response model is necessary, and determining how to maximize the efficiency of the design for a model becomes a practical question. Here, we consider maximin optimality criteria for multiple objectives and construct maximin exact designs. We use one application with four response models in Wong and Zhou, (2023), which is also studied in Bretz et al., (2010). There are one linear response model, two Emax models with different true parameter values, and one logistic model. Let 𝐈i(ξ,𝜽i)subscript𝐈𝑖𝜉superscriptsubscript𝜽𝑖\boldsymbol{\mathrm{I}}_{i}(\xi,\boldsymbol{\theta}_{i}^{*})bold_I start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ), i=1,,4𝑖14i=1,\ldots,4italic_i = 1 , … , 4, be the information matrices for the four models, respectively, where 𝜽isuperscriptsubscript𝜽𝑖\boldsymbol{\theta}_{i}^{*}bold_italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is the true parameter for model i𝑖iitalic_i, the same as in Wong and Zhou, (2023).

We define maximin optimal designs in the same way as in Section 2.3. If ϕi(ξ)=tr{𝐈i1(ξ,𝜽i)}subscriptitalic-ϕ𝑖𝜉trsuperscriptsubscript𝐈𝑖1𝜉superscriptsubscript𝜽𝑖\phi_{i}(\xi)=\operatorname{tr}\left\{\boldsymbol{\mathrm{I}}_{i}^{-1}(\xi,% \boldsymbol{\theta}_{i}^{*})\right\}italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_ξ ) = roman_tr { bold_I start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } for i=1,,4𝑖14i=1,\ldots,4italic_i = 1 , … , 4, we call it as maximin A-optimal design (maximin A-OAD/A-OED). If ϕi(ξ)=det{𝐈i1(ξ,𝜽i)}subscriptitalic-ϕ𝑖𝜉superscriptsubscript𝐈𝑖1𝜉superscriptsubscript𝜽𝑖\phi_{i}(\xi)=\det\left\{\boldsymbol{\mathrm{I}}_{i}^{-1}(\xi,\boldsymbol{% \theta}_{i}^{*})\right\}italic_ϕ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_ξ ) = roman_det { bold_I start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ , bold_italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } for i=1,,4𝑖14i=1,\ldots,4italic_i = 1 , … , 4, we call it as maximin D-optimal design (maximin D-OAD/D-OED).

Refer to caption
Figure 5: Plots of distribution functions of maximin A-optimal designs. (a) approximate design, (b) exact design for n=10𝑛10n=10italic_n = 10, (c) exact design for n=20𝑛20n=20italic_n = 20, (d) exact design for n=30𝑛30n=30italic_n = 30 and approximate design.

Wong and Zhou, (2023) discussed an algorithm to compute maximin OADs on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, and the main idea is to transform the maximin problem into a convex optimization problem and use CVX to find solutions. Applying the algorithm in Wong and Zhou, (2023), we can easily obtain maximin A- and D-OADs ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT via CVX in Step 1 of Algorithm 1. Then we can compute maximin A- and D-OEDs for various values of n𝑛nitalic_n. Representative results are plotted in Figures 5 and 6, where S=[0,500]𝑆0500S=[0,500]italic_S = [ 0 , 500 ], N=201𝑁201N=201italic_N = 201, and n=10𝑛10n=10italic_n = 10, 20, and 30 are used. In Figure 5, the distribution functions of maximin A-OADs and maximin A-OEDs are plotted. From Figure 5(d), the maximin A-OAD and maximin A-OED with n=30𝑛30n=30italic_n = 30 are very similar. The maximin A-OAD has min{𝔼ff~ϕ1(ξ),,𝔼ff~ϕ4(ξ)}=0.7155subscript~𝔼ffsubscriptitalic-ϕ1𝜉subscript~𝔼ffsubscriptitalic-ϕ4𝜉0.7155\min\{\widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{1}}(\xi),\ldots,% \widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{4}}(\xi)\}=0.7155roman_min { over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) , … , over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) } = 0.7155, and the maximin A-OEDs have min{𝔼ff~ϕ1(ξ),,𝔼ff~ϕ4(ξ)}=0.6813subscript~𝔼ffsubscriptitalic-ϕ1𝜉subscript~𝔼ffsubscriptitalic-ϕ4𝜉0.6813\min\{\widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{1}}(\xi),\ldots,% \widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{4}}(\xi)\}=0.6813roman_min { over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) , … , over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) } = 0.6813, 0.69830.69830.69830.6983 and 0.71210.71210.71210.7121 for n=10𝑛10n=10italic_n = 10, 20202020 and 30303030, respectively.

In Figure 6, the distribution functions of maximin D-OAD and D-OED are plotted. The maximin D-OED has 5 support points, while the exact ones have more than 5 support points. The maximin D-OAD has min{𝔼ff~ϕ1(ξ),,𝔼ff~ϕ4(ξ)}=0.8538subscript~𝔼ffsubscriptitalic-ϕ1𝜉subscript~𝔼ffsubscriptitalic-ϕ4𝜉0.8538\min\{\widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{1}}(\xi),\ldots,% \widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{4}}(\xi)\}=0.8538roman_min { over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) , … , over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) } = 0.8538, and the maximin D-OEDs have min{𝔼ff~ϕ1(ξ),,𝔼ff~ϕ4(ξ)}=subscript~𝔼ffsubscriptitalic-ϕ1𝜉subscript~𝔼ffsubscriptitalic-ϕ4𝜉absent\min\{\widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{1}}(\xi),\ldots,% \widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{4}}(\xi)\}=roman_min { over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) , … , over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) } = 0.83710.83710.83710.8371, 0.84200.84200.84200.8420 and 0.84590.84590.84590.8459 for n=10𝑛10n=10italic_n = 10, 20 and 30, respectively. From our numerical results, min{𝔼ff~ϕ1(ξ),,𝔼ff~ϕ4(ξ)}subscript~𝔼ffsubscriptitalic-ϕ1𝜉subscript~𝔼ffsubscriptitalic-ϕ4𝜉\min\{\widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{1}}(\xi),\ldots,% \widetilde{\operatorname{\mathbb{E}ff}}_{\phi_{4}}(\xi)\}roman_min { over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) , … , over~ start_ARG blackboard_E roman_ff end_ARG start_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ξ ) } of the maximin D-OED or A-OED increases as n𝑛nitalic_n increases.

Refer to caption
Figure 6: Plots of distribution functions of maximin D-optimal designs. (a) approximate design, (b) exact design for n=10𝑛10n=10italic_n = 10, (c) exact design for n=20𝑛20n=20italic_n = 20, (d) exact design for n=30𝑛30n=30italic_n = 30 and approximate design.

The gap between the distribution functions of the approximate and exact designs reflects the differences between their weights and slightly different support points. For an exact design with n𝑛nitalic_n runs, the weights have to be multiples of 1/n1𝑛1/n1 / italic_n. There are usually more support points in exact designs than those in approximate designs. Table 5 presents a maximin A-OED with n=20𝑛20n=20italic_n = 20. There are 11 support points. Algorithm 1 is flexible to find highly efficient maximin OEDs with various values of n𝑛nitalic_n.

Table 5: A maximin A-OED with n=20𝑛20n=20italic_n = 20, where nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denotes the number of observations at each support point.
support point 0 23.07 28.47 36.48 45.77 80.95 83.54 91.30 173.32 217.51 500
nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT 8 1 1 1 1 1 1 1 1 1 3

5 Conclusion

Constructing an OED is challenging, as it involves solving an integer programming problem, which is NP-complete. The conventional methods for obtaining the exact design either involve deriving analytical solutions for low-dimensional problems or rounding approximate designs. However, closed-form solutions usually do not exist for designs involving more than two variables, and rounding may not yield exact designs with high efficiency. We have developed a general algorithm to search for OAEs and highly efficient OEDs. This algorithm is applicable to any criterion with a convex loss function, any design spaces, and any sample sizes n𝑛nitalic_n, although it is particularly useful when n𝑛nitalic_n is small or moderate. For very large n𝑛nitalic_n, a rounding method still performs well in obtaining highly efficient OEDs. Notably, our algorithm also computes the OADs concurrently with OEDs. While there are other numerical algorithms for finding exact designs, they do not compute approximate designs, so it is difficult to access the efficiency of the resulting exact designs; some numerical methods are provided in Section 1. By computing the OADs, we can assess the efficiency of the OEDs relative to the OADs through the modified design efficiency measure. Our method offers a new and alternative approach to finding highly efficient exact designs.

We have provided four applications to demonstrate the effectiveness of our algorithm, which include single and multiple-objective optimality criteria, discrete and high-dimensional design spaces, linear, nonlinear and GLMs. Our proposed algorithm can find highly efficient exact designs for small n𝑛nitalic_n quickly, which is very useful for practical applications. Full implementation of the proposed algorithm is available online for practitioners to use in other real-world applications.

Appendix: Proofs and derivations

Proof of Theorem 1: Write the information matrices of design ξ(𝐱)𝜉𝐱\xi(\boldsymbol{\mathrm{x}})italic_ξ ( bold_x ) in (2) and (5) using a general form as

𝐈(ξ,𝜽)=i=1kwiI(𝐯i,𝜽),𝐈𝜉superscript𝜽superscriptsubscript𝑖1𝑘subscript𝑤𝑖𝐼subscript𝐯𝑖superscript𝜽\displaystyle\boldsymbol{\mathrm{I}}(\xi,\boldsymbol{\theta}^{*})=\sum_{i=1}^{% k}w_{i}I({\bf v}_{i},\boldsymbol{\theta}^{*}),bold_I ( italic_ξ , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_I ( bold_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) , (12)

where I(𝐱,𝜽)=𝐟(𝐱,𝜽)𝐟(𝐱,𝜽)𝐼𝐱superscript𝜽𝐟𝐱superscript𝜽superscript𝐟top𝐱superscript𝜽I(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})=\boldsymbol{\mathrm{f}}(% \boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})\boldsymbol{\mathrm{f}}^{\top}% (\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})italic_I ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = bold_f ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) in (2), or λ(𝐱,𝜽)𝐟(𝐱)𝐟(𝐱)𝜆𝐱superscript𝜽𝐟𝐱superscript𝐟top𝐱\lambda(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})\boldsymbol{\mathrm{f}% }(\boldsymbol{\mathrm{x}})\boldsymbol{\mathrm{f}}^{\top}(\boldsymbol{\mathrm{x% }})italic_λ ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) bold_f ( bold_x ) bold_f start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( bold_x ) in (5). Suppose the OAD ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT has support points 𝐱1,,𝐱mSsuperscriptsubscript𝐱1superscriptsubscript𝐱𝑚𝑆\boldsymbol{\mathrm{x}}_{1}^{*},\ldots,\boldsymbol{\mathrm{x}}_{m}^{*}\in Sbold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , … , bold_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∈ italic_S with corresponding weights, w1,,wmsuperscriptsubscript𝑤1superscriptsubscript𝑤𝑚w_{1}^{*},\ldots,w_{m}^{*}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, respectively.

For part (i), we want to show that limn𝔼ffϕ(ξn,ϕ)=1subscript𝑛subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ1\lim_{n\to\infty}\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})=1roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1.

For each n𝑛nitalic_n, compute ai=nwisubscript𝑎𝑖𝑛superscriptsubscript𝑤𝑖a_{i}=n\cdot w_{i}^{*}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_n ⋅ italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT for i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m. For large n𝑛nitalic_n such that ai1subscript𝑎𝑖1a_{i}\geq 1italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 1 for all i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m, we construct an exact design ξ(n)superscript𝜉𝑛\xi^{(n)}italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT as follows. The support points of ξ(n)superscript𝜉𝑛\xi^{(n)}italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT are the same as those of ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. We choose integer nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to be either the floor or the ceiling of aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and satisfy i=1mni=nsuperscriptsubscript𝑖1𝑚subscript𝑛𝑖𝑛\sum_{i=1}^{m}n_{i}=n∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_n. Since ai1subscript𝑎𝑖1a_{i}\geq 1italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 1 for all i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m, it is clear that ni1subscript𝑛𝑖1n_{i}\geq 1italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ 1 and |niai|1subscript𝑛𝑖subscript𝑎𝑖1|n_{i}-a_{i}|\leq 1| italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ 1 for all i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m. Define the weights of exact design ξ(n)superscript𝜉𝑛\xi^{(n)}italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT to be n1/n,,nm/nsubscript𝑛1𝑛subscript𝑛𝑚𝑛n_{1}/n,\ldots,n_{m}/nitalic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_n , … , italic_n start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT / italic_n for support points 𝐱1,,𝐱msuperscriptsubscript𝐱1superscriptsubscript𝐱𝑚\boldsymbol{\mathrm{x}}_{1}^{*},\ldots,\boldsymbol{\mathrm{x}}_{m}^{*}bold_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , … , bold_x start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, respectively.

Let δi=ainisubscript𝛿𝑖subscript𝑎𝑖subscript𝑛𝑖\delta_{i}=a_{i}-n_{i}italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m. Then wini/n=wiai/n+δi/n=δi/nsuperscriptsubscript𝑤𝑖subscript𝑛𝑖𝑛superscriptsubscript𝑤𝑖subscript𝑎𝑖𝑛subscript𝛿𝑖𝑛subscript𝛿𝑖𝑛w_{i}^{*}-n_{i}/n=w_{i}^{*}-a_{i}/n+\delta_{i}/n=\delta_{i}/nitalic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_n = italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_n + italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_n = italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_n. Note that |δi|1subscript𝛿𝑖1|\delta_{i}|\leq 1| italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ 1. Evaluate the information matrix of the exact design ξ(n)superscript𝜉𝑛\xi^{(n)}italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT using (12),

𝐈(ξ(n),𝜽)𝐈superscript𝜉𝑛superscript𝜽\displaystyle\boldsymbol{\mathrm{I}}(\xi^{(n)},\boldsymbol{\theta}^{*})bold_I ( italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) =\displaystyle== i=1mninI(𝐱i,𝜽)superscriptsubscript𝑖1𝑚subscript𝑛𝑖𝑛𝐼superscriptsubscript𝐱𝑖superscript𝜽\displaystyle\sum_{i=1}^{m}\frac{n_{i}}{n}~{}I(\boldsymbol{\mathrm{x}}_{i}^{*}% ,\boldsymbol{\theta}^{*})∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT divide start_ARG italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG italic_I ( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )
=\displaystyle== i=1m(wiδi/n)I(𝐱i,𝜽)superscriptsubscript𝑖1𝑚superscriptsubscript𝑤𝑖subscript𝛿𝑖𝑛𝐼superscriptsubscript𝐱𝑖superscript𝜽\displaystyle\sum_{i=1}^{m}\left(w_{i}^{*}-\delta_{i}/n\right)I(\boldsymbol{% \mathrm{x}}_{i}^{*},\boldsymbol{\theta}^{*})∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / italic_n ) italic_I ( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )
=\displaystyle== 𝐈(ξϕ,𝜽)1ni=1mδiI(𝐱i,𝜽)𝐈superscriptsubscript𝜉italic-ϕsuperscript𝜽1𝑛superscriptsubscript𝑖1𝑚subscript𝛿𝑖𝐼superscriptsubscript𝐱𝑖superscript𝜽\displaystyle\boldsymbol{\mathrm{I}}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*})-% \frac{1}{n}\sum_{i=1}^{m}\delta_{i}I(\boldsymbol{\mathrm{x}}_{i}^{*},% \boldsymbol{\theta}^{*})bold_I ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_I ( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )
=\displaystyle== 𝐈(ξϕ,𝜽)1n𝐁,𝐈superscriptsubscript𝜉italic-ϕsuperscript𝜽1𝑛𝐁\displaystyle\boldsymbol{\mathrm{I}}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*})-% \frac{1}{n}{\bf B},bold_I ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) - divide start_ARG 1 end_ARG start_ARG italic_n end_ARG bold_B ,

where 𝐁=i=1mδiI(𝐱i,𝜽)𝐁superscriptsubscript𝑖1𝑚subscript𝛿𝑖𝐼superscriptsubscript𝐱𝑖superscript𝜽{\bf B}=\sum_{i=1}^{m}\delta_{i}I(\boldsymbol{\mathrm{x}}_{i}^{*},\boldsymbol{% \theta}^{*})bold_B = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_I ( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ). Since all entries of I(𝐱,𝜽)𝐼𝐱superscript𝜽I(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})italic_I ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) are continuous functions of 𝐱𝐱\boldsymbol{\mathrm{x}}bold_x, S𝑆Sitalic_S is a bounded region, m𝑚mitalic_m is a fixed number, and |δi|1subscript𝛿𝑖1|\delta_{i}|\leq 1| italic_δ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | ≤ 1, all entries of 𝐁𝐁{\bf B}bold_B are bounded. Thus, 1n𝐁𝟎1𝑛𝐁0\frac{1}{n}{\bf B}\to{\bf 0}divide start_ARG 1 end_ARG start_ARG italic_n end_ARG bold_B → bold_0 (a zero matrix), as n𝑛n\to\inftyitalic_n → ∞. This implies that 𝐈(ξ(n),𝜽)𝐈(ξϕ,𝜽)𝐈superscript𝜉𝑛superscript𝜽𝐈superscriptsubscript𝜉italic-ϕsuperscript𝜽\boldsymbol{\mathrm{I}}(\xi^{(n)},\boldsymbol{\theta}^{*})\to\boldsymbol{% \mathrm{I}}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*})bold_I ( italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) → bold_I ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) as n𝑛n\to\inftyitalic_n → ∞, which leads to ϕ{𝐈1(ξ(n),𝜽)}ϕ{𝐈1(ξϕ,𝜽)}italic-ϕsuperscript𝐈1superscript𝜉𝑛superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi^{(n)},\boldsymbol{\theta}^{*})% \right\}\to\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{% \theta}^{*})\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } → italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } for commonly used optimality criterion.

Notice that ξ(n)superscript𝜉𝑛\xi^{(n)}italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT is an OED with n𝑛nitalic_n points. It is clear that for any n𝑛nitalic_n

ϕ{𝐈1(ξ(n),𝜽)}ϕ{𝐈1(ξn,ϕ,𝜽)}ϕ{𝐈1(ξϕ,𝜽)}.italic-ϕsuperscript𝐈1superscript𝜉𝑛superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉𝑛italic-ϕsuperscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi^{(n)},\boldsymbol{\theta}^{*})% \right\}\geq\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{n,\phi}^{*},% \boldsymbol{\theta}^{*})\right\}\geq\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(% \xi_{\phi}^{*},\boldsymbol{\theta}^{*})\right\}.italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ≥ italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ≥ italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } .

From above analysis, ϕ{𝐈1(ξ(n),𝜽)}ϕ{𝐈1(ξϕ,𝜽)}italic-ϕsuperscript𝐈1superscript𝜉𝑛superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi^{(n)},\boldsymbol{\theta}^{*})% \right\}\to\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{% \theta}^{*})\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUPERSCRIPT ( italic_n ) end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } → italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } as n𝑛n\to\inftyitalic_n → ∞. Therefore, we must have ϕ{𝐈1(ξn,ϕ,𝜽)}ϕ{𝐈1(ξϕ,𝜽)}italic-ϕsuperscript𝐈1superscriptsubscript𝜉𝑛italic-ϕsuperscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{n,\phi}^{*},\boldsymbol{\theta}^{% *})\right\}\to\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},% \boldsymbol{\theta}^{*})\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } → italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } as n𝑛n\to\inftyitalic_n → ∞, which gives limn𝔼ffϕ(ξn,ϕ)=1subscript𝑛subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉𝑛italic-ϕ1\lim_{n\to\infty}\operatorname{\mathbb{E}ff}_{\phi}(\xi_{n,\phi}^{*})=1roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_n , italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1.


For part (ii), we want to show that limN𝔼ffϕ(ξϕ,N)=1subscript𝑁subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁1\lim_{N\to\infty}\operatorname{\mathbb{E}ff}_{\phi}(\xi_{\phi,N}^{*})=1roman_lim start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1.

For each SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, we construct an approximate design ξN(0)superscriptsubscript𝜉𝑁0\xi_{N}^{(0)}italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT with m𝑚mitalic_m support points, which are selected as follows. By the construction of SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, we can find a sequence of points, 𝐮i,NSNsubscript𝐮𝑖𝑁subscript𝑆𝑁{\bf u}_{i,N}\in S_{N}bold_u start_POSTSUBSCRIPT italic_i , italic_N end_POSTSUBSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m, such that limN𝐮i,N=𝐱isubscript𝑁subscript𝐮𝑖𝑁superscriptsubscript𝐱𝑖\lim_{N\to\infty}{\bf u}_{i,N}=\boldsymbol{\mathrm{x}}_{i}^{*}roman_lim start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT bold_u start_POSTSUBSCRIPT italic_i , italic_N end_POSTSUBSCRIPT = bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, for all i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m. Then we choose 𝐮i,NSNsubscript𝐮𝑖𝑁subscript𝑆𝑁{\bf u}_{i,N}\in S_{N}bold_u start_POSTSUBSCRIPT italic_i , italic_N end_POSTSUBSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, i=1,,m𝑖1𝑚i=1,\ldots,mitalic_i = 1 , … , italic_m, as the support points of ξN(0)superscriptsubscript𝜉𝑁0\xi_{N}^{(0)}italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT, and their weights are w1,,wmsuperscriptsubscript𝑤1superscriptsubscript𝑤𝑚w_{1}^{*},\ldots,w_{m}^{*}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT (from ξϕsuperscriptsubscript𝜉italic-ϕ\xi_{\phi}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT), respectively. The information matrix of the approximate design ξN(0)superscriptsubscript𝜉𝑁0\xi_{N}^{(0)}italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT using (12) is,

𝐈(ξN(0),𝜽)𝐈superscriptsubscript𝜉𝑁0superscript𝜽\displaystyle\boldsymbol{\mathrm{I}}(\xi_{N}^{(0)},\boldsymbol{\theta}^{*})bold_I ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) =\displaystyle== i=1mwiI(𝐮i,N,𝜽)superscriptsubscript𝑖1𝑚superscriptsubscript𝑤𝑖𝐼subscript𝐮𝑖𝑁superscript𝜽\displaystyle\sum_{i=1}^{m}w_{i}^{*}~{}I({\bf u}_{i,N},\boldsymbol{\theta}^{*})∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_I ( bold_u start_POSTSUBSCRIPT italic_i , italic_N end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT )
\displaystyle\to i=1mwiI(𝐱i,𝜽),asNsuperscriptsubscript𝑖1𝑚superscriptsubscript𝑤𝑖𝐼superscriptsubscript𝐱𝑖superscript𝜽as𝑁\displaystyle\sum_{i=1}^{m}w_{i}^{*}~{}I(\boldsymbol{\mathrm{x}}_{i}^{*},% \boldsymbol{\theta}^{*}),~{}\mbox{as}~{}N\to\infty∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_w start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_I ( bold_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) , as italic_N → ∞
=\displaystyle== 𝐈(ξϕ,𝜽),𝐈superscriptsubscript𝜉italic-ϕsuperscript𝜽\displaystyle\boldsymbol{\mathrm{I}}(\xi_{\phi}^{*},\boldsymbol{\theta}^{*}),bold_I ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) ,

since all entries of I(𝐱,𝜽)𝐼𝐱superscript𝜽I(\boldsymbol{\mathrm{x}},\boldsymbol{\theta}^{*})italic_I ( bold_x , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) are continuous functions of 𝐱𝐱\boldsymbol{\mathrm{x}}bold_x. Similar to the proof of part (i), it is clear that ϕ{𝐈1(ξN(0),𝜽)}ϕ{𝐈1(ξϕ,𝜽)}italic-ϕsuperscript𝐈1superscriptsubscript𝜉𝑁0superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{N}^{(0)},\boldsymbol{\theta}^{*})% \right\}\to\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},\boldsymbol{% \theta}^{*})\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } → italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } as N𝑁N\to\inftyitalic_N → ∞. Since ξN(0)superscriptsubscript𝜉𝑁0\xi_{N}^{(0)}italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT is a design on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT and ξϕ,Nsuperscriptsubscript𝜉italic-ϕ𝑁\xi_{\phi,N}^{*}italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT is an OAD on SNsubscript𝑆𝑁S_{N}italic_S start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, it follows that

ϕ{𝐈1(ξN(0),𝜽)}ϕ{𝐈1(ξϕ,N,𝜽)}ϕ{𝐈1(ξϕ,𝜽)}.italic-ϕsuperscript𝐈1superscriptsubscript𝜉𝑁0superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕ𝑁superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{N}^{(0)},\boldsymbol{\theta}^{*})% \right\}\geq\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi,N}^{*},% \boldsymbol{\theta}^{*})\right\}\geq\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(% \xi_{\phi}^{*},\boldsymbol{\theta}^{*})\right\}.italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ≥ italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } ≥ italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } .

Thus, ϕ{𝐈1(ξϕ,N,𝜽)}ϕ{𝐈1(ξϕ,𝜽)}italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕ𝑁superscript𝜽italic-ϕsuperscript𝐈1superscriptsubscript𝜉italic-ϕsuperscript𝜽\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi,N}^{*},\boldsymbol{\theta}^{% *})\right\}\to\phi\left\{\boldsymbol{\mathrm{I}}^{-1}(\xi_{\phi}^{*},% \boldsymbol{\theta}^{*})\right\}italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } → italic_ϕ { bold_I start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) } as N𝑁N\to\inftyitalic_N → ∞, which implies that limN𝔼ffϕ(ξϕ,N)=1subscript𝑁subscript𝔼ffitalic-ϕsuperscriptsubscript𝜉italic-ϕ𝑁1\lim_{N\to\infty}\operatorname{\mathbb{E}ff}_{\phi}(\xi_{\phi,N}^{*})=1roman_lim start_POSTSUBSCRIPT italic_N → ∞ end_POSTSUBSCRIPT start_OPFUNCTION blackboard_E roman_ff end_OPFUNCTION start_POSTSUBSCRIPT italic_ϕ end_POSTSUBSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_ϕ , italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) = 1. \Box


Acknowledgements

This research was partially supported by Discovery Grants from the Natural Sciences and Engineering Research Council of Canada. The first author is partially supported by the CANSSI Distinguished Postdoctoral Fellowship from the Canadian Statistical Sciences Institute.

Declarations

The authors have no conflicts of interest to declare.

References

  • Berger and Wong, (2009) Berger, M. P. and Wong, W.-K. (2009). An Introduction to Optimal Designs for Social and Biomedical Research. Wiley, New York.
  • Bose and Mukerjee, (2015) Bose, M. and Mukerjee, R. (2015). Optimal design measures under asymmetric errors, with application to binary design points. Journal of Statistical Planning and Inference, 159:28–36.
  • Boyd and Vandenberghe, (2004) Boyd, S. P. and Vandenberghe, L. (2004). Convex Optimization. Cambridge University Press, New York.
  • Bretz et al., (2010) Bretz, F., Dette, H., and Pinheiro, J. C. (2010). Practical considerations for optimal designs in clinical dose finding studies. Statistics in Medicine, 29:731–742.
  • Broudiscou et al., (1996) Broudiscou, A., Leardi, R., and Phan-Tan-Luu, R. (1996). Genetic algorithm as a tool for selection of D-optimal design. Chemometrics and Intelligent Laboratory Systems, 35:105–116.
  • Chen et al., (2015) Chen, R.-B., Chang, S.-P., Wang, W., Tung, H.-C., and Wong, W. K. (2015). Minimax optimal designs via particle swarm optimization methods. Statistics and Computing, 25:975–988.
  • Dean et al., (2015) Dean, A. M., Morris, M., Stufken, J., and Bingham, D. (2015). Handbook of Design and Analysis of Experiments. CRC Press, Boca Raton.
  • Dette et al., (2008) Dette, H., Bretz, F., Pepelyshev, A., and Pinheiro, J. (2008). Optimal designs for dose-finding studies. Journal of the American Statistical Association, 103:1225–1237.
  • Duan et al., (2022) Duan, J., Gao, W., Ma, Y., and Ng, H. K. T. (2022). Efficient computational algorithms for approximate optimal designs. Journal of Statistical Computation and Simulation, 92:764–793.
  • Duarte et al., (2020) Duarte, B. P., Granjo, J. F., and Wong, W. K. (2020). Optimal exact designs of experiments via mixed integer nonlinear programming. Statistics and Computing, 30:93–112.
  • Duarte and Wong, (2014) Duarte, B. P. and Wong, W. K. (2014). A semi-infinite programming based algorithm for finding minimax optimal designs for nonlinear models. Statistics and Computing, 24:1063–1080.
  • Duarte and Wong, (2015) Duarte, B. P. and Wong, W. K. (2015). Finding bayesian optimal designs for nonlinear models: a semidefinite programming-based approach. International Statistical Review, 83:239–262.
  • Duarte et al., (2015) Duarte, B. P., Wong, W. K., and Atkinson, A. C. (2015). A semi-infinite programming based algorithm for determining T-optimum designs for model discrimination. Journal of Multivariate Analysis, 135:11–24.
  • Fedorov, (1972) Fedorov, V. V. (1972). Theory of Optimal Experiments. Academic Press, New York.
  • Gao et al., (2024) Gao, L. L., Ye, J. J., Zeng, S., and Zhou, J. (2024+). Necessary and sufficient conditions for multiple objective optimal regression designs. Statistica Sinica, to appear.
  • Gao and Zhou, (2017) Gao, L. L. and Zhou, J. (2017). D-optimal designs based on the second-order least squares estimator. Statistical Papers, 58:77–94.
  • Grant and Boyd, (2020) Grant, M. C. and Boyd, S. P. (2020). The CVX users’ guide, release 2.2. https://cvxr.com/cvx/doc/CVX.pdf. Online; accessed 02 April 2024.
  • Haines et al., (2018) Haines, L. M., Kabera, G. M., and Ndlovu, P. (2018). D-optimal designs for the two-variable binary logistic regression model with interaction. Journal of Statistical Planning and Inference, 193:136–150.
  • Hamada et al., (2001) Hamada, M., Martz, H., Reese, C., and Wilson, A. (2001). Finding near-optimal bayesian experimental designs via genetic algorithms. The American Statistician, 55:175–181.
  • Huang et al., (2017) Huang, S.-H., Huang, M.-N. L., Shedden, K., and Wong, W. K. (2017). Optimal group testing designs for estimating prevalence with uncertain testing errors. Journal of the Royal Statistical Society, Series B, 79:1547–1563.
  • Hughes-Oliver and Rosenberger, (2000) Hughes-Oliver, J. M. and Rosenberger, W. F. (2000). Efficient estimation of the prevalence of multiple rare traits. Biometrika, 87:315–327.
  • Hughes-Oliver and Swallow, (1994) Hughes-Oliver, J. M. and Swallow, W. H. (1994). A two-stage adaptive group-testing procedure for estimating small proportions. Journal of the American Statistical Association, 89:982–993.
  • John and Draper, (1975) John, R. S. and Draper, N. R. (1975). D-optimality for regression designs: a review. Technometrics, 17:15–23.
  • Kiefer, (1959) Kiefer, J. (1959). Optimum experimental designs. Journal of the Royal Statistical Society, Series B, 21:272–304.
  • Kiefer, (1974) Kiefer, J. (1974). General equivalence theory for optimum designs (approximate theory). The Annals of Statistics, 2:849–879.
  • López-Fidalgo, (2023) López-Fidalgo, J. (2023). Optimal Experimental Design: A Concise Introduction for Researchers. Springer, New York.
  • Mandal et al., (2015) Mandal, A., Wong, W. K., and Yu, Y. (2015). Algorithmic searches for optimal designs. In Angela Dean, Max Morris, J. S. and Bingham, D., editors, Handbook of Design and Analysis of Experiments, chapter 21, pages 755–783. CRC Press, Roca Raton.
  • Meyer and Nachtsheim, (1988) Meyer, R. K. and Nachtsheim, C. J. (1988). Constructing exact d-optimal experimental designs by simulated annealing. American Journal of Mathematical and Management Sciences, 8:329–359.
  • Meyer and Nachtsheim, (1995) Meyer, R. K. and Nachtsheim, C. J. (1995). The coordinate-exchange algorithm for constructing exact optimal experimental designs. Technometrics, 37:60–69.
  • Mukerjee and Huda, (2016) Mukerjee, R. and Huda, S. (2016). Approximate theory-aided robust efficient factorial fractions under baseline parametrization. Annals of the Institute of Statistical Mathematics, 68:787–803.
  • Palhazi Cuervo et al., (2016) Palhazi Cuervo, D., Goos, P., and Sörensen, K. (2016). Optimal design of large-scale screening experiments: a critical look at the coordinate-exchange algorithm. Statistics and Computing, 26:15–28.
  • Papp, (2012) Papp, D. (2012). Optimal designs for rational function regression. Journal of the American Statistical Association, 107:400–411.
  • Pukelsheim, (1993) Pukelsheim, F. (1993). Optimal Design of Experiments. Wiley, New York.
  • Pukelsheim and Rieder, (1992) Pukelsheim, F. and Rieder, S. (1992). Efficient rounding of approximate designs. Biometrika, 79:763–770.
  • Rempel and Zhou, (2014) Rempel, M. F. and Zhou, J. (2014). On exact k-optimal designs minimizing the condition number. Communications in Statistics-Theory and Methods, 43:1114–1131.
  • Smucker et al., (2012) Smucker, B. J., del Castillo, E., and Rosenberger, J. L. (2012). Model-robust two-level designs using coordinate exchange algorithms and a maximin criterion. Technometrics, 54:367–375.
  • Wilmut and Zhou, (2011) Wilmut, M. and Zhou, J. (2011). D-optimal minimax design criterion for two-level fractional factorial designs. Journal of Statistical Planning and Inference, 141:576–587.
  • Wong and Zhou, (2019) Wong, W. K. and Zhou, J. (2019). CVX-based algorithms for constructing various optimal regression designs. Canadian Journal of Statistics, 47:374–391.
  • Wong and Zhou, (2023) Wong, W. K. and Zhou, J. (2023). Using CVX to construct optimal designs for biomedical studies with multiple objectives. Journal of Computational and Graphical Statistics, 32:744–753.
  • Xu et al., (2019) Xu, W., Wong, W. K., Tan, K. C., and Xu, J.-X. (2019). Finding high-dimensional D-optimal designs for logistic models via differential evolution. IEEE Access, 7:7133–7146.
  • Yang et al., (2013) Yang, M., Biedermann, S., and Tang, E. (2013). On optimal designs for nonlinear models: a general and efficient algorithm. Journal of the American Statistical Association, 108:1411–1420.
  • Ye et al., (2017) Ye, J. J., Zhou, J., and Zhou, W. (2017). Computing A-optimal and E-optimal designs for regression models via semidefinite programming. Communications in Statistics-Simulation and Computation, 46:2011–2024.
  • Yu, (2011) Yu, Y. (2011). D-optimal designs via a cocktail algorithm. Statistics and Computing, 21:475–481.
  • Zhang and Mukerjee, (2013) Zhang, R. and Mukerjee, R. (2013). Highly efficient factorial designs for cdna microarray experiments: use of approximate theory together with a step-up step-down procedure. Statistical Applications in Genetics and Molecular Biology, 12:489–503.