Statistical inference on partially shape-constrained function-on-scalar linear regression models

Kyunghee Han University of Illinois at Chicago Yeonjoo Park Corresponding author ([email protected]) University of Texas at San Antonio Soo-Young Kim Fred Hutchinson Cancer Center
Abstract

We consider functional linear regression models where functional outcomes are associated with scalar predictors by coefficient functions with shape constraints, such as monotonicity and convexity, that apply to sub-domains of interest. To validate the partial shape constraints, we propose testing a composite hypothesis of linear functional constraints on regression coefficients. Our approach employs kernel- and spline-based methods within a unified inferential framework, evaluating the statistical significance of the hypothesis by measuring an L2superscript𝐿2L^{2}italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT-distance between constrained and unconstrained model fits. In the theoretical study of large-sample analysis under mild conditions, we show that both methods achieve the standard rate of convergence observed in the nonparametric estimation literature. Through numerical experiments of finite-sample analysis, we demonstrate that the type I error rate keeps the significance level as specified across various scenarios and that the power increases with sample size, confirming the consistency of the test procedure under both estimation methods. Our theoretical and numerical results provide researchers the flexibility to choose a method based on computational preference. The practicality of partial shape-constrained inference is illustrated by two data applications: one involving clinical trials of NeuroBloc in type A-resistant cervical dystonia and the other with the National Institute of Mental Health Schizophrenia Study.

Keywords: Nonparametric estimation, partial shape constraints, shape-constrained kernel least squares, shape-constrained regression spline, testing

1 Introduction

As function-valued data acquisition becomes increasingly common, there has been a significant amount of work devoted to functional regression models to address coefficient function estimation and inferential capabilities [21, 32]. Among them, inference for the function-on-scalar regression (FoSR) model has been making implications in many fields [24] by identifying a dynamic association between response and scalar covariates, varying over the domain. Besides finding the statistical evidence of a non-null covariate effect on response trajectories over the domain, validating the shape of functional regression coefficients, such as monotonicity, convexity, or concavity, over a specific subset of the domain is crucial for domain experts to allow tangible interpretation in practice. Once the functional shape is validated, one may incorporate the corresponding conditions to the model estimates to avoid potential biases. Or, in practical estimation problems, shape restrictions on functional regression coefficients over specific sub-intervals of the domain can be known as prior knowledge.

Our motivating example is the functional data analysis approach to demonstrating the treatment efficacy of drug use during the period of clinical trials in the National Institute of Mental Health Schizophrenia Collaborative Study. The previous studies, including [1] and [12], showed a significant continual drop in disease severity for the treatment group, i.e., improving drug efficacy, over weeks through the shape-constraint test on the drug effect coefficient function. While the study design and models are provided in Section 4.2, the left panel of Figure 1 displays the collected longitudinal disease severity measures surveyed from the initial treatment (week 0) to week 6 from randomly selected 30 patients of placebo and treatment cohorts. However, refined conclusions over specific periods can be of practical interest to practitioners, such as when the maximum drug efficacy is achieved or whether we can conclude significant improvement in drug effectiveness even during the later weeks of trials. As we shall see in Section 4.2, our proposed partial inference method concludes the significant monotone decrease in disease severity owing to the drug treatment over weeks 0–3, followed by the consistent duration of such efficacy for the remainder weeks, illustrated with the blue solid line in the right panel of Figure 1. This is somewhat distinct from constrained estimates under the condition of monotone decrease over the entire domain, the conclusion of [12], and unconstrained smooth estimates, which are displayed with the black dotted and dashed lines, respectively.

Refer to caption
Figure 1: The left panel illustrates the individual trajectories of disease severity for a subset of patients in the treatment and placebo groups. The average disease severity for each group (solid line) shows an overall decrease, with the treatment group experiencing a rapid drop in the first few weeks, followed by a continual reduction at a seemingly similar rate for both groups. As shown in the right panel, [12] previously validated this observation using globally monotone shape-constrained estimates (black dashed line), which provide more explicit and decision-making interpretations than unconstrained estimates (black dotted line). However, the global shape-constrained inference may be misleading by ignoring the rebound after Week 5, as indicated by the unconstrained estimates. Our proposed method (blue solid line) with partial shape constraints refines the conclusion, suggesting that the duration of drug action may effectively extend up to Week 3, with sustained efficacy thereafter.

The recent literature on functional data analysis and its applications also pays attention to shape-restricted estimation and inference in functional regression models. For the inference on shape-constrained regression coefficients, [6] proposed the goodness-of-fit test based on empirical processes projected to the space with shape constraints, while [22] addresses a similar problem under the null hypothesis with linear operator constraints. For estimation, [4] extended the shape-constrained kernel smoothing to the functional and longitudinal data framework, and [12] employed the Bernstein polynomials for shape-constrained estimation in functional regression with one of the data application results from [12] being illustrated in Figure 1. To name of few other applications, in economics, motivated by the pioneering work from Slutsky [31] recognizing the necessity of shape-restricted estimation and inference, [5] elaborated theoretical and application works in econometric research. In public health research, the analysis of growth charts under the monotone increasing shape restrictions has provided a crucial clinical tool for growth screening during infancy, childhood, and adolescence [15] or such growth chart helps health providers assess monotone increasing growth patterns against age-specific percentile curves [9]. The reliability engineering society also paid attention to this topic for bathtub-shaped function estimation in assessing the degradation of system reliability. However, to our best knowledge, inference for partial shape constraints has not been recognized, although such interests can be plausible in practice.

In this study, we propose the inferential tool to validate the partial shape constraints on functional regression coefficients in FoSR, along with two corresponding estimation approaches using kernel-smoothing and spline techniques. Suppose one is interested in the shape constraints on a sub-interval =[a,b][0,1]𝑎𝑏01\mathcal{I}=[a,b]\subset[0,1]caligraphic_I = [ italic_a , italic_b ] ⊂ [ 0 , 1 ], where collected response trajectories span [0,1]01[0,1][ 0 , 1 ]. Under the existing tools for shape-constrained inference, one might consider using a part of functional observations restricted on \mathcal{I}caligraphic_I and apply the standard testing procedure. However, such an approach will cause a significant drop in sample sizes in the case of a bounded number of measurements on response trajectories, which is common in longitudinal studies, including our motivating example illustrated in Figure 1. The situation becomes much worse depending on the length of \mathcal{I}caligraphic_I or the sparsity of evaluation grids on it. Consequently, this naive approach would result in a severe deterioration in the power. To prevent this, we propose to borrow information from observations over the entire domain to perform the inference even for testing focused on specific subsets of the domain. It indeed prevents the significant drop in the power of the test while kee** the desirable level of type-I error, as demonstrated in the simulation studies of Section 3. The other contribution of our paper is in develo** two partial shape-constrained regression estimators in parallel using the most widely used nonparametric smoothing techniques, kernel-smoothing and smoothing splines. The proposed unified testing tool applicable to both estimates would provide practical flexibility in a real application. We additionally derive asymptotic behaviors of the regression coefficient estimators with embedded partial shape constraints.

The article is organized as follows. Section 2 presents the proposed method and results of the study. We introduce the partially shape-constrained FoSR models in Section 2.1. The estimation methods and theoretical findings are provided in Sections 2.2 and 2.3 that cover the kernel and spline estimators, respectively. In Section 2.4, we establish a unified inferential procedure for testing the partial shape constraints, encompassing kernel and spline approaches. Simulation results are reported in Section 3, and data applications are illustrated with two examples in Section 4. Technical details and proofs are provided in Supplementary Material.

2 Methodology and Theory

2.1 Testing partial shape constraints in FoSR models

Let Yisubscript𝑌𝑖Y_{i}italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT be the functional response coupled with a vector covariate 𝐗i=(Xi,1,,Xi,p)subscript𝐗𝑖superscriptsubscript𝑋𝑖1subscript𝑋𝑖𝑝top\mathbf{X}_{i}=(X_{i,1},\ldots,X_{i,p})^{\top}bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_i , italic_p end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT as

Yi(t)=j=1pβj(t)Xi,j+εi(t)(t[0,1])subscript𝑌𝑖𝑡superscriptsubscript𝑗1𝑝subscript𝛽𝑗𝑡subscript𝑋𝑖𝑗subscript𝜀𝑖𝑡𝑡01\displaystyle Y_{i}(t)=\sum_{j=1}^{p}\beta_{j}(t)X_{i,j}+\varepsilon_{i}(t)% \quad(t\in[0,1])italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) = ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) italic_X start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ( italic_t ∈ [ 0 , 1 ] ) (2.1)

independently for i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n, where 𝜷(t)=(β1(t),,βp(t))𝜷𝑡superscriptsubscript𝛽1𝑡subscript𝛽𝑝𝑡top\boldsymbol{\beta}(t)=\big{(}\beta_{1}(t),\ldots,\beta_{p}(t)\big{)}^{\top}bold_italic_β ( italic_t ) = ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , … , italic_β start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is a vector coefficient function and εi(t)subscript𝜀𝑖𝑡\varepsilon_{i}(t)italic_ε start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) is a mean-zero stochastic process that models the regression error uncorrelated with 𝐗isubscript𝐗𝑖\mathbf{X}_{i}bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i.e., 𝔼(εi(t)|𝐗i)=0𝔼conditionalsubscript𝜀𝑖𝑡subscript𝐗𝑖0\mathbb{E}(\varepsilon_{i}(t)|\mathbf{X}_{i})=0blackboard_E ( italic_ε start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) | bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = 0. We assume that Xi,1,,Xi,psubscript𝑋𝑖1subscript𝑋𝑖𝑝X_{i,1},\ldots,X_{i,p}italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_i , italic_p end_POSTSUBSCRIPT are linearly independent with positive probability, which allows the design matrix to include the intercept, e.g., Xi,1=1subscript𝑋𝑖11X_{i,1}=1italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT = 1 for all i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n, depending on the inferential interest. Despite the generality of the functional model (2.1), it is practically infeasible to observe the functional outcomes as the infinite-dimensional objects, and we assume that discrete evaluations 𝐘i=(Yi,1,,Yi,Li)subscript𝐘𝑖superscriptsubscript𝑌𝑖1subscript𝑌𝑖subscript𝐿𝑖top\mathbf{Y}_{i}=(Y_{i,1},\ldots,Y_{i,L_{i}})^{\top}bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_Y start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT with Yi,=Yi(Ti,)subscript𝑌𝑖subscript𝑌𝑖subscript𝑇𝑖Y_{i,\ell}=Y_{i}(T_{i,\ell})italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT = italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) are only accessible for =1,,Li1subscript𝐿𝑖\ell=1,\ldots,L_{i}roman_ℓ = 1 , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, where Ti,1,,Ti,Lisubscript𝑇𝑖1subscript𝑇𝑖subscript𝐿𝑖T_{i,1},\ldots,T_{i,L_{i}}italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT is a random sample of T𝑇Titalic_T with a probability density function π𝜋\piitalic_π on [0,1]01[0,1][ 0 , 1 ]. Here, Lisubscript𝐿𝑖L_{i}italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is an independent random integer that may or may not depend on the sample size n𝑛nitalic_n. More detailed conditions on Lisubscript𝐿𝑖L_{i}italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT required for kernel and regression spline methods are provided in the Supplementary Material S.1 and S.2. We denote the finite random sample as 𝒳n={(𝐘i,𝐗i):i=1,,n}subscript𝒳𝑛conditional-setsubscript𝐘𝑖subscript𝐗𝑖𝑖1𝑛\mathcal{X}_{n}=\{(\mathbf{Y}_{i},\mathbf{X}_{i}):i=1,\ldots,n\}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = { ( bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) : italic_i = 1 , … , italic_n }.

Suppose we test the functional shape of βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT on a pre-specified sub-interval j[0,1]subscript𝑗01\mathcal{I}_{j}\subset[0,1]caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⊂ [ 0 , 1 ] for jJ𝑗𝐽j\in Jitalic_j ∈ italic_J with J{1,,p}𝐽1𝑝J\subset\{1,\ldots,p\}italic_J ⊂ { 1 , … , italic_p }. The null hypothesis “H0:H0,j is true for all jJ:subscript𝐻0H0,j is true for all jJH_{0}:\textrm{$H_{0,j}$ is true for all $j\in J$}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT is true for all italic_j ∈ italic_J” consists of individual hypotheses

H0,j:βj is 𝒜j-shaped on j:subscript𝐻0𝑗βj is 𝒜j-shaped on j\displaystyle H_{0,j}:\textrm{$\beta_{j}$ is $\mathcal{A}_{j}$-shaped on $% \mathcal{I}_{j}$}italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is caligraphic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT -shaped on caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT (2.2)

for jJ𝑗𝐽j\in Jitalic_j ∈ italic_J. For example, if we hypothesize that βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is monotone increasing on jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, we read (2.2) as “H0,j:βj is monotone increasing on j:subscript𝐻0𝑗βj is monotone increasing on jH_{0,j}:\textrm{$\beta_{j}$ is monotone increasing on $\mathcal{I}_{j}$}italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is monotone increasing on caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT.” Similarly, if one tests a partial convexity hypothesis, (2.2) can also be written as “H0,j:βj is convex on j:subscript𝐻0𝑗βj is convex on jH_{0,j}:\textrm{$\beta_{j}$ is convex on $\mathcal{I}_{j}$}italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is convex on caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT.” In this paper, monotone and convex hypotheses will mainly be exemplified, yet one can extend our framework to general shape constraints, similarly as covered by many others in the literature [17, 18, 27, 28, 20, 25].

We then propose to reject the null hypothesis H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT if the test statistic

Dn=1ni=1n01{𝐗i(𝜷^(t)𝜷~(t))}2dtsubscript𝐷𝑛1𝑛superscriptsubscript𝑖1𝑛superscriptsubscript01superscriptsuperscriptsubscript𝐗𝑖top^𝜷𝑡~𝜷𝑡2differential-d𝑡\displaystyle\begin{split}D_{n}=\frac{1}{n}\sum_{i=1}^{n}\int_{0}^{1}\left\{% \mathbf{X}_{i}^{\top}\big{(}\widehat{\boldsymbol{\beta}}(t)-\widetilde{% \boldsymbol{\beta}}(t)\big{)}\right\}^{2}\,\mathrm{d}t\end{split}start_ROW start_CELL italic_D start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT { bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( over^ start_ARG bold_italic_β end_ARG ( italic_t ) - over~ start_ARG bold_italic_β end_ARG ( italic_t ) ) } start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_d italic_t end_CELL end_ROW (2.3)

is significantly large, where 𝜷^(t)=(β^1(t),,β^p(t))^𝜷𝑡superscriptsubscript^𝛽1𝑡subscript^𝛽𝑝𝑡top\widehat{\boldsymbol{\beta}}(t)=\big{(}\hat{\beta}_{1}(t),\ldots,\hat{\beta}_{% p}(t)\big{)}^{\top}over^ start_ARG bold_italic_β end_ARG ( italic_t ) = ( over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , … , over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and 𝜷~(t)=(β~1(t),,β~p(t))~𝜷𝑡superscriptsubscript~𝛽1𝑡subscript~𝛽𝑝𝑡top\widetilde{\boldsymbol{\beta}}(t)=\big{(}\tilde{\beta}_{1}(t),\ldots,\tilde{% \beta}_{p}(t)\big{)}^{\top}over~ start_ARG bold_italic_β end_ARG ( italic_t ) = ( over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , … , over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT denote the estimates of coefficient function 𝜷(t)=(β1(t),,βp(t))𝜷𝑡superscriptsubscript𝛽1𝑡subscript𝛽𝑝𝑡top\boldsymbol{\beta}(t)=\big{(}\beta_{1}(t),\ldots,\beta_{p}(t)\big{)}^{\top}bold_italic_β ( italic_t ) = ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , … , italic_β start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT under the null hypothesis H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and its general alternative H1subscript𝐻1H_{1}italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, respectively. While this Dnsubscript𝐷𝑛D_{n}italic_D start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is partly motivated by [22], a goodness-of-fit based test statistic for validating functional constraints on βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) via linear operator, similar types of the L2superscript𝐿2L^{2}italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT-based test statistics were frequently employed in the literature for testing the nullity of functional difference for general scope [30, 36, 35]. Indeed, our proposed method corresponds to the global shape constraints if one sets j=[0,1]subscript𝑗01\mathcal{I}_{j}=[0,1]caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = [ 0 , 1 ]. The following Sections 2.2 and 2.3 elaborate on how to obtain 𝜷^(t)^𝜷𝑡\widehat{\boldsymbol{\beta}}(t)over^ start_ARG bold_italic_β end_ARG ( italic_t ) under kernel-smoothing and spline-smoothing approaches, respectively. Based on those estimators, we present a bootstrap test procedure in Section 2.4.

2.2 Estimation: Shape-constrained kernel-weighted least squares

We assume that the vector coefficient 𝜷(t)𝜷𝑡\boldsymbol{\beta}(t)bold_italic_β ( italic_t ) is as smooth as it allows local linear approximation 𝜷(u)𝜷(t)+𝜷˙(t)(ut)𝜷𝑢𝜷𝑡˙𝜷𝑡𝑢𝑡\boldsymbol{\beta}(u)\approx\boldsymbol{\beta}(t)+\dot{\boldsymbol{\beta}}(t)(% u-t)bold_italic_β ( italic_u ) ≈ bold_italic_β ( italic_t ) + over˙ start_ARG bold_italic_β end_ARG ( italic_t ) ( italic_u - italic_t ) for u𝑢uitalic_u near t𝑡titalic_t, where 𝜷˙(t)=(β1(t),,βp(t))˙𝜷𝑡superscriptsuperscriptsubscript𝛽1𝑡superscriptsubscript𝛽𝑝𝑡top\dot{\boldsymbol{\beta}}(t)=\big{(}\beta_{1}^{\prime}(t),\ldots,\beta_{p}^{% \prime}(t)\big{)}^{\top}over˙ start_ARG bold_italic_β end_ARG ( italic_t ) = ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_t ) , … , italic_β start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT denotes the gradient of 𝜷(t)𝜷𝑡\boldsymbol{\beta}(t)bold_italic_β ( italic_t ). The local linear kernel estimator of 𝔹(t)=[𝜷(t),𝜷˙(t)]p×2𝔹𝑡𝜷𝑡˙𝜷𝑡superscript𝑝2\mathbb{B}(t)=\big{[}\boldsymbol{\beta}(t),\dot{\boldsymbol{\beta}}(t)\big{]}% \in\mathbb{R}^{p\times 2}blackboard_B ( italic_t ) = [ bold_italic_β ( italic_t ) , over˙ start_ARG bold_italic_β end_ARG ( italic_t ) ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × 2 end_POSTSUPERSCRIPT is given by the unconstrained minimizer 𝔹~(t)~𝔹𝑡\widetilde{\mathbb{B}}(t)over~ start_ARG blackboard_B end_ARG ( italic_t ) of the following objective function.

n(𝔹(t))=i=1n1Li=1Li[Yi,𝐗i𝔹(t)𝐙i,(t)]2Kh(Ti,t)subscript𝑛𝔹𝑡superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖superscriptdelimited-[]subscript𝑌𝑖superscriptsubscript𝐗𝑖top𝔹𝑡subscript𝐙𝑖𝑡2subscript𝐾subscript𝑇𝑖𝑡\displaystyle\begin{split}\mathcal{L}_{n}\big{(}\mathbb{B}(t)\big{)}&=\sum_{i=% 1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}\Big{[}Y_{i,\ell}-\mathbf{X}_{i}^{% \top}\mathbb{B}(t)\mathbf{Z}_{i,\ell}(t)\Big{]}^{2}K_{h}(T_{i,\ell}-t)\end{split}start_ROW start_CELL caligraphic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B ( italic_t ) ) end_CELL start_CELL = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT [ italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT blackboard_B ( italic_t ) bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) end_CELL end_ROW (2.4)

where Kh(u)=K(u/h)/hsubscript𝐾𝑢𝐾𝑢K_{h}(u)=K(u/h)/hitalic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_u ) = italic_K ( italic_u / italic_h ) / italic_h with a probability density function K𝐾Kitalic_K and a bandwidth h>00h>0italic_h > 0, 𝐙i,(t)=(1,Ti,t)subscript𝐙𝑖𝑡superscript1subscript𝑇𝑖𝑡top\mathbf{Z}_{i,\ell}(t)=\big{(}1,T_{i,\ell}-t\big{)}^{\top}bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) = ( 1 , italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is the local smoothing design, and 𝐗i𝔹(t)𝐙i,(t)=𝕎i,(t)vec(𝔹(t))superscriptsubscript𝐗𝑖top𝔹𝑡subscript𝐙𝑖𝑡subscript𝕎𝑖superscript𝑡topvec𝔹𝑡\mathbf{X}_{i}^{\top}\mathbb{B}(t)\mathbf{Z}_{i,\ell}(t)=\mathbb{W}_{i,\ell}(t% )^{\top}\mathrm{vec}\big{(}\mathbb{B}(t)\big{)}bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT blackboard_B ( italic_t ) bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) = blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_vec ( blackboard_B ( italic_t ) ) with vec(𝔹(t))=(𝜷(t),𝜷˙(t))2pvec𝔹𝑡superscript𝜷superscript𝑡top˙𝜷superscript𝑡toptopsuperscript2𝑝\mathrm{vec}\big{(}\mathbb{B}(t)\big{)}=\big{(}\boldsymbol{\beta}(t)^{\top},% \dot{\boldsymbol{\beta}}(t)^{\top}\big{)}^{\top}\in\mathbb{R}^{2p}roman_vec ( blackboard_B ( italic_t ) ) = ( bold_italic_β ( italic_t ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , over˙ start_ARG bold_italic_β end_ARG ( italic_t ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT of 𝔹(t)𝔹𝑡\mathbb{B}(t)blackboard_B ( italic_t ) and the Kronecker product 𝕎i,(t)=𝐙i,(t)𝐗isubscript𝕎𝑖𝑡tensor-productsubscript𝐙𝑖𝑡subscript𝐗𝑖\mathbb{W}_{i,\ell}(t)=\mathbf{Z}_{i,\ell}(t)\otimes\mathbf{X}_{i}blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) = bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) ⊗ bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT of the covarite design for local linear smoothing.

Proposition 2.2.1.

For each t[0,1]𝑡01t\in[0,1]italic_t ∈ [ 0 , 1 ], let 𝔹0(t)=[𝛃0(t),𝛃˙0(t)]subscript𝔹0𝑡subscript𝛃0𝑡subscript˙𝛃0𝑡\mathbb{B}_{0}(t)=\big{[}\boldsymbol{\beta}_{0}(t),\dot{\boldsymbol{\beta}}_{0% }(t)\big{]}blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) = [ bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) , over˙ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ] be the p×2𝑝2p\times 2italic_p × 2 matrix, where 𝛃0(t)subscript𝛃0𝑡\boldsymbol{\beta}_{0}(t)bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) is the true vector coefficient function such that 𝛃¨0(t)=(β1′′(t),,βp′′(t))subscript¨𝛃0𝑡superscriptsuperscriptsubscript𝛽1′′𝑡superscriptsubscript𝛽𝑝′′𝑡top\ddot{\boldsymbol{\beta}}_{0}(t)=\big{(}\beta_{1}^{\prime\prime}(t),\ldots,% \beta_{p}^{\prime\prime}(t)\big{)}^{\top}over¨ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) = ( italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_t ) , … , italic_β start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT exists and is continuous. Suppose conditions (K0)𝐾0(K0)( italic_K 0 )(K4)𝐾4(K4)( italic_K 4 ) in Supplementray Material S.1 hold. Then, we have

supt[0,1]𝜷~(t)𝜷0(t)=OP(h2+lognnh).subscriptsupremum𝑡01norm~𝜷𝑡subscript𝜷0𝑡subscript𝑂𝑃superscript2𝑛𝑛\displaystyle\sup_{t\in[0,1]}\big{\|}\widetilde{\boldsymbol{\beta}}(t)-% \boldsymbol{\beta}_{0}(t)\big{\|}=O_{P}\bigg{(}h^{2}+\sqrt{\frac{\log n}{nh}}% \bigg{)}.roman_sup start_POSTSUBSCRIPT italic_t ∈ [ 0 , 1 ] end_POSTSUBSCRIPT ∥ over~ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ∥ = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + square-root start_ARG divide start_ARG roman_log italic_n end_ARG start_ARG italic_n italic_h end_ARG end_ARG ) . (2.5)

The implication of conditions (K0)𝐾0(K0)( italic_K 0 )(K4)𝐾4(K4)( italic_K 4 ) are provided in Supplementary Material S.1. It is worth mentioning that Proposition 2.2.1 is valid even if only a few longitudinal observations are available for some subjects. For example, it is common in observational studies that dense observations of functional responses for some subjects may not be available regardless of the sample size n𝑛nitalic_n, i.e., the minimum number of longitudinal observations λn=min1inLisubscript𝜆𝑛subscript1𝑖𝑛subscript𝐿𝑖\lambda_{n}=\min_{1\leq i\leq n}L_{i}italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT 1 ≤ italic_i ≤ italic_n end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is bounded. Therefore, the technical condition on λnsubscript𝜆𝑛\lambda_{n}italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in (K4)𝐾4(K4)( italic_K 4 ) is not a restriction because we do not require λnsubscript𝜆𝑛\lambda_{n}\to\inftyitalic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT → ∞ as n𝑛n\to\inftyitalic_n → ∞.

Remark 1.

For technical conditions (K0)𝐾0(K0)( italic_K 0 )(K4)𝐾4(K4)( italic_K 4 ), k>2.5𝑘2.5k>2.5italic_k > 2.5 and a=1/5𝑎15a=1/5italic_a = 1 / 5 are commonly adopted in the standard theory of kernel smoothing [10]. Then, Theorem 2.2.1 gives supt[0,1]𝜷~(t)𝜷0(t)=OP(n2/5logn)subscriptsupremum𝑡01norm~𝜷𝑡subscript𝜷0𝑡subscript𝑂𝑃superscript𝑛25𝑛\sup_{t\in[0,1]}\|\tilde{\boldsymbol{\beta}}(t)-\boldsymbol{\beta}_{0}(t)\|=O_% {P}(n^{-2/5}\sqrt{\log n})roman_sup start_POSTSUBSCRIPT italic_t ∈ [ 0 , 1 ] end_POSTSUBSCRIPT ∥ over~ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ∥ = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_n start_POSTSUPERSCRIPT - 2 / 5 end_POSTSUPERSCRIPT square-root start_ARG roman_log italic_n end_ARG ). It follows that 01𝜷~(t)𝜷0(t)2dt=OP(n2/5logn)superscriptsubscript01superscriptnorm~𝜷𝑡subscript𝜷0𝑡2differential-d𝑡subscript𝑂𝑃superscript𝑛25𝑛\int_{0}^{1}\big{\|}\widetilde{\boldsymbol{\beta}}(t)-\boldsymbol{\beta}_{0}(t% )\big{\|}^{2}\,\mathrm{d}t=O_{P}(n^{-2/5}\sqrt{\log n})∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∥ over~ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_d italic_t = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_n start_POSTSUPERSCRIPT - 2 / 5 end_POSTSUPERSCRIPT square-root start_ARG roman_log italic_n end_ARG ).

For the shape-constrained estimation of 𝜷^^𝜷\widehat{\boldsymbol{\beta}}over^ start_ARG bold_italic_β end_ARG under H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we extend the kernel-weighted least squares for nonparametric regression with scalar responses [34], where side conditions are empirically examined over a fine grid 𝒢M={tm[0,1]:m=0,1,,M}subscript𝒢𝑀conditional-setsubscript𝑡𝑚01𝑚01𝑀\mathcal{G}_{M}=\{t_{m}\in[0,1]:m=0,1,\ldots,M\}caligraphic_G start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = { italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ [ 0 , 1 ] : italic_m = 0 , 1 , … , italic_M } with 0=t0<t1<<tM1<tM=10subscript𝑡0subscript𝑡1subscript𝑡𝑀1subscript𝑡𝑀10=t_{0}<t_{1}<\cdots<t_{M-1}<t_{M}=10 = italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_t start_POSTSUBSCRIPT italic_M - 1 end_POSTSUBSCRIPT < italic_t start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = 1 so that β^jsubscript^𝛽𝑗\hat{\beta}_{j}over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT satisfies the null hypothesis (2.2) over 𝒢j=𝒢|j={tjmj:m=0,1,,Mj}subscript𝒢𝑗evaluated-at𝒢subscript𝑗conditional-setsubscript𝑡subscript𝑗𝑚subscript𝑗𝑚01subscript𝑀𝑗\mathcal{G}_{j}=\mathcal{G}|_{\mathcal{I}_{j}}=\{t_{j_{m}}\in\mathcal{I}_{j}:m% =0,1,\ldots,M_{j}\}caligraphic_G start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = caligraphic_G | start_POSTSUBSCRIPT caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT = { italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT : italic_m = 0 , 1 , … , italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } for each jJ𝑗𝐽j\in Jitalic_j ∈ italic_J. Specifically, if we consider “H0,j:βj is monotone increasing on j:subscript𝐻0𝑗βj is monotone increasing on jH_{0,j}:\textrm{$\beta_{j}$ is monotone increasing on $\mathcal{I}_{j}$}italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is monotone increasing on caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT,” we propose to substitute

H0,j:βj(tjm)βj(tjm1)0for allm=1,,Mj.\displaystyle H_{0,j}^{\prime}:\beta_{j}(t_{j_{m}})-\beta_{j}(t_{j_{m-1}})\geq 0% \,\,\,\textrm{for all}\,\,\,m=1,\ldots,M_{j}.italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT : italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_m - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ≥ 0 for all italic_m = 1 , … , italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT . (2.6)

Similarly, we empirically validate the partial hypothesis “H0,j:βk is convex on j:subscript𝐻0𝑗βk is convex on jH_{0,j}:\textrm{$\beta_{k}$ is convex on $\mathcal{I}_{j}$}italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is convex on caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT” as

H0,j:βj(tjm)βj(tjm1)tjmtjm1βj(tjm1)for allm=1,,Mj.\displaystyle H_{0,j}^{\prime}:\frac{\beta_{j}(t_{j_{m}})-\beta_{j}(t_{j_{m-1}% })}{t_{j_{m}}-t_{j_{m-1}}}\geq\beta_{j}^{\prime}(t_{j_{m-1}})\,\,\,\textrm{for% all}\,\,\,m=1,\ldots,M_{j}.italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT : divide start_ARG italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_m - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_m - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG ≥ italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_m - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) for all italic_m = 1 , … , italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT . (2.7)

To define the shape-constrained kernel-weighted least squares for the function-on-scalar regression model (2.1) under the empirical null hypothesis on the grid 𝒢Msubscript𝒢𝑀\mathcal{G}_{M}caligraphic_G start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT, we note that

n(𝔹(t))=i=1n1Li=1Li(𝕎i,(t)[vec(𝔹(t))vec(𝔹~(t))]e~i,(t))2Kh(Ti,t)=𝒬n(𝔹(t),𝔹~(t))+1ni=1n1Li=1LiKh(Ti,t)e~i,(t)2,subscript𝑛𝔹𝑡superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖superscriptsubscript𝕎𝑖superscript𝑡topdelimited-[]vec𝔹𝑡vec~𝔹𝑡subscript~𝑒𝑖𝑡2subscript𝐾subscript𝑇𝑖𝑡subscript𝒬𝑛𝔹𝑡~𝔹𝑡1𝑛superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖subscript𝐾subscript𝑇𝑖𝑡subscript~𝑒𝑖superscript𝑡2\displaystyle\begin{split}\mathcal{L}_{n}\big{(}\mathbb{B}(t)\big{)}&=\sum_{i=% 1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}\Big{(}\mathbb{W}_{i,\ell}(t)^{\top}% \big{[}\mathrm{vec}\big{(}\mathbb{B}(t)\big{)}-\mathrm{vec}\big{(}\widetilde{% \mathbb{B}}(t)\big{)}\big{]}-\tilde{e}_{i,\ell}(t)\Big{)}^{2}K_{h}(T_{i,\ell}-% t)\\ &=\mathcal{Q}_{n}\big{(}\mathbb{B}(t),\widetilde{\mathbb{B}}(t)\big{)}+\frac{1% }{n}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}K_{h}(T_{i,\ell}-t)% \tilde{e}_{i,\ell}(t)^{2},\end{split}start_ROW start_CELL caligraphic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B ( italic_t ) ) end_CELL start_CELL = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT [ roman_vec ( blackboard_B ( italic_t ) ) - roman_vec ( over~ start_ARG blackboard_B end_ARG ( italic_t ) ) ] - over~ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B ( italic_t ) , over~ start_ARG blackboard_B end_ARG ( italic_t ) ) + divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) over~ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , end_CELL end_ROW (2.8)

where e~i,=Yi,𝕎i,vec(𝔹~(t))subscript~𝑒𝑖subscript𝑌𝑖superscriptsubscript𝕎𝑖topvec~𝔹𝑡\tilde{e}_{i,\ell}=Y_{i,\ell}-\mathbb{W}_{i,\ell}^{\top}\mathrm{vec}\big{(}% \widetilde{\mathbb{B}}(t)\big{)}over~ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT = italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_vec ( over~ start_ARG blackboard_B end_ARG ( italic_t ) ) is the unconstrained residual of fitting Yi,subscript𝑌𝑖Y_{i,\ell}italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT and 𝒬n(𝔹(t),𝔹~(t))=vec(𝔹(t)𝔹~(t))Ψ^(t)vec(𝔹(t)𝔹~(t))subscript𝒬𝑛𝔹𝑡~𝔹𝑡vecsuperscript𝔹𝑡~𝔹𝑡top^Ψ𝑡vec𝔹𝑡~𝔹𝑡\mathcal{Q}_{n}\big{(}\mathbb{B}(t),\widetilde{\mathbb{B}}(t)\big{)}=\mathrm{% vec}\big{(}\mathbb{B}(t)-\widetilde{\mathbb{B}}(t)\big{)}^{\top}\widehat{\Psi}% (t)\mathrm{vec}\big{(}\mathbb{B}(t)-\widetilde{\mathbb{B}}(t)\big{)}caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B ( italic_t ) , over~ start_ARG blackboard_B end_ARG ( italic_t ) ) = roman_vec ( blackboard_B ( italic_t ) - over~ start_ARG blackboard_B end_ARG ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over^ start_ARG roman_Ψ end_ARG ( italic_t ) roman_vec ( blackboard_B ( italic_t ) - over~ start_ARG blackboard_B end_ARG ( italic_t ) ) with Ψ^(t)=n1i=1nLi1=1Li𝕎i,(t)𝕎i,(t)Kh(Ti,t)^Ψ𝑡superscript𝑛1superscriptsubscript𝑖1𝑛superscriptsubscript𝐿𝑖1superscriptsubscript1subscript𝐿𝑖subscript𝕎𝑖𝑡subscript𝕎𝑖superscript𝑡topsubscript𝐾subscript𝑇𝑖𝑡\widehat{\Psi}(t)=n^{-1}\sum_{i=1}^{n}L_{i}^{-1}\sum_{\ell=1}^{L_{i}}\mathbb{W% }_{i,\ell}(t)\mathbb{W}_{i,\ell}(t)^{\top}K_{h}(T_{i,\ell}-t)over^ start_ARG roman_Ψ end_ARG ( italic_t ) = italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ). To get (2.8), we used the fact that the unconstrained estimator 𝔹~(t)~𝔹𝑡\widetilde{\mathbb{B}}(t)over~ start_ARG blackboard_B end_ARG ( italic_t ) solves the estimating equation n(𝔹(t))/vec(𝔹(t))=02psubscript𝑛𝔹𝑡vec𝔹𝑡subscript02𝑝\partial\mathcal{L}_{n}\big{(}\mathbb{B}(t)\big{)}/\partial\mathrm{vec}\big{(}% \mathbb{B}(t)\big{)}=0_{2p}∂ caligraphic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B ( italic_t ) ) / ∂ roman_vec ( blackboard_B ( italic_t ) ) = 0 start_POSTSUBSCRIPT 2 italic_p end_POSTSUBSCRIPT, or equivalently n1i=1nLi1=1Li𝕎i,(t)e~i,Kh(Ti,t)=02psuperscript𝑛1superscriptsubscript𝑖1𝑛superscriptsubscript𝐿𝑖1superscriptsubscript1subscript𝐿𝑖subscript𝕎𝑖𝑡subscript~𝑒𝑖subscript𝐾subscript𝑇𝑖𝑡subscript02𝑝n^{-1}\sum_{i=1}^{n}L_{i}^{-1}\sum_{\ell=1}^{L_{i}}\mathbb{W}_{i,\ell}(t)% \tilde{e}_{i,\ell}K_{h}(T_{i,\ell}-t)=0_{2p}italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) over~ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) = 0 start_POSTSUBSCRIPT 2 italic_p end_POSTSUBSCRIPT, where 02p=(0,,0)subscript02𝑝superscript00top0_{2p}=(0,\ldots,0)^{\top}0 start_POSTSUBSCRIPT 2 italic_p end_POSTSUBSCRIPT = ( 0 , … , 0 ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT denotes the zero vector in 2psuperscript2𝑝\mathbb{R}^{2p}blackboard_R start_POSTSUPERSCRIPT 2 italic_p end_POSTSUPERSCRIPT. Noting that the minimization of n(𝔹(t))subscript𝑛𝔹𝑡\mathcal{L}_{n}\big{(}\mathbb{B}(t)\big{)}caligraphic_L start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B ( italic_t ) ) only depends on the first term of (2.8) given the unconstrained estimator B~(t)~𝐵𝑡\widetilde{B}(t)over~ start_ARG italic_B end_ARG ( italic_t ), we propose the constrained optimization problem as follows.

Minimizem=0Mvec(𝔹(tm)𝔹~(tm))Ψ^(tm)vec(𝔹(tm)𝔹~(tm))subject toAj[vec(𝔹(tj0))vec(𝔹(tjM))]0Mjfor alljJ,Minimizesuperscriptsubscript𝑚0𝑀vecsuperscript𝔹subscript𝑡𝑚~𝔹subscript𝑡𝑚top^Ψsubscript𝑡𝑚vec𝔹subscript𝑡𝑚~𝔹subscript𝑡𝑚subject tosubscript𝐴𝑗delimited-[]vec𝔹subscript𝑡subscript𝑗0vec𝔹subscript𝑡subscript𝑗𝑀subscript0subscript𝑀𝑗for all𝑗𝐽\displaystyle\begin{split}\textrm{Minimize}&\quad\sum_{m=0}^{M}\mathrm{vec}% \big{(}\mathbb{B}(t_{m})-\widetilde{\mathbb{B}}(t_{m})\big{)}^{\top}\widehat{% \Psi}(t_{m})\mathrm{vec}\big{(}\mathbb{B}(t_{m})-\widetilde{\mathbb{B}}(t_{m})% \big{)}\\ \quad\textrm{subject to}&\quad A_{j}\left[\begin{array}[]{c}\mathrm{vec}\big{(% }\mathbb{B}(t_{j_{0}})\big{)}\\ \vdots\\ \mathrm{vec}\big{(}\mathbb{B}(t_{j_{M}})\big{)}\end{array}\right]\geq 0_{M_{j}% }\,\,\,\textrm{for all}\,\,\,j\in J,\end{split}start_ROW start_CELL Minimize end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_m = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT roman_vec ( blackboard_B ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) - over~ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over^ start_ARG roman_Ψ end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) roman_vec ( blackboard_B ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) - over~ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) end_CELL end_ROW start_ROW start_CELL subject to end_CELL start_CELL italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT [ start_ARRAY start_ROW start_CELL roman_vec ( blackboard_B ( italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL roman_vec ( blackboard_B ( italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) end_CELL end_ROW end_ARRAY ] ≥ 0 start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT for all italic_j ∈ italic_J , end_CELL end_ROW (2.9)

where 0kksubscript0𝑘superscript𝑘0_{k}\in\mathbb{R}^{k}0 start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT and Ok×k×subscript𝑂𝑘superscript𝑘O_{k\times\ell}\in\mathbb{R}^{k\times\ell}italic_O start_POSTSUBSCRIPT italic_k × roman_ℓ end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_k × roman_ℓ end_POSTSUPERSCRIPT are the k𝑘kitalic_k-vector and (k,)𝑘(k,\ell)( italic_k , roman_ℓ )-matrix of zeros, respectively, and AjMj×2p(M+1)subscript𝐴𝑗superscriptsubscript𝑀𝑗2𝑝𝑀1A_{j}\in\mathbb{R}^{M_{j}\times 2p(M+1)}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT × 2 italic_p ( italic_M + 1 ) end_POSTSUPERSCRIPT is the linear constraint matrix associated with the individual hypothesis H0,jsubscript𝐻0𝑗H_{0,j}italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT. For the monotone increasing hypothesis (2.6), we set Aj=Amono(j)[ej,0p]subscript𝐴𝑗tensor-productsubscript𝐴monosubscript𝑗superscriptsubscripte𝑗topsuperscriptsubscript0𝑝topA_{j}=A_{\mathrm{mono}}(\mathcal{I}_{j})\otimes\big{[}\mathrm{e}_{j}^{\top},0_% {p}^{\top}\big{]}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_A start_POSTSUBSCRIPT roman_mono end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ⊗ [ roman_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ], where ejpsubscripte𝑗superscript𝑝\textrm{e}_{j}\in\mathbb{R}^{p}e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT is the unit vector with 1111 only at its j𝑗jitalic_j-th coordinate and Amono(j)Mj×(Mj+1)subscript𝐴monosubscript𝑗superscriptsubscript𝑀𝑗subscript𝑀𝑗1A_{\mathrm{mono}}(\mathcal{I}_{j})\in\mathbb{R}^{M_{j}\times(M_{j}+1)}italic_A start_POSTSUBSCRIPT roman_mono end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT × ( italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_POSTSUPERSCRIPT is defined as

Amono(j)subscript𝐴monosubscript𝑗\displaystyle A_{\mathrm{mono}}(\mathcal{I}_{j})italic_A start_POSTSUBSCRIPT roman_mono end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) =[110000110000011].absentdelimited-[]110000110000011\displaystyle=\left[\begin{array}[]{ccc c cc}-1&1&0&\cdots&0&0\\ 0&-1&1&\cdots&0&0\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ 0&0&0&\cdots&-1&1\end{array}\right].= [ start_ARRAY start_ROW start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL - 1 end_CELL start_CELL 1 end_CELL end_ROW end_ARRAY ] . (2.14)

Similarly, for the convexity hypothesis (2.7), we set we set Aj=Aconv,0(j)[ej,0p]Aconv,1(j)[0p,ej]subscript𝐴𝑗tensor-productsubscript𝐴conv0subscript𝑗superscriptsubscripte𝑗topsuperscriptsubscript0𝑝toptensor-productsubscript𝐴conv1subscript𝑗superscriptsubscript0𝑝topsuperscriptsubscripte𝑗topA_{j}=A_{\mathrm{conv},0}(\mathcal{I}_{j})\otimes\big{[}\mathrm{e}_{j}^{\top},% 0_{p}^{\top}\big{]}-A_{\mathrm{conv},1}(\mathcal{I}_{j})\otimes\big{[}0_{p}^{% \top},\mathrm{e}_{j}^{\top}\big{]}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_A start_POSTSUBSCRIPT roman_conv , 0 end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ⊗ [ roman_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] - italic_A start_POSTSUBSCRIPT roman_conv , 1 end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ⊗ [ 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , roman_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ], where Aconv,0(k),Aconv,1(j)Mj×(Mj+1)subscript𝐴conv0subscript𝑘subscript𝐴conv1subscript𝑗superscriptsubscript𝑀𝑗subscript𝑀𝑗1A_{\mathrm{conv},0}(\mathcal{I}_{k}),A_{\mathrm{conv},1}(\mathcal{I}_{j})\in% \mathbb{R}^{M_{j}\times(M_{j}+1)}italic_A start_POSTSUBSCRIPT roman_conv , 0 end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , italic_A start_POSTSUBSCRIPT roman_conv , 1 end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT × ( italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + 1 ) end_POSTSUPERSCRIPT are defined as

Aconv,0(j)=[1tj1tj01tj1tj000001tj2tj11tj2tj1000001tjMjtjMj11tjMjtjMj1]subscript𝐴conv0subscript𝑗delimited-[]1subscript𝑡subscript𝑗1subscript𝑡subscript𝑗01subscript𝑡subscript𝑗1subscript𝑡subscript𝑗000001subscript𝑡subscript𝑗2subscript𝑡subscript𝑗11subscript𝑡subscript𝑗2subscript𝑡subscript𝑗1000001subscript𝑡subscript𝑗subscript𝑀𝑗subscript𝑡subscript𝑗subscript𝑀𝑗11subscript𝑡subscript𝑗subscript𝑀𝑗subscript𝑡subscript𝑗subscript𝑀𝑗1\displaystyle\begin{split}A_{\mathrm{conv},0}(\mathcal{I}_{j})&=\left[\begin{% array}[]{ccc c cc}\frac{-1}{t_{j_{1}}-t_{j_{0}}}&\frac{1}{t_{j_{1}}-t_{j_{0}}}% &0&\cdots&0&0\\ 0&\frac{-1}{t_{j_{2}}-t_{j_{1}}}&\frac{1}{t_{j_{2}}-t_{j_{1}}}&\cdots&0&0\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ 0&0&0&\cdots&\frac{-1}{t_{j_{M_{j}}}-t_{j_{M_{j}-1}}}&\frac{1}{t_{j_{M_{j}}}-t% _{j_{M_{j}-1}}}\end{array}\right]\end{split}start_ROW start_CELL italic_A start_POSTSUBSCRIPT roman_conv , 0 end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) end_CELL start_CELL = [ start_ARRAY start_ROW start_CELL divide start_ARG - 1 end_ARG start_ARG italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL divide start_ARG - 1 end_ARG start_ARG italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL divide start_ARG - 1 end_ARG start_ARG italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG end_CELL end_ROW end_ARRAY ] end_CELL end_ROW (2.15)

and Aconv,1(j)=[IMj,0Mj]subscript𝐴conv1subscript𝑗subscript𝐼subscript𝑀𝑗subscript0subscript𝑀𝑗A_{\mathrm{conv},1}(\mathcal{I}_{j})=\big{[}I_{M_{j}},0_{M_{j}}\big{]}italic_A start_POSTSUBSCRIPT roman_conv , 1 end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = [ italic_I start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT , 0 start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ]. This illustrates that the constrained estimates 𝔹^(t0),,𝔹^(tM)^𝔹subscript𝑡0^𝔹subscript𝑡𝑀\widehat{\mathbb{B}}(t_{0}),\ldots,\widehat{\mathbb{B}}(t_{M})over^ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , … , over^ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) can be obtained by the standard quadratic programming with jJMjsubscript𝑗𝐽subscript𝑀𝑗\sum_{j\in J}M_{j}∑ start_POSTSUBSCRIPT italic_j ∈ italic_J end_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT linear constraints.

Remark 2.

Suppose we hypothesize “H0,j:βj is monotone increasing on j,1 and convex on j,2:subscript𝐻0𝑗βj is monotone increasing on j,1 and convex on j,2H_{0,j}:\textrm{$\beta_{j}$ is monotone increasing on $\mathcal{I}_{j,1}$ and % convex on $\mathcal{I}_{j,2}$}italic_H start_POSTSUBSCRIPT 0 , italic_j end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is monotone increasing on caligraphic_I start_POSTSUBSCRIPT italic_j , 1 end_POSTSUBSCRIPT and convex on caligraphic_I start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT,” where j,1subscript𝑗1\mathcal{I}_{j,1}caligraphic_I start_POSTSUBSCRIPT italic_j , 1 end_POSTSUBSCRIPT and j,2subscript𝑗2\mathcal{I}_{j,2}caligraphic_I start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT may or may not overlap. Then, the side conditions corresponding to the null hypothesis can also be examined by imposing both linear constraint matrices Amono(j,1)[ej,0p]tensor-productsubscript𝐴monosubscript𝑗1superscriptsubscripte𝑗topsuperscriptsubscript0𝑝topA_{\mathrm{mono}}(\mathcal{I}_{j,1})\otimes\big{[}\mathrm{e}_{j}^{\top},0_{p}^% {\top}\big{]}italic_A start_POSTSUBSCRIPT roman_mono end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j , 1 end_POSTSUBSCRIPT ) ⊗ [ roman_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] and Aconv,0(j,2)[ej,0p]Aconv,1(j,2)[0p,ej]tensor-productsubscript𝐴conv0subscript𝑗2superscriptsubscripte𝑗topsuperscriptsubscript0𝑝toptensor-productsubscript𝐴conv1subscript𝑗2superscriptsubscript0𝑝topsuperscriptsubscripte𝑗topA_{\mathrm{conv},0}(\mathcal{I}_{j,2})\otimes\big{[}\mathrm{e}_{j}^{\top},0_{p% }^{\top}\big{]}-A_{\mathrm{conv},1}(\mathcal{I}_{j,2})\otimes\big{[}0_{p}^{% \top},\mathrm{e}_{j}^{\top}\big{]}italic_A start_POSTSUBSCRIPT roman_conv , 0 end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT ) ⊗ [ roman_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] - italic_A start_POSTSUBSCRIPT roman_conv , 1 end_POSTSUBSCRIPT ( caligraphic_I start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT ) ⊗ [ 0 start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , roman_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ].

Theorem 2.2.2.

Suppose the conditions in Proposition 2.2.1 hold. For any M1𝑀1M\geq 1italic_M ≥ 1, let 𝒢M={tm[0,1]:m=0,1,,M}subscript𝒢𝑀conditional-setsubscript𝑡𝑚01𝑚01𝑀\mathcal{G}_{M}=\{t_{m}\in[0,1]:m=0,1,\ldots,M\}caligraphic_G start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = { italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ [ 0 , 1 ] : italic_m = 0 , 1 , … , italic_M } be the finite grid used in (2.9) with 0=t0<t1<<tM=10subscript𝑡0subscript𝑡1subscript𝑡𝑀10=t_{0}<t_{1}<\cdots<t_{M}=10 = italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < ⋯ < italic_t start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = 1. Under the null hypothesis H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in (2.2), we have

maxt𝒢M𝜷^(t)𝜷0(t)=OP(h2+lognnh).subscript𝑡subscript𝒢𝑀norm^𝜷𝑡subscript𝜷0𝑡subscript𝑂𝑃superscript2𝑛𝑛\displaystyle\max_{t\in\mathcal{G}_{M}}\big{\|}\widehat{\boldsymbol{\beta}}(t)% -\boldsymbol{\beta}_{0}(t)\big{\|}=O_{P}\bigg{(}h^{2}+\sqrt{\frac{\log n}{nh}}% \bigg{)}.roman_max start_POSTSUBSCRIPT italic_t ∈ caligraphic_G start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ over^ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ∥ = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + square-root start_ARG divide start_ARG roman_log italic_n end_ARG start_ARG italic_n italic_h end_ARG end_ARG ) . (2.16)

Theorem 2.2.2 indicates that the shape-constrained kernel least square estimator achieves the same rate of convergence as the unconstrained estimator under the null hypothesis. Moreover, suppose the empirical distribution of 𝒢Msubscript𝒢𝑀\mathcal{G}_{M}caligraphic_G start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT converges to the uniform distribution on [0,1]01[0,1][ 0 , 1 ]. Then, it can be verified that 01𝜷^(t)𝜷0(t)dt=OP(n2/5logn)superscriptsubscript01norm^𝜷𝑡subscript𝜷0𝑡differential-d𝑡subscript𝑂𝑃superscript𝑛25𝑛\int_{0}^{1}\big{\|}\widehat{\boldsymbol{\beta}}(t)-\boldsymbol{\beta}_{0}(t)% \big{\|}\,\mathrm{d}t=O_{P}(n^{-2/5}\sqrt{\log n})∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ∥ over^ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ∥ roman_d italic_t = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_n start_POSTSUPERSCRIPT - 2 / 5 end_POSTSUPERSCRIPT square-root start_ARG roman_log italic_n end_ARG ) similarly to Remark 1.

2.3 Estimation: Shape-constrained regression spline

We now consider fitting the functional regression model by means of spline functions for smooth, flexible, and parsimonious estimation of coefficient functions. Let ϕk(t)subscriptitalic-ϕ𝑘𝑡\phi_{k}(t)italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ), k=1,,d𝑘1𝑑k=1,\ldots,ditalic_k = 1 , … , italic_d denote spline basis functions defined over [0,1]01[0,1][ 0 , 1 ] for a given sequence of knots, then we express βj(t)=k=1dcjkϕk(t)subscript𝛽𝑗𝑡superscriptsubscript𝑘1𝑑subscript𝑐𝑗𝑘subscriptitalic-ϕ𝑘𝑡\beta_{j}(t)=\sum_{k=1}^{d}c_{jk}\phi_{k}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ), where cjksubscript𝑐𝑗𝑘c_{jk}italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT denotes the basis coefficients. Under the shape constraints on βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) over jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, we can still employ the regression spline approach based on chosen basis functions, such as I𝐼Iitalic_I-splines [23], C𝐶Citalic_C-splines [18] or general B𝐵Bitalic_B-splines, with corresponding constraints assigned on a subset of basis coefficients cjksubscript𝑐𝑗𝑘c_{jk}italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT. While we focus on partially constrained estimation based on I𝐼Iitalic_I- or C𝐶Citalic_C- splines in this section, estimation via B𝐵Bitalic_B-splines is also discussed in the Supplementary Material S.6.

[23] first introduced I𝐼Iitalic_I-splines for curve estimation under the monotonicity restriction over the entire domain. As illustrated in the left panel of Figure 2, I𝐼Iitalic_I-spline basis functions are piece-wise quadratic, and at each of the knots indicated by dotted vertical lines, there is exactly one basis function with a non-zero slope. Thus, a monotone increasing (decreasing) spline function can be constructed as a non-negative (non-positive) linear combination of I𝐼Iitalic_I-spline basis functions. For readers interested in generating process of I𝐼Iitalic_I-splines, we refer [23] on how piece-wise quadratic I𝐼Iitalic_I-splines are formed from nonnegative M𝑀Mitalic_M-spline family. Then, if the specific aim is on assigning, for example, monotone increasing restriction over a subset of the domain jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, non-negative coefficients condition should apply to a subset of cjksubscript𝑐𝑗𝑘c_{jk}italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT for which corresponding I𝐼Iitalic_I-spline ϕk(t)subscriptitalic-ϕ𝑘𝑡\phi_{k}(t)italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_t ) display positive slopes within the range of jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT.

Refer to caption
Figure 2: The piece-wise quadratic I𝐼Iitalic_I-spline basis functions and the piece-wise cubic C𝐶Citalic_C-spline basis functions under the location of equally spaced knots indicated by the symbol ‘×\times×’ along with the dotted vertical lines.

Next, in terms of convexity constraints, [18] proposed C𝐶Citalic_C-splines by integrating I𝐼Iitalic_I-splines. As shown in the right panel of Figure 2, C𝐶Citalic_C-splines form convex and piece-wise cubic functions with non-zero second derivatives at each knot, implying that a linear combination of C𝐶Citalic_C-splines with non-negative (non-positive) basis coefficients ensures the convexity (concavity) of the function estimator. See Section 2 of [18] for details. Similarly, for the convexity constraints over sub-interval(s) jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, we can impose non-negative restrictions to basis coefficients corresponding to splines showing positive second derivates features within jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT.

We now apply the shape-restricted spline function technique to the functional regression model framework to estimate functional coefficients under (2.2). Although methodological or theoretical developments are general to other shape constraints, such as convexity or monotone convexity, we elaborate on the estimation under the monotonicity restriction below and refer to remarks for other restrictions. Let ϕkj(t)superscriptsubscriptitalic-ϕ𝑘𝑗𝑡\phi_{k}^{j}(t)italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_t ), k=1,,dj𝑘1subscript𝑑𝑗k=1,\ldots,d_{j}italic_k = 1 , … , italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, denote piece-wise quadratic I𝐼Iitalic_I-spline basis functions under a given knot sequence, employed to fit βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ), j=1,,p𝑗1𝑝j=1,\ldots,pitalic_j = 1 , … , italic_p. Especially for partially constrained coefficients jJ𝑗𝐽j\in Jitalic_j ∈ italic_J, the knot sequence should include lower and upper boundaries of jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. For the case of multiple disjoint sub-intervals form of jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, all boundaries should be a part of the knot sequence. We then estimate the functional regression coefficients by minimizing the following objective function with respect to non-negative conditions assigning on a subset of basis coefficients cjksubscript𝑐𝑗𝑘c_{jk}italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT depending on the set J𝐽Jitalic_J and Ajsubscript𝐴𝑗A_{j}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT as,

Minimizei=1n1Li=1Li[Yi,j=1pXi,j{k=0djcjkϕkj(Ti,)}]2subject tocjk0for{(j,k):jJandkAj},Minimizesuperscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖superscriptdelimited-[]subscript𝑌𝑖superscriptsubscript𝑗1𝑝subscript𝑋𝑖𝑗superscriptsubscript𝑘0subscript𝑑𝑗subscript𝑐𝑗𝑘superscriptsubscriptitalic-ϕ𝑘𝑗subscript𝑇𝑖2subject tosubscript𝑐𝑗𝑘0forconditional-set𝑗𝑘𝑗𝐽and𝑘subscript𝐴𝑗\displaystyle\begin{split}\textrm{Minimize}~{}~{}~{}&\sum_{i=1}^{n}\frac{1}{L_% {i}}\sum_{\ell=1}^{L_{i}}\Big{[}Y_{i,\ell}-\sum_{j=1}^{p}{X}_{i,j}\big{\{}\sum% _{k=0}^{d_{j}}c_{jk}\phi_{k}^{j}(T_{i,\ell})\big{\}}\Big{]}^{2}\\ \textrm{subject to}~{}~{}~{}&c_{jk}\geq 0~{}~{}~{}\textrm{for}~{}~{}\{(j,k):j% \in J~{}\textrm{and}~{}k\in A_{j}\},~{}~{}\end{split}start_ROW start_CELL Minimize end_CELL start_CELL ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT [ italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT { ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) } ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL subject to end_CELL start_CELL italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT ≥ 0 for { ( italic_j , italic_k ) : italic_j ∈ italic_J and italic_k ∈ italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } , end_CELL end_ROW (2.17)

where ϕ0j(t)superscriptsubscriptitalic-ϕ0𝑗𝑡\phi_{0}^{j}(t)italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_t ) indicates the intercept function, i.e., ϕ0j(t)=1superscriptsubscriptitalic-ϕ0𝑗𝑡1\phi_{0}^{j}(t)=1italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_t ) = 1, so that cj0subscript𝑐𝑗0c_{j0}italic_c start_POSTSUBSCRIPT italic_j 0 end_POSTSUBSCRIPT represents the location of monotonicity functions of βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ), the dimensionality of splines djsubscript𝑑𝑗d_{j}italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is determined by the number of knots chosen to approximate βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, the set J𝐽Jitalic_J from (2.2) includes indices which coefficients are constrained in H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and the set Ajsubscript𝐴𝑗A_{j}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is defined as {k{1,,dj}:ϕkj(t)>0,fortj}\big{\{}k\in\{1,\ldots,d_{j}\}:{\phi_{k}^{j}}{{}^{\prime}}(t)>0,~{}\textrm{for% }~{}t\in\mathcal{I}_{j}\big{\}}{ italic_k ∈ { 1 , … , italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } : italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t ) > 0 , for italic_t ∈ caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT }.

To obtain shape-restricted β^j(t)subscript^𝛽𝑗𝑡\hat{\beta}_{j}(t)over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ), we first write our regression problem associated with (2.17) as below. We elaborate for the case of regular evaluation grids, T1,,TLsubscript𝑇1subscript𝑇𝐿T_{1},\ldots,T_{L}italic_T start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_T start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT, for i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n, and dj=dsubscript𝑑𝑗𝑑d_{j}=ditalic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_d for representation simplicity, but having different Ti,subscript𝑇𝑖T_{i,\ell}italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT or djsubscript𝑑𝑗d_{j}italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT does not affect methodological developments. Let vec([𝒄j]j=1p)=(𝒄1,,𝒄p)vecsuperscriptsubscriptdelimited-[]subscript𝒄𝑗𝑗1𝑝superscriptsuperscriptsubscript𝒄1topsuperscriptsubscript𝒄𝑝toptop\textrm{vec}\big{(}[\boldsymbol{c}_{j}]_{j=1}^{p}\big{)}=(\boldsymbol{c}_{1}^{% \top},\ldots,\boldsymbol{c}_{p}^{\top})^{\top}vec ( [ bold_italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) = ( bold_italic_c start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , … , bold_italic_c start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT for 𝒄j=(cj0,cj1,,cjd)subscript𝒄𝑗superscriptsubscript𝑐𝑗0subscript𝑐𝑗1subscript𝑐𝑗𝑑top\boldsymbol{c}_{j}=(c_{j0},c_{j1},\ldots,c_{jd})^{\top}bold_italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_c start_POSTSUBSCRIPT italic_j 0 end_POSTSUBSCRIPT , italic_c start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT , … , italic_c start_POSTSUBSCRIPT italic_j italic_d end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, then

𝐘i=(𝐗iΦ)vec([𝒄j]j=1p)+ϵi,i=1,,n,formulae-sequencesubscript𝐘𝑖tensor-productsuperscriptsubscript𝐗𝑖topΦvecsuperscriptsubscriptdelimited-[]subscript𝒄𝑗𝑗1𝑝subscriptbold-italic-ϵ𝑖𝑖1𝑛\mathbf{Y}_{i}=(\mathbf{X}_{i}^{\top}\otimes\Phi)\cdot\textrm{vec}\big{(}[% \boldsymbol{c}_{j}]_{j=1}^{p}\big{)}+\boldsymbol{\epsilon}_{i},\quad i=1,% \ldots,n,bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⊗ roman_Φ ) ⋅ vec ( [ bold_italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT ) + bold_italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_i = 1 , … , italic_n , (2.18)

under cjk0subscript𝑐𝑗𝑘0c_{jk}\geq 0italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT ≥ 0 for certain (j,k)𝑗𝑘(j,k)( italic_j , italic_k ) specified in (2.17), where ΦΦ\Phiroman_Φ denotes a L×(d+1)𝐿𝑑1L\times(d+1)italic_L × ( italic_d + 1 ) basis matrix having its \ellroman_ℓ-th row consisting of (1,ϕ1(T),\big{(}1,\phi_{1}(T_{\ell}),( 1 , italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) , ,\ldots,… , ϕd(T))\phi_{d}(T_{\ell})\big{)}italic_ϕ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT ) ) for =1,,L1𝐿\ell=1,\ldots,Lroman_ℓ = 1 , … , italic_L. Then, the coefficient estimation is associated with finding the projection of nL𝑛𝐿nLitalic_n italic_L-dimensional (𝐘1,𝐘n)superscriptsuperscriptsubscript𝐘1topsuperscriptsubscript𝐘𝑛toptop(\mathbf{Y}_{1}^{\top}\ldots,\mathbf{Y}_{n}^{\top})^{\top}( bold_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT … , bold_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT onto the space spanned by columns of (𝐗Φ)tensor-product𝐗Φ(\mathbf{X}\otimes\Phi)( bold_X ⊗ roman_Φ ) with corresponding constraints on 𝒄jsubscript𝒄𝑗\boldsymbol{c}_{j}bold_italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, where 𝐗𝐗\mathbf{X}bold_X denotes the (n×p)𝑛𝑝(n\times p)( italic_n × italic_p ) design matrix. By letting 𝝈hnLsuperscript𝝈superscript𝑛𝐿\boldsymbol{\sigma}^{h}\in\mathbb{R}^{nL}bold_italic_σ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n italic_L end_POSTSUPERSCRIPT, for h=1,,p(d+1)1𝑝𝑑1h=1,\ldots,p(d+1)italic_h = 1 , … , italic_p ( italic_d + 1 ), denote the set of column vectors on (𝐗Φ)tensor-product𝐗Φ(\mathbf{X}\otimes\Phi)( bold_X ⊗ roman_Φ ) and ()\mathcal{L}(\cdot)caligraphic_L ( ⋅ ) represent the linear space spanned by a given set of vectors, we separate 𝝈hsuperscript𝝈\boldsymbol{\sigma}^{h}bold_italic_σ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT, hH1subscript𝐻1h\in H_{1}italic_h ∈ italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, a set of column vectors associated with basis coefficients with no non-negativity constraint, and define 𝒱=({𝝈h;hH1})𝒱superscript𝝈subscript𝐻1\mathcal{V}=\mathcal{L}\big{(}\{\boldsymbol{\sigma}^{h};~{}h\in H_{1}\}\big{)}caligraphic_V = caligraphic_L ( { bold_italic_σ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT ; italic_h ∈ italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT } ) indicating the linear space spanned by them. We then further obtain a set of generating vectors, denoted as 𝜹hsuperscript𝜹\boldsymbol{\delta}^{h}bold_italic_δ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT for hH2subscript𝐻2h\in H_{2}italic_h ∈ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT satisfying H2={1,,p(d+1)}\H1subscript𝐻2\1𝑝𝑑1subscript𝐻1H_{2}=\{1,\ldots,p(d+1)\}\backslash H_{1}italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = { 1 , … , italic_p ( italic_d + 1 ) } \ italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, that are orthogonal to 𝒱𝒱\mathcal{V}caligraphic_V. That is, 𝜹h=𝝈hΠ(𝝈h|𝒱)superscript𝜹superscript𝝈Πconditionalsuperscript𝝈𝒱\boldsymbol{\delta}^{h}=\boldsymbol{\sigma}^{h}-\Pi(\boldsymbol{\sigma}^{h}|% \mathcal{V})bold_italic_δ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT = bold_italic_σ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT - roman_Π ( bold_italic_σ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT | caligraphic_V ), for hH2subscript𝐻2h\in H_{2}italic_h ∈ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, where ΠΠ\Piroman_Π indicates a projection operator, such as the Gram-Schmidt process. By extending [18], we can characterize the constrained set as

𝒞={𝝁nL:𝝁=hH1bh𝝈h+hH2bh𝜹h,wherebh0,forhH2}.𝒞conditional-set𝝁superscript𝑛𝐿formulae-sequence𝝁subscriptsubscript𝐻1subscript𝑏superscript𝝈subscriptsubscript𝐻2subscript𝑏superscript𝜹formulae-sequencewheresubscript𝑏0forsubscript𝐻2\mathcal{C}=\big{\{}\boldsymbol{\mu}\in\mathbb{R}^{nL}:~{}~{}\boldsymbol{\mu}=% \sum_{h\in H_{1}}b_{h}\boldsymbol{\sigma}^{h}+\sum_{h\in H_{2}}b_{h}% \boldsymbol{\delta}^{h},~{}~{}\textrm{where}~{}b_{h}\geq 0,~{}\textrm{for}~{}h% \in H_{2}\}.caligraphic_C = { bold_italic_μ ∈ blackboard_R start_POSTSUPERSCRIPT italic_n italic_L end_POSTSUPERSCRIPT : bold_italic_μ = ∑ start_POSTSUBSCRIPT italic_h ∈ italic_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT bold_italic_σ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_h ∈ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT bold_italic_δ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT , where italic_b start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ≥ 0 , for italic_h ∈ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT } . (2.19)

Then the projection of (𝐘1,𝐘n)superscriptsuperscriptsubscript𝐘1topsuperscriptsubscript𝐘𝑛toptop(\mathbf{Y}_{1}^{\top}\ldots,\mathbf{Y}_{n}^{\top})^{\top}( bold_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT … , bold_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT to 𝒞𝒞\mathcal{C}caligraphic_C can be fulfilled by projecting it onto the set of nonnegative linear combinations of 𝜹hsuperscript𝜹\boldsymbol{\delta}^{h}bold_italic_δ start_POSTSUPERSCRIPT italic_h end_POSTSUPERSCRIPT, hH2subscript𝐻2h\in H_{2}italic_h ∈ italic_H start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, called as constraint cone, and onto the 𝒱𝒱\mathcal{V}caligraphic_V, separately. We refer to Section 2 of [18] for readers interested in more comprehensive descriptions under a nonparametric regression setting. Owing to the fact that the constraint set 𝒞𝒞\mathcal{C}caligraphic_C is a closed convex polyhedral cone in nLsuperscript𝑛𝐿\mathbb{R}^{nL}blackboard_R start_POSTSUPERSCRIPT italic_n italic_L end_POSTSUPERSCRIPT, the projection is unique, and we employ the cone projection algorithm of [19] to find a solution, available as the function qprog or coneA in the R package coneproj. We note that the unconstrained estimates β~j(t)subscript~𝛽𝑗𝑡\tilde{\beta}_{j}(t)over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) can be calculated by the least square estimates of 𝒄jsubscript𝒄𝑗\boldsymbol{c}_{j}bold_italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT under (2.18).

Next, we investigate rates of convergence for estimated functional coefficients under the constraints. Let Kjnsubscript𝐾𝑗𝑛K_{jn}italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT denote the number of knots to fit βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) growing with n𝑛nitalic_n, q𝑞qitalic_q is the order of the spline, and aL2subscriptnorm𝑎subscript𝐿2\|a\|_{L_{2}}∥ italic_a ∥ start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT be the L2subscript𝐿2L_{2}italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT norm of a square-integrable function a(t)𝑎𝑡a(t)italic_a ( italic_t ). Under Conditions (S1)𝑆1(S1)( italic_S 1 )(S6)𝑆6(S6)( italic_S 6 ), deferred in Supplementary Material S.2, and limnKnlogKn/n=0subscript𝑛subscript𝐾𝑛subscript𝐾𝑛𝑛0\lim_{n\rightarrow\infty}K_{n}\log K_{n}/n=0roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT roman_log italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / italic_n = 0 for Kn=max1jpKjnsubscript𝐾𝑛subscriptmax1𝑗𝑝subscript𝐾𝑗𝑛K_{n}=\text{max}_{1\leq j\leq p}K_{jn}italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = max start_POSTSUBSCRIPT 1 ≤ italic_j ≤ italic_p end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT, Theorem 2 of [14] and Theorem 6.25 of [26] imply the consistency of unconstrained 𝜷~(t)=(β~1(t),,β~p(t))~𝜷𝑡superscriptsubscript~𝛽1𝑡subscript~𝛽𝑝𝑡top\tilde{\boldsymbol{\beta}}(t)=\big{(}\tilde{\beta}_{1}(t),\ldots,\tilde{\beta}% _{p}(t)\big{)}^{\top}over~ start_ARG bold_italic_β end_ARG ( italic_t ) = ( over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , … , over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT written as, β~jβjL22=Op(Knn1+Kn2q)subscriptsuperscriptnormsubscript~𝛽𝑗subscript𝛽𝑗2subscript𝐿2subscript𝑂𝑝subscript𝐾𝑛superscript𝑛1superscriptsubscript𝐾𝑛2𝑞\|\tilde{\beta}_{j}-\beta_{j}\|^{2}_{L_{2}}=O_{p}(K_{n}n^{-1}+K_{n}^{-2q})∥ over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_O start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 italic_q end_POSTSUPERSCRIPT ), j=1,,p.𝑗1𝑝j=1,\ldots,p.italic_j = 1 , … , italic_p . Next, we show the consistency of the constrained estimators by adding the condition (S7) specified in Supplementary Material.

Theorem 2.3.1.

When shape constraints assigned to βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is true in H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, Knj=O(n1/(2q+1))subscript𝐾𝑛𝑗𝑂superscript𝑛12𝑞1K_{nj}=O(n^{1/(2q+1)})italic_K start_POSTSUBSCRIPT italic_n italic_j end_POSTSUBSCRIPT = italic_O ( italic_n start_POSTSUPERSCRIPT 1 / ( 2 italic_q + 1 ) end_POSTSUPERSCRIPT ), and conditions (S1)𝑆1(S1)( italic_S 1 )(S7)𝑆7(S7)( italic_S 7 ) are satisfied, the constrained estimator 𝛃^(t)=(β^1(t),,β^p(t))^𝛃𝑡superscriptsubscript^𝛽1𝑡subscript^𝛽𝑝𝑡top\widehat{\boldsymbol{\beta}}(t)=\big{(}\hat{\beta}_{1}(t),\ldots,\hat{\beta}_{% p}(t)\big{)}^{\top}over^ start_ARG bold_italic_β end_ARG ( italic_t ) = ( over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , … , over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT attains the same rate as the unconstrained estimator.

β^jβjL22=Op(Knn1+Kn2q),j=1,,p,formulae-sequencesubscriptsuperscriptnormsubscript^𝛽𝑗subscript𝛽𝑗2subscript𝐿2subscript𝑂𝑝subscript𝐾𝑛superscript𝑛1superscriptsubscript𝐾𝑛2𝑞𝑗1𝑝\|\hat{\beta}_{j}-\beta_{j}\|^{2}_{L_{2}}=O_{p}(K_{n}n^{-1}+K_{n}^{-2q}),\quad j% =1,\ldots,p,∥ over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_O start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 2 italic_q end_POSTSUPERSCRIPT ) , italic_j = 1 , … , italic_p ,

where its minimum rate Op(n2q/(2q+1))subscript𝑂𝑝superscript𝑛2𝑞2𝑞1O_{p}(n^{-2q/(2q+1)})italic_O start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_n start_POSTSUPERSCRIPT - 2 italic_q / ( 2 italic_q + 1 ) end_POSTSUPERSCRIPT ) is achieved when Kn=O(n1/(2q+1))subscript𝐾𝑛𝑂superscript𝑛12𝑞1K_{n}=O(n^{1/(2q+1)})italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_O ( italic_n start_POSTSUPERSCRIPT 1 / ( 2 italic_q + 1 ) end_POSTSUPERSCRIPT ).

Remark 3 (Convexity shape constraints).

When the convexity constraint is considered in (2.2) instead of monotonicity, we employ C𝐶Citalic_C-splines and express the coefficient function in the objective function (2.17) as βj(t)=cj0,1ϕ0,1(Ti,)+cj0,2ϕ0,2(Ti,)+k=1djcjkϕkj(Ti,),subscript𝛽𝑗𝑡subscript𝑐𝑗01subscriptitalic-ϕ01subscript𝑇𝑖subscript𝑐𝑗02subscriptitalic-ϕ02subscript𝑇𝑖superscriptsubscript𝑘1subscript𝑑𝑗subscript𝑐𝑗𝑘superscriptsubscriptitalic-ϕ𝑘𝑗subscript𝑇𝑖\beta_{j}(t)=c_{j0,1}\phi_{0,1}(T_{i,\ell})+c_{j0,2}\phi_{0,2}(T_{i,\ell})+% \sum_{k=1}^{d_{j}}c_{jk}\phi_{k}^{j}(T_{i,\ell}),italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) = italic_c start_POSTSUBSCRIPT italic_j 0 , 1 end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 , 1 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) + italic_c start_POSTSUBSCRIPT italic_j 0 , 2 end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT 0 , 2 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) + ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) , where ϕ0,1(t)subscriptitalic-ϕ01𝑡\phi_{0,1}(t)italic_ϕ start_POSTSUBSCRIPT 0 , 1 end_POSTSUBSCRIPT ( italic_t ) denotes the intercept function, ϕ0,2(t)subscriptitalic-ϕ02𝑡\phi_{0,2}(t)italic_ϕ start_POSTSUBSCRIPT 0 , 2 end_POSTSUBSCRIPT ( italic_t ) is the identity function, i.e., ϕ0,2(Ti,)=Ti,subscriptitalic-ϕ02subscript𝑇𝑖subscript𝑇𝑖\phi_{0,2}(T_{i,\ell})=T_{i,\ell}italic_ϕ start_POSTSUBSCRIPT 0 , 2 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) = italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT, and ϕkj(t)superscriptsubscriptitalic-ϕ𝑘𝑗𝑡\phi_{k}^{j}(t)italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_t ) denotes C𝐶Citalic_C-splines under a given sequence of knots. The first two terms without constraints on cj0,1subscript𝑐𝑗01c_{j0,1}italic_c start_POSTSUBSCRIPT italic_j 0 , 1 end_POSTSUBSCRIPT and cj0,2subscript𝑐𝑗02c_{j0,2}italic_c start_POSTSUBSCRIPT italic_j 0 , 2 end_POSTSUBSCRIPT determine the first-order behaviors of functional coefficients. Then, we assign constraints cjk0subscript𝑐𝑗𝑘0c_{jk}\geq 0italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT ≥ 0, for (j,k)𝑗𝑘(j,k)( italic_j , italic_k ), where jJ𝑗𝐽j\in Jitalic_j ∈ italic_J and kAj𝑘subscript𝐴𝑗k\in A_{j}italic_k ∈ italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT with the set Ajsubscript𝐴𝑗A_{j}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT defined as {k{1,,dj}:ϕkj(t)′′>0,fortj}\{k\in\{1,\ldots,d_{j}\}:{\phi_{k}^{j}}{{}^{\prime\prime}}(t)>0,~{}\textrm{for% }~{}t\in\mathcal{I}_{j}\}{ italic_k ∈ { 1 , … , italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } : italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT ( italic_t ) > 0 , for italic_t ∈ caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT }. The non-negativity constraints on basis coefficients associated with ϕkj(t)superscriptsubscriptitalic-ϕ𝑘𝑗𝑡\phi_{k}^{j}(t)italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_t ) having positive second derivatives within jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ensures the convexity of functional estimates over jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. For concavity constraint, we apply the same framework but with non-positivity constraints on a chosen set of cjksubscript𝑐𝑗𝑘c_{jk}italic_c start_POSTSUBSCRIPT italic_j italic_k end_POSTSUBSCRIPT. In this way, we can consider general H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with various types of shape constraints on βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) over jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, for jJ𝑗𝐽j\in Jitalic_j ∈ italic_J.

2.4 Test procedure

We employ a resampling method to assess the statistical significance of the observed Dnsubscript𝐷𝑛D_{n}italic_D start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT given a random sample 𝒳nsubscript𝒳𝑛\mathcal{X}_{n}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. We note that resampling functional observations , say 𝐘~isubscript~𝐘𝑖\widetilde{\mathbf{Y}}_{i}over~ start_ARG bold_Y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT === (Y~i(Ti,1),\big{(}\widetilde{Y}_{i}(T_{i,1}),( over~ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ) , ,\ldots,… , Y~i(Ti,Li))\widetilde{Y}_{i}(T_{i,L_{i}})\big{)}^{\top}over~ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, is not straightforward because Y~i(Ti,1),,Y~i(Ti,Li)subscript~𝑌𝑖subscript𝑇𝑖1subscript~𝑌𝑖subscript𝑇𝑖subscript𝐿𝑖\widetilde{Y}_{i}(T_{i,1}),\ldots,\widetilde{Y}_{i}(T_{i,L_{i}})over~ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ) , … , over~ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) should inherit the joint distribution of Yi(Ti,1),,Yi(Ti,Li)subscript𝑌𝑖subscript𝑇𝑖1subscript𝑌𝑖subscript𝑇𝑖subscript𝐿𝑖{Y}_{i}(T_{i,1}),\ldots,{Y}_{i}(T_{i,L_{i}})italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ) , … , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) with within-subject heteroscedasticity. In literature, the wild bootstrap method [33, 13, 16] has been extensively investigated that can effectively handle the heteroscedasticity, and there have been many advances for handling dependent data [3, 7, 29, 11]. In this study, we use the wild bootstrap procedure as described in Algorithm 1, which can be regarded as the wild bootstrap for clustered data [3, 7].

Algorithm 1 Bootstrap procedure under the partial shape constraints
1:For a given a random sample 𝒳n={(𝐗i,𝐘i):i=1,,n}subscript𝒳𝑛conditional-setsubscript𝐗𝑖subscript𝐘𝑖𝑖1𝑛\mathcal{X}_{n}=\{(\mathbf{X}_{i},\mathbf{Y}_{i}):i=1,\ldots,n\}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = { ( bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) : italic_i = 1 , … , italic_n }, find the unconstrained and shape-constrained estimators 𝜷~~𝜷\widetilde{\boldsymbol{\beta}}over~ start_ARG bold_italic_β end_ARG and 𝜷^^𝜷\widehat{\boldsymbol{\beta}}over^ start_ARG bold_italic_β end_ARG, respectively.
2:Set a bootstrap size B1𝐵1B\geq 1italic_B ≥ 1.
3:Compute functional residuals;
4:for i=1𝑖1i=1italic_i = 1 to n𝑛nitalic_n do
5:     for =11\ell=1roman_ℓ = 1 to Lisubscript𝐿𝑖L_{i}italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT do
6:         e~i,Yi,𝐗i𝜷~(Ti,)subscript~𝑒𝑖subscript𝑌𝑖superscriptsubscript𝐗𝑖top~𝜷subscript𝑇𝑖\widetilde{e}_{i,\ell}\leftarrow Y_{i,\ell}-\mathbf{X}_{i}^{\top}\widetilde{% \boldsymbol{\beta}}(T_{i,\ell})over~ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ← italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over~ start_ARG bold_italic_β end_ARG ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ).
7:     end for
8:end for
9:Generate B𝐵Bitalic_B bootstrap samples under the null hypothesis;
10:for b=1𝑏1b=1italic_b = 1 to B𝐵Bitalic_B do
11:     for i=1𝑖1i=1italic_i = 1 to n𝑛nitalic_n do
12:         for =11\ell=1roman_ℓ = 1 to Lisubscript𝐿𝑖L_{i}italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT do
13:              Sample vi,(b){512,5+12}superscriptsubscript𝑣𝑖𝑏512512v_{i,\ell}^{(b)}\in\big{\{}-\frac{\sqrt{5}-1}{2},\frac{\sqrt{5}+1}{2}\big{\}}italic_v start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT ∈ { - divide start_ARG square-root start_ARG 5 end_ARG - 1 end_ARG start_ARG 2 end_ARG , divide start_ARG square-root start_ARG 5 end_ARG + 1 end_ARG start_ARG 2 end_ARG } with probability (5+125,5125)51255125\Big{(}\frac{\sqrt{5}+1}{2\sqrt{5}},\frac{\sqrt{5}-1}{2\sqrt{5}}\Big{)}( divide start_ARG square-root start_ARG 5 end_ARG + 1 end_ARG start_ARG 2 square-root start_ARG 5 end_ARG end_ARG , divide start_ARG square-root start_ARG 5 end_ARG - 1 end_ARG start_ARG 2 square-root start_ARG 5 end_ARG end_ARG ).
14:              Y~i,(b)𝐗i𝜷^(Ti,)+vi,(b)e~i,superscriptsubscript~𝑌𝑖𝑏superscriptsubscript𝐗𝑖top^𝜷subscript𝑇𝑖superscriptsubscript𝑣𝑖𝑏subscript~𝑒𝑖\widetilde{Y}_{i,\ell}^{(b)}\leftarrow\mathbf{X}_{i}^{\top}\widehat{% \boldsymbol{\beta}}(T_{i,\ell})+v_{i,\ell}^{(b)}\widetilde{e}_{i,\ell}over~ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT ← bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over^ start_ARG bold_italic_β end_ARG ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) + italic_v start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT over~ start_ARG italic_e end_ARG start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT.
15:         end for
16:     end for
17:     𝒳n(b){(𝐘~i(b),𝐗i):i=1,,n}superscriptsubscript𝒳𝑛𝑏conditional-setsuperscriptsubscript~𝐘𝑖𝑏subscript𝐗𝑖𝑖1𝑛\mathcal{X}_{n}^{(b)}\leftarrow\{(\widetilde{\mathbf{Y}}_{i}^{(b)},\mathbf{X}_% {i}):i=1,\ldots,n\}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT ← { ( over~ start_ARG bold_Y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT , bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) : italic_i = 1 , … , italic_n }.
18:     Find the shape-constrained 𝜷^(b)superscript^𝜷𝑏\widehat{\boldsymbol{\beta}}^{(b)}over^ start_ARG bold_italic_β end_ARG start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT and unconstrained 𝜷~(b)superscript~𝜷𝑏\widetilde{\boldsymbol{\beta}}^{(b)}over~ start_ARG bold_italic_β end_ARG start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT based on 𝒳n(b)superscriptsubscript𝒳𝑛𝑏\mathcal{X}_{n}^{(b)}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT.
19:     Compute Dn(b)superscriptsubscript𝐷𝑛𝑏D_{n}^{(b)}italic_D start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT based on 𝜷^(b)superscript^𝜷𝑏\widehat{\boldsymbol{\beta}}^{(b)}over^ start_ARG bold_italic_β end_ARG start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT, 𝜷~(b)superscript~𝜷𝑏\widetilde{\boldsymbol{\beta}}^{(b)}over~ start_ARG bold_italic_β end_ARG start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT and 𝒳n(b)superscriptsubscript𝒳𝑛𝑏\mathcal{X}_{n}^{(b)}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT.
20:end for

Once B𝐵Bitalic_B bootstrap samples 𝒳n(1),,𝒳n(B)superscriptsubscript𝒳𝑛1superscriptsubscript𝒳𝑛𝐵\mathcal{X}_{n}^{(1)},\ldots,\mathcal{X}_{n}^{(B)}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 1 ) end_POSTSUPERSCRIPT , … , caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_B ) end_POSTSUPERSCRIPT are independently generated, we calculate the shape-constrained and unconstrained estimates 𝜷^(b)superscript^𝜷𝑏\widehat{\boldsymbol{\beta}}^{(b)}over^ start_ARG bold_italic_β end_ARG start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT and 𝜷~(b)superscript~𝜷𝑏\widetilde{\boldsymbol{\beta}}^{(b)}over~ start_ARG bold_italic_β end_ARG start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT, and calculate the test statistic Dn(b)superscriptsubscript𝐷𝑛𝑏D_{n}^{(b)}italic_D start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT, for each b=1,,B𝑏1𝐵b=1,\ldots,Bitalic_b = 1 , … , italic_B. These bootstrap test statistics are compared to the original test statistic Dnsubscript𝐷𝑛D_{n}italic_D start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT to determine the bootstrap p𝑝pitalic_p-value, pn=B1b=1B𝕀(Dn(b)>Dn)subscript𝑝𝑛superscript𝐵1superscriptsubscript𝑏1𝐵𝕀superscriptsubscript𝐷𝑛𝑏subscript𝐷𝑛p_{n}=B^{-1}\sum_{b=1}^{B}\mathbb{I}(D_{n}^{(b)}>D_{n})italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_B start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_b = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT blackboard_I ( italic_D start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_b ) end_POSTSUPERSCRIPT > italic_D start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). Then, we reject the null hypothesis H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT if pn<αsubscript𝑝𝑛𝛼p_{n}<\alphaitalic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT < italic_α, and retain H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT otherwise at the significance level α(0,1)𝛼01\alpha\in(0,1)italic_α ∈ ( 0 , 1 ). Our numerical study (Section 3) demonstrates that the proposed test procedure fulfills consistency in the sense that the type I error rate meets the significance level as specified and that the power of the test increases as sample size increases.

3 Simulation Study

We demonstrate the finite-sample performance of the proposed methods with numerical simulations. As described in Section 2, functional observations are assumed to be only partially available in our simulation study. Specifically, we first generate subject-specific evaluation points 𝐓i=(Ti,1,,Ti,Li)subscript𝐓𝑖subscript𝑇𝑖1subscript𝑇𝑖subscript𝐿𝑖\mathbf{T}_{i}=(T_{i,1},\ldots,T_{i,L_{i}})bold_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) independently drawn from Beta(1,1.25)Beta11.25\mathrm{Beta}(1,1.25)roman_Beta ( 1 , 1.25 ), which is a right-skewed distribution on [0,1]01[0,1][ 0 , 1 ]. Here, Lisubscript𝐿𝑖L_{i}italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is a uniform random integer on [5,15]515[5,15][ 5 , 15 ], representing the bounded number of sparse and irregular observations. Then, longitudinal observations 𝐘i=(Yi(Ti,1),,Yi(Ti,Li))subscript𝐘𝑖superscriptsubscript𝑌𝑖subscript𝑇𝑖1subscript𝑌𝑖subscript𝑇𝑖subscript𝐿𝑖top\mathbf{Y}_{i}=\big{(}Y_{i}(T_{i,1}),\ldots,Y_{i}(T_{i,L_{i}})\big{)}^{\top}bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ) , … , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT of each functional response Yisubscript𝑌𝑖Y_{i}italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are generated by the model (2.1) associated with four functional coefficients {βj:j=1,4}conditional-setsubscript𝛽𝑗𝑗14\{\beta_{j}:j=1,\ldots 4\}{ italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT : italic_j = 1 , … 4 } depicted in Figure 3(a), where the random vector 𝐗i=(Xi,1,,Xi,4)subscript𝐗𝑖superscriptsubscript𝑋𝑖1subscript𝑋𝑖4top\mathbf{X}_{i}=(X_{i,1},\ldots,X_{i,4})^{\top}bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT of scalar covariates is given by Xi,j=(Ui,j+0.5Ui,5)/(1+0.5)subscript𝑋𝑖𝑗subscript𝑈𝑖𝑗0.5subscript𝑈𝑖510.5X_{i,j}=(U_{i,j}+0.5U_{i,5})/(1+0.5)italic_X start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT = ( italic_U start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT + 0.5 italic_U start_POSTSUBSCRIPT italic_i , 5 end_POSTSUBSCRIPT ) / ( 1 + 0.5 ) with independent uniform random variables Ui,1,,Ui,5subscript𝑈𝑖1subscript𝑈𝑖5U_{i,1},\ldots,U_{i,5}italic_U start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_U start_POSTSUBSCRIPT italic_i , 5 end_POSTSUBSCRIPT on [0,1]01[0,1][ 0 , 1 ]. This makes the four components Xi,1,,Xi,4subscript𝑋𝑖1subscript𝑋𝑖4X_{i,1},\ldots,X_{i,4}italic_X start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_i , 4 end_POSTSUBSCRIPT correlated. Moreover, to introduce the within-subject dependency of longitudinal observations, we set the functional noise as εi(t)=k=150(ξi,k/k)2sin(2kπt)+ϵi(t)subscript𝜀𝑖𝑡superscriptsubscript𝑘150subscript𝜉𝑖𝑘𝑘22𝑘𝜋𝑡subscriptitalic-ϵ𝑖𝑡\varepsilon_{i}(t)=\sum_{k=1}^{50}(\xi_{i,k}/\sqrt{k})\sqrt{2}\sin(2k\pi t)+% \epsilon_{i}(t)italic_ε start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 50 end_POSTSUPERSCRIPT ( italic_ξ start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT / square-root start_ARG italic_k end_ARG ) square-root start_ARG 2 end_ARG roman_sin ( 2 italic_k italic_π italic_t ) + italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ), where 𝝃i=(ξi,1,,ξi,50)subscript𝝃𝑖superscriptsubscript𝜉𝑖1subscript𝜉𝑖50top\boldsymbol{\xi}_{i}=(\xi_{i,1},\ldots,\xi_{i,50})^{\top}bold_italic_ξ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_ξ start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT , … , italic_ξ start_POSTSUBSCRIPT italic_i , 50 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT has a multivariate normal distribution with mean zero and Cov(ξi,k,ξi,k)=0.5|kk|Covsubscript𝜉𝑖𝑘subscript𝜉𝑖superscript𝑘superscript0.5𝑘superscript𝑘\mathrm{Cov}(\xi_{i,k},\xi_{i,k^{\prime}})=0.5^{|k-k^{\prime}|}roman_Cov ( italic_ξ start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_i , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) = 0.5 start_POSTSUPERSCRIPT | italic_k - italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | end_POSTSUPERSCRIPT and {ϵi(t):t[0,1]}conditional-setsubscriptitalic-ϵ𝑖𝑡𝑡01\{\epsilon_{i}(t):t\in[0,1]\}{ italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) : italic_t ∈ [ 0 , 1 ] } is a Gaussian white noise process with mean zero and Var(ϵi(t))=0.12Varsubscriptitalic-ϵ𝑖𝑡superscript0.12\mathrm{Var}(\epsilon_{i}(t))=0.1^{2}roman_Var ( italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ) = 0.1 start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Then, we have a random sample 𝒳n={(𝐗i,𝐘i):i=1,,n}subscript𝒳𝑛conditional-setsubscript𝐗𝑖subscript𝐘𝑖𝑖1𝑛\mathcal{X}_{n}=\{(\mathbf{X}_{i},\mathbf{Y}_{i}):i=1,\ldots,n\}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = { ( bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) : italic_i = 1 , … , italic_n } of size n{100,500}𝑛100500n\in\{100,500\}italic_n ∈ { 100 , 500 }.

Refer to caption
Refer to caption
Refer to caption
Refer to caption
(a)
Refer to caption
(b)
Figure 3: True regression coefficient functions associated with the function-on-scalar regression model (2.1) are depicted in the left panel. The right panel illustrates empirical sample paths of 50505050 functional responses 𝐘isubscript𝐘𝑖\mathbf{Y}_{i}bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT generated with right-skewed evaluation points 𝐓isubscript𝐓𝑖\mathbf{T}_{i}bold_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We consider testing monotone-increasing hypotheses of β1subscript𝛽1\beta_{1}italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and β3subscript𝛽3\beta_{3}italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT on sub-intervals of inferential interests.

We consider testing a composite null hypothesis

H0:β1 and β3 are monotone increasing on ,:subscript𝐻0β1 and β3 are monotone increasing on H_{0}:\textrm{$\beta_{1}$ and $\beta_{3}$ are monotone increasing on $\mathcal% {I}$},italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are monotone increasing on caligraphic_I , (3.1)

where [0,1]01\mathcal{I}\subset[0,1]caligraphic_I ⊂ [ 0 , 1 ] is a pre-specified sub-interval to which the partial shape constraints of interest apply. This setup is intended to mock up one of the common situations in longitudinal studies such that patients drop out of the study or transfer to other clinics. For example, as the vertical rugs show that evaluation points are right-skewed in Figure 3(b), fewer observations are available near t=1𝑡1t=1italic_t = 1. Still, verifying the partial shape constraints on regression coefficient functions near t=1𝑡1t=1italic_t = 1 can be of inferential interest. For a systematic assessment of the numerical experiments, we consider ten simulation scenarios of the sub-interval =[0,0.5]00.5\mathcal{I}=[0,0.5]caligraphic_I = [ 0 , 0.5 ], [0.1,0.5]0.10.5[0.1,0.5][ 0.1 , 0.5 ], [0.2,0.5]0.20.5[0.2,0.5][ 0.2 , 0.5 ], [0.3,0.5]0.30.5[0.3,0.5][ 0.3 , 0.5 ], [0.4,0.5]0.40.5[0.4,0.5][ 0.4 , 0.5 ], [0.5,1]0.51[0.5,1][ 0.5 , 1 ], [0.6,1]0.61[0.6,1][ 0.6 , 1 ], [0.7,1]0.71[0.7,1][ 0.7 , 1 ], [0.8,1]0.81[0.8,1][ 0.8 , 1 ], and [0.9,1]0.91[0.9,1][ 0.9 , 1 ]. With the graphical illustration of the true regression coefficient functions in Figure 3(a), we note that the null hypothesis (3.1) is true if [0,0.5]00.5\mathcal{I}\subset[0,0.5]caligraphic_I ⊂ [ 0 , 0.5 ], and false otherwise.

To demonstrate the consistency of the proposed method subject to the different scenarios of \mathcal{I}caligraphic_I, we evaluate the power of the test by the Monte Carlo approximation as

γα()=1Rr=1R𝕀(pn(r)<α)subscript𝛾𝛼1𝑅superscriptsubscript𝑟1𝑅𝕀superscriptsubscript𝑝𝑛𝑟𝛼\displaystyle\gamma_{\alpha}(\mathcal{I})=\frac{1}{R}\sum_{r=1}^{R}\mathbb{I}% \big{(}p_{n}^{(r)}<\alpha\big{)}italic_γ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( caligraphic_I ) = divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT blackboard_I ( italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_r ) end_POSTSUPERSCRIPT < italic_α ) (3.2)

at the significance level α(0,1)𝛼01\alpha\in(0,1)italic_α ∈ ( 0 , 1 ), where pn(r)superscriptsubscript𝑝𝑛𝑟p_{n}^{(r)}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_r ) end_POSTSUPERSCRIPT is the p𝑝pitalic_p-value obtained in each r𝑟ritalic_r-th Monte Carlo repetition under the null hypothesis (2.2). Since γα()subscript𝛾𝛼\gamma_{\alpha}(\mathcal{I})italic_γ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( caligraphic_I ) gives the empirical level of the test, we can validate how much the proposed test keeps the test level as specified with different significance levels. Under the general alternative, γα()subscript𝛾𝛼\gamma_{\alpha}(\mathcal{I})italic_γ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( caligraphic_I ) corresponds to the empirical power of the test, and we can examine how fast γα()subscript𝛾𝛼\gamma_{\alpha}(\mathcal{I})italic_γ start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( caligraphic_I ) approaches 1 as the sample size increases.

Besides the consistency of the partially shape-constrained inference, we evaluate the consistency of estimation with the integrated squared bias (ISB) and the integrated variance (IVar),

ISB()=1||𝜷^¯(t)𝜷(t)2dt,IVar()=1Rr=1R1||𝜷^(r)(t)𝜷^¯(t)2dt,formulae-sequenceISB1subscriptsuperscriptnorm¯^𝜷𝑡𝜷𝑡2differential-d𝑡IVar1𝑅superscriptsubscript𝑟1𝑅1subscriptsuperscriptnormsuperscript^𝜷𝑟𝑡¯^𝜷𝑡2differential-d𝑡\displaystyle\textrm{ISB}(\mathcal{I})=\frac{1}{|\mathcal{I}|}\int_{\mathcal{I% }}\Big{\|}\overline{\widehat{\boldsymbol{\beta}}}(t)-\boldsymbol{\beta}(t)\Big% {\|}^{2}\,\mathrm{d}t,\quad\textrm{IVar}(\mathcal{I})=\frac{1}{R}\sum_{r=1}^{R% }\frac{1}{|\mathcal{I}|}\int_{\mathcal{I}}\Big{\|}\widehat{\boldsymbol{\beta}}% ^{(r)}(t)-\overline{\widehat{\boldsymbol{\beta}}}(t)\Big{\|}^{2}\,\mathrm{d}t,ISB ( caligraphic_I ) = divide start_ARG 1 end_ARG start_ARG | caligraphic_I | end_ARG ∫ start_POSTSUBSCRIPT caligraphic_I end_POSTSUBSCRIPT ∥ over¯ start_ARG over^ start_ARG bold_italic_β end_ARG end_ARG ( italic_t ) - bold_italic_β ( italic_t ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_d italic_t , IVar ( caligraphic_I ) = divide start_ARG 1 end_ARG start_ARG italic_R end_ARG ∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG | caligraphic_I | end_ARG ∫ start_POSTSUBSCRIPT caligraphic_I end_POSTSUBSCRIPT ∥ over^ start_ARG bold_italic_β end_ARG start_POSTSUPERSCRIPT ( italic_r ) end_POSTSUPERSCRIPT ( italic_t ) - over¯ start_ARG over^ start_ARG bold_italic_β end_ARG end_ARG ( italic_t ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_d italic_t , (3.3)

where 𝜷^¯(t)=R1r=1R𝜷^(r)(t)¯^𝜷𝑡superscript𝑅1superscriptsubscript𝑟1𝑅superscript^𝜷𝑟𝑡\overline{\widehat{\boldsymbol{\beta}}}(t)=R^{-1}\sum_{r=1}^{R}\widehat{% \boldsymbol{\beta}}^{(r)}(t)over¯ start_ARG over^ start_ARG bold_italic_β end_ARG end_ARG ( italic_t ) = italic_R start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT over^ start_ARG bold_italic_β end_ARG start_POSTSUPERSCRIPT ( italic_r ) end_POSTSUPERSCRIPT ( italic_t ) is the average of the shape-constrained estimates obtained from the repeated Monte Carlo experiments. In (3.3), we normalize the ISB and IVar by |||\mathcal{I}|| caligraphic_I | to easily compare the trend of numerical performances obtained from the different lengths of sub-intervals. Combining the above (3.2) and (3.3), we can assess the sensitivity and specificity of the proposed test procedure subject to the location and length of the sub-interval. We mainly report the simulation results obtained with B=200𝐵200B=200italic_B = 200 and R=200𝑅200R=200italic_R = 200, but our background simulation study gave the same lessons with larger B𝐵Bitalic_B and R𝑅Ritalic_R. We refer the readers to Section S.3 of the Supplementary Materials for the implementation details, including the bandwidth and knots selection for the shape-constrained kernel smoothing and regression spline, respectively.

We also consider a sub-cohort analysis for a comparative study, shedding light on another essential feature of the proposed method. As mentioned in the Introduction section, one may argue that it is more appropriate to simply treat (3.1) as global shape constraints “β1|evaluated-atsubscript𝛽1\beta_{1}|_{\mathcal{I}}italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | start_POSTSUBSCRIPT caligraphic_I end_POSTSUBSCRIPT and β3|evaluated-atsubscript𝛽3\beta_{3}|_{\mathcal{I}}italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | start_POSTSUBSCRIPT caligraphic_I end_POSTSUBSCRIPT are globally monotone increasing” under the model (2.1) restricted to \mathcal{I}caligraphic_I. In this concern, we conduct the sub-cohort analysis using the same inferential procedure as the proposed method but only utilizing the partial data 𝒳n|={(𝐘i|,𝐗i):i=1,,n}\mathcal{X}_{n}|_{\mathcal{I}}=\{(\mathbf{Y}_{i}|_{\mathcal{I}},\mathbf{X}_{i}% ):i=1,\ldots,n\}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | start_POSTSUBSCRIPT caligraphic_I end_POSTSUBSCRIPT = { ( bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | start_POSTSUBSCRIPT caligraphic_I end_POSTSUBSCRIPT , bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) : italic_i = 1 , … , italic_n }, where 𝐘i|={Yi,:Ti,,=1,,Li}evaluated-atsubscript𝐘𝑖conditional-setsubscript𝑌𝑖formulae-sequencesubscript𝑇𝑖1subscript𝐿𝑖\mathbf{Y}_{i}|_{\mathcal{I}}=\{Y_{i,\ell}:T_{i,\ell}\in\mathcal{I},\,\ell=1,% \ldots,L_{i}\}bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | start_POSTSUBSCRIPT caligraphic_I end_POSTSUBSCRIPT = { italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT : italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ∈ caligraphic_I , roman_ℓ = 1 , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT }. In contrast to the sub-cohort analysis, we call our proposed method the “full cohort” analysis since it uses all available observations in 𝒳nsubscript𝒳𝑛\mathcal{X}_{n}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT for estimation regardless of \mathcal{I}caligraphic_I. As seen below, our simulation result indicates that the sub-cohort analysis suffers from a significant loss of information. The more \mathcal{I}caligraphic_I is located in the second half of the domain, the smaller the statistical power one may expect. The situation becomes much worse depending on the length of \mathcal{I}caligraphic_I and the sparsity of functional evaluations on it. Through our simulation study, we also demonstrate that the proposed test procedure is robust against the location of sub-intervals.

Table 1: Simulation results for the partially shape-constrained estimates corresponding to the kernel and spline methods. The estimation and test performance are evaluated with the integrated squared bias (ISB), the integrated variance (IVar), the type I error rate, and the power of the test, defined as (3.2) and (3.3).
Sample size Dataset Criterion H0:β1:subscript𝐻0subscript𝛽1H_{0}:\beta_{1}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and β3subscript𝛽3\beta_{3}italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are monotone increasing on \mathcal{I}caligraphic_I.
Kernel least squares Spline regression
=[0,0.5]00.5\mathcal{I}=\mathcal{[}0,0.5]caligraphic_I = [ 0 , 0.5 ] =[0.4,0.5]0.40.5\mathcal{I}=\mathcal{[}0.4,0.5]caligraphic_I = [ 0.4 , 0.5 ] =[0,0.5]00.5\mathcal{I}=\mathcal{[}0,0.5]caligraphic_I = [ 0 , 0.5 ] =[0.4,0.5]0.40.5\mathcal{I}=\mathcal{[}0.4,0.5]caligraphic_I = [ 0.4 , 0.5 ]
n=100𝑛100n=100italic_n = 100 Full data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.06,0.11)0.060.11(0.06,0.11)( 0.06 , 0.11 ) (0.05,0.11)0.050.11(0.05,0.11)( 0.05 , 0.11 ) (0.04,0.07)0.040.07(0.04,0.07)( 0.04 , 0.07 ) (0.08,0.11)0.080.11(0.08,0.11)( 0.08 , 0.11 )
ISBISB\mathrm{ISB}roman_ISB 0.03020.03020.03020.0302 0.00730.00730.00730.0073 0.01010.01010.01010.0101 0.00330.00330.00330.0033
IVarIVar\mathrm{IVar}roman_IVar 0.40340.40340.40340.4034 0.53460.53460.53460.5346 0.37370.37370.37370.3737 0.20540.20540.20540.2054
Partial data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.06,0.11)0.060.11(0.06,0.11)( 0.06 , 0.11 ) (0.09,0.19)0.090.19(0.09,0.19)( 0.09 , 0.19 ) (0.05,0.11)0.050.11(0.05,0.11)( 0.05 , 0.11 ) (0.06,0.12)0.060.12(0.06,0.12)( 0.06 , 0.12 )
restricted to \mathcal{I}caligraphic_I ISBISB\mathrm{ISB}roman_ISB 0.05170.05170.05170.0517 0.07680.07680.07680.0768 0.01760.01760.01760.0176 0.11560.11560.11560.1156
IVarIVar\mathrm{IVar}roman_IVar 0.53350.53350.53350.5335 1.50281.50281.50281.5028 0.57580.57580.57580.5758 1.68961.68961.68961.6896
n=500𝑛500n=500italic_n = 500 Full data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.05,0.10)0.050.10(0.05,0.10)( 0.05 , 0.10 ) (0.05,0.09)0.050.09(0.05,0.09)( 0.05 , 0.09 ) (0.04,0.08)0.040.08(0.04,0.08)( 0.04 , 0.08 ) (0.05,0.08)0.050.08(0.05,0.08)( 0.05 , 0.08 )
ISBISB\mathrm{ISB}roman_ISB 0.01020.01020.01020.0102 0.00180.00180.00180.0018 0.00170.00170.00170.0017 0.00160.00160.00160.0016
IVarIVar\mathrm{IVar}roman_IVar 0.10100.10100.10100.1010 0.11350.11350.11350.1135 0.09460.09460.09460.0946 0.05720.05720.05720.0572
Partial data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.04,0.10)0.040.10(0.04,0.10)( 0.04 , 0.10 ) (0.07,0.13)0.070.13(0.07,0.13)( 0.07 , 0.13 ) (0.05,0.11)0.050.11(0.05,0.11)( 0.05 , 0.11 ) (0.04,0.08)0.040.08(0.04,0.08)( 0.04 , 0.08 )
restricted to \mathcal{I}caligraphic_I ISBISB\mathrm{ISB}roman_ISB 0.01290.01290.01290.0129 0.01100.01100.01100.0110 0.00220.00220.00220.0022 0.01090.01090.01090.0109
IVarIVar\mathrm{IVar}roman_IVar 0.14970.14970.14970.1497 0.32290.32290.32290.3229 0.11680.11680.11680.1168 0.27980.27980.27980.2798

We report the simulation results in Table 1 and Figure 4, highlighting the main lessons we obtained in the power analysis and estimation performance. The additional simulation results are provided in Section S.7 of the Supplementary Materials. The proposed partially shape-constrained inference (full cohort analysis) outperforms the sub-cohort analysis with the global shape constraints. Specifically, Table 1 shows that both the kernel and spline methods yield consistent estimates in the sense that ISB and IVar decrease as the sample size increases, which is well-aligned with the large sample properties we investigated in Section 2. Figure 4 shows that the proposed test procedure consistently meets the level of the test as specified (α=0.05,0.1𝛼0.050.1\alpha=0.05,0.1italic_α = 0.05 , 0.1) across different scenarios varying with the length of sub-intervals from 0.10.10.10.1 to 0.50.50.50.5.

We note that the proposed method keeps the significance level under the null hypothesis and generally attains a greater power under the general alternative, while the sub-cohort analysis often violates the pre-specified level of the test under the null hypothesis. For the sub-interval =[0.9,1]0.91\mathcal{I}=[0.9,1]caligraphic_I = [ 0.9 , 1 ], the sub-cohort analysis appears more sensitive to the general alternative than the full cohort analysis. However, as shown in Figure 1(b) in the Supplementary Materials, the shape-constrained estimates of the sub-cohort analysis suffer from significant bias due to the low sample size with 𝒳n|=[0.9,1]evaluated-atsubscript𝒳𝑛0.91\mathcal{X}_{n}|_{\mathcal{I}=[0.9,1]}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | start_POSTSUBSCRIPT caligraphic_I = [ 0.9 , 1 ] end_POSTSUBSCRIPT. A similar issue arises with the sub-interval =[0.4,0.5]0.40.5\mathcal{I}=[0.4,0.5]caligraphic_I = [ 0.4 , 0.5 ], where the sub-cohort analysis exhibits a higher type I error rate compared to the full cohort analysis. Therefore, we do not recommend using the sub-cohort analysis for verifying partial shape constraints over short sub-intervals.

Refer to caption
Refer to caption
Figure 4: The power analysis of the partially shape-constrained kernel method (2.9) subject to shape constraints (3.1). The sensitivity and specificity are evaluated at 5%percent55\%5 % significance level under different scenarios of locations and lengths of the sub-interval as =[0,0.5]00.5\mathcal{I}=[0,0.5]caligraphic_I = [ 0 , 0.5 ], [0.1,0.5]0.10.5[0.1,0.5][ 0.1 , 0.5 ], [0.2,0.5]0.20.5[0.2,0.5][ 0.2 , 0.5 ], [0.3,0.5]0.30.5[0.3,0.5][ 0.3 , 0.5 ], [0.4,0.5]0.40.5[0.4,0.5][ 0.4 , 0.5 ], [0.5,1]0.51[0.5,1][ 0.5 , 1 ], [0.6,1]0.61[0.6,1][ 0.6 , 1 ], [0.7,1]0.71[0.7,1][ 0.7 , 1 ], [0.8,1]0.81[0.8,1][ 0.8 , 1 ], and [0.9,1]0.91[0.9,1][ 0.9 , 1 ].

4 Real Data Applications

We illustrate the application of shape-constrained inference to two datasets from clinical trials and demonstrate how we figure out treatment efficacy over time. The proposed tools help summarize dynamic efficacy patterns so that practitioners better understand when the maximum treatment effect is achieved and decide the appropriate treatment frequencies. Compared to existing studies making such conclusions relying on visualization of estimated regression coefficient functions or based on inference over the entire domain, our method enables providing statistical evidence on refined conclusions on any sub-intervals of interest. Although we illustrate examples from clinical trials, the proposed tool can be applied to any field with similar interests.

4.1 Application 1: Cervical Dystonia Dataset

Cervical dystonia is a painful neurological condition that causes the head to twist or turn to one side because of involuntary muscle contractions in the neck. Although this rare disorder can occur at any age, it most often occurs in middle-aged people, women more than men. The dataset is collected from a randomized placebo-controlled trial of botulinum toxin (botox) B, where 109 participants across 9 sites were assigned to placebo (36 subjects), 5000 units of botox B (36 subjects), or 10,000 units of botox B (37 subjects), injected into the affected muscle to partially paralyze it and make it relax. The response variable is the score on the Toronto Western Spasmodic Torticollis Rating Scale (TWSTRS), measuring the severity, pain, and disability of cervical dystonia, where higher scores mean more impairment on a 0-87 scale. The TWSTRS is administered at baseline (week 0) and at weeks 2, 4, 8, 12, and 16 thereafter. This study was originally conducted by [2], and 96.5% of subjects were followed up at designated weeks, on average. Besides, the dataset contains information on the age and sex of the subjects. The data is available in R through the package ‘medicaldata’ (https://github.com/higgi13425/medicaldata). While [2] or [8] focus on demonstrating the efficacy of botox B in cervical dystonia during weeks of clinical trial under longitudinal models, our application aims to assess the pattern of drug efficacy over weeks by applying shape-constrained inference on various sub-intervals and investigate when the maximum efficacy is achieved. To do this, we introduce the indicator variables Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, where Di=1subscript𝐷𝑖1D_{i}=1italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 if the drug is assigned to i𝑖iitalic_ith subject, and Di=0subscript𝐷𝑖0D_{i}=0italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0, otherwise. We then consider the following regression model

Yi(t)=β0(t)+β1(t)Di+β2(t)Agei+β3(t)Sexi+ϵi(t)(t[0,16]),subscript𝑌𝑖𝑡subscript𝛽0𝑡subscript𝛽1𝑡subscript𝐷𝑖subscript𝛽2𝑡subscriptAge𝑖subscript𝛽3𝑡subscriptSex𝑖subscriptitalic-ϵ𝑖𝑡𝑡016Y_{i}(t)=\beta_{0}(t)+\beta_{1}(t)D_{i}+\beta_{2}(t)*\texttt{Age}_{i}+\beta_{3% }(t)*\texttt{Sex}_{i}+\epsilon_{i}(t)\quad(t\in[0,16]),italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) = italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) + italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ) ∗ Age start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_t ) ∗ Sex start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) ( italic_t ∈ [ 0 , 16 ] ) , (4.1)
Table 2: p𝑝pitalic_p-values from Cervical Dystonia data under corresponding shape-constrained null hypotheses with given sub-intervals using kernel-based and spline-based estimation.
Hypothesis p𝑝pitalic_p-values
Kernel-based Spline-based
β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone decreasing on [0,4]04[0,4][ 0 , 4 ] 0.36 0.45
β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone increasing on [0,4]04[0,4][ 0 , 4 ] 0.03 0.02
β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone decreasing on [4,16]416[4,16][ 4 , 16 ] 0.07 0.01
β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone increasing on [4,16]416[4,16][ 4 , 16 ] 0.84 0.48

Table 2 contains the results of the monotonicity tests, H0:β1(t):subscript𝐻0subscript𝛽1𝑡H_{0}:\beta_{1}(t)italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone decreasing (increasing) over two sub-intervals, =[0,4]04\mathcal{I}=[0,4]caligraphic_I = [ 0 , 4 ] and [4,16]416[4,16][ 4 , 16 ], from the kernel- and spline-based tests using bootstrap size B=500𝐵500B=500italic_B = 500. The spline-based approach specifically adopts piece-wise quadratic I𝐼Iitalic_I-spline basis functions with one internal knot located at t=8𝑡8t=8italic_t = 8, determined by cross-validation as detailed in Supplementary Material S.3. To estimate β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) with shape restriction over =[0,4]04\mathcal{I}=[0,4]caligraphic_I = [ 0 , 4 ] or [4,16]416[4,16][ 4 , 16 ], we locate one additional knot at t=4𝑡4t=4italic_t = 4 only for β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) to present desired shape under given boundaries. The kernel-based results are obtained from estimates based on optimal bandwidth h3.2(=0.2×16)annotated3.2absent0.216h\approx 3.2(=0.2\times 16)italic_h ≈ 3.2 ( = 0.2 × 16 ) with the detailed selection criteria discussed in Supplementary Material. Then, we observe that both kernel- and spline-based tests conclude the significant decreasing β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) from weeks 0 to 4 by rejecting H0:β1(t):subscript𝐻0subscript𝛽1𝑡H_{0}:\beta_{1}(t)italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone increasing on [0,4]04[0,4][ 0 , 4 ], but declare an increasing pattern observed from weeks 4 to 16 by rejecting H0:β1(t):subscript𝐻0subscript𝛽1𝑡H_{0}:\beta_{1}(t)italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone decreasing on [4,16]416[4,16][ 4 , 16 ] under the significant level 0.1. This conclusion implies the improving efficacy of botox B during the first 4 weeks, but diminishing effectiveness after then. This finding becomes more convincing through consistent conclusions regardless of the choice of shape constraints on null hypotheses, either monotone decreasing or increasing, or choice of estimation methods.

Refer to caption
Figure 5: Estimated coefficient functions from Cervical Dystonia data. Solid lines are constrained estimation functions with decreasing restriction over weeks 0 to 4 and increasing restriction over weeks 4 to 16 on treatment effect, and dotted lines are unconstrained spline estimation, along with the location of knots indicated by the symbol ‘×’

Figure 5 displays the estimated functional coefficients under spline estimates with the chosen optimal selection of knots in each panel. Following the inferential conclusion on treatment effect, β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is estimated under the decreasing constraint over weeks from 0 to 4 and under the increasing constraint during weeks from 4 to 16. The unconstrained and constrained models are fitted using the same knots, but one extra knot at week 4 is added to fit β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) due to shape constraint changing at week 4. We observe the positive coefficients at week 0, presumably due to a slight bias from the random assignment in the trial, with the mean of response scores from placebo and treatment groups calculated as 43.5 and 46.7, respectively, at baseline (week 0). Although patients in the treatment group show slightly higher scores at the beginning, the decreasing trend in the first four weeks, i.e., improving efficacy, is clear from the estimated function. After 4 weeks, the degree of effectiveness weakens, and at week 16, there is no more treatment effect by returning to the status that we observed from week 0. While we find a precise fit for β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) under shape constraints, remainder regression coefficient estimates for age and sex, β2(t)subscript𝛽2𝑡\beta_{2}(t)italic_β start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ) and β3(t)subscript𝛽3𝑡\beta_{3}(t)italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_t ), show similar fits with or without constraints on β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ). In addition to the conclusion on overall treatment efficacy as in [2], we could make a comprehensive summary of treatment efficacy based on refined conclusions over sub-intervals of interest, and it further helps precise estimation.

4.2 Application 2: Mental Health Schizophrenia Collaborative Study

We next implement the proposed method on the data from the National Institute of Mental Health Schizophrenia Collaborative Study, where 437 patients were randomized to receive either a placebo or anti-psychotic drug, followed by longitudinal monitoring of individuals. [1] and [12] already analyzed this data to assess the efficacy of the drug using Item 79 ‘Severity of Illness,’ the Inpatient Multidimensional Psychiatric Scale (IMPS) ranging from 1 (normal, not ill at all) to 7 (among the most extremely ill), evaluated at weeks 0, 1, 3, 6 under the protocol along with additional measurements for some patients made at weeks 2, 4, 5. There were a total of 108 and 329 patients in the placebo and treatment groups, respectively, with an average of 89.8%percent89.889.8\%89.8 % among them collected during weeks of protocol and 2.62.62.62.6% collected during other weeks. The data is available in R (Package ‘mixor’). As in [1] and [12], we consider the regression model Yi(t)=β0(t)+β1(t)Di+ϵi(t)subscript𝑌𝑖𝑡subscript𝛽0𝑡subscript𝛽1𝑡subscript𝐷𝑖subscriptitalic-ϵ𝑖𝑡Y_{i}(t)=\beta_{0}(t)+\beta_{1}(t)D_{i}+\epsilon_{i}(t)italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) = italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) + italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ), t[0,6]𝑡06t\in[0,6]italic_t ∈ [ 0 , 6 ], where Yi(t)subscript𝑌𝑖𝑡Y_{i}(t)italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) denotes the illness severity measurement of i𝑖iitalic_ith subject at week t𝑡titalic_t, and β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) represents the effect of drug treatment over weeks with the indicator dummy variable Disubscript𝐷𝑖D_{i}italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT; Di=1subscript𝐷𝑖1D_{i}=1italic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 for individuals assigned in treatment group. As mentioned in the Introduction, although [1] and [12] demonstrated its improving effectiveness over the entire domain, from week 0 to 6, through conclusion on significant decreasing βi(t)subscript𝛽𝑖𝑡\beta_{i}(t)italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) using their inferential tools under shape-constrained estimation, respectively, we are furthermore interested in the efficacy dynamics in later weeks by applying sub-interval tests, similar to what we conducted in Section 4.1. This is motivated by Figure 4 of [1], displaying a clear decreasing trend on estimated β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) with narrow confidence over weeks 0 - 3, but flattening out phase afterward with widening confidence intervals. By applying sub-interval tests over weeks 0 to 3 and weeks 3 to 6, we examine patterns in efficacy beyond the overall drug effectiveness.

Table 3: p𝑝pitalic_p-values from Schizophrenia data under corresponding shape-constrained null hypotheses with given sub-intervals using kernel-based and spline-based estimation.
Hypothesis p𝑝pitalic_p-values
Kernel-based Spline-based
β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone decreasing on [0,3]03[0,3][ 0 , 3 ] 0.71 0.26
β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone increasing on [0,3]03[0,3][ 0 , 3 ] 0.05 0.00
β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone decreasing on [3,6]36[3,6][ 3 , 6 ] 0.65 0.15
β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone increasing on [3,6]36[3,6][ 3 , 6 ] 0.62 0.11

Table 3 provides p𝑝pitalic_p-values from H0:β1(t):subscript𝐻0subscript𝛽1𝑡H_{0}:\beta_{1}(t)italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) is monotone increasing (decreasing) over \mathcal{I}caligraphic_I with bootstrap size B=500𝐵500B=500italic_B = 500. The optimal bandwidth for kernel estimates is set as h1.8(=0.3×6)annotated1.8absent0.36h\approx 1.8(=0.3\times 6)italic_h ≈ 1.8 ( = 0.3 × 6 ), and for spline estimates, we adopt piece-wise I𝐼Iitalic_I-splines with two internal knots located at t=1𝑡1t=1italic_t = 1 and 4. Both kernel- and spline-based tests conclude the significant decreasing β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) during weeks 0 - 3, supported by p𝑝pitalic_p-values less than or equal to significance level α=0.05𝛼0.05\alpha=0.05italic_α = 0.05 from the null hypothesis of monotone increasing. Consistently, the null hypothesis for monotone decrasing over weeks 0 -3 is not rejected from both tests. However, sub-interval tests over weeks 3 - 6 conclude that there is not enough statistical evidence to reject both the monotone increasing and decreasing trends. Thus, we conclude the constant level of β1(t)subscript𝛽1𝑡\beta_{1}(t)italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) maintained over this period, implying the maximum drug efficacy achieved at week 3 and the duration of such degree of effectiveness until the end of the experiment. The estimated drug effect is illustrated in Figure 1 with the estimates with no constraints and with decreasing constraints over the entire domain. As discussed in the Section 1, the latter fit is the inferential finding from [1] and [12].

References

  • [1] M. Ahkim, G. I., and V. A. Shape testing in varying coefficient models. Test, 26:429–450, 2017.
  • [2] A. Brashear, M. Lew, D. Dykstra, C. Comella, S. Factor, R. Rodnitzky, R. Trosch, C. Singer, M. Brin, and J. Murray. Safety and efficacy of NeuroBloc (botulinum toxin type B) in type A–responsive cervical dystonia. Neurology, 53(7):1439–1439, 1999.
  • [3] A. C. Cameron, J. B. Gelbach, and D. L. Miller. Bootstrap-based improvements for inference with clustered errors. The review of economics and statistics, 90(3):414–427, 2008.
  • [4] Z. Chen, Q. Gao, B. Fu, and H. Zhu. Monotone nonparametric regression for functional/longitudinal data. Statistica Sinica, 29(4):2229–2249, 2019.
  • [5] D. Chetverikov, A. Santos, and A. M. Shaikh. The econometrics of shape restrictions. Annual Review of Economics, 10(1):31–63, 2018.
  • [6] J. A. Cuesta-Albertos, E. García-Portugués, M. Febrero-Bande, and W. González-Manteiga. Goodness-of-fit tests for the functional linear model based on randomly projected empirical processes. The Annals of Statistics, 47(1):439–467, 2019.
  • [7] R. Davidson and E. Flachaire. The wild bootstrap, tamed at last. Journal of Econometrics, 146(1):162–169, 2008.
  • [8] C. S. Davis. Statistical Methods for the Analysis of Repeated Measurements. Springer Texts in Statistics. New York, NY: Springer Nature, 1 edition, 2003.
  • [9] L. Dümbgen. Shape-constrained statistical inference. Annual Review of Statistics and Its Application, 11, 2024.
  • [10] Y. Fan and E. Guerre. Multivariate local polynomial estimators: Uniform boundary properties and asymptotic linear representation. In Essays in Honor of Aman Ullah, volume 36, pages 489–537. Emerald Group Publishing Limited, 2016.
  • [11] S. Friedrich, F. Konietschke, and M. Pauly. A wild bootstrap approach for nonparametric repeated measurements. Computational Statistics & Data Analysis, 113:38–52, 2017.
  • [12] R. Ghosal, S. Ghosh, J. Urbanek, J. A. Schrack, and V. Zipunnikov. Shape-constrained estimation in functional regression with bernstein polynomials. Computational Statistics & Data Analysis, 178:107614, 2023.
  • [13] W. Härdle and E. Mammen. Bootstrap methods in nonparametric regression. In Nonparametric functional estimation and related topics, pages 111–123. Springer, 1991.
  • [14] J. Z. Huang, C. . Wu, and L. Zhou. Polynomial spline estimation and inference for varying coefficient models with longitudinal data. Statistica Sinica, 14(3):763–788, 2004.
  • [15] R. J. Kuczmarski. 2000 CDC growth charts for the United States: Methods and development. Number 246. Department of Health and Human Services, Centers for Disease Control and Prevention, National Center for Health Statistics, 2002.
  • [16] E. Mammen. Bootstrap and wild bootstrap for high dimensional linear models. The annals of statistics, 21(1):255–285, 1993.
  • [17] E. Mammen, J. S. Marron, B. A. Turlach, and M. P. Wand. A General Projection Framework for Constrained Smoothing. Statistical Science, 16(3):232 – 248, 2001.
  • [18] M. C. Meyer. Inference using shape-restricted regression splines. The Annals of Applied Statistics, 2(3):1013–1033, 2008.
  • [19] M. C. Meyer. A simple new algorithm for quadratic programming with applications in statistics. Communications in Statistics - Simulation and Computation, 42(5):1126–1139, 2013.
  • [20] M. C. Meyer. A framework for estimation and inference in generalized additive models with shape and order restrictions. Statistical Science, 33(4):595–614, 2018.
  • [21] J. S. Morris. Functional regression. Annual Review of Statistics and Its Application, 2(2):321–359, 2015.
  • [22] Y. Park, K. Han, and D. G. Simpson. Testing linear operator constraints in functional response regression with incomplete response functions. Electronic Journal of Statistics, 17(2):3143–3180, 2023.
  • [23] J. O. Ramsay. Monotone regression splines in action. Statistical Science, 3(4):425–441, 1988.
  • [24] P. T. Reiss, J. Goldsmith, H. L. Shang, and R. T. Ogden. Methods for scalar-on-function regression. International Statistical Review, 85(2):228–249, 2017.
  • [25] R. J. Samworth and B. Sen. Special issue on “nonparametric inference under shape constraints”. Statistical science, 33(4):469–472, 2018.
  • [26] L. Schumaker. Spline Functions: Basic Theory. Cambridge Mathematical Library. Cambridge University Press, 3 edition, 2007.
  • [27] E. Seijo and B. Sen. Nonparametric least squares estimation of a multivariate convex regression function. The Annals of Statistics, 39(3):1633–1657, 2011.
  • [28] B. Sen and M. Meyer. Testing against a linear regression model using ideas from shape-restricted estimation. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79(2):423–448, 2017.
  • [29] X. Shao. The dependent wild bootstrap. Journal of the American Statistical Association, 105(489):218–235, 2010.
  • [30] Q. Shen and J. Faraway. An F𝐹Fitalic_F test for linear models with functional responses. Statistica Sinica, 14:1239–1257, 2004.
  • [31] E. Slutsky. Sulla teoria del bilancio del consumatore. Giornale degli economisti e rivista di statistica, pages 1–26, 1915.
  • [32] J.-L. Wang, J.-M. Chiou, and H.-G. Müller. Functional data analysis. Annual Review of Statistics and Its Application, 3:257–295, 2016.
  • [33] C.-F. J. Wu. Jackknife, bootstrap and other resampling methods in regression analysis. The Annals of Statistics, 14(4):1261–1295, 1986.
  • [34] D. Yagi, Y. Chen, A. L. Johnson, and T. Kuosmanen. Shape-constrained kernel-weighted least squares: Estimating production functions for chilean manufacturing industries. Journal of Business & Economic Statistics, 38(1):43–54, 2020.
  • [35] J.-T. Zhang. Statistical inferences for linear models with functional responses. Statistica Sinica, 21:1431–1451, 2011.
  • [36] J.-T. Zhang and J. Chen. Statistical inferences for functional data. The Annals of Statistics, 35(3):1052–1079, 2007.

Supplementary materials:
Statistical inference on partially shape-constrained
function-on-scalar linear regression models

S.1 Technical conditions for kernel estimates

For the minimum and maximum measurement frequencies λn=min1inLisubscript𝜆𝑛subscript1𝑖𝑛subscript𝐿𝑖\lambda_{n}=\min_{1\leq i\leq n}L_{i}italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = roman_min start_POSTSUBSCRIPT 1 ≤ italic_i ≤ italic_n end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and Λn=max1inLisubscriptΛ𝑛subscript1𝑖𝑛subscript𝐿𝑖\Lambda_{n}=\max_{1\leq i\leq n}L_{i}roman_Λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT 1 ≤ italic_i ≤ italic_n end_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, let N=nλn𝑁𝑛subscript𝜆𝑛N=n\lambda_{n}italic_N = italic_n italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. We list the technical conditions for Proposition 2.2.1 and Theorem 2.2.2.

  1. (K0)𝐾0(K0)( italic_K 0 )

    π𝜋\piitalic_π of T𝑇Titalic_T is continuously differentiable and strictly positive.

  2. (K1)𝐾1(K1)( italic_K 1 )

    K𝐾Kitalic_K is a probability density function symmetric and Lipschitz continuous on [1,1]11[-1,1][ - 1 , 1 ].

  3. (K2)𝐾2(K2)( italic_K 2 )

    Eεk<𝐸superscriptsubscriptnorm𝜀𝑘E\|\varepsilon\|_{\infty}^{k}<\inftyitalic_E ∥ italic_ε ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT < ∞ for some k>2𝑘2k>2italic_k > 2.

  4. (K3)𝐾3(K3)( italic_K 3 )

    hNaasymptotically-equalssuperscript𝑁𝑎h\asymp N^{-a}italic_h ≍ italic_N start_POSTSUPERSCRIPT - italic_a end_POSTSUPERSCRIPT for some 0<a<12/k0𝑎12𝑘0<a<1-2/k0 < italic_a < 1 - 2 / italic_k.

  5. (K4)𝐾4(K4)( italic_K 4 )

    λn=O(1)subscript𝜆𝑛𝑂1\lambda_{n}=O(1)italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_O ( 1 ) and Λn=O(Nb/logN)k2subscriptΛ𝑛𝑂superscriptsuperscript𝑁𝑏𝑁𝑘2\Lambda_{n}=O\big{(}N^{b}/\log N\big{)}^{\frac{k}{2}}roman_Λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_O ( italic_N start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT / roman_log italic_N ) start_POSTSUPERSCRIPT divide start_ARG italic_k end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT for some 0<b<1(2/k)a0𝑏12𝑘𝑎0<b<1-(2/k)-a0 < italic_b < 1 - ( 2 / italic_k ) - italic_a.

(K1)𝐾1(K1)( italic_K 1 )(K3)𝐾3(K3)( italic_K 3 ) are standard conditions to analyze the large sample property of the kernel estimator with an exponential inequality. (K4)𝐾4(K4)( italic_K 4 ) is a technical condition that ensures the uniform convergence of the unconstrained kernel least square estimator even if the minimum number λnsubscript𝜆𝑛\lambda_{n}italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT of longitudinal observations for some subjects is bounded. Still, we require ΛnsubscriptΛ𝑛\Lambda_{n}roman_Λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT to increase so that the collection {Ti,:1in,=1,,Li}conditional-setsubscript𝑇𝑖formulae-sequence1𝑖𝑛1subscript𝐿𝑖\{T_{i,\ell}:1\leq i\leq n,\,\ell=1,\ldots,L_{i}\}{ italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT : 1 ≤ italic_i ≤ italic_n , roman_ℓ = 1 , … , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } of sampling points from all subjects densely covers the entire domain [0,1]01[0,1][ 0 , 1 ].

S.2 Technical conditions for spline estimates

  1. (S1)𝑆1(S1)( italic_S 1 )

    Assume βj(t)Cq[0,1]subscript𝛽𝑗𝑡superscript𝐶𝑞01\beta_{j}(t)\in C^{q}[0,1]italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) ∈ italic_C start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT [ 0 , 1 ]. For monotone βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, we use quadratic splines (q=3)𝑞3(q=3)( italic_q = 3 ) and for convex βjsubscript𝛽𝑗\beta_{j}italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, cubic splines are used (q=4)𝑞4(q=4)( italic_q = 4 ).

  2. (S2)𝑆2(S2)( italic_S 2 )

    There is a positive constant M1subscript𝑀1M_{1}italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT such that |Xj|M1subscript𝑋𝑗subscript𝑀1|X_{j}|\leq M_{1}| italic_X start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | ≤ italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, for j=1,,p.𝑗1𝑝j=1,\ldots,p.italic_j = 1 , … , italic_p .

  3. (S3)𝑆3(S3)( italic_S 3 )

    There is a constant M3subscript𝑀3M_{3}italic_M start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT such that E[ϵ(t)2]M3<𝐸delimited-[]italic-ϵsuperscript𝑡2subscript𝑀3E[\epsilon(t)^{2}]\leq M_{3}<\inftyitalic_E [ italic_ϵ ( italic_t ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] ≤ italic_M start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT < ∞ for t[0,1].𝑡01t\in[0,1].italic_t ∈ [ 0 , 1 ] .

  4. (S4)𝑆4(S4)( italic_S 4 )

    The knots 0=ξj,1<ξj,2<<ξj,Kjn=10subscript𝜉𝑗1subscript𝜉𝑗2subscript𝜉𝑗subscript𝐾𝑗𝑛10=\xi_{j,1}<\xi_{j,2}<\cdots<\xi_{j,K_{jn}}=10 = italic_ξ start_POSTSUBSCRIPT italic_j , 1 end_POSTSUBSCRIPT < italic_ξ start_POSTSUBSCRIPT italic_j , 2 end_POSTSUBSCRIPT < ⋯ < italic_ξ start_POSTSUBSCRIPT italic_j , italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 1 have bounded mesh ratio; that is, there is a constant M2(1,)subscript𝑀21M_{2}\in(1,\infty)italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∈ ( 1 , ∞ ) not depending on n𝑛nitalic_n or j𝑗jitalic_j such that Kjn1M21ξj,k+1ξj,kKjn1M2superscriptsubscript𝐾𝑗𝑛1superscriptsubscript𝑀21subscript𝜉𝑗𝑘1subscript𝜉𝑗𝑘superscriptsubscript𝐾𝑗𝑛1subscript𝑀2K_{jn}^{-1}M_{2}^{-1}\leq\xi_{j,k+1}-\xi_{j,k}\leq K_{jn}^{-1}M_{2}italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ≤ italic_ξ start_POSTSUBSCRIPT italic_j , italic_k + 1 end_POSTSUBSCRIPT - italic_ξ start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT ≤ italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, for k=2,Kjn𝑘2subscript𝐾𝑗𝑛k=2,\ldots K_{jn}italic_k = 2 , … italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT.

  5. (S5)𝑆5(S5)( italic_S 5 )

    lim supn(maxjKjn/minjKjn)subscriptlimit-supremum𝑛subscript𝑗subscript𝐾𝑗𝑛subscript𝑗subscript𝐾𝑗𝑛\limsup_{n}(\max_{j}K_{jn}/\min_{j}K_{jn})\leq\inftylim sup start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_max start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT / roman_min start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT ) ≤ ∞.

  6. (S6)𝑆6(S6)( italic_S 6 )

    inLi1nasymptotically-equalssuperscriptsubscript𝑖𝑛superscriptsubscript𝐿𝑖1𝑛\sum_{i}^{n}L_{i}^{-1}\asymp n∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ≍ italic_n.

  7. (S7)𝑆7(S7)( italic_S 7 )

    For the monotone assumption, we have βj(t)ε>0superscriptsubscript𝛽𝑗𝑡𝜀0\beta_{j}^{{}^{\prime}}(t)\geq\varepsilon>0italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ( italic_t ) ≥ italic_ε > 0 on [0,1]01[0,1][ 0 , 1 ], and for the convex assumption, βj′′(t)ε>0superscriptsubscript𝛽𝑗′′𝑡𝜀0\beta_{j}^{{}^{\prime\prime}}(t)\geq\varepsilon>0italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ( italic_t ) ≥ italic_ε > 0 on [0,1]01[0,1][ 0 , 1 ]. For the increasing and convex assumption, constrained hold strictly if βj′′ε>0superscriptsubscript𝛽𝑗′′𝜀0\beta_{j}^{{}^{\prime\prime}}\geq\varepsilon>0italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ≥ italic_ε > 0 on [0,1]01[0,1][ 0 , 1 ] and βj(t)ε>0superscriptsubscript𝛽𝑗𝑡𝜀0\beta_{j}^{{}^{\prime}}(t)\geq\varepsilon>0italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT end_POSTSUPERSCRIPT ( italic_t ) ≥ italic_ε > 0.

The condition (S4)𝑆4(S4)( italic_S 4 ) requires fairly distributed knots over [0,1] without bias in location, where equidistantly located knots satisfy this condition. Then similar rates of growth on the number of knots Kjnsubscript𝐾𝑗𝑛K_{jn}italic_K start_POSTSUBSCRIPT italic_j italic_n end_POSTSUBSCRIPT across j𝑗jitalic_j are conditioned through (S5)𝑆5(S5)( italic_S 5 ). Lastly, Condition (S6)𝑆6(S6)( italic_S 6 ) indicates the number of observations per subject increases slower than the rate of increase for sample size n𝑛nitalic_n. In practice, (S6)𝑆6(S6)( italic_S 6 ) is rather common with a bounded number of measurements per subject in clinical experiments instead of its divergence.

S.3 Implementation Details

Bandwidth selection for kernel estimation

To select bandwidths in a data-adaptive manner, we propose a K𝐾Kitalic_K-fold cross-validation (CV) procedure that minimizes empirical prediction errors for individual trajectories. Specifically, let 𝒫=k=1K𝒫k𝒫superscriptsubscript𝑘1𝐾subscript𝒫𝑘\mathcal{P}=\bigcup_{k=1}^{K}\mathcal{P}_{k}caligraphic_P = ⋃ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be an arbitrary disjoint partition of the index set {1,,n}1𝑛\{1,\ldots,n\}{ 1 , … , italic_n }, where each partition set approximately has the same size. Also, let 𝜷~(t;h,𝒳n,(k))~𝜷𝑡subscript𝒳𝑛𝑘\widetilde{\boldsymbol{\beta}}\big{(}t;h,\mathcal{X}_{n,(-k)}\big{)}over~ start_ARG bold_italic_β end_ARG ( italic_t ; italic_h , caligraphic_X start_POSTSUBSCRIPT italic_n , ( - italic_k ) end_POSTSUBSCRIPT ) be the unconstrained estimates of 𝜷(t)𝜷𝑡\boldsymbol{\beta}(t)bold_italic_β ( italic_t ) defined with a bandwidth h>00h>0italic_h > 0 and the training sample 𝒳n,(k)={(𝐘i,𝐗i):i𝒫k}subscript𝒳𝑛𝑘conditional-setsubscript𝐘𝑖subscript𝐗𝑖𝑖subscript𝒫𝑘\mathcal{X}_{n,(-k)}=\{(\mathbf{Y}_{i},\mathbf{X}_{i}):i\not\in\mathcal{P}_{k}\}caligraphic_X start_POSTSUBSCRIPT italic_n , ( - italic_k ) end_POSTSUBSCRIPT = { ( bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) : italic_i ∉ caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } in (2.4). We choose

h~=argminhk=1Ki𝒫k1Li=1Li(Yi,𝐗i𝜷~(Ti,;h,𝒳n,(k)))2~subscriptargminsuperscriptsubscript𝑘1𝐾subscript𝑖subscript𝒫𝑘1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖superscriptsubscript𝑌𝑖superscriptsubscript𝐗𝑖top~𝜷subscript𝑇𝑖subscript𝒳𝑛𝑘2\displaystyle\widetilde{h}=\operatorname*{arg\,min}_{h\in\mathcal{H}}\sum_{k=1% }^{K}\sum_{i\in\mathcal{P}_{k}}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}\Big{(}Y_{i% ,\ell}-\mathbf{X}_{i}^{\top}\widetilde{\boldsymbol{\beta}}\big{(}T_{i,\ell};h,% \mathcal{X}_{n,(-k)}\big{)}\Big{)}^{2}over~ start_ARG italic_h end_ARG = start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_h ∈ caligraphic_H end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_P start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over~ start_ARG bold_italic_β end_ARG ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ; italic_h , caligraphic_X start_POSTSUBSCRIPT italic_n , ( - italic_k ) end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (S.3.1)

and use 𝜷~(t)=𝜷~(t;h~,𝒳n)~𝜷𝑡~𝜷𝑡~subscript𝒳𝑛\widetilde{\boldsymbol{\beta}}(t)=\widetilde{\boldsymbol{\beta}}\big{(}t;% \widetilde{h},\mathcal{X}_{n}\big{)}over~ start_ARG bold_italic_β end_ARG ( italic_t ) = over~ start_ARG bold_italic_β end_ARG ( italic_t ; over~ start_ARG italic_h end_ARG , caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) as the final estimates, where \mathcal{H}caligraphic_H is a grid of the domain. We also similarly implement the constrained estimates 𝜷^(t)=𝜷^(t;h^,𝒳n)^𝜷𝑡^𝜷𝑡^subscript𝒳𝑛\widehat{\boldsymbol{\beta}}(t)=\widehat{\boldsymbol{\beta}}\big{(}t;\widehat{% h},\mathcal{X}_{n}\big{)}over^ start_ARG bold_italic_β end_ARG ( italic_t ) = over^ start_ARG bold_italic_β end_ARG ( italic_t ; over^ start_ARG italic_h end_ARG , caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). In our numerical studies, we used the 5555-fold CV procedure with \mathcal{H}caligraphic_H that consists of equally-spaced ten points.

Knot selection for regression spline functions

In spline regression, the number and location of knots can significantly impact the fit. Thus, in simulation studies and real data examples, we try a K𝐾Kitalic_K-fold CV error to choose the optimal number as well as the location of internal knots for unconstrained estimates, where the practical partition of samples and evaluation details are discussed in the above paragraph for bandwidth selection in kernel least squares. Although the equally spaced knots are preferred in some literature, considering the right-skewed or unbalanced evaluation points Ti,subscript𝑇𝑖T_{i,\ell}italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT in our numerical examples, we further consider a set of different knot sequences spanning the entire domain under the given candidate number of internal knots, which are set as 1,2, or 3, in our study. We assign the same optimal knot sequence for all β~j(t)subscript~𝛽𝑗𝑡\tilde{\beta}_{j}(t)over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ), although it can vary over j𝑗jitalic_j. Once the optimal knot sequence for unconstrained fit is determined, we add lower and upper boundaries of sub-interval jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT as additional internal knots for the estimation of partially shape-constrained βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ), for jJ𝑗𝐽j\in Jitalic_j ∈ italic_J, to present the desired shape explicitly over jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. For simulation experiments, we perform the 5-fold CV-based knot selection at each iteration.

S.4 Proofs of Proposition 2.2.1 and Theorem 2.2.2

Lemma S.4.1.

For r{0,1,2}𝑟012r\in\{0,1,2\}italic_r ∈ { 0 , 1 , 2 }, let

Hr(t)=1ni=1n1Li=1LiKh(Ti,t)(Ti,th)rηi(Ti,)(t[0,1]),subscript𝐻𝑟𝑡1𝑛superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖subscript𝐾subscript𝑇𝑖𝑡superscriptsubscript𝑇𝑖𝑡𝑟subscript𝜂𝑖subscript𝑇𝑖𝑡01\displaystyle H_{r}(t)=\frac{1}{n}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{% L_{i}}K_{h}(T_{i,\ell}-t)\Big{(}\frac{T_{i,\ell}-t}{h}\Big{)}^{r}\eta_{i}(T_{i% ,\ell})\quad(t\in[0,1]),italic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) ( divide start_ARG italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) ( italic_t ∈ [ 0 , 1 ] ) , (S.4.1)

where η1,,ηnsubscript𝜂1subscript𝜂𝑛\eta_{1},\ldots,\eta_{n}italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_η start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT are random copies of a stochastic process η𝜂\etaitalic_η on [0,1]01[0,1][ 0 , 1 ]. Assume the following conditions hold.

  1. (L0)𝐿0(L0)( italic_L 0 )

    π𝜋\piitalic_π is continuously differentiable and strictly positive,

  2. (L1)𝐿1(L1)( italic_L 1 )

    K𝐾Kitalic_K is Lipschitz continuous on [0,1]01[0,1][ 0 , 1 ], i.e., |K(s)K(t)|Lip(K)|st|𝐾𝑠𝐾𝑡Lip𝐾𝑠𝑡|K(s)-K(t)|\leq\mathrm{Lip}(K)|s-t|| italic_K ( italic_s ) - italic_K ( italic_t ) | ≤ roman_Lip ( italic_K ) | italic_s - italic_t | for every s,t,[0,1]s,t,\in[0,1]italic_s , italic_t , ∈ [ 0 , 1 ], where Lip(K)Lip𝐾\mathrm{Lip}(K)roman_Lip ( italic_K ) denotes the Lipschitz constant of K𝐾Kitalic_K,

  3. (L2)𝐿2(L2)( italic_L 2 )

    Eηk<𝐸superscriptsubscriptnorm𝜂𝑘E\|\eta\|_{\infty}^{k}<\inftyitalic_E ∥ italic_η ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT < ∞ for some k>2𝑘2k>2italic_k > 2,

  4. (L3)𝐿3(L3)( italic_L 3 )

    hNaasymptotically-equalssuperscript𝑁𝑎h\asymp N^{-a}italic_h ≍ italic_N start_POSTSUPERSCRIPT - italic_a end_POSTSUPERSCRIPT for some 0<a<12/k0𝑎12𝑘0<a<1-2/k0 < italic_a < 1 - 2 / italic_k,

  5. (L4)𝐿4(L4)( italic_L 4 )

    λn=O(1)subscript𝜆𝑛𝑂1\lambda_{n}=O(1)italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_O ( 1 ) and Λn=O(Nb/logN)k/2subscriptΛ𝑛𝑂superscriptsuperscript𝑁𝑏𝑁𝑘2\Lambda_{n}=O\big{(}N^{b}/\log N\big{)}^{k/2}roman_Λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_O ( italic_N start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT / roman_log italic_N ) start_POSTSUPERSCRIPT italic_k / 2 end_POSTSUPERSCRIPT for some 0<b<1(2/k)a0𝑏12𝑘𝑎0<b<1-(2/k)-a0 < italic_b < 1 - ( 2 / italic_k ) - italic_a.

Then, letting N=nλn𝑁𝑛subscript𝜆𝑛N=n\lambda_{n}italic_N = italic_n italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we have

supt[0,1]hr|Hr(t)E(Hr(t))|=OP(logNNh).subscriptsupremum𝑡01superscript𝑟subscript𝐻𝑟𝑡𝐸subscript𝐻𝑟𝑡subscript𝑂𝑃𝑁𝑁\displaystyle\sup_{t\in[0,1]}h^{r}\big{|}H_{r}(t)-E\big{(}H_{r}(t)\big{)}\big{% |}=O_{P}\bigg{(}\sqrt{\frac{\log N}{Nh}}\bigg{)}.roman_sup start_POSTSUBSCRIPT italic_t ∈ [ 0 , 1 ] end_POSTSUBSCRIPT italic_h start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT | italic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( italic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) | = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( square-root start_ARG divide start_ARG roman_log italic_N end_ARG start_ARG italic_N italic_h end_ARG end_ARG ) . (S.4.2)
Proof of Lemma S.4.1.

Define H~r(t)subscript~𝐻𝑟𝑡\widetilde{H}_{r}(t)over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) similarly as Hr(t)subscript𝐻𝑟𝑡H_{r}(t)italic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) by substituting η~i(Ti,)=ηi(Ti,)𝕀(|ηi(Ti,)|δn1)subscript~𝜂𝑖subscript𝑇𝑖subscript𝜂𝑖subscript𝑇𝑖𝕀subscript𝜂𝑖subscript𝑇𝑖superscriptsubscript𝛿𝑛1\tilde{\eta}_{i}(T_{i,\ell})=\eta_{i}(T_{i,\ell})\mathbb{I}\big{(}|\eta_{i}(T_% {i,\ell})|\leq\delta_{n}^{-1}\big{)}over~ start_ARG italic_η end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) = italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) blackboard_I ( | italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) | ≤ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) for ηi(Ti,)subscript𝜂𝑖subscript𝑇𝑖\eta_{i}(T_{i,\ell})italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) in (S.4.1), where δn=N1/2h1/2logNsubscript𝛿𝑛superscript𝑁12superscript12𝑁\delta_{n}=N^{-1/2}h^{-1/2}\sqrt{\log N}italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_N start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT italic_h start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT square-root start_ARG roman_log italic_N end_ARG. It follows that

Hr(t)E(Hr(t))={H~r(t)E(H~r(t))}+1ni=1n1Li=1LiKh(Ti,t)(Ti,th)rηi(Ti,)𝕀(|ηi(t)|>δn1)E[Kh(Tt)(Tth)rη(T)𝕀(|η(T)|>δn1)].subscript𝐻𝑟𝑡𝐸subscript𝐻𝑟𝑡subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡1𝑛superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖subscript𝐾subscript𝑇𝑖𝑡superscriptsubscript𝑇𝑖𝑡𝑟subscript𝜂𝑖subscript𝑇𝑖𝕀subscript𝜂𝑖𝑡superscriptsubscript𝛿𝑛1𝐸delimited-[]subscript𝐾𝑇𝑡superscript𝑇𝑡𝑟𝜂𝑇𝕀𝜂𝑇superscriptsubscript𝛿𝑛1\displaystyle\begin{split}H_{r}(t)-E\big{(}H_{r}(t)\big{)}&=\Big{\{}\widetilde% {H}_{r}(t)-E\big{(}\widetilde{H}_{r}(t)\big{)}\Big{\}}\\ &\quad+\frac{1}{n}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}K_{h}(T_{i% ,\ell}-t)\Big{(}\frac{T_{i,\ell}-t}{h}\Big{)}^{r}\eta_{i}(T_{i,\ell})\mathbb{I% }\big{(}|\eta_{i}(t)|>\delta_{n}^{-1}\big{)}\\ &\quad-E\Big{[}K_{h}(T-t)\Big{(}\frac{T-t}{h}\Big{)}^{r}\eta(T)\mathbb{I}\big{% (}|\eta(T)|>\delta_{n}^{-1}\big{)}\Big{]}.\end{split}start_ROW start_CELL italic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( italic_H start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) end_CELL start_CELL = { over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) } end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) ( divide start_ARG italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) blackboard_I ( | italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_t ) | > italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL - italic_E [ italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T - italic_t ) ( divide start_ARG italic_T - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_η ( italic_T ) blackboard_I ( | italic_η ( italic_T ) | > italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ] . end_CELL end_ROW (S.4.3)

First, we claim that the last two terms on the right side are negligible up to the order of δnsubscript𝛿𝑛\delta_{n}italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT so that it suffices to show the uniform convergence of H~r(t)E(H~r(t))subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡\widetilde{H}_{r}(t)-E\big{(}\widetilde{H}_{r}(t)\big{)}over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ). To claim the approximation, define a sequence of event sets as

n=(|ηi(Ti,)|δn1 for all 1in and 1Li)subscript𝑛subscript𝜂𝑖subscript𝑇𝑖superscriptsubscript𝛿𝑛1 for all 1in and 1Li\displaystyle\mathcal{E}_{n}=\big{(}|\eta_{i}(T_{i,\ell})|\leq\delta_{n}^{-1}% \textrm{ for all $1\leq i\leq n$ and $1\leq\ell\leq L_{i}$}\big{)}caligraphic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ( | italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) | ≤ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT for all 1 ≤ italic_i ≤ italic_n and 1 ≤ roman_ℓ ≤ italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) (S.4.4)

for n=1,2,𝑛12n=1,2,\ldotsitalic_n = 1 , 2 , …. Recalling N=nλn𝑁𝑛subscript𝜆𝑛N=n\lambda_{n}italic_N = italic_n italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we obtain from Markov’s inequality that

P(n)1i=1n=1LiP(|ηi(Ti,)|>δn1)1nΛnδnkEηk1λn1(Nb{1(2/k)a}logN)k/2Eηk=1o(1)𝑃subscript𝑛1superscriptsubscript𝑖1𝑛superscriptsubscript1subscript𝐿𝑖𝑃subscript𝜂𝑖subscript𝑇𝑖superscriptsubscript𝛿𝑛11𝑛subscriptΛ𝑛superscriptsubscript𝛿𝑛𝑘𝐸superscriptsubscriptdelimited-∥∥𝜂𝑘1superscriptsubscript𝜆𝑛1superscriptsuperscript𝑁𝑏12𝑘𝑎𝑁𝑘2𝐸superscriptsubscriptdelimited-∥∥𝜂𝑘1𝑜1\displaystyle\begin{split}P(\mathcal{E}_{n})&\geq 1-\sum_{i=1}^{n}\sum_{\ell=1% }^{L_{i}}P\big{(}|\eta_{i}(T_{i,\ell})|>\delta_{n}^{-1}\big{)}\\ &\geq 1-n\Lambda_{n}\delta_{n}^{k}E\|\eta\|_{\infty}^{k}\\ &\geq 1-\lambda_{n}^{-1}\Big{(}N^{b-\{1-(2/k)-a\}}\log N\Big{)}^{k/2}E\|\eta\|% _{\infty}^{k}\\ &=1-o(1)\end{split}start_ROW start_CELL italic_P ( caligraphic_E start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_CELL start_CELL ≥ 1 - ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_P ( | italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) | > italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≥ 1 - italic_n roman_Λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_E ∥ italic_η ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≥ 1 - italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_N start_POSTSUPERSCRIPT italic_b - { 1 - ( 2 / italic_k ) - italic_a } end_POSTSUPERSCRIPT roman_log italic_N ) start_POSTSUPERSCRIPT italic_k / 2 end_POSTSUPERSCRIPT italic_E ∥ italic_η ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = 1 - italic_o ( 1 ) end_CELL end_ROW (S.4.5)

where λn11superscriptsubscript𝜆𝑛11\lambda_{n}^{-1}\leq 1italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ≤ 1. This implies that the second term on the right side of (S.4.3) is zero with probability tending to 1. Similarly, we also get

|E(Kh(Tt)(Tth)rη(T)𝕀(|η(T)|>δn1))|h1νr(t;h)E[supt|η(t)|𝕀(|η(t)|>δn1)]h1νr(t;h)Eηkδnk1δn(Nk22hk221(logN)k22)=δnO(Nk2{1(2/k)a}(logN)k22)=o(δn),𝐸subscript𝐾𝑇𝑡superscript𝑇𝑡𝑟𝜂𝑇𝕀𝜂𝑇superscriptsubscript𝛿𝑛1superscript1subscript𝜈𝑟𝑡𝐸delimited-[]subscriptsupremum𝑡𝜂𝑡𝕀𝜂𝑡superscriptsubscript𝛿𝑛1superscript1subscript𝜈𝑟𝑡𝐸superscriptsubscriptdelimited-∥∥𝜂𝑘superscriptsubscript𝛿𝑛𝑘1less-than-or-similar-tosubscript𝛿𝑛superscript𝑁𝑘22superscript𝑘221superscript𝑁𝑘22subscript𝛿𝑛𝑂superscript𝑁𝑘212𝑘𝑎superscript𝑁𝑘22𝑜subscript𝛿𝑛\displaystyle\begin{split}&\bigg{|}E\Big{(}K_{h}(T-t)\Big{(}\frac{T-t}{h}\Big{% )}^{r}\eta(T)\mathbb{I}\big{(}|\eta(T)|>\delta_{n}^{-1}\big{)}\Big{)}\bigg{|}% \\ &\quad\leq h^{-1}\nu_{r}(t;h)E\big{[}\sup_{t}|\eta(t)|\,\mathbb{I}\big{(}|\eta% (t)|>\delta_{n}^{-1}\big{)}\big{]}\\ &\quad\leq h^{-1}\nu_{r}(t;h)E\|\eta\|_{\infty}^{k}\delta_{n}^{k-1}\\ &\quad\lesssim\delta_{n}\Big{(}N^{-\frac{k-2}{2}}h^{-\frac{k-2}{2}-1}(\log N)^% {\frac{k-2}{2}}\Big{)}\\ &\quad=\delta_{n}O\Big{(}N^{-\frac{k}{2}\{1-(2/k)-a\}}(\log N)^{\frac{k-2}{2}}% \Big{)}\\ &\quad=o(\delta_{n}),\end{split}start_ROW start_CELL end_CELL start_CELL | italic_E ( italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T - italic_t ) ( divide start_ARG italic_T - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_η ( italic_T ) blackboard_I ( | italic_η ( italic_T ) | > italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ) | end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_ν start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ; italic_h ) italic_E [ roman_sup start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | italic_η ( italic_t ) | blackboard_I ( | italic_η ( italic_t ) | > italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_ν start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ; italic_h ) italic_E ∥ italic_η ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≲ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_N start_POSTSUPERSCRIPT - divide start_ARG italic_k - 2 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_h start_POSTSUPERSCRIPT - divide start_ARG italic_k - 2 end_ARG start_ARG 2 end_ARG - 1 end_POSTSUPERSCRIPT ( roman_log italic_N ) start_POSTSUPERSCRIPT divide start_ARG italic_k - 2 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_O ( italic_N start_POSTSUPERSCRIPT - divide start_ARG italic_k end_ARG start_ARG 2 end_ARG { 1 - ( 2 / italic_k ) - italic_a } end_POSTSUPERSCRIPT ( roman_log italic_N ) start_POSTSUPERSCRIPT divide start_ARG italic_k - 2 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_o ( italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) , end_CELL end_ROW (S.4.6)

where νr(t;h)=01(uth)rKh(ut)π(u)dusubscript𝜈𝑟𝑡superscriptsubscript01superscript𝑢𝑡𝑟subscript𝐾𝑢𝑡𝜋𝑢differential-d𝑢\nu_{r}(t;h)=\int_{0}^{1}\big{(}\frac{u-t}{h}\big{)}^{r}K_{h}(u-t)\pi(u)\,% \mathrm{d}uitalic_ν start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ; italic_h ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( divide start_ARG italic_u - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_u - italic_t ) italic_π ( italic_u ) roman_d italic_u.

Now, we prove the uniform convergence of H~r(t)E(H~r(t))subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡\widetilde{H}_{r}(t)-E\big{(}\widetilde{H}_{r}(t)\big{)}over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ). For τ>0𝜏0\tau>0italic_τ > 0, let 𝒯Nsubscript𝒯𝑁\mathcal{T}_{N}caligraphic_T start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT be the (Nτ)superscript𝑁𝜏(N^{-\tau})( italic_N start_POSTSUPERSCRIPT - italic_τ end_POSTSUPERSCRIPT )-net over [0,1]01[0,1][ 0 , 1 ] with the cardinality |𝒯N|=O(Nτ)subscript𝒯𝑁𝑂superscript𝑁𝜏|\mathcal{T}_{N}|=O(N^{\tau})| caligraphic_T start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT | = italic_O ( italic_N start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ) such that, for any s[0,1]𝑠01s\in[0,1]italic_s ∈ [ 0 , 1 ], there exists t𝒯N𝑡subscript𝒯𝑁t\in\mathcal{T}_{N}italic_t ∈ caligraphic_T start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT such that |st|Nτ𝑠𝑡superscript𝑁𝜏|s-t|\leq N^{-\tau}| italic_s - italic_t | ≤ italic_N start_POSTSUPERSCRIPT - italic_τ end_POSTSUPERSCRIPT. Then, we have

supt[0,1]|H~r(t)E(H~r(t))|supt𝒯n|H~r(t)E(H~r(t))|+sup|st|nτ|(H~r(s)H~r(t))E(H~r(s)H~r(t))|.subscriptsupremum𝑡01subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡subscriptsupremum𝑡subscript𝒯𝑛subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡subscriptsupremum𝑠𝑡superscript𝑛𝜏subscript~𝐻𝑟𝑠subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑠subscript~𝐻𝑟𝑡\displaystyle\begin{split}&\sup_{t\in[0,1]}\big{|}\widetilde{H}_{r}(t)-E\big{(% }\widetilde{H}_{r}(t)\big{)}\big{|}\\ &\leq\sup_{t\in\mathcal{T}_{n}}\big{|}\widetilde{H}_{r}(t)-E\big{(}\widetilde{% H}_{r}(t)\big{)}\big{|}\,\,\,+\sup_{|s-t|\leq n^{-\tau}}\Big{|}\big{(}% \widetilde{H}_{r}(s)-\widetilde{H}_{r}(t)\big{)}-E\big{(}\widetilde{H}_{r}(s)-% \widetilde{H}_{r}(t)\big{)}\Big{|}.\end{split}start_ROW start_CELL end_CELL start_CELL roman_sup start_POSTSUBSCRIPT italic_t ∈ [ 0 , 1 ] end_POSTSUBSCRIPT | over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) | end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ roman_sup start_POSTSUBSCRIPT italic_t ∈ caligraphic_T start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT | over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) | + roman_sup start_POSTSUBSCRIPT | italic_s - italic_t | ≤ italic_n start_POSTSUPERSCRIPT - italic_τ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_s ) - over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_s ) - over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) | . end_CELL end_ROW (S.4.7)

The second term on the right side is almost surely negligible up to the order of δnsubscript𝛿𝑛\delta_{n}italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. Specifically, for any |st|Nτ𝑠𝑡superscript𝑁𝜏|s-t|\leq N^{-\tau}| italic_s - italic_t | ≤ italic_N start_POSTSUPERSCRIPT - italic_τ end_POSTSUPERSCRIPT, it follows from the Lipschitz continuity of K𝐾Kitalic_K that

|H~r(s)H~r(t)|1ni=1n1Li=1Li|η~i(Ti,)||Kh(Ti,s)(Ti,sh)rKh(Ti,t)(Ti,th)r|δn1h1{Lip(K)hr+rK}|st|hδn(N1τh1r/logN)=δnO(N1+a(1+r)τ/logN)=o(δn),subscript~𝐻𝑟𝑠subscript~𝐻𝑟𝑡1𝑛superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖subscript~𝜂𝑖subscript𝑇𝑖subscript𝐾subscript𝑇𝑖𝑠superscriptsubscript𝑇𝑖𝑠𝑟subscript𝐾subscript𝑇𝑖𝑡superscriptsubscript𝑇𝑖𝑡𝑟superscriptsubscript𝛿𝑛1superscript1conditional-setLip𝐾superscript𝑟𝑟evaluated-at𝐾𝑠𝑡less-than-or-similar-tosubscript𝛿𝑛superscript𝑁1𝜏superscript1𝑟𝑁subscript𝛿𝑛𝑂superscript𝑁1𝑎1𝑟𝜏𝑁𝑜subscript𝛿𝑛\displaystyle\begin{split}&\big{|}\widetilde{H}_{r}(s)-\widetilde{H}_{r}(t)% \big{|}\\ &\leq\frac{1}{n}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}|\tilde{\eta% }_{i}(T_{i,\ell})|\,\Big{|}K_{h}(T_{i,\ell}-s)\Big{(}\frac{T_{i,\ell}-s}{h}% \Big{)}^{r}-K_{h}(T_{i,\ell}-t)\Big{(}\frac{T_{i,\ell}-t}{h}\Big{)}^{r}\Big{|}% \\ &\leq\delta_{n}^{-1}h^{-1}\big{\{}\mathrm{Lip}(K)h^{-r}+r\|K\|_{\infty}\big{\}% }\frac{|s-t|}{h}\\ &\lesssim\delta_{n}\big{(}N^{1-\tau}h^{-1-r}/\log N\big{)}\\ &=\delta_{n}O\big{(}N^{1+a(1+r)-\tau}/\log N\big{)}\\ &=o(\delta_{n}),\end{split}start_ROW start_CELL end_CELL start_CELL | over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_s ) - over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) | end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | over~ start_ARG italic_η end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ) | | italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_s ) ( divide start_ARG italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_s end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT - italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) ( divide start_ARG italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT | end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT { roman_Lip ( italic_K ) italic_h start_POSTSUPERSCRIPT - italic_r end_POSTSUPERSCRIPT + italic_r ∥ italic_K ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT } divide start_ARG | italic_s - italic_t | end_ARG start_ARG italic_h end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≲ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_N start_POSTSUPERSCRIPT 1 - italic_τ end_POSTSUPERSCRIPT italic_h start_POSTSUPERSCRIPT - 1 - italic_r end_POSTSUPERSCRIPT / roman_log italic_N ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_O ( italic_N start_POSTSUPERSCRIPT 1 + italic_a ( 1 + italic_r ) - italic_τ end_POSTSUPERSCRIPT / roman_log italic_N ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_o ( italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) , end_CELL end_ROW (S.4.8)

for a sufficiently large τ>1+a(1+r)𝜏1𝑎1𝑟\tau>1+a(1+r)italic_τ > 1 + italic_a ( 1 + italic_r ). To investigate the stochastic magnitude of supt𝒯nhr|H~r(t)E(H~r(t))|subscriptsupremum𝑡subscript𝒯𝑛superscript𝑟subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡\sup_{t\in\mathcal{T}_{n}}h^{r}\big{|}\widetilde{H}_{r}(t)-E\big{(}\widetilde{% H}_{r}(t)\big{)}\big{|}roman_sup start_POSTSUBSCRIPT italic_t ∈ caligraphic_T start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_h start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT | over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) |, we write

H~r(t)E(H~r(t))=1ni=1n{ζi,r(t)E(ζi,r(t))},subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡1𝑛superscriptsubscript𝑖1𝑛subscript𝜁𝑖𝑟𝑡𝐸subscript𝜁𝑖𝑟𝑡\displaystyle\widetilde{H}_{r}(t)-E\big{(}\widetilde{H}_{r}(t)\big{)}=\frac{1}% {n}\sum_{i=1}^{n}\big{\{}\zeta_{i,r}(t)-E\big{(}\zeta_{i,r}(t)\big{)}\big{\}},over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT { italic_ζ start_POSTSUBSCRIPT italic_i , italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( italic_ζ start_POSTSUBSCRIPT italic_i , italic_r end_POSTSUBSCRIPT ( italic_t ) ) } , (S.4.9)

where ζi,r(t)=1Li=1LiKh(Ti,t)(Ti,th)rη~i(Ti,)subscript𝜁𝑖𝑟𝑡1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖subscript𝐾subscript𝑇𝑖𝑡superscriptsubscript𝑇𝑖𝑡𝑟subscript~𝜂𝑖subscript𝑇𝑖\zeta_{i,r}(t)=\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}K_{h}(T_{i,\ell}-t)\big{(}% \frac{T_{i,\ell}-t}{h}\big{)}^{r}\tilde{\eta}_{i}(T_{i,\ell})italic_ζ start_POSTSUBSCRIPT italic_i , italic_r end_POSTSUBSCRIPT ( italic_t ) = divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) ( divide start_ARG italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT over~ start_ARG italic_η end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ). Since ζi,rsubscriptnormsubscript𝜁𝑖𝑟\|\zeta_{i,r}\|_{\infty}∥ italic_ζ start_POSTSUBSCRIPT italic_i , italic_r end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT is almost surely bounded for all i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n, it follows from Bernstein’s inequality that

P(supt𝒯Nhr|H~r(t)E(H~r(t))|Cδn)t𝒯NP(|i=1nhr{ζi,r(t)E(ζi,r(t))}|Cnδn)t𝒯N2exp((Cnδn)2/2h2ri=1nVar(ζi,r(t))+2hrζi,r(Cnδn)/3)2|𝒯N|exp((C2/2)(λn1nh1logN)01u2rK2(u)duEη2π(nh1+2r)+(C/3)K(nh1))Nτexp(λn1g(C)logN)=O(Nτλn1g(C)),𝑃subscriptsupremum𝑡subscript𝒯𝑁superscript𝑟subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡𝐶subscript𝛿𝑛subscript𝑡subscript𝒯𝑁𝑃superscriptsubscript𝑖1𝑛superscript𝑟subscript𝜁𝑖𝑟𝑡𝐸subscript𝜁𝑖𝑟𝑡𝐶𝑛subscript𝛿𝑛subscript𝑡subscript𝒯𝑁2superscript𝐶𝑛subscript𝛿𝑛22superscript2𝑟superscriptsubscript𝑖1𝑛Varsubscript𝜁𝑖𝑟𝑡2superscript𝑟subscriptnormsubscript𝜁𝑖𝑟𝐶𝑛subscript𝛿𝑛32subscript𝒯𝑁superscript𝐶22superscriptsubscript𝜆𝑛1𝑛superscript1𝑁superscriptsubscript01superscript𝑢2𝑟superscript𝐾2𝑢differential-d𝑢𝐸superscriptsubscriptnorm𝜂2subscriptnorm𝜋𝑛superscript12𝑟𝐶3subscriptnorm𝐾𝑛superscript1less-than-or-similar-tosuperscript𝑁𝜏superscriptsubscript𝜆𝑛1𝑔𝐶𝑁𝑂superscript𝑁𝜏superscriptsubscript𝜆𝑛1𝑔𝐶\displaystyle\begin{split}&P\bigg{(}\sup_{t\in\mathcal{T}_{N}}h^{r}\big{|}% \widetilde{H}_{r}(t)-E\big{(}\widetilde{H}_{r}(t)\big{)}\big{|}\geq C\delta_{n% }\bigg{)}\\ &\leq\sum_{t\in\mathcal{T}_{N}}P\bigg{(}\Big{|}\sum_{i=1}^{n}h^{r}\big{\{}% \zeta_{i,r}(t)-E\big{(}\zeta_{i,r}(t)\big{)}\big{\}}\Big{|}\geq Cn\delta_{n}% \bigg{)}\\ &\leq\sum_{t\in\mathcal{T}_{N}}2\exp\Bigg{(}-\frac{(Cn\delta_{n})^{2}/2}{h^{2r% }\sum_{i=1}^{n}\mathrm{Var}\big{(}\zeta_{i,r}(t)\big{)}+2h^{r}\|\zeta_{i,r}\|_% {\infty}(Cn\delta_{n})/3}\Bigg{)}\\ &\leq 2|\mathcal{T}_{N}|\exp\Bigg{(}-\frac{(C^{2}/2)(\lambda_{n}^{-1}nh^{-1}% \log N)}{\int_{0}^{1}u^{2r}K^{2}(u)\,\mathrm{d}u\,E\|\eta\|_{\infty}^{2}\|\pi% \|_{\infty}(nh^{-1+2r})+(C/3)\|K\|_{\infty}(nh^{-1})}\Bigg{)}\\ &\lesssim N^{\tau}\exp\big{(}-\lambda_{n}^{-1}g(C)\log N\big{)}\\ &=O\big{(}N^{\tau-\lambda_{n}^{-1}g(C)}\big{)},\end{split}start_ROW start_CELL end_CELL start_CELL italic_P ( roman_sup start_POSTSUBSCRIPT italic_t ∈ caligraphic_T start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_h start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT | over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) | ≥ italic_C italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ ∑ start_POSTSUBSCRIPT italic_t ∈ caligraphic_T start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_P ( | ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_h start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT { italic_ζ start_POSTSUBSCRIPT italic_i , italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( italic_ζ start_POSTSUBSCRIPT italic_i , italic_r end_POSTSUBSCRIPT ( italic_t ) ) } | ≥ italic_C italic_n italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ ∑ start_POSTSUBSCRIPT italic_t ∈ caligraphic_T start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT 2 roman_exp ( - divide start_ARG ( italic_C italic_n italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 end_ARG start_ARG italic_h start_POSTSUPERSCRIPT 2 italic_r end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT roman_Var ( italic_ζ start_POSTSUBSCRIPT italic_i , italic_r end_POSTSUBSCRIPT ( italic_t ) ) + 2 italic_h start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ∥ italic_ζ start_POSTSUBSCRIPT italic_i , italic_r end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ( italic_C italic_n italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) / 3 end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ 2 | caligraphic_T start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT | roman_exp ( - divide start_ARG ( italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 ) ( italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_n italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_log italic_N ) end_ARG start_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_u start_POSTSUPERSCRIPT 2 italic_r end_POSTSUPERSCRIPT italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_u ) roman_d italic_u italic_E ∥ italic_η ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_π ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ( italic_n italic_h start_POSTSUPERSCRIPT - 1 + 2 italic_r end_POSTSUPERSCRIPT ) + ( italic_C / 3 ) ∥ italic_K ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ( italic_n italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≲ italic_N start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT roman_exp ( - italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_g ( italic_C ) roman_log italic_N ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_O ( italic_N start_POSTSUPERSCRIPT italic_τ - italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_g ( italic_C ) end_POSTSUPERSCRIPT ) , end_CELL end_ROW (S.4.10)

where g(C)=(C2/2){01u2rK2(u)duEη2π+(C/3)K}𝑔𝐶superscript𝐶22conditional-setsuperscriptsubscript01superscript𝑢2𝑟superscript𝐾2𝑢differential-d𝑢𝐸evaluated-at𝜂2subscriptnorm𝜋𝐶3subscriptnorm𝐾g(C)=(C^{2}/2)\big{\{}\int_{0}^{1}u^{2r}K^{2}(u)\,\mathrm{d}u\,E\|\eta\|_{% \infty}^{2}\|\pi\|_{\infty}+(C/3)\|K\|_{\infty}\big{\}}italic_g ( italic_C ) = ( italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 ) { ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_u start_POSTSUPERSCRIPT 2 italic_r end_POSTSUPERSCRIPT italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_u ) roman_d italic_u italic_E ∥ italic_η ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_π ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT + ( italic_C / 3 ) ∥ italic_K ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT } is increasing in C>0𝐶0C>0italic_C > 0. Since λn1>0superscriptsubscript𝜆𝑛10\lambda_{n}^{-1}>0italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT > 0 with λn=O(1)subscript𝜆𝑛𝑂1\lambda_{n}=O(1)italic_λ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_O ( 1 ), we conclude that supt𝒯nhr|H~r(t)E(H~r(t))|=OP(δn)subscriptsupremum𝑡subscript𝒯𝑛superscript𝑟subscript~𝐻𝑟𝑡𝐸subscript~𝐻𝑟𝑡subscript𝑂𝑃subscript𝛿𝑛\sup_{t\in\mathcal{T}_{n}}h^{r}\big{|}\widetilde{H}_{r}(t)-E\big{(}\widetilde{% H}_{r}(t)\big{)}\big{|}=O_{P}(\delta_{n})roman_sup start_POSTSUBSCRIPT italic_t ∈ caligraphic_T start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_h start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT | over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) - italic_E ( over~ start_ARG italic_H end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ) ) | = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). ∎

Proof of Proposition 2.2.1.

We note that

vec(𝔹~(t)𝔹0(t))=[i=1n1Li=1LiKh(Ti,t)(𝐙i,(t)𝐗i)(𝐙i,(t)𝐗i)]1×[i=1n1Li=1LiKh(Ti,t)(𝐙i,(t)𝐗i){Yi,𝐗i𝔹0(t)𝐙i,(t)}].vec~𝔹𝑡subscript𝔹0𝑡superscriptdelimited-[]superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖subscript𝐾subscript𝑇𝑖𝑡tensor-productsubscript𝐙𝑖𝑡subscript𝐗𝑖superscripttensor-productsubscript𝐙𝑖𝑡subscript𝐗𝑖top1delimited-[]superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖subscript𝐾subscript𝑇𝑖𝑡tensor-productsubscript𝐙𝑖𝑡subscript𝐗𝑖subscript𝑌𝑖superscriptsubscript𝐗𝑖topsubscript𝔹0𝑡subscript𝐙𝑖𝑡\displaystyle\begin{split}\mathrm{vec}\big{(}\widetilde{\mathbb{B}}(t)-\mathbb% {B}_{0}(t)\big{)}&=\Bigg{[}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}K% _{h}(T_{i,\ell}-t)\big{(}\mathbf{Z}_{i,\ell}(t)\otimes\mathbf{X}_{i}\big{)}% \big{(}\mathbf{Z}_{i,\ell}(t)\otimes\mathbf{X}_{i}\big{)}^{\top}\Bigg{]}^{-1}% \\ &\qquad\times\Bigg{[}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}}K_{h}(T% _{i,\ell}-t)\big{(}\mathbf{Z}_{i,\ell}(t)\otimes\mathbf{X}_{i}\big{)}\Big{\{}Y% _{i,\ell}-\mathbf{X}_{i}^{\top}\mathbb{B}_{0}(t)\mathbf{Z}_{i,\ell}(t)\Big{\}}% \Bigg{]}.\end{split}start_ROW start_CELL roman_vec ( over~ start_ARG blackboard_B end_ARG ( italic_t ) - blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ) end_CELL start_CELL = [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) ( bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) ⊗ bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ( bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) ⊗ bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL × [ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) ( bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) ⊗ bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) { italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) } ] . end_CELL end_ROW (S.4.11)

To analyze the large sample property of 𝜷~(t)𝜷(t)~𝜷𝑡𝜷𝑡\widetilde{\boldsymbol{\beta}}(t)-\boldsymbol{\beta}(t)over~ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β ( italic_t ), we obtain the asymptotic approximation of the above matrix multiplication by analyzing individual components. For the first factor in (S.4.11), we note that Lemma S.4.1 gives

1ni=1n1Li=1Li(Ti,th)rKh(Ti,t)𝐗i𝐗i=νr(t;h)Γ+OP(logNNh1+r)1𝑛superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖superscriptsubscript𝑇𝑖𝑡𝑟subscript𝐾subscript𝑇𝑖𝑡subscript𝐗𝑖superscriptsubscript𝐗𝑖topsubscript𝜈𝑟𝑡Γsubscript𝑂𝑃𝑁𝑁superscript1𝑟\displaystyle\begin{split}\frac{1}{n}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=1% }^{L_{i}}\Big{(}\frac{T_{i,\ell}-t}{h}\Big{)}^{r}K_{h}(T_{i,\ell}-t)\mathbf{X}% _{i}\mathbf{X}_{i}^{\top}&=\nu_{r}(t;h)\Gamma+O_{P}\bigg{(}\sqrt{\frac{\log N}% {Nh^{1+r}}}\bigg{)}\end{split}start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( divide start_ARG italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT end_CELL start_CELL = italic_ν start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ; italic_h ) roman_Γ + italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( square-root start_ARG divide start_ARG roman_log italic_N end_ARG start_ARG italic_N italic_h start_POSTSUPERSCRIPT 1 + italic_r end_POSTSUPERSCRIPT end_ARG end_ARG ) end_CELL end_ROW (S.4.12)

for r{0,1,2}𝑟012r\in\{0,1,2\}italic_r ∈ { 0 , 1 , 2 } and uniformly for t[0,1]𝑡01t\in[0,1]italic_t ∈ [ 0 , 1 ], where νr(t;h)=01(uth)rKh(ut)π(u)dusubscript𝜈𝑟𝑡superscriptsubscript01superscript𝑢𝑡𝑟subscript𝐾𝑢𝑡𝜋𝑢differential-d𝑢\nu_{r}(t;h)=\int_{0}^{1}\big{(}\frac{u-t}{h}\big{)}^{r}K_{h}(u-t)\pi(u)\,% \mathrm{d}uitalic_ν start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ( italic_t ; italic_h ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( divide start_ARG italic_u - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_u - italic_t ) italic_π ( italic_u ) roman_d italic_u and Γ=E(𝐗𝐗)Γ𝐸superscript𝐗𝐗top\Gamma=E(\mathbf{X}\mathbf{X}^{\top})roman_Γ = italic_E ( bold_XX start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ). Similarly, for the second factor in (S.4.11), we have

1ni=1n1Li=1Li(Ti,th)rKh(Ti,t)𝐗i{Yi,𝐗i𝔹(t)𝐙i,(t)}=12h2ν2+r(t;h)(Γ𝜷¨0(t))+o(h2)+OP(h2logNNh1+r)1𝑛superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖superscriptsubscript𝑇𝑖𝑡𝑟subscript𝐾subscript𝑇𝑖𝑡subscript𝐗𝑖subscript𝑌𝑖superscriptsubscript𝐗𝑖top𝔹𝑡subscript𝐙𝑖𝑡12superscript2subscript𝜈2𝑟𝑡Γsubscript¨𝜷0𝑡𝑜superscript2subscript𝑂𝑃superscript2𝑁𝑁superscript1𝑟\displaystyle\begin{split}&\frac{1}{n}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=% 1}^{L_{i}}\Big{(}\frac{T_{i,\ell}-t}{h}\Big{)}^{r}K_{h}(T_{i,\ell}-t)\mathbf{X% }_{i}\Big{\{}Y_{i,\ell}-\mathbf{X}_{i}^{\top}\mathbb{B}(t)\mathbf{Z}_{i,\ell}(% t)\Big{\}}\\ &=\frac{1}{2}h^{2}\nu_{2+r}(t;h)\big{(}\Gamma\ddot{\boldsymbol{\beta}}_{0}(t)% \big{)}+o(h^{2})+O_{P}\bigg{(}h^{2}\sqrt{\frac{\log N}{Nh^{1+r}}}\bigg{)}\end{split}start_ROW start_CELL end_CELL start_CELL divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( divide start_ARG italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT { italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT blackboard_B ( italic_t ) bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) } end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_ν start_POSTSUBSCRIPT 2 + italic_r end_POSTSUBSCRIPT ( italic_t ; italic_h ) ( roman_Γ over¨ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ) + italic_o ( italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT square-root start_ARG divide start_ARG roman_log italic_N end_ARG start_ARG italic_N italic_h start_POSTSUPERSCRIPT 1 + italic_r end_POSTSUPERSCRIPT end_ARG end_ARG ) end_CELL end_ROW (S.4.13)

for r{0,1}𝑟01r\in\{0,1\}italic_r ∈ { 0 , 1 } and uniformly for t[0,1]𝑡01t\in[0,1]italic_t ∈ [ 0 , 1 ], where Yi,𝐗i𝔹(t)𝐙i,(t)=𝐗i𝜷¨(t){(Ti,t)2/2}(1+o(|Ti,t|))+εi,subscript𝑌𝑖superscriptsubscript𝐗𝑖top𝔹𝑡subscript𝐙𝑖𝑡superscriptsubscript𝐗𝑖top¨𝜷𝑡superscriptsubscript𝑇𝑖𝑡221𝑜subscript𝑇𝑖𝑡subscript𝜀𝑖Y_{i,\ell}-\mathbf{X}_{i}^{\top}\mathbb{B}(t)\mathbf{Z}_{i,\ell}(t)=\mathbf{X}% _{i}^{\top}\ddot{\boldsymbol{\beta}}(t)\{(T_{i,\ell}-t)^{2}/2\}\cdot(1+o(|T_{i% ,\ell}-t|))+\varepsilon_{i,\ell}italic_Y start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT blackboard_B ( italic_t ) bold_Z start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) = bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¨ start_ARG bold_italic_β end_ARG ( italic_t ) { ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 2 } ⋅ ( 1 + italic_o ( | italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t | ) ) + italic_ε start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT for Ti,subscript𝑇𝑖T_{i,\ell}italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT near t𝑡titalic_t. Combining (S.4.12) and (S.4.13), we have

𝜷~(t)𝜷0(t)=[IpOp]vec(𝔹~(t)𝔹0(t))=h22[(1,0)Ip][(ν0(t;h)ν1(t;h)ν1(t;h)ν2(t;h))Γ]1[(ν2(t;h)ν3(t;h))(Γ𝜷¨0(t))]+o(h2)+OP(logNNh)=h22𝜷¨0(t)(ν2(t;h)2ν1(t;h)μ3(t;h)ν0(t;h)ν2(t;h)ν1(t;h)2)+o(h2)+OP(logNNh),~𝜷𝑡subscript𝜷0𝑡superscriptdelimited-[]subscript𝐼𝑝subscript𝑂𝑝topvec~𝔹𝑡subscript𝔹0𝑡superscript22delimited-[]tensor-productsuperscript10topsubscript𝐼𝑝superscriptdelimited-[]tensor-productsubscript𝜈0𝑡subscript𝜈1𝑡subscript𝜈1𝑡subscript𝜈2𝑡Γ1delimited-[]tensor-productsubscript𝜈2𝑡subscript𝜈3𝑡Γsubscript¨𝜷0𝑡𝑜superscript2subscript𝑂𝑃𝑁𝑁superscript22subscript¨𝜷0𝑡subscript𝜈2superscript𝑡2subscript𝜈1𝑡subscript𝜇3𝑡subscript𝜈0𝑡subscript𝜈2𝑡subscript𝜈1superscript𝑡2𝑜superscript2subscript𝑂𝑃𝑁𝑁\displaystyle\begin{split}\widetilde{\boldsymbol{\beta}}(t)-\boldsymbol{\beta}% _{0}(t)&=\left[\begin{array}[]{c}I_{p}\\ O_{p}\end{array}\right]^{\top}\mathrm{vec}\big{(}\widetilde{\mathbb{B}}(t)-% \mathbb{B}_{0}(t)\big{)}\\ &=\frac{h^{2}}{2}\Big{[}(1,0)^{\top}\otimes I_{p}\Big{]}\Bigg{[}\left(\begin{% array}[]{cc}\nu_{0}(t;h)&\nu_{1}(t;h)\\ \nu_{1}(t;h)&\nu_{2}(t;h)\end{array}\right)\otimes\Gamma\Bigg{]}^{-1}\Bigg{[}% \left(\begin{array}[]{c}\nu_{2}(t;h)\\ \nu_{3}(t;h)\end{array}\right)\otimes\big{(}\Gamma\ddot{\boldsymbol{\beta}}_{0% }(t)\big{)}\Bigg{]}\\ &\qquad+o(h^{2})+O_{P}\bigg{(}\sqrt{\frac{\log N}{Nh}}\bigg{)}\\ &=\frac{h^{2}}{2}\ddot{\boldsymbol{\beta}}_{0}(t)\bigg{(}\frac{\nu_{2}(t;h)^{2% }-\nu_{1}(t;h)\mu_{3}(t;h)}{\nu_{0}(t;h)\nu_{2}(t;h)-\nu_{1}(t;h)^{2}}\bigg{)}% +o(h^{2})+O_{P}\bigg{(}\sqrt{\frac{\log N}{Nh}}\bigg{)},\end{split}start_ROW start_CELL over~ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) end_CELL start_CELL = [ start_ARRAY start_ROW start_CELL italic_I start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_O start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_CELL end_ROW end_ARRAY ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_vec ( over~ start_ARG blackboard_B end_ARG ( italic_t ) - blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG [ ( 1 , 0 ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ⊗ italic_I start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ] [ ( start_ARRAY start_ROW start_CELL italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ; italic_h ) end_CELL start_CELL italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ; italic_h ) end_CELL end_ROW start_ROW start_CELL italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ; italic_h ) end_CELL start_CELL italic_ν start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ; italic_h ) end_CELL end_ROW end_ARRAY ) ⊗ roman_Γ ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT [ ( start_ARRAY start_ROW start_CELL italic_ν start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ; italic_h ) end_CELL end_ROW start_ROW start_CELL italic_ν start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_t ; italic_h ) end_CELL end_ROW end_ARRAY ) ⊗ ( roman_Γ over¨ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ) ] end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL + italic_o ( italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( square-root start_ARG divide start_ARG roman_log italic_N end_ARG start_ARG italic_N italic_h end_ARG end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = divide start_ARG italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG over¨ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ) ( divide start_ARG italic_ν start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ; italic_h ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ; italic_h ) italic_μ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ( italic_t ; italic_h ) end_ARG start_ARG italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t ; italic_h ) italic_ν start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_t ; italic_h ) - italic_ν start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ; italic_h ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) + italic_o ( italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( square-root start_ARG divide start_ARG roman_log italic_N end_ARG start_ARG italic_N italic_h end_ARG end_ARG ) , end_CELL end_ROW (S.4.14)

where it can be verified that the first term is O(h2)𝑂superscript2O(h^{2})italic_O ( italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) uniformly for t[0,1]𝑡01t\in[0,1]italic_t ∈ [ 0 , 1 ]. ∎

Remark 4.

Using the similar argument in (S.4.14), one can further verify that

supt[0,1]h𝜷˙~(t)𝜷˙(t)=OP(h2+logNNh).subscriptsupremum𝑡01delimited-∥∥~˙𝜷𝑡˙𝜷𝑡subscript𝑂𝑃superscript2𝑁𝑁\displaystyle\begin{split}\sup_{t\in[0,1]}h\Big{\|}\widetilde{\dot{\boldsymbol% {\beta}}}(t)-\dot{\boldsymbol{\beta}}(t)\Big{\|}&=O_{P}\bigg{(}h^{2}+\sqrt{% \frac{\log N}{Nh}}\bigg{)}.\end{split}start_ROW start_CELL roman_sup start_POSTSUBSCRIPT italic_t ∈ [ 0 , 1 ] end_POSTSUBSCRIPT italic_h ∥ over~ start_ARG over˙ start_ARG bold_italic_β end_ARG end_ARG ( italic_t ) - over˙ start_ARG bold_italic_β end_ARG ( italic_t ) ∥ end_CELL start_CELL = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + square-root start_ARG divide start_ARG roman_log italic_N end_ARG start_ARG italic_N italic_h end_ARG end_ARG ) . end_CELL end_ROW (S.4.15)
Proof of Theorem 2.2.2.

We write

𝒬n(𝔹(t),𝔹~(t))=vec(𝔹(t)𝔹~(t))Ψ^(t)vec(𝔹(t)𝔹~(t)),subscript𝒬𝑛𝔹𝑡~𝔹𝑡vecsuperscript𝔹𝑡~𝔹𝑡top^Ψ𝑡vec𝔹𝑡~𝔹𝑡\displaystyle\begin{split}\mathcal{Q}_{n}\big{(}{\mathbb{B}}(t),\widetilde{% \mathbb{B}}(t)\big{)}&=\mathrm{vec}\big{(}\mathbb{B}(t)-\widetilde{\mathbb{B}}% (t)\big{)}^{\top}\widehat{\Psi}(t)\mathrm{vec}\big{(}\mathbb{B}(t)-\widetilde{% \mathbb{B}}(t)\big{)},\end{split}start_ROW start_CELL caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B ( italic_t ) , over~ start_ARG blackboard_B end_ARG ( italic_t ) ) end_CELL start_CELL = roman_vec ( blackboard_B ( italic_t ) - over~ start_ARG blackboard_B end_ARG ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over^ start_ARG roman_Ψ end_ARG ( italic_t ) roman_vec ( blackboard_B ( italic_t ) - over~ start_ARG blackboard_B end_ARG ( italic_t ) ) , end_CELL end_ROW (S.4.16)

where Ψ^(t)=1ni=1n1Li=1LiKh(Ti,t)𝕎i,(t)𝕎i,(t)^Ψ𝑡1𝑛superscriptsubscript𝑖1𝑛1subscript𝐿𝑖superscriptsubscript1subscript𝐿𝑖subscript𝐾subscript𝑇𝑖𝑡subscript𝕎𝑖𝑡subscript𝕎𝑖superscript𝑡top\widehat{\Psi}(t)=\frac{1}{n}\sum_{i=1}^{n}\frac{1}{L_{i}}\sum_{\ell=1}^{L_{i}% }K_{h}(T_{i,\ell}-t)\mathbb{W}_{i,\ell}(t)\mathbb{W}_{i,\ell}(t)^{\top}over^ start_ARG roman_Ψ end_ARG ( italic_t ) = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT roman_ℓ = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT - italic_t ) blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) blackboard_W start_POSTSUBSCRIPT italic_i , roman_ℓ end_POSTSUBSCRIPT ( italic_t ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. It follows from Lemma S.4.1 that

Ψ^(t)=𝒩(t;h)Γ+oP(1)^Ψ𝑡tensor-product𝒩𝑡Γsubscript𝑜𝑃1\displaystyle\begin{split}\widehat{\Psi}(t)&=\mathcal{N}(t;h)\otimes\Gamma+o_{% P}(1)\end{split}start_ROW start_CELL over^ start_ARG roman_Ψ end_ARG ( italic_t ) end_CELL start_CELL = caligraphic_N ( italic_t ; italic_h ) ⊗ roman_Γ + italic_o start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( 1 ) end_CELL end_ROW (S.4.17)

uniformly for t[0,1]𝑡01t\in[0,1]italic_t ∈ [ 0 , 1 ], where Γ=E(𝐗𝐗)Γ𝐸superscript𝐗𝐗top\Gamma=E(\mathbf{X}\mathbf{X}^{\top})roman_Γ = italic_E ( bold_XX start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) and 𝒩(t;h)𝒩𝑡\mathcal{N}(t;h)caligraphic_N ( italic_t ; italic_h ) is the 2×2222\times 22 × 2 matrix whose (r+1,r+1)𝑟1superscript𝑟1(r+1,r^{\prime}+1)( italic_r + 1 , italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + 1 )-component is given by νr+r(t;h)=01(uth)r+rKh(ut)π(u)dusubscript𝜈𝑟superscript𝑟𝑡superscriptsubscript01superscript𝑢𝑡𝑟superscript𝑟subscript𝐾𝑢𝑡𝜋𝑢differential-d𝑢\nu_{r+r^{\prime}}(t;h)=\int_{0}^{1}\big{(}\frac{u-t}{h}\big{)}^{r+r^{\prime}}% K_{h}(u-t)\pi(u)\,\mathrm{d}uitalic_ν start_POSTSUBSCRIPT italic_r + italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_t ; italic_h ) = ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ( divide start_ARG italic_u - italic_t end_ARG start_ARG italic_h end_ARG ) start_POSTSUPERSCRIPT italic_r + italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_u - italic_t ) italic_π ( italic_u ) roman_d italic_u for 0r,r1formulae-sequence0𝑟superscript𝑟10\leq r,r^{\prime}\leq 10 ≤ italic_r , italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≤ 1. For t(0,1)𝑡01t\in(0,1)italic_t ∈ ( 0 , 1 ), we note that 𝒩(t;h)=π(t)𝒩𝒩𝑡𝜋𝑡𝒩\mathcal{N}(t;h)=\pi(t)\mathcal{N}caligraphic_N ( italic_t ; italic_h ) = italic_π ( italic_t ) caligraphic_N is strictly positive definite, where 𝒩=diag(ν0,ν2)𝒩diagsubscript𝜈0subscript𝜈2\mathcal{N}=\mathrm{diag}(\nu_{0},\nu_{2})caligraphic_N = roman_diag ( italic_ν start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ν start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) is a diagonal matrix with νr=11urK(u)dusubscript𝜈𝑟superscriptsubscript11superscript𝑢𝑟𝐾𝑢differential-d𝑢\nu_{r}=\int_{-1}^{1}u^{r}K(u)\,\mathrm{d}uitalic_ν start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT = ∫ start_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT italic_u start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT italic_K ( italic_u ) roman_d italic_u. Therefore, with probability tending to one, we have

1Mm=1M𝒬n(𝔹0(tm),𝔹^(tm))1Mm=1M(𝒬n(𝔹0(tm),𝔹~(tm))+𝒬n(𝔹^(tm),𝔹~(tm)))2Mm=1M𝒬n(𝔹0(tm),𝔹~(tm))2σmax(𝒩Γ)supt[0,1]vec(𝔹(t)𝔹~(t))2=OP(h4+logNNh),1𝑀superscriptsubscript𝑚1𝑀subscript𝒬𝑛subscript𝔹0subscript𝑡𝑚^𝔹subscript𝑡𝑚1𝑀superscriptsubscript𝑚1𝑀subscript𝒬𝑛subscript𝔹0subscript𝑡𝑚~𝔹subscript𝑡𝑚subscript𝒬𝑛^𝔹subscript𝑡𝑚~𝔹subscript𝑡𝑚2𝑀superscriptsubscript𝑚1𝑀subscript𝒬𝑛subscript𝔹0subscript𝑡𝑚~𝔹subscript𝑡𝑚2subscript𝜎maxtensor-product𝒩Γsubscriptsupremum𝑡01superscriptdelimited-∥∥vec𝔹𝑡~𝔹𝑡2subscript𝑂𝑃superscript4𝑁𝑁\displaystyle\begin{split}\frac{1}{M}\sum_{m=1}^{M}\mathcal{Q}_{n}\big{(}% \mathbb{B}_{0}(t_{m}),\widehat{\mathbb{B}}(t_{m})\big{)}&\leq\frac{1}{M}\sum_{% m=1}^{M}\Big{(}\mathcal{Q}_{n}\big{(}\mathbb{B}_{0}(t_{m}),\widetilde{\mathbb{% B}}(t_{m})\big{)}+\mathcal{Q}_{n}\big{(}\widehat{\mathbb{B}}(t_{m}),\widetilde% {\mathbb{B}}(t_{m})\big{)}\Big{)}\\ &\leq\frac{2}{M}\sum_{m=1}^{M}\mathcal{Q}_{n}\big{(}\mathbb{B}_{0}(t_{m}),% \widetilde{\mathbb{B}}(t_{m})\big{)}\\ &\leq 2\sigma_{\mathrm{max}}(\mathcal{N}\otimes\Gamma)\sup_{t\in[0,1]}\big{\|}% \mathrm{vec}\big{(}\mathbb{B}(t)-\widetilde{\mathbb{B}}(t)\big{)}\big{\|}^{2}% \\ &=O_{P}\bigg{(}h^{4}+\frac{\log N}{Nh}\bigg{)},\end{split}start_ROW start_CELL divide start_ARG 1 end_ARG start_ARG italic_M end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , over^ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) end_CELL start_CELL ≤ divide start_ARG 1 end_ARG start_ARG italic_M end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ( caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , over~ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) + caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( over^ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , over~ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ divide start_ARG 2 end_ARG start_ARG italic_M end_ARG ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , over~ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ 2 italic_σ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( caligraphic_N ⊗ roman_Γ ) roman_sup start_POSTSUBSCRIPT italic_t ∈ [ 0 , 1 ] end_POSTSUBSCRIPT ∥ roman_vec ( blackboard_B ( italic_t ) - over~ start_ARG blackboard_B end_ARG ( italic_t ) ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_h start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT + divide start_ARG roman_log italic_N end_ARG start_ARG italic_N italic_h end_ARG ) , end_CELL end_ROW (S.4.18)

where σmax(𝒩Γ)subscript𝜎maxtensor-product𝒩Γ\sigma_{\mathrm{max}}(\mathcal{N}\otimes\Gamma)italic_σ start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ( caligraphic_N ⊗ roman_Γ ) denotes the maximum eigenvalue of 𝒩Γtensor-product𝒩Γ\mathcal{N}\otimes\Gammacaligraphic_N ⊗ roman_Γ. To get the second inequality in (S.4.18), we used the fact that m=1M𝒬n(𝔹^(tm),𝔹~(tm))m=1M𝒬n(𝔹0(tm),𝔹~(tm))superscriptsubscript𝑚1𝑀subscript𝒬𝑛^𝔹subscript𝑡𝑚~𝔹subscript𝑡𝑚superscriptsubscript𝑚1𝑀subscript𝒬𝑛subscript𝔹0subscript𝑡𝑚~𝔹subscript𝑡𝑚\sum_{m=1}^{M}\mathcal{Q}_{n}\big{(}\widehat{\mathbb{B}}(t_{m}),\widetilde{% \mathbb{B}}(t_{m})\big{)}\leq\sum_{m=1}^{M}\mathcal{Q}_{n}\big{(}\mathbb{B}_{0% }(t_{m}),\widetilde{\mathbb{B}}(t_{m})\big{)}∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( over^ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , over~ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) ≤ ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , over~ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) holds because 𝔹0subscript𝔹0\mathbb{B}_{0}blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT satisfies the side conditions (2.9) under which 𝔹^^𝔹\widehat{\mathbb{B}}over^ start_ARG blackboard_B end_ARG minimizes m=1M𝒬n(𝔹(tm),𝔹~(tm))superscriptsubscript𝑚1𝑀subscript𝒬𝑛𝔹subscript𝑡𝑚~𝔹subscript𝑡𝑚\sum_{m=1}^{M}\mathcal{Q}_{n}\big{(}\mathbb{B}(t_{m}),\widetilde{\mathbb{B}}(t% _{m})\big{)}∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , over~ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ). The last equality in (S.4.18) follows from Proposition 2.2.1 and (S.4.15). Finally, we conclude from (S.4.18) that

max1mM𝜷^(tm)𝜷0(tm)2max1mMvec(𝔹0(tm)𝔹^(tm))2σmin(𝒩Γ)1m=1M𝒬n(𝔹0(tm),𝔹^(tm))=OP(h4+logNNh),subscript1𝑚𝑀superscriptdelimited-∥∥^𝜷subscript𝑡𝑚subscript𝜷0subscript𝑡𝑚2subscript1𝑚𝑀superscriptdelimited-∥∥vecsubscript𝔹0subscript𝑡𝑚^𝔹subscript𝑡𝑚2subscript𝜎minsuperscripttensor-product𝒩Γ1superscriptsubscript𝑚1𝑀subscript𝒬𝑛subscript𝔹0subscript𝑡𝑚^𝔹subscript𝑡𝑚subscript𝑂𝑃superscript4𝑁𝑁\displaystyle\begin{split}\max_{1\leq m\leq M}\big{\|}\widehat{\boldsymbol{% \beta}}(t_{m})-\boldsymbol{\beta}_{0}(t_{m})\big{\|}^{2}&\leq\max_{1\leq m\leq M% }\big{\|}\mathrm{vec}\big{(}\mathbb{B}_{0}(t_{m})-\widehat{\mathbb{B}}(t_{m})% \big{)}\big{\|}^{2}\\ &\leq\sigma_{\mathrm{min}}(\mathcal{N}\otimes\Gamma)^{-1}\sum_{m=1}^{M}% \mathcal{Q}_{n}\big{(}\mathbb{B}_{0}(t_{m}),\widehat{\mathbb{B}}(t_{m})\big{)}% \\ &=O_{P}\bigg{(}h^{4}+\frac{\log N}{Nh}\bigg{)},\end{split}start_ROW start_CELL roman_max start_POSTSUBSCRIPT 1 ≤ italic_m ≤ italic_M end_POSTSUBSCRIPT ∥ over^ start_ARG bold_italic_β end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) - bold_italic_β start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL ≤ roman_max start_POSTSUBSCRIPT 1 ≤ italic_m ≤ italic_M end_POSTSUBSCRIPT ∥ roman_vec ( blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) - over^ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ≤ italic_σ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( caligraphic_N ⊗ roman_Γ ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_Q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( blackboard_B start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) , over^ start_ARG blackboard_B end_ARG ( italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_O start_POSTSUBSCRIPT italic_P end_POSTSUBSCRIPT ( italic_h start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT + divide start_ARG roman_log italic_N end_ARG start_ARG italic_N italic_h end_ARG ) , end_CELL end_ROW (S.4.19)

where σmin(𝒩Γ)subscript𝜎mintensor-product𝒩Γ\sigma_{\mathrm{min}}(\mathcal{N}\otimes\Gamma)italic_σ start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ( caligraphic_N ⊗ roman_Γ ) denotes the minimum eigenvalues of 𝒩Γtensor-product𝒩Γ\mathcal{N}\otimes\Gammacaligraphic_N ⊗ roman_Γ. ∎

S.5 Proof of Theorem 2.3.1

Proof.

Let 𝝁isubscript𝝁𝑖\boldsymbol{\mu}_{i}bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denote the true mean vector of responses of the i𝑖iitalic_ith trajectory when (2.2) is true. We define 𝝁~isubscript~𝝁𝑖\tilde{\boldsymbol{\mu}}_{i}over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and 𝝁^isubscript^𝝁𝑖\hat{\boldsymbol{\mu}}_{i}over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT be the estimated responses calculated from the unconstrained 𝜷~(t)=(β~1(t),,β~p(t))~𝜷𝑡superscriptsubscript~𝛽1𝑡subscript~𝛽𝑝𝑡top\tilde{\boldsymbol{\beta}}(t)=\big{(}\tilde{\beta}_{1}(t),\ldots,\tilde{\beta}% _{p}(t)\big{)}^{\top}over~ start_ARG bold_italic_β end_ARG ( italic_t ) = ( over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , … , over~ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and constrained 𝜷^(t)=(β^1(t),,β^p(t))^𝜷𝑡superscriptsubscript^𝛽1𝑡subscript^𝛽𝑝𝑡top\hat{\boldsymbol{\beta}}(t)=\big{(}\hat{\beta}_{1}(t),\ldots,\hat{\beta}_{p}(t% )\big{)}^{\top}over^ start_ARG bold_italic_β end_ARG ( italic_t ) = ( over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_t ) , … , over^ start_ARG italic_β end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_t ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, respectively. Let 𝒂2=𝒂𝒂superscriptnorm𝒂2superscript𝒂top𝒂\|\boldsymbol{a}\|^{2}=\boldsymbol{a}^{\top}\boldsymbol{a}∥ bold_italic_a ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = bold_italic_a start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT bold_italic_a and define vec[𝝁i]=(𝝁1,,𝝁n)vecdelimited-[]subscript𝝁𝑖superscriptsuperscriptsubscript𝝁1topsuperscriptsubscript𝝁𝑛toptop\textrm{vec}[\boldsymbol{\mu}_{i}]=(\boldsymbol{\mu}_{1}^{\top},\ldots,% \boldsymbol{\mu}_{n}^{\top})^{\top}vec [ bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] = ( bold_italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , … , bold_italic_μ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT. Then we have

vec[𝝁~i]vec[𝝁i]2vec[𝝁^i]vec[𝝁i]2superscriptnormvecdelimited-[]subscript~𝝁𝑖vecdelimited-[]subscript𝝁𝑖2superscriptnormvecdelimited-[]subscript^𝝁𝑖vecdelimited-[]subscript𝝁𝑖2\displaystyle||\text{vec}[\tilde{\boldsymbol{\mu}}_{i}]-\text{vec}[\boldsymbol% {\mu}_{i}]||^{2}-||\text{vec}[\hat{\boldsymbol{\mu}}_{i}]-\text{vec}[% \boldsymbol{\mu}_{i}]||^{2}| | vec [ over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - | | vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
=vec[𝝁~i]vec[𝝁^i]2+2(vec[𝝁~i]vec[𝝁^i])(vec[𝝁^i]vec[𝝁i])absentsuperscriptnormvecdelimited-[]subscript~𝝁𝑖vecdelimited-[]subscript^𝝁𝑖22superscriptvecdelimited-[]subscript~𝝁𝑖vecdelimited-[]subscript^𝝁𝑖topvecdelimited-[]subscript^𝝁𝑖vecdelimited-[]subscript𝝁𝑖\displaystyle=||\text{vec}[\tilde{\boldsymbol{\mu}}_{i}]-\text{vec}[\hat{% \boldsymbol{\mu}}_{i}]||^{2}+2(\text{vec}[\tilde{\boldsymbol{\mu}}_{i}]-\text{% vec}[\hat{\boldsymbol{\mu}}_{i}])^{\top}(\text{vec}[\hat{\boldsymbol{\mu}}_{i}% ]-\text{vec}[\boldsymbol{\mu}_{i}])= | | vec [ over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 2 ( vec [ over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] )
=vec[𝝁~i]vec[𝝁^i]22(vec[𝐘i]vec[𝝁~i])(vec[𝝁^i]vec[𝝁i])absentsuperscriptnormvecdelimited-[]subscript~𝝁𝑖vecdelimited-[]subscript^𝝁𝑖22superscriptvecdelimited-[]subscript𝐘𝑖vecdelimited-[]subscript~𝝁𝑖topvecdelimited-[]subscript^𝝁𝑖vecdelimited-[]subscript𝝁𝑖\displaystyle=||\text{vec}[\tilde{\boldsymbol{\mu}}_{i}]-\text{vec}[\hat{% \boldsymbol{\mu}}_{i}]||^{2}-2(\text{vec}[\mathbf{Y}_{i}]-\text{vec}[\tilde{% \boldsymbol{\mu}}_{i}])^{\top}(\text{vec}[\hat{\boldsymbol{\mu}}_{i}]-\text{% vec}[\boldsymbol{\mu}_{i}])= | | vec [ over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - 2 ( vec [ bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ) (S.5.1)
+2(vec[𝐘i]vec[𝝁^i])(vec[𝝁^i]vec[𝝁i]).2superscriptvecdelimited-[]subscript𝐘𝑖vecdelimited-[]subscript^𝝁𝑖topvecdelimited-[]subscript^𝝁𝑖vecdelimited-[]subscript𝝁𝑖\displaystyle~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{% }~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}+2(\text{vec}[% \mathbf{Y}_{i}]-\text{vec}[\hat{\boldsymbol{\mu}}_{i}])^{\top}(\text{vec}[\hat% {\boldsymbol{\mu}}_{i}]-\text{vec}[\boldsymbol{\mu}_{i}]).+ 2 ( vec [ bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] ) .

The second term on the last equation is zero because of the orthogonality of (𝐘i𝝁~i)subscript𝐘𝑖subscript~𝝁𝑖(\mathbf{Y}_{i}-\tilde{\boldsymbol{\mu}}_{i})( bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and (𝝁^i𝝁i)subscript^𝝁𝑖subscript𝝁𝑖(\hat{\boldsymbol{\mu}}_{i}-\boldsymbol{\mu}_{i})( over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), for i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n. Based on Proposition 3.12.3 of Silvapulle and Sen (2005), the last term is non-negative by Karush-Kuhn-Tucker conditions as follows,

𝐘i𝝁^i,𝝁^i=0 and𝐘i𝝁^i,𝝁i0,subscript𝐘𝑖subscript^𝝁𝑖subscript^𝝁𝑖0 andsubscript𝐘𝑖subscript^𝝁𝑖subscript𝝁𝑖0\langle\mathbf{Y}_{i}-\hat{\boldsymbol{\mu}}_{i},~{}\hat{\boldsymbol{\mu}}_{i}% \rangle=0~{}\text{~{}and}~{}\langle\mathbf{Y}_{i}-\hat{\boldsymbol{\mu}}_{i},% \boldsymbol{\mu}_{i}\rangle\leq 0,⟨ bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟩ = 0 and ⟨ bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟩ ≤ 0 ,

for i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n. Thus, we have vec[𝝁^i]vec[𝝁i]2vec[𝝁~i]vec[𝝁i]2.superscriptnormvecdelimited-[]subscript^𝝁𝑖vecdelimited-[]subscript𝝁𝑖2superscriptnormvecdelimited-[]subscript~𝝁𝑖vecdelimited-[]subscript𝝁𝑖2||\text{vec}[\hat{\boldsymbol{\mu}}_{i}]-\text{vec}[\boldsymbol{\mu}_{i}]||^{2% }\leq||\text{vec}[\tilde{\boldsymbol{\mu}}_{i}]-\text{vec}[\boldsymbol{\mu}_{i% }]||^{2}.| | vec [ over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ | | vec [ over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] - vec [ bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . Then, from Condition (S2), 𝝁~i𝝁i2𝐗i𝜷(t)𝐗i𝜷~(t)L22=𝐗i{𝜷~(t)𝜷(t)}L22M1𝜷(t)𝜷~(t)L22.similar-to-or-equalssuperscriptnormsubscript~𝝁𝑖subscript𝝁𝑖2subscriptsuperscriptnormsubscript𝐗𝑖𝜷𝑡subscript𝐗𝑖~𝜷𝑡2subscript𝐿2subscriptsuperscriptnormsubscript𝐗𝑖~𝜷𝑡𝜷𝑡2subscript𝐿2subscript𝑀1subscriptsuperscriptnorm𝜷𝑡~𝜷𝑡2subscript𝐿2||\tilde{\boldsymbol{\mu}}_{i}-{\boldsymbol{\mu}}_{i}||^{2}\simeq||\mathbf{X}_% {i}\boldsymbol{\beta}(t)-\mathbf{X}_{i}\tilde{\boldsymbol{\beta}}(t)||^{2}_{L_% {2}}=||\mathbf{X}_{i}\{\tilde{\boldsymbol{\beta}}(t)-{\boldsymbol{\beta}}(t)\}% ||^{2}_{L_{2}}\leq M_{1}||\boldsymbol{\beta}(t)-\tilde{\boldsymbol{\beta}}(t)|% |^{2}_{L_{2}}.| | over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≃ | | bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT bold_italic_β ( italic_t ) - bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT over~ start_ARG bold_italic_β end_ARG ( italic_t ) | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = | | bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT { over~ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β ( italic_t ) } | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ italic_M start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | | bold_italic_β ( italic_t ) - over~ start_ARG bold_italic_β end_ARG ( italic_t ) | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . For non-trivial 𝐗isubscript𝐗𝑖\mathbf{X}_{i}bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we can always find M4>0subscript𝑀40M_{4}>0italic_M start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT > 0 satisfying 𝝁^i𝝁i2𝐗i{𝜷^(t)𝜷(t)}L22M4𝜷^(t)𝜷(t)L22.similar-to-or-equalssuperscriptnormsubscript^𝝁𝑖subscript𝝁𝑖2subscriptsuperscriptnormsubscript𝐗𝑖^𝜷𝑡𝜷𝑡2subscript𝐿2subscript𝑀4subscriptsuperscriptnorm^𝜷𝑡𝜷𝑡2subscript𝐿2||\hat{\boldsymbol{\mu}}_{i}-{\boldsymbol{\mu}}_{i}||^{2}\simeq||\mathbf{X}_{i% }\{{\hat{\boldsymbol{\beta}}}(t)-{\boldsymbol{\beta}}(t)\}||^{2}_{L_{2}}\geq M% _{4}||\hat{\boldsymbol{\beta}}(t)-{\boldsymbol{\beta}}(t)||^{2}_{L_{2}}.| | over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≃ | | bold_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT { over^ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β ( italic_t ) } | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≥ italic_M start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | | over^ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β ( italic_t ) | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . Hence, altogether, we have M4𝜷^(t)𝜷(t)L22𝝁^i𝝁i2𝝁~i𝝁i2M3𝜷~(t)𝜷~(t)L22subscript𝑀4subscriptsuperscriptnorm^𝜷𝑡𝜷𝑡2subscript𝐿2superscriptnormsubscript^𝝁𝑖subscript𝝁𝑖2superscriptnormsubscript~𝝁𝑖subscript𝝁𝑖2subscript𝑀3subscriptsuperscriptnorm~𝜷𝑡~𝜷𝑡2subscript𝐿2M_{4}||\hat{\boldsymbol{\beta}}(t)-{\boldsymbol{\beta}}(t)||^{2}_{L_{2}}\leq||% \hat{\boldsymbol{\mu}}_{i}-{\boldsymbol{\mu}}_{i}||^{2}\leq||\tilde{% \boldsymbol{\mu}}_{i}-{\boldsymbol{\mu}}_{i}||^{2}\leq M_{3}||\tilde{% \boldsymbol{\beta}}(t)-\tilde{\boldsymbol{\beta}}(t)||^{2}_{L_{2}}italic_M start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT | | over^ start_ARG bold_italic_β end_ARG ( italic_t ) - bold_italic_β ( italic_t ) | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ | | over^ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ | | over~ start_ARG bold_italic_μ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - bold_italic_μ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_M start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT | | over~ start_ARG bold_italic_β end_ARG ( italic_t ) - over~ start_ARG bold_italic_β end_ARG ( italic_t ) | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, which leads to 𝜷^j(t)𝜷j(t)L22𝜷~j(t)𝜷j(t)L22subscriptsuperscriptnormsubscript^𝜷𝑗𝑡subscript𝜷𝑗𝑡2subscript𝐿2subscriptsuperscriptnormsubscript~𝜷𝑗𝑡subscript𝜷𝑗𝑡2subscript𝐿2||\hat{\boldsymbol{\beta}}_{j}(t)-{\boldsymbol{\beta}_{j}}(t)||^{2}_{L_{2}}% \leq||\tilde{\boldsymbol{\beta}}_{j}(t)-{\boldsymbol{\beta}_{j}}(t)||^{2}_{L_{% 2}}| | over^ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) - bold_italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ | | over~ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) - bold_italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) | | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, for j=1,,p𝑗1𝑝j=1,\ldots,pitalic_j = 1 , … , italic_p. ∎

S.6 Partially shape-constrained regression spline via B𝐵Bitalic_B-splines

We consider the monotone increasing constraints on βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) over sub-interval jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT for jJ𝑗𝐽j\in Jitalic_j ∈ italic_J, as in (2.2). Borrowing the notation from Section 2.3, now let {ϕkj(t)superscriptsubscriptitalic-ϕ𝑘𝑗𝑡\phi_{k}^{j}(t)italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_t ): k=1,,dj𝑘1subscript𝑑𝑗k=1,\ldots,d_{j}italic_k = 1 , … , italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT}, denote a set of B𝐵Bitalic_B-spline basis functions of degree 2 under the given knot sequence to express βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) and 𝐜j=(cj1,,cjdj)subscript𝐜𝑗superscriptsubscript𝑐𝑗1subscript𝑐𝑗subscript𝑑𝑗top\mathbf{c}_{j}=(c_{j1},\ldots,c_{jd_{j}})^{\top}bold_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_c start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT , … , italic_c start_POSTSUBSCRIPT italic_j italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT represents the corresponding basis coefficients. For i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n, we define (Li×dj)subscript𝐿𝑖subscript𝑑𝑗(L_{i}\times d_{j})( italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT × italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) matrix of discretized basis functions as,

Φij=[ϕ1j(Ti,1)ϕ2j(Ti,1)ϕdjj(Ti,1)ϕ1j(Ti,2)ϕ2j(Ti,2)ϕdjj(Ti,2)ϕ1j(Ti,Li)ϕ2j(Ti,Li)ϕdjj(Ti,Li)].subscriptΦ𝑖𝑗delimited-[]subscriptsuperscriptitalic-ϕ𝑗1subscript𝑇𝑖1subscriptsuperscriptitalic-ϕ𝑗2subscript𝑇𝑖1subscriptsuperscriptitalic-ϕ𝑗subscript𝑑𝑗subscript𝑇𝑖1subscriptsuperscriptitalic-ϕ𝑗1subscript𝑇𝑖2subscriptsuperscriptitalic-ϕ𝑗2subscript𝑇𝑖2subscriptsuperscriptitalic-ϕ𝑗subscript𝑑𝑗subscript𝑇𝑖2subscriptsuperscriptitalic-ϕ𝑗1subscript𝑇𝑖subscript𝐿𝑖subscriptsuperscriptitalic-ϕ𝑗2subscript𝑇𝑖subscript𝐿𝑖subscriptsuperscriptitalic-ϕ𝑗subscript𝑑𝑗subscript𝑇𝑖subscript𝐿𝑖\Phi_{ij}=\left[{\begin{array}[]{cccc}\phi^{j}_{1}{(T_{i,1})}&\phi^{j}_{2}{(T_% {i,1})}&\cdots&\phi^{j}_{d_{j}}{(T_{i,1})}\\ \phi^{j}_{1}{({T_{i,2}})}&\phi^{j}_{2}{({T_{i,2}})}&\cdots&\phi^{j}_{d_{j}}{({% T_{i,2}})}\\ \vdots&\vdots&\ddots&\vdots\\ \phi^{j}_{1}{(T_{i,L_{i}})}&\phi^{j}_{2}(T_{i,L_{i}})&\cdots&\phi^{j}_{d_{j}}(% T_{i,L_{i}})\\ \end{array}}\right].roman_Φ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = [ start_ARRAY start_ROW start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 1 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , 2 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_T start_POSTSUBSCRIPT italic_i , italic_L start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL end_ROW end_ARRAY ] .

We then notate a set of given knot sequence generating a set of ϕkj(t)superscriptsubscriptitalic-ϕ𝑘𝑗𝑡\phi_{k}^{j}(t)italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ( italic_t ) by 𝒦j={tjm[0,1]:m=1,,Mj}subscript𝒦𝑗conditional-setsubscript𝑡𝑗𝑚01𝑚1subscript𝑀𝑗\mathcal{K}_{j}=\{t_{jm}\in[0,1]:m=1,\ldots,M_{j}\}caligraphic_K start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = { italic_t start_POSTSUBSCRIPT italic_j italic_m end_POSTSUBSCRIPT ∈ [ 0 , 1 ] : italic_m = 1 , … , italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } with 0=tj1<<tjMj=10subscript𝑡𝑗1subscript𝑡𝑗subscript𝑀𝑗10=t_{j1}<\cdots<t_{jM_{j}}=10 = italic_t start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT < ⋯ < italic_t start_POSTSUBSCRIPT italic_j italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT = 1. For j=1,,p𝑗1𝑝j=1,\ldots,pitalic_j = 1 , … , italic_p, let 𝐀jsubscript𝐀𝑗\mathbf{A}_{j}bold_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT denote the (Mj×dj)subscript𝑀𝑗subscript𝑑𝑗(M_{j}\times d_{j})( italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT × italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) matrix of discretized first derivatives of basis functions evaluated at 𝒦jsubscript𝒦𝑗\mathcal{K}_{j}caligraphic_K start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT as,

𝐀j=[ϕ1j(tj1)ϕ2j(tj1)ϕdjj(tj1)ϕ1j(tj2)ϕ2j(tj2)ϕdjj(tj2)ϕ1j(tjMj)ϕ2j(tjMj)ϕdjj(tjMj)].\mathbf{A}_{j}=\left[{\begin{array}[]{cccc}{\phi^{j}_{1}}{{}^{\prime}}{(t_{j1}% )}&{\phi^{j}_{2}}{{}^{\prime}}{(t_{j1})}&\cdots&{\phi^{j}_{d_{j}}}{{}^{\prime}% }{(t_{j1})}\\ {\phi^{j}_{1}}{{}^{\prime}}{({t_{j2}})}&{\phi^{j}_{2}}{{}^{\prime}}{({t_{j2}})% }&\cdots&{\phi^{j}_{d_{j}}}{{}^{\prime}}{({t_{j2}})}\\ \vdots&\vdots&\ddots&\vdots\\ {\phi^{j}_{1}}{{}^{\prime}}{(t_{jM_{j}})}&{\phi^{j}_{2}}{{}^{\prime}}(t_{jM_{j% }})&\cdots&{\phi^{j}_{d_{j}}}{{}^{\prime}}(t_{jM_{j}})\\ \end{array}}\right].bold_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = [ start_ARRAY start_ROW start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j 2 end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j 2 end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j 2 end_POSTSUBSCRIPT ) end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL end_ROW end_ARRAY ] .

For the special case of j=[0,1]subscript𝑗01\mathcal{I}_{j}=[0,1]caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = [ 0 , 1 ], for all jJ𝑗𝐽j\in Jitalic_j ∈ italic_J, i.e., when the monotone increasing constraint is assigned to the entire domain, the objective function to find shape constrained βj(t)subscript𝛽𝑗𝑡\beta_{j}(t)italic_β start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( italic_t ) is written as

Minimizei=1n𝐘ij=1pXi,j{Φij𝒄j}2subject to𝐀j𝐜j0Mj,jJ,formulae-sequenceMinimizesuperscriptsubscript𝑖1𝑛superscriptnormsubscript𝐘𝑖superscriptsubscript𝑗1𝑝subscript𝑋𝑖𝑗subscriptΦ𝑖𝑗subscript𝒄𝑗2subject tosubscript𝐀𝑗subscript𝐜𝑗subscript0subscript𝑀𝑗𝑗𝐽\textrm{Minimize}~{}~{}~{}\sum_{i=1}^{n}\Big{\|}\mathbf{Y}_{i}-\sum_{j=1}^{p}{% X}_{i,j}\big{\{}\Phi_{ij}\boldsymbol{c}_{j}\big{\}}\Big{\|}^{2}\quad\textrm{% subject to}~{}~{}~{}\mathbf{A}_{j}\mathbf{c}_{j}\geq 0_{M_{j}},~{}~{}j\in J,Minimize ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ∥ bold_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - ∑ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT { roman_Φ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT bold_italic_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT subject to bold_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT bold_c start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ 0 start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_j ∈ italic_J , (S.6.1)

where 0Mjsubscript0subscript𝑀𝑗0_{M_{j}}0 start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT represents a length-Mjsubscript𝑀𝑗M_{j}italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT vector of 0’s. Now, under the general partial constraint case, suppose that lower and upper boundaries of sub-interval jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, say tjlsubscript𝑡𝑗𝑙t_{jl}italic_t start_POSTSUBSCRIPT italic_j italic_l end_POSTSUBSCRIPT and tjusubscript𝑡𝑗𝑢t_{ju}italic_t start_POSTSUBSCRIPT italic_j italic_u end_POSTSUBSCRIPT, belong to 𝒦jsubscript𝒦𝑗\mathcal{K}_{j}caligraphic_K start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT without loss of generality. Since 𝒦jsubscript𝒦𝑗\mathcal{K}_{j}caligraphic_K start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is user-defined, we can always define such 𝒦jsubscript𝒦𝑗\mathcal{K}_{j}caligraphic_K start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT by simply adding additional internal knots. Then, the objective function can be written similarly to (S.6.1) aiming to minimize the least squares but now under different 𝐀jsubscript𝐀𝑗\mathbf{A}_{j}bold_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, which assigns the non-negativity restriction on the first derivative function evaluated on knots belonging to jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Specifically,(v,w)𝑣𝑤(v,w)( italic_v , italic_w )th element of Ajsubscript𝐴𝑗A_{j}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is defined as (Aj)[v,w]=ϕwj(tv)(A_{j})_{[v,w]}={{\phi}_{w}^{j}}{{}^{\prime}}{(t_{v})}( italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT [ italic_v , italic_w ] end_POSTSUBSCRIPT = italic_ϕ start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ), for v𝒱j𝑣subscript𝒱𝑗v\in\mathcal{V}_{j}italic_v ∈ caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and w𝒲j𝑤subscript𝒲𝑗w\in\mathcal{W}_{j}italic_w ∈ caligraphic_W start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, where 𝒱j={m{1,,Mj}:tjltmtju}subscript𝒱𝑗conditional-set𝑚1subscript𝑀𝑗subscript𝑡𝑗𝑙subscript𝑡𝑚subscript𝑡𝑗𝑢\mathcal{V}_{j}=\big{\{}m\in\{1,\ldots,M_{j}\}:t_{jl}\leq t_{m}\leq t_{ju}\big% {\}}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = { italic_m ∈ { 1 , … , italic_M start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } : italic_t start_POSTSUBSCRIPT italic_j italic_l end_POSTSUBSCRIPT ≤ italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ≤ italic_t start_POSTSUBSCRIPT italic_j italic_u end_POSTSUBSCRIPT } and 𝒲j={k{1,,dj}:ϕkj(t)0 over tj}\mathcal{W}_{j}=\big{\{}k\in\{1,\ldots,d_{j}\}:{\phi_{k}^{j}}{{}^{\prime}}(t)% \neq 0\textrm{~{}over~{}}t\in\mathcal{I}_{j}\big{\}}caligraphic_W start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = { italic_k ∈ { 1 , … , italic_d start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } : italic_ϕ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t ) ≠ 0 over italic_t ∈ caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT }. Otherwise, (Aj)[v,w]=0subscriptsubscript𝐴𝑗𝑣𝑤0(A_{j})_{[v,w]}=0( italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT [ italic_v , italic_w ] end_POSTSUBSCRIPT = 0. In a matrix form, if we denote integer elements in 𝒱jsubscript𝒱𝑗\mathcal{V}_{j}caligraphic_V start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and 𝒲jsubscript𝒲𝑗\mathcal{W}_{j}caligraphic_W start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT as mlsubscript𝑚𝑙m_{l}italic_m start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT, ml+1subscript𝑚𝑙1m_{l+1}italic_m start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT,…, musubscript𝑚𝑢m_{u}italic_m start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, and as kl,kl+1,kusubscript𝑘𝑙subscript𝑘𝑙1subscript𝑘𝑢k_{l},k_{l+1}\ldots,k_{u}italic_k start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT … , italic_k start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT, respectively, 𝐀jsubscript𝐀𝑗\mathbf{A}_{j}bold_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT can be written as,

𝐀j=[00000000000000\hdashline00ϕklj(tj,ml)ϕkl+1j(tj,ml)ϕkuj(tj,ml)0000ϕklj(tj,ml+1)ϕkl+1j(tj,ml+1)ϕkuj(tj,ml+1)0000ϕklj(tj,mu)ϕkl+1j(tj,mu)ϕkuj(tj,mu)00\hdashline00000000000000].\mathbf{A}_{j}=\left[{\begin{array}[]{ccc:cc cc:ccc}0&\cdots&0&0&0&\cdots&0&0&% \cdots&0\\ &\vdots&&&&\vdots&&&\vdots&\\ 0&\cdots&0&0&0&\cdots&0&0&\cdots&0\\ \hdashline 0&\cdots&0&{\phi^{j}_{k_{l}}}{{}^{\prime}}{(t_{j,m_{l}})}&{\phi^{j}% _{k_{l+1}}}{{}^{\prime}}{(t_{j,m_{l}})}&\cdots&{\phi^{j}_{k_{u}}}{{}^{\prime}}% {(t_{j,m_{l}})}&0&\cdots&0\\ 0&\cdots&0&{\phi^{j}_{k_{l}}}{{}^{\prime}}{(t_{j,m_{l+1}})}&{\phi^{j}_{k_{l+1}% }}{{}^{\prime}}{(t_{j,m_{l+1}})}&\cdots&{\phi^{j}_{k_{u}}}{{}^{\prime}}{(t_{j,% m_{l+1}})}&0&\cdots&0\\ \vdots&&\vdots&\vdots&\ddots&\vdots&\vdots&&\vdots\\ 0&\cdots&0&{\phi^{j}_{k_{l}}}{{}^{\prime}}{(t_{j,m_{u}})}&{\phi^{j}_{k_{l+1}}}% {{}^{\prime}}{(t_{j,m_{u}})}&\cdots&{\phi^{j}_{k_{u}}}{{}^{\prime}}{(t_{j,m_{u% }})}&0&\cdots&0\\ \hdashline 0&\cdots&0&0&0&\cdots&0&0&\cdots&0\\ &\vdots&&&&\vdots&&&\vdots&\\ 0&\cdots&0&0&0&\cdots&0&0&\cdots&0\\ \end{array}}\right].bold_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = [ start_ARRAY start_ROW start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_l + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL ⋯ end_CELL start_CELL italic_ϕ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_FLOATSUPERSCRIPT ′ end_FLOATSUPERSCRIPT ( italic_t start_POSTSUBSCRIPT italic_j , italic_m start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL start_CELL ⋮ end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL start_CELL 0 end_CELL start_CELL ⋯ end_CELL start_CELL 0 end_CELL end_ROW end_ARRAY ] .

Although here we elaborate on the case of one sub-interval jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, the same approach applies to the case of multiple disjoint intervals of jsubscript𝑗\mathcal{I}_{j}caligraphic_I start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. This optimization can be fulfilled using quadratic programming through qprog function in an R package coneporj. Indeed, this approach applies to shape-constrained estimation under any kind of basis functions, not necessarily B𝐵Bitalic_B-splines.

S.7 Additional simulation results

Refer to caption
Refer to caption
Refer to caption
Refer to caption
(a) =[0,0.5]00.5\mathcal{I}=[0,0.5]caligraphic_I = [ 0 , 0.5 ]

 

Refer to caption
Refer to caption
Refer to caption
Refer to caption
(b) =[0.3,0.5]0.30.5\mathcal{I}=[0.3,0.5]caligraphic_I = [ 0.3 , 0.5 ]
Refer to caption
Refer to caption
Refer to caption
Refer to caption
(c) =[0.4,0.5]0.40.5\mathcal{I}=[0.4,0.5]caligraphic_I = [ 0.4 , 0.5 ]
Refer to caption
Refer to caption
Refer to caption
Refer to caption
(e) =[0.8,1]0.81\mathcal{I}=[0.8,1]caligraphic_I = [ 0.8 , 1 ]
Figure S.7.1: Regression coefficient function estimates with kernel method (2.9) under shape constraints (3.1). The shaded area indicates the sub-interval \mathcal{I}caligraphic_I of inferential interest. The blue dash-dotted lines depict the Monte Carlo average estimate in the full cohort analysis. The red long-dashed lines represent the Monte Carlo average estimate from the sub-cohort analysis. The solid black lines represent the true functions.

             

Refer to caption
Refer to caption
Refer to caption
Refer to caption
(d) =[0.5,1]0.51\mathcal{I}=[0.5,1]caligraphic_I = [ 0.5 , 1 ]
Refer to caption
Refer to caption
Refer to caption
Refer to caption
(f) =[0.9,1]0.91\mathcal{I}=[0.9,1]caligraphic_I = [ 0.9 , 1 ]
Refer to caption
Refer to caption
Figure S.7.2: The power analysis of the partially shape-constrained spline method (2.17) subject to shape constraints H0:β1:subscript𝐻0subscript𝛽1H_{0}:\beta_{1}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and β3subscript𝛽3\beta_{3}italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are monotone increasing on \mathcal{I}caligraphic_I. The sensitivity and specificity are evaluated at 5%percent55\%5 % significance level under different scenarios of locations and lengths of the sub-interval as =[0,0.5]00.5\mathcal{I}=[0,0.5]caligraphic_I = [ 0 , 0.5 ], [0.1,0.5]0.10.5[0.1,0.5][ 0.1 , 0.5 ], [0.2,0.5]0.20.5[0.2,0.5][ 0.2 , 0.5 ], [0.3,0.5]0.30.5[0.3,0.5][ 0.3 , 0.5 ], [0.4,0.5]0.40.5[0.4,0.5][ 0.4 , 0.5 ], [0.5,1]0.51[0.5,1][ 0.5 , 1 ], [0.6,1]0.61[0.6,1][ 0.6 , 1 ], [0.7,1]0.71[0.7,1][ 0.7 , 1 ], [0.8,1]0.81[0.8,1][ 0.8 , 1 ], and [0.9,1]0.91[0.9,1][ 0.9 , 1 ].
Table S.7.1: Simulation results for the shape-constrained kernel-weighted least squares approach (2.9). The estimation and test performance are evaluated with the integrated squared bias (ISB), the integrated variance (IVar), the type I error rate, and the power of the test, defined as (3.2) and (3.3).
Sample size Dataset Criterion H0:β1:subscript𝐻0subscript𝛽1H_{0}:\beta_{1}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and β3subscript𝛽3\beta_{3}italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are monotone increasing on \mathcal{I}caligraphic_I.
H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is true. H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is false.
=[0,0.5]00.5\mathcal{I}=\mathcal{[}0,0.5]caligraphic_I = [ 0 , 0.5 ] =[0.4,0.5]0.40.5\mathcal{I}=\mathcal{[}0.4,0.5]caligraphic_I = [ 0.4 , 0.5 ] =[0.5,1]0.51\mathcal{I}=\mathcal{[}0.5,1]caligraphic_I = [ 0.5 , 1 ] =[0.9,1]0.91\mathcal{I}=\mathcal{[}0.9,1]caligraphic_I = [ 0.9 , 1 ]
n=100𝑛100n=100italic_n = 100 Full data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.06,0.11)0.060.11(0.06,0.11)( 0.06 , 0.11 ) (0.05,0.11)0.050.11(0.05,0.11)( 0.05 , 0.11 ) (0.40,0.50)0.400.50(0.40,0.50)( 0.40 , 0.50 ) (0.12,0.20)0.120.20(0.12,0.20)( 0.12 , 0.20 )
ISBISB\mathrm{ISB}roman_ISB 0.03020.03020.03020.0302 0.00730.00730.00730.0073 0.16000.16000.16000.1600 0.02300.02300.02300.0230
IVarIVar\mathrm{IVar}roman_IVar 0.40340.40340.40340.4034 0.53460.53460.53460.5346 0.45370.45370.45370.4537 1.09011.09011.09011.0901
Partial data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.06,0.11)0.060.11(0.06,0.11)( 0.06 , 0.11 ) (0.09,0.19)0.090.19(0.09,0.19)( 0.09 , 0.19 ) (0.30,0.40)0.300.40(0.30,0.40)( 0.30 , 0.40 ) (0.21,0.30)0.210.30(0.21,0.30)( 0.21 , 0.30 )
restricted to \mathcal{I}caligraphic_I ISBISB\mathrm{ISB}roman_ISB 0.05170.05170.05170.0517 0.07680.07680.07680.0768 0.22990.22990.22990.2299 0.22000.22000.22000.2200
IVarIVar\mathrm{IVar}roman_IVar 0.53350.53350.53350.5335 1.50281.50281.50281.5028 0.59870.59870.59870.5987 3.78443.78443.78443.7844
n=500𝑛500n=500italic_n = 500 Full data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.05,0.10)0.050.10(0.05,0.10)( 0.05 , 0.10 ) (0.05,0.09)0.050.09(0.05,0.09)( 0.05 , 0.09 ) (0.86,0.92)0.860.92(0.86,0.92)( 0.86 , 0.92 ) (0.11,0.16)0.110.16(0.11,0.16)( 0.11 , 0.16 )
ISBISB\mathrm{ISB}roman_ISB 0.01020.01020.01020.0102 0.00180.00180.00180.0018 0.12110.12110.12110.1211 0.00970.00970.00970.0097
IVarIVar\mathrm{IVar}roman_IVar 0.10100.10100.10100.1010 0.11350.11350.11350.1135 0.10590.10590.10590.1059 0.26810.26810.26810.2681
Partial data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.04,0.10)0.040.10(0.04,0.10)( 0.04 , 0.10 ) (0.07,0.13)0.070.13(0.07,0.13)( 0.07 , 0.13 ) (0.78,0.88)0.780.88(0.78,0.88)( 0.78 , 0.88 ) (0.16,0.25)0.160.25(0.16,0.25)( 0.16 , 0.25 )
restricted to \mathcal{I}caligraphic_I ISBISB\mathrm{ISB}roman_ISB 0.01290.01290.01290.0129 0.01100.01100.01100.0110 0.16990.16990.16990.1699 0.10450.10450.10450.1045
IVarIVar\mathrm{IVar}roman_IVar 0.14970.14970.14970.1497 0.32290.32290.32290.3229 0.14300.14300.14300.1430 0.73960.73960.73960.7396
Table S.7.2: Simulation results for the shape-constrained regression spline approach (2.17). The estimation and test performance are evaluated with the integrated squared bias (ISB), the integrated variance (IVar), the type I error rate, and the power of the test, defined as (3.2) and (3.3).
Sample size Dataset Criterion H0:β1:subscript𝐻0subscript𝛽1H_{0}:\beta_{1}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : italic_β start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and β3subscript𝛽3\beta_{3}italic_β start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are monotone increasing on \mathcal{I}caligraphic_I.
H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is true. H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is false.
=[0,0.5]00.5\mathcal{I}=\mathcal{[}0,0.5]caligraphic_I = [ 0 , 0.5 ] =[0.4,0.5]0.40.5\mathcal{I}=\mathcal{[}0.4,0.5]caligraphic_I = [ 0.4 , 0.5 ] =[0.5,1]0.51\mathcal{I}=\mathcal{[}0.5,1]caligraphic_I = [ 0.5 , 1 ] =[0.9,1]0.91\mathcal{I}=\mathcal{[}0.9,1]caligraphic_I = [ 0.9 , 1 ]
n=100𝑛100n=100italic_n = 100 Full data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.04,0.07)0.040.07(0.04,0.07)( 0.04 , 0.07 ) (0.08,0.11)0.080.11(0.08,0.11)( 0.08 , 0.11 ) (0.29,0.40)0.290.40(0.29,0.40)( 0.29 , 0.40 ) (0.09,0.11)0.090.11(0.09,0.11)( 0.09 , 0.11 )
ISBISB\mathrm{ISB}roman_ISB 0.01010.01010.01010.0101 0.00330.00330.00330.0033 0.18500.18500.18500.1850 0.23040.23040.23040.2304
IVarIVar\mathrm{IVar}roman_IVar 0.37370.37370.37370.3737 0.20540.20540.20540.2054 0.40460.40460.40460.4046 0.92820.92820.92820.9282
Partial data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.05,0.11)0.050.11(0.05,0.11)( 0.05 , 0.11 ) (0.06,0.12)0.060.12(0.06,0.12)( 0.06 , 0.12 ) (0.18,0.24)0.180.24(0.18,0.24)( 0.18 , 0.24 ) (0.05,0.11)0.050.11(0.05,0.11)( 0.05 , 0.11 )
restricted to \mathcal{I}caligraphic_I ISBISB\mathrm{ISB}roman_ISB 0.01760.01760.01760.0176 0.11560.11560.11560.1156 0.17830.17830.17830.1783 0.42000.42000.42000.4200
IVarIVar\mathrm{IVar}roman_IVar 0.57580.57580.57580.5758 1.68961.68961.68961.6896 0.61510.61510.61510.6151 4.21234.21234.21234.2123
n=500𝑛500n=500italic_n = 500 Full data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.04,0.08)0.040.08(0.04,0.08)( 0.04 , 0.08 ) (0.05,0.08)0.050.08(0.05,0.08)( 0.05 , 0.08 ) (0.81,0.89)0.810.89(0.81,0.89)( 0.81 , 0.89 ) (0.10,0.16)0.100.16(0.10,0.16)( 0.10 , 0.16 )
ISBISB\mathrm{ISB}roman_ISB 0.00170.00170.00170.0017 0.00160.00160.00160.0016 0.14610.14610.14610.1461 0.09130.09130.09130.0913
IVarIVar\mathrm{IVar}roman_IVar 0.09460.09460.09460.0946 0.05720.05720.05720.0572 0.09740.09740.09740.0974 0.20280.20280.20280.2028
Partial data (γ0.05,γ0.1)subscript𝛾0.05subscript𝛾0.1(\gamma_{0.05},\gamma_{0.1})( italic_γ start_POSTSUBSCRIPT 0.05 end_POSTSUBSCRIPT , italic_γ start_POSTSUBSCRIPT 0.1 end_POSTSUBSCRIPT ) (0.05,0.11)0.050.11(0.05,0.11)( 0.05 , 0.11 ) (0.04,0.08)0.040.08(0.04,0.08)( 0.04 , 0.08 ) (0.44,0.56)0.440.56(0.44,0.56)( 0.44 , 0.56 ) (0.04,0.09)0.040.09(0.04,0.09)( 0.04 , 0.09 )
restricted to \mathcal{I}caligraphic_I ISBISB\mathrm{ISB}roman_ISB 0.00220.00220.00220.0022 0.01090.01090.01090.0109 0.14060.14060.14060.1406 0.07310.07310.07310.0731
IVarIVar\mathrm{IVar}roman_IVar 0.11680.11680.11680.1168 0.27980.27980.27980.2798 0.14630.14630.14630.1463 0.70050.70050.70050.7005

References

M. J. Silvapulle and P. K. Sen. Constrained Statistical Inference: Inequality, Order, and Shape Restrictions. Wiley Series in Probability and Statistics. John Wiley & Sons, Inc, 1 edition, 2005.