HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: asymptote

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: arXiv.org perpetual non-exclusive license
arXiv:2403.03540v1 [math.ST] 06 Mar 2024

Contraction rates and projection subspace estimation with Gaussian process priors in high dimension

Elie Odin Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse; CNRS. UT3, F-31062 Toulouse, France.
[email protected]
François Bachoc Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse; CNRS. UT3, F-31062 Toulouse, France.
[email protected]
Agnès Lagnoux Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse; CNRS. UT2J, F-31058 Toulouse, France.
[email protected]
(02 2023)
Abstract

This work explores the dimension reduction problem for Bayesian nonparametric regression and density estimation. More precisely, we are interested in estimating a functional parameter f𝑓fitalic_f over the unit ball in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, which depends only on a d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional subspace of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, with d0<dsubscript𝑑0𝑑d_{0}<ditalic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_d. It is well-known that rescaled Gaussian process priors over the function space achieve smoothness adaptation and posterior contraction with near minimax-optimal rates. Moreover, hierarchical extensions of this approach, equipped with subspace projection, can also adapt to the intrinsic dimension d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ([Tok11]). When the ambient dimension d𝑑ditalic_d does not vary with n𝑛nitalic_n, the minimax rate remains of the order nβ/(2β+d0)superscript𝑛𝛽2𝛽subscript𝑑0n^{-\beta/(2\beta+d_{0})}italic_n start_POSTSUPERSCRIPT - italic_β / ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT. However, this is up to multiplicative constants that can become prohibitively large when d𝑑ditalic_d grows. The dependences between the contraction rate and the ambient dimension have not been fully explored yet and this work provides a first insight: we let the dimension d𝑑ditalic_d grow with n𝑛nitalic_n and, by combining the arguments of [Tok11] and [JT21], we derive a growth rate for d𝑑ditalic_d that still leads to posterior consistency with minimax rate. The optimality of this growth rate is then discussed. Additionally, we provide a set of assumptions under which consistent estimation of f𝑓fitalic_f leads to a correct estimation of the subspace projection, assuming that d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is known.



1 Introduction

With the ever-increasing availability of high-dimensional data in various fields of science and technology, dimension reduction methods have become more and more important, especially in non-parametric estimation, to counteract the curse of dimensionality. Suppose we want to estimate an unknown function f:d:𝑓superscript𝑑f:\mathbb{R}^{d}\to\mathbb{R}italic_f : blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT → blackboard_R that depends only on a d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional linear subspace 𝒮d𝒮superscript𝑑\mathcal{S}\subset\mathbb{R}^{d}caligraphic_S ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, with d0dmuch-less-thansubscript𝑑0𝑑d_{0}\ll ditalic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≪ italic_d. For regression and density estimation problems, minimax rates without sparsity assumptions are both of the order nβ/(2β+d)superscript𝑛𝛽2𝛽𝑑n^{-\beta/(2\beta+d)}italic_n start_POSTSUPERSCRIPT - italic_β / ( 2 italic_β + italic_d ) end_POSTSUPERSCRIPT where β𝛽\betaitalic_β is the smoothness of f𝑓fitalic_f and n𝑛nitalic_n is the sample size ([Bir86], [Sto82]). The aim of dimension reduction is to convert this d𝑑ditalic_d-dimensional problem into a d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional one in order to obtain the way more attractive rate nβ/(2β+d0)superscript𝑛𝛽2𝛽subscript𝑑0n^{-\beta/(2\beta+d_{0})}italic_n start_POSTSUPERSCRIPT - italic_β / ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT.

As the above rates are given up to a multiplicative constant, which may itself depend on the ambient dimension d𝑑ditalic_d, another problem arises: determining if the number of available data is sufficient in regard to the problem’s dimension. This is generally done by allowing the ambient dimension d𝑑ditalic_d to grow with n𝑛nitalic_n, letting d=dn𝑑subscript𝑑𝑛d=d_{n}italic_d = italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, and then observing which growth rate still permits minimax estimation at rate nβ/(2β+d0)superscript𝑛𝛽2𝛽subscript𝑑0n^{-\beta/(2\beta+d_{0})}italic_n start_POSTSUPERSCRIPT - italic_β / ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT. Note that the subspace 𝒮𝒮\mathcal{S}caligraphic_S also depends on n𝑛nitalic_n, thus we write 𝒮=𝒮n𝒮subscript𝒮𝑛\mathcal{S}=\mathcal{S}_{n}caligraphic_S = caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

For fixed intrinsic dimension d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we distinguish two cases, whether the subspace 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is parallel to the axes or not. In the first case (when 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is parallel to the axes), the dimension-reduction problem is referred to as variable selection. In this context, it is known that for non-parametric regression, the sparsity pattern can be consistently recovered when dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT grows exponentially with the sample size ([CD12], [YT15]). More precisely, [CD12] show that there exist two constants c*<c*subscript𝑐superscript𝑐c_{*}<c^{*}italic_c start_POSTSUBSCRIPT * end_POSTSUBSCRIPT < italic_c start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT such that

  • if logdnn<c*subscript𝑑𝑛𝑛subscript𝑐\frac{\log d_{n}}{n}<c_{*}divide start_ARG roman_log italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG < italic_c start_POSTSUBSCRIPT * end_POSTSUBSCRIPT, there exists a consistent estimator of the sparsity pattern,

  • if logdnn>c*subscript𝑑𝑛𝑛superscript𝑐\frac{\log d_{n}}{n}>c^{*}divide start_ARG roman_log italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG > italic_c start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, no such estimator exists.

This phase transition phenomenon seems to be similar in the linear regression framework (see [Ver12] and [Wai09]).

In the second case (when nothing is assumed on 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT), the estimation of a minimal subspace which contains all the information on f𝑓fitalic_f is sometimes referred to as sufficient dimension reduction ([Coo98]). Among the various methods proposed for estimating 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, sliced inverse regression (SIR) ([Li91]) is one of the most studied. The first article including the framework of growing ambient dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT shows the consistency of SIR only under dn=O(n1/2)subscript𝑑𝑛𝑂superscript𝑛12d_{n}=O(n^{1/2})italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_O ( italic_n start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ) ([ZMP06]). Later, [LZL18] show that the phase transition phenomenon occurs at a growth rate dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in o(n)𝑜𝑛o(n)italic_o ( italic_n ). In other words, SIR-based estimators are consistent only if dn/nn+0subscript𝑑𝑛𝑛subscript𝑛0d_{n}/n\mathop{\longrightarrow}\limits_{n\to+\infty}0italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / italic_n ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 and this growth rate appears to be optimal ([Lin+21]).

The difference between growth rates encountered in variable selection and in sufficient dimension reduction has led recently to the emergence of methods combining both approaches. If f𝑓fitalic_f depends on a d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional subspace 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT which can be described by linear combination of only a small number of variables, then we can perform both variable selection and sufficient dimension reduction over the selected variables. This method is studied for example in [Lin+21], [LZL19], [TSY20], and [ZMZ22] and allows a return to the exponential growth of the dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

The aim of this article is to perform both function and subspace estimation in the case where no hypotheses are made on 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and to derive the maximum dimension growth rate. Our analysis is done in the nonparametric Bayesian framework introduced by [GGV00]. Among the advantages of this approach, the use of very versatile priors, such as Gaussian processes [VV08], allows to perform smoothness and dimension adaptability at near minimax rates ([VV09], [TZG10], [JT21]) with a single Bayesian procedure, and avoids the complications associated with kernel methods (see for example the introduction of [STG13]).

The work of Tokdar, Zhu, and Ghosh [TZG10] is one of the first to include a hierarchical prior with a parameter on the subspace.They use a uniform prior on the Grassmannian of dimension d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and a logistic Gaussian process prior for the conditional density function. The authors are able to derive posterior consistency for both the conditional density function and the subspace but they do not provide contraction rates. Near minimax contraction rates are then derived in [Tok11] by extending the framework introduced by [VV09]. Finally, [JT21] show that for variable selection, the estimation of the regression function and that of the sparsity pattern can be realized simultaneously at near minimax rates even with dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT growing exponentially with the sample size. The growth rate is linked to the smoothness β𝛽\betaitalic_β of f𝑓fitalic_f via log(dn)=O(nd0/(2β+d0))subscript𝑑𝑛𝑂superscript𝑛subscript𝑑02𝛽subscript𝑑0\log(d_{n})=O(n^{d_{0}/(2\beta+d_{0})})roman_log ( italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) = italic_O ( italic_n start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ).

The paper is organized as follows. In Section 2, we introduce a hierarchical Gaussian process-based prior for both regression and density estimation models. This prior consists of a dimension parameter for d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, an invariant prior over linear d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional subspaces of dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, a d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional Gaussian process, and a rescaling parameter to ensure smoothness adaptability. Our first result (Theorem 3.1 in Section 3) shows that, for the estimation problem of f𝑓fitalic_f, near minimax contraction rates can be achieved for dimensions dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT growing not faster than nd0/(2β+d0)superscript𝑛subscript𝑑02𝛽subscript𝑑0n^{d_{0}/(2\beta+d_{0})}italic_n start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT which is interestingly the already mentioned growth rate where we drop out the exponential. We are not able to prove the optimality of this result but some clues are given below (see Remark 5.2); notably, this growth rate is equivalent to n𝑛nitalic_n when β0𝛽0\beta\to 0italic_β → 0, which is known to be the breakpoint of the consistency of the SIR estimator. In Section 4, we show that for fixed ambient dimension d𝑑ditalic_d, the hierarchical Bayes procedure contracts to a subspace that contains 𝒮𝒮\mathcal{S}caligraphic_S and we conjecture that this subspace is exactly 𝒮𝒮\mathcal{S}caligraphic_S. Our estimation result of f𝑓fitalic_f combines the standard arguments used in [Tok11] and [JT21], which are based on [VV09]. To prove the contraction around the central subspace 𝒮𝒮\mathcal{S}caligraphic_S, we show that an error on the estimation of 𝒮𝒮\mathcal{S}caligraphic_S leads to an error on the estimation of f𝑓fitalic_f from which we obtain a contradiction on the previously established minimax estimation of f𝑓fitalic_f. The proofs of the main results (Theorems 3.1 and 4.1) are postponed to Appendices 5.1 and 5.2 while Appendix 5.3 is dedicated to useful lemmas.

2 Problem formulation

2.1 Notation and definitions

The abundant technical notation used throughout this article make this section very useful. We begin with the definition of standard functional spaces. Let K𝐾Kitalic_K be a bounded convex subset of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, with d*𝑑superscriptd\in\mathds{N}^{*}italic_d ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. For α>0𝛼0\alpha>0italic_α > 0, write α=k+r𝛼𝑘𝑟\alpha=k+ritalic_α = italic_k + italic_r with k𝑘kitalic_k a nonnegative integer and r(0,1]𝑟01r\in(0,1]italic_r ∈ ( 0 , 1 ]. The Hölder space α(K)superscript𝛼𝐾\mathfrak{C}^{\alpha}(K)fraktur_C start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT ( italic_K ) is the space of all functions f:K:𝑓𝐾f:K\to\mathbb{R}italic_f : italic_K → blackboard_R that are k𝑘kitalic_k-times differentiable and whose partial derivatives of order (k1,,kd)subscript𝑘1subscript𝑘𝑑(k_{1},\ldots,k_{d})( italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_k start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ), with k1,,kdsubscript𝑘1subscript𝑘𝑑k_{1},\ldots,k_{d}italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_k start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT nonnegative integers such that k1++kd=ksubscript𝑘1subscript𝑘𝑑𝑘k_{1}+\cdots+k_{d}=kitalic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + ⋯ + italic_k start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = italic_k, are Lipshitz functions of order r𝑟ritalic_r, that is, there exists a constant D𝐷Ditalic_D such that

|k1k1dkd(f(x)f(y))|Dxyr,superscript𝑘superscriptsubscript1subscript𝑘1superscriptsubscript𝑑subscript𝑘𝑑𝑓𝑥𝑓𝑦𝐷superscriptnorm𝑥𝑦𝑟\left|\frac{\partial^{k}}{\partial_{1}^{k_{1}}\cdots\partial_{d}^{k_{d}}}(f(x)% -f(y))\right|\ \leq\ D\left\|{x-y}\right\|^{r},| divide start_ARG ∂ start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT end_ARG start_ARG ∂ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ⋯ ∂ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_ARG ( italic_f ( italic_x ) - italic_f ( italic_y ) ) | ≤ italic_D ∥ italic_x - italic_y ∥ start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ,

for all pairs x,yK2𝑥𝑦superscript𝐾2x,y\in K^{2}italic_x , italic_y ∈ italic_K start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and where \left\|{\cdot}\right\|∥ ⋅ ∥ is the Euclidean norm. We use the following asymptotic notation: if f𝑓fitalic_f and g𝑔gitalic_g are two real functions over an arbitrary set S𝑆Sitalic_S, then we write fgless-than-or-similar-to𝑓𝑔f\lesssim gitalic_f ≲ italic_g if there exists a constant c𝑐citalic_c such that |f(s)|c|g(s)|𝑓𝑠𝑐𝑔𝑠|f(s)|\leq c\cdot|g(s)|| italic_f ( italic_s ) | ≤ italic_c ⋅ | italic_g ( italic_s ) | for all sS𝑠𝑆s\in Sitalic_s ∈ italic_S. The notation greater-than-or-equivalent-to\gtrsim is defined in the same way and we write fgasymptotically-equals𝑓𝑔f\asymp gitalic_f ≍ italic_g when both fgless-than-or-similar-to𝑓𝑔f\lesssim gitalic_f ≲ italic_g and fggreater-than-or-equivalent-to𝑓𝑔f\gtrsim gitalic_f ≳ italic_g hold. To model the central subspace 𝒮𝒮\mathcal{S}caligraphic_S, we will use isometries instead of the Grassmannian. For d*𝑑superscriptd\in\mathds{N}^{*}italic_d ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, we denote by 𝒪dsubscript𝒪𝑑\mathcal{O}_{d}caligraphic_O start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT the space of linear isometries over dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. In addition, the introduction of canonical subspaces and of “component filters” notation will be very convenient when dealing with the sparsity. For xd𝑥superscript𝑑x\in\mathbb{R}^{d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and 𝕧{0,1}d𝕧superscript01𝑑\mathbb{v}\in\{0,1\}^{d}blackboard_v ∈ { 0 , 1 } start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, we denote by |𝕧|𝕧|\mathbb{v}|| blackboard_v | the number of ones in 𝕧𝕧\mathbb{v}blackboard_v, by x𝕧:=(xj:vj=1, 1jd)|𝕧|x_{\mathbb{v}}:=(x_{j}:v_{j}=1,\ 1\leq j\leq d)\in\mathbb{R}^{|\mathbb{v}|}italic_x start_POSTSUBSCRIPT blackboard_v end_POSTSUBSCRIPT := ( italic_x start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT : italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 1 , 1 ≤ italic_j ≤ italic_d ) ∈ blackboard_R start_POSTSUPERSCRIPT | blackboard_v | end_POSTSUPERSCRIPT the sub-vector with components selected according to 𝕧𝕧\mathbb{v}blackboard_v, and for y|𝕧|𝑦superscript𝕧y\in\mathbb{R}^{|\mathbb{v}|}italic_y ∈ blackboard_R start_POSTSUPERSCRIPT | blackboard_v | end_POSTSUPERSCRIPT, by y𝕧:=(y~j)1jdassignsuperscript𝑦𝕧subscriptsubscript~𝑦𝑗1𝑗𝑑y^{\mathbb{v}}:=(\tilde{y}_{j})_{1\leq j\leq d}italic_y start_POSTSUPERSCRIPT blackboard_v end_POSTSUPERSCRIPT := ( over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT 1 ≤ italic_j ≤ italic_d end_POSTSUBSCRIPT the vector in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT with y~j=0subscript~𝑦𝑗0\tilde{y}_{j}=0over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 if vj=0subscript𝑣𝑗0v_{j}=0italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = 0 and y~j=yisubscript~𝑦𝑗subscript𝑦𝑖\tilde{y}_{j}=y_{i}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT if vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT is the i𝑖iitalic_i-th one in 𝕧𝕧\mathbb{v}blackboard_v. Moreover, for any integer b1,d𝑏1𝑑b\in\llbracket 1,d\rrbracketitalic_b ∈ ⟦ 1 , italic_d ⟧, we denote by 𝕓𝕓\mathbb{b}blackboard_b the vector i=1beisuperscriptsubscript𝑖1𝑏subscript𝑒𝑖\sum_{i=1}^{b}e_{i}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, where {ei:1id}conditional-setsubscript𝑒𝑖1𝑖𝑑\{e_{i}:1\leq i\leq d\}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : 1 ≤ italic_i ≤ italic_d } is the canonical basis on dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. The dimension d𝑑ditalic_d of the ambient space is implicit in this notation. Finally, for 𝕧{0,1}d𝕧superscript01𝑑\mathbb{v}\in\{0,1\}^{d}blackboard_v ∈ { 0 , 1 } start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, we denote by E𝕧subscript𝐸𝕧E_{\mathbb{v}}italic_E start_POSTSUBSCRIPT blackboard_v end_POSTSUBSCRIPT the linear span of {ei:vi=1}conditional-setsubscript𝑒𝑖subscript𝑣𝑖1\{e_{i}:v_{i}=1\}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 } and by E1𝕧subscript𝐸1𝕧E_{1-\mathbb{v}}italic_E start_POSTSUBSCRIPT 1 - blackboard_v end_POSTSUBSCRIPT the linear span of {ei:vi=0}conditional-setsubscript𝑒𝑖subscript𝑣𝑖0\{e_{i}:v_{i}=0\}{ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_v start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 0 }. Clearly, E1𝕧subscript𝐸1𝕧E_{1-\mathbb{v}}italic_E start_POSTSUBSCRIPT 1 - blackboard_v end_POSTSUBSCRIPT is the orthogonal complement of E𝕧subscript𝐸𝕧E_{\mathbb{v}}italic_E start_POSTSUBSCRIPT blackboard_v end_POSTSUBSCRIPT.

The proof of Theorem 3.1 involves measuring the complexity of the space where the prior puts its mass. This measure is carried out via metric entropy. Given a subset B𝐵Bitalic_B of a metric space (E,d)𝐸𝑑(E,d)( italic_E , italic_d ) and a radius ε>0𝜀0\varepsilon>0italic_ε > 0, we can define the following numbers:

  • the ε𝜀\varepsilonitalic_ε-packing number D(ε,B,d)𝐷𝜀𝐵𝑑D(\varepsilon,B,d)italic_D ( italic_ε , italic_B , italic_d ) is the maximum number of points in B𝐵Bitalic_B such that the distance between every pair is at least ε𝜀\varepsilonitalic_ε,

  • the ε𝜀\varepsilonitalic_ε-covering number N(ε,B,d)𝑁𝜀𝐵𝑑N(\varepsilon,B,d)italic_N ( italic_ε , italic_B , italic_d ) is the minimum number of balls of radius ε𝜀\varepsilonitalic_ε needed to cover B𝐵Bitalic_B.

The logarithms of the packing and the covering number are called the entropy and the metric entropy respectively.

2.2 Bayesian framework for density estimation and regression

Our main result will be stated for two statistical settings: density estimation and fixed or random design regression with Gaussian error. As we will work with subspaces that are not orthogonal with the axes, the usual support [0,1]dsuperscript01𝑑[0,1]^{d}[ 0 , 1 ] start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT for the density or the regression function will be replaced by the unit ball 𝕌d:={xd,x1}assignsubscript𝕌𝑑formulae-sequence𝑥superscript𝑑norm𝑥1\mathbb{U}_{d}:=\{x\in\mathbb{R}^{d},\left\|{x}\right\|\leq 1\}blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT := { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT , ∥ italic_x ∥ ≤ 1 }. For a given number of observations n𝑛nitalic_n, the density or the regression function will be characterized by a functional parameter fn*:𝕌dn:superscriptsubscript𝑓𝑛subscript𝕌subscript𝑑𝑛f_{n}^{*}:\mathbb{U}_{d_{n}}\to\mathbb{R}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT : blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT → blackboard_R. The ambient dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is allowed to grow with n𝑛nitalic_n but fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is supposed to depend only on a subspace 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT with fixed dimension d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. A prior on d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and on the subspace itself will be later introduced to ensure the dimension adaptability. The prior on the true parameter fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT will consist of a projected Gaussian random variable Wnsubscript𝑊𝑛W_{n}italic_W start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT with values in the Banach space (𝒞(𝕌dn),)(\mathcal{C}(\mathbb{U}_{d_{n}}),\left\|{\cdot}\right\|_{\infty})( caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ). Now let us describe the two previously introduced statistical settings.

Density estimation

Suppose we observe an i.i.d. sample X1,,Xnsubscript𝑋1subscript𝑋𝑛X_{1},\ldots,X_{n}italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT from a law Pn*superscriptsubscript𝑃𝑛P_{n}^{*}italic_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT over 𝕌dnsubscript𝕌subscript𝑑𝑛\mathbb{U}_{d_{n}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, which admits a continuous density pn*superscriptsubscript𝑝𝑛p_{n}^{*}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT relative to the Lebesgue measure on dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. The prior Wnsubscript𝑊𝑛W_{n}italic_W start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT puts its mass on a space that is far too large compared to the space of continuous densities. So to correctly retrieve pn*superscriptsubscript𝑝𝑛p_{n}^{*}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, we will work with the parametrized density pn,Wnsubscript𝑝𝑛subscript𝑊𝑛p_{n,W_{n}}italic_p start_POSTSUBSCRIPT italic_n , italic_W start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT where, for w𝒞(𝕌dn)𝑤𝒞subscript𝕌subscript𝑑𝑛w\in\mathcal{C}(\mathbb{U}_{d_{n}})italic_w ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ),

(2.1) pn,w(x):=ew(x)𝕌dnew(x)𝑑x.assignsubscript𝑝𝑛𝑤𝑥superscript𝑒𝑤𝑥subscriptsubscript𝕌subscript𝑑𝑛superscript𝑒𝑤𝑥differential-d𝑥p_{n,w}(x)\ :=\ \frac{e^{w(x)}}{\int_{\mathbb{U}_{d_{n}}}e^{w(x)}dx}.italic_p start_POSTSUBSCRIPT italic_n , italic_w end_POSTSUBSCRIPT ( italic_x ) := divide start_ARG italic_e start_POSTSUPERSCRIPT italic_w ( italic_x ) end_POSTSUPERSCRIPT end_ARG start_ARG ∫ start_POSTSUBSCRIPT blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT italic_w ( italic_x ) end_POSTSUPERSCRIPT italic_d italic_x end_ARG .

Here the exponential forces the prior to charge only nonnegative functions while the renormalization ensures that pn,wsubscript𝑝𝑛𝑤p_{n,w}italic_p start_POSTSUBSCRIPT italic_n , italic_w end_POSTSUBSCRIPT integrates to one. The true density pn*superscriptsubscript𝑝𝑛p_{n}^{*}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT will then be encoded by the parameter fn*𝒞(𝕌dn)superscriptsubscript𝑓𝑛𝒞subscript𝕌subscript𝑑𝑛f_{n}^{*}\in\mathcal{C}(\mathbb{U}_{d_{n}})italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) such that pn*=pn,fn*superscriptsubscript𝑝𝑛subscript𝑝𝑛superscriptsubscript𝑓𝑛p_{n}^{*}=p_{n,f_{n}^{*}}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_p start_POSTSUBSCRIPT italic_n , italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. In this way, all the assumptions on the true parameter fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT can be transferred to the density pn*superscriptsubscript𝑝𝑛p_{n}^{*}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. That is, pn*superscriptsubscript𝑝𝑛p_{n}^{*}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is supposed to depend only on the d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional subspace 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT of dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT.

The natural metric between two densities p𝑝pitalic_p and psuperscript𝑝p^{\prime}italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is the Hellinger distance defined by h(p,p)=pp2𝑝superscript𝑝subscriptnorm𝑝superscript𝑝2h(p,p^{\prime})=\left\|{\sqrt{p}-\sqrt{p^{\prime}}}\right\|_{2}italic_h ( italic_p , italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = ∥ square-root start_ARG italic_p end_ARG - square-root start_ARG italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, where 2\left\|{\cdot}\right\|_{2}∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is the L2superscript𝐿2L^{2}italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT-norm with respect to the Lebesgue measure. Consequently, if the parameter space is embedded with a prior ΠnsubscriptΠ𝑛\Pi_{n}roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we will say that the posterior contracts to pn*superscriptsubscript𝑝𝑛p_{n}^{*}italic_p start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT at rate (εn)nsubscriptsubscript𝜀𝑛𝑛(\varepsilon_{n})_{n\in\mathds{N}}( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT if, for any sufficiently large constant M𝑀Mitalic_M,

(2.2) n*[Πn(f𝒞(𝕌dn):h(pn,f,pn*)>Mεn|X1,,Xn)]n+0,\operatorname{\mathds{P}}_{n}^{*}\left[\Pi_{n}\left(f\in\mathcal{C}(\mathbb{U}% _{d_{n}}):h(p_{n,f},p^{*}_{n})>M\varepsilon_{n}\ |\ X_{1},\ldots,X_{n}\right)% \right]\mathop{\longrightarrow}\limits_{n\to+\infty}0,blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT [ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_f ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) : italic_h ( italic_p start_POSTSUBSCRIPT italic_n , italic_f end_POSTSUBSCRIPT , italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) > italic_M italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ] ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 ,

where n*superscriptsubscript𝑛\operatorname{\mathds{P}}_{n}^{*}blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is the joint law of (X1,,Xn)subscript𝑋1subscript𝑋𝑛(X_{1},\ldots,X_{n})( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ).

Regression with Gaussian error

In a regression problem, the covariates can be either predetermined for each observation, this is the fixed design case, or can be part of the observation themselves. In the later case, the covariates can be considered as random; this corresponds to the random design case. The notion of posterior contraction differs slightly between these two situations and some clarifications are in order.

Fixed design

In this setting, we consider a sample of n𝑛nitalic_n real observations Y1,,Ynsubscript𝑌1subscript𝑌𝑛Y_{1},\ldots,Y_{n}italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT satisfying the model Yi=fn*(xi)+ϵisubscript𝑌𝑖superscriptsubscript𝑓𝑛subscript𝑥𝑖subscriptitalic-ϵ𝑖Y_{i}=f_{n}^{*}(x_{i})+\epsilon_{i}italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, with ϵi𝒩(0,σ2)similar-tosubscriptitalic-ϵ𝑖𝒩0superscript𝜎2\epsilon_{i}\sim\mathcal{N}(0,\sigma^{2})italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) where the xi𝕌dnsubscript𝑥𝑖subscript𝕌subscript𝑑𝑛x_{i}\in\mathbb{U}_{d_{n}}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT for i1,n𝑖1𝑛i\in\llbracket 1,n\rrbracketitalic_i ∈ ⟦ 1 , italic_n ⟧ are n𝑛nitalic_n fixed covariates and where the ϵisubscriptitalic-ϵ𝑖\epsilon_{i}italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are n𝑛nitalic_n i.i.d. univariate Gaussian random variables with zero mean and standard deviation σ𝜎\sigmaitalic_σ. As previously, the regression function fn*:{xi:i1,n}:superscriptsubscript𝑓𝑛conditional-setsubscript𝑥𝑖𝑖1𝑛f_{n}^{*}:\{x_{i}:i\in\llbracket 1,n\rrbracket\}\to\mathbb{R}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT : { italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_i ∈ ⟦ 1 , italic_n ⟧ } → blackboard_R is supposed to depend only on a d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional subspace of dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT.

We will use Wnsubscript𝑊𝑛W_{n}italic_W start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT directly as a prior for the regression function because Wnsubscript𝑊𝑛W_{n}italic_W start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT can be viewed by restriction as a Gaussian process over the space 𝒳n:={xi:i1,n}assignsubscript𝒳𝑛conditional-setsubscript𝑥𝑖𝑖1𝑛\mathcal{X}_{n}:=\{x_{i}:i\in\llbracket 1,n\rrbracket\}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT := { italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_i ∈ ⟦ 1 , italic_n ⟧ } of design points. To quantify the posterior contraction, we introduce the design dependent semi-metric n\left\|{\cdot}\right\|_{n}∥ ⋅ ∥ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT defined as the L2(nx)superscript𝐿2superscriptsubscript𝑛𝑥L^{2}(\operatorname{\mathds{P}}_{n}^{x})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT )-norm for the empirical measure nx=n1i=1nδxisuperscriptsubscript𝑛𝑥superscript𝑛1superscriptsubscript𝑖1𝑛subscript𝛿subscript𝑥𝑖\operatorname{\mathds{P}}_{n}^{x}=n^{-1}\sum_{i=1}^{n}\delta_{x_{i}}blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT = italic_n start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT of the design points. If the space of regression functions over 𝒳nsubscript𝒳𝑛\mathcal{X}_{n}caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is embedded with a prior ΠnsubscriptΠ𝑛\Pi_{n}roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, we will say that the posterior contracts to fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT at rate (εn)nsubscriptsubscript𝜀𝑛𝑛(\varepsilon_{n})_{n\in\mathds{N}}( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT if, for any sufficiently large constant M𝑀Mitalic_M,

(2.3) n*[Πn(f𝒞(𝒳n):ffn*n>Mεn|Y1,,Yn)]n+0,\operatorname{\mathds{P}}_{n}^{*}\left[\Pi_{n}\left(f\in\mathcal{C}(\mathcal{X% }_{n}):\left\|{f-f_{n}^{*}}\right\|_{n}>M\varepsilon_{n}\ |\ Y_{1},\ldots,Y_{n% }\right)\right]\mathop{\longrightarrow}\limits_{n\to+\infty}0,blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT [ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_f ∈ caligraphic_C ( caligraphic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) : ∥ italic_f - italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT > italic_M italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ] ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 ,

where n*superscriptsubscript𝑛\operatorname{\mathds{P}}_{n}^{*}blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is the joint law of (Y1,,Yn)subscript𝑌1subscript𝑌𝑛(Y_{1},\ldots,Y_{n})( italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ).

Random design

Here, we observe n𝑛nitalic_n i.i.d. pairs (X1,Y1),,(Xn,Yn)subscript𝑋1subscript𝑌1subscript𝑋𝑛subscript𝑌𝑛(X_{1},Y_{1}),\ldots,(X_{n},Y_{n})( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , ( italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) such that Yi=fn*(Xi)+ϵisubscript𝑌𝑖superscriptsubscript𝑓𝑛subscript𝑋𝑖subscriptitalic-ϵ𝑖Y_{i}=f_{n}^{*}(X_{i})+\epsilon_{i}italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, with i.i.d. ϵi𝒩(0,σ2),σ[1,2]formulae-sequencesimilar-tosubscriptitalic-ϵ𝑖𝒩0superscript𝜎2𝜎12\epsilon_{i}\sim\mathcal{N}(0,\sigma^{2}),\ \sigma\in[1,2]italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) , italic_σ ∈ [ 1 , 2 ], and where the Xisubscript𝑋𝑖X_{i}italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s are random covariates over 𝕌dnsubscript𝕌subscript𝑑𝑛\mathbb{U}_{d_{n}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT independent of the ϵisubscriptitalic-ϵ𝑖\epsilon_{i}italic_ϵ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s and admitting a common density Gnsubscript𝐺𝑛G_{n}italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT that is bounded away from zero. For the sake of simplicity, the standard deviation σ𝜎\sigmaitalic_σ is restricted to the interval [1,2]12[1,2][ 1 , 2 ] but these bounds can be relaxed, see Remark 5.1.1 for details. Again, the regression function fn*:𝕌dn:superscriptsubscript𝑓𝑛subscript𝕌subscript𝑑𝑛f_{n}^{*}:\mathbb{U}_{d_{n}}\to\mathbb{R}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT : blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT → blackboard_R is supposed to depend only on a d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional subspace of dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Moreover, we use Wnsubscript𝑊𝑛W_{n}italic_W start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT directly as a prior for the regression function. The natural metric for this problem is the L2(Gn)superscript𝐿2subscript𝐺𝑛L^{2}(G_{n})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT )-norm denoted by 2,Gn\left\|{\cdot}\right\|_{2,G_{n}}∥ ⋅ ∥ start_POSTSUBSCRIPT 2 , italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT where Gnsubscript𝐺𝑛G_{n}italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is identified with the law of one covariate. This metric is not equivalent to the Hellinger metric, which is used in the proof of Theorem 3.1, unless all regression functions are uniformly bounded by a constant Q>0𝑄0Q>0italic_Q > 0. This condition can be fulfilled by projecting the prior on the space of all functions uniformly bounded by Q𝑄Qitalic_Q, as proposed in [GN11], but this would force us to rewrite the proof of Theorem 3.1 only for this setting. Instead, we directly post-process the posterior to integrate this constraint as in [YD16]. Then, the formulation of posterior consistency becomes as follows. Considering a prior ΠnsubscriptΠ𝑛\Pi_{n}roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT over the regression functions, we will say that the posterior contracts to fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT at rate (εn)nsubscriptsubscript𝜀𝑛𝑛(\varepsilon_{n})_{n\in\mathds{N}}( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT if, for Q>0𝑄0Q>0italic_Q > 0 and any sufficiently large constants M𝑀Mitalic_M,

(2.4) n*[Πn(f𝒞(𝕌dn):fQfn*Q2,Gn>Mεn|(X1,Y1),,(Xn,Yn))]n+0,\operatorname{\mathds{P}}_{n}^{*}\left[\Pi_{n}\left(f\in\mathcal{C}(\mathbb{U}% _{d_{n}}):\bigl{\|}{f^{Q}-{f_{n}^{*}}^{Q}}\bigr{\|}_{2,G_{n}}>M\varepsilon_{n}% \ |\ (X_{1},Y_{1}),\ldots,(X_{n},Y_{n})\right)\right]\mathop{\longrightarrow}% \limits_{n\to+\infty}0,blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT [ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_f ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) : ∥ italic_f start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 , italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_M italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , ( italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) ] ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 ,

where n*superscriptsubscript𝑛\operatorname{\mathds{P}}_{n}^{*}blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is the joint law of (X1,Y1),,(Xn,Yn)subscript𝑋1subscript𝑌1subscript𝑋𝑛subscript𝑌𝑛(X_{1},Y_{1}),\ldots,(X_{n},Y_{n})( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , ( italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) and where fQ:=(fQ)Qf^{Q}:=(f\vee-Q)\wedge Qitalic_f start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT := ( italic_f ∨ - italic_Q ) ∧ italic_Q is the truncated version of f𝑓fitalic_f.


3 Main result for the functional parameter

In order for the true parameter fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT to be recovered, we suppose that its restriction to the d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional subspace 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT does not depend on the ambient dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

Assumption 3.1 (Sparsity of the true parameter).

There exist n0,d0subscript𝑛0subscript𝑑0n_{0},d_{0}\in\mathds{N}italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ blackboard_N, f0𝒞(𝕌d0)subscript𝑓0𝒞subscript𝕌subscript𝑑0f_{0}\in\mathcal{C}(\mathbb{U}_{d_{0}})italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), and a sequence of linear isometries qn*𝒪dnsuperscriptsubscript𝑞𝑛subscript𝒪subscript𝑑𝑛q_{n}^{*}\in\mathcal{O}_{d_{n}}italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that for all nn0𝑛subscript𝑛0n\geq n_{0}italic_n ≥ italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we have dnd0subscript𝑑𝑛subscript𝑑0d_{n}\geq d_{0}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≥ italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and fn*(x)=f0((qn*(x))𝐝𝟎)superscriptsubscript𝑓𝑛𝑥subscript𝑓0subscriptsuperscriptsubscript𝑞𝑛𝑥subscript𝐝0f_{n}^{*}(x)=f_{0}\left((q_{n}^{*}(x))_{\mathbf{d_{0}}}\right)italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) = italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) ) start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), for all x𝕌dn𝑥subscript𝕌subscript𝑑𝑛x\in\mathbb{U}_{d_{n}}italic_x ∈ blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

In this way, each fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT can be viewed as a sparse continuation in dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT of an underlying fixed function f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT called the core function. The use of isometries instead of vector subspaces permits us to avoid the manipulation of the Grassmannian. We will use instead the more convenient orthogonal group 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. The next property is straightforward.

Property 3.1.

For nn0𝑛subscript𝑛0n\geq n_{0}italic_n ≥ italic_n start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is constant on the intersection between 𝕌dnsubscript𝕌subscript𝑑𝑛\mathbb{U}_{d_{n}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT and the affine subspaces (qn*)1(E1𝐝𝟎)+xsuperscriptsuperscriptsubscript𝑞𝑛1subscript𝐸1subscript𝐝0𝑥(q_{n}^{*})^{-1}(E_{1-\mathbf{d_{0}}})+x( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) + italic_x, for xdn𝑥superscriptsubscript𝑑𝑛x\in\mathbb{R}^{d_{n}}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT.

In parallel to the dimension adaptability, the present setting allows the core function f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT to be arbitrarily smooth (in a Hölder sense) while maintaining near-minimax contraction rates.

Assumption 3.2 (Smoothness of f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT).

There exists β>0𝛽0\beta>0italic_β > 0 such that f0β(𝕌d0)subscript𝑓0superscript𝛽subscript𝕌subscript𝑑0f_{0}\in\mathfrak{C}^{\beta}(\mathbb{U}_{d_{0}})italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ fraktur_C start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ).

3.1 Prior specification

Here we specify the hierarchical prior on the parameter space. The true parameter fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is characterized by a sparsity pattern (d0,qn*)subscript𝑑0superscriptsubscript𝑞𝑛(d_{0},q_{n}^{*})( italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ), where the intrinsic dimension d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the one of the relevant subspace and qn*𝒪dnsuperscriptsubscript𝑞𝑛subscript𝒪subscript𝑑𝑛q_{n}^{*}\in\mathcal{O}_{d_{n}}italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT is an isometry for the orientation; its smoothness is modeled by a rescaling parameter, and the core function f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is modeled by a standard squared exponential Gaussian process which has infinitely smooth sample paths. Indeed, this process has proven to be fruitful in combination with a scale parameter and allows smoothness adaptation (see [VV09]).

For n>0𝑛0n>0italic_n > 0, let W=(W(x):xdn)W=(W(x):x\in\mathbb{R}^{d_{n}})italic_W = ( italic_W ( italic_x ) : italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) be a standard squared exponential Gaussian process on dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT; that is, a centered Gaussian process with covariance kernel

𝔼[W(s)W(t)]=exp(st2),for all s,tdn,formulae-sequence𝔼𝑊𝑠𝑊𝑡superscriptnorm𝑠𝑡2for all 𝑠𝑡superscriptsubscript𝑑𝑛\operatorname{\mathbb{E}}[W(s)W(t)]\ =\ \exp\bigl{(}-\left\|{s-t}\right\|^{2}% \bigr{)},\qquad\text{for all }s,t\in\mathbb{R}^{d_{n}},blackboard_E [ italic_W ( italic_s ) italic_W ( italic_t ) ] = roman_exp ( - ∥ italic_s - italic_t ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) , for all italic_s , italic_t ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ,

where \left\|{\cdot}\right\|∥ ⋅ ∥ is the Euclidean norm.

Let a>0𝑎0a>0italic_a > 0, b1,dn𝑏1subscript𝑑𝑛b\in\llbracket 1,d_{n}\rrbracketitalic_b ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧, and q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. We define Wxa,b,q:=W(aDiag(𝕓)q(x))assignsuperscriptsubscript𝑊𝑥𝑎𝑏𝑞𝑊𝑎Diag𝕓𝑞𝑥W_{x}^{a,b,q}:=W(a\operatorname{Diag}(\mathbb{b})\cdot q(x))italic_W start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT := italic_W ( italic_a roman_Diag ( blackboard_b ) ⋅ italic_q ( italic_x ) ) and Wa,b,q:=(Wxa,b,q:xdn)W^{a,b,q}:=(W_{x}^{a,b,q}:x\in\mathbb{R}^{d_{n}})italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT := ( italic_W start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT : italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) a rescaled Gaussian process with sparsity pattern (b,q)𝑏𝑞(b,q)( italic_b , italic_q ), where Diag(𝕓)Diag𝕓\operatorname{Diag}(\mathbb{b})roman_Diag ( blackboard_b ) is the diagonal matrix with diagonal vector 𝕓𝕓\mathbb{b}blackboard_b. Then, the process Wa,b,qsuperscript𝑊𝑎𝑏𝑞W^{a,b,q}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT is constant on affine subspaces q1(E1𝕓)+xsuperscript𝑞1subscript𝐸1𝕓𝑥q^{-1}(E_{1-\mathbb{b}})+xitalic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - blackboard_b end_POSTSUBSCRIPT ) + italic_x, for xdn𝑥superscriptsubscript𝑑𝑛x\in\mathbb{R}^{d_{n}}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT (as in Property 3.1) and if R:=q1Diag(𝕓)qassign𝑅superscript𝑞1Diag𝕓𝑞R:=q^{-1}\operatorname{Diag}(\mathbb{b})qitalic_R := italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Diag ( blackboard_b ) italic_q is the orthogonal projection onto q1(E𝕓)superscript𝑞1subscript𝐸𝕓q^{-1}(E_{\mathbb{b}})italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT ), then Wxa,b,q=WRxa,b,qsubscriptsuperscript𝑊𝑎𝑏𝑞𝑥subscriptsuperscript𝑊𝑎𝑏𝑞𝑅𝑥W^{a,b,q}_{x}=W^{a,b,q}_{Rx}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT = italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_R italic_x end_POSTSUBSCRIPT, for all xdn𝑥superscriptsubscript𝑑𝑛x\in\mathbb{R}^{d_{n}}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT.

To work properly with Wa,b,qsuperscript𝑊𝑎𝑏𝑞W^{a,b,q}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT, we have to verify that its law identifies with the law of a b𝑏bitalic_b-dimensional standard squared exponential process. To do so, define

ϕ:b:italic-ϕsuperscript𝑏\displaystyle\phi:\ \mathbb{R}^{b}italic_ϕ : blackboard_R start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT q1(E𝕓)absentsuperscript𝑞1subscript𝐸𝕓\displaystyle\ \to\ q^{-1}(E_{\mathbb{b}})→ italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT )
x𝑥\displaystyle xitalic_x 1aq1(x𝕓),maps-toabsent1𝑎superscript𝑞1superscript𝑥𝕓\displaystyle\ \mapsto\ \frac{1}{a}q^{-1}(x^{\mathbb{b}}),↦ divide start_ARG 1 end_ARG start_ARG italic_a end_ARG italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT blackboard_b end_POSTSUPERSCRIPT ) ,

a bijection with inverse ϕ1(t)=a(qt)𝕓superscriptitalic-ϕ1𝑡𝑎subscript𝑞𝑡𝕓\phi^{-1}(t)=a(qt)_{\mathbb{b}}italic_ϕ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_t ) = italic_a ( italic_q italic_t ) start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT for tq1(E𝕓)𝑡superscript𝑞1subscript𝐸𝕓t\in q^{-1}(E_{\mathbb{b}})italic_t ∈ italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT ). Then, Wϕ(x)a,b,q=W(x𝕓)subscriptsuperscript𝑊𝑎𝑏𝑞italic-ϕ𝑥𝑊superscript𝑥𝕓W^{a,b,q}_{\phi(x)}=W(x^{\mathbb{b}})italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϕ ( italic_x ) end_POSTSUBSCRIPT = italic_W ( italic_x start_POSTSUPERSCRIPT blackboard_b end_POSTSUPERSCRIPT ) for all xb𝑥superscript𝑏x\in\mathbb{R}^{b}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT.

Let us introduce ~W:=(Wϕ(x)a,b,q,xb)assign~absent𝑊subscriptsuperscript𝑊𝑎𝑏𝑞italic-ϕ𝑥𝑥superscript𝑏\mathrlap{\,\widetilde{\phantom{A}}}W:=(W^{a,b,q}_{\phi(x)},x\in\mathbb{R}^{b})start_ARG over~ start_ARG end_ARG end_ARG italic_W := ( italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_ϕ ( italic_x ) end_POSTSUBSCRIPT , italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ). Then, for all x,yb×b𝑥𝑦superscript𝑏superscript𝑏x,y\in\mathbb{R}^{b}\times\mathbb{R}^{b}italic_x , italic_y ∈ blackboard_R start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT × blackboard_R start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT, we have

𝔼[~W(x)~W(y)]=𝔼[W(x𝕓)W(y𝕓)]=ex𝕓y𝕓2.𝔼~absent𝑊𝑥~absent𝑊𝑦𝔼𝑊superscript𝑥𝕓𝑊superscript𝑦𝕓superscript𝑒superscriptnormsuperscript𝑥𝕓superscript𝑦𝕓2\operatorname{\mathbb{E}}[\mathrlap{\,\widetilde{\phantom{A}}}W(x)\mathrlap{\,% \widetilde{\phantom{A}}}W(y)]\ =\ \operatorname{\mathbb{E}}[W(x^{\mathbb{b}})W% (y^{\mathbb{b}})]\ =\ e^{-\left\|{x^{\mathbb{b}}-y^{\mathbb{b}}}\right\|^{2}}.blackboard_E [ start_ARG over~ start_ARG end_ARG end_ARG italic_W ( italic_x ) start_ARG over~ start_ARG end_ARG end_ARG italic_W ( italic_y ) ] = blackboard_E [ italic_W ( italic_x start_POSTSUPERSCRIPT blackboard_b end_POSTSUPERSCRIPT ) italic_W ( italic_y start_POSTSUPERSCRIPT blackboard_b end_POSTSUPERSCRIPT ) ] = italic_e start_POSTSUPERSCRIPT - ∥ italic_x start_POSTSUPERSCRIPT blackboard_b end_POSTSUPERSCRIPT - italic_y start_POSTSUPERSCRIPT blackboard_b end_POSTSUPERSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT .

So ~W~absent𝑊\mathrlap{\,\widetilde{\phantom{A}}}Wstart_ARG over~ start_ARG end_ARG end_ARG italic_W is a standard squared exponential Gaussian process in dimension b𝑏bitalic_b that does not depend on a𝑎aitalic_a nor q𝑞qitalic_q. Moreover, we have Wta,b,q=~W(ϕ1(Rt))subscriptsuperscript𝑊𝑎𝑏𝑞𝑡~absent𝑊superscriptitalic-ϕ1𝑅𝑡W^{a,b,q}_{t}=\mathrlap{\,\widetilde{\phantom{A}}}W(\phi^{-1}(Rt))italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = start_ARG over~ start_ARG end_ARG end_ARG italic_W ( italic_ϕ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_R italic_t ) ).

From now on, Wa,b,qsuperscript𝑊𝑎𝑏𝑞W^{a,b,q}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT will refer to the restriction on 𝕌dnsubscript𝕌subscript𝑑𝑛\mathbb{U}_{d_{n}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT of this process. Then, the hierarchical prior on the parameter f𝒞(𝕌dn)𝑓𝒞subscript𝕌subscript𝑑𝑛f\in\mathcal{C}(\mathbb{U}_{d_{n}})italic_f ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) with stochastic subspace selection is defined as the law ΠnsubscriptΠ𝑛\Pi_{n}roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT of WA,Γ,Θsuperscript𝑊𝐴ΓΘW^{A,\Gamma,\Theta}italic_W start_POSTSUPERSCRIPT italic_A , roman_Γ , roman_Θ end_POSTSUPERSCRIPT, where A𝐴Aitalic_A is the scaling parameter, Γ1,dnΓ1subscript𝑑𝑛\Gamma\in\llbracket 1,d_{n}\rrbracketroman_Γ ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧ is the prior on the subspace dimension, and ΘΘ\Thetaroman_Θ is the prior on the orientation.

Assumption 3.3.

The intrinsic dimension d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT of the subspace is assumed to be bounded by a known deterministic number dmaxsubscript𝑑maxd_{\mathrm{max}}italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT.

Consequently, ΓΓ\Gammaroman_Γ is defined by a probability vector (πΓ(d):1ddmax):subscript𝜋Γ𝑑1𝑑subscript𝑑max(\pi_{\Gamma}(d):1\leq d\leq d_{\mathrm{max}})( italic_π start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ( italic_d ) : 1 ≤ italic_d ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ) with πΓ(d)>0subscript𝜋Γ𝑑0\pi_{\Gamma}(d)>0italic_π start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ( italic_d ) > 0 for all d𝑑ditalic_d. Moreover, we define the scaling parameter A𝐴Aitalic_A such that there exists a collection of probability measures πn,dsubscript𝜋𝑛𝑑\pi_{n,d}italic_π start_POSTSUBSCRIPT italic_n , italic_d end_POSTSUBSCRIPT on (0,)0(0,\infty)( 0 , ∞ ), 0ddmaxdn0𝑑subscript𝑑maxsubscript𝑑𝑛0\leq d\leq d_{\mathrm{max}}\wedge d_{n}0 ≤ italic_d ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ∧ italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, with A|(Γ=d)πn,dsimilar-toconditional𝐴Γ𝑑subscript𝜋𝑛𝑑A\ |\ (\Gamma=d)\sim\pi_{n,d}italic_A | ( roman_Γ = italic_d ) ∼ italic_π start_POSTSUBSCRIPT italic_n , italic_d end_POSTSUBSCRIPT. We require the law of the stochastic isometry ΘΘ\Thetaroman_Θ to be translation invariant. That is, for all subset 𝒬𝒬\mathcal{Q}caligraphic_Q of 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT and for all q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we need (Θq𝒬)=(Θ𝒬)Θ𝑞𝒬Θ𝒬\operatorname{\mathds{P}}(\Theta\in q\cdot\mathcal{Q})=\operatorname{\mathds{P% }}(\Theta\in\mathcal{Q})blackboard_P ( roman_Θ ∈ italic_q ⋅ caligraphic_Q ) = blackboard_P ( roman_Θ ∈ caligraphic_Q ). Therefore, the law of ΘΘ\Thetaroman_Θ is taken as the unit Haar measure on 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, the only probability measure that is translation invariant on 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. In addition, all A,Γ𝐴ΓA,\Gammaitalic_A , roman_Γ, and ΘΘ\Thetaroman_Θ are supposed to be independent of W𝑊Witalic_W.

For convenience, the notation πn,dsubscript𝜋𝑛𝑑\pi_{n,d}italic_π start_POSTSUBSCRIPT italic_n , italic_d end_POSTSUBSCRIPT will refer to a probability measure as well as its density.

Assumption 3.4 (Rescaling measures).

There exist constants D1,D2,C1subscript𝐷1subscript𝐷2subscript𝐶1D_{1},D_{2},C_{1}italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_D start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, C2subscript𝐶2C_{2}italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, and c>1𝑐1c>1italic_c > 1 such that for all n*𝑛superscriptn\in\mathds{N}^{*}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and d<dmaxdn𝑑subscript𝑑maxsubscript𝑑𝑛d<d_{\mathrm{max}}\wedge d_{n}italic_d < italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ∧ italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, the density πn,dsubscript𝜋𝑛𝑑\pi_{n,d}italic_π start_POSTSUBSCRIPT italic_n , italic_d end_POSTSUBSCRIPT satisfies

  1. 1.

    for all sufficiently large a𝑎aitalic_a, πn,d(a)D1eC1ad(loga)d+1subscript𝜋𝑛𝑑𝑎subscript𝐷1superscript𝑒subscript𝐶1superscript𝑎𝑑superscript𝑎𝑑1\pi_{n,d}(a)\geq D_{1}e^{-C_{1}a^{d}(\log a)^{d+1}}italic_π start_POSTSUBSCRIPT italic_n , italic_d end_POSTSUBSCRIPT ( italic_a ) ≥ italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ( roman_log italic_a ) start_POSTSUPERSCRIPT italic_d + 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ;

  2. 2.

    for all a>c𝑎𝑐a>citalic_a > italic_c, πn,d(a)D2eC2ad(loga)d+1subscript𝜋𝑛𝑑𝑎subscript𝐷2superscript𝑒subscript𝐶2superscript𝑎𝑑superscript𝑎𝑑1\pi_{n,d}(a)\leq D_{2}e^{-C_{2}a^{d}(\log a)^{d+1}}italic_π start_POSTSUBSCRIPT italic_n , italic_d end_POSTSUBSCRIPT ( italic_a ) ≤ italic_D start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ( roman_log italic_a ) start_POSTSUPERSCRIPT italic_d + 1 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ;

  3. 3.

    πn,d([0,c])=0subscript𝜋𝑛𝑑0𝑐0\pi_{n,d}([0,c])=0italic_π start_POSTSUBSCRIPT italic_n , italic_d end_POSTSUBSCRIPT ( [ 0 , italic_c ] ) = 0.

Assumptions similar to Assumption 3.4 are standard, see for instance Equation (3.4) in [VV09] or Assumption 5 [JT21]. For example, this assumption is satisfied if, for all n*𝑛superscriptn\in\mathds{N}^{*}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and d<dmaxdn𝑑subscript𝑑maxsubscript𝑑𝑛d<d_{\mathrm{max}}\wedge d_{n}italic_d < italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ∧ italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, Ad(logA)d+1|(Γ=d)conditionalsuperscript𝐴𝑑superscript𝐴𝑑1Γ𝑑A^{d}(\log A)^{d+1}\ |\ (\Gamma=d)italic_A start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ( roman_log italic_A ) start_POSTSUPERSCRIPT italic_d + 1 end_POSTSUPERSCRIPT | ( roman_Γ = italic_d ) is the restriction to (c,+)𝑐(c,+\infty)( italic_c , + ∞ ) of an exponential law with parameter independent of d𝑑ditalic_d and n𝑛nitalic_n (indeed, if g(A)𝑔𝐴g(A)italic_g ( italic_A ) has density function f𝑓fitalic_f, with g𝑔gitalic_g differentiable and strictly increasing, then A𝐴Aitalic_A has density function (fg)g𝑓𝑔superscript𝑔(f\circ g)\cdot g^{\prime}( italic_f ∘ italic_g ) ⋅ italic_g start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT).

The next section gives some precision about the reproducing kernel Hilbert space (RKHS) of Wa,b,qsuperscript𝑊𝑎𝑏𝑞W^{a,b,q}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT. The content is a bit technical and can be skipped at first reading.

3.2 Reproducing kernel Hilbert space of Wa,b,qsuperscript𝑊𝑎𝑏𝑞W^{a,b,q}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT

One of the advantages of choosing a Gaussian process prior is that the contraction rate depends explicitly on the small ball probability and on the relative position of the parameter with respect to the RKHS associated with the process. This section is dedicated to the basic properties of this space. For elementary definitions and for some precision about the link between the contraction rate and the RKHS, we refer the reader to [VV08] and [VV08a].

Notation.

We denote by 𝒞(𝕌d|q1(E𝕓))𝒞conditionalsubscript𝕌𝑑superscript𝑞1subscript𝐸𝕓\mathcal{C}(\mathbb{U}_{d}\ |\ q^{-1}(E_{\mathbb{b}}))caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT | italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT ) ) the space of continuous functions on 𝕌dsubscript𝕌𝑑\mathbb{U}_{d}blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT which are constant on affine subspaces q1(E1𝕓)+xsuperscript𝑞1subscript𝐸1𝕓𝑥q^{-1}(E_{1-\mathbb{b}})+xitalic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - blackboard_b end_POSTSUBSCRIPT ) + italic_x, for x𝕌d𝑥subscript𝕌𝑑x\in\mathbb{U}_{d}italic_x ∈ blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT.

We introduce the operator

Λ:{𝒞(𝕌b)𝒞(𝕌d|q1(E𝕓))fΛf:{𝕌dxf((qx)𝕓),:Λcases𝒞subscript𝕌𝑏absent𝒞conditionalsubscript𝕌𝑑superscript𝑞1subscript𝐸𝕓𝑓:maps-toabsentΛ𝑓casessubscript𝕌𝑑absent𝑥maps-toabsent𝑓subscript𝑞𝑥𝕓\Lambda:\begin{cases}\mathcal{C}(\mathbb{U}_{b})&\kern-6.0pt\to\ \mathcal{C}(% \mathbb{U}_{d}\ |\ q^{-1}(E_{\mathbb{b}}))\\[8.5359pt] \phantom{\mathcal{L}^{2}}f&\kern-6.0pt\mapsto\ \Lambda f:\begin{cases}\mathbb{% U}_{d}&\kern-6.0pt\to\ \mathbb{R}\\ \ x&\kern-6.0pt\mapsto\ f((qx)_{\mathbb{b}}),\end{cases}\end{cases}roman_Λ : { start_ROW start_CELL caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) end_CELL start_CELL → caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT | italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT ) ) end_CELL end_ROW start_ROW start_CELL italic_f end_CELL start_CELL ↦ roman_Λ italic_f : { start_ROW start_CELL blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_CELL start_CELL → blackboard_R end_CELL end_ROW start_ROW start_CELL italic_x end_CELL start_CELL ↦ italic_f ( ( italic_q italic_x ) start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT ) , end_CELL end_ROW end_CELL end_ROW

so that Wa,b,q=Λ(~Wa)superscript𝑊𝑎𝑏𝑞Λ~absentsuperscript𝑊𝑎W^{a,b,q}=\Lambda(\mathrlap{\,\widetilde{\phantom{A}}}W^{a})italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT = roman_Λ ( start_ARG over~ start_ARG end_ARG end_ARG italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT ), where ~Wa=(~Wat,t𝕌b)~absentsuperscript𝑊𝑎~absentsubscript𝑊𝑎𝑡𝑡subscript𝕌𝑏\mathrlap{\,\widetilde{\phantom{A}}}W^{a}=(\mathrlap{\,\widetilde{\phantom{A}}% }W_{at},t\in\mathbb{U}_{b})start_ARG over~ start_ARG end_ARG end_ARG italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT = ( start_ARG over~ start_ARG end_ARG end_ARG italic_W start_POSTSUBSCRIPT italic_a italic_t end_POSTSUBSCRIPT , italic_t ∈ blackboard_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) is the process ~W~absent𝑊\mathrlap{\,\widetilde{\phantom{A}}}Wstart_ARG over~ start_ARG end_ARG end_ARG italic_W introduced above rescaled by a𝑎aitalic_a and restricted to 𝕌bsubscript𝕌𝑏\mathbb{U}_{b}blackboard_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT. It is a bijective linear map and also an isometry if the domain and the codomain are endowed with the uniform norm. In particular, the map ΛΛ\Lambdaroman_Λ is continuous. According to Lemma 7.1 in [VV08a], if ~asubscript~𝑎\widetilde{\mathbb{H}}_{a}over~ start_ARG blackboard_H end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is the RKHS of ~Wa~absentsuperscript𝑊𝑎\mathrlap{\,\widetilde{\phantom{A}}}W^{a}start_ARG over~ start_ARG end_ARG end_ARG italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT, then the RKHS a,b,qsubscript𝑎𝑏𝑞\mathbb{H}_{a,b,q}blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q end_POSTSUBSCRIPT of Wa,b,qsuperscript𝑊𝑎𝑏𝑞W^{a,b,q}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT is equal to Λ(~a)Λsubscript~𝑎\Lambda(\widetilde{\mathbb{H}}_{a})roman_Λ ( over~ start_ARG blackboard_H end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ). Let us detail its elements. The stochastic process RKHS of ~Wa~absentsuperscript𝑊𝑎\mathrlap{\,\widetilde{\phantom{A}}}W^{a}start_ARG over~ start_ARG end_ARG end_ARG italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT (as defined in [VV08a]) is composed of functions h:𝕌b:subscript𝕌𝑏h:\mathbb{U}_{b}\to\mathbb{R}italic_h : blackboard_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT → blackboard_R for which there exists ψL2(μa,bse)𝜓subscriptsuperscript𝐿2subscriptsuperscript𝜇𝑠𝑒𝑎𝑏\psi\in L^{2}_{\mathbb{C}}(\mu^{se}_{a,b})italic_ψ ∈ italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT blackboard_C end_POSTSUBSCRIPT ( italic_μ start_POSTSUPERSCRIPT italic_s italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a , italic_b end_POSTSUBSCRIPT ) such that

(3.1) h(t)=𝔢beiλtψ(λ)𝑑μa,bse(λ),t𝕌b,formulae-sequence𝑡𝔢subscriptsuperscript𝑏superscript𝑒𝑖𝜆𝑡𝜓𝜆differential-dsubscriptsuperscript𝜇𝑠𝑒𝑎𝑏𝜆𝑡subscript𝕌𝑏h(t)\ =\ \operatorname{\mathfrak{Re}}\int_{\mathbb{R}^{b}}e^{-i\lambda\cdot t}% \psi(\lambda)d\mu^{se}_{a,b}(\lambda),\quad t\in\mathbb{U}_{b},italic_h ( italic_t ) = start_OPFUNCTION fraktur_R fraktur_e end_OPFUNCTION ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_i italic_λ ⋅ italic_t end_POSTSUPERSCRIPT italic_ψ ( italic_λ ) italic_d italic_μ start_POSTSUPERSCRIPT italic_s italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a , italic_b end_POSTSUBSCRIPT ( italic_λ ) , italic_t ∈ blackboard_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ,

where μa,bsesubscriptsuperscript𝜇𝑠𝑒𝑎𝑏\mu^{se}_{a,b}italic_μ start_POSTSUPERSCRIPT italic_s italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a , italic_b end_POSTSUBSCRIPT is the spectral measure of the a𝑎aitalic_a-rescaled squared exponential process in dimension b𝑏bitalic_b with spectral density fa,bse:t(2aπ)bexp(14t/a2):subscriptsuperscript𝑓𝑠𝑒𝑎𝑏maps-to𝑡superscript2𝑎𝜋𝑏14superscriptnorm𝑡𝑎2f^{se}_{a,b}:t\mapsto(2a\sqrt{\pi})^{-b}\exp(-\frac{1}{4}\|t/a\|^{2})italic_f start_POSTSUPERSCRIPT italic_s italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a , italic_b end_POSTSUBSCRIPT : italic_t ↦ ( 2 italic_a square-root start_ARG italic_π end_ARG ) start_POSTSUPERSCRIPT - italic_b end_POSTSUPERSCRIPT roman_exp ( - divide start_ARG 1 end_ARG start_ARG 4 end_ARG ∥ italic_t / italic_a ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) (see Lemma 4.1 in [VV09], and the following discussion). We can view ~Wa~absentsuperscript𝑊𝑎\mathrlap{\,\widetilde{\phantom{A}}}W^{a}start_ARG over~ start_ARG end_ARG end_ARG italic_W start_POSTSUPERSCRIPT italic_a end_POSTSUPERSCRIPT as a random Gaussian element with values in the Banach space (𝒞(𝕌b),)(\mathcal{C}(\mathbb{U}_{b}),\left\|{\cdot}\right\|_{\infty})( caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ) , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ). Thus, according to Theorem 2.1 in [VV08a], the stochastic process RKHS and the Banach space RKHS coincide and we can apply Lemma 7.1 from the same reference. The space a,b,q=Λ(~a)subscript𝑎𝑏𝑞Λsubscript~𝑎\mathbb{H}_{a,b,q}=\Lambda(\widetilde{\mathbb{H}}_{a})blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q end_POSTSUBSCRIPT = roman_Λ ( over~ start_ARG blackboard_H end_ARG start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ) is then the set of functions

(3.2) h¯:x𝕌d𝔢beiλ,(qx)𝕓ψ(λ)𝑑μa,bse(λ),:¯𝑥subscript𝕌𝑑maps-to𝔢subscriptsuperscript𝑏superscript𝑒𝑖𝜆subscript𝑞𝑥𝕓𝜓𝜆differential-dsubscriptsuperscript𝜇𝑠𝑒𝑎𝑏𝜆\overline{h}:x\in\mathbb{U}_{d}\ \mapsto\ \operatorname{\mathfrak{Re}}\int_{% \mathbb{R}^{b}}e^{-i\langle\lambda,(qx)_{\mathbb{b}}\rangle}\psi(\lambda)d\mu^% {se}_{a,b}(\lambda),over¯ start_ARG italic_h end_ARG : italic_x ∈ blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ↦ start_OPFUNCTION fraktur_R fraktur_e end_OPFUNCTION ∫ start_POSTSUBSCRIPT blackboard_R start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_e start_POSTSUPERSCRIPT - italic_i ⟨ italic_λ , ( italic_q italic_x ) start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT ⟩ end_POSTSUPERSCRIPT italic_ψ ( italic_λ ) italic_d italic_μ start_POSTSUPERSCRIPT italic_s italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a , italic_b end_POSTSUBSCRIPT ( italic_λ ) ,

where ψ𝜓\psiitalic_ψ runs through L2(μa,bse)subscriptsuperscript𝐿2subscriptsuperscript𝜇𝑠𝑒𝑎𝑏L^{2}_{\mathbb{C}}(\mu^{se}_{a,b})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT blackboard_C end_POSTSUBSCRIPT ( italic_μ start_POSTSUPERSCRIPT italic_s italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a , italic_b end_POSTSUBSCRIPT ) and the RKHS norm is h¯a,b,q=ψL2(μa,bse)subscriptnorm¯subscript𝑎𝑏𝑞subscriptnorm𝜓superscript𝐿2subscriptsuperscript𝜇𝑠𝑒𝑎𝑏\left\|{\overline{h}}\right\|_{\mathbb{H}_{a,b,q}}=\left\|{\psi}\right\|_{L^{2% }(\mu^{se}_{a,b})}∥ over¯ start_ARG italic_h end_ARG ∥ start_POSTSUBSCRIPT blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∥ italic_ψ ∥ start_POSTSUBSCRIPT italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_μ start_POSTSUPERSCRIPT italic_s italic_e end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_a , italic_b end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT.

We remark that functions of the RKHS of Wa,b,qsuperscript𝑊𝑎𝑏𝑞W^{a,b,q}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT have the same sparsity-pattern as the trajectories of Wa,b,qsuperscript𝑊𝑎𝑏𝑞W^{a,b,q}italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT.

Remark 3.1.

Functions h¯a,b,q¯subscript𝑎𝑏𝑞\overline{h}\in\mathbb{H}_{a,b,q}over¯ start_ARG italic_h end_ARG ∈ blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q end_POSTSUBSCRIPT are constant on affine subspaces q1(E1𝕓)+xsuperscript𝑞1subscript𝐸1𝕓𝑥q^{-1}(E_{1-\mathbb{b}})+xitalic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - blackboard_b end_POSTSUBSCRIPT ) + italic_x for x𝕌d𝑥subscript𝕌𝑑x\in\mathbb{U}_{d}italic_x ∈ blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT.

As mentioned at the beginning of this section, contraction rates under Gaussian process prior depend on two quantities: the small ball probability and the relative position of the parameter with respect to the RKHS. For a parameter f𝒞(𝕌d|q1(E𝕓))𝑓𝒞conditionalsubscript𝕌𝑑superscript𝑞1subscript𝐸𝕓f\in\mathcal{C}(\mathbb{U}_{d}\ |\ q^{-1}(E_{\mathbb{b}}))italic_f ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT | italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT ) ) and ε>0𝜀0\varepsilon>0italic_ε > 0, these two quantities define the concentration function ϕfa,b,qsuperscriptsubscriptitalic-ϕ𝑓𝑎𝑏𝑞\phi_{f}^{a,b,q}italic_ϕ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT, with

(3.3) ϕfa,b,q(ε):=infha,b,q:hf<εha,b,q2log(Wa,b,q<ε).assignsuperscriptsubscriptitalic-ϕ𝑓𝑎𝑏𝑞𝜀subscriptinfimum:subscript𝑎𝑏𝑞subscriptnorm𝑓𝜀superscriptsubscriptnormsubscript𝑎𝑏𝑞2subscriptnormsuperscript𝑊𝑎𝑏𝑞𝜀\phi_{f}^{a,b,q}(\varepsilon)\ :=\ \inf_{h\in\mathbb{H}_{a,b,q}:\left\|{h-f}% \right\|_{\infty}<\varepsilon}\left\|{h}\right\|_{\mathbb{H}_{a,b,q}}^{2}-\log% \operatorname{\mathds{P}}\left(\left\|{W^{a,b,q}}\right\|_{\infty}<\varepsilon% \right).italic_ϕ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ( italic_ε ) := roman_inf start_POSTSUBSCRIPT italic_h ∈ blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q end_POSTSUBSCRIPT : ∥ italic_h - italic_f ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT < italic_ε end_POSTSUBSCRIPT ∥ italic_h ∥ start_POSTSUBSCRIPT blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - roman_log blackboard_P ( ∥ italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT < italic_ε ) .

3.3 Posterior consistency

Before we state the theorem, we need a last assumption, which determines how the ambient dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is allowed to grow with the sample size n𝑛nitalic_n.

Assumption 3.5 (Growth of dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT).

The ambient dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT satisfies

dnCDnd02β+d0(logn)2κ1,subscript𝑑𝑛subscript𝐶𝐷superscript𝑛subscript𝑑02𝛽subscript𝑑0superscript𝑛2𝜅1d_{n}\ \leq\ C_{D}\cdot n^{\tfrac{d_{0}}{2\beta+d_{0}}}\cdot(\log n)^{2\kappa-% 1},italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ italic_C start_POSTSUBSCRIPT italic_D end_POSTSUBSCRIPT ⋅ italic_n start_POSTSUPERSCRIPT divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT ⋅ ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 italic_κ - 1 end_POSTSUPERSCRIPT ,

for some small constant CD>0subscript𝐶𝐷0C_{D}>0italic_C start_POSTSUBSCRIPT italic_D end_POSTSUBSCRIPT > 0 and where κ=(d0+1)β/(2β+d0)𝜅subscript𝑑01𝛽2𝛽subscript𝑑0\kappa=(d_{0}+1)\beta/(2\beta+d_{0})italic_κ = ( italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 ) italic_β / ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ).

An examination of κ𝜅\kappaitalic_κ shows that κ1/2𝜅12\kappa\geq 1/2italic_κ ≥ 1 / 2 if β1/2𝛽12\beta\geq 1/2italic_β ≥ 1 / 2 and that κ>β𝜅𝛽\kappa>\betaitalic_κ > italic_β otherwise. Thereby, a standard rate of order n1/2superscript𝑛12n^{1/2}italic_n start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT for dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is achieved with parameter β=d0/2𝛽subscript𝑑02\beta=d_{0}/2italic_β = italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / 2. The fastest rate tends to the order n(logn)1𝑛superscript𝑛1n\cdot(\log n)^{-1}italic_n ⋅ ( roman_log italic_n ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT when β𝛽\betaitalic_β tends to zero. Although it is always possible to set β𝛽\betaitalic_β extremely close to zero in order to obtain the best rate for dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, one should keep in mind that the contraction rate may then be suboptimal, as discussed at the end of this section.

Theorem 3.1.

Let εn=Cεε¯n(logn)κsubscript𝜀𝑛normal-⋅subscript𝐶𝜀subscriptnormal-¯𝜀𝑛superscript𝑛𝜅\varepsilon_{n}=C_{\varepsilon}\cdot\underline{\varepsilon}_{n}(\log n)^{\kappa}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ⋅ under¯ start_ARG italic_ε end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT with ε¯n=nβ/(2β+d0)subscriptnormal-¯𝜀𝑛superscript𝑛𝛽2𝛽subscript𝑑0\underline{\varepsilon}_{n}=n^{-\beta/(2\beta+d_{0})}under¯ start_ARG italic_ε end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_n start_POSTSUPERSCRIPT - italic_β / ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT, Cεsubscript𝐶𝜀C_{\varepsilon}italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT a large constant that depends on f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and κ𝜅\kappaitalic_κ as in Assumption 3.5. Then, if the parameter space is embedded with the prior Πnsubscriptnormal-Π𝑛\Pi_{n}roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and under Assumptions 3.1-3.5, the posterior contracts at rate (εn)nsubscriptsubscript𝜀𝑛𝑛(\varepsilon_{n})_{n\in\mathds{N}}( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n ∈ blackboard_N end_POSTSUBSCRIPT for density estimation (as defined in (2.2)) as well as for regression with fixed or random design (as defined in (2.3) and (2.4)).

An examination of εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT shows that the contraction rate is improved as the smoothness β𝛽\betaitalic_β of f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT grows, unlike dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. This highlights a trade-off between the contraction rate and the growth of the design dimension: fast contraction rates imply slowly increasing dimension and conversely.

The proof of Theorem 3.1, postponed in the Appendix, in Section 5.1, combines the arguments of [Tok11] and [JT21].

4 Subspace recovery for the density estimation problem

In this section, we propose to recover the central subspace for the density estimation problem. To avoid identifiability issues caused by the spherical support, we suppose that the ambient dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT does not depend on n𝑛nitalic_n. Hence, we denote the ambient dimension by d𝑑ditalic_d with dd0𝑑subscript𝑑0d\geq d_{0}italic_d ≥ italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and the central subspace by 𝒮:=(q*)1(E𝐝𝟎)assign𝒮superscriptsuperscript𝑞1subscript𝐸subscript𝐝0\mathcal{S}:=(q^{*})^{-1}(E_{\mathbf{d_{0}}})caligraphic_S := ( italic_q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) where q*superscript𝑞q^{*}italic_q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT corresponds to qn*subscriptsuperscript𝑞𝑛q^{*}_{n}italic_q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in Assumption 3.1. This assumption is justified by the following considerations. If the ambient dimension grows with n𝑛nitalic_n, the Hellinger metric relative to the Lebesgue measure on 𝕌dnsubscript𝕌subscript𝑑𝑛\mathbb{U}_{d_{n}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT tends to give more importance to the center of the support, as n𝑛nitalic_n tends to infinity. For example, consider a parameter f0:𝕌2:subscript𝑓0subscript𝕌2f_{0}:\mathbb{U}_{2}\to\mathbb{R}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : blackboard_U start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT → blackboard_R in dimension two that is everywhere constant except in a small region on the border of 𝕌2subscript𝕌2\mathbb{U}_{2}blackboard_U start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, and such that the central subspace 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is of dimension two. The importance of this small region in the support 𝕌dnsubscript𝕌subscript𝑑𝑛\mathbb{U}_{d_{n}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, in the Hellinger sense, decreases exponentially with n𝑛nitalic_n, way faster than the estimation of the true parameter fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT in Theorem 3.1. Consequently, for sufficiently large n𝑛nitalic_n, a constant function f0:[0,1]:subscript𝑓001f_{0}:[0,1]\to\mathbb{R}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : [ 0 , 1 ] → blackboard_R together with some one-dimensional subspace 𝒮superscript𝒮\mathcal{S}^{\prime}caligraphic_S start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT characterize a density that is in the Hellinger ball of radius εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT centered on fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT; so we have no hope of recovering the true subspace by simply using the posterior consistency.

As a consequence, the true density p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, the parameter f*superscript𝑓f^{*}italic_f start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, and the central subspace do not depend on n𝑛nitalic_n anymore. The true density p*=pf*superscript𝑝subscript𝑝superscript𝑓p^{*}=p_{f^{*}}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_p start_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT is characterized by f*superscript𝑓f^{*}italic_f start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT via the transformation (2.1). Moreover, f*superscript𝑓f^{*}italic_f start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is supposed to depend only on a d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-dimensional subspace of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and can be viewed as the sparse continuation of an underlying function f0𝒞(𝕌d0)subscript𝑓0𝒞subscript𝕌subscript𝑑0f_{0}\in\mathcal{C}(\mathbb{U}_{d_{0}})italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ). In the same way, p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT can be viewed as the sparse continuation of a function p0subscript𝑝0p_{0}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT over 𝕌d0subscript𝕌subscript𝑑0\mathbb{U}_{d_{0}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, except that the renormalisation of p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT depends on d𝑑ditalic_d. Note that p0subscript𝑝0p_{0}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is not necessarily a density on 𝕌d0subscript𝕌subscript𝑑0\mathbb{U}_{d_{0}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT so the notation h(f,g)𝑓𝑔h(f,g)italic_h ( italic_f , italic_g ) will designate from now on the L2superscript𝐿2L^{2}italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT-distance between the square roots of f𝑓fitalic_f and g𝑔gitalic_g even if f𝑓fitalic_f and g𝑔gitalic_g are not densities.

Let us introduce a few more notation. Let 𝒬*superscript𝒬\mathcal{Q}^{*}caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT be the set of all optimal isometries:

𝒬*:={q𝒪d:q1(E𝐝𝟎)=𝒮},assignsuperscript𝒬conditional-set𝑞subscript𝒪𝑑superscript𝑞1subscript𝐸subscript𝐝0𝒮\mathcal{Q}^{*}\ :=\ \{q\in\mathcal{O}_{d}:q^{-1}(E_{\mathbf{d_{0}}})=\mathcal% {S}\},caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT := { italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT : italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = caligraphic_S } ,

and, for d>d0superscript𝑑subscript𝑑0d^{\prime}>d_{0}italic_d start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT > italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, let 𝒬d*subscriptsuperscript𝒬superscript𝑑\mathcal{Q}^{*}_{d^{\prime}}caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT be the set of isometries that send the subspace E𝐝subscript𝐸superscript𝐝E_{\mathbf{d^{\prime}}}italic_E start_POSTSUBSCRIPT bold_d start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT to a subspace containing 𝒮𝒮\mathcal{S}caligraphic_S:

𝒬d*:={q𝒪d:q1(E𝐝)𝒮}.assignsubscriptsuperscript𝒬superscript𝑑conditional-set𝑞subscript𝒪𝑑𝒮superscript𝑞1subscript𝐸superscript𝐝\mathcal{Q}^{*}_{d^{\prime}}\ :=\ \{q\in\mathcal{O}_{d}:q^{-1}(E_{\mathbf{d^{% \prime}}})\supset\mathcal{S}\}.caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT := { italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT : italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ⊃ caligraphic_S } .

Recovering 𝒮𝒮\mathcal{S}caligraphic_S means the following: for some rate δn0subscript𝛿𝑛0\delta_{n}\to 0italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT → 0,

n*[Πn(Γd0 or minq𝒬*|Θq|δn|X1,,Xn)]n+0,superscriptsubscript𝑛subscriptΠ𝑛Γsubscript𝑑0 or subscript𝑞superscript𝒬normΘ𝑞conditionalsubscript𝛿𝑛subscript𝑋1subscript𝑋𝑛subscript𝑛0\operatorname{\mathds{P}}_{n}^{*}\left[\Pi_{n}\left(\Gamma\neq d_{0}\text{ or % }\min_{q\in\mathcal{Q}^{*}}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|% \Theta-q\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\geq\delta_{n}\ |% \ X_{1},\ldots,X_{n}\right)\right]\mathop{\longrightarrow}\limits_{n\to+\infty% }0,blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT [ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_Γ ≠ italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT or roman_min start_POSTSUBSCRIPT italic_q ∈ caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | | | roman_Θ - italic_q | | | ≥ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ] ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 ,

where ||||||{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt% \right|\kern-1.07639pt\right|}| | | ⋅ | | | is the operator norm with respect to the Euclidean distance in dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. However, under the assumptions of Theorem 3.1, the only information we have on the true subspace is posterior consistency to the density p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT with rate εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. This will only allow us to recover a subspace of dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT containing 𝒮𝒮\mathcal{S}caligraphic_S. A crucial assumption to eliminate the subspaces of dimension smaller than d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and the subspaces that do not contain 𝒮𝒮\mathcal{S}caligraphic_S is to suppose that p0subscript𝑝0p_{0}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is non-constant in all directions. More precisely, the default of constancy for each direction has to be detectable in Hellinger distance, as formalized in the following assumption.

Assumption 4.1.

There exist a constant D𝐷Ditalic_D and a window size L<1𝐿1L<1italic_L < 1 such that for all vector line ΔΔ\Deltaroman_Δ in d0superscriptsubscript𝑑0\mathbb{R}^{d_{0}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT (directed by a unit vector 𝚫𝚫\boldsymbol{\Delta}bold_Δ), there exists od0(1L)𝑜subscriptsubscript𝑑01𝐿o\in\mathcal{B}_{d_{0}}(1-L)italic_o ∈ caligraphic_B start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 1 - italic_L ) such that for all 0<lL0𝑙𝐿0<l\leq L0 < italic_l ≤ italic_L, for all td0(L/2)+o𝑡subscriptsubscript𝑑0𝐿2𝑜t\in\mathcal{B}_{d_{0}}(L/2)+oitalic_t ∈ caligraphic_B start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_L / 2 ) + italic_o, and for all constant c>0𝑐0c>0italic_c > 0,

h2(p0|I;c)Dl2,h^{2}({p_{0}}_{|I};c)\ \geq\ D\cdot l^{2},italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUBSCRIPT | italic_I end_POSTSUBSCRIPT ; italic_c ) ≥ italic_D ⋅ italic_l start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

where I:=]o+tl2𝚫;o+t+l2𝚫[I:=\ ]o+t-\frac{l}{2}\boldsymbol{\Delta};o+t+\frac{l}{2}\boldsymbol{\Delta}[italic_I := ] italic_o + italic_t - divide start_ARG italic_l end_ARG start_ARG 2 end_ARG bold_Δ ; italic_o + italic_t + divide start_ARG italic_l end_ARG start_ARG 2 end_ARG bold_Δ [.

Assumption 4.1 seems a bit technical at first glance but it can be shown that it is satisfied as soon as p0subscript𝑝0p_{0}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is differentiable over 𝕌d0subscript𝕌subscript𝑑0\mathbb{U}_{d_{0}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT with d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT points such that the gradients at these points are linearly independent.

Theorem 4.1.

Under Assumption 4.1 and the assumptions of Theorem 3.1, we have, for some rate (δn)nsubscriptsubscript𝛿𝑛𝑛(\delta_{n})_{n}( italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT tending to zero,

(4.1) Πn(Γ<d0|X1,,Xn)n+0,in n*-probability,subscriptΠ𝑛Γbrasubscript𝑑0subscript𝑋1subscript𝑋𝑛subscript𝑛0in superscriptsubscript𝑛-probability\displaystyle\Pi_{n}\left(\Gamma<d_{0}\ |\ X_{1},\ldots,X_{n}\right)\mathop{% \longrightarrow}\limits_{n\to+\infty}0,\qquad\text{in }\operatorname{\mathds{P% }}_{n}^{*}\text{-probability},roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_Γ < italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 , in blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT -probability ,
(4.2) Πn(Γ=d0 and minq𝒬*|Θq|δn|X1,,Xn)n+0,in n*-probability,subscriptΠ𝑛Γsubscript𝑑0 and subscript𝑞superscript𝒬normΘ𝑞conditionalsubscript𝛿𝑛subscript𝑋1subscript𝑋𝑛subscript𝑛0in superscriptsubscript𝑛-probability\displaystyle\Pi_{n}\left(\Gamma=d_{0}\text{ and }\min_{q\in\mathcal{Q}^{*}}{% \left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Theta-q\right|\kern-1.07639pt% \right|\kern-1.07639pt\right|}\geq\delta_{n}\ |\ X_{1},\ldots,X_{n}\right)% \mathop{\longrightarrow}\limits_{n\to+\infty}0,\qquad\text{in }\operatorname{% \mathds{P}}_{n}^{*}\text{-probability},roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_Γ = italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and roman_min start_POSTSUBSCRIPT italic_q ∈ caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | | | roman_Θ - italic_q | | | ≥ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 , in blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT -probability ,
(4.3) Πn(Γ>d0 and minq𝒬Γ*|Θq|δn|X1,,Xn)n+0,in n*-probability.subscriptΠ𝑛Γsubscript𝑑0 and subscript𝑞subscriptsuperscript𝒬ΓnormΘ𝑞conditionalsubscript𝛿𝑛subscript𝑋1subscript𝑋𝑛subscript𝑛0in superscriptsubscript𝑛-probability\displaystyle\Pi_{n}\left(\Gamma>d_{0}\text{ and }\min_{q\in\mathcal{Q}^{*}_{% \Gamma}}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\Theta-q\right|\kern-% 1.07639pt\right|\kern-1.07639pt\right|}\geq\delta_{n}\ |\ X_{1},\ldots,X_{n}% \right)\mathop{\longrightarrow}\limits_{n\to+\infty}0,\qquad\text{in }% \operatorname{\mathds{P}}_{n}^{*}\text{-probability}.roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_Γ > italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and roman_min start_POSTSUBSCRIPT italic_q ∈ caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT end_POSTSUBSCRIPT | | | roman_Θ - italic_q | | | ≥ italic_δ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 , in blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT -probability .

Theorem 4.1 ensures that the central subspace 𝒮𝒮\mathcal{S}caligraphic_S can be recovered as soon as the intrinsic dimension d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is known. Subspaces of dimension smaller than d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT are also eliminated but the theorem does not reject those of dimension greater than d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. We conjecture that the prior mass on those spaces tends to vanish, for reasons similar to those exposed in [JT21]. Indeed, introducing a penalization on larger dimensions if necessary, it should be possible to show that the posterior cannot contract as fast as the minimax rate for d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT if a subspace of greater dimension is chosen. As discussed in the introduction of this section, the estimation of the central subspace is made under the assumption that d𝑑ditalic_d is fixed with n𝑛nitalic_n mainly because of the identifiability issue caused by the ellipsoid support. We believe that this restriction can be relaxed by extending the support 𝕌dsubscript𝕌𝑑\mathbb{U}_{d}blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT to the full ambient space dsuperscript𝑑\mathbb{R}^{d}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, as in [JT21]. In this case, the square over which we integrate the Hellinger distance in the proof of Theorem 4.1 can be taken as the product space of a square of side L𝐿Litalic_L in directions 𝚫𝚫\boldsymbol{\Delta}bold_Δ and 𝚲𝚲\boldsymbol{\Lambda}bold_Λ times d2superscript𝑑2\mathbb{R}^{d-2}blackboard_R start_POSTSUPERSCRIPT italic_d - 2 end_POSTSUPERSCRIPT. Then, the integrated error should no longer depend on d𝑑ditalic_d and consistency to the true subspace should follow. Further investigations in this direction might be worthwhile.

The proof of Theorem 4.1 is postponed in Appendix 5.2.

Acknowledgments

We acknowledge the support of the French Agence Nationale de la Recherche (ANR) under reference ANR-21-CE40-0007 (GAP Project).

5 Appendix

5.1 Proof of Theorem 3.1

As a reminder, we first exhibit some facts about the convergence rate:

(5.1) εn=Cεnβ2β+d0(logn)κ,nεn2=Cε2nd02β+d0(logn)2κ.formulae-sequencesubscript𝜀𝑛subscript𝐶𝜀superscript𝑛𝛽2𝛽subscript𝑑0superscript𝑛𝜅𝑛superscriptsubscript𝜀𝑛2superscriptsubscript𝐶𝜀2superscript𝑛subscript𝑑02𝛽subscript𝑑0superscript𝑛2𝜅\varepsilon_{n}\ =\ C_{\varepsilon}\cdot n^{-\tfrac{\beta}{2\beta+d_{0}}}\cdot% (\log n)^{\kappa},\qquad n\varepsilon_{n}^{2}\ =\ C_{\varepsilon}^{2}\cdot n^{% \tfrac{d_{0}}{2\beta+d_{0}}}\cdot(\log n)^{2\kappa}.italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ⋅ italic_n start_POSTSUPERSCRIPT - divide start_ARG italic_β end_ARG start_ARG 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT ⋅ ( roman_log italic_n ) start_POSTSUPERSCRIPT italic_κ end_POSTSUPERSCRIPT , italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ italic_n start_POSTSUPERSCRIPT divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT ⋅ ( roman_log italic_n ) start_POSTSUPERSCRIPT 2 italic_κ end_POSTSUPERSCRIPT .

So εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is a large multiple of the minimax rate times a logarithm factor. The constant Cεsubscript𝐶𝜀C_{\varepsilon}italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT is chosen to be arbitrarily large in order to absorb undesired terms in the proof.


The proof of Theorem 3.1 is based on Theorem 2.1 in [GGV00]. The general outline is a combination of the arguments of [Tok11] (itself derived from [VV09]) and [JT21]. Concretely, it suffices to show that there exists a sequence of sets 𝔹n𝒞(𝕌dn)subscript𝔹𝑛𝒞subscript𝕌subscript𝑑𝑛\mathbb{B}_{n}\subset\mathcal{C}(\mathbb{U}_{d_{n}})blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⊂ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (referred to as a sieve), such that the following three conditions hold for all sufficiently large n𝑛nitalic_n:

(5.2) Πn(WA,Γ,Θfn*2εn)exp(nεn2),subscriptΠ𝑛subscriptnormsuperscript𝑊𝐴ΓΘsuperscriptsubscript𝑓𝑛2subscript𝜀𝑛𝑛superscriptsubscript𝜀𝑛2\displaystyle\Pi_{n}\left(\left\|{W^{A,\Gamma,\Theta}-f_{n}^{*}}\right\|_{% \infty}\leq 2\varepsilon_{n}\right)\ \geq\ \exp(-n\varepsilon_{n}^{2}),roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , roman_Γ , roman_Θ end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≥ roman_exp ( - italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,
(5.3) Πn(WA,Γ,Θ𝔹n)exp(5nεn2),subscriptΠ𝑛superscript𝑊𝐴ΓΘsubscript𝔹𝑛5𝑛superscriptsubscript𝜀𝑛2\displaystyle\Pi_{n}\left(W^{A,\Gamma,\Theta}\notin\mathbb{B}_{n}\right)\ \leq% \ \exp(-5n\varepsilon_{n}^{2}),roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_A , roman_Γ , roman_Θ end_POSTSUPERSCRIPT ∉ blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≤ roman_exp ( - 5 italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,
(5.4) logN(3εn,𝔹n,)nεn2.\displaystyle\log N\left(3\varepsilon_{n},\mathbb{B}_{n},\left\|{\cdot}\right% \|_{\infty}\right)\ \leq\ n\varepsilon_{n}^{2}.roman_log italic_N ( 3 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ≤ italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

This is the purpose of the next sections. The first condition (5.2), referred to as prior mass condition, ensures that the prior puts a sufficient amount of mass around the true parameter. Condition (5.3), called sieve condition, forces the sieve 𝔹nsubscript𝔹𝑛\mathbb{B}_{n}blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT to capture most of the mass of the prior, while the entropy condition (5.4) constrains its size. These three conditions map one to one with the conditions of Theorem 2.1 in [GGV00], as showed in [VV08] for density estimation and regression with fixed design. For regression with random design, we recall in the next section some arguments spread out in Bayesian literature.

5.1.1 Regression with random design

Here, we show that Theorem 2.1 in [GGV00] can be applied in the regression with random design setting, as soon as Conditions (5.2), (5.3), and (5.4) are satisfied. The procedure consists in showing that the posterior contracts to the density of a pair (Xi,Yi)subscript𝑋𝑖subscript𝑌𝑖(X_{i},Y_{i})( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and then to retrieve fn*subscriptsuperscript𝑓𝑛f^{*}_{n}italic_f start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT from this density. For a function f:𝕌dn:𝑓subscript𝕌subscript𝑑𝑛f:\mathbb{U}_{d_{n}}\to\mathbb{R}italic_f : blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT → blackboard_R, we define Pf:𝕌dn×+,(x,y)Gn(x)Φf(x),σ(y):subscript𝑃𝑓formulae-sequencesubscript𝕌subscript𝑑𝑛subscriptmaps-to𝑥𝑦subscript𝐺𝑛𝑥subscriptΦ𝑓𝑥𝜎𝑦P_{f}:\mathbb{U}_{d_{n}}\times\mathbb{R}\to\mathbb{R}_{+},\ (x,y)\mapsto G_{n}% (x)\cdot\Phi_{f(x),\sigma}(y)italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT : blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT × blackboard_R → blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT , ( italic_x , italic_y ) ↦ italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_x ) ⋅ roman_Φ start_POSTSUBSCRIPT italic_f ( italic_x ) , italic_σ end_POSTSUBSCRIPT ( italic_y ), where Φμ,σsubscriptΦ𝜇𝜎\Phi_{\mu,\sigma}roman_Φ start_POSTSUBSCRIPT italic_μ , italic_σ end_POSTSUBSCRIPT is the density of a univariate Gaussian variable with mean μ𝜇\muitalic_μ and standard deviation σ𝜎\sigmaitalic_σ and Gnsubscript𝐺𝑛G_{n}italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is the density of one covariate. Then, the density of one observation (X,Y)𝑋𝑌(X,Y)( italic_X , italic_Y ) under regression with random design is Pfn*subscript𝑃superscriptsubscript𝑓𝑛P_{f_{n}^{*}}italic_P start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. We first prove that Condition (5.2) implies Condition (2.4) in [GGV00] with C=1𝐶1C=1italic_C = 1. Detailed calculations can be found in [FS23], Section A.2. We have to compare the uniform neighborhood of fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT with the Kullback-Leibler neighborhood

B2(Pfn*;ε):={g:KL(Pfn*,Pg)ε2,V2,0(Pfn*,Pg)ε2},assignsubscript𝐵2subscript𝑃superscriptsubscript𝑓𝑛𝜀conditional-set𝑔formulae-sequenceKLsubscript𝑃superscriptsubscript𝑓𝑛subscript𝑃𝑔superscript𝜀2subscript𝑉20subscript𝑃superscriptsubscript𝑓𝑛subscript𝑃𝑔superscript𝜀2B_{2}(P_{f_{n}^{*}};\varepsilon):=\{g:\mathrm{KL}(P_{f_{n}^{*}},P_{g})\leq% \varepsilon^{2},\ V_{2,0}(P_{f_{n}^{*}},P_{g})\leq\varepsilon^{2}\},italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ; italic_ε ) := { italic_g : roman_KL ( italic_P start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) ≤ italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_V start_POSTSUBSCRIPT 2 , 0 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) ≤ italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT } ,

where KL(Pf,Pg):=Pf[log(dPf/dPg)]assignKLsubscript𝑃𝑓subscript𝑃𝑔subscript𝑃𝑓delimited-[]𝑑subscript𝑃𝑓𝑑subscript𝑃𝑔\mathrm{KL}(P_{f},P_{g}):=P_{f}\left[\log(dP_{f}/dP_{g})\right]roman_KL ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) := italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT [ roman_log ( italic_d italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT / italic_d italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) ] is the Kullback-Leibler divergence between Pfsubscript𝑃𝑓P_{f}italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT and Pgsubscript𝑃𝑔P_{g}italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT and V2,0(Pf,Pg):=Pf[log(dPf/dPg)KL(Pf,Pg)]2assignsubscript𝑉20subscript𝑃𝑓subscript𝑃𝑔subscript𝑃𝑓superscriptdelimited-[]𝑑subscript𝑃𝑓𝑑subscript𝑃𝑔KLsubscript𝑃𝑓subscript𝑃𝑔2V_{2,0}(P_{f},P_{g}):=P_{f}\left[\log(dP_{f}/dP_{g})-\mathrm{KL}(P_{f},P_{g})% \right]^{2}italic_V start_POSTSUBSCRIPT 2 , 0 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) := italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT [ roman_log ( italic_d italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT / italic_d italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) - roman_KL ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT is the Kullback-Leibler variation. Using the following identities from [FS23]

KL(Pf,Pg)KLsubscript𝑃𝑓subscript𝑃𝑔\displaystyle\mathrm{KL}(P_{f},P_{g})\ roman_KL ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) =12σ2fg2,Gn2,absent12superscript𝜎2superscriptsubscriptnorm𝑓𝑔2subscript𝐺𝑛2\displaystyle=\ \frac{1}{2\sigma^{2}}\left\|{f-g}\right\|_{2,G_{n}}^{2},= divide start_ARG 1 end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∥ italic_f - italic_g ∥ start_POSTSUBSCRIPT 2 , italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,
V2(Pf,Pg)subscript𝑉2subscript𝑃𝑓subscript𝑃𝑔\displaystyle V_{2}(P_{f},P_{g})\ italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) :=Pf[log(dPfdPg)2]=1σ2fg2,Gn2+(12σ2(fg)22,Gn)2,\displaystyle:=\ P_{f}\left[\log\left(\frac{dP_{f}}{dP_{g}}\right)^{2}\right]% \ =\ \frac{1}{\sigma^{2}}\left\|{f-g}\right\|_{2,G_{n}}^{2}\ +\ \left(\frac{1}% {2\sigma^{2}}\left\|{(f-g)^{2}}\right\|_{2,G_{n}}\right)^{2},:= italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT [ roman_log ( divide start_ARG italic_d italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_ARG start_ARG italic_d italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] = divide start_ARG 1 end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∥ italic_f - italic_g ∥ start_POSTSUBSCRIPT 2 , italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( divide start_ARG 1 end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∥ ( italic_f - italic_g ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 , italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,
V2,0(Pf,Pg)subscript𝑉20subscript𝑃𝑓subscript𝑃𝑔\displaystyle V_{2,0}(P_{f},P_{g})\ italic_V start_POSTSUBSCRIPT 2 , 0 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) =V2(Pf,Pg)KL(Pf,Pg)2,absentsubscript𝑉2subscript𝑃𝑓subscript𝑃𝑔KLsuperscriptsubscript𝑃𝑓subscript𝑃𝑔2\displaystyle=\ V_{2}(P_{f},P_{g})\ -\ \mathrm{KL}(P_{f},P_{g})^{2},= italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) - roman_KL ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

we deduce that, if fg2εsubscriptnorm𝑓𝑔2𝜀\left\|{f-g}\right\|_{\infty}\leq 2\varepsilon∥ italic_f - italic_g ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ 2 italic_ε with 2ε<12𝜀12\varepsilon<12 italic_ε < 1, then

KL(Pf,Pg)KLsubscript𝑃𝑓subscript𝑃𝑔\displaystyle\mathrm{KL}(P_{f},P_{g})\ roman_KL ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) 12σ2fg22ε2σ2,absent12superscript𝜎2superscriptsubscriptnorm𝑓𝑔22superscript𝜀2superscript𝜎2\displaystyle\leq\ \frac{1}{2\sigma^{2}}\left\|{f-g}\right\|_{\infty}^{2}\ % \leq\ \frac{2\varepsilon^{2}}{\sigma^{2}},≤ divide start_ARG 1 end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∥ italic_f - italic_g ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ divide start_ARG 2 italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ,
V2,0(Pf,Pg)subscript𝑉20subscript𝑃𝑓subscript𝑃𝑔\displaystyle V_{2,0}(P_{f},P_{g})\ italic_V start_POSTSUBSCRIPT 2 , 0 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) 4Cσ2ε2,absent4superscriptsubscript𝐶𝜎2superscript𝜀2\displaystyle\leq\ 4C_{\sigma}^{2}\cdot\varepsilon^{2},≤ 4 italic_C start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

where Cσ:=1/σ2+1/(4σ4)assignsubscript𝐶𝜎1superscript𝜎214superscript𝜎4C_{\sigma}:=\sqrt{1/\sigma^{2}+1/(4\sigma^{4})}italic_C start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT := square-root start_ARG 1 / italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + 1 / ( 4 italic_σ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT ) end_ARG. Consequently, according to (5.2), and multiplying εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT by 4Cσ24superscriptsubscript𝐶𝜎24C_{\sigma}^{2}4 italic_C start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT if necessary, we have

Πn(B2(Pfn*;εn))exp(14Cσ2nεn2).subscriptΠ𝑛subscript𝐵2subscript𝑃superscriptsubscript𝑓𝑛subscript𝜀𝑛14superscriptsubscript𝐶𝜎2𝑛superscriptsubscript𝜀𝑛2\Pi_{n}\left(B_{2}(P_{f_{n}^{*}};\varepsilon_{n})\right)\ \geq\ \exp\left(-% \frac{1}{4C_{\sigma}^{2}}n\varepsilon_{n}^{2}\right).roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ; italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) ≥ roman_exp ( - divide start_ARG 1 end_ARG start_ARG 4 italic_C start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

One can remark that for Condition (2.4) in [GGV00] to be satisfied, we must have (4Cσ2)11superscript4superscriptsubscript𝐶𝜎211(4C_{\sigma}^{2})^{-1}\leq 1( 4 italic_C start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ≤ 1 which is the case as soon as σ2𝜎2\sigma\leq 2italic_σ ≤ 2.

Condition (2.3) in [GGV00] is immediately deduced from (5.3). For Condition (2.4), we use the inequality

(5.5) h(Pf,Pg)12σfg,subscript𝑃𝑓subscript𝑃𝑔12𝜎subscriptnorm𝑓𝑔h(P_{f},P_{g})\ \leq\ \frac{1}{2\sigma}\left\|{f-g}\right\|_{\infty},italic_h ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) ≤ divide start_ARG 1 end_ARG start_ARG 2 italic_σ end_ARG ∥ italic_f - italic_g ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ,

see again [FS23] for details. Then, assuming that σ1𝜎1\sigma\geq 1italic_σ ≥ 1, we have, according to (5.4) and multiplying εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT by 3 if necessary,

D(εn,𝔹n,h)N(εn2,𝔹n,h)N(εn2σ,𝔹n,h)N(εn,𝔹n,)exp(nεn2),D\left(\varepsilon_{n},\mathbb{B}_{n},h\right)\ \leq\ N\left(\frac{\varepsilon% _{n}}{2},\mathbb{B}_{n},h\right)\ \leq\ N\left(\frac{\varepsilon_{n}}{2\sigma}% ,\mathbb{B}_{n},h\right)\ \leq\ N\left(\varepsilon_{n},\mathbb{B}_{n},\left\|{% \cdot}\right\|_{\infty}\right)\ \leq\ \exp(n\varepsilon_{n}^{2}),italic_D ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_h ) ≤ italic_N ( divide start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG , blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_h ) ≤ italic_N ( divide start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_σ end_ARG , blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_h ) ≤ italic_N ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ≤ roman_exp ( italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,

where the first inequality comes from the definition of the packing number D𝐷Ditalic_D and the covering number N𝑁Nitalic_N and where the third inequality follows from (5.5). Theorem 2.1 in [GGV00] then ensures posterior consistency to Pfn*subscript𝑃superscriptsubscript𝑓𝑛P_{f_{n}^{*}}italic_P start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT at rate εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in Hellinger distance. Now, because we also have the converse inequality

h2(Pf,Pg)14σ2exp(Q22σ2)fg2,Gn2if fQ and gQformulae-sequencesuperscript2subscript𝑃𝑓subscript𝑃𝑔14superscript𝜎2superscript𝑄22superscript𝜎2superscriptsubscriptnorm𝑓𝑔2subscript𝐺𝑛2if subscriptnorm𝑓𝑄 and subscriptnorm𝑔𝑄h^{2}(P_{f},P_{g})\ \geq\ \frac{1}{4\sigma^{2}}\exp\left(-\frac{Q^{2}}{2\sigma% ^{2}}\right)\cdot\left\|{f-g}\right\|_{2,G_{n}}^{2}\qquad\text{if }\left\|{f}% \right\|_{\infty}\leq Q\text{ and }\left\|{g}\right\|_{\infty}\leq Qitalic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) ≥ divide start_ARG 1 end_ARG start_ARG 4 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG roman_exp ( - divide start_ARG italic_Q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ) ⋅ ∥ italic_f - italic_g ∥ start_POSTSUBSCRIPT 2 , italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT if ∥ italic_f ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_Q and ∥ italic_g ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_Q

and that h(PfQ,PgQ)h(Pf,Pg)subscript𝑃superscript𝑓𝑄subscript𝑃superscript𝑔𝑄subscript𝑃𝑓subscript𝑃𝑔h(P_{f^{Q}},P_{g^{Q}})\leq h(P_{f},P_{g})italic_h ( italic_P start_POSTSUBSCRIPT italic_f start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ≤ italic_h ( italic_P start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ) when nothing is assumed on f𝑓fitalic_f and g𝑔gitalic_g with fQ=(fQ)Qf^{Q}=(f\vee-Q)\wedge Qitalic_f start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT = ( italic_f ∨ - italic_Q ) ∧ italic_Q, we obtain posterior contraction to fn*superscriptsubscript𝑓𝑛f_{n}^{*}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT at rate εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT in the L2(Gn)superscript𝐿2subscript𝐺𝑛L^{2}(G_{n})italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT )-distance:

n*[Πn(g𝒞(𝕌dn):fn*QgQ2,Gn>DσQεn|(X1,Y1),,(Xn,Yn))]n+0,\operatorname{\mathds{P}}_{n}^{*}\left[\Pi_{n}\left(g\in\mathcal{C}(\mathbb{U}% _{d_{n}}):\bigl{\|}{{f_{n}^{*}}^{Q}-g^{Q}}\bigr{\|}_{2,G_{n}}>D_{\sigma}^{Q}% \cdot\varepsilon_{n}\ |\ (X_{1},Y_{1}),\ldots,(X_{n},Y_{n})\right)\right]% \mathop{\longrightarrow}\limits_{n\to+\infty}0,blackboard_P start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT [ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_g ∈ caligraphic_C ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) : ∥ italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT - italic_g start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 , italic_G start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT > italic_D start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT ⋅ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , ( italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) ] ⟶ start_POSTSUBSCRIPT italic_n → + ∞ end_POSTSUBSCRIPT 0 ,

where DσQ:=M2σexp(Q2/(4σ2))assignsuperscriptsubscript𝐷𝜎𝑄𝑀2𝜎superscript𝑄24superscript𝜎2D_{\sigma}^{Q}:=M\cdot 2\sigma\cdot\exp\left(Q^{2}/(4\sigma^{2})\right)italic_D start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_Q end_POSTSUPERSCRIPT := italic_M ⋅ 2 italic_σ ⋅ roman_exp ( italic_Q start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / ( 4 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ).

Remark 5.1.

The restriction to [1,2]12[1,2][ 1 , 2 ] for the standard deviation σ𝜎\sigmaitalic_σ can be relaxed. In fact, if σ>2𝜎2\sigma>2italic_σ > 2, then it suffices to consider Theorem 2.1 in [GGV00] with C=(4Cσ2)1𝐶superscript4subscriptsuperscript𝐶2𝜎1C=(4C^{2}_{\sigma})^{-1}italic_C = ( 4 italic_C start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_σ end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Condition (2.4) in [GGV00] is then immediately satisfied and, for Condition (2.3), the proof of (5.3) can be adapted to replace 5 by 4+C4𝐶4+C4 + italic_C. On the contrary, if 0<σ<10𝜎10<\sigma<10 < italic_σ < 1, Condition (2.2) in [GGV00] can be satisfied by multiplying εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT by σ1superscript𝜎1\sigma^{-1}italic_σ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT.


5.1.2 Prior mass condition (5.2)

We verify here that Πn(WA,Γ,Θfn*2εn)exp(nεn2)subscriptΠ𝑛subscriptnormsuperscript𝑊𝐴ΓΘsuperscriptsubscript𝑓𝑛2subscript𝜀𝑛𝑛superscriptsubscript𝜀𝑛2\Pi_{n}\left(\left\|{W^{A,\Gamma,\Theta}-f_{n}^{*}}\right\|_{\infty}\leq 2% \varepsilon_{n}\right)\ \geq\ \exp(-n\varepsilon_{n}^{2})roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , roman_Γ , roman_Θ end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≥ roman_exp ( - italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ). Let us introduce the following notation.

Notation.

For q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we denote by fn,q:𝕌dn:subscript𝑓𝑛𝑞subscript𝕌subscript𝑑𝑛f_{n,q}:\mathbb{U}_{d_{n}}\to\mathbb{R}italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT : blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT → blackboard_R the function such that fn,q(x)=f0((qx)𝐝𝟎)subscript𝑓𝑛𝑞𝑥subscript𝑓0subscript𝑞𝑥subscript𝐝0f_{n,q}(x)=f_{0}\left((qx)_{\mathbf{d_{0}}}\right)italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ( italic_x ) = italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ( italic_q italic_x ) start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), for all x𝕌dn𝑥subscript𝕌subscript𝑑𝑛x\in\mathbb{U}_{d_{n}}italic_x ∈ blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Hence, fn*=fn,qn*superscriptsubscript𝑓𝑛subscript𝑓𝑛superscriptsubscript𝑞𝑛f_{n}^{*}=f_{n,q_{n}^{*}}italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_f start_POSTSUBSCRIPT italic_n , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT.

We first reduce the problem to deterministic dimension and direction by conditioning with Γ=d0Γsubscript𝑑0\Gamma=d_{0}roman_Γ = italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and integrating over 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT:

Πn(WA,Γ,Θfn*2εn)πΓ(d0)𝒪dnΠn(WA,d0,qfn*2εn)𝑑q.subscriptΠ𝑛subscriptdelimited-∥∥superscript𝑊𝐴ΓΘsuperscriptsubscript𝑓𝑛2subscript𝜀𝑛subscript𝜋Γsubscript𝑑0subscriptsubscript𝒪subscript𝑑𝑛subscriptΠ𝑛subscriptdelimited-∥∥superscript𝑊𝐴subscript𝑑0𝑞superscriptsubscript𝑓𝑛2subscript𝜀𝑛differential-d𝑞\Pi_{n}\left(\bigl{\|}{W^{A,\Gamma,\Theta}-f_{n}^{*}}\bigr{\|}_{\infty}\leq 2% \varepsilon_{n}\right)\ \geq\ \pi_{\Gamma}(d_{0})\int_{\mathcal{O}_{d_{n}}}\Pi% _{n}\left(\bigl{\|}{W^{A,d_{0},q}-f_{n}^{*}}\bigr{\|}_{\infty}\leq 2% \varepsilon_{n}\right)dq.roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , roman_Γ , roman_Θ end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≥ italic_π start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ∫ start_POSTSUBSCRIPT caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) italic_d italic_q .

Now, we want to bound from below the integrand on a significant subset of 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. We remark that if q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT is such that fn*fn,qεnsubscriptnormsuperscriptsubscript𝑓𝑛subscript𝑓𝑛𝑞subscript𝜀𝑛\left\|{f_{n}^{*}-f_{n,q}}\right\|_{\infty}\leq\varepsilon_{n}∥ italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, then

Πn(WA,d0,qfn*2εn)Πn(WA,d0,qfn,qεn).subscriptΠ𝑛subscriptdelimited-∥∥superscript𝑊𝐴subscript𝑑0𝑞superscriptsubscript𝑓𝑛2subscript𝜀𝑛subscriptΠ𝑛subscriptdelimited-∥∥superscript𝑊𝐴subscript𝑑0𝑞subscript𝑓𝑛𝑞subscript𝜀𝑛\Pi_{n}\left(\bigl{\|}{W^{A,d_{0},q}-f_{n}^{*}}\bigr{\|}_{\infty}\leq 2% \varepsilon_{n}\right)\ \geq\ \Pi_{n}\left(\bigl{\|}{W^{A,d_{0},q}-f_{n,q}}% \bigr{\|}_{\infty}\leq\varepsilon_{n}\right).roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≥ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) .

We show that the right-hand side is bounded from below by exp(12nεn2)12𝑛superscriptsubscript𝜀𝑛2\exp(-\frac{1}{2}n\varepsilon_{n}^{2})roman_exp ( - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) and then, we bound from below the measure of the set of q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT satisfying fn*fn,qεnsubscriptnormsuperscriptsubscript𝑓𝑛subscript𝑓𝑛𝑞subscript𝜀𝑛\left\|{f_{n}^{*}-f_{n,q}}\right\|_{\infty}\leq\varepsilon_{n}∥ italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

From now on, we use without specification the constants of Lemmas 5.5, 5.6, and 5.7 and we fix a0>1subscript𝑎01a_{0}>1italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 1. Let a[Kn,2Kn]𝑎subscript𝐾𝑛2subscript𝐾𝑛a\in[K_{n},2K_{n}]italic_a ∈ [ italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , 2 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ] where Kn=(2Cf0εn)1/βsubscript𝐾𝑛superscript2subscript𝐶subscript𝑓0subscript𝜀𝑛1𝛽K_{n}=\left(\frac{2C_{f_{0}}}{\varepsilon_{n}}\right)^{1/\beta}italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ( divide start_ARG 2 italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 1 / italic_β end_POSTSUPERSCRIPT. We suppose n𝑛nitalic_n large enough so that εn/2<min(ε0a0,d0;Cf0a0β;1/2)subscript𝜀𝑛2superscriptsubscript𝜀0subscript𝑎0subscript𝑑0subscript𝐶subscript𝑓0superscriptsubscript𝑎0𝛽12\varepsilon_{n}/2\ <\ \min(\varepsilon_{0}^{a_{0},d_{0}};C_{f_{0}}\cdot a_{0}^% {-\beta};1/2)italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / 2 < roman_min ( italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ; italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT ; 1 / 2 ). Then,

(5.6) Kn>(Cf0Cf0a0β)1/β=a0,subscript𝐾𝑛superscriptsubscript𝐶subscript𝑓0subscript𝐶subscript𝑓0superscriptsubscript𝑎0𝛽1𝛽subscript𝑎0K_{n}\ >\ \left(\frac{C_{f_{0}}}{C_{f_{0}}\cdot a_{0}^{-\beta}}\right)^{1/% \beta}\ =\ a_{0},italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT > ( divide start_ARG italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT 1 / italic_β end_POSTSUPERSCRIPT = italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ,

and, because a(2Cf0εn)1/β𝑎superscript2subscript𝐶subscript𝑓0subscript𝜀𝑛1𝛽a\geq\left(\frac{2C_{f_{0}}}{\varepsilon_{n}}\right)^{1/\beta}italic_a ≥ ( divide start_ARG 2 italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT 1 / italic_β end_POSTSUPERSCRIPT, we have

(5.7) εn2Cf0aβ.subscript𝜀𝑛2subscript𝐶subscript𝑓0superscript𝑎𝛽\frac{\varepsilon_{n}}{2}\ \geq\ C_{f_{0}}\cdot a^{-\beta}.divide start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ≥ italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT .

According to Lemma 5.3 in [VV08a], for q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we can write

Πn(WA,d0,qfn,qεn)subscriptΠ𝑛subscriptdelimited-∥∥superscript𝑊𝐴subscript𝑑0𝑞subscript𝑓𝑛𝑞subscript𝜀𝑛\displaystyle\Pi_{n}\left(\bigl{\|}{W^{A,d_{0},q}-f_{n,q}}\bigr{\|}_{\infty}% \leq\varepsilon_{n}\right)\ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) Kn2KnΠn(Wa,d0,qfn,qεn)πn,d0(a)𝑑aabsentsuperscriptsubscriptsubscript𝐾𝑛2subscript𝐾𝑛subscriptΠ𝑛subscriptdelimited-∥∥superscript𝑊𝑎subscript𝑑0𝑞subscript𝑓𝑛𝑞subscript𝜀𝑛subscript𝜋𝑛subscript𝑑0𝑎differential-d𝑎\displaystyle\geq\ \int_{K_{n}}^{2K_{n}}\Pi_{n}\left(\bigl{\|}{W^{a,d_{0},q}-f% _{n,q}}\bigr{\|}_{\infty}\leq\varepsilon_{n}\right)\pi_{n,d_{0}}(a)da≥ ∫ start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) italic_π start_POSTSUBSCRIPT italic_n , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_a ) italic_d italic_a
Kn2Knexp(ϕfn,qa,d0,q(εn/2))πn,d0(a)𝑑a,absentsuperscriptsubscriptsubscript𝐾𝑛2subscript𝐾𝑛superscriptsubscriptitalic-ϕsubscript𝑓𝑛𝑞𝑎subscript𝑑0𝑞subscript𝜀𝑛2subscript𝜋𝑛subscript𝑑0𝑎differential-d𝑎\displaystyle\geq\ \int_{K_{n}}^{2K_{n}}\exp\left(-\phi_{f_{n,q}}^{a,d_{0},q}(% \varepsilon_{n}/2)\right)\pi_{n,d_{0}}(a)da,≥ ∫ start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_exp ( - italic_ϕ start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / 2 ) ) italic_π start_POSTSUBSCRIPT italic_n , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_a ) italic_d italic_a ,

where ϕfn,qa,d0,qsuperscriptsubscriptitalic-ϕsubscript𝑓𝑛𝑞𝑎subscript𝑑0𝑞\phi_{f_{n,q}}^{a,d_{0},q}italic_ϕ start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT is the concentration function in (3.3). Now we want to control the concentration function using Lemmas 5.5 and 5.7. The inequality (5.6) and the previous restriction on n𝑛nitalic_n ensure that the conditions of Lemma 5.7 are satisfied with ε=εn/2𝜀subscript𝜀𝑛2\varepsilon=\varepsilon_{n}/2italic_ε = italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / 2, while (5.7) and Lemma 5.5 give

inf{h¯a,d0,q2:h¯a,d0,q,h¯fn,qεn/2}Df0ad0.\inf\left\{\left\|{\overline{h}}\right\|_{\mathbb{H}_{a,d_{0},q}}^{2}:% \overline{h}\in\mathbb{H}_{a,d_{0},q},\ \left\|{\overline{h}-f_{n,q}}\right\|_% {\infty}\leq\varepsilon_{n}/2\right\}\ \leq\ D_{f_{0}}\cdot a^{d_{0}}.roman_inf { ∥ over¯ start_ARG italic_h end_ARG ∥ start_POSTSUBSCRIPT blackboard_H start_POSTSUBSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT : over¯ start_ARG italic_h end_ARG ∈ blackboard_H start_POSTSUBSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUBSCRIPT , ∥ over¯ start_ARG italic_h end_ARG - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / 2 } ≤ italic_D start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT .

Using the expression (3.3) of the concentration function, a combination of the two lemmas gives

ϕfn,qa,d0,q(εn/2)superscriptsubscriptitalic-ϕsubscript𝑓𝑛𝑞𝑎subscript𝑑0𝑞subscript𝜀𝑛2\displaystyle\phi_{f_{n,q}}^{a,d_{0},q}(\varepsilon_{n}/2)\ italic_ϕ start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / 2 ) Df0ad0+Ca0,d0ad0log(2a/εn)d0+1\displaystyle\leq\ D_{f_{0}}\cdot a^{d_{0}}+C_{a_{0},d_{0}}\cdot a^{d_{0}}\log% (2a/\varepsilon_{n})^{d_{0}+1}≤ italic_D start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT + italic_C start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_log ( 2 italic_a / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT
=(Df0log(2a/εn)d01+Ca0,d0)ad0log(2a/εn)d0+1\displaystyle=\ \left(D_{f_{0}}\log(2a/\varepsilon_{n})^{-d_{0}-1}+C_{a_{0},d_% {0}}\right)a^{d_{0}}\log(2a/\varepsilon_{n})^{d_{0}+1}= ( italic_D start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_log ( 2 italic_a / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT + italic_C start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_a start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_log ( 2 italic_a / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT
(Df0log(a0)d01+Ca0,d0)ad0log(2a/εn)d0+1,\displaystyle\leq\ \left(D_{f_{0}}\log(a_{0})^{-d_{0}-1}+C_{a_{0},d_{0}}\right% )a^{d_{0}}\log(2a/\varepsilon_{n})^{d_{0}+1},≤ ( italic_D start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_log ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT + italic_C start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_a start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_log ( 2 italic_a / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT ,

where the last inequality holds because aa0𝑎subscript𝑎0a\geq a_{0}italic_a ≥ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and εn1subscript𝜀𝑛1\varepsilon_{n}\leq 1italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ 1 for n𝑛nitalic_n large enough. Let us define the constant Cf0,a0,d0:=Df0log(a0)d01+Ca0,d0C_{f_{0},a_{0},d_{0}}:=D_{f_{0}}\log(a_{0})^{-d_{0}-1}+C_{a_{0},d_{0}}italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT := italic_D start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_log ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT + italic_C start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and note that there exists a constant Cf0subscriptsuperscript𝐶subscript𝑓0C^{\prime}_{f_{0}}italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that for sufficiently large n𝑛nitalic_n, log(4Kn/εn)=log(4(2Cf0)1/βεn(1+1/β))Cf0log(1/εn)4subscript𝐾𝑛subscript𝜀𝑛4superscript2subscript𝐶subscript𝑓01𝛽superscriptsubscript𝜀𝑛11𝛽subscriptsuperscript𝐶subscript𝑓01subscript𝜀𝑛\log(4K_{n}/\varepsilon_{n})=\log\left(4(2C_{f_{0}})^{1/\beta}\varepsilon_{n}^% {-(1+1/\beta)}\right)\leq C^{\prime}_{f_{0}}\log(1/\varepsilon_{n})roman_log ( 4 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) = roman_log ( 4 ( 2 italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / italic_β end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - ( 1 + 1 / italic_β ) end_POSTSUPERSCRIPT ) ≤ italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_log ( 1 / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ). Then, there exists a constant Cf0,a0,d0subscriptsuperscript𝐶subscript𝑓0subscript𝑎0subscript𝑑0C^{\prime}_{f_{0},a_{0},d_{0}}italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that

Kn2Knexp(ϕfn,qa,d0,q(εn/2))πn,d0(a)𝑑asuperscriptsubscriptsubscript𝐾𝑛2subscript𝐾𝑛superscriptsubscriptitalic-ϕsubscript𝑓𝑛𝑞𝑎subscript𝑑0𝑞subscript𝜀𝑛2subscript𝜋𝑛subscript𝑑0𝑎differential-d𝑎\displaystyle\int_{K_{n}}^{2K_{n}}\exp\left(-\phi_{f_{n,q}}^{a,d_{0},q}(% \varepsilon_{n}/2)\right)\pi_{n,d_{0}}(a)da\ ∫ start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_exp ( - italic_ϕ start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / 2 ) ) italic_π start_POSTSUBSCRIPT italic_n , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_a ) italic_d italic_a Kn2Knexp(Cf0,a0,d0ad0log(2a/εn)d0+1)πn,d0(a)da\displaystyle\geq\ \int_{K_{n}}^{2K_{n}}\exp\left(-C_{f_{0},a_{0},d_{0}}\cdot a% ^{d_{0}}\log(2a/\varepsilon_{n})^{d_{0}+1}\right)\pi_{n,d_{0}}(a)da≥ ∫ start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_exp ( - italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_log ( 2 italic_a / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT ) italic_π start_POSTSUBSCRIPT italic_n , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_a ) italic_d italic_a
exp(Cf0,a0,d0(2Kn)d0log(4Kn/εn)d0+1)πn,d0(2Kn)\displaystyle\geq\ \exp\left(-C_{f_{0},a_{0},d_{0}}(2K_{n})^{d_{0}}\log(4K_{n}% /\varepsilon_{n})^{d_{0}+1}\right)\pi_{n,d_{0}}(2K_{n})≥ roman_exp ( - italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 2 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_log ( 4 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT ) italic_π start_POSTSUBSCRIPT italic_n , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( 2 italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT )
(Assumption 3.4)Assumption 3.4\displaystyle(\text{Assumption }\ref{ass:resc})\qquad( Assumption ) exp(Cf0,a0,d0εnd0/βlog(1/εn)d0+1).\displaystyle\geq\ \exp\left(-C^{\prime}_{f_{0},a_{0},d_{0}}\cdot\varepsilon_{% n}^{-d_{0}/\beta}\log(1/\varepsilon_{n})^{d_{0}+1}\right).≥ roman_exp ( - italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_β end_POSTSUPERSCRIPT roman_log ( 1 / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT ) .

With the help of the reminder (5.1), we see that

εnd0β=Cεd0βnd02β+d0(logn)(d0+1)d02β+d0and(log1εn)d0+1<(logn)d0+1,formulae-sequencesuperscriptsubscript𝜀𝑛subscript𝑑0𝛽superscriptsubscript𝐶𝜀subscript𝑑0𝛽superscript𝑛subscript𝑑02𝛽subscript𝑑0superscript𝑛subscript𝑑01subscript𝑑02𝛽subscript𝑑0andsuperscript1subscript𝜀𝑛subscript𝑑01superscript𝑛subscript𝑑01\varepsilon_{n}^{-\tfrac{d_{0}}{\beta}}\ =\ C_{\varepsilon}^{-\tfrac{d_{0}}{% \beta}}\cdot n^{\tfrac{d_{0}}{2\beta+d_{0}}}\cdot(\log n)^{-\tfrac{(d_{0}+1)d_% {0}}{2\beta+d_{0}}}\quad\text{and}\quad\left(\log\frac{1}{\varepsilon_{n}}% \right)^{d_{0}+1}\ <\ (\log n)^{d_{0}+1},italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_β end_ARG end_POSTSUPERSCRIPT = italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_β end_ARG end_POSTSUPERSCRIPT ⋅ italic_n start_POSTSUPERSCRIPT divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT ⋅ ( roman_log italic_n ) start_POSTSUPERSCRIPT - divide start_ARG ( italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 ) italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_POSTSUPERSCRIPT and ( roman_log divide start_ARG 1 end_ARG start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT < ( roman_log italic_n ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT ,

hence εnd0/βlog(1/εn)d0+1<Cεd0/βnεn2\varepsilon_{n}^{-d_{0}/\beta}\log(1/\varepsilon_{n})^{d_{0}+1}<C_{\varepsilon% }^{-d_{0}/\beta}n\varepsilon_{n}^{2}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_β end_POSTSUPERSCRIPT roman_log ( 1 / italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUPERSCRIPT < italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_β end_POSTSUPERSCRIPT italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Then, by choosing Cεsubscript𝐶𝜀C_{\varepsilon}italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT such that Cεd0/β2Cf0,a0,d0superscriptsubscript𝐶𝜀subscript𝑑0𝛽2subscriptsuperscript𝐶subscript𝑓0subscript𝑎0subscript𝑑0C_{\varepsilon}^{d_{0}/\beta}\geq 2C^{\prime}_{f_{0},a_{0},d_{0}}italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_β end_POSTSUPERSCRIPT ≥ 2 italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we can achieve

(5.8) Πn(WA,d0,qfn,qεn)exp(Cf0,a0,d0nεn2Cεd0/β)exp(12nεn2).subscriptΠ𝑛subscriptdelimited-∥∥superscript𝑊𝐴subscript𝑑0𝑞subscript𝑓𝑛𝑞subscript𝜀𝑛subscriptsuperscript𝐶subscript𝑓0subscript𝑎0subscript𝑑0𝑛superscriptsubscript𝜀𝑛2superscriptsubscript𝐶𝜀subscript𝑑0𝛽12𝑛superscriptsubscript𝜀𝑛2\Pi_{n}\left(\bigl{\|}{W^{A,d_{0},q}-f_{n,q}}\bigr{\|}_{\infty}\leq\varepsilon% _{n}\right)\ \geq\ \exp\left(-\frac{C^{\prime}_{f_{0},a_{0},d_{0}}\cdot n% \varepsilon_{n}^{2}}{C_{\varepsilon}^{d_{0}/\beta}}\right)\ \geq\ \exp\left(-% \frac{1}{2}n\varepsilon_{n}^{2}\right).roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≥ roman_exp ( - divide start_ARG italic_C start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_C start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / italic_β end_POSTSUPERSCRIPT end_ARG ) ≥ roman_exp ( - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

At this point, the problem amount to bound from below the measure of the set of q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT satisfying fn*fn,qεnsubscriptnormsuperscriptsubscript𝑓𝑛subscript𝑓𝑛𝑞subscript𝜀𝑛\left\|{f_{n}^{*}-f_{n,q}}\right\|_{\infty}\leq\varepsilon_{n}∥ italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT. We denote by 𝒜εnsubscript𝒜subscript𝜀𝑛\mathcal{A}_{\varepsilon_{n}}caligraphic_A start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT this set. The core function f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is continuous on the compact subset 𝕌d0subscript𝕌subscript𝑑0\mathbb{U}_{d_{0}}blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, so there exists a constant D1>0subscript𝐷10D_{1}>0italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0 such that f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is β𝛽\betaitalic_β-Hölder with Hölder constant D1subscript𝐷1D_{1}italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Then, for all q,q𝒪dn𝑞superscript𝑞subscript𝒪subscript𝑑𝑛q,q^{\prime}\in\mathcal{O}_{d_{n}}italic_q , italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT,

fn,qfn,qsubscriptnormsubscript𝑓𝑛superscript𝑞subscript𝑓𝑛𝑞\displaystyle\left\|{f_{n,q^{\prime}}-f_{n,q}}\right\|_{\infty}\ ∥ italic_f start_POSTSUBSCRIPT italic_n , italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT =supx𝕌dn|f0((qx)𝐝𝟎)f0((qx)𝐝𝟎)|D1|qq|β.absentsubscriptsupremum𝑥subscript𝕌subscript𝑑𝑛subscript𝑓0subscriptsuperscript𝑞𝑥subscript𝐝0subscript𝑓0subscript𝑞𝑥subscript𝐝0subscript𝐷1superscriptnormsuperscript𝑞𝑞𝛽\displaystyle=\ \sup_{x\in\mathbb{U}_{d_{n}}}\left|f_{0}\left((q^{\prime}x)_{% \mathbf{d_{0}}}\right)-f_{0}\left((qx)_{\mathbf{d_{0}}}\right)\right|\ \leq\ D% _{1}\cdot{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|q^{\prime}-q\right|% \kern-1.07639pt\right|\kern-1.07639pt\right|}^{\beta}.= roman_sup start_POSTSUBSCRIPT italic_x ∈ blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT | italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ( italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_x ) start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) - italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( ( italic_q italic_x ) start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) | ≤ italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⋅ | | | italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_q | | | start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT .

From now on, it is apparently sufficient to compute the measure of a ball in 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT with radius (εn/D1)1/βsuperscriptsubscript𝜀𝑛subscript𝐷11𝛽(\varepsilon_{n}/D_{1})^{1/\beta}( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / italic_β end_POSTSUPERSCRIPT. In fact, B𝒪dn(qn*,(εn/D1)1/β)𝒜εnsubscript𝐵subscript𝒪subscript𝑑𝑛superscriptsubscript𝑞𝑛superscriptsubscript𝜀𝑛subscript𝐷11𝛽subscript𝒜subscript𝜀𝑛B_{\mathcal{O}_{d_{n}}}\left(q_{n}^{*},\ (\varepsilon_{n}/D_{1})^{1/\beta}% \right)\subset\mathcal{A}_{\varepsilon_{n}}italic_B start_POSTSUBSCRIPT caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / italic_β end_POSTSUPERSCRIPT ) ⊂ caligraphic_A start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. However, this leads to a design dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT not larger than nd0/(4β+2d0)superscript𝑛subscript𝑑04𝛽2subscript𝑑0n^{d_{0}/(4\beta+2d_{0})}italic_n start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / ( 4 italic_β + 2 italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT. To obtain dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT of order nd0/(2β+d0)superscript𝑛subscript𝑑02𝛽subscript𝑑0n^{d_{0}/(2\beta+d_{0})}italic_n start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT, we have to consider a larger subset.

Notation.

Let Fdn𝐹superscriptsubscript𝑑𝑛F\subset\mathbb{R}^{d_{n}}italic_F ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT be a linear subspace of dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. We denote by 𝒪dn(F)subscript𝒪subscript𝑑𝑛𝐹\mathcal{O}_{d_{n}}(F)caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_F ) the set of isometries that fix F𝐹Fitalic_F:

𝒪dn(F):={q𝒪dn:q|F=Id}.\mathcal{O}_{d_{n}}(F)\ :=\ \{q^{\prime}\in\mathcal{O}_{d_{n}}:q^{\prime}_{|F}% =\operatorname{Id}\}.caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_F ) := { italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT : italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_F end_POSTSUBSCRIPT = roman_Id } .

Then, for all q𝒪dn((qn*)1(E𝐝𝟎))superscript𝑞subscript𝒪subscript𝑑𝑛superscriptsuperscriptsubscript𝑞𝑛1subscript𝐸subscript𝐝0q^{\prime}\in\mathcal{O}_{d_{n}}((q_{n}^{*})^{-1}(E_{\mathbf{d_{0}}}))italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ), we have

fn,qn*q=fn,qn*q=fn,qn*andfn*fn,q=fn,qn*qfn,qD1|qn*qq|β.formulae-sequencesubscript𝑓𝑛superscriptsubscript𝑞𝑛superscript𝑞subscript𝑓𝑛superscriptsubscript𝑞𝑛superscript𝑞subscript𝑓𝑛superscriptsubscript𝑞𝑛andsubscriptnormsuperscriptsubscript𝑓𝑛subscript𝑓𝑛𝑞subscriptnormsubscript𝑓𝑛superscriptsubscript𝑞𝑛superscript𝑞subscript𝑓𝑛𝑞subscript𝐷1superscriptnormsubscriptsuperscript𝑞𝑛superscript𝑞𝑞𝛽f_{n,q_{n}^{*}q^{\prime}}\ =\ f_{n,q_{n}^{*}}\circ q^{\prime}\ =\ f_{n,q_{n}^{% *}}\qquad\text{and}\qquad\left\|{f_{n}^{*}-f_{n,q}}\right\|_{\infty}\ =\ \left% \|{f_{n,q_{n}^{*}q^{\prime}}-f_{n,q}}\right\|_{\infty}\ \leq\ D_{1}\cdot{\left% |\kern-1.07639pt\left|\kern-1.07639pt\left|q^{*}_{n}q^{\prime}-q\right|\kern-1% .07639pt\right|\kern-1.07639pt\right|}^{\beta}.italic_f start_POSTSUBSCRIPT italic_n , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_f start_POSTSUBSCRIPT italic_n , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∘ italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_f start_POSTSUBSCRIPT italic_n , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT and ∥ italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT = ∥ italic_f start_POSTSUBSCRIPT italic_n , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⋅ | | | italic_q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_q | | | start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT .

For ε>0𝜀0\varepsilon>0italic_ε > 0, we define

𝒬qn*,ε:={q𝒪dn:q𝒪dn((qn*)1(E𝐝𝟎)),|qn*qq|ε}.assignsubscript𝒬superscriptsubscript𝑞𝑛𝜀conditional-set𝑞subscript𝒪subscript𝑑𝑛formulae-sequencesuperscript𝑞subscript𝒪subscript𝑑𝑛superscriptsuperscriptsubscript𝑞𝑛1subscript𝐸subscript𝐝0normsuperscriptsubscript𝑞𝑛superscript𝑞𝑞𝜀\mathcal{Q}_{q_{n}^{*},\varepsilon}\ :=\ \{q\in\mathcal{O}_{d_{n}}:\exists q^{% \prime}\in\mathcal{O}_{d_{n}}((q_{n}^{*})^{-1}(E_{\mathbf{d_{0}}})),\ {\left|% \kern-1.07639pt\left|\kern-1.07639pt\left|q_{n}^{*}q^{\prime}-q\right|\kern-1.% 07639pt\right|\kern-1.07639pt\right|}\leq\varepsilon\}.caligraphic_Q start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_ε end_POSTSUBSCRIPT := { italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT : ∃ italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , | | | italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_q | | | ≤ italic_ε } .

Then, 𝒬qn*,(εn/D1)1/β𝒜εnsubscript𝒬subscriptsuperscript𝑞𝑛superscriptsubscript𝜀𝑛subscript𝐷11𝛽subscript𝒜subscript𝜀𝑛\mathcal{Q}_{q^{*}_{n},(\varepsilon_{n}/D_{1})^{1/\beta}}\subset\mathcal{A}_{% \varepsilon_{n}}caligraphic_Q start_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / italic_β end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⊂ caligraphic_A start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Since the Haar measure is translation invariant, it is sufficient to cover 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT with translations of 𝒬qn*,εsubscript𝒬superscriptsubscript𝑞𝑛𝜀\mathcal{Q}_{q_{n}^{*},\varepsilon}caligraphic_Q start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_ε end_POSTSUBSCRIPT to obtain a lower bound on the measure of 𝒬qn*,εsubscript𝒬superscriptsubscript𝑞𝑛𝜀\mathcal{Q}_{q_{n}^{*},\varepsilon}caligraphic_Q start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_ε end_POSTSUBSCRIPT, that is, to cover 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT with sets q¯𝒬qn*,ε¯𝑞subscript𝒬superscriptsubscript𝑞𝑛𝜀\overline{q}\mathcal{Q}_{q_{n}^{*},\varepsilon}over¯ start_ARG italic_q end_ARG caligraphic_Q start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_ε end_POSTSUBSCRIPT where q¯¯𝑞\overline{q}over¯ start_ARG italic_q end_ARG belongs to some net 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{R}\subset\mathcal{O}_{d_{n}}caligraphic_R ⊂ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT and then remark that (Θ𝒬qn*,ε)1/||Θsubscript𝒬superscriptsubscript𝑞𝑛𝜀1\operatorname{\mathds{P}}(\Theta\in\mathcal{Q}_{q_{n}^{*},\varepsilon})\geq 1/% \left|\mathcal{R}\right|blackboard_P ( roman_Θ ∈ caligraphic_Q start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_ε end_POSTSUBSCRIPT ) ≥ 1 / | caligraphic_R |.

Lemma 5.1.

We have,

(Θ𝒬qn*,ε)(2πdn)d02(ε16d0dn)d0(dn1).Θsubscript𝒬superscriptsubscript𝑞𝑛𝜀superscript2𝜋subscript𝑑𝑛subscript𝑑02superscript𝜀16subscript𝑑0subscript𝑑𝑛subscript𝑑0subscript𝑑𝑛1\operatorname{\mathds{P}}(\Theta\in\mathcal{Q}_{q_{n}^{*},\varepsilon})\ \geq% \ \left(\frac{2}{\pi d_{n}}\right)^{\tfrac{d_{0}}{2}}\cdot\left(\frac{% \varepsilon}{16\sqrt{d_{0}d_{n}}}\right)^{d_{0}(d_{n}-1)}.blackboard_P ( roman_Θ ∈ caligraphic_Q start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_ε end_POSTSUBSCRIPT ) ≥ ( divide start_ARG 2 end_ARG start_ARG italic_π italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ⋅ ( divide start_ARG italic_ε end_ARG start_ARG 16 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - 1 ) end_POSTSUPERSCRIPT .
Proof of Lemma 5.1.

Let q′′𝒪dnsuperscript𝑞′′subscript𝒪subscript𝑑𝑛q^{\prime\prime}\in\mathcal{O}_{d_{n}}italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. The first step consists in constructing a net 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{R}\subset\mathcal{O}_{d_{n}}caligraphic_R ⊂ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that there exist q¯¯𝑞\overline{q}\in\mathcal{R}over¯ start_ARG italic_q end_ARG ∈ caligraphic_R and q𝒬qn*,ε𝑞subscript𝒬superscriptsubscript𝑞𝑛𝜀q\in\mathcal{Q}_{q_{n}^{*},\varepsilon}italic_q ∈ caligraphic_Q start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_ε end_POSTSUBSCRIPT with q′′=q¯qsuperscript𝑞′′¯𝑞𝑞q^{\prime\prime}=\overline{q}qitalic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT = over¯ start_ARG italic_q end_ARG italic_q. Let (u1,,ud0,ud0+1,,udn)subscript𝑢1subscript𝑢subscript𝑑0subscript𝑢subscript𝑑01subscript𝑢subscript𝑑𝑛(u_{1},\ldots,u_{d_{0}},u_{d_{0}+1},\ldots,u_{d_{n}})( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) be an orthonormal basis adapted to the direct sum dn=(qn*)1(E𝐝𝟎)(qn*)1(E1𝐝𝟎)superscriptsubscript𝑑𝑛superscriptsuperscriptsubscript𝑞𝑛1subscript𝐸subscript𝐝0perpendicular-todirect-sumsuperscriptsuperscriptsubscript𝑞𝑛1subscript𝐸1subscript𝐝0\mathbb{R}^{d_{n}}=(q_{n}^{*})^{-1}(E_{\mathbf{d_{0}}})\ \overset{\perp}{% \bigoplus}\ (q_{n}^{*})^{-1}(E_{1-\mathbf{d_{0}}})blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT = ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) over⟂ start_ARG ⨁ end_ARG ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ). For all d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-tuple of orthonormal vectors g=(g1,,gd0)𝑔subscript𝑔1subscript𝑔subscript𝑑0g=(g_{1},\ldots,g_{d_{0}})italic_g = ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_g start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), we fix rg𝒪dnsubscript𝑟𝑔subscript𝒪subscript𝑑𝑛r_{g}\in\mathcal{O}_{d_{n}}italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT an isometry such that rg(qn*ui)=gisubscript𝑟𝑔superscriptsubscript𝑞𝑛subscript𝑢𝑖subscript𝑔𝑖r_{g}(q_{n}^{*}u_{i})=g_{i}italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all i1,d0𝑖1subscript𝑑0i\in\llbracket 1,d_{0}\rrbracketitalic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧. Moreover, we denote by 𝒢𝒢\mathcal{G}caligraphic_G a set of d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-tuples of orthonormal vectors in dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT such that, for all d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-tuples f=(f1,,fd0)𝑓subscript𝑓1subscript𝑓subscript𝑑0f=(f_{1},\ldots,f_{d_{0}})italic_f = ( italic_f start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_f start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) of orthonormal vectors, there exists g𝒢𝑔𝒢g\in\mathcal{G}italic_g ∈ caligraphic_G satisfying

supi1,d0gifiε2d0dn.subscriptsupremum𝑖1subscript𝑑0normsubscript𝑔𝑖subscript𝑓𝑖𝜀2subscript𝑑0subscript𝑑𝑛\sup_{i\in\llbracket 1,d_{0}\rrbracket}\left\|{g_{i}-f_{i}}\right\|\ \leq\ % \frac{\varepsilon}{2\sqrt{d_{0}d_{n}}}.roman_sup start_POSTSUBSCRIPT italic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧ end_POSTSUBSCRIPT ∥ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ ≤ divide start_ARG italic_ε end_ARG start_ARG 2 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG .

We claim that we can take :={rg:g𝒢}assignconditional-setsubscript𝑟𝑔𝑔𝒢\mathcal{R}:=\{r_{g}:g\in\mathcal{G}\}caligraphic_R := { italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT : italic_g ∈ caligraphic_G }. Indeed, there exists g𝒢𝑔𝒢g\in\mathcal{G}italic_g ∈ caligraphic_G such that

supi1,d0giq′′(ui)ε2d0dn.subscriptsupremum𝑖1subscript𝑑0normsubscript𝑔𝑖superscript𝑞′′subscript𝑢𝑖𝜀2subscript𝑑0subscript𝑑𝑛\sup_{i\in\llbracket 1,d_{0}\rrbracket}\left\|{g_{i}-q^{\prime\prime}(u_{i})}% \right\|\ \leq\ \frac{\varepsilon}{2\sqrt{d_{0}d_{n}}}.roman_sup start_POSTSUBSCRIPT italic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧ end_POSTSUBSCRIPT ∥ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ ≤ divide start_ARG italic_ε end_ARG start_ARG 2 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG .

By Lemma 5.8, we can extend g𝑔gitalic_g in an orthonormal basis of dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT such that

(5.9) supj1,dngjq′′(uj)εdn.subscriptsupremum𝑗1subscript𝑑𝑛normsubscript𝑔𝑗superscript𝑞′′subscript𝑢𝑗𝜀subscript𝑑𝑛\sup_{j\in\llbracket 1,d_{n}\rrbracket}\left\|{g_{j}-q^{\prime\prime}(u_{j})}% \right\|\ \leq\ \frac{\varepsilon}{\sqrt{d_{n}}}.roman_sup start_POSTSUBSCRIPT italic_j ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧ end_POSTSUBSCRIPT ∥ italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∥ ≤ divide start_ARG italic_ε end_ARG start_ARG square-root start_ARG italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG .

Then, writing q¯=rg¯𝑞subscript𝑟𝑔\overline{q}=r_{g}over¯ start_ARG italic_q end_ARG = italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT and taking q𝑞qitalic_q such that q(uj)=rg1(q′′uj)𝑞subscript𝑢𝑗superscriptsubscript𝑟𝑔1superscript𝑞′′subscript𝑢𝑗q(u_{j})=r_{g}^{-1}(q^{\prime\prime}u_{j})italic_q ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) for all j1,dn𝑗1subscript𝑑𝑛j\in\llbracket 1,d_{n}\rrbracketitalic_j ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧, we have q′′=q¯qsuperscript𝑞′′¯𝑞𝑞q^{\prime\prime}=\overline{q}qitalic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT = over¯ start_ARG italic_q end_ARG italic_q. Moreover, because rg1(gj)E1𝐝𝟎superscriptsubscript𝑟𝑔1subscript𝑔𝑗subscript𝐸1subscript𝐝0r_{g}^{-1}(g_{j})\in E_{1-\mathbf{d_{0}}}italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT 1 - bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and (qn*)1rg1(gj)(qn*)1(E1𝐝𝟎)superscriptsuperscriptsubscript𝑞𝑛1superscriptsubscript𝑟𝑔1subscript𝑔𝑗superscriptsuperscriptsubscript𝑞𝑛1subscript𝐸1subscript𝐝0(q_{n}^{*})^{-1}r_{g}^{-1}(g_{j})\in(q_{n}^{*})^{-1}(E_{1-\mathbf{d_{0}}})( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∈ ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) for jd0+1,dn𝑗subscript𝑑01subscript𝑑𝑛j\in\llbracket d_{0}+1,d_{n}\rrbracketitalic_j ∈ ⟦ italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧, we can define qsuperscript𝑞q^{\prime}italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT such that

{q(ui)=ui,if i1,d0,q(uj)=(qn*)1rg1(gj),if jd0+1,dn.casessuperscript𝑞subscript𝑢𝑖subscript𝑢𝑖if 𝑖1subscript𝑑0superscript𝑞subscript𝑢𝑗superscriptsuperscriptsubscript𝑞𝑛1superscriptsubscript𝑟𝑔1subscript𝑔𝑗if 𝑗subscript𝑑01subscript𝑑𝑛\begin{cases}q^{\prime}(u_{i})=u_{i},&\text{if }i\in\llbracket 1,d_{0}% \rrbracket,\\ q^{\prime}(u_{j})=(q_{n}^{*})^{-1}r_{g}^{-1}(g_{j}),&\text{if }j\in\llbracket d% _{0}+1,d_{n}\rrbracket.\end{cases}{ start_ROW start_CELL italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , end_CELL start_CELL if italic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧ , end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , end_CELL start_CELL if italic_j ∈ ⟦ italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧ . end_CELL end_ROW

Then, we have q𝒪dn((qn*)1(E𝐝𝟎))superscript𝑞subscript𝒪subscript𝑑𝑛superscriptsuperscriptsubscript𝑞𝑛1subscript𝐸subscript𝐝0q^{\prime}\in\mathcal{O}_{d_{n}}((q_{n}^{*})^{-1}(E_{\mathbf{d_{0}}}))italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ( italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) and according to (5.9),

qn*q(ui)q(ui)=qn*(ui)rg1(q′′ui)=rgqn*(ui)q′′(ui)εdn,for i1,d0,formulae-sequencenormsuperscriptsubscript𝑞𝑛superscript𝑞subscript𝑢𝑖𝑞subscript𝑢𝑖normsuperscriptsubscript𝑞𝑛subscript𝑢𝑖superscriptsubscript𝑟𝑔1superscript𝑞′′subscript𝑢𝑖normsubscript𝑟𝑔superscriptsubscript𝑞𝑛subscript𝑢𝑖superscript𝑞′′subscript𝑢𝑖𝜀subscript𝑑𝑛for 𝑖1subscript𝑑0\left\|{q_{n}^{*}q^{\prime}(u_{i})-q(u_{i})}\right\|\ =\ \left\|{q_{n}^{*}(u_{% i})-r_{g}^{-1}(q^{\prime\prime}u_{i})}\right\|\ =\ \left\|{r_{g}q_{n}^{*}(u_{i% })-q^{\prime\prime}(u_{i})}\right\|\ \leq\ \frac{\varepsilon}{\sqrt{d_{n}}},% \qquad\text{for }i\in\llbracket 1,d_{0}\rrbracket,∥ italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - italic_q ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ = ∥ italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ = ∥ italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ ≤ divide start_ARG italic_ε end_ARG start_ARG square-root start_ARG italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG , for italic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧ ,

and,

qn*q(uj)q(uj)=rg1(gj)q(uj)=gjrg(quj)εdn,for jd0+1,dn.formulae-sequencenormsuperscriptsubscript𝑞𝑛superscript𝑞subscript𝑢𝑗𝑞subscript𝑢𝑗normsuperscriptsubscript𝑟𝑔1subscript𝑔𝑗𝑞subscript𝑢𝑗normsubscript𝑔𝑗subscript𝑟𝑔𝑞subscript𝑢𝑗𝜀subscript𝑑𝑛for 𝑗subscript𝑑01subscript𝑑𝑛\left\|{q_{n}^{*}q^{\prime}(u_{j})-q(u_{j})}\right\|\ =\ \left\|{r_{g}^{-1}(g_% {j})-q(u_{j})}\right\|\ =\ \left\|{g_{j}-r_{g}(qu_{j})}\right\|\ \leq\ \frac{% \varepsilon}{\sqrt{d_{n}}},\qquad\text{for }j\in\llbracket d_{0}+1,d_{n}\rrbracket.∥ italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) - italic_q ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∥ = ∥ italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) - italic_q ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∥ = ∥ italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_q italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∥ ≤ divide start_ARG italic_ε end_ARG start_ARG square-root start_ARG italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG , for italic_j ∈ ⟦ italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧ .

So |qn*qq|εnormsuperscriptsubscript𝑞𝑛superscript𝑞𝑞𝜀{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|q_{n}^{*}q^{\prime}-q\right|% \kern-1.07639pt\right|\kern-1.07639pt\right|}\leq\varepsilon| | | italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - italic_q | | | ≤ italic_ε and the net :={rg:g𝒢}assignconditional-setsubscript𝑟𝑔𝑔𝒢\mathcal{R}:=\{r_{g}:g\in\mathcal{G}\}caligraphic_R := { italic_r start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT : italic_g ∈ caligraphic_G } is appropriate. Finally, by taking 𝒢𝒢\mathcal{G}caligraphic_G as in Lemma 5.10, we obtain

||(πdn2)d02(16d0dnε)d0(dn1),superscript𝜋subscript𝑑𝑛2subscript𝑑02superscript16subscript𝑑0subscript𝑑𝑛𝜀subscript𝑑0subscript𝑑𝑛1\left|\mathcal{R}\right|\ \leq\ \left(\frac{\pi d_{n}}{2}\right)^{\tfrac{d_{0}% }{2}}\cdot\left(\frac{16\sqrt{d_{0}d_{n}}}{\varepsilon}\right)^{d_{0}(d_{n}-1)},| caligraphic_R | ≤ ( divide start_ARG italic_π italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ⋅ ( divide start_ARG 16 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_ε end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - 1 ) end_POSTSUPERSCRIPT ,

hence the result. ∎

Consequently, we have established that

(Θ𝒜εn)(2πdn)d02((εnD1)1β116d0dn)d0(dn1).Θsubscript𝒜subscript𝜀𝑛superscript2𝜋subscript𝑑𝑛subscript𝑑02superscriptsuperscriptsubscript𝜀𝑛subscript𝐷11𝛽116subscript𝑑0subscript𝑑𝑛subscript𝑑0subscript𝑑𝑛1\operatorname{\mathds{P}}(\Theta\in\mathcal{A}_{\varepsilon_{n}})\ \geq\ \left% (\frac{2}{\pi d_{n}}\right)^{\tfrac{d_{0}}{2}}\cdot\left(\left(\frac{% \varepsilon_{n}}{D_{1}}\right)^{\frac{1}{\beta}}\frac{1}{16\sqrt{d_{0}d_{n}}}% \right)^{d_{0}(d_{n}-1)}.blackboard_P ( roman_Θ ∈ caligraphic_A start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ≥ ( divide start_ARG 2 end_ARG start_ARG italic_π italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ⋅ ( ( divide start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG italic_D start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_β end_ARG end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 16 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - 1 ) end_POSTSUPERSCRIPT .

Recall that we have the following lower bound:

Πn(WA,Γ,Θfn*2εn)πΓ(d0)(Θ𝒜εn)exp(12nεn2).subscriptΠ𝑛subscriptdelimited-∥∥superscript𝑊𝐴ΓΘsuperscriptsubscript𝑓𝑛2subscript𝜀𝑛subscript𝜋Γsubscript𝑑0Θsubscript𝒜subscript𝜀𝑛12𝑛superscriptsubscript𝜀𝑛2\Pi_{n}\left(\bigl{\|}{W^{A,\Gamma,\Theta}-f_{n}^{*}}\bigr{\|}_{\infty}\leq 2% \varepsilon_{n}\right)\ \geq\ \pi_{\Gamma}(d_{0})\cdot\operatorname{\mathds{P}% }(\Theta\in\mathcal{A}_{\varepsilon_{n}})\cdot\exp\left(-\frac{1}{2}n% \varepsilon_{n}^{2}\right).roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_A , roman_Γ , roman_Θ end_POSTSUPERSCRIPT - italic_f start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≥ italic_π start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ⋅ blackboard_P ( roman_Θ ∈ caligraphic_A start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ⋅ roman_exp ( - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

In order to establish the prior mass condition, it suffices to derive the greatest design dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT for which we can reach

(Θ𝒜εn)πΓ(d0)1exp(12nεn2).Θsubscript𝒜subscript𝜀𝑛subscript𝜋Γsuperscriptsubscript𝑑0112𝑛superscriptsubscript𝜀𝑛2\operatorname{\mathds{P}}(\Theta\in\mathcal{A}_{\varepsilon_{n}})\ \geq\ \pi_{% \Gamma}(d_{0})^{-1}\exp\left(-\frac{1}{2}n\varepsilon_{n}^{2}\right).blackboard_P ( roman_Θ ∈ caligraphic_A start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ≥ italic_π start_POSTSUBSCRIPT roman_Γ end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_exp ( - divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

For n𝑛nitalic_n large enough, a design dimension dnsubscript𝑑𝑛d_{n}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT as specified in Assumption 3.5 is appropriate for sufficiently small constant CDsubscript𝐶𝐷C_{D}italic_C start_POSTSUBSCRIPT italic_D end_POSTSUBSCRIPT.

Remark 5.2.

The exponent d0(dn1)subscript𝑑0subscript𝑑𝑛1d_{0}(d_{n}-1)italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - 1 ) in Lemma 5.1 is probably not far to be optimal. In fact, ignoring the constants, changing this exponent to dnαsuperscriptsubscript𝑑𝑛𝛼d_{n}^{\alpha}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_α end_POSTSUPERSCRIPT with α<1𝛼1\alpha<1italic_α < 1 would lead to a growth rate of nd0/(α(2β+d0))superscript𝑛subscript𝑑0𝛼2𝛽subscript𝑑0n^{d_{0}/(\alpha(2\beta+d_{0}))}italic_n start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT / ( italic_α ( 2 italic_β + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) end_POSTSUPERSCRIPT which, when β𝛽\betaitalic_β is close to zero, gives a growth rate with an order superior to n𝑛nitalic_n. The breakpoint of some popular subspace estimators, such as SIR, being the order n𝑛nitalic_n, it would be surprising to estimate a function faster than its central subspace.

5.1.3 Sieve condition (5.3)

The second condition can be verified similarly as in [JT21]. As in the previous section, we will first treat the case with deterministic rescaling parameter, dimension, and direction and then integrate according to A𝐴Aitalic_A, ΓΓ\Gammaroman_Γ, and ΘΘ\Thetaroman_Θ.

We suppose that n𝑛nitalic_n is large enough so that dn>dmaxsubscript𝑑𝑛subscript𝑑maxd_{n}>d_{\mathrm{max}}italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT > italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT. We introduce the quantities Mn:=CMnεn2assignsubscript𝑀𝑛subscript𝐶𝑀𝑛superscriptsubscript𝜀𝑛2M_{n}:=C_{M}\sqrt{n\varepsilon_{n}^{2}}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT := italic_C start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT square-root start_ARG italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG for some large constant CMsubscript𝐶𝑀C_{M}italic_C start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT and, for 1bdmax1𝑏subscript𝑑max1\leq b\leq d_{\mathrm{max}}1 ≤ italic_b ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, the quantity rn,bsubscript𝑟𝑛𝑏r_{n,b}italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT such that rn,bb(logn)b+1=Crnεn2superscriptsubscript𝑟𝑛𝑏𝑏superscript𝑛𝑏1subscript𝐶𝑟𝑛superscriptsubscript𝜀𝑛2r_{n,b}^{b}(\log n)^{b+1}=C_{r}n\varepsilon_{n}^{2}italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT = italic_C start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, for a large constant Crsubscript𝐶𝑟C_{r}italic_C start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT. The sieve 𝔹nsubscript𝔹𝑛\mathbb{B}_{n}blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is defined as follows:

𝔹n:=q𝒪dnn,q,assignsubscript𝔹𝑛subscript𝑞subscript𝒪subscript𝑑𝑛subscript𝑛𝑞\mathbb{B}_{n}\ :=\ \bigcup_{q\in\mathcal{O}_{d_{n}}}\mathcal{B}_{n,q},blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT := ⋃ start_POSTSUBSCRIPT italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_B start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ,

with

n,q:=b=1dmaxn,b,qandn,b,q:=Mnrn,b1rn,b,b,q+εnB1,formulae-sequenceassignsubscript𝑛𝑞superscriptsubscript𝑏1subscript𝑑maxsubscript𝑛𝑏𝑞andassignsubscript𝑛𝑏𝑞subscript𝑀𝑛subscript𝑟𝑛𝑏superscriptsubscript1subscript𝑟𝑛𝑏𝑏𝑞subscript𝜀𝑛subscript𝐵1\mathcal{B}_{n,q}\ :=\ \bigcup_{b=1}^{d_{\mathrm{max}}}\mathcal{B}_{n,b,q}% \qquad\text{and}\quad\mathcal{B}_{n,b,q}\ :=\ M_{n}\sqrt{r_{n,b}}\cdot\mathbb{% H}_{1}^{r_{n,b},b,q}+\varepsilon_{n}B_{1},caligraphic_B start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT := ⋃ start_POSTSUBSCRIPT italic_b = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_POSTSUPERSCRIPT caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT and caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT := italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG ⋅ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ,

where B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is the unit ball in the Banach space (𝒞0(𝕌dn),)(\mathcal{C}^{0}(\mathbb{U}_{d_{n}}),\left\|{\cdot}\right\|_{\infty})( caligraphic_C start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ).

The nesting property of Lemma 4.7 in [VV09] remains true in the present setting, that is, for aa𝑎superscript𝑎a\leq a^{\prime}italic_a ≤ italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT,

a1a,b,qa1a,b,q.𝑎superscriptsubscript1𝑎𝑏𝑞superscript𝑎superscriptsubscript1superscript𝑎𝑏𝑞\sqrt{a}\cdot\mathbb{H}_{1}^{a,b,q}\ \subseteq\ \sqrt{a^{\prime}}\cdot\mathbb{% H}_{1}^{a^{\prime},b,q}.square-root start_ARG italic_a end_ARG ⋅ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ⊆ square-root start_ARG italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG ⋅ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT .

Consequently, if 1arn,b1𝑎subscript𝑟𝑛𝑏1\leq a\leq r_{n,b}1 ≤ italic_a ≤ italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT, then

Mn1a,b,q+εnB1Mnrn,ba1rn,b,b,q+εnB1n,b,q.subscript𝑀𝑛superscriptsubscript1𝑎𝑏𝑞subscript𝜀𝑛subscript𝐵1subscript𝑀𝑛subscript𝑟𝑛𝑏𝑎superscriptsubscript1subscript𝑟𝑛𝑏𝑏𝑞subscript𝜀𝑛subscript𝐵1subscript𝑛𝑏𝑞M_{n}\mathbb{H}_{1}^{a,b,q}+\varepsilon_{n}B_{1}\ \subseteq\ M_{n}\sqrt{\frac{% r_{n,b}}{a}}\cdot\mathbb{H}_{1}^{r_{n,b},b,q}+\varepsilon_{n}B_{1}\ \subseteq% \ \mathcal{B}_{n,b,q}.italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⊆ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG divide start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG start_ARG italic_a end_ARG end_ARG ⋅ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⊆ caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT .

By Borell’s inequality (see [VV08a], Theorem 5.1, or [Bor75]), for every a[1,rn,b]𝑎1subscript𝑟𝑛𝑏a\in[1,r_{n,b}]italic_a ∈ [ 1 , italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT ],

Πn(Wa,b,q𝔹n)subscriptΠ𝑛superscript𝑊𝑎𝑏𝑞subscript𝔹𝑛\displaystyle\Pi_{n}(W^{a,b,q}\notin\mathbb{B}_{n})\ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∉ blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) Πn(Wa,b,qn,b,q)absentsubscriptΠ𝑛superscript𝑊𝑎𝑏𝑞subscript𝑛𝑏𝑞\displaystyle\leq\ \Pi_{n}(W^{a,b,q}\notin\mathcal{B}_{n,b,q})≤ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∉ caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT )
Πn(Wa,b,qMn1a,b,q+εnB1)absentsubscriptΠ𝑛superscript𝑊𝑎𝑏𝑞subscript𝑀𝑛superscriptsubscript1𝑎𝑏𝑞subscript𝜀𝑛subscript𝐵1\displaystyle\leq\ \Pi_{n}(W^{a,b,q}\notin M_{n}\mathbb{H}_{1}^{a,b,q}+% \varepsilon_{n}B_{1})≤ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∉ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT )
1Φ(Φ1(Πn(Wa,b,qεn))+Mn),absent1ΦsuperscriptΦ1subscriptΠ𝑛subscriptnormsuperscript𝑊𝑎𝑏𝑞subscript𝜀𝑛subscript𝑀𝑛\displaystyle\leq\ 1-\Phi\left(\Phi^{-1}\left(\Pi_{n}\left(\left\|{W^{a,b,q}}% \right\|_{\infty}\leq\varepsilon_{n}\right)\right)+M_{n}\right),≤ 1 - roman_Φ ( roman_Φ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) + italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ,

where ΦΦ\Phiroman_Φ is the cumulative distribution function of the standard normal distribution. Now, because

Πn(Wa,b,qεn)Πn(Wrn,b,b,qεn)=exp(ϕ0rn,b,b,q(εn)),subscriptΠ𝑛subscriptnormsuperscript𝑊𝑎𝑏𝑞subscript𝜀𝑛subscriptΠ𝑛subscriptnormsuperscript𝑊subscript𝑟𝑛𝑏𝑏𝑞subscript𝜀𝑛superscriptsubscriptitalic-ϕ0subscript𝑟𝑛𝑏𝑏𝑞subscript𝜀𝑛\Pi_{n}\left(\left\|{W^{a,b,q}}\right\|_{\infty}\leq\varepsilon_{n}\right)\ % \geq\ \Pi_{n}\left(\left\|{W^{r_{n,b},b,q}}\right\|_{\infty}\leq\varepsilon_{n% }\right)\ =\ \exp\left(-\phi_{0}^{r_{n,b},b,q}(\varepsilon_{n})\right),roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≥ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( ∥ italic_W start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) = roman_exp ( - italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) ,

we have

Πn(Wa,b,q𝔹n) 1Φ(Φ1(eϕ0rn,b,b,q(εn))+Mn).subscriptΠ𝑛superscript𝑊𝑎𝑏𝑞subscript𝔹𝑛1ΦsuperscriptΦ1superscript𝑒superscriptsubscriptitalic-ϕ0subscript𝑟𝑛𝑏𝑏𝑞subscript𝜀𝑛subscript𝑀𝑛\Pi_{n}(W^{a,b,q}\notin\mathbb{B}_{n})\ \leq\ 1-\Phi\left(\Phi^{-1}\left(e^{-% \phi_{0}^{r_{n,b},b,q}(\varepsilon_{n})}\right)+M_{n}\right).roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∉ blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≤ 1 - roman_Φ ( roman_Φ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUPERSCRIPT - italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_POSTSUPERSCRIPT ) + italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) .

For n𝑛nitalic_n large enough, we have εnmin{ε0a0,b:b1,dmax}subscript𝜀𝑛:superscriptsubscript𝜀0subscript𝑎0𝑏𝑏1subscript𝑑max\varepsilon_{n}\leq\min\{\varepsilon_{0}^{a_{0},b}:b\in\llbracket 1,d_{\mathrm% {max}}\rrbracket\}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ roman_min { italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_b end_POSTSUPERSCRIPT : italic_b ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ⟧ } and rn,ba0subscript𝑟𝑛𝑏subscript𝑎0r_{n,b}\geq a_{0}italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT ≥ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, so according to Lemma 5.7 and because bdmax𝑏subscript𝑑maxb\leq d_{\mathrm{max}}italic_b ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT, we have

ϕ0rn,b,b,q(εn)rn,bblog(rn,bεn)b+1rn,bb(logn)b+1nεn2,\phi_{0}^{r_{n,b},b,q}(\varepsilon_{n})\ \lesssim\ r_{n,b}^{b}\log\left(\frac{% r_{n,b}}{\varepsilon_{n}}\right)^{b+1}\ \lesssim\ r_{n,b}^{b}(\log n)^{b+1}\ % \lesssim\ n\varepsilon_{n}^{2},italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≲ italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT roman_log ( divide start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT ≲ italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ( roman_log italic_n ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT ≲ italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

for sufficiently large n𝑛nitalic_n. So by taking Mn2superscriptsubscript𝑀𝑛2M_{n}^{2}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT a very large multiple of nεn2𝑛superscriptsubscript𝜀𝑛2n\varepsilon_{n}^{2}italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, we can reach Mn4ϕ0rn,b,b,q(εn)subscript𝑀𝑛4superscriptsubscriptitalic-ϕ0subscript𝑟𝑛𝑏𝑏𝑞subscript𝜀𝑛M_{n}\geq 4\sqrt{\phi_{0}^{r_{n,b},b,q}(\varepsilon_{n})}italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≥ 4 square-root start_ARG italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) end_ARG. The second assertion of Lemma 4.10 in [VV09] gives Mn2Φ1(exp(ϕ0rn,b,b,q(εn)))subscript𝑀𝑛2superscriptΦ1superscriptsubscriptitalic-ϕ0subscript𝑟𝑛𝑏𝑏𝑞subscript𝜀𝑛M_{n}\geq-2\Phi^{-1}\left(\exp\bigl{(}-\phi_{0}^{r_{n,b},b,q}(\varepsilon_{n})% \bigr{)}\right)italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≥ - 2 roman_Φ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_exp ( - italic_ϕ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) ) which leads to the upper bound

Πn(Wa,b,q𝔹n) 1Φ(Mn/2)exp(Mn2/8).subscriptΠ𝑛superscript𝑊𝑎𝑏𝑞subscript𝔹𝑛1Φsubscript𝑀𝑛2superscriptsubscript𝑀𝑛28\Pi_{n}(W^{a,b,q}\notin\mathbb{B}_{n})\ \leq\ 1-\Phi(M_{n}/2)\ \leq\ \exp(-M_{% n}^{2}/8).roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∉ blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ≤ 1 - roman_Φ ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / 2 ) ≤ roman_exp ( - italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8 ) .

Taking into account the random rescaling parameter A𝐴Aitalic_A, we have, for sufficiently large n𝑛nitalic_n,

Πn(WA,b,q𝔹n)subscriptΠ𝑛superscript𝑊𝐴𝑏𝑞subscript𝔹𝑛\displaystyle\Pi_{n}(W^{A,b,q}\notin\mathbb{B}_{n})\ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_A , italic_b , italic_q end_POSTSUPERSCRIPT ∉ blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) crn,bΠn(Wa,b,q𝔹n)πn,b(a)𝑑a+πn,b(Arn,b)absentsuperscriptsubscript𝑐subscript𝑟𝑛𝑏subscriptΠ𝑛superscript𝑊𝑎𝑏𝑞subscript𝔹𝑛subscript𝜋𝑛𝑏𝑎differential-d𝑎subscript𝜋𝑛𝑏𝐴subscript𝑟𝑛𝑏\displaystyle\leq\ \int_{c}^{r_{n,b}}\Pi_{n}(W^{a,b,q}\notin\mathbb{B}_{n})\pi% _{n,b}(a)da\ +\ \pi_{n,b}(A\geq r_{n,b})≤ ∫ start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ∉ blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) italic_π start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT ( italic_a ) italic_d italic_a + italic_π start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT ( italic_A ≥ italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT )
(Assumption 3.4)Assumption 3.4\displaystyle(\text{Assumption }\ref{ass:resc})\quad( Assumption ) exp(Mn2/8)+D2rn,bexp(C2ab(loga)b+1)𝑑aabsentsuperscriptsubscript𝑀𝑛28subscript𝐷2superscriptsubscriptsubscript𝑟𝑛𝑏subscript𝐶2superscript𝑎𝑏superscript𝑎𝑏1differential-d𝑎\displaystyle\leq\ \exp(-M_{n}^{2}/8)\ +\ D_{2}\int_{r_{n,b}}^{\infty}\exp% \left(-C_{2}a^{b}(\log a)^{b+1}\right)da≤ roman_exp ( - italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8 ) + italic_D start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT roman_exp ( - italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ( roman_log italic_a ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT ) italic_d italic_a
exp(Mn2/8)+D2rn,bC2ab1((b+1)logba+blogb+1a)exp(C2ab(loga)b+1)𝑑aabsentsuperscriptsubscript𝑀𝑛28subscript𝐷2superscriptsubscriptsubscript𝑟𝑛𝑏subscript𝐶2superscript𝑎𝑏1𝑏1superscript𝑏𝑎𝑏superscript𝑏1𝑎subscript𝐶2superscript𝑎𝑏superscript𝑎𝑏1differential-d𝑎\displaystyle\leq\ \exp(-M_{n}^{2}/8)\ +\ D_{2}\int_{r_{n,b}}^{\infty}C_{2}a^{% b-1}((b+1)\log^{b}a+b\log^{b+1}a)\exp\left(-C_{2}a^{b}(\log a)^{b+1}\right)da≤ roman_exp ( - italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8 ) + italic_D start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∫ start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_b - 1 end_POSTSUPERSCRIPT ( ( italic_b + 1 ) roman_log start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT italic_a + italic_b roman_log start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT italic_a ) roman_exp ( - italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ( roman_log italic_a ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT ) italic_d italic_a
exp(Mn2/8)+D2exp(C2rn,bb(logrn,b)b+1)absentsuperscriptsubscript𝑀𝑛28subscript𝐷2subscript𝐶2superscriptsubscript𝑟𝑛𝑏𝑏superscriptsubscript𝑟𝑛𝑏𝑏1\displaystyle\leq\ \exp(-M_{n}^{2}/8)\ +\ D_{2}\exp\left(-C_{2}r_{n,b}^{b}(% \log r_{n,b})^{b+1}\right)≤ roman_exp ( - italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT / 8 ) + italic_D start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT roman_exp ( - italic_C start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ( roman_log italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT )
12exp(5nεn2)+12exp(5nεn2)absent125𝑛superscriptsubscript𝜀𝑛2125𝑛superscriptsubscript𝜀𝑛2\displaystyle\leq\ \frac{1}{2}\exp(-5n\varepsilon_{n}^{2})\ +\ \frac{1}{2}\exp% (-5n\varepsilon_{n}^{2})≤ divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_exp ( - 5 italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) + divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_exp ( - 5 italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT )
=exp(5nεn2),absent5𝑛superscriptsubscript𝜀𝑛2\displaystyle=\ \exp(-5n\varepsilon_{n}^{2}),= roman_exp ( - 5 italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,

where the last inequality holds because Crsubscript𝐶𝑟C_{r}italic_C start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT and CMsubscript𝐶𝑀C_{M}italic_C start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT are supposed to be large enough.

Now considering the prior on the sparsity pattern, we obtain

Πn(WA,Γ,Θ𝔹n)subscriptΠ𝑛superscript𝑊𝐴ΓΘsubscript𝔹𝑛\displaystyle\Pi_{n}(W^{A,\Gamma,\Theta}\notin\mathbb{B}_{n})\ roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_A , roman_Γ , roman_Θ end_POSTSUPERSCRIPT ∉ blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) b=1dmaxΠn(Γ=b)𝒪dnΠn(WA,b,q𝔹n)𝑑qexp(5nεn2).absentsuperscriptsubscript𝑏1subscript𝑑maxsubscriptΠ𝑛Γ𝑏subscriptsubscript𝒪subscript𝑑𝑛subscriptΠ𝑛superscript𝑊𝐴𝑏𝑞subscript𝔹𝑛differential-d𝑞5𝑛superscriptsubscript𝜀𝑛2\displaystyle\leq\ \sum_{b=1}^{d_{\mathrm{max}}}\Pi_{n}(\Gamma=b)\int_{% \mathcal{O}_{d_{n}}}\Pi_{n}(W^{A,b,q}\notin\mathbb{B}_{n})dq\ \leq\ \exp(-5n% \varepsilon_{n}^{2}).≤ ∑ start_POSTSUBSCRIPT italic_b = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_POSTSUPERSCRIPT roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_Γ = italic_b ) ∫ start_POSTSUBSCRIPT caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_W start_POSTSUPERSCRIPT italic_A , italic_b , italic_q end_POSTSUPERSCRIPT ∉ blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) italic_d italic_q ≤ roman_exp ( - 5 italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

5.1.4 Entropy condition (5.4)

We use again the notation and quantities of the previous section. According to Lemma 5.6, for all q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT and b1,dmax𝑏1subscript𝑑maxb\in\llbracket 1,d_{\mathrm{max}}\rrbracketitalic_b ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ⟧, the metric entropy of n,b,qsubscript𝑛𝑏𝑞\mathcal{B}_{n,b,q}caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT is bounded as:

logN(2εn,Mnrn,b1rn,b,b,q+εnB1,)\displaystyle\log N\left(2\varepsilon_{n},M_{n}\sqrt{r_{n,b}}\mathbb{H}_{1}^{r% _{n,b},b,q}+\varepsilon_{n}B_{1},\left\|{\cdot}\right\|_{\infty}\right)\ roman_log italic_N ( 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) logN(εn,Mnrn,b1rn,b,b,q,),\displaystyle\leq\ \log N\left(\varepsilon_{n},M_{n}\sqrt{r_{n,b}}\mathbb{H}_{% 1}^{r_{n,b},b,q},\left\|{\cdot}\right\|_{\infty}\right),≤ roman_log italic_N ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ,
rn,bblog(Mnrn,bεn1)b+1.\displaystyle\lesssim\ r_{n,b}^{b}\log\left(M_{n}\sqrt{r_{n,b}}\varepsilon_{n}% ^{-1}\right)^{b+1}.≲ italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT roman_log ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT .

The simple estimation log(Mnrn,bεn1)lognasymptotically-equalssubscript𝑀𝑛subscript𝑟𝑛𝑏superscriptsubscript𝜀𝑛1𝑛\log\left(M_{n}\sqrt{r_{n,b}}\varepsilon_{n}^{-1}\right)\asymp\log nroman_log ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) ≍ roman_log italic_n gives then

(5.10) logN(2εn,n,b,q,)nεn2.\log N\left(2\varepsilon_{n},\mathcal{B}_{n,b,q},\left\|{\cdot}\right\|_{% \infty}\right)\ \lesssim\ n\varepsilon_{n}^{2}.roman_log italic_N ( 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ≲ italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

The metric entropy of n,qsubscript𝑛𝑞\mathcal{B}_{n,q}caligraphic_B start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT is derived as follows:

N(2εn,n,q,)b=1dmaxN(2εn,n,b,q,)dmaxmax1bdmaxN(2εn,n,b,q,).N\left(2\varepsilon_{n},\mathcal{B}_{n,q},\left\|{\cdot}\right\|_{\infty}% \right)\ \leq\ \sum_{b=1}^{d_{\mathrm{max}}}N\left(2\varepsilon_{n},\mathcal{B% }_{n,b,q},\left\|{\cdot}\right\|_{\infty}\right)\ \leq\ d_{\mathrm{max}}\max_{% 1\leq b\leq d_{\mathrm{max}}}N\left(2\varepsilon_{n},\mathcal{B}_{n,b,q},\left% \|{\cdot}\right\|_{\infty}\right).italic_N ( 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , caligraphic_B start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ≤ ∑ start_POSTSUBSCRIPT italic_b = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_N ( 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT 1 ≤ italic_b ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_N ( 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) .

To extend these inequalities to the full sieve, we need the following lemma from [Tok11].

Lemma 5.2 (Tokdar 2011, Lemma 1).

Let a>0𝑎0a>0italic_a > 0, b<dn𝑏subscript𝑑𝑛b<d_{n}italic_b < italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and q,q~𝒪dn𝑞normal-~𝑞subscript𝒪subscript𝑑𝑛q,\tilde{q}\in\mathcal{O}_{d_{n}}italic_q , over~ start_ARG italic_q end_ARG ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then

1a,b,q1a,b,q~+a2b|qq~|B1,superscriptsubscript1𝑎𝑏𝑞superscriptsubscript1𝑎𝑏~𝑞𝑎2𝑏norm𝑞~𝑞subscript𝐵1\mathbb{H}_{1}^{a,b,q}\ \subseteq\ \mathbb{H}_{1}^{a,b,\tilde{q}}\ +\ a\sqrt{2% b}\cdot{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|q-\tilde{q}\right|% \kern-1.07639pt\right|\kern-1.07639pt\right|}B_{1},blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ⊆ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , over~ start_ARG italic_q end_ARG end_POSTSUPERSCRIPT + italic_a square-root start_ARG 2 italic_b end_ARG ⋅ | | | italic_q - over~ start_ARG italic_q end_ARG | | | italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ,

where B1subscript𝐵1B_{1}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is the unit ball in (𝒞0(𝕌dn),)(\mathcal{C}^{0}(\mathbb{U}_{d_{n}}),\left\|{\cdot}\right\|_{\infty})( caligraphic_C start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ).

By examining the representation result in (3.2) for a,b,qsubscript𝑎𝑏𝑞\mathbb{H}_{a,b,q}blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q end_POSTSUBSCRIPT, we see that, for all q𝒪dn(q1(E𝕓))superscript𝑞subscript𝒪subscript𝑑𝑛superscript𝑞1subscript𝐸𝕓q^{\prime}\in\mathcal{O}_{d_{n}}(q^{-1}(E_{\mathbb{b}}))italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT blackboard_b end_POSTSUBSCRIPT ) ), we have a,b,q=a,b,qqsubscript𝑎𝑏𝑞subscript𝑎𝑏𝑞superscript𝑞\mathbb{H}_{a,b,q}=\mathbb{H}_{a,b,qq^{\prime}}blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q end_POSTSUBSCRIPT = blackboard_H start_POSTSUBSCRIPT italic_a , italic_b , italic_q italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. Hence, Lemma 5.2 gives

1a,b,q1a,b,q~q+a2b|qq~|B1.superscriptsubscript1𝑎𝑏𝑞superscriptsubscript1𝑎𝑏~𝑞superscript𝑞𝑎2𝑏norm𝑞~𝑞subscript𝐵1\mathbb{H}_{1}^{a,b,q}\ \subseteq\ \mathbb{H}_{1}^{a,b,\tilde{q}q^{\prime}}\ +% \ a\sqrt{2b}\cdot{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|q-\tilde{q}% \right|\kern-1.07639pt\right|\kern-1.07639pt\right|}B_{1}.blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q end_POSTSUPERSCRIPT ⊆ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , over~ start_ARG italic_q end_ARG italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT + italic_a square-root start_ARG 2 italic_b end_ARG ⋅ | | | italic_q - over~ start_ARG italic_q end_ARG | | | italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT .

If nsubscript𝑛\mathcal{R}_{n}caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is a net over 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that for all q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, there exist q𝒪dn(q1(E𝐝𝟎))superscript𝑞subscript𝒪subscript𝑑𝑛superscript𝑞1subscript𝐸subscript𝐝0q^{\prime}\in\mathcal{O}_{d_{n}}(q^{-1}(E_{\mathbf{d_{0}}}))italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) and q¯n¯𝑞subscript𝑛\overline{q}\in\mathcal{R}_{n}over¯ start_ARG italic_q end_ARG ∈ caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT with |qqq¯|ζnnorm𝑞superscript𝑞¯𝑞subscript𝜁𝑛{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|qq^{\prime}-\overline{q}% \right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\leq\zeta_{n}| | | italic_q italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over¯ start_ARG italic_q end_ARG | | | ≤ italic_ζ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, where ζnsubscript𝜁𝑛\zeta_{n}italic_ζ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is the minimum of εn/(Mnrn,b3/22dn)subscript𝜀𝑛subscript𝑀𝑛superscriptsubscript𝑟𝑛𝑏322subscript𝑑𝑛\varepsilon_{n}/(M_{n}r_{n,b}^{3/2}\sqrt{2d_{n}})italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT / ( italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT square-root start_ARG 2 italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) when b𝑏bitalic_b runs through 1,dmax1subscript𝑑max\llbracket 1,d_{\mathrm{max}}\rrbracket⟦ 1 , italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ⟧, then

Mnrn,b1rn,b,b,qsubscript𝑀𝑛subscript𝑟𝑛𝑏superscriptsubscript1subscript𝑟𝑛𝑏𝑏𝑞\displaystyle M_{n}\sqrt{r_{n,b}}\cdot\mathbb{H}_{1}^{r_{n,b},b,q}\ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG ⋅ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , italic_q end_POSTSUPERSCRIPT Mnrn,b1rn,b,b,q¯+Mnrn,b3/22b|qqq¯|B1absentsubscript𝑀𝑛subscript𝑟𝑛𝑏superscriptsubscript1subscript𝑟𝑛𝑏𝑏¯𝑞subscript𝑀𝑛superscriptsubscript𝑟𝑛𝑏322𝑏norm𝑞superscript𝑞¯𝑞subscript𝐵1\displaystyle\subseteq\ M_{n}\sqrt{r_{n,b}}\cdot\mathbb{H}_{1}^{r_{n,b},b,% \overline{q}}\ +\ M_{n}r_{n,b}^{3/2}\sqrt{2b}\cdot{\left|\kern-1.07639pt\left|% \kern-1.07639pt\left|qq^{\prime}-\overline{q}\right|\kern-1.07639pt\right|% \kern-1.07639pt\right|}B_{1}⊆ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG ⋅ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , over¯ start_ARG italic_q end_ARG end_POSTSUPERSCRIPT + italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT square-root start_ARG 2 italic_b end_ARG ⋅ | | | italic_q italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over¯ start_ARG italic_q end_ARG | | | italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT
Mnrn,b1rn,b,b,q¯+εnB1absentsubscript𝑀𝑛subscript𝑟𝑛𝑏superscriptsubscript1subscript𝑟𝑛𝑏𝑏¯𝑞subscript𝜀𝑛subscript𝐵1\displaystyle\subseteq\ M_{n}\sqrt{r_{n,b}}\cdot\mathbb{H}_{1}^{r_{n,b},b,% \overline{q}}\ +\ \varepsilon_{n}B_{1}⊆ italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT end_ARG ⋅ blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT , italic_b , over¯ start_ARG italic_q end_ARG end_POSTSUPERSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT
=n,b,q¯.absentsubscript𝑛𝑏¯𝑞\displaystyle=\ \mathcal{B}_{n,b,\overline{q}}.= caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , over¯ start_ARG italic_q end_ARG end_POSTSUBSCRIPT .

This clearly implies

n,qn,q¯+εnB1,subscript𝑛𝑞subscript𝑛¯𝑞subscript𝜀𝑛subscript𝐵1\mathcal{B}_{n,q}\ \subseteq\ \mathcal{B}_{n,\overline{q}}+\varepsilon_{n}B_{1},caligraphic_B start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ⊆ caligraphic_B start_POSTSUBSCRIPT italic_n , over¯ start_ARG italic_q end_ARG end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ,

and hence

𝔹n=q𝒪dnn,qq¯n(n,q¯+εnB1).subscript𝔹𝑛subscript𝑞subscript𝒪subscript𝑑𝑛subscript𝑛𝑞subscript¯𝑞subscript𝑛subscript𝑛¯𝑞subscript𝜀𝑛subscript𝐵1\mathbb{B}_{n}\ =\ \bigcup_{q\in\mathcal{O}_{d_{n}}}\mathcal{B}_{n,q}\ % \subseteq\ \bigcup_{\overline{q}\in\mathcal{R}_{n}}\left(\mathcal{B}_{n,% \overline{q}}+\varepsilon_{n}B_{1}\right).blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ⋃ start_POSTSUBSCRIPT italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_B start_POSTSUBSCRIPT italic_n , italic_q end_POSTSUBSCRIPT ⊆ ⋃ start_POSTSUBSCRIPT over¯ start_ARG italic_q end_ARG ∈ caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( caligraphic_B start_POSTSUBSCRIPT italic_n , over¯ start_ARG italic_q end_ARG end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) .

Consequently, the 3εn3subscript𝜀𝑛3\varepsilon_{n}3 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT-entropy of 𝔹nsubscript𝔹𝑛\mathbb{B}_{n}blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT can be bounded by the cardinal of the net nsubscript𝑛\mathcal{R}_{n}caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT times the maximal 2εn2subscript𝜀𝑛2\varepsilon_{n}2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT-entropy of sets n,b,qsubscript𝑛𝑏𝑞\mathcal{B}_{n,b,q}caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT:

N(3εn,𝔹n,)\displaystyle N\left(3\varepsilon_{n},\mathbb{B}_{n},\left\|{\cdot}\right\|_{% \infty}\right)\ italic_N ( 3 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) q¯nN(3εn,n,q¯+εnB1,)\displaystyle\leq\ \sum_{\overline{q}\in\mathcal{R}_{n}}N\left(3\varepsilon_{n% },\mathcal{B}_{n,\overline{q}}+\varepsilon_{n}B_{1},\left\|{\cdot}\right\|_{% \infty}\right)≤ ∑ start_POSTSUBSCRIPT over¯ start_ARG italic_q end_ARG ∈ caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_N ( 3 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , caligraphic_B start_POSTSUBSCRIPT italic_n , over¯ start_ARG italic_q end_ARG end_POSTSUBSCRIPT + italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT )
q¯nN(2εn,n,q¯,)\displaystyle\leq\ \sum_{\overline{q}\in\mathcal{R}_{n}}N\left(2\varepsilon_{n% },\mathcal{B}_{n,\overline{q}},\left\|{\cdot}\right\|_{\infty}\right)≤ ∑ start_POSTSUBSCRIPT over¯ start_ARG italic_q end_ARG ∈ caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_N ( 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , caligraphic_B start_POSTSUBSCRIPT italic_n , over¯ start_ARG italic_q end_ARG end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT )
|n|dmaxmax1bdmaxq¯nN(2εn,n,b,q,).\displaystyle\leq\ \left|\mathcal{R}_{n}\right|\cdot d_{\mathrm{max}}\max_{% \begin{subarray}{c}1\leq b\leq d_{\mathrm{max}}\\[1.9919pt] \overline{q}\in\mathcal{R}_{n}\end{subarray}}N\left(2\varepsilon_{n},\mathcal{% B}_{n,b,q},\left\|{\cdot}\right\|_{\infty}\right).≤ | caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT | ⋅ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 1 ≤ italic_b ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_q end_ARG ∈ caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_CELL end_ROW end_ARG end_POSTSUBSCRIPT italic_N ( 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) .

It only remains to bound the cardinal of nsubscript𝑛\mathcal{R}_{n}caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT.

Lemma 5.3.

For ζ>0𝜁0\zeta>0italic_ζ > 0, there exists a net \mathcal{R}caligraphic_R over 𝒪dnsubscript𝒪subscript𝑑𝑛\mathcal{O}_{d_{n}}caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that

q¯𝒜q¯=𝒪dn,subscript¯𝑞subscript𝒜¯𝑞subscript𝒪subscript𝑑𝑛\bigcup_{\overline{q}\in\mathcal{R}}\mathcal{A}_{\overline{q}}\ =\ \mathcal{O}% _{d_{n}},⋃ start_POSTSUBSCRIPT over¯ start_ARG italic_q end_ARG ∈ caligraphic_R end_POSTSUBSCRIPT caligraphic_A start_POSTSUBSCRIPT over¯ start_ARG italic_q end_ARG end_POSTSUBSCRIPT = caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ,

where

𝒜q¯:={q𝒪dn|q𝒪dn(q1(E𝐝𝟎)),|qqq¯|ζ},assignsubscript𝒜¯𝑞conditional-set𝑞subscript𝒪subscript𝑑𝑛formulae-sequencesuperscript𝑞subscript𝒪subscript𝑑𝑛superscript𝑞1subscript𝐸subscript𝐝0norm𝑞superscript𝑞¯𝑞𝜁\mathcal{A}_{\overline{q}}\ :=\ \{q\in\mathcal{O}_{d_{n}}\ |\ \exists q^{% \prime}\in\mathcal{O}_{d_{n}}(q^{-1}(E_{\mathbf{d_{0}}})),\ {\left|\kern-1.076% 39pt\left|\kern-1.07639pt\left|qq^{\prime}-\overline{q}\right|\kern-1.07639pt% \right|\kern-1.07639pt\right|}\leq\zeta\},caligraphic_A start_POSTSUBSCRIPT over¯ start_ARG italic_q end_ARG end_POSTSUBSCRIPT := { italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT | ∃ italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , | | | italic_q italic_q start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT - over¯ start_ARG italic_q end_ARG | | | ≤ italic_ζ } ,

and such that

||(πd0dn2)d0(16d0dnζ)d0(dn+d02).superscript𝜋subscript𝑑0subscript𝑑𝑛2subscript𝑑0superscript16subscript𝑑0subscript𝑑𝑛𝜁subscript𝑑0subscript𝑑𝑛subscript𝑑02\left|\mathcal{R}\right|\ \leq\ \left(\frac{\pi\sqrt{d_{0}d_{n}}}{2}\right)^{d% _{0}}\left(\frac{16\sqrt{d_{0}d_{n}}}{\zeta}\right)^{d_{0}(d_{n}+d_{0}-2)}.| caligraphic_R | ≤ ( divide start_ARG italic_π square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( divide start_ARG 16 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_ζ end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 2 ) end_POSTSUPERSCRIPT .
Proof.

Firstly, we remark that

𝒜q¯={q𝒪dn|q′′𝒪dn,q|q1(E𝐝𝟎)′′=q|q1(E𝐝𝟎)and|q′′q¯|ζ}.\mathcal{A}_{\overline{q}}\ =\ \bigl{\{}q\in\mathcal{O}_{d_{n}}\ |\ \exists q^% {\prime\prime}\in\mathcal{O}_{d_{n}},\ q^{\prime\prime}_{|q^{-1}(E_{\mathbf{d_% {0}}})}=q_{|q^{-1}(E_{\mathbf{d_{0}}})}\ \text{and}\ {\left|\kern-1.07639pt% \left|\kern-1.07639pt\left|q^{\prime\prime}-\overline{q}\right|\kern-1.07639pt% \right|\kern-1.07639pt\right|}\leq\zeta\bigr{\}}.caligraphic_A start_POSTSUBSCRIPT over¯ start_ARG italic_q end_ARG end_POSTSUBSCRIPT = { italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT | ∃ italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT | italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT and | | | italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT - over¯ start_ARG italic_q end_ARG | | | ≤ italic_ζ } .

Thus, for q𝒪dn𝑞subscript𝒪subscript𝑑𝑛q\in\mathcal{O}_{d_{n}}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we search to construct q¯¯𝑞\overline{q}over¯ start_ARG italic_q end_ARG such that there exists q′′𝒪dnsuperscript𝑞′′subscript𝒪subscript𝑑𝑛q^{\prime\prime}\in\mathcal{O}_{d_{n}}italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT satisfying q|q1(E𝐝𝟎)′′=q|q1(E𝐝𝟎)q^{\prime\prime}_{|q^{-1}(E_{\mathbf{d_{0}}})}=q_{|q^{-1}(E_{\mathbf{d_{0}}})}italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT = italic_q start_POSTSUBSCRIPT | italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT and |q′′q¯|ζnormsuperscript𝑞′′¯𝑞𝜁{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|q^{\prime\prime}-\overline{q}% \right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\leq\zeta| | | italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT - over¯ start_ARG italic_q end_ARG | | | ≤ italic_ζ. Let (u1,,ud0,ud0+1,,udn)subscript𝑢1subscript𝑢subscript𝑑0subscript𝑢subscript𝑑01subscript𝑢subscript𝑑𝑛(u_{1},\ldots,u_{d_{0}},u_{d_{0}+1},\ldots,u_{d_{n}})( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) be an orthonormal basis adapted to the direct sum dn=superscriptsubscript𝑑𝑛absent\mathbb{R}^{d_{n}}=blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT = (q)1(E𝐝𝟎)superscript𝑞1subscript𝐸subscript𝐝0(q)^{-1}(E_{\mathbf{d_{0}}})\ ( italic_q ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) (q)1(E1𝐝𝟎)perpendicular-todirect-sumsuperscript𝑞1subscript𝐸1subscript𝐝0\overset{\perp}{\bigoplus}\ (q)^{-1}(E_{1-\mathbf{d_{0}}})over⟂ start_ARG ⨁ end_ARG ( italic_q ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ). We introduce \mathcal{F}caligraphic_F a set of orthonormal basis of E𝐝𝟎subscript𝐸subscript𝐝0E_{\mathbf{d_{0}}}italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that, for all orthonormal basis fsuperscript𝑓f^{\prime}italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT of E𝐝𝟎subscript𝐸subscript𝐝0E_{\mathbf{d_{0}}}italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, there exists f𝑓f\in\mathcal{F}italic_f ∈ caligraphic_F such that

supi1,d0fifiζ2d0dn,subscriptsupremum𝑖1subscript𝑑0normsubscript𝑓𝑖subscriptsuperscript𝑓𝑖𝜁2subscript𝑑0subscript𝑑𝑛\sup_{i\in\llbracket 1,d_{0}\rrbracket}\left\|{f_{i}-f^{\prime}_{i}}\right\|\ % \leq\ \frac{\zeta}{2\sqrt{d_{0}d_{n}}},roman_sup start_POSTSUBSCRIPT italic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧ end_POSTSUBSCRIPT ∥ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ ≤ divide start_ARG italic_ζ end_ARG start_ARG 2 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG ,

and we reuse the set 𝒢𝒢\mathcal{G}caligraphic_G of Lemma 5.1, replacing ε𝜀\varepsilonitalic_ε by ζ𝜁\zetaitalic_ζ. For all g𝒢𝑔𝒢g\in\mathcal{G}italic_g ∈ caligraphic_G and f𝑓f\in\mathcal{F}italic_f ∈ caligraphic_F, we fix an isometry rg,f𝒪dnsubscript𝑟𝑔𝑓subscript𝒪subscript𝑑𝑛r_{g,f}\in\mathcal{O}_{d_{n}}italic_r start_POSTSUBSCRIPT italic_g , italic_f end_POSTSUBSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that rg,f(gi)=fisubscript𝑟𝑔𝑓subscript𝑔𝑖subscript𝑓𝑖r_{g,f}(g_{i})=f_{i}italic_r start_POSTSUBSCRIPT italic_g , italic_f end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, for all i1,d0𝑖1subscript𝑑0i\in\llbracket 1,d_{0}\rrbracketitalic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧. By construction, there exist f𝑓f\in\mathcal{F}italic_f ∈ caligraphic_F and g𝒢𝑔𝒢g\in\mathcal{G}italic_g ∈ caligraphic_G such that

supi1,d0fiq(ui)ζ2d0dnandsupi1,d0giuiζ2d0dn.formulae-sequencesubscriptsupremum𝑖1subscript𝑑0normsubscript𝑓𝑖𝑞subscript𝑢𝑖𝜁2subscript𝑑0subscript𝑑𝑛andsubscriptsupremum𝑖1subscript𝑑0normsubscript𝑔𝑖subscript𝑢𝑖𝜁2subscript𝑑0subscript𝑑𝑛\sup_{i\in\llbracket 1,d_{0}\rrbracket}\left\|{f_{i}-q(u_{i})}\right\|\ \leq\ % \frac{\zeta}{2\sqrt{d_{0}d_{n}}}\quad\text{and}\quad\sup_{i\in\llbracket 1,d_{% 0}\rrbracket}\left\|{g_{i}-u_{i}}\right\|\ \leq\ \frac{\zeta}{2\sqrt{d_{0}d_{n% }}}.roman_sup start_POSTSUBSCRIPT italic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧ end_POSTSUBSCRIPT ∥ italic_f start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_q ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ ≤ divide start_ARG italic_ζ end_ARG start_ARG 2 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG and roman_sup start_POSTSUBSCRIPT italic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧ end_POSTSUBSCRIPT ∥ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ ≤ divide start_ARG italic_ζ end_ARG start_ARG 2 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG .

Then we choose q¯=rg,f¯𝑞subscript𝑟𝑔𝑓\overline{q}=r_{g,f}over¯ start_ARG italic_q end_ARG = italic_r start_POSTSUBSCRIPT italic_g , italic_f end_POSTSUBSCRIPT. Using Lemma 5.8, we extend g𝑔gitalic_g to an orthonormal basis over dnsuperscriptsubscript𝑑𝑛\mathbb{R}^{d_{n}}blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT such that supj1,dngjujζ/dnsubscriptsupremum𝑗1subscript𝑑𝑛normsubscript𝑔𝑗subscript𝑢𝑗𝜁subscript𝑑𝑛\sup_{j\in\llbracket 1,d_{n}\rrbracket}\left\|{g_{j}-u_{j}}\right\|\ \leq\ % \zeta/\sqrt{d_{n}}roman_sup start_POSTSUBSCRIPT italic_j ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧ end_POSTSUBSCRIPT ∥ italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ ≤ italic_ζ / square-root start_ARG italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG and we define fj:=rg,f(gj)E𝐝𝟎assignsubscript𝑓𝑗subscript𝑟𝑔𝑓subscript𝑔𝑗superscriptsubscript𝐸subscript𝐝0perpendicular-tof_{j}:=r_{g,f}(g_{j})\in E_{\mathbf{d_{0}}}^{\perp}italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT := italic_r start_POSTSUBSCRIPT italic_g , italic_f end_POSTSUBSCRIPT ( italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∈ italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT, for jd0+1,dn𝑗subscript𝑑01subscript𝑑𝑛j\in\llbracket d_{0}+1,d_{n}\rrbracketitalic_j ∈ ⟦ italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧. Now we choose q′′𝒪dnsuperscript𝑞′′subscript𝒪subscript𝑑𝑛q^{\prime\prime}\in\mathcal{O}_{d_{n}}italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT such that

{q′′(ui)=q(ui)if i1,d0,q′′(uj)=fjif jd0+1,dn.casessuperscript𝑞′′subscript𝑢𝑖𝑞subscript𝑢𝑖if 𝑖1subscript𝑑0superscript𝑞′′subscript𝑢𝑗subscript𝑓𝑗if 𝑗subscript𝑑01subscript𝑑𝑛\begin{cases}q^{\prime\prime}(u_{i})=q(u_{i})&\text{if }i\in\llbracket 1,d_{0}% \rrbracket,\\ q^{\prime\prime}(u_{j})=f_{j}&\text{if }j\in\llbracket d_{0}+1,d_{n}\rrbracket% .\end{cases}{ start_ROW start_CELL italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_q ( italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_CELL start_CELL if italic_i ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⟧ , end_CELL end_ROW start_ROW start_CELL italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = italic_f start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_CELL start_CELL if italic_j ∈ ⟦ italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧ . end_CELL end_ROW

This leads to q′′(uj)q¯(uj)ζ/dnnormsuperscript𝑞′′subscript𝑢𝑗¯𝑞subscript𝑢𝑗𝜁subscript𝑑𝑛\left\|{q^{\prime\prime}(u_{j})-\overline{q}(u_{j})}\right\|\leq\zeta/\sqrt{d_% {n}}∥ italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) - over¯ start_ARG italic_q end_ARG ( italic_u start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∥ ≤ italic_ζ / square-root start_ARG italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG, for all j1,dn𝑗1subscript𝑑𝑛j\in\llbracket 1,d_{n}\rrbracketitalic_j ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ⟧, hence |q′′q¯|ζnormsuperscript𝑞′′¯𝑞𝜁{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|q^{\prime\prime}-\overline{q}% \right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\leq\zeta| | | italic_q start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT - over¯ start_ARG italic_q end_ARG | | | ≤ italic_ζ. We can thus define the net \mathcal{R}caligraphic_R as the set of all isometries rg,fsubscript𝑟𝑔𝑓r_{g,f}italic_r start_POSTSUBSCRIPT italic_g , italic_f end_POSTSUBSCRIPT for g𝒢𝑔𝒢g\in\mathcal{G}italic_g ∈ caligraphic_G and f𝑓f\in\mathcal{F}italic_f ∈ caligraphic_F. According to Lemma 5.10, this yields the upper bound

||\displaystyle\left|\mathcal{R}\right|\ | caligraphic_R | =|𝒢|||absent𝒢\displaystyle=\ \left|\mathcal{G}\right|\cdot\left|\mathcal{F}\right|= | caligraphic_G | ⋅ | caligraphic_F |
(πdn2)d02(16d0dnζ)d0(dn1)(πd02)d02(16d0dnζ)d0(d01).absentsuperscript𝜋subscript𝑑𝑛2subscript𝑑02superscript16subscript𝑑0subscript𝑑𝑛𝜁subscript𝑑0subscript𝑑𝑛1superscript𝜋subscript𝑑02subscript𝑑02superscript16subscript𝑑0subscript𝑑𝑛𝜁subscript𝑑0subscript𝑑01\displaystyle\leq\ \left(\frac{\pi d_{n}}{2}\right)^{\tfrac{d_{0}}{2}}\left(% \frac{16\sqrt{d_{0}d_{n}}}{\zeta}\right)^{d_{0}(d_{n}-1)}\left(\frac{\pi d_{0}% }{2}\right)^{\tfrac{d_{0}}{2}}\left(\frac{16\sqrt{d_{0}d_{n}}}{\zeta}\right)^{% d_{0}(d_{0}-1)}.\qed≤ ( divide start_ARG italic_π italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( divide start_ARG 16 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_ζ end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - 1 ) end_POSTSUPERSCRIPT ( divide start_ARG italic_π italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT ( divide start_ARG 16 square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_ζ end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 ) end_POSTSUPERSCRIPT . italic_∎

Observing that the upper bound in (5.10) does not hide a constant depending on q𝑞qitalic_q, we can write

max1bdmaxq¯nN(2εn,n,b,q,)nεn2.\max_{\begin{subarray}{c}1\leq b\leq d_{\mathrm{max}}\\[1.9919pt] \overline{q}\in\mathcal{R}_{n}\end{subarray}}N\left(2\varepsilon_{n},\mathcal{% B}_{n,b,q},\left\|{\cdot}\right\|_{\infty}\right)\ \lesssim\ n\varepsilon_{n}^% {2}.roman_max start_POSTSUBSCRIPT start_ARG start_ROW start_CELL 1 ≤ italic_b ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_q end_ARG ∈ caligraphic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_CELL end_ROW end_ARG end_POSTSUBSCRIPT italic_N ( 2 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , caligraphic_B start_POSTSUBSCRIPT italic_n , italic_b , italic_q end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ≲ italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

Then, the lemma yields the following inequality:

N(3εn,𝔹n,)(πd0dn2)d0(16Mnrn3/2dn2d0εn)d0(dn+d02)dmaxnεn2,N\left(3\varepsilon_{n},\mathbb{B}_{n},\left\|{\cdot}\right\|_{\infty}\right)% \ \lesssim\ \left(\frac{\pi\sqrt{d_{0}d_{n}}}{2}\right)^{d_{0}}\left(\frac{16M% _{n}r_{n}^{3/2}d_{n}\sqrt{2d_{0}}}{\varepsilon_{n}}\right)^{d_{0}(d_{n}+d_{0}-% 2)}d_{\mathrm{max}}\cdot n\varepsilon_{n}^{2},italic_N ( 3 italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , blackboard_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ≲ ( divide start_ARG italic_π square-root start_ARG italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( divide start_ARG 16 italic_M start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT square-root start_ARG 2 italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG end_ARG start_ARG italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 2 ) end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ⋅ italic_n italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

where rn:=max{rn,b:b1,dmax}assignsubscript𝑟𝑛:subscript𝑟𝑛𝑏𝑏1subscript𝑑maxr_{n}:=\max\{r_{n,b}:b\in\llbracket 1,d_{\mathrm{max}}\rrbracket\}italic_r start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT := roman_max { italic_r start_POSTSUBSCRIPT italic_n , italic_b end_POSTSUBSCRIPT : italic_b ∈ ⟦ 1 , italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT ⟧ }, which, with the logarithm and for sufficiently large n𝑛nitalic_n, gives the desired result.

5.2 Proof of Theorem 4.1

5.2.1 Case Γ<d0Γsubscript𝑑0\Gamma<d_{0}roman_Γ < italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT

The idea of the proof is to show that the non-constancy of p0subscript𝑝0p_{0}italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in all directions results in a significant difference (in the Hellinger sense) between the true density p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and any density that is more parcimonious than p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. If this difference can be bounded from below, then the set of over-parcimonious densities is expected to have an almost-null posterior mass as soon as the contraction rate falls below the lower bound.

Let q𝒪d𝑞subscript𝒪𝑑q\in\mathcal{O}_{d}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT and let p~~𝑝\tilde{p}over~ start_ARG italic_p end_ARG be a density that satisfies the model with parameters ΓΓ\Gammaroman_Γ and q𝑞qitalic_q. Then, p~~𝑝\tilde{p}over~ start_ARG italic_p end_ARG is constant on q1(E1𝚪)+xsuperscript𝑞1subscript𝐸1𝚪𝑥q^{-1}(E_{1-\boldsymbol{\Gamma}})+xitalic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - bold_Γ end_POSTSUBSCRIPT ) + italic_x, for any x𝕌d𝑥subscript𝕌𝑑x\in\mathbb{U}_{d}italic_x ∈ blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT. Moreover, the intersection between 𝒮𝒮\mathcal{S}caligraphic_S and q1(E1𝚪)superscript𝑞1subscript𝐸1𝚪q^{-1}(E_{1-\boldsymbol{\Gamma}})italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT 1 - bold_Γ end_POSTSUBSCRIPT ) is non-null so p~|𝒮\tilde{p}_{|\mathcal{S}}over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT | caligraphic_S end_POSTSUBSCRIPT is constant in at least one direction, say 𝚫𝒮𝚫𝒮\boldsymbol{\Delta}\in\mathcal{S}bold_Δ ∈ caligraphic_S. We will use Assumption 4.1 and integrate the Hellinger distance over a small square inside the region where p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is non-constant in 𝚫𝚫\boldsymbol{\Delta}bold_Δ. As usual, we denote Δ:=Span(𝚫)assignΔSpan𝚫\Delta:=\operatorname{Span}(\boldsymbol{\Delta})roman_Δ := roman_Span ( bold_Δ ).

Let us introduce the operator

Ψ:d0:Ψsuperscriptsubscript𝑑0\displaystyle\Psi:\ \mathbb{R}^{d_{0}}roman_Ψ : blackboard_R start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT 𝒮absent𝒮\displaystyle\ \to\ \mathcal{S}→ caligraphic_S
x𝑥\displaystyle xitalic_x (q*)1(x𝐝𝟎).maps-toabsentsuperscriptsuperscript𝑞1superscript𝑥subscript𝐝0\displaystyle\ \mapsto\ (q^{*})^{-1}(x^{\mathbf{d_{0}}}).↦ ( italic_q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ) .

In particular, we have p*Ψ=p0superscript𝑝Ψsubscript𝑝0p^{*}\circ\Psi=p_{0}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∘ roman_Ψ = italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. We use the notation of Assumption 4.1 with Ψ1(𝚫)superscriptΨ1𝚫\Psi^{-1}(\boldsymbol{\Delta})roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_Δ ) instead of 𝚫𝚫\boldsymbol{\Delta}bold_Δ.

Let (𝚫,u1,,ud01;v1,,vdd0)𝚫subscript𝑢1subscript𝑢subscript𝑑01subscript𝑣1subscript𝑣𝑑subscript𝑑0(\boldsymbol{\Delta},u_{1},\ldots,u_{d_{0}-1};v_{1},\ldots,v_{d-d_{0}})( bold_Δ , italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ; italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_d - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) be an orthonormal basis adapted to the direct sum d=Δ(Δ𝒮)𝒮superscript𝑑direct-sumΔsuperscriptΔperpendicular-to𝒮superscript𝒮perpendicular-to\mathbb{R}^{d}=\Delta\oplus(\Delta^{\perp}\cap\mathcal{S})\oplus\mathcal{S}^{\perp}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT = roman_Δ ⊕ ( roman_Δ start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT ∩ caligraphic_S ) ⊕ caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT and let R𝑅Ritalic_R be a solid square with edges parallel to this basis, of size L/d𝐿𝑑L/\sqrt{d}italic_L / square-root start_ARG italic_d end_ARG and centered on Ψ(o)Ψ𝑜\Psi(o)roman_Ψ ( italic_o ). Then, Rd(L/2)+Ψ(o)𝑅subscript𝑑𝐿2Ψ𝑜R\subset\mathcal{B}_{d}(L/2)+\Psi(o)italic_R ⊂ caligraphic_B start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_L / 2 ) + roman_Ψ ( italic_o ) and the inequality of Assumption 4.1 is valid when tR𝑡𝑅t\in Ritalic_t ∈ italic_R. Considering the basis previously introduced, integrating over R𝑅Ritalic_R amounts to integrate with respect to each variables. To simplify, we bundle these variables in three groups: a variable δ𝛿\deltaitalic_δ parallel to ΔΔ\Deltaroman_Δ, a variable u𝑢uitalic_u parallel to Δ𝒮superscriptΔperpendicular-to𝒮\Delta^{\perp}\cap\mathcal{S}roman_Δ start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT ∩ caligraphic_S and a variable v𝑣vitalic_v parallel to 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT. In this coordinate system, we can write Ψ(o)=(Ψ(o)1,Ψ(o)2,0)Ψ𝑜Ψsubscript𝑜1Ψsubscript𝑜20\Psi(o)=(\Psi(o)_{1},\Psi(o)_{2},0)roman_Ψ ( italic_o ) = ( roman_Ψ ( italic_o ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , roman_Ψ ( italic_o ) start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , 0 ) and we have p*(δ,u,v)=p0(Ψ1(δ,u,0))superscript𝑝𝛿𝑢𝑣subscript𝑝0superscriptΨ1𝛿𝑢0p^{*}(\delta,u,v)=p_{0}(\Psi^{-1}(\delta,u,0))italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_δ , italic_u , italic_v ) = italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_δ , italic_u , 0 ) ).Then

h2(p|R*;p~|R)=\displaystyle h^{2}(p^{*}_{|R}\,;\,\tilde{p}_{|R})\ =\ italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ; over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ) = R|p*(δ,u,0)p~(0,u,v)|2𝑑δ𝑑u𝑑vsubscripttriple-integral𝑅superscriptsuperscript𝑝𝛿𝑢0~𝑝0𝑢𝑣2differential-d𝛿differential-d𝑢differential-d𝑣\displaystyle\iiint_{R}\left|\sqrt{p^{*}(\delta,u,0)}-\sqrt{\tilde{p}(0,u,v)}% \right|^{2}d\delta\,du\,dv∭ start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT | square-root start_ARG italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_δ , italic_u , 0 ) end_ARG - square-root start_ARG over~ start_ARG italic_p end_ARG ( 0 , italic_u , italic_v ) end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_δ italic_d italic_u italic_d italic_v
=\displaystyle\ =\ = (|p0(Ψ1(δ,u,0))p~(0,u,v)|2𝑑δ)𝑑u𝑑vdouble-integralsuperscriptsubscript𝑝0superscriptΨ1𝛿𝑢0~𝑝0𝑢𝑣2differential-d𝛿differential-d𝑢differential-d𝑣\displaystyle\iint\left(\int\left|\sqrt{p_{0}(\Psi^{-1}(\delta,u,0))}-\sqrt{% \tilde{p}(0,u,v)}\right|^{2}d\delta\right)du\,dv∬ ( ∫ | square-root start_ARG italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_δ , italic_u , 0 ) ) end_ARG - square-root start_ARG over~ start_ARG italic_p end_ARG ( 0 , italic_u , italic_v ) end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_δ ) italic_d italic_u italic_d italic_v
=\displaystyle\ =\ = h2(p0|Iu;p~(0,u,v))𝑑u𝑑v,\displaystyle\iint h^{2}\left({p_{0}}_{|I_{u}}\,;\,\tilde{p}(0,u,v)\right)du\,dv,∬ italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUBSCRIPT | italic_I start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT end_POSTSUBSCRIPT ; over~ start_ARG italic_p end_ARG ( 0 , italic_u , italic_v ) ) italic_d italic_u italic_d italic_v ,

where Iusubscript𝐼𝑢I_{u}italic_I start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT is the inverse image via ΨΨ\Psiroman_Ψ of the range of the integral in δ𝛿\deltaitalic_δ. Hence

Ψ(Iu)=(Ψ(o)1,u,0)+]L2d𝚫;L2d𝚫[with uΨ(o)2+]L2d;L2d[d01.\displaystyle\Psi(I_{u})\ =\ (\Psi(o)_{1},u,0)\ +\ \Big{]}-\frac{L}{2\sqrt{d}}% \boldsymbol{\Delta}\,;\,\frac{L}{2\sqrt{d}}\boldsymbol{\Delta}\Big{[}\qquad% \text{with }u\in\Psi(o)_{2}\ +\ \Big{]}-\frac{L}{2\sqrt{d}}\,;\,\frac{L}{2% \sqrt{d}}\Big{[}^{\,d_{0}-1}.roman_Ψ ( italic_I start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ) = ( roman_Ψ ( italic_o ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_u , 0 ) + ] - divide start_ARG italic_L end_ARG start_ARG 2 square-root start_ARG italic_d end_ARG end_ARG bold_Δ ; divide start_ARG italic_L end_ARG start_ARG 2 square-root start_ARG italic_d end_ARG end_ARG bold_Δ [ with italic_u ∈ roman_Ψ ( italic_o ) start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ] - divide start_ARG italic_L end_ARG start_ARG 2 square-root start_ARG italic_d end_ARG end_ARG ; divide start_ARG italic_L end_ARG start_ARG 2 square-root start_ARG italic_d end_ARG end_ARG [ start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT .

Then because Ψ1(Ψ(o)1,u,0)o+d0(L/2)superscriptΨ1Ψsubscript𝑜1𝑢0𝑜subscriptsubscript𝑑0𝐿2\Psi^{-1}(\Psi(o)_{1},u,0)\in o+\mathcal{B}_{d_{0}}(L/2)roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_Ψ ( italic_o ) start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_u , 0 ) ∈ italic_o + caligraphic_B start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_L / 2 ), there exists td0(L/2)𝑡subscriptsubscript𝑑0𝐿2t\in\mathcal{B}_{d_{0}}(L/2)italic_t ∈ caligraphic_B start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_L / 2 ) such that

(5.11) Iu=o+t+]L2dΨ1(𝚫);L2dΨ1(𝚫)[.I_{u}\ =\ o\ +\ t\ +\ \Big{]}-\frac{L}{2\sqrt{d}}\Psi^{-1}(\boldsymbol{\Delta}% )\,;\,\frac{L}{2\sqrt{d}}\Psi^{-1}(\boldsymbol{\Delta})\Big{[}.italic_I start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT = italic_o + italic_t + ] - divide start_ARG italic_L end_ARG start_ARG 2 square-root start_ARG italic_d end_ARG end_ARG roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_Δ ) ; divide start_ARG italic_L end_ARG start_ARG 2 square-root start_ARG italic_d end_ARG end_ARG roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_Δ ) [ .

Now we can use Assumption 4.1 and bound from below the Hellinger distance in the last integral, which gives

h2(p|R*;p~|R)DL2d𝑑u𝑑v=D(Ld)d+1.\displaystyle h^{2}(p^{*}_{|R}\,;\,\tilde{p}_{|R})\ \geq\ \iint D\cdot\frac{L^% {2}}{d}\,du\,dv\ =\ D\cdot\left(\frac{L}{\sqrt{d}}\right)^{d+1}.italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ; over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ) ≥ ∬ italic_D ⋅ divide start_ARG italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_d end_ARG italic_d italic_u italic_d italic_v = italic_D ⋅ ( divide start_ARG italic_L end_ARG start_ARG square-root start_ARG italic_d end_ARG end_ARG ) start_POSTSUPERSCRIPT italic_d + 1 end_POSTSUPERSCRIPT .

Finally, Πn(Γ<d0|X1,,Xn)=0subscriptΠ𝑛Γbrasubscript𝑑0subscript𝑋1subscript𝑋𝑛0\Pi_{n}(\Gamma<d_{0}\ |\ X_{1},\ldots,X_{n})=0roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_Γ < italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) = 0 as soon as the contraction rate achieves εnD(Ld)d+12subscript𝜀𝑛𝐷superscript𝐿𝑑𝑑12\varepsilon_{n}\leq\sqrt{D}\left(\frac{L}{\sqrt{d}}\right)^{\frac{d+1}{2}}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ square-root start_ARG italic_D end_ARG ( divide start_ARG italic_L end_ARG start_ARG square-root start_ARG italic_d end_ARG end_ARG ) start_POSTSUPERSCRIPT divide start_ARG italic_d + 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT.


5.2.2 Case Γ=d0Γsubscript𝑑0\Gamma=d_{0}roman_Γ = italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT

Case Γ=𝒅0normal-Γsubscript𝒅0\boldsymbol{\Gamma=d_{0}}bold_Γ bold_= bold_italic_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT, with d=2normal-d2\mathbf{d=2}bold_d = bold_2 and d0=1subscriptnormal-d01\mathbf{d_{0}=1}bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT = bold_1.

To simplify the presentation, we first restrict ourselves to the case d=2𝑑2d=2italic_d = 2 and d0=1subscript𝑑01d_{0}=1italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 1. Assumption 4.1 specializes as follows: for all 0<lL0𝑙𝐿0<l\leq L0 < italic_l ≤ italic_L, there exists o[1+L,1L]𝑜1𝐿1𝐿o\in[-1+L,1-L]italic_o ∈ [ - 1 + italic_L , 1 - italic_L ] such that, for all t[l/2,l/2]𝑡𝑙2𝑙2t\in[-l/2,l/2]italic_t ∈ [ - italic_l / 2 , italic_l / 2 ] and all constant c>0𝑐0c>0italic_c > 0,

h2(p0|]o+tl2;o+t+l2[;c)=o+tl2o+t+l2|p0(λ)c|2𝑑λDl2.h^{2}\left({p_{0}}_{|]o+t-\frac{l}{2};o+t+\frac{l}{2}[}\,;\,c\right)\ =\ \int_% {o+t-\frac{l}{2}}^{o+t+\frac{l}{2}}\left|\sqrt{p_{0}(\lambda)}-\sqrt{c}\right|% ^{2}d\lambda\ \geq\ D\cdot l^{2}.italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUBSCRIPT | ] italic_o + italic_t - divide start_ARG italic_l end_ARG start_ARG 2 end_ARG ; italic_o + italic_t + divide start_ARG italic_l end_ARG start_ARG 2 end_ARG [ end_POSTSUBSCRIPT ; italic_c ) = ∫ start_POSTSUBSCRIPT italic_o + italic_t - divide start_ARG italic_l end_ARG start_ARG 2 end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_o + italic_t + divide start_ARG italic_l end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT | square-root start_ARG italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_λ ) end_ARG - square-root start_ARG italic_c end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_λ ≥ italic_D ⋅ italic_l start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .

We use the fact that the non-constancy of p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT over 𝒮𝒮\mathcal{S}caligraphic_S induces a non-constancy over any one-dimensional space not parallel to 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT. It is then possible to set a lower bound on the Hellinger distance between p*superscript𝑝p^{*}italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and any density that is constant on a space not parallel to 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT. For q𝒪2𝑞subscript𝒪2q\in\mathcal{O}_{2}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, we denote E:=q1(E𝐝𝟎)assign𝐸superscript𝑞1subscript𝐸subscript𝐝0E:=q^{-1}(E_{\mathbf{d_{0}}})italic_E := italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) and F:=Eassign𝐹superscript𝐸perpendicular-toF:=E^{\perp}italic_F := italic_E start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT. If q𝑞qitalic_q is not in 𝒬*superscript𝒬\mathcal{Q}^{*}caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, then there exists 0<ϑπ/20italic-ϑ𝜋20<\vartheta\leq\pi/20 < italic_ϑ ≤ italic_π / 2 such that for all q¯𝒬*¯𝑞superscript𝒬\overline{q}\in\mathcal{Q}^{*}over¯ start_ARG italic_q end_ARG ∈ caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, we have |q¯q|>ϑnorm¯𝑞𝑞italic-ϑ{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\overline{q}-q\right|\kern-1.% 07639pt\right|\kern-1.07639pt\right|}>\vartheta| | | over¯ start_ARG italic_q end_ARG - italic_q | | | > italic_ϑ. Then, the intersections of F𝐹Fitalic_F and 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT with the unit circle are separated by at least ϑitalic-ϑ\varthetaitalic_ϑ.

With this setting, any square of size L/2𝐿2L/\sqrt{2}italic_L / square-root start_ARG 2 end_ARG centered in Ψ(o)Ψ𝑜\Psi(o)roman_Ψ ( italic_o ) is included in 𝕌2subscript𝕌2\mathbb{U}_{2}blackboard_U start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. Let R𝑅Ritalic_R be a solid square of size L/2𝐿2L/\sqrt{2}italic_L / square-root start_ARG 2 end_ARG, parallel to the line F𝐹Fitalic_F and centered on Ψ(o)Ψ𝑜\Psi(o)roman_Ψ ( italic_o ). The line F+Ψ(o)𝐹Ψ𝑜F+\Psi(o)italic_F + roman_Ψ ( italic_o ) intersects the border of R𝑅Ritalic_R at two points (see Figure 1), and using arguments from geometry on the two-dimensional Euclidean space, we can show that the orthogonal projections of these points over 𝒮𝒮\mathcal{S}caligraphic_S are at a distance ζLϑ424ϑ2𝜁𝐿italic-ϑ424superscriptitalic-ϑ2\zeta\geq\frac{L\vartheta}{4\sqrt{2}}\sqrt{4-\vartheta^{2}}italic_ζ ≥ divide start_ARG italic_L italic_ϑ end_ARG start_ARG 4 square-root start_ARG 2 end_ARG end_ARG square-root start_ARG 4 - italic_ϑ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG from Ψ(o)Ψ𝑜\Psi(o)roman_Ψ ( italic_o ). Similarly, the line E+Ψ(o)𝐸Ψ𝑜E+\Psi(o)italic_E + roman_Ψ ( italic_o ) intersects the border of R𝑅Ritalic_R at two points whose orthogonal projections on 𝒮𝒮\mathcal{S}caligraphic_S are at a distance χL221ϑ2+ϑ4/4𝜒𝐿221superscriptitalic-ϑ2superscriptitalic-ϑ44\chi\leq\frac{L}{2\sqrt{2}}\sqrt{1-\vartheta^{2}+\vartheta^{4}/4}italic_χ ≤ divide start_ARG italic_L end_ARG start_ARG 2 square-root start_ARG 2 end_ARG end_ARG square-root start_ARG 1 - italic_ϑ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_ϑ start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT / 4 end_ARG from Ψ(o)Ψ𝑜\Psi(o)roman_Ψ ( italic_o ).

0011111111F𝐹Fitalic_F𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPTϑitalic-ϑ\varthetaitalic_ϑ𝒮𝒮\mathcal{S}caligraphic_SE+Ψ(o)𝐸Ψ𝑜E+\Psi(o)italic_E + roman_Ψ ( italic_o )Ψ(o)Ψ𝑜\Psi(o)roman_Ψ ( italic_o )F+Ψ(o)𝐹Ψ𝑜F+\Psi(o)italic_F + roman_Ψ ( italic_o )R𝑅Ritalic_RL/2𝐿2L/\sqrt{2}italic_L / square-root start_ARG 2 end_ARGχ𝜒\chiitalic_χζ𝜁\zetaitalic_ζ
Figure 1: Illustration of the proof of Theorem 4.1 in the case Γ=d0Γsubscript𝑑0\Gamma=d_{0}roman_Γ = italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT with d=2𝑑2d=2italic_d = 2 and d0=1subscript𝑑01d_{0}=1italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 1.

Let (𝐮,𝐯)𝐮𝐯(\mathbf{u},\mathbf{v})( bold_u , bold_v ) be an orthogonal basis of 2superscript2\mathbb{R}^{2}blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT adapted to the decomposition EFdirect-sum𝐸𝐹E\oplus Fitalic_E ⊕ italic_F and such that pr𝒮(𝐮)=22LχΨ(1)subscriptpr𝒮𝐮22𝐿𝜒Ψ1\mathrm{pr}_{\mathcal{S}}(\mathbf{u})=\frac{2\sqrt{2}}{L}\chi\cdot\Psi(1)roman_pr start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( bold_u ) = divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_χ ⋅ roman_Ψ ( 1 ) and pr𝒮(𝐯)=22LζΨ(1)subscriptpr𝒮𝐯22𝐿𝜁Ψ1\mathrm{pr}_{\mathcal{S}}(\mathbf{v})=\frac{2\sqrt{2}}{L}\zeta\cdot\Psi(1)roman_pr start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( bold_v ) = divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_ζ ⋅ roman_Ψ ( 1 ). In this system of coordinates, Ψ(o)Ψ𝑜\Psi(o)roman_Ψ ( italic_o ) can be written (o1,o2)subscript𝑜1subscript𝑜2(o_{1},o_{2})( italic_o start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_o start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) and for all u,v2𝑢𝑣superscript2u,v\in\mathbb{R}^{2}italic_u , italic_v ∈ blackboard_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, we have

Ψ1(pr𝒮(u,v))=χ22Lu+ζ22Lv.superscriptΨ1subscriptpr𝒮𝑢𝑣𝜒22𝐿𝑢𝜁22𝐿𝑣\Psi^{-1}\left(\mathrm{pr}_{\mathcal{S}}(u,v)\right)\ =\ \chi\cdot\frac{2\sqrt% {2}}{L}u\ +\ \zeta\cdot\frac{2\sqrt{2}}{L}v.roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_pr start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_u , italic_v ) ) = italic_χ ⋅ divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_u + italic_ζ ⋅ divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_v .

We will also use the fact that p*(u,v)=p0(Ψ1(pr𝒮(u,v)))superscript𝑝𝑢𝑣subscript𝑝0superscriptΨ1subscriptpr𝒮𝑢𝑣p^{*}(u,v)=p_{0}\left(\Psi^{-1}\left(\mathrm{pr}_{\mathcal{S}}(u,v)\right)\right)italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_u , italic_v ) = italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_pr start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_u , italic_v ) ) ). Then, for all density p~~𝑝\tilde{p}over~ start_ARG italic_p end_ARG constant in the direction F𝐹Fitalic_F, we have

h2(p|R*;p~|R)=\displaystyle h^{2}(p^{*}_{|R}\,;\,\tilde{p}_{|R})\ =\ italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ; over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ) = R|p*(u,v)p~(u,0)|2𝑑u𝑑vsubscriptdouble-integral𝑅superscriptsuperscript𝑝𝑢𝑣~𝑝𝑢02differential-d𝑢differential-d𝑣\displaystyle\iint_{R}|\sqrt{p^{*}(u,v)}-\sqrt{\tilde{p}(u,0)}|^{2}du\,dv∬ start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT | square-root start_ARG italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_u , italic_v ) end_ARG - square-root start_ARG over~ start_ARG italic_p end_ARG ( italic_u , 0 ) end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_u italic_d italic_v
=\displaystyle\ =\ = R|p0(Ψ1(pr𝒮(u,v)))p~(u,0)|2𝑑u𝑑vsubscriptdouble-integral𝑅superscriptsubscript𝑝0superscriptΨ1subscriptpr𝒮𝑢𝑣~𝑝𝑢02differential-d𝑢differential-d𝑣\displaystyle\iint_{R}|\sqrt{p_{0}\left(\Psi^{-1}\left(\mathrm{pr}_{\mathcal{S% }}(u,v)\right)\right)}-\sqrt{\tilde{p}(u,0)}|^{2}du\,dv∬ start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT | square-root start_ARG italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( roman_Ψ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( roman_pr start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_u , italic_v ) ) ) end_ARG - square-root start_ARG over~ start_ARG italic_p end_ARG ( italic_u , 0 ) end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_u italic_d italic_v
=\displaystyle\ =\ = o1L/(22)o1+L/(22)o2L/(22)o2+L/(22)|p0(χ22Lu+ζ22Lv)1/2p~(u,0)|2𝑑v𝑑usuperscriptsubscriptsubscript𝑜1𝐿22subscript𝑜1𝐿22superscriptsubscriptsubscript𝑜2𝐿22subscript𝑜2𝐿22superscriptsubscript𝑝0superscript𝜒22𝐿𝑢𝜁22𝐿𝑣12~𝑝𝑢02differential-d𝑣differential-d𝑢\displaystyle\int_{o_{1}-L/(2\sqrt{2})}^{o_{1}+L/(2\sqrt{2})}\int_{o_{2}-L/(2% \sqrt{2})}^{o_{2}+L/(2\sqrt{2})}\left|p_{0}\Bigl{(}\chi\cdot\frac{2\sqrt{2}}{L% }u\ +\ \zeta\cdot\frac{2\sqrt{2}}{L}v\Bigr{)}^{1/2}-\sqrt{\tilde{p}(u,0)}% \right|^{2}dv\,du∫ start_POSTSUBSCRIPT italic_o start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_o start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT italic_o start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_o start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT | italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_χ ⋅ divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_u + italic_ζ ⋅ divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_v ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT - square-root start_ARG over~ start_ARG italic_p end_ARG ( italic_u , 0 ) end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_v italic_d italic_u
=\displaystyle\ =\ = L/(22)L/(22)L/(22)L/(22)|p0(o+χ22Lu+ζ22Lv)1/2p~(u,0)|2𝑑v𝑑usuperscriptsubscript𝐿22𝐿22superscriptsubscript𝐿22𝐿22superscriptsubscript𝑝0superscript𝑜𝜒22𝐿𝑢𝜁22𝐿𝑣12~𝑝𝑢02differential-d𝑣differential-d𝑢\displaystyle\int_{-L/(2\sqrt{2})}^{L/(2\sqrt{2})}\int_{-L/(2\sqrt{2})}^{L/(2% \sqrt{2})}\left|p_{0}\Bigl{(}o+\chi\cdot\frac{2\sqrt{2}}{L}u\ +\ \zeta\cdot% \frac{2\sqrt{2}}{L}v\Bigr{)}^{1/2}-\sqrt{\tilde{p}(u,0)}\right|^{2}dv\,du∫ start_POSTSUBSCRIPT - italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT - italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT | italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_o + italic_χ ⋅ divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_u + italic_ζ ⋅ divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_v ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT - square-root start_ARG over~ start_ARG italic_p end_ARG ( italic_u , 0 ) end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_v italic_d italic_u
=\displaystyle\ =\ = L/(22)L/(22)L2ζ2(ζζ|p0(o+χ22Lu+w)1/2p~(u,0)|2𝑑w)𝑑usuperscriptsubscript𝐿22𝐿22𝐿2𝜁2superscriptsubscript𝜁𝜁superscriptsubscript𝑝0superscript𝑜𝜒22𝐿𝑢𝑤12~𝑝𝑢02differential-d𝑤differential-d𝑢\displaystyle\int_{-L/(2\sqrt{2})}^{L/(2\sqrt{2})}\frac{L}{2\zeta\cdot\sqrt{2}% }\left(\int_{-\zeta}^{\zeta}\left|p_{0}\Bigl{(}o+\chi\cdot\frac{2\sqrt{2}}{L}u% \ +\ w\Bigr{)}^{1/2}-\sqrt{\tilde{p}(u,0)}\right|^{2}dw\right)\,du∫ start_POSTSUBSCRIPT - italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT divide start_ARG italic_L end_ARG start_ARG 2 italic_ζ ⋅ square-root start_ARG 2 end_ARG end_ARG ( ∫ start_POSTSUBSCRIPT - italic_ζ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ζ end_POSTSUPERSCRIPT | italic_p start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_o + italic_χ ⋅ divide start_ARG 2 square-root start_ARG 2 end_ARG end_ARG start_ARG italic_L end_ARG italic_u + italic_w ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT - square-root start_ARG over~ start_ARG italic_p end_ARG ( italic_u , 0 ) end_ARG | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_w ) italic_d italic_u
(Assumption 4.1)Assumption 4.1\displaystyle(\text{Assumption }\ref{ass:directdetect})\quad\geq\ ( Assumption ) ≥ L/(22)L/(22)L2ζ2D4ζ2𝑑u=DL2ζDL342ϑ4ϑ2.superscriptsubscript𝐿22𝐿22𝐿2𝜁2𝐷4superscript𝜁2differential-d𝑢𝐷superscript𝐿2𝜁𝐷superscript𝐿342italic-ϑ4superscriptitalic-ϑ2\displaystyle\int_{-L/(2\sqrt{2})}^{L/(2\sqrt{2})}\frac{L}{2\zeta\cdot\sqrt{2}% }\cdot D\cdot 4\zeta^{2}\,du\ =\ DL^{2}\cdot\zeta\ \geq\ D\cdot\frac{L^{3}}{4% \sqrt{2}}\vartheta\sqrt{4-\vartheta^{2}}.∫ start_POSTSUBSCRIPT - italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_L / ( 2 square-root start_ARG 2 end_ARG ) end_POSTSUPERSCRIPT divide start_ARG italic_L end_ARG start_ARG 2 italic_ζ ⋅ square-root start_ARG 2 end_ARG end_ARG ⋅ italic_D ⋅ 4 italic_ζ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_d italic_u = italic_D italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ italic_ζ ≥ italic_D ⋅ divide start_ARG italic_L start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG 4 square-root start_ARG 2 end_ARG end_ARG italic_ϑ square-root start_ARG 4 - italic_ϑ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG .

Finally, Πn(Γ=d0 and minq𝒬*|Θq|ϑ|X1,,Xn)=0subscriptΠ𝑛Γsubscript𝑑0 and subscript𝑞superscript𝒬normΘ𝑞conditionalitalic-ϑsubscript𝑋1subscript𝑋𝑛0\Pi_{n}\left(\Gamma=d_{0}\text{ and }\min_{q\in\mathcal{Q}^{*}}{\left|\kern-1.% 07639pt\left|\kern-1.07639pt\left|\Theta-q\right|\kern-1.07639pt\right|\kern-1% .07639pt\right|}\geq\vartheta\ |\ X_{1},\ldots,X_{n}\right)=0roman_Π start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( roman_Γ = italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and roman_min start_POSTSUBSCRIPT italic_q ∈ caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | | | roman_Θ - italic_q | | | ≥ italic_ϑ | italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) = 0 as soon as εn<DL2ζsubscript𝜀𝑛𝐷superscript𝐿2𝜁\varepsilon_{n}<\sqrt{DL^{2}\cdot\zeta}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT < square-root start_ARG italic_D italic_L start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⋅ italic_ζ end_ARG.


Case Γ=𝒅0normal-Γsubscript𝒅0\boldsymbol{\Gamma=d_{0}}bold_Γ bold_= bold_italic_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT, with arbitrary d>d0normal-dsubscriptnormal-d0\mathbf{d>d_{0}}bold_d > bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT.

Given a non-optimal isometry q𝑞qitalic_q, we need to quantify how far from 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT the inverse image of the subspace E1𝐝𝟎subscript𝐸1subscript𝐝0E_{1-\mathbf{d_{0}}}italic_E start_POSTSUBSCRIPT 1 - bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT via q𝑞qitalic_q is. This result, elementary when d=2𝑑2d=2italic_d = 2, is stated for arbitrary d>d0𝑑subscript𝑑0d>d_{0}italic_d > italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT in the following lemma. A proof is given in Appendix 5.3.

Lemma 5.4.

Let q𝒪d𝑞subscript𝒪𝑑q\in\mathcal{O}_{d}italic_q ∈ caligraphic_O start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT. If for all q¯𝒬*normal-¯𝑞superscript𝒬\overline{q}\in\mathcal{Q}^{*}over¯ start_ARG italic_q end_ARG ∈ caligraphic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, we have |q¯q|>ϑnormnormal-¯𝑞𝑞italic-ϑ{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\overline{q}-q\right|\kern-1.% 07639pt\right|\kern-1.07639pt\right|}>\vartheta| | | over¯ start_ARG italic_q end_ARG - italic_q | | | > italic_ϑ, 0<ϑπ/20italic-ϑ𝜋20<\vartheta\leq\pi/20 < italic_ϑ ≤ italic_π / 2, then there exists rE1𝐝𝟎𝑟subscript𝐸1subscript𝐝0r\in E_{1-\mathbf{d_{0}}}italic_r ∈ italic_E start_POSTSUBSCRIPT 1 - bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, r=1norm𝑟1\left\|{r}\right\|=1∥ italic_r ∥ = 1, such that the distance between q1(r)superscript𝑞1𝑟q^{-1}(r)italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_r ) and 𝒮𝕊dsuperscript𝒮perpendicular-tosubscript𝕊𝑑\mathcal{S}^{\perp}\cap\mathbb{S}_{d}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT ∩ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT is at least ϑ/2d=:ϑ¯\vartheta/2d=:\overline{\vartheta}italic_ϑ / 2 italic_d = : over¯ start_ARG italic_ϑ end_ARG, where 𝕊d:={xd:x=1}assignsubscript𝕊𝑑conditional-set𝑥superscript𝑑norm𝑥1\mathbb{S}_{d}:=\{x\in\mathbb{R}^{d}:\left\|{x}\right\|=1\}blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT := { italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT : ∥ italic_x ∥ = 1 }.

Now we work under the assumptions of Lemma 5.4. Let G𝐺Gitalic_G be the linear span of q1(r)superscript𝑞1𝑟q^{-1}(r)italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_r ) and its orthogonal projection 𝚲𝚲\boldsymbol{\Lambda}bold_Λ on 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT (or any vector of 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT if the orthogonal projection is zero). Then G𝐺Gitalic_G has a non-zero intersection with 𝒮𝒮\mathcal{S}caligraphic_S. Let ΔΔ\Deltaroman_Δ be this one-dimensional intersection.

Let R𝑅Ritalic_R be a solid hypercube centered on Ψ(o)Ψ𝑜\Psi(o)roman_Ψ ( italic_o ), with size L¯:=L/dassign¯𝐿𝐿𝑑\overline{L}:=L/\sqrt{d}over¯ start_ARG italic_L end_ARG := italic_L / square-root start_ARG italic_d end_ARG, and aligned with an orthogonal basis (𝚫,u1,,ud01,𝚲,v1,,vdd01)𝚫subscript𝑢1subscript𝑢subscript𝑑01𝚲subscript𝑣1subscript𝑣𝑑subscript𝑑01(\boldsymbol{\Delta},u_{1},\ldots,u_{d_{0}-1},\boldsymbol{\Lambda},v_{1},% \ldots,v_{d-d_{0}-1})( bold_Δ , italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT , bold_Λ , italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_v start_POSTSUBSCRIPT italic_d - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ) adapted to the direct sum d=𝒮𝒮superscript𝑑direct-sum𝒮superscript𝒮perpendicular-to\mathbb{R}^{d}=\mathcal{S}\oplus\mathcal{S}^{\perp}blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT = caligraphic_S ⊕ caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT. With the restrictions on o𝑜oitalic_o, R𝑅Ritalic_R is included in 𝕌dsubscript𝕌𝑑\mathbb{U}_{d}blackboard_U start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT.

We will bound from below the quantity h2(p|R*;p~|R)h^{2}(p^{*}_{|R};\tilde{p}_{|R})italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ; over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ) by using the preceding two-dimensional case on slices of R𝑅Ritalic_R. For t{0}×i=1d01[oL¯/2ui;o+L¯/2ui]×{0}×j=1dd01[oL¯/2vj;o+L¯/2vj]𝑡0superscriptsubscriptproduct𝑖1subscript𝑑01𝑜¯𝐿2subscript𝑢𝑖𝑜¯𝐿2subscript𝑢𝑖0superscriptsubscriptproduct𝑗1𝑑subscript𝑑01𝑜¯𝐿2subscript𝑣𝑗𝑜¯𝐿2subscript𝑣𝑗t\in\{0\}\times\prod_{i=1}^{d_{0}-1}[o-\overline{L}/2\cdot u_{i};o+\overline{L% }/2\cdot u_{i}]\times\{0\}\times\prod_{j=1}^{d-d_{0}-1}[o-\overline{L}/2\cdot v% _{j};o+\overline{L}/2\cdot v_{j}]italic_t ∈ { 0 } × ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT [ italic_o - over¯ start_ARG italic_L end_ARG / 2 ⋅ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ; italic_o + over¯ start_ARG italic_L end_ARG / 2 ⋅ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] × { 0 } × ∏ start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d - italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT [ italic_o - over¯ start_ARG italic_L end_ARG / 2 ⋅ italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ; italic_o + over¯ start_ARG italic_L end_ARG / 2 ⋅ italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ], the plane G+t𝐺𝑡G+titalic_G + italic_t contains one element parallel to 𝒮𝒮\mathcal{S}caligraphic_S and one element parallel to 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT, so the situation is analogue to the previous case, replacing ζ𝜁\zetaitalic_ζ by ζ¯:=L¯2ϑ¯4ϑ¯2assign¯𝜁¯𝐿2¯italic-ϑ4superscript¯italic-ϑ2\overline{\zeta}:=\frac{\overline{L}}{2}\overline{\vartheta}\sqrt{4-\overline{% \vartheta}^{2}}over¯ start_ARG italic_ζ end_ARG := divide start_ARG over¯ start_ARG italic_L end_ARG end_ARG start_ARG 2 end_ARG over¯ start_ARG italic_ϑ end_ARG square-root start_ARG 4 - over¯ start_ARG italic_ϑ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (Figure 2). With all this in mind, for all density p~~𝑝\tilde{p}over~ start_ARG italic_p end_ARG constant in the direction q1(r)superscript𝑞1𝑟q^{-1}(r)italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_r ), one has

h2(p|R*;p~|R)=th2(p|R(G+t)*;p~|R(G+t))𝑑tt2DL¯2ζ¯𝑑t= 2DL¯dζ¯,h^{2}(p^{*}_{|R}\,;\,\tilde{p}_{|R})\ =\ \int_{t}h^{2}(p^{*}_{|R\cap(G+t)};% \tilde{p}_{|R\cap(G+t)})dt\ \geq\ \int_{t}2D\overline{L}^{2}\overline{\zeta}dt% \ =\ 2D\overline{L}^{d}\overline{\zeta},italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ; over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT | italic_R end_POSTSUBSCRIPT ) = ∫ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_p start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT | italic_R ∩ ( italic_G + italic_t ) end_POSTSUBSCRIPT ; over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT | italic_R ∩ ( italic_G + italic_t ) end_POSTSUBSCRIPT ) italic_d italic_t ≥ ∫ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT 2 italic_D over¯ start_ARG italic_L end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT over¯ start_ARG italic_ζ end_ARG italic_d italic_t = 2 italic_D over¯ start_ARG italic_L end_ARG start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT over¯ start_ARG italic_ζ end_ARG ,

which is sufficient to conclude.

The case Γ>d0Γsubscript𝑑0\Gamma>d_{0}roman_Γ > italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be proven in a similar way.

G+t𝐺𝑡G+titalic_G + italic_t1111𝒮+tsuperscript𝒮perpendicular-to𝑡\mathcal{S}^{\perp}+tcaligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT + italic_tΔ+tΔ𝑡\Delta+troman_Δ + italic_tE+t𝐸𝑡E+titalic_E + italic_tt𝑡titalic_tF+t𝐹𝑡F+titalic_F + italic_tR𝑅Ritalic_RL/d𝐿𝑑L/\sqrt{d}italic_L / square-root start_ARG italic_d end_ARGχ¯¯𝜒\overline{\chi}over¯ start_ARG italic_χ end_ARGζ¯¯𝜁\overline{\zeta}over¯ start_ARG italic_ζ end_ARG
Figure 2: Illustration of the proof of Theorem 4.1 in the case Γ=d0Γsubscript𝑑0\Gamma=d_{0}roman_Γ = italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT for arbitrary d>d0𝑑subscript𝑑0d>d_{0}italic_d > italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

5.3 Lemmas

The next three lemmas are related to Lemmas 4.3, 4.5, and 4.6 in [VV09], hence their proofs can be omitted.

Lemma 5.5.

Let n*𝑛superscriptn\in\mathds{N}^{*}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and β>0𝛽0\beta>0italic_β > 0. If f0β(𝕌d0)subscript𝑓0superscript𝛽subscript𝕌subscript𝑑0f_{0}\in\mathfrak{C}^{\beta}(\mathbb{U}_{d_{0}})italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ fraktur_C start_POSTSUPERSCRIPT italic_β end_POSTSUPERSCRIPT ( blackboard_U start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), then, for all a>0𝑎0a>0italic_a > 0 and qn𝒪dnsubscript𝑞𝑛subscript𝒪subscript𝑑𝑛q_{n}\in\mathcal{O}_{d_{n}}italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT, there exist constants Cf0subscript𝐶subscript𝑓0C_{f_{0}}italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and Df0subscript𝐷subscript𝑓0D_{f_{0}}italic_D start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT that depend only on f0subscript𝑓0f_{0}italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT such that

inf{h¯a,d0,qn2:h¯a,d0,qn,h¯fn,qnCf0aβ}Df0ad0.\inf\left\{\left\|{\overline{h}}\right\|_{\mathbb{H}_{a,d_{0},q_{n}}}^{2}\ :\ % \overline{h}\in\mathbb{H}_{a,d_{0},q_{n}},\ \left\|{\overline{h}-f_{n,q_{n}}}% \right\|_{\infty}\leq C_{f_{0}}\cdot a^{-\beta}\right\}\ \leq\ D_{f_{0}}\cdot a% ^{d_{0}}.roman_inf { ∥ over¯ start_ARG italic_h end_ARG ∥ start_POSTSUBSCRIPT blackboard_H start_POSTSUBSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT : over¯ start_ARG italic_h end_ARG ∈ blackboard_H start_POSTSUBSCRIPT italic_a , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ∥ over¯ start_ARG italic_h end_ARG - italic_f start_POSTSUBSCRIPT italic_n , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_C start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT - italic_β end_POSTSUPERSCRIPT } ≤ italic_D start_POSTSUBSCRIPT italic_f start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT .
Lemma 5.6.

Let n*𝑛superscriptn\in\mathds{N}^{*}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, a>0𝑎0a>0italic_a > 0, bdmax𝑏subscript𝑑normal-maxb\leq d_{\mathrm{max}}italic_b ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT and qn𝒪dnsubscript𝑞𝑛subscript𝒪subscript𝑑𝑛q_{n}\in\mathcal{O}_{d_{n}}italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then, there exists a constant Lbsubscript𝐿𝑏L_{b}italic_L start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT that depends only on b𝑏bitalic_b such that, for ε<1/2𝜀12\varepsilon<1/2italic_ε < 1 / 2,

logN(ε,1a,b,qn,)Lbab(log1ε)b+1.\log N(\varepsilon,\mathbb{H}_{1}^{a,b,q_{n}},\left\|{\cdot}\right\|_{\infty})% \ \leq\ L_{b}\cdot a^{b}\left(\log\frac{1}{\varepsilon}\right)^{b+1}.roman_log italic_N ( italic_ε , blackboard_H start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a , italic_b , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ) ≤ italic_L start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ( roman_log divide start_ARG 1 end_ARG start_ARG italic_ε end_ARG ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT .
Lemma 5.7.

Let n*𝑛superscriptn\in\mathds{N}^{*}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, bdmax𝑏subscript𝑑normal-maxb\leq d_{\mathrm{max}}italic_b ≤ italic_d start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT and qn𝒪dnsubscript𝑞𝑛subscript𝒪subscript𝑑𝑛q_{n}\in\mathcal{O}_{d_{n}}italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ∈ caligraphic_O start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then, for a0>0subscript𝑎00a_{0}>0italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT > 0, there exist constants Ca0,bsubscript𝐶subscript𝑎0𝑏C_{a_{0},b}italic_C start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_b end_POSTSUBSCRIPT and ε0a0,bsuperscriptsubscript𝜀0subscript𝑎0𝑏\varepsilon_{0}^{a_{0},b}italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_b end_POSTSUPERSCRIPT that depends only on a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and b𝑏bitalic_b such that, for all aa0𝑎subscript𝑎0a\geq a_{0}italic_a ≥ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and ε<ε0a0,b𝜀superscriptsubscript𝜀0subscript𝑎0𝑏\varepsilon<\varepsilon_{0}^{a_{0},b}italic_ε < italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_b end_POSTSUPERSCRIPT,

log(Wa,b,qnε)Ca0,bab(logaε)b+1.subscriptnormsuperscript𝑊𝑎𝑏subscript𝑞𝑛𝜀subscript𝐶subscript𝑎0𝑏superscript𝑎𝑏superscript𝑎𝜀𝑏1-\log\operatorname{\mathds{P}}\left(\left\|{W^{a,b,q_{n}}}\right\|_{\infty}% \leq\varepsilon\right)\ \leq\ C_{a_{0},b}\cdot a^{b}\left(\log\frac{a}{% \varepsilon}\right)^{b+1}.- roman_log blackboard_P ( ∥ italic_W start_POSTSUPERSCRIPT italic_a , italic_b , italic_q start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT ≤ italic_ε ) ≤ italic_C start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_b end_POSTSUBSCRIPT ⋅ italic_a start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT ( roman_log divide start_ARG italic_a end_ARG start_ARG italic_ε end_ARG ) start_POSTSUPERSCRIPT italic_b + 1 end_POSTSUPERSCRIPT .
Lemma 5.8.

Let n*𝑛superscriptn\in\mathds{N}^{*}italic_n ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and let (e1,,en)subscript𝑒1normal-…subscript𝑒𝑛(e_{1},\ldots,e_{n})( italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_e start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) be an orthonormal basis of nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT. For dn𝑑𝑛d\leq nitalic_d ≤ italic_n, let (g1,,gd)n×dsubscript𝑔1normal-…subscript𝑔𝑑superscript𝑛𝑑(g_{1},\ldots,g_{d})\in\mathbb{R}^{n\times d}( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_g start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_d end_POSTSUPERSCRIPT be a collection of orthonormal vectors in nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT such that

eigiε,for all i1,d.formulae-sequencenormsubscript𝑒𝑖subscript𝑔𝑖𝜀for all 𝑖1𝑑\left\|{e_{i}-g_{i}}\right\|\leq\varepsilon,\quad\text{for all }i\in\llbracket 1% ,d\rrbracket.∥ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ ≤ italic_ε , for all italic_i ∈ ⟦ 1 , italic_d ⟧ .

Then we can complete this collection to obtain an orthonormal basis (g1,,gn)subscript𝑔1normal-…subscript𝑔𝑛(g_{1},\ldots,g_{n})( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_g start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) of nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT satisfying

ejgj2dε,for all j1,n.formulae-sequencenormsubscript𝑒𝑗subscript𝑔𝑗2𝑑𝜀for all 𝑗1𝑛\left\|{e_{j}-g_{j}}\right\|\leq 2\sqrt{d}\cdot\varepsilon,\quad\text{for all % }j\in\llbracket 1,n\rrbracket.∥ italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ ≤ 2 square-root start_ARG italic_d end_ARG ⋅ italic_ε , for all italic_j ∈ ⟦ 1 , italic_n ⟧ .
Proof of Lemma 5.8.

We denote by F𝐹Fitalic_F the subspace Span(g1,,gd)Spansubscript𝑔1subscript𝑔𝑑\operatorname{Span}(g_{1},\ldots,g_{d})roman_Span ( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_g start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ). Let us determine the distance between a vector ejsubscript𝑒𝑗e_{j}italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and its orthogonal projection on Fsuperscript𝐹perpendicular-toF^{\perp}italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT, for jd+1,n𝑗𝑑1𝑛j\in\llbracket d+1,n\rrbracketitalic_j ∈ ⟦ italic_d + 1 , italic_n ⟧. By Cauchy-Schwartz inequality, we have

|ej,gi|ejgieiε,subscript𝑒𝑗subscript𝑔𝑖normsubscript𝑒𝑗normsubscript𝑔𝑖subscript𝑒𝑖𝜀\left|\langle e_{j},g_{i}\rangle\right|\ \leq\ \left\|{e_{j}}\right\|\left\|{g% _{i}-e_{i}}\right\|\leq\ \varepsilon,| ⟨ italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟩ | ≤ ∥ italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ ∥ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ ≤ italic_ε ,

for all i1,d𝑖1𝑑i\in\llbracket 1,d\rrbracketitalic_i ∈ ⟦ 1 , italic_d ⟧. Then

(5.12) ejPF(ej)=PF(ej)=(i=1dej,gi2gi2)1/2dε.normsubscript𝑒𝑗subscript𝑃superscript𝐹perpendicular-tosubscript𝑒𝑗normsubscript𝑃𝐹subscript𝑒𝑗superscriptsuperscriptsubscript𝑖1𝑑superscriptsubscript𝑒𝑗subscript𝑔𝑖2superscriptnormsubscript𝑔𝑖212𝑑𝜀\left\|{e_{j}-P_{F^{\perp}}(e_{j})}\right\|\ =\ \left\|{P_{F}(e_{j})}\right\|=% \left(\sum_{i=1}^{d}\langle e_{j},g_{i}\rangle^{2}\left\|{g_{i}}\right\|^{2}% \right)^{1/2}\ \leq\ \sqrt{d}\cdot\varepsilon.∥ italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_P start_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∥ = ∥ italic_P start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) ∥ = ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT ⟨ italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟩ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∥ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ≤ square-root start_ARG italic_d end_ARG ⋅ italic_ε .

Thus the problem reduces to find a family of nd𝑛𝑑n-ditalic_n - italic_d orthonormal vectors in Fsuperscript𝐹perpendicular-toF^{\perp}italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT with elements as close as possible to the vectors PF(ej)subscript𝑃superscript𝐹perpendicular-tosubscript𝑒𝑗P_{F^{\perp}}(e_{j})italic_P start_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ), for jd+1,n𝑗𝑑1𝑛j\in\llbracket d+1,n\rrbracketitalic_j ∈ ⟦ italic_d + 1 , italic_n ⟧. This is related to what is known as procruste problem. We denote by A𝐴Aitalic_A the matrix A:=(PF(ed+1)||PF(en))n×ndassign𝐴subscript𝑃superscript𝐹perpendicular-tosubscript𝑒𝑑1subscript𝑃superscript𝐹perpendicular-tosubscript𝑒𝑛superscript𝑛𝑛𝑑A:=\left(P_{F^{\perp}}(e_{d+1})|\cdots|P_{F^{\perp}}(e_{n})\right)\in\mathbb{R% }^{n\times n-d}italic_A := ( italic_P start_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_d + 1 end_POSTSUBSCRIPT ) | ⋯ | italic_P start_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n - italic_d end_POSTSUPERSCRIPT and we use Theorem 4.1 stated in [Hig89]:

Theorem 5.9 ([Hig89]).

If A𝐴Aitalic_A admits a polar decomposition A=UH𝐴𝑈𝐻A=UHitalic_A = italic_U italic_H, and if Qn×nd𝑄superscript𝑛𝑛𝑑Q\in\mathbb{R}^{n\times n-d}italic_Q ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_n - italic_d end_POSTSUPERSCRIPT has orthonormal columns, then

|AU|2|AQ|2.subscriptnorm𝐴𝑈2subscriptnorm𝐴𝑄2{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|A-U\right|\kern-1.07639pt% \right|\kern-1.07639pt\right|}_{2}\ \leq\ {\left|\kern-1.07639pt\left|\kern-1.% 07639pt\left|A-Q\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{2}.| | | italic_A - italic_U | | | start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ | | | italic_A - italic_Q | | | start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT .

Let us show that the columns of U𝑈Uitalic_U can be chosen in Fsuperscript𝐹perpendicular-toF^{\perp}italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT. A singular value decomposition of A𝐴Aitalic_A can be written, A=WDVt𝐴𝑊𝐷superscript𝑉tA=WD\prescript{\mathrm{t}}{}{V}italic_A = italic_W italic_D start_FLOATSUPERSCRIPT roman_t end_FLOATSUPERSCRIPT italic_V, where W𝑊Witalic_W has orthonormal columns, V𝒪nd𝑉subscript𝒪𝑛𝑑V\in\mathcal{O}_{n-d}italic_V ∈ caligraphic_O start_POSTSUBSCRIPT italic_n - italic_d end_POSTSUBSCRIPT, and Dnd×nd𝐷superscript𝑛𝑑𝑛𝑑D\in\mathbb{R}^{n-d\times n-d}italic_D ∈ blackboard_R start_POSTSUPERSCRIPT italic_n - italic_d × italic_n - italic_d end_POSTSUPERSCRIPT is diagonal. Therefore, A=(WVt)VDVt𝐴𝑊superscript𝑉t𝑉𝐷superscript𝑉tA=(W\prescript{\mathrm{t}}{}{V})VD\prescript{\mathrm{t}}{}{V}italic_A = ( italic_W start_FLOATSUPERSCRIPT roman_t end_FLOATSUPERSCRIPT italic_V ) italic_V italic_D start_FLOATSUPERSCRIPT roman_t end_FLOATSUPERSCRIPT italic_V. Taking U:=WVtassign𝑈𝑊superscript𝑉tU:=W\prescript{\mathrm{t}}{}{V}italic_U := italic_W start_FLOATSUPERSCRIPT roman_t end_FLOATSUPERSCRIPT italic_V and H:=VDVtassign𝐻𝑉𝐷superscript𝑉tH:=VD\prescript{\mathrm{t}}{}{V}italic_H := italic_V italic_D start_FLOATSUPERSCRIPT roman_t end_FLOATSUPERSCRIPT italic_V, we have the polar decomposition A=UH𝐴𝑈𝐻A=UHitalic_A = italic_U italic_H where U𝑈Uitalic_U has orthonormal columns. Because Im(A)=Span(PF(ej),jd+1,n)FIm𝐴Spansubscript𝑃superscript𝐹perpendicular-tosubscript𝑒𝑗𝑗𝑑1𝑛superscript𝐹perpendicular-to\operatorname{Im}(A)=\operatorname{Span}\left(P_{F^{\perp}}(e_{j}),j\in% \llbracket d+1,n\rrbracket\right)\subset F^{\perp}roman_Im ( italic_A ) = roman_Span ( italic_P start_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) , italic_j ∈ ⟦ italic_d + 1 , italic_n ⟧ ) ⊂ italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT, it is possible to choose W𝑊Witalic_W with columns in Fsuperscript𝐹perpendicular-toF^{\perp}italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT, whence the desired result. Now, taking Q=(ed+1||en)𝑄subscript𝑒𝑑1subscript𝑒𝑛Q=\left(e_{d+1}|\cdots|e_{n}\right)italic_Q = ( italic_e start_POSTSUBSCRIPT italic_d + 1 end_POSTSUBSCRIPT | ⋯ | italic_e start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ), we have, for all unit vector xnd𝑥superscript𝑛𝑑x\in\mathbb{R}^{n-d}italic_x ∈ blackboard_R start_POSTSUPERSCRIPT italic_n - italic_d end_POSTSUPERSCRIPT,

PF(Qx)=Ax.subscript𝑃superscript𝐹perpendicular-to𝑄𝑥𝐴𝑥P_{F^{\perp}}(Qx)\ =\ Ax.italic_P start_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_Q italic_x ) = italic_A italic_x .

Moreover, using that |Qx,gi|Qxgieiε𝑄𝑥subscript𝑔𝑖norm𝑄𝑥normsubscript𝑔𝑖subscript𝑒𝑖𝜀\left|\langle Qx,g_{i}\rangle\right|\leq\left\|{Qx}\right\|\left\|{g_{i}-e_{i}% }\right\|\leq\varepsilon| ⟨ italic_Q italic_x , italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⟩ | ≤ ∥ italic_Q italic_x ∥ ∥ italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ ≤ italic_ε for all i1,d𝑖1𝑑i\in\llbracket 1,d\rrbracketitalic_i ∈ ⟦ 1 , italic_d ⟧, we finally have

QxAx2=QxPF(Qx)2dε2,superscriptnorm𝑄𝑥𝐴𝑥2superscriptnorm𝑄𝑥subscript𝑃superscript𝐹perpendicular-to𝑄𝑥2𝑑superscript𝜀2\left\|{Qx-Ax}\right\|^{2}\ =\ \left\|{Qx-P_{F^{\perp}}(Qx)}\right\|^{2}\ \leq% \ d\varepsilon^{2},∥ italic_Q italic_x - italic_A italic_x ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∥ italic_Q italic_x - italic_P start_POSTSUBSCRIPT italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_Q italic_x ) ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ italic_d italic_ε start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,

thus |AQ|dεnorm𝐴𝑄𝑑𝜀{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|A-Q\right|\kern-1.07639pt% \right|\kern-1.07639pt\right|}\leq\sqrt{d}\cdot\varepsilon| | | italic_A - italic_Q | | | ≤ square-root start_ARG italic_d end_ARG ⋅ italic_ε. According to Theorem 5.9, the last inequality is also true if we replace Q𝑄Qitalic_Q by U𝑈Uitalic_U. Because the columns ud+1,,unsubscript𝑢𝑑1subscript𝑢𝑛u_{d+1},\ldots,u_{n}italic_u start_POSTSUBSCRIPT italic_d + 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT of U𝑈Uitalic_U are in Fsuperscript𝐹perpendicular-toF^{\perp}italic_F start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT, the family (g1,,gd,ud+1,,un)subscript𝑔1subscript𝑔𝑑subscript𝑢𝑑1subscript𝑢𝑛(g_{1},\ldots,g_{d},u_{d+1},\ldots,u_{n})( italic_g start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_g start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT , italic_u start_POSTSUBSCRIPT italic_d + 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) is orthonormal and moreover satisfies (5.12) by the triangle inequality. ∎

Notation.

Let d,n*𝑑𝑛superscriptd,n\in\mathds{N}^{*}italic_d , italic_n ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT with d<n𝑑𝑛d<nitalic_d < italic_n and let onn(d)superscriptsubscripton𝑛𝑑\mathcal{B}_{\mathrm{on}}^{n}(d)caligraphic_B start_POSTSUBSCRIPT roman_on end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_d ) be the set of all d𝑑ditalic_d-tuples of orthonormal vectors in nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT.

Lemma 5.10.

Let d,n*𝑑𝑛superscriptd,n\in\mathds{N}^{*}italic_d , italic_n ∈ blackboard_N start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT with dn𝑑𝑛d\leq nitalic_d ≤ italic_n and 0<ε10𝜀10<\varepsilon\leq 10 < italic_ε ≤ 1. Then there exists a set 𝒢onn(d)𝒢superscriptsubscriptnormal-on𝑛𝑑\mathcal{G}\subset\mathcal{B}_{\mathrm{on}}^{n}(d)caligraphic_G ⊂ caligraphic_B start_POSTSUBSCRIPT roman_on end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_d ) such that for all eonn(d)𝑒superscriptsubscriptnormal-on𝑛𝑑e\in\mathcal{B}_{\mathrm{on}}^{n}(d)italic_e ∈ caligraphic_B start_POSTSUBSCRIPT roman_on end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_d ), there exists g𝒢𝑔𝒢g\in\mathcal{G}italic_g ∈ caligraphic_G such that

maxi1,deigi2ε𝑎𝑛𝑑|𝒢|(πn2)d/2(8ε)d(n1).formulae-sequencesubscript𝑖1𝑑subscriptnormsubscript𝑒𝑖subscript𝑔𝑖2𝜀𝑎𝑛𝑑𝒢superscript𝜋𝑛2𝑑2superscript8𝜀𝑑𝑛1\max_{i\in\llbracket 1,d\rrbracket}\left\|{e_{i}-g_{i}}\right\|_{2}\ \leq\ % \varepsilon\qquad\text{and}\qquad\left|\mathcal{G}\right|\ \leq\ \left(\frac{% \pi n}{2}\right)^{d/2}\left(\frac{8}{\varepsilon}\right)^{d(n-1)}.roman_max start_POSTSUBSCRIPT italic_i ∈ ⟦ 1 , italic_d ⟧ end_POSTSUBSCRIPT ∥ italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_g start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_ε and | caligraphic_G | ≤ ( divide start_ARG italic_π italic_n end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT italic_d / 2 end_POSTSUPERSCRIPT ( divide start_ARG 8 end_ARG start_ARG italic_ε end_ARG ) start_POSTSUPERSCRIPT italic_d ( italic_n - 1 ) end_POSTSUPERSCRIPT .
Proof of Lemma 5.10.

Let us construct 𝒢𝒢\mathcal{G}caligraphic_G. Let 𝒯𝒯\mathcal{T}caligraphic_T be a set of balls in nsuperscript𝑛\mathbb{R}^{n}blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT with radius ε/2𝜀2\varepsilon/2italic_ε / 2 which cover 𝕊n1superscript𝕊𝑛1\mathbb{S}^{n-1}blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT and such that |𝒯|=N(𝕊n1,ε/2,2)\left|\mathcal{T}\right|=N(\mathbb{S}^{n-1},\varepsilon/2,\left\|{\cdot}\right% \|_{2})| caligraphic_T | = italic_N ( blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT , italic_ε / 2 , ∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ). We denote by 𝒯¯dsuperscript¯𝒯𝑑\overline{\mathcal{T}}^{d}over¯ start_ARG caligraphic_T end_ARG start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT the set of d𝑑ditalic_d-tuples of balls (B1,,Bd)𝒯dsubscript𝐵1subscript𝐵𝑑superscript𝒯𝑑(B_{1},\ldots,B_{d})\in\mathcal{T}^{d}( italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_B start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) ∈ caligraphic_T start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT such that B1××Bdsubscript𝐵1subscript𝐵𝑑B_{1}\times\cdots\times B_{d}italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT × ⋯ × italic_B start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT contains at least one element of onn(d)superscriptsubscripton𝑛𝑑\mathcal{B}_{\mathrm{on}}^{n}(d)caligraphic_B start_POSTSUBSCRIPT roman_on end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_d ). Then, for each eonn(d)𝑒superscriptsubscripton𝑛𝑑e\in\mathcal{B}_{\mathrm{on}}^{n}(d)italic_e ∈ caligraphic_B start_POSTSUBSCRIPT roman_on end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_d ), there exists (B1,,Bd)𝒯¯dsubscript𝐵1subscript𝐵𝑑superscript¯𝒯𝑑(B_{1},\ldots,B_{d})\in\overline{\mathcal{T}}^{d}( italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_B start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) ∈ over¯ start_ARG caligraphic_T end_ARG start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT such that eB1××Bd𝑒subscript𝐵1subscript𝐵𝑑e\in B_{1}\times\cdots\times B_{d}italic_e ∈ italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT × ⋯ × italic_B start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT. For each B𝒯¯d𝐵superscript¯𝒯𝑑B\in\overline{\mathcal{T}}^{d}italic_B ∈ over¯ start_ARG caligraphic_T end_ARG start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, choose one particular d𝑑ditalic_d-tuple gonn(d)𝑔superscriptsubscripton𝑛𝑑g\in\mathcal{B}_{\mathrm{on}}^{n}(d)italic_g ∈ caligraphic_B start_POSTSUBSCRIPT roman_on end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_d ) such that gB𝑔𝐵g\in Bitalic_g ∈ italic_B and let 𝒢𝒢\mathcal{G}caligraphic_G be the set of these d𝑑ditalic_d-tuples when B𝐵Bitalic_B runs through 𝒯¯dsuperscript¯𝒯𝑑\overline{\mathcal{T}}^{d}over¯ start_ARG caligraphic_T end_ARG start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. It is clear that 𝒢𝒢\mathcal{G}caligraphic_G satisfy the first condition of the lemma. Moreover,

|𝒢|=|𝒯¯d||𝒯d|=N(ε/2,𝕊n1,2)d.\left|\mathcal{G}\right|\ =\ \bigl{|}\overline{\mathcal{T}}^{d}\bigr{|}\ \leq% \ \left|\mathcal{T}^{d}\right|\ =\ N\left(\varepsilon/2,\mathbb{S}^{n-1},\left% \|{\cdot}\right\|_{2}\right)^{d}.| caligraphic_G | = | over¯ start_ARG caligraphic_T end_ARG start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT | ≤ | caligraphic_T start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT | = italic_N ( italic_ε / 2 , blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT .

Let us estimate the last quantity. We use the inequality

N(ε,𝕊n1,2)D(ε,𝕊n1,2),N\left(\varepsilon,\mathbb{S}^{n-1},\left\|{\cdot}\right\|_{2}\right)\ \leq\ D% \left(\varepsilon,\mathbb{S}^{n-1},\left\|{\cdot}\right\|_{2}\right),italic_N ( italic_ε , blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≤ italic_D ( italic_ε , blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ,

where D(ε,𝕊n1,2)D\left(\varepsilon,\mathbb{S}^{n-1},\left\|{\cdot}\right\|_{2}\right)italic_D ( italic_ε , blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) is the maximum number of disjoint balls with radius ε/2𝜀2\varepsilon/2italic_ε / 2 and with center in 𝕊n1superscript𝕊𝑛1\mathbb{S}^{n-1}blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT. Recall that

𝒜(𝕊n1)=2πn/2Γ(n/2)and𝒱(Bn1(ε))=πn12εn1Γ(n+12).formulae-sequence𝒜superscript𝕊𝑛12superscript𝜋𝑛2Γ𝑛2and𝒱subscript𝐵𝑛1𝜀superscript𝜋𝑛12superscript𝜀𝑛1Γ𝑛12\mathcal{A}\left(\mathbb{S}^{n-1}\right)\ =\ \frac{2\pi^{n/2}}{\Gamma(n/2)}% \qquad\text{and}\qquad\mathcal{V}\left(B_{n-1}(\varepsilon)\right)\ =\ \frac{% \pi^{\frac{n-1}{2}}\varepsilon^{n-1}}{\Gamma\left(\frac{n+1}{2}\right)}.caligraphic_A ( blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT ) = divide start_ARG 2 italic_π start_POSTSUPERSCRIPT italic_n / 2 end_POSTSUPERSCRIPT end_ARG start_ARG roman_Γ ( italic_n / 2 ) end_ARG and caligraphic_V ( italic_B start_POSTSUBSCRIPT italic_n - 1 end_POSTSUBSCRIPT ( italic_ε ) ) = divide start_ARG italic_π start_POSTSUPERSCRIPT divide start_ARG italic_n - 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT italic_ε start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_ARG start_ARG roman_Γ ( divide start_ARG italic_n + 1 end_ARG start_ARG 2 end_ARG ) end_ARG .

Consider the measure ν(ε/2)𝜈𝜀2\nu(\varepsilon/2)italic_ν ( italic_ε / 2 ) of the hyperspherical cap defined by the intersection of 𝕊n1superscript𝕊𝑛1\mathbb{S}^{n-1}blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT and a ball with center in 𝕊n1superscript𝕊𝑛1\mathbb{S}^{n-1}blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT and with radius ε/2𝜀2\varepsilon/2italic_ε / 2. The colatitude angle of the cap is ϕ=2arcsin(ε/4)italic-ϕ2𝜀4\phi=2\arcsin(\varepsilon/4)italic_ϕ = 2 roman_arcsin ( italic_ε / 4 ) and, according to [Li11],

ν(ε/2)=(n1)πn12Γ(n+12)0ϕsinn2(θ)𝑑θ.𝜈𝜀2𝑛1superscript𝜋𝑛12Γ𝑛12superscriptsubscript0italic-ϕsuperscript𝑛2𝜃differential-d𝜃\nu(\varepsilon/2)\ =\ \frac{(n-1)\pi^{\frac{n-1}{2}}}{\Gamma\left(\frac{n+1}{% 2}\right)}\int_{0}^{\phi}\sin^{n-2}(\theta)d\theta.italic_ν ( italic_ε / 2 ) = divide start_ARG ( italic_n - 1 ) italic_π start_POSTSUPERSCRIPT divide start_ARG italic_n - 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT end_ARG start_ARG roman_Γ ( divide start_ARG italic_n + 1 end_ARG start_ARG 2 end_ARG ) end_ARG ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ end_POSTSUPERSCRIPT roman_sin start_POSTSUPERSCRIPT italic_n - 2 end_POSTSUPERSCRIPT ( italic_θ ) italic_d italic_θ .

Since ϕε/2italic-ϕ𝜀2\phi\geq\varepsilon/2italic_ϕ ≥ italic_ε / 2,

0ϕsinn2(θ)𝑑θ0ϕ(sinϕϕθ)n2𝑑θ=(sinϕϕ)n2ϕn1n1(sinϕϕ)n21n1(ε2)n1superscriptsubscript0italic-ϕsuperscript𝑛2𝜃differential-d𝜃superscriptsubscript0italic-ϕsuperscriptitalic-ϕitalic-ϕ𝜃𝑛2differential-d𝜃superscriptitalic-ϕitalic-ϕ𝑛2superscriptitalic-ϕ𝑛1𝑛1superscriptitalic-ϕitalic-ϕ𝑛21𝑛1superscript𝜀2𝑛1\int_{0}^{\phi}\sin^{n-2}(\theta)d\theta\ \geq\ \int_{0}^{\phi}\left(\frac{% \sin\phi}{\phi}\cdot\theta\right)^{n-2}d\theta\ =\ \left(\frac{\sin\phi}{\phi}% \right)^{n-2}\frac{\phi^{n-1}}{n-1}\ \geq\ \left(\frac{\sin\phi}{\phi}\right)^% {n-2}\frac{1}{n-1}\left(\frac{\varepsilon}{2}\right)^{n-1}∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ end_POSTSUPERSCRIPT roman_sin start_POSTSUPERSCRIPT italic_n - 2 end_POSTSUPERSCRIPT ( italic_θ ) italic_d italic_θ ≥ ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_ϕ end_POSTSUPERSCRIPT ( divide start_ARG roman_sin italic_ϕ end_ARG start_ARG italic_ϕ end_ARG ⋅ italic_θ ) start_POSTSUPERSCRIPT italic_n - 2 end_POSTSUPERSCRIPT italic_d italic_θ = ( divide start_ARG roman_sin italic_ϕ end_ARG start_ARG italic_ϕ end_ARG ) start_POSTSUPERSCRIPT italic_n - 2 end_POSTSUPERSCRIPT divide start_ARG italic_ϕ start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_n - 1 end_ARG ≥ ( divide start_ARG roman_sin italic_ϕ end_ARG start_ARG italic_ϕ end_ARG ) start_POSTSUPERSCRIPT italic_n - 2 end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG italic_n - 1 end_ARG ( divide start_ARG italic_ε end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT

and, using the facts that ε1𝜀1\varepsilon\leq 1italic_ε ≤ 1, ϕεitalic-ϕ𝜀\phi\leq\varepsilonitalic_ϕ ≤ italic_ε, and (sinϕ)/ϕ1/2italic-ϕitalic-ϕ12(\sin\phi)/\phi\geq 1/2( roman_sin italic_ϕ ) / italic_ϕ ≥ 1 / 2, we have

D(ε,𝕊n1,2)𝒜(𝕊n1)ν(ε/2)<𝒜(𝕊n1)𝒱(Bn1(ε/2))(12)n2=π(4ε)n1Γ(n+12)Γ(n/2).D\left(\varepsilon,\mathbb{S}^{n-1},\left\|{\cdot}\right\|_{2}\right)\ \leq\ % \frac{\mathcal{A}(\mathbb{S}^{n-1})}{\nu(\varepsilon/2)}\ <\ \frac{\mathcal{A}% (\mathbb{S}^{n-1})}{\mathcal{V}\left(B_{n-1}(\varepsilon/2)\right)\cdot\left(% \frac{1}{2}\right)^{n-2}}\ =\ \sqrt{\pi}\cdot\left(\frac{4}{\varepsilon}\right% )^{n-1}\cdot\frac{\Gamma\left(\frac{n+1}{2}\right)}{\Gamma(n/2)}.italic_D ( italic_ε , blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ≤ divide start_ARG caligraphic_A ( blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_ν ( italic_ε / 2 ) end_ARG < divide start_ARG caligraphic_A ( blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT ) end_ARG start_ARG caligraphic_V ( italic_B start_POSTSUBSCRIPT italic_n - 1 end_POSTSUBSCRIPT ( italic_ε / 2 ) ) ⋅ ( divide start_ARG 1 end_ARG start_ARG 2 end_ARG ) start_POSTSUPERSCRIPT italic_n - 2 end_POSTSUPERSCRIPT end_ARG = square-root start_ARG italic_π end_ARG ⋅ ( divide start_ARG 4 end_ARG start_ARG italic_ε end_ARG ) start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT ⋅ divide start_ARG roman_Γ ( divide start_ARG italic_n + 1 end_ARG start_ARG 2 end_ARG ) end_ARG start_ARG roman_Γ ( italic_n / 2 ) end_ARG .

The ratio of two Gamma functions can be bounded as follows

x+1/4<Γ(x+1)Γ(x+1/2)<x+1/2,𝑥14Γ𝑥1Γ𝑥12𝑥12\sqrt{x+1/4}\ <\ \frac{\Gamma(x+1)}{\Gamma(x+1/2)}\ <\ \sqrt{x+1/2},square-root start_ARG italic_x + 1 / 4 end_ARG < divide start_ARG roman_Γ ( italic_x + 1 ) end_ARG start_ARG roman_Γ ( italic_x + 1 / 2 ) end_ARG < square-root start_ARG italic_x + 1 / 2 end_ARG ,

for x>1/2𝑥12x>-1/2italic_x > - 1 / 2 (see [Wat59] and [LQ12], Section 2.3). Choosing x=(n1)/2𝑥𝑛12x=(n-1)/2italic_x = ( italic_n - 1 ) / 2, we obtain

N(ε,𝕊n1,2)<πn2(4ε)n1,N\left(\varepsilon,\mathbb{S}^{n-1},\left\|{\cdot}\right\|_{2}\right)\ <\ % \sqrt{\frac{\pi n}{2}}\cdot\left(\frac{4}{\varepsilon}\right)^{n-1},italic_N ( italic_ε , blackboard_S start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT , ∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) < square-root start_ARG divide start_ARG italic_π italic_n end_ARG start_ARG 2 end_ARG end_ARG ⋅ ( divide start_ARG 4 end_ARG start_ARG italic_ε end_ARG ) start_POSTSUPERSCRIPT italic_n - 1 end_POSTSUPERSCRIPT ,

hence the result. ∎


Proof of Lemma 5.4.

Suppose that, for all rE𝐝𝟎𝑟subscript𝐸subscript𝐝0r\in E_{\mathbf{d_{0}}}italic_r ∈ italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we have d(q1(r),𝒮𝕊d)<ϑ¯𝑑superscript𝑞1𝑟𝒮subscript𝕊𝑑¯italic-ϑd(q^{-1}(r),\mathcal{S}\cap\mathbb{S}_{d})<\overline{\vartheta}italic_d ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_r ) , caligraphic_S ∩ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) < over¯ start_ARG italic_ϑ end_ARG and, for all rE1𝐝𝟎superscript𝑟subscript𝐸1subscript𝐝0r^{\prime}\in E_{1-\mathbf{d_{0}}}italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_E start_POSTSUBSCRIPT 1 - bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, d(q1(r),𝒮𝕊d)<ϑ¯𝑑superscript𝑞1superscript𝑟superscript𝒮perpendicular-tosubscript𝕊𝑑¯italic-ϑd(q^{-1}(r^{\prime}),\mathcal{S}^{\perp}\cap\mathbb{S}_{d})<\overline{\vartheta}italic_d ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT ∩ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) < over¯ start_ARG italic_ϑ end_ARG. Let us show that for all vectors eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT of the canonical basis, q1(ei)q¯1(ei)<2dϑ¯normsuperscript𝑞1subscript𝑒𝑖superscript¯𝑞1subscript𝑒𝑖2𝑑¯italic-ϑ\left\|{q^{-1}(e_{i})-\overline{q}^{-1}(e_{i})}\right\|<2\sqrt{d}\cdot% \overline{\vartheta}∥ italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - over¯ start_ARG italic_q end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ < 2 square-root start_ARG italic_d end_ARG ⋅ over¯ start_ARG italic_ϑ end_ARG. We begin with the first d0subscript𝑑0d_{0}italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT vectors (e1,,ed0)subscript𝑒1subscript𝑒subscript𝑑0(e_{1},\ldots,e_{d_{0}})( italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_e start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ). Define p𝒮subscriptp𝒮\operatorname{p_{\mathcal{S}}}roman_p start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT an operator which maps rE𝐝𝟎𝑟subscript𝐸subscript𝐝0r\in E_{\mathbf{d_{0}}}italic_r ∈ italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT to argminu𝒮𝕊dq1(r)usubscriptargmin𝑢𝒮subscript𝕊𝑑normsuperscript𝑞1𝑟𝑢\operatorname*{arg\,min}_{u\in\mathcal{S}\cap\mathbb{S}_{d}}\left\|{q^{-1}(r)-% u}\right\|start_OPERATOR roman_arg roman_min end_OPERATOR start_POSTSUBSCRIPT italic_u ∈ caligraphic_S ∩ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_r ) - italic_u ∥. Then, for i=1,,d0𝑖1subscript𝑑0i=1,\ldots,d_{0}italic_i = 1 , … , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we have q1(ei)p𝒮(q1(ei))<ϑ¯normsuperscript𝑞1subscript𝑒𝑖subscriptp𝒮superscript𝑞1subscript𝑒𝑖¯italic-ϑ\left\|{q^{-1}(e_{i})-\operatorname{p_{\mathcal{S}}}(q^{-1}(e_{i}))}\right\|<% \overline{\vartheta}∥ italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - start_OPFUNCTION roman_p start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT end_OPFUNCTION ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ∥ < over¯ start_ARG italic_ϑ end_ARG. Now, we reuse the arguments of the proof of Lemma 5.8, with A:=(p𝒮(q1(e1))||p𝒮(q1(ed0)))assign𝐴subscriptp𝒮superscript𝑞1subscript𝑒1subscriptp𝒮superscript𝑞1subscript𝑒subscript𝑑0A:=\left(\operatorname{p_{\mathcal{S}}}(q^{-1}(e_{1}))|\cdots|\operatorname{p_% {\mathcal{S}}}(q^{-1}(e_{d_{0}}))\right)italic_A := ( start_OPFUNCTION roman_p start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT end_OPFUNCTION ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) | ⋯ | start_OPFUNCTION roman_p start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT end_OPFUNCTION ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ). We can write A=UH𝐴𝑈𝐻A=UHitalic_A = italic_U italic_H where U=(u1||ud0)𝑈subscript𝑢1subscript𝑢subscript𝑑0U=(u_{1}|\cdots|u_{d_{0}})italic_U = ( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | ⋯ | italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) is a rectangular matrix with orthonormal columns in 𝒮𝒮\mathcal{S}caligraphic_S and where H𝐻Hitalic_H is symmetric. Moreover, taking Q:=(q1(e1)||q1(ed0))assign𝑄superscript𝑞1subscript𝑒1superscript𝑞1subscript𝑒subscript𝑑0Q:=(q^{-1}(e_{1})|\cdots|q^{-1}(e_{d_{0}}))italic_Q := ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) | ⋯ | italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ), and xE𝐝𝟎𝕊d𝑥subscript𝐸subscript𝐝0subscript𝕊𝑑x\in E_{\mathbf{d_{0}}}\cap\mathbb{S}_{d}italic_x ∈ italic_E start_POSTSUBSCRIPT bold_d start_POSTSUBSCRIPT bold_0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∩ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT, x=i=1d0aiei𝑥superscriptsubscript𝑖1subscript𝑑0subscript𝑎𝑖subscript𝑒𝑖x=\sum_{i=1}^{d_{0}}a_{i}e_{i}italic_x = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we have

QxAx=i=1d0ai(q1(ei)p𝒮(q1(ei)))<dϑ¯.norm𝑄𝑥𝐴𝑥normsuperscriptsubscript𝑖1subscript𝑑0subscript𝑎𝑖superscript𝑞1subscript𝑒𝑖subscriptp𝒮superscript𝑞1subscript𝑒𝑖𝑑¯italic-ϑ\left\|{Qx-Ax}\right\|\ =\ \left\|{\sum_{i=1}^{d_{0}}a_{i}\cdot\left(q^{-1}(e_% {i})-\operatorname{p_{\mathcal{S}}}(q^{-1}(e_{i}))\right)}\right\|\ <\ \sqrt{d% }\cdot\overline{\vartheta}.∥ italic_Q italic_x - italic_A italic_x ∥ = ∥ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - start_OPFUNCTION roman_p start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT end_OPFUNCTION ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ) ∥ < square-root start_ARG italic_d end_ARG ⋅ over¯ start_ARG italic_ϑ end_ARG .

So, by Theorem 4.1 in [Hig89] (Theorem 5.9 in the present document), |AU|<dϑ¯norm𝐴𝑈𝑑¯italic-ϑ{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|A-U\right|\kern-1.07639pt% \right|\kern-1.07639pt\right|}<\sqrt{d}\cdot\overline{\vartheta}| | | italic_A - italic_U | | | < square-root start_ARG italic_d end_ARG ⋅ over¯ start_ARG italic_ϑ end_ARG. Then, (u1,,ud0)subscript𝑢1subscript𝑢subscript𝑑0(u_{1},\ldots,u_{d_{0}})( italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) is an orthonormal basis of 𝒮𝒮\mathcal{S}caligraphic_S such that p𝒮(q1(ei))ui<dϑ¯normsubscriptp𝒮superscript𝑞1subscript𝑒𝑖subscript𝑢𝑖𝑑¯italic-ϑ\left\|{\operatorname{p_{\mathcal{S}}}(q^{-1}(e_{i}))-u_{i}}\right\|<\sqrt{d}% \cdot\overline{\vartheta}∥ start_OPFUNCTION roman_p start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT end_OPFUNCTION ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) - italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ < square-root start_ARG italic_d end_ARG ⋅ over¯ start_ARG italic_ϑ end_ARG, for i=1,,d0𝑖1subscript𝑑0i=1,\ldots,d_{0}italic_i = 1 , … , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Let q¯Q*¯𝑞superscript𝑄\overline{q}\in Q^{*}over¯ start_ARG italic_q end_ARG ∈ italic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT be an isometry such that q¯(ei)=ui¯𝑞subscript𝑒𝑖subscript𝑢𝑖\overline{q}(e_{i})=u_{i}over¯ start_ARG italic_q end_ARG ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i=1,,d0𝑖1subscript𝑑0i=1,\ldots,d_{0}italic_i = 1 , … , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Then

q1(ei)q¯1(ei)q1(ei)p𝒮(q1(ei))+p𝒮(q1(ei))q¯(ei)< 2dϑ¯,i=1,,d0.formulae-sequencenormsuperscript𝑞1subscript𝑒𝑖superscript¯𝑞1subscript𝑒𝑖normsuperscript𝑞1subscript𝑒𝑖subscriptp𝒮superscript𝑞1subscript𝑒𝑖normsubscriptp𝒮superscript𝑞1subscript𝑒𝑖¯𝑞subscript𝑒𝑖2𝑑¯italic-ϑ𝑖1subscript𝑑0\left\|{q^{-1}(e_{i})-\overline{q}^{-1}(e_{i})}\right\|\ \leq\ \left\|{q^{-1}(% e_{i})-\operatorname{p_{\mathcal{S}}}(q^{-1}(e_{i}))}\right\|+\left\|{% \operatorname{p_{\mathcal{S}}}(q^{-1}(e_{i}))-\overline{q}(e_{i})}\right\|\ <% \ 2\sqrt{d}\cdot\overline{\vartheta},\qquad i=1,\ldots,d_{0}.∥ italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - over¯ start_ARG italic_q end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ ≤ ∥ italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - start_OPFUNCTION roman_p start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT end_OPFUNCTION ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ∥ + ∥ start_OPFUNCTION roman_p start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT end_OPFUNCTION ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) - over¯ start_ARG italic_q end_ARG ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ∥ < 2 square-root start_ARG italic_d end_ARG ⋅ over¯ start_ARG italic_ϑ end_ARG , italic_i = 1 , … , italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .

The same reasoning occurs with the remaining vectors, (ed0+1,,ed)subscript𝑒subscript𝑑01subscript𝑒𝑑(e_{d_{0}+1},\ldots,e_{d})( italic_e start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT , … , italic_e start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ), by replacing 𝒮𝒮\mathcal{S}caligraphic_S by 𝒮superscript𝒮perpendicular-to\mathcal{S}^{\perp}caligraphic_S start_POSTSUPERSCRIPT ⟂ end_POSTSUPERSCRIPT, and taking A=UHsuperscript𝐴superscript𝑈superscript𝐻A^{\prime}=U^{\prime}H^{\prime}italic_A start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_H start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, with U=(ud0+1||ud)superscript𝑈subscript𝑢subscript𝑑01subscript𝑢𝑑U^{\prime}=(u_{d_{0}+1}|\cdots|u_{d})italic_U start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = ( italic_u start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT | ⋯ | italic_u start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ). The isometry q¯Q*¯𝑞superscript𝑄\overline{q}\in Q^{*}over¯ start_ARG italic_q end_ARG ∈ italic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is now the one that maps eisubscript𝑒𝑖e_{i}italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT to uisubscript𝑢𝑖u_{i}italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i=1,,d𝑖1𝑑i=1,\ldots,ditalic_i = 1 , … , italic_d. As a result, for all x𝕊d𝑥subscript𝕊𝑑x\in\mathbb{S}_{d}italic_x ∈ blackboard_S start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT, x=i=1daiei𝑥superscriptsubscript𝑖1𝑑subscript𝑎𝑖subscript𝑒𝑖x=\sum_{i=1}^{d}a_{i}e_{i}italic_x = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, we have

q1(x)q¯1(x)=i=1dai(q1(ei)q¯1(ei))< 2dϑ¯=ϑ,normsuperscript𝑞1𝑥superscript¯𝑞1𝑥normsuperscriptsubscript𝑖1𝑑subscript𝑎𝑖superscript𝑞1subscript𝑒𝑖superscript¯𝑞1subscript𝑒𝑖2𝑑¯italic-ϑitalic-ϑ\left\|{q^{-1}(x)-\overline{q}^{-1}(x)}\right\|\ =\ \left\|{\sum_{i=1}^{d}a_{i% }\cdot\left(q^{-1}(e_{i})-\overline{q}^{-1}(e_{i})\right)}\right\|\ <\ 2d\cdot% \overline{\vartheta}\ =\ \vartheta,∥ italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x ) - over¯ start_ARG italic_q end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x ) ∥ = ∥ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ ( italic_q start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) - over¯ start_ARG italic_q end_ARG start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_e start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ∥ < 2 italic_d ⋅ over¯ start_ARG italic_ϑ end_ARG = italic_ϑ ,

which contradicts the fact that |q¯q|>ϑnorm¯𝑞𝑞italic-ϑ{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\overline{q}-q\right|\kern-1.% 07639pt\right|\kern-1.07639pt\right|}>\vartheta| | | over¯ start_ARG italic_q end_ARG - italic_q | | | > italic_ϑ. Finally, d(q,Q*)<ϑ𝑑𝑞superscript𝑄italic-ϑd(q,Q^{*})<\varthetaitalic_d ( italic_q , italic_Q start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) < italic_ϑ. ∎

Acknowledgement

We acknowledge the support of the French Agence Nationale de la Recherche (ANR) under reference ANR-21-CE40-0007 (GAP Project).

References

  • [Bir86] Lucien Birgé “On estimating a density using Hellinger distance and some other strange facts” In Probability Theory and Related Fields 71, 1986, pp. 271–291
  • [Bor75] Christer Borell “The Brunn–Minkowski inequality in Gauss space” In Inventiones mathematicae 30, 1975, pp. 207–216
  • [CD12] Laëtitia Comminges and Arnak S. Dalalyan “Tight conditions for consistency of variable selection in the context of high dimensionality” In The Annals of Statistics 40.5, 2012, pp. 2667–2696
  • [Coo98] R.Dennis Cook “Regression graphics: Ideas for studying regressions through graphics” John Wiley & Sons, 1998
  • [FS23] Gianluca Finocchio and Johannes Schmidt-Hieber “Posterior contraction for deep Gaussian process priors” In Journal of Machine Learning Research 24.66, 2023, pp. 1–49 URL: http://jmlr.org/papers/v24/21-0556.html
  • [GGV00] Subhashis Ghosal, Jayanta K. Ghosh and Aad W. Van Der Vaart “Convergence rates of posterior distributions” In The Annals of Statistics 28.2, 2000, pp. 500–531
  • [GN11] Evarist Giné and Richard Nickl “Rates of contraction for posterior distributions in Lrsuperscript𝐿𝑟L^{r}italic_L start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT-metrics, 1r1𝑟1\leq r\leq\infty1 ≤ italic_r ≤ ∞ In The Annals of Statistics 39.6, 2011, pp. 2883–2911
  • [Hig89] Nicholas J. Higham “Matrix nearness problems and applications” In Applications of Matrix Theory Oxford University Press, 1989, pp. 1–27
  • [JT21] Sheng Jiang and Surya T. Tokdar “Variable selection consistency of Gaussian process regression” In The Annals of Statistics 49.5, 2021, pp. 2491–2505
  • [Li11] Shengqiao Li “Concise formulas for the area and volume of a hyperspherical cap” In Asian Journal of Mathematics and Statistics 4.1 ANSInet, 2011, pp. 66–70
  • [Li91] Ker-Chau Li “Sliced inverse regression for dimension reduction” In Journal of the American Statistical Association 86.414 Taylor & Francis, 1991, pp. 316–327
  • [Lin+21] Qian Lin, Xinran Li, Dongming Huang and Jun S. Liu “On the optimality of sliced inverse regression in high dimensions” In The Annals of Statistics 49.1 Institute of Mathematical Statistics, 2021, pp. 1–20
  • [LQ12] Qiu-Ming Luo and Feng Qi “Bounds for the ratio of two gamma functions—From Wendel’s and related inequalities to logarithmically completely monotonic functions” In Banach Journal of Mathematical Analysis 6.2 Tusi Mathematical Research Group, 2012, pp. 132–158
  • [LZL18] Qian Lin, Zhigen Zhao and Jun S. Liu “On consistency and sparsity for sliced inverse regression in high dimensions” In The Annals of Statistics 46.2, 2018, pp. 580–610
  • [LZL19] Qian Lin, Zhigen Zhao and Jun S. Liu “Sparse sliced inverse regression via lasso” In Journal of the American Statistical Association 114.528 Taylor & Francis, 2019, pp. 1726–1739
  • [STG13] Weining Shen, Surya T. Tokdar and Subhashis Ghosal “Adaptive Bayesian multivariate density estimation with Dirichlet mixtures” In Biometrika 100.3 Oxford University Press, 2013, pp. 623–640
  • [Sto82] Charles J. Stone “Optimal global rates of convergence for nonparametric regression” In The Annals of Statistics 10.4 Institute of Mathematical Statistics, 1982, pp. 1040–1053
  • [Tok11] Surya T. Tokdar “Dimension adaptability of Gaussian process models with variable selection and projection” Preprint . Available at arXiv:1112.0716, 2011
  • [TSY20] Kai Tan, Lei Shi and Zhou Yu “Sparse SIR: Optimal rates and adaptive estimation” In The Annals of Statistics 48.1 Institute of Mathematical Statistics, 2020, pp. 64–85
  • [TZG10] Surya T. Tokdar, Yu M. Zhu and Jayanta K. Ghosh “Bayesian density regression with logistic Gaussian process and subspace projection” In Bayesian Analysis 5.2 Institute of Mathematical Statistics, 2010, pp. 319
  • [Ver12] Nicolas Verzelen “Minimax risks for sparse regressions: Ultra-high dimensional phenomenons” In Electronic Journal of Statistics 6 Institute of Mathematical StatisticsBernoulli Society, 2012, pp. 38–90
  • [VV08] Aad W. Van Der Vaart and J.Harry Van Zanten “Rates of contraction of posterior distributions based on Gaussian process priors” In The Annals of Statistics 36.3, 2008, pp. 1435–1463
  • [VV08a] Aad W. Van Der Vaart and J.Harry Van Zanten “Reproducing kernel Hilbert spaces of Gaussian priors” In Pushing the Limits of Contemporary Statistics: Contributions in Honor of Jayanta K. Ghosh. Inst. Math. Stat. (IMS) Collect. 3, 2008, pp. 200–222
  • [VV09] Aad W. Van Der Vaart and J.Harry Van Zanten “Adaptative Bayesian estimation using a Gaussian random field with inverse gamma bandwidth” In The Annals of Statistics 37.5B, 2009, pp. 2655–2675
  • [Wai09] Martin J. Wainwright “Sharp thresholds for high-Dimensional and noisy sparsity recovery using 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-constrained quadratic programming (Lasso)” In IEEE transactions on information theory 55.5 IEEE, 2009, pp. 2183–2202
  • [Wat59] G.N. Watson “A note on gamma functions” In Edinburgh Mathematical Notes 42 Cambridge University Press, 1959, pp. 7–9
  • [YD16] Yun Yang and David B. Dunson “Bayesian manifold regression” In The Annals of Statistics 44.2, 2016, pp. 876–905
  • [YT15] Yun Yang and Surya T. Tokdar “Minimax-optimal nonparametric regression in high dimensions” In The Annals of Statistics 43.2, 2015, pp. 652–674
  • [ZMP06] Lixing Zhu, Baiqi Miao and Heng Peng “On sliced inverse regression with high-dimensional covariates” In Journal of the American Statistical Association 101.474 Taylor & Francis, 2006, pp. 630–643
  • [ZMZ22] **g Zeng, Qing Mai and Xin Zhang “Subspace estimation with automatic dimension and variable selection in sufficient dimension reduction” In Journal of the American Statistical Association Taylor & Francis, 2022, pp. 1–13