HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: nowidow

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: CC BY-SA 4.0
arXiv:2401.02883v1 [cs.RO] 05 Jan 2024

iPolicy: Incremental Policy Algorithms for Feedback Motion Planning

Guoxiang Zhao1,*1{}^{1,*}start_FLOATSUPERSCRIPT 1 , * end_FLOATSUPERSCRIPT Devesh K. Jha2,*2{}^{2,*}start_FLOATSUPERSCRIPT 2 , * end_FLOATSUPERSCRIPT Yebin Wang22{}^{2}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT Minghui Zhu33{}^{3}start_FLOATSUPERSCRIPT 3 end_FLOATSUPERSCRIPT
*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT Equal contributions.11{}^{1}start_FLOATSUPERSCRIPT 1 end_FLOATSUPERSCRIPT Guoxiang Zhao is with School of Future Technology and Institute of Artificial Intelligence, Shanghai University, Shanghai, 200444, China. (email: [email protected])22{}^{2}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT Devesh K. Jha and Yebin Wang are with Mitsubishi Electric Research Laboratories, Cambridge, MA 02139, USA. (email: [email protected], [email protected])33{}^{3}start_FLOATSUPERSCRIPT 3 end_FLOATSUPERSCRIPT Minghui Zhu is with School of Electrical Engineering and Computer Science, Pennsylvania State University, University Park, PA 16802, USA. This work by M. Zhu was partially supported by the grant U.S. NSF CNS1830390. (email: [email protected])
Abstract

This paper presents policy-based motion planning for robotic systems. The motion planning literature has been mostly focused on open-loop trajectory planning which is followed by tracking online. In contrast, we solve the problem of path planning and controller synthesis simultaneously by solving the related feedback control problem. We present a novel incremental policy (iPolicy) algorithm for motion planning, which integrates sampling-based methods and set-valued optimal control methods to compute feedback controllers for the robotic system. In particular, we use sampling to incrementally construct the state space of the system. Asynchronous value iterations are performed on the sampled state space to synthesize the incremental policy feedback controller. We show the convergence of the estimates to the optimal value function in continuous state space. Numerical results with various different dynamical systems (including nonholonomic systems) verify the optimality and effectiveness of iPolicy.

I Introduction

Informally speaking, given a robot with a description of its dynamics, a description of its environment and a set of goal states, the motion planning problem is to find a sequence of control inputs so as to guide the robot from the initial state to one of the goal states while avoiding collision in the cluttered environment. It is well-known that robotic motion planning is at least as difficult as the generalized piano mover’s problem, which has been proven to be PSPACE-hard [1]. Many planning algorithms have been proposed. The discrete approaches, such as Dijkstra’s Algorithm and A* [2], usually depend on the structured environment and apply graph search to find the shortest path. The optimality of their returned trajectories are formally guaranteed for discrete problems but their heavy dependency on structured environment makes them suffer from inaccuracies and infeasibility of the solution when dealing with continuous systems [3]. Recently, sampling-based geometric planning algorithms such as the rapidly-exploring random trees (RRT) [4] and its optimal variant RRT* [5] are arguably the most influential and widely-used motion planning algorithms since the last two decades. They are shown to compute quickly in high-dimensional continuous space and possess theoretical guarantees such as probabilistic completeness and optimality.

Robots, in most of the practical applications, have stringent differential constraints on their motion which need to be properly incorporated during motion planning. Motion planning for dynamical systems has a rich history and much work has been done on this topic [6, 7, 2, 8]. However, the problem of motion planning for nonholonomic systems is still open in many aspects [9]. Sampling-based algorithms have received a lot of attention for their efficiency in handling obstacle-cluttered environments. There are two main bodies of research to make these methods more suitable for dealing with differential constraints: one develops steering functions for dynamical systems using concepts from feedback control theory (see e.g., [10, 11]); the other direction of work improves the computational effectiveness of the algorithms using geometric planning (see e.g., [9, 12, 13]). Despite the tremendous body of work on this topic, most of the work focuses on planning open-loop trajectories.

In practical implementation, the planned open-loop trajectories are tracked by a feedback controller. [14]. The presence of differential, state and input constraints, however, makes the design of the feedback tracking controller for highly nonlinear and complex robots still a difficult problem. Furthermore, the performance guarantees of the closed-loop trajectories could be lost in the presence of these constraints, especially in terms of minimizing the aggregated cost. It necessitates feedback motion planning which explicitly takes into account the feedback tracking during the planning process. The feedback planner computes the map** from state space to control space subject to constraints and equivalently searches for the solution for every initial condition of the robot; this facilitates the completeness and performance guarantee of the planner. However, the feedback motion planning problem is challenging in various aspects and most of the computational issues involving feedback planning are still unexplored and open. Interested readers are referred to [3] for a comprehensive survey of the challenges in feedback motion planning. Most of the previous approaches for feedback motion planning struggle with computational tractability in continuous state-space or local minima issues [2]. In this paper, we present a method for feedback motion planning of dynamical systems which makes use of sampling-based methods and value iterations to recover the optimal feedback controller. As a synthesis of the computational advantages of sampling-based algorithms and the approximation consistency of set-valued algorithms [15], the proposed iPolicy algorithm is an incremental and anytime feedback motion planner with formal guarantee of asymptotic optimality.

Contributions. This paper presents a sampling-based algorithm for feedback motion planning for a class of nonlinear robotic systems. In this work, we leverage the existing set-valued analysis tool for motion planning and asynchronous value iterations to synthesize feedback motion planners in an incremental fashion. In particular, the continual sampling creates an approximation of the original minimal time problem in every iteration and the policy on the discretized space provides an incrementally refined estimate of the value function. Using value iterations in an asynchronous and incremental manner limits the computational requirements while retaining optimality conditions. We show the convergence of the estimated value functions to the optimal value function for the motion planning problem in the continuous state space. For clarification of presentation, the various salient features of the algorithms are demonstrated with simulations using point mass system. Some numerical simulations are then provided for feedback motion planning of simple car and Dubins car (these are well studied nonholonomic systems [2, 8]) using the proposed motion planning algorithm. Several simulations are provided to show the value functions calculated by the proposed algorithm and trajectories under the guidance of the computed controller in cluttered environments. Through simulation study of various different dynamical systems, we show the applicability of the proposed algorithms across a wide range of robotic systems (two different classic nonholonomic systems).

Organization. This paper is organized in eight sections including the current one. We present work related to our proposed problem in Section II. Commonly sued notations and notions are clarified in Section III. In Section IV, we present a formal statement of the problem solved in this paper. The main algorithms of the paper are presented in Section V followed by analysis of the same in Section VI. Numerical results for the proposed algorithm are presented in Section VII with related interpretation and discussion. The paper is finally concluded with a summary and future work in Section VIII.

II Related Work

Motion planning is central to robotics. Consequently, it has received a lot of attention in robotics and controls community. Broadly speaking, there are mostly three kinds of approaches for motion planning – optimization-based methods, gradient-based methods and sampling-based methods.

Some of the most popular optimization-based approaches could be found in [16, 17, 18, 19, 20]. The main idea of these approaches is to formulate a dynamic optimization problem with smooth formulation of constraints (like collision, etc.). The resulting optimization problem can then be solved to generate an optimal trajectory. However, the main shortcoming of these optimization-based techniques is that they struggle to find solution in obstacle cluttered, non-convex spaces. Trajectory optimization techniques using optimal control literature can generate control trajectories in presence of state and input constraints for nonlinear dynamical systems – however, they may not be able to find feasible trajectories in the presence of arbitrary obstacles  [21, 22]. These methods are mostly used to compute trajectories under dynamical constraints which can be followed online using a trajectory tracking controller using desirable constraints for execution [23, 24, 25].

Gradient-based approaches, such as potential fields [26, 27] and navigation functions [28], consider the composition of a repulsive component for collision avoidance and an attractive component for reaching goal, and derive the control law based on the gradient of the synthesized field. The computations of both fields usually only depend on local information and this brings computational efficiency to gradient-based approaches; however, the cancellation of different components in the composition can also trap the robot at local minima [2]. Thus, these approaches tend to struggle when a robot has to operate in obstacle cluttered environment.

Sampling-based approaches are shown to be able to successfully address irregularly shaped environments and arguably the most widely used methods for motion planning in robotics [2, 29]. However, most of these methods are used to compute open-loop plans for the robot to follow. It is, in general, desirable that robots operate in a feedback fashion using state estimates during execution of a task. Consequently, there has been some work on using sampling-based algorithms to develop feedback planners by using different metrics to select the best path. Some methods to solve for feedback planners are presented in [30, 31, 10, 32, 33]. However, very little work has been done on using these algorithms to incrementally build the state-space of systems to solve for continuous time and space optimal control problems for the dynamical systems. So, even though the motion planning problem has been thoroughly studied in literature, there seems to be lack of techniques and algorithms which can compute feedback motion planners for dynamical systems. We make an attempt to address this problem in this paper.

Our approach is closely related to the value iterations-based approach in [34]; however, we use asynchronous value iterations, provide results for nonholonomic systems and also provide rigorous numerical simulation results. Our results on optimal performance do not require the explicit description of steering functions between any two states of the system in the collision-free space. As such, we expect our results to cater to a rich set of robotic motion planning problems. Our approach is also related to [35] but differs in the way to solve the approximate motion planning problem, where [35] spends heavy computations searching for the optimal solution over the entire graph before refinement while our approach swiftly solves for a small subset of the graph and proceeds to finer graphs. This brings the continuously increasing optimality to our approach and it incrementally improves solutions once more computational resources are given. The problem and algorithms presented in this paper are different from some other sampling-based feedback planning like [36], and the proposed algorithms have the advantage of being incremental (instead of batch). Furthermore, optimal performance guarantees for the feedback control problem have been provided.

III Notations and notions

Let \|\cdot\|∥ ⋅ ∥ be the 2222-norm in n𝑛{}^{n}start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT. Define the distance from a point xnsuperscript𝑛𝑥absentx\in^{n}italic_x ∈ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to a set 𝒳nsuperscript𝑛𝒳absent\mathcal{X}\subseteq^{n}caligraphic_X ⊆ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT by dist(x,𝒳)infx𝒳xxdist𝑥𝒳subscriptinfimumsuperscript𝑥𝒳norm𝑥superscript𝑥\operatorname{dist}(x,\mathcal{X})\triangleq\inf_{x^{\prime}\in\mathcal{X}}\|x% -x^{\prime}\|roman_dist ( italic_x , caligraphic_X ) ≜ roman_inf start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X end_POSTSUBSCRIPT ∥ italic_x - italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∥. Given a compact set 𝒳nsuperscript𝑛𝒳absent\mathcal{X}\subseteq^{n}caligraphic_X ⊆ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and a function v:𝒳:𝑣𝒳absentv:\mathcal{X}\toitalic_v : caligraphic_X →, denote the supremum norm over 𝒳𝒳\mathcal{X}caligraphic_X by v𝒳supx𝒳|v(x)|subscriptnorm𝑣𝒳subscriptsupremum𝑥𝒳𝑣𝑥\|v\|_{\mathcal{X}}\triangleq\sup_{x\in\mathcal{X}}|v(x)|∥ italic_v ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ≜ roman_sup start_POSTSUBSCRIPT italic_x ∈ caligraphic_X end_POSTSUBSCRIPT | italic_v ( italic_x ) |. Let the Lebesgue measure of the set 𝒳𝒳\mathcal{X}caligraphic_X be μ(𝒳)𝜇𝒳\mu(\mathcal{X})italic_μ ( caligraphic_X ). Denote the unit ball in n𝑛{}^{n}start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT by nsubscript𝑛\mathcal{B}_{n}caligraphic_B start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and the volume of the unit ball by Cnπn/2(n/2)!subscript𝐶𝑛superscript𝜋𝑛2𝑛2C_{n}\triangleq\frac{\pi^{n/2}}{(n/2)!}italic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≜ divide start_ARG italic_π start_POSTSUPERSCRIPT italic_n / 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_n / 2 ) ! end_ARG. When no ambiguity is caused, the dimensionality in the subscript may be omitted. Define the Minkowski sum in n𝑛{}^{n}start_FLOATSUPERSCRIPT italic_n end_FLOATSUPERSCRIPT for 𝒳nsuperscript𝑛𝒳absent\mathcal{X}\subseteq^{n}caligraphic_X ⊆ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT and 𝒴nsuperscript𝑛𝒴absent\mathcal{Y}\subseteq^{n}caligraphic_Y ⊆ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT by 𝒳+𝒴{x+y|x𝒳,y𝒴}𝒳𝒴conditional-set𝑥𝑦formulae-sequence𝑥𝒳𝑦𝒴\mathcal{X}+\mathcal{Y}\triangleq\{x+y|x\in\mathcal{X},y\in\mathcal{Y}\}caligraphic_X + caligraphic_Y ≜ { italic_x + italic_y | italic_x ∈ caligraphic_X , italic_y ∈ caligraphic_Y }. When 𝒳={x}𝒳𝑥\mathcal{X}=\{x\}caligraphic_X = { italic_x } is a singleton, with slight abuse of notations, the Minkowski sum can be written as x+𝒴{x+y|y𝒴}𝑥𝒴conditional-set𝑥𝑦𝑦𝒴x+\mathcal{Y}\triangleq\{x+y|y\in\mathcal{Y}\}italic_x + caligraphic_Y ≜ { italic_x + italic_y | italic_y ∈ caligraphic_Y }. Define the value assignment operator by ab𝑎𝑏a\leftarrow bitalic_a ← italic_b where the value of b𝑏bitalic_b is assigned to a𝑎aitalic_a.

IV Problem Formulation

Consider a robot associated with a time-invariant dynamic system governed by the following differential equation:

x˙(t)=f(x(t),u(t)),˙𝑥𝑡𝑓𝑥𝑡𝑢𝑡\dot{x}(t)=f(x(t),u(t)),over˙ start_ARG italic_x end_ARG ( italic_t ) = italic_f ( italic_x ( italic_t ) , italic_u ( italic_t ) ) , (1)

where x(t)𝒳n𝑥𝑡𝒳superscript𝑛absentx(t)\in\mathcal{X}\subseteq^{n}italic_x ( italic_t ) ∈ caligraphic_X ⊆ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is the state and u(t)𝒰m𝑢𝑡𝒰superscript𝑚absentu(t)\in\mathcal{U}\subseteq^{m}italic_u ( italic_t ) ∈ caligraphic_U ⊆ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT is the control of robot. The following mild assumptions are imposed throughout the paper and they are inherited from [15].

Assumption IV.1.

The following assumptions hold for (1):

  1. (A1)

    𝒳𝒳\mathcal{X}caligraphic_X and 𝒰𝒰\mathcal{U}caligraphic_U are non-empty and compact;

  2. (A2)

    f𝑓fitalic_f is continuous in u𝑢uitalic_u and Lipschitz continuous in x𝑥xitalic_x for any u𝒰𝑢𝒰u\in\mathcal{U}italic_u ∈ caligraphic_U with a Lipschitz constant l𝑙litalic_l;

  3. (A3)

    f𝑓fitalic_f is linear growth; i.e., c0𝑐0\exists c\geq 0∃ italic_c ≥ 0 s.t. x𝒳,u𝒰,f(x,u)c(x+u+1)formulae-sequencefor-all𝑥𝒳formulae-sequence𝑢𝒰norm𝑓𝑥𝑢𝑐norm𝑥norm𝑢1\forall x\in\mathcal{X},u\in\mathcal{U},\|f(x,u)\|\leq c(\|x\|+\|u\|+1)∀ italic_x ∈ caligraphic_X , italic_u ∈ caligraphic_U , ∥ italic_f ( italic_x , italic_u ) ∥ ≤ italic_c ( ∥ italic_x ∥ + ∥ italic_u ∥ + 1 );

  4. (A4)

    u𝒰f(x,u)subscript𝑢𝒰𝑓𝑥𝑢\bigcup_{u\in\mathcal{U}}f(x,u)⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) is a convex set for any x𝒳𝑥𝒳x\in\mathcal{X}italic_x ∈ caligraphic_X.

Assumptions (A1) and (A2) imply f(x,u)norm𝑓𝑥𝑢\|f(x,u)\|∥ italic_f ( italic_x , italic_u ) ∥ is bounded. Denote the upper bound of f(x,u)norm𝑓𝑥𝑢\|f(x,u)\|∥ italic_f ( italic_x , italic_u ) ∥ by Mmaxx𝒳,u𝒰f(x,u)𝑀subscriptformulae-sequence𝑥𝒳𝑢𝒰norm𝑓𝑥𝑢M\triangleq\max_{x\in\mathcal{X},u\in\mathcal{U}}\|f(x,u)\|italic_M ≜ roman_max start_POSTSUBSCRIPT italic_x ∈ caligraphic_X , italic_u ∈ caligraphic_U end_POSTSUBSCRIPT ∥ italic_f ( italic_x , italic_u ) ∥.

Let 𝒳obssubscript𝒳obs\mathcal{X}_{\mathrm{obs}}caligraphic_X start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT and 𝒳goalsubscript𝒳goal\mathcal{X}_{\mathrm{goal}}caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT be the obstacle region and the goal region, respectively. Define the obstacle free region as 𝒳free𝒳𝒳obssubscript𝒳free𝒳subscript𝒳obs\mathcal{X}_{\mathrm{free}}\triangleq\mathcal{X}\setminus\mathcal{X}_{\mathrm{% obs}}caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT ≜ caligraphic_X ∖ caligraphic_X start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT. Denote the trajectory of system (1) as ϕ(;x,π):𝒳\phi(\cdot;x,\pi):\to\mathcal{X}italic_ϕ ( ⋅ ; italic_x , italic_π ) : → caligraphic_X given the initial state x𝑥xitalic_x and a state feedback controller π:𝒳𝒰:𝜋𝒳𝒰\pi:\mathcal{X}\to\mathcal{U}italic_π : caligraphic_X → caligraphic_U. Let 𝕋π(x)superscript𝕋𝜋𝑥\mathds{T}^{\pi}(x)blackboard_T start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT ( italic_x ) be the first time when the trajectory ϕ(;x,π)italic-ϕ𝑥𝜋\phi(\cdot;x,\pi)italic_ϕ ( ⋅ ; italic_x , italic_π ) hits 𝒳goalsubscript𝒳goal\mathcal{X}_{\mathrm{goal}}caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT while staying in 𝒳freesubscript𝒳free\mathcal{X}_{\mathrm{free}}caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT before 𝕋π(x)superscript𝕋𝜋𝑥\mathds{T}^{\pi}(x)blackboard_T start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT ( italic_x ); i.e.,

𝕋π(x)inf{t0|ϕ(t;x,π)𝒳goal,ϕ(τ;x,π)𝒳free,τ[0,t]}.superscript𝕋𝜋𝑥infimumconditional-set𝑡0formulae-sequenceitalic-ϕ𝑡𝑥𝜋subscript𝒳goalformulae-sequenceitalic-ϕ𝜏𝑥𝜋subscript𝒳freefor-all𝜏0𝑡\begin{split}\mathds{T}^{\pi}(x)&\triangleq\inf\{t\geq 0|\phi(t;x,\pi)\in% \mathcal{X}_{\mathrm{goal}},\\ &\phi(\tau;x,\pi)\in\mathcal{X}_{\mathrm{free}},\forall\tau\in[0,t]\}.\end{split}start_ROW start_CELL blackboard_T start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT ( italic_x ) end_CELL start_CELL ≜ roman_inf { italic_t ≥ 0 | italic_ϕ ( italic_t ; italic_x , italic_π ) ∈ caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_ϕ ( italic_τ ; italic_x , italic_π ) ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT , ∀ italic_τ ∈ [ 0 , italic_t ] } . end_CELL end_ROW

The problem of interest in this paper is to find the control policy π*:𝒳𝒰:superscript𝜋𝒳𝒰\pi^{*}:\mathcal{X}\to\mathcal{U}italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT : caligraphic_X → caligraphic_U for system (1) that incurs the minimal traveling time 𝕋*superscript𝕋\mathds{T}^{*}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT for every x𝒳𝑥𝒳x\in\mathcal{X}italic_x ∈ caligraphic_X; that is,

𝕋*(x)=minπ𝕋π(x),x𝒳formulae-sequencesuperscript𝕋𝑥subscript𝜋superscript𝕋𝜋𝑥for-all𝑥𝒳\displaystyle\mathds{T}^{*}(x)=\min_{\pi}\mathds{T}^{\pi}(x),\forall x\in% \mathcal{X}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) = roman_min start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT italic_π end_POSTSUPERSCRIPT ( italic_x ) , ∀ italic_x ∈ caligraphic_X

and the minimizer π*superscript𝜋\pi^{*}italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is referred to as the optimal policy.

Notice that the minimal traveling time function 𝕋*superscript𝕋\mathds{T}^{*}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is not necessarily bounded, as 𝒳goalsubscript𝒳goal\mathcal{X}_{\mathrm{goal}}caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT may not be reachable for every x𝒳free𝑥subscript𝒳freex\in\mathcal{X}_{\mathrm{free}}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT, and 𝕋*(x)superscript𝕋𝑥\mathds{T}^{*}(x)blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) would be positive infinity for such states. The positive infinity can cause difficulties in both computational expressions and mathematical analysis, and we address this issue by leveraging the Kruzhkov transform Ψ:0{+}[0,1]\Psi:_{\geq 0}\bigcup\{+\infty\}\to[0,1]roman_Ψ : start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT ⋃ { + ∞ } → [ 0 , 1 ], defined as

(Ψ𝕋)(x)1exp(𝕋(x)),Ψ𝕋𝑥1𝕋𝑥\displaystyle(\Psi\circ\mathds{T})(x)\triangleq 1-\exp(-\mathds{T}(x)),( roman_Ψ ∘ blackboard_T ) ( italic_x ) ≜ 1 - roman_exp ( - blackboard_T ( italic_x ) ) ,

and we let the transformed value function be denoted by ΘΨ𝕋ΘΨ𝕋\varTheta\triangleq\Psi\circ\mathds{T}roman_Θ ≜ roman_Ψ ∘ blackboard_T inheriting all subscripts and superscripts of 𝕋𝕋\mathds{T}blackboard_T. Notice that the Kruzhkov transform is bijective and monotonically increasing. The positive infity is transformed to 1111 after the Kruzhkov transform while 00 remains the same. The objective of this paper is equivalently to find the transformed minimal traveling time function Θ*superscriptΘ\varTheta^{*}roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and the optimal policy π*superscript𝜋\pi^{*}italic_π start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT.

V The Incremental Policy (iPolicy) Algorithm

In this section, we present the incremental policy algorithm iPolicy. We leverage sampling-based methods in [5, 12, 37] and set-valued methods in [15, 35] along with asynchronous value iterations to incrementally approximate the minimal traveling time function. In particular, iPolicy consists of two components: graph expansion and minimal traveling time estimation. In graph expansion, iPolicy continually samples the free region 𝒳freesubscript𝒳free\mathcal{X}_{\mathrm{free}}caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT and constructs a graph using the set-valued method to approximate the continuous time dynamic of system (1); in minimal traveling time estimation, asynchronous value iterations are executed over the approximate graph in a back propogation fashion, where only stale values are updated to save computations, to obtain an estimate of the minimal traveling time function. As more samples are added, iPolicy iteratively expands graph and estimates the minimal traveling time function until a certain limit (e.g., time limit, approximation error threshold) is reached. Denote the set of sampled states from 𝒳freesubscript𝒳free\mathcal{X}_{\mathrm{free}}caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT after k𝑘kitalic_k iterations of iPolicy by Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. With slight abuse of notations, denote the estimate of minimal traveling time, also known as the value function, after k𝑘kitalic_k iterations by 𝕋k:Vk[0,+]:subscript𝕋𝑘subscript𝑉𝑘0\mathds{T}_{k}:V_{k}\to[0,+\infty]blackboard_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT : italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT → [ 0 , + ∞ ]. The staleness function k:Vk0:subscript𝑘subscript𝑉𝑘subscriptabsent0\mathcal{F}_{k}:V_{k}\to\mathbb{N}_{\geq 0}caligraphic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT : italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT → blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT is defined as the number of iterations since the last update of a sample xVk𝑥subscript𝑉𝑘x\in V_{k}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. We proceed to explain iPolicy in the rest of this section.

1 Initialization
2 for xV0𝑥subscript𝑉0x\in V_{0}italic_x ∈ italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT do
3       Θ0(x)0subscriptΘ0𝑥0\varTheta_{0}(x)\leftarrow 0roman_Θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_x ) ← 0;
4       0(x)Psubscript0𝑥𝑃\mathcal{F}_{0}(x)\leftarrow Pcaligraphic_F start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_x ) ← italic_P;
5      
6 k1𝑘1k\leftarrow 1italic_k ← 1;
7 Main Loop
8 while k<K𝑘𝐾k<Kitalic_k < italic_K do
9       xnew𝚂𝚊𝚖𝚙𝚕𝚎(𝒳free+dk)subscript𝑥new𝚂𝚊𝚖𝚙𝚕𝚎subscript𝒳freesubscript𝑑𝑘x_{\mathrm{new}}\leftarrow\texttt{Sample}(\mathcal{X}_{\mathrm{free}}+d_{k}% \mathcal{B})italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT ← Sample ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B );
10       VkVk1{xnew}subscript𝑉𝑘subscript𝑉𝑘1subscript𝑥newV_{k}\leftarrow V_{k-1}\cup\{x_{\mathrm{new}}\}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ← italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∪ { italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT };
11       if xnew𝒳goal+(Mϵk+dk)subscript𝑥normal-newsubscript𝒳normal-goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x_{\mathrm{new}}\in\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}+d_{k})\mathcal{B}italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B then
12             Θ^k(xnew)0subscript^Θ𝑘subscript𝑥new0\hat{\varTheta}_{k}(x_{\mathrm{new}})\leftarrow 0over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT ) ← 0;
13            
14      else
15             Θ^k(xnew)1subscript^Θ𝑘subscript𝑥new1\hat{\varTheta}_{k}(x_{\mathrm{new}})\leftarrow 1over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT ) ← 1;
16            
17      ^k(xnew)Psubscript^𝑘subscript𝑥new𝑃\hat{\mathcal{F}}_{k}(x_{\mathrm{new}})\leftarrow Pover^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT ) ← italic_P;
18       for xVk1𝑥subscript𝑉𝑘1x\in V_{k-1}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT do
19             Θ^k(x)Θk1(x)subscript^Θ𝑘𝑥subscriptΘ𝑘1𝑥\hat{\varTheta}_{k}(x)\leftarrow\varTheta_{k-1}(x)over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) ← roman_Θ start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ( italic_x );
20             ^k(x)k1(x)subscript^𝑘𝑥subscript𝑘1𝑥\hat{\mathcal{F}}_{k}(x)\leftarrow\mathcal{F}_{k-1}(x)over^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) ← caligraphic_F start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ( italic_x );
21             Compute Fk(x)subscript𝐹𝑘𝑥F_{k}(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) in (3);
22            
23       (Θk,k)𝚅𝚊𝚕𝚞𝚎𝙸𝚝𝚎𝚛𝚊𝚝𝚒𝚘𝚗(Vk,Θ^k,^k,mk)subscriptΘ𝑘subscript𝑘𝚅𝚊𝚕𝚞𝚎𝙸𝚝𝚎𝚛𝚊𝚝𝚒𝚘𝚗subscript𝑉𝑘subscript^Θ𝑘subscript^𝑘subscript𝑚𝑘(\varTheta_{k},\mathcal{F}_{k})\leftarrow{\texttt{ValueIteration}}{(V_{k},\hat% {\varTheta}_{k},\hat{\mathcal{F}}_{k},m_{k})}( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , caligraphic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ← ValueIteration ( italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , over^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT );
24       kk+1𝑘𝑘1k\leftarrow{k+1}italic_k ← italic_k + 1;
25return Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT;
Algorithm 1 Incremental Policy (iPolicy) Algorithm
1 Input: Samples V𝑉Vitalic_V, transformed value function ΘΘ\varThetaroman_Θ, staleness function \mathcal{F}caligraphic_F, recursion allowance m𝑚mitalic_m;
2 def ValueIteration(V,Θ,,m𝑉normal-Θ𝑚V,\varTheta,\mathcal{F},mitalic_V , roman_Θ , caligraphic_F , italic_m):
3       ΔΨ(ϵd),β=1Δformulae-sequenceΔΨitalic-ϵ𝑑𝛽1Δ\Delta\leftarrow\Psi(\epsilon-d),\beta=1-\Deltaroman_Δ ← roman_Ψ ( italic_ϵ - italic_d ) , italic_β = 1 - roman_Δ;
4       S{xV|(x)=P,x𝒳free+d}𝑆conditional-set𝑥𝑉formulae-sequence𝑥𝑃𝑥subscript𝒳free𝑑S\leftarrow\{x\in V|\mathcal{F}(x)=P,x\in\mathcal{X}_{\mathrm{free}}+d\mathcal% {B}\}italic_S ← { italic_x ∈ italic_V | caligraphic_F ( italic_x ) = italic_P , italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d caligraphic_B };
5       for xS𝑥𝑆x\in Sitalic_x ∈ italic_S do
6             Θ(x)𝙱𝚊𝚌𝚔𝙿𝚛𝚘𝚙(x,m,Θ)Θ𝑥𝙱𝚊𝚌𝚔𝙿𝚛𝚘𝚙𝑥𝑚Θ\varTheta(x)\leftarrow\texttt{BackProp}(x,m,\varTheta)roman_Θ ( italic_x ) ← BackProp ( italic_x , italic_m , roman_Θ );
7             (x)0𝑥0\mathcal{F}(x)\leftarrow 0caligraphic_F ( italic_x ) ← 0;
8            
9      for xVS𝑥𝑉𝑆x\in V\setminus Sitalic_x ∈ italic_V ∖ italic_S do
10             (x)(x)+1𝑥𝑥1\mathcal{F}(x)\leftarrow\mathcal{F}(x)+1caligraphic_F ( italic_x ) ← caligraphic_F ( italic_x ) + 1;
11            
12      return (Θ,)normal-Θ(\varTheta,\mathcal{F})( roman_Θ , caligraphic_F )
Algorithm 2 The ValueIteration procedure
1 Input: State xV𝑥𝑉x\in Vitalic_x ∈ italic_V, recursion allowance m𝑚mitalic_m, transformed value function ΘmsuperscriptΘ𝑚\varTheta^{m}roman_Θ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT;
2 def BackProp(x𝑥xitalic_x, m𝑚mitalic_m, Θmsuperscriptnormal-Θ𝑚\varTheta^{m}roman_Θ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT):
3       if x𝒳goal+(Mϵ+d)𝑥subscript𝒳normal-goal𝑀italic-ϵ𝑑x\in\mathcal{X}_{\mathrm{goal}}+(M\epsilon+d)\mathcal{B}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ + italic_d ) caligraphic_B or m=0𝑚0m=0italic_m = 0 then
4             return Θm(x)superscriptΘ𝑚𝑥\varTheta^{m}(x)roman_Θ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_x );
5            
6      for xF(x)superscript𝑥normal-′𝐹𝑥x^{\prime}\in F(x)italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F ( italic_x ) do
7             Θm1(x)𝙱𝚊𝚌𝚔𝙿𝚛𝚘𝚙(x,m1,Θm)superscriptΘ𝑚1𝑥𝙱𝚊𝚌𝚔𝙿𝚛𝚘𝚙superscript𝑥𝑚1superscriptΘ𝑚\varTheta^{m-1}(x)\leftarrow\texttt{BackProp}(x^{\prime},m-1,\varTheta^{m})roman_Θ start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT ( italic_x ) ← BackProp ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_m - 1 , roman_Θ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT );
8            
9      Θm(x)Δ+βminxF(x)Θm1(x)superscriptΘ𝑚𝑥Δ𝛽subscriptsuperscript𝑥𝐹𝑥superscriptΘ𝑚1superscript𝑥\displaystyle\varTheta^{m}(x)\leftarrow\Delta+\beta\min_{x^{\prime}\in F(x)}% \varTheta^{m-1}(x^{\prime})roman_Θ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_x ) ← roman_Δ + italic_β roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m - 1 end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT );
10       return Θm(x)superscriptΘ𝑚𝑥\varTheta^{m}(x)roman_Θ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_x )
Algorithm 3 The BackProp procedure

V-A Algorithm statement

We initialize iPolicy with a set of states V0subscript𝑉0V_{0}italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, which we assume it intersects with the goal set 𝒳goalsubscript𝒳goal\mathcal{X}_{\mathrm{goal}}caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT to ensure the availability of boundary condition in value iteration execution. Each sampled state xV0𝑥subscript𝑉0x\in V_{0}italic_x ∈ italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is initialized with its value set by Θ0(x)=0subscriptΘ0𝑥0\varTheta_{0}(x)=0roman_Θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_x ) = 0 and the its staleness by a staleness threshold P0𝑃0P\geq 0italic_P ≥ 0, meaning their values shall be updated at the beginning of the main loop of iPolicy. See lines 1-1 in Algorithm 1.

Then iPolicy enters the main loop in line 1. Similar to exploration, the main loop samples 𝒳newsubscript𝒳new\mathcal{X}_{\mathrm{new}}caligraphic_X start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT and add newly sampled state xnewsubscript𝑥newx_{\mathrm{new}}italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT to Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT as line 1. Then the value and staleness of xnewsubscript𝑥newx_{\mathrm{new}}italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT are initialized in the same way as those in the initialization stage, while for sampled states from the last grid Vk1subscript𝑉𝑘1V_{k-1}italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT, we retain their staleness and values as k1subscript𝑘1\mathcal{F}_{k-1}caligraphic_F start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT and Θk1subscriptΘ𝑘1\varTheta_{k-1}roman_Θ start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT. Since the new sample may incur paths with lower cost, value iterations shall be executed on Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT as line 1 in Algorithm 1 and Algorithm 2 to obtain a better estimate of the minimal traveling time 𝕋ksubscript𝕋𝑘\mathds{T}_{k}blackboard_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. The conventional value iteration is executed in a synchronous fashion, where the value at every sampled state on Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is updated in every iteration, and its computations grow quickly as more and more samples are added to Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. We employ the asynchronous value iteration to mitigate the computational complexity. Particularly, the value at x𝑥xitalic_x is updated when two circumstances occur: first, a sampled state is newly added to Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, since its initial value may not be accurate; second, the value of a sampled state is too stale to reflect the true estimate of the minimal traveling time. Recall the staleness function k:Vk0:subscript𝑘subscript𝑉𝑘subscriptabsent0\mathcal{F}_{k}:V_{k}\to\mathbb{N}_{\geq 0}caligraphic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT : italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT → blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT represents the number of iterations since the last update of a sample xVk𝑥subscript𝑉𝑘x\in V_{k}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Let ^ksubscript^𝑘\hat{\mathcal{F}}_{k}over^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be the staleness function before the update at iteration k𝑘kitalic_k. Then ^k(x)=k1(x),xVk1formulae-sequencesubscript^𝑘𝑥subscript𝑘1𝑥for-all𝑥subscript𝑉𝑘1\hat{\mathcal{F}}_{k}(x)=\mathcal{F}_{k-1}(x),\forall x\in V_{k-1}over^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = caligraphic_F start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ( italic_x ) , ∀ italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT and for xVkVk1𝑥subscript𝑉𝑘subscript𝑉𝑘1x\in V_{k}\setminus V_{k-1}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT, we have ^k(x)=Psubscript^𝑘𝑥𝑃\hat{\mathcal{F}}_{k}(x)=Pover^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = italic_P. The staleness threshold P0𝑃subscriptabsent0P\in\mathbb{N}_{\geq 0}italic_P ∈ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT is the maximum number of value iterations a sampled state can skip. When P=0𝑃0P=0italic_P = 0, the asynchronous update is identical to the synchronous update, while when P𝑃Pitalic_P is positive infinity, each sample will only be updated once. In Algorithm 2 ValueItertion, the set of stale samples Sk{xVk|^k(x)=P,x𝒳free+dk}subscript𝑆𝑘conditional-set𝑥subscript𝑉𝑘formulae-sequencesubscript^𝑘𝑥𝑃𝑥subscript𝒳freesubscript𝑑𝑘S_{k}\triangleq\{x\in V_{k}|\hat{\mathcal{F}}_{k}(x)=P,x\in\mathcal{X}_{% \mathrm{free}}+d_{k}\mathcal{B}\}italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≜ { italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | over^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = italic_P , italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B } are updated using Algorithm 3 BackProp that recursively searches for the minimum values from one’s neighbors. After updates, the staleness for the samples in Sksubscript𝑆𝑘S_{k}italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT will be cleared as line 2 in Algorithm 2 as they have been freshly updated while staleness of other samples VkSksubscript𝑉𝑘subscript𝑆𝑘V_{k}\setminus S_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT would be increased by 1111 as line 2.

Algorithm 3 BackProp leverages the Bellman equation to recursively update values and partially solve for the estimate of minimal traveling time on Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. For simplicity, we drop the dependency on k𝑘kitalic_k in the psuedo codes of BackProp. The vanilla update procedure of value iteration is summarized as the Bellman operator 𝒯ksubscript𝒯𝑘\mathcal{T}_{k}caligraphic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT:

(𝒯k𝕋k)(x){ϵkdk+minxFk(x)𝕋k(x),if xVk(𝒳goal+(Mϵk+dk));𝕋k(x),otherwise,subscript𝒯𝑘subscript𝕋𝑘𝑥cases𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒subscriptitalic-ϵ𝑘subscript𝑑𝑘subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscript𝕋𝑘superscript𝑥𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒if 𝑥subscript𝑉𝑘subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒subscript𝕋𝑘𝑥otherwise(\mathcal{T}_{k}\circ\mathds{T}_{k})(x)\triangleq\begin{cases}&\displaystyle% \epsilon_{k}-d_{k}+\min_{x^{\prime}\in F_{k}(x)}\mathds{T}_{k}(x^{\prime}),\\ &\quad\text{if }x\in V_{k}\setminus(\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}% +d_{k})\mathcal{B});\\ &\mathds{T}_{k}(x),\text{otherwise},\end{cases}( caligraphic_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∘ blackboard_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) ≜ { start_ROW start_CELL end_CELL start_CELL italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT blackboard_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL if italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ; end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL blackboard_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) , otherwise , end_CELL end_ROW (2)

where

Fk(x)subscript𝐹𝑘𝑥absent\displaystyle F_{k}(x)\triangleqitalic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) ≜ (x+ϵku𝒰f(x,u)+ρ(dk,ϵk))𝑥subscriptitalic-ϵ𝑘subscript𝑢𝒰𝑓𝑥𝑢𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘\displaystyle(x+\epsilon_{k}\bigcup_{u\in\mathcal{U}}f(x,u)+\rho(d_{k},% \epsilon_{k})\mathcal{B})( italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) (3)
(𝒳free+dk)Vksubscript𝒳freesubscript𝑑𝑘subscript𝑉𝑘\displaystyle\cap(\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B})\cap V_{k}∩ ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT

is the set of one-hop neighbors of x𝑥xitalic_x using Euler discretiztion and ρ(d,k)2dk+lϵk(dk+Mϵk)𝜌𝑑𝑘2subscript𝑑𝑘𝑙subscriptitalic-ϵ𝑘subscript𝑑𝑘𝑀subscriptitalic-ϵ𝑘\rho(d,k)\triangleq 2d_{k}+l\epsilon_{k}(d_{k}+M\epsilon_{k})italic_ρ ( italic_d , italic_k ) ≜ 2 italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_l italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) is the perturbation radius inherited from [15]. See Figure 1 for the visualization of the discretization, where the robot transits from x𝑥xitalic_x to xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT after applying a constant control u𝑢uitalic_u for a fixed time ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, and we treat all sampled states (orange dots) in the circle of radius ρ𝜌\rhoitalic_ρ centered at xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT as the one-hop neighbors of x𝑥xitalic_x, as xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is not necessarily in Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. With the Kruzhkov transform, one may rewrite the update procedure (2) and define a tranformed Bellman operator as:

(𝔗kΘk)(x){Δk+βkminxFk(x)Θk(x),if xVk(𝒳goal+(Mϵk+dk));Θk(x), otherwise,subscript𝔗𝑘subscriptΘ𝑘𝑥cases𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒subscriptΔ𝑘subscript𝛽𝑘subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptΘ𝑘superscript𝑥𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒if 𝑥subscript𝑉𝑘subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒subscriptΘ𝑘𝑥 otherwise\displaystyle(\mathfrak{T}_{k}\circ\varTheta_{k})(x)\triangleq\begin{cases}&% \displaystyle\Delta_{k}+\beta_{k}\min_{x^{\prime}\in F_{k}(x)}\varTheta_{k}(x^% {\prime}),\\ &\quad\text{if }x\in V_{k}\setminus(\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}% +d_{k})\mathcal{B});\\ &\varTheta_{k}(x),\text{ otherwise},\end{cases}( fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∘ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) ≜ { start_ROW start_CELL end_CELL start_CELL roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL if italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ; end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) , otherwise , end_CELL end_ROW

where ΔkΨ(ϵkdk)subscriptΔ𝑘Ψsubscriptitalic-ϵ𝑘subscript𝑑𝑘\Delta_{k}\triangleq\Psi(\epsilon_{k}-d_{k})roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≜ roman_Ψ ( italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) is the transformed running cost, βk1Δksubscript𝛽𝑘1subscriptΔ𝑘\beta_{k}\triangleq 1-\Delta_{k}italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≜ 1 - roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the total discount factor in the transformed form and 𝔗ksubscript𝔗𝑘\mathfrak{T}_{k}fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the transformed Bellman operator. For each xS𝑥𝑆x\in Sitalic_x ∈ italic_S, BackProp computes Θmk(x)superscriptΘsubscript𝑚𝑘𝑥\varTheta^{m_{k}}(x)roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x ) in a recursive manner. See Figure 1 again for the visualization of BackProp. Algorithm 3 BackProp first checks whether the goal region 𝒳goalsubscript𝒳goal\mathcal{X}_{\mathrm{goal}}caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT is reached or the recursion allowance mk=0subscript𝑚𝑘0m_{k}=0italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0 runs out as line 3, which are the boundary condition of Bellman equation and the limit of computations for each iteration, respectively. If neither is satisfied, BackProp will check and compute Θmk1(x)superscriptΘsubscript𝑚𝑘1superscript𝑥\varTheta^{m_{k}-1}(x^{\prime})roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) for each neighbor xFk(x)superscript𝑥subscript𝐹𝑘𝑥x^{\prime}\in F_{k}(x)italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) by calling itself as line 3. This will temporarily halt the exeuction of current BackProp (recursion allowance mksubscript𝑚𝑘m_{k}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT) with all values saved, while processor will create a duplicate of BackProp with a reduced recursion allowance mk1subscript𝑚𝑘1m_{k}-1italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - 1 and continues executing the new BackProp When the new BackProp finishes according to the boundary conditions in line 3, the processor will exit it and resume the execution of the old BackProp with recursion allowance mksubscript𝑚𝑘m_{k}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT with its previously saved values and the returned value at a one-hop neighbor Θmk1(x)superscriptΘsubscript𝑚𝑘1superscript𝑥\varTheta^{m_{k}-1}(x^{\prime})roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) from the new BackProp. With all values at one-hop neighbors being ready, BackProp computes Θm(x)superscriptΘ𝑚𝑥\varTheta^{m}(x)roman_Θ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_x ) using the Bellman operator as line 3. In other words, a recursion with maximum allowance mksubscript𝑚𝑘m_{k}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT starting from x𝑥xitalic_x updates the value at x𝑥xitalic_x by propogating values at neighbors of x𝑥xitalic_x within mksubscript𝑚𝑘m_{k}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT hops. This limits the search to a subset of Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT to save computations. When mk|Vk|subscript𝑚𝑘subscript𝑉𝑘m_{k}\geq|V_{k}|italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ | italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT |, BackProp updates all sampled states on Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in the depth-first search manner.

Refer to caption
Figure 1: Illustration of the approximation of dynamic and BackProp. Orange dots are sampled states, orange crosses are resting states when applying constant control u𝑢uitalic_u for time ϵitalic-ϵ\epsilonitalic_ϵ and blue arrows imply discrete time transition.

The consistency of iPolicy follows the discretization scheme in [15], where convergence rate conditions are imposed for the temporal and spatial resolutions. Specifically, let dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be a conservative estimate of the spatial dispersion, ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT be the associated temporal resolution and ρ(ϵk,dk)𝜌subscriptitalic-ϵ𝑘subscript𝑑𝑘\rho(\epsilon_{k},d_{k})italic_ρ ( italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) be the perturbation radius. Then dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is a distance within which every point x𝒳𝑥𝒳x\in\mathcal{X}italic_x ∈ caligraphic_X can find a sampled state in Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT would be the minimal time elapse system (1) can transit. Since system (1) may not fall on a sample in Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT if it takes a constant control for ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT time units, the discrepancy of approximation of the dynamic system is accommodated by the perturbation ρ(dk,ϵk)𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘\rho(d_{k},\epsilon_{k})italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) so that the one-hop neighbors of state x𝑥xitalic_x with pertubation would surely fall on Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT; i.e., (x+ϵkf(x,u)+ρ(dk,ϵk))Vk𝑥subscriptitalic-ϵ𝑘𝑓𝑥𝑢𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑉𝑘(x+\epsilon_{k}f(x,u)+\rho(d_{k},\epsilon_{k})\mathcal{B})\cap V_{k}\neq\emptyset( italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≠ ∅. The choice of dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is not unique but is subject to the following assumption for the purpose of consistent approximation.

Assumption V.1.

The selection of dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT satsifies the following relations:

  1. (A5)

    dkB(log|Vk||Vk|)1/n,k0formulae-sequencesubscript𝑑𝑘𝐵superscriptsubscript𝑉𝑘subscript𝑉𝑘1𝑛for-all𝑘subscriptabsent0d_{k}\geq B\Big{(}\frac{\log|V_{k}|}{|V_{k}|}\Big{)}^{1/n},\forall k\in\mathbb% {N}_{\geq 0}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ italic_B ( divide start_ARG roman_log | italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | end_ARG start_ARG | italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | end_ARG ) start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT , ∀ italic_k ∈ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT, where B>(μ(𝒳)/Cn)1/n𝐵superscript𝜇𝒳subscript𝐶𝑛1𝑛B>(\mu(\mathcal{X})/C_{n})^{1/n}italic_B > ( italic_μ ( caligraphic_X ) / italic_C start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT;

  2. (A6)

    ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and dk/ϵksubscript𝑑𝑘subscriptitalic-ϵ𝑘d_{k}/\epsilon_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT / italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT monotonically decreases to 00 as k+𝑘k\to+\inftyitalic_k → + ∞;

  3. (A7)

    ϵk>dk,k0formulae-sequencesubscriptitalic-ϵ𝑘subscript𝑑𝑘for-all𝑘subscriptabsent0\epsilon_{k}>d_{k},\forall k\in\mathbb{N}_{\geq 0}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT > italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ∀ italic_k ∈ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT;

  4. (A8)

    ρ(dk,ϵk)dk1,kformulae-sequence𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑑𝑘1for-all𝑘\rho(d_{k},\epsilon_{k})\geq d_{k-1},\forall k\in\mathbb{N}italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ≥ italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT , ∀ italic_k ∈ blackboard_N.

Assumption (A5) follows from [5] and implies the lower bound of the spatial resolution that the union of balls centered at every sample xVk𝑥subscript𝑉𝑘x\in V_{k}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT with radius dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT covers the whole region 𝒳𝒳\mathcal{X}caligraphic_X. Assumption (A6) characterizes the convergence rate for temporal and spatial resolutions. Notice that the perturbation radius ρ(dk,ϵk)𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘\rho(d_{k},\epsilon_{k})italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) is dimishing faster than ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT; this ensures the discrete time transition always falls on the graph for sufficiently small temporal-spatial resolutions. In Assumption (A6), the faster convergence of dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT compared to ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT indicates the discrete time transition on Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT would be close to the discrete time transition on 𝒳𝒳\mathcal{X}caligraphic_X for sufficiently small ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT; and ϵk0subscriptitalic-ϵ𝑘0\epsilon_{k}\to 0italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT → 0 finalizes the consistency of approximation. Assumptions (A7) and (A8) are purely for the validity of computations and analysis of iPolicy. Assumption (A7) trivially holds for sufficiently large k𝑘kitalic_k by Assumption (A6) while assumption (A8) holds when dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT takes the lower bound in Assumption (A5).

V-B Performance guarantee

The consistency of the estimates requires proper selection of temporal resolution ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and sufficient value iterations mksubscript𝑚𝑘{m_{k}}italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT executed on Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. The following assumption is adapted from Assumption IV.1 in [35] and it imposes the minimum required value iterations {mk}subscript𝑚𝑘\{m_{k}\}{ italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } executed on each graph with convergence guarantee.

Assumption V.2.

There exists a graph index k0subscript𝑘0k_{0}italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, an upper bound of intervals T0𝑇0T\geq 0italic_T ≥ 0 and an upper bound of accumulated discount β¯(0,1)normal-¯𝛽01\bar{\beta}\in(0,1)over¯ start_ARG italic_β end_ARG ∈ ( 0 , 1 ) such that k¯k0for-allnormal-¯𝑘subscript𝑘0\forall\bar{k}\geq k_{0}∀ over¯ start_ARG italic_k end_ARG ≥ italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, the total numbers of value iterations on each graph {mk}subscript𝑚𝑘\{m_{k}\}{ italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } satisfy

t=0Tmax{0,,P}βk˘(t,)mk˘(t,)β¯<1,superscriptsubscriptproduct𝑡0𝑇subscript0𝑃superscriptsubscript𝛽˘𝑘𝑡subscript𝑚˘𝑘𝑡¯𝛽1\displaystyle\prod_{t=0}^{T}\max_{\ell\in\{0,\dots,P\}}\beta_{\breve{k}(t,\ell% )}^{m_{\breve{k}(t,\ell)}}\leq\bar{\beta}<1,∏ start_POSTSUBSCRIPT italic_t = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_max start_POSTSUBSCRIPT roman_ℓ ∈ { 0 , … , italic_P } end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_t , roman_ℓ ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_t , roman_ℓ ) end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ≤ over¯ start_ARG italic_β end_ARG < 1 ,

where the graph index is denoted by k˘(t,)k¯+t(P+1)+normal-≜normal-˘𝑘𝑡normal-ℓnormal-¯𝑘𝑡𝑃1normal-ℓ\breve{k}(t,\ell)\triangleq\bar{k}+t(P+1)+\ellover˘ start_ARG italic_k end_ARG ( italic_t , roman_ℓ ) ≜ over¯ start_ARG italic_k end_ARG + italic_t ( italic_P + 1 ) + roman_ℓ.

In particular, Assumption V.2 partitions the graph indices into intervals of equal length P+1𝑃1P+1italic_P + 1, and the aggregation of the worst-case discount on each interval k˘[k¯+t(P+1),k¯+t(P+1)+P]˘𝑘¯𝑘𝑡𝑃1¯𝑘𝑡𝑃1𝑃\breve{k}\in[\bar{k}+t(P+1),\bar{k}+t(P+1)+P]over˘ start_ARG italic_k end_ARG ∈ [ over¯ start_ARG italic_k end_ARG + italic_t ( italic_P + 1 ) , over¯ start_ARG italic_k end_ARG + italic_t ( italic_P + 1 ) + italic_P ] over T+1𝑇1T+1italic_T + 1 consecutive intervals should be strictly lower than 1111. This implies the distance between the estimated value function ΘkmksuperscriptsubscriptΘ𝑘subscript𝑚𝑘\varTheta_{k}^{m_{k}}roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT and the fixed point Θk*superscriptsubscriptΘ𝑘\varTheta_{k}^{*}roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is strictly decreasing in a window of (T+1)(P+1)𝑇1𝑃1(T+1)(P+1)( italic_T + 1 ) ( italic_P + 1 ) graphs.

The main result is summarized below, where the last estimate of minimal traveling time ΘkmksubscriptsuperscriptΘsubscript𝑚𝑘𝑘\varTheta^{m_{k}}_{k}roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT converges to the ground truth pointwise with probability one.

Theorem V.1.

Suppose Assumptions IV.1, V.1 and V.2 hold. Then the following holds with probability one:

limk+minx(x+dk)VkΘkmk=Θ*(x),x𝒳free𝒵,formulae-sequencesubscript𝑘subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘsubscript𝑚𝑘𝑘superscriptΘ𝑥for-all𝑥subscript𝒳free𝒵\displaystyle\lim_{k\to+\infty}\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{% k}}\varTheta^{m_{k}}_{k}=\varTheta^{*}(x),\forall x\in\mathcal{X}_{\mathrm{% free}}\setminus\mathcal{Z},roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) , ∀ italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT ∖ caligraphic_Z ,

where 𝒵𝒵\mathcal{Z}caligraphic_Z is a set of sufficiently small measure.

VI Analysis

In this section, we analyze the performance of iPolicy. Towards the proof of Theorem V.1, we proceed with the following result with a supplementary definition.

Theorem VI.1.

Suppose Assumptions IV.1, V.1 and V.2 hold and let Θk*(x)=Θkm(x)=1,x𝒳(𝒳free+dk),k1formulae-sequencesuperscriptsubscriptnormal-Θ𝑘𝑥superscriptsubscriptnormal-Θ𝑘𝑚𝑥1formulae-sequencefor-all𝑥𝒳subscript𝒳normal-freesubscript𝑑𝑘𝑘1\varTheta_{k}^{*}(x)=\varTheta_{k}^{m}(x)=1,\forall x\in\mathcal{X}\setminus(% \mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B}),k\geq 1roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) = roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_x ) = 1 , ∀ italic_x ∈ caligraphic_X ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) , italic_k ≥ 1 and 0mmk0𝑚subscript𝑚𝑘0\leq m\leq m_{k}0 ≤ italic_m ≤ italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Then it holds with probability one:

limk+minx(x+dk)VkΘkmk=Θ*(x),x𝒳𝒵,formulae-sequencesubscript𝑘subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘsubscript𝑚𝑘𝑘superscriptΘ𝑥for-all𝑥𝒳𝒵\displaystyle\lim_{k\to+\infty}\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{% k}}\varTheta^{m_{k}}_{k}=\varTheta^{*}(x),\forall x\in\mathcal{X}\setminus% \mathcal{Z},roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) , ∀ italic_x ∈ caligraphic_X ∖ caligraphic_Z ,

where 𝒵𝒵\mathcal{Z}caligraphic_Z is a set of sufficiently small measure.

Remark VI.1.

Theorem VI.1 extends Theorem V.1 to the whole state space 𝒳𝒳\mathcal{X}caligraphic_X by manually setting Θkmk(x)superscriptsubscriptnormal-Θ𝑘subscript𝑚𝑘𝑥\varTheta_{k}^{m_{k}}(x)roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_x ) as 1111 for every x𝒳obs𝑥subscript𝒳normal-obsx\in\mathcal{X}_{\mathrm{obs}}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT, indicating there is no feasible path connecting the goal set 𝒳goalsubscript𝒳normal-goal\mathcal{X}_{\mathrm{goal}}caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT. This practice consolidates the obstacle regions 𝒳obssubscript𝒳normal-obs\mathcal{X}_{\mathrm{obs}}caligraphic_X start_POSTSUBSCRIPT roman_obs end_POSTSUBSCRIPT where Θkmsubscriptsuperscriptnormal-Θ𝑚𝑘\varTheta^{m}_{k}roman_Θ start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is only defined with the free regions 𝒳freesubscript𝒳normal-free\mathcal{X}_{\mathrm{free}}caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT and simplifies analysis by reducing the total number of cases considered in the analysis.

The following lemma shows the supplementary definition in Theorem VI.1 is not involved in any part in the execution of the proposed algorithms.

Lemma VI.1.

For any k0𝑘0k\geq 0italic_k ≥ 0 and x𝒳(𝒳free+dk)𝑥𝒳subscript𝒳normal-freesubscript𝑑𝑘x\in\mathcal{X}\setminus(\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B})italic_x ∈ caligraphic_X ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ), it holds that xSκ𝑥subscript𝑆𝜅x\notin S_{\kappa}italic_x ∉ italic_S start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT and xFκ(x)𝑥subscript𝐹𝜅superscript𝑥normal-′x\notin F_{\kappa}(x^{\prime})italic_x ∉ italic_F start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ), κkfor-all𝜅𝑘\forall\kappa\geq k∀ italic_κ ≥ italic_k and xVk{x}superscript𝑥normal-′subscript𝑉𝑘𝑥x^{\prime}\in V_{k}\setminus\{x\}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ { italic_x }.

Proof.

It follows from Assumption V.1 that dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is monotonically decreasing. Then for every x𝒳(𝒳free+dk)𝑥𝒳subscript𝒳freesubscript𝑑𝑘x\in\mathcal{X}\setminus(\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B})italic_x ∈ caligraphic_X ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ), x𝒳free+dκ,κkformulae-sequence𝑥subscript𝒳freesubscript𝑑𝜅for-all𝜅𝑘x\notin\mathcal{X}_{\mathrm{free}}+d_{\kappa}\mathcal{B},\forall\kappa\geq kitalic_x ∉ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT caligraphic_B , ∀ italic_κ ≥ italic_k. This implies x𝑥xitalic_x will never be included in Sκ,κksubscript𝑆𝜅for-all𝜅𝑘S_{\kappa},\forall\kappa\geq kitalic_S start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT , ∀ italic_κ ≥ italic_k per line 2 in Algorithm 2. It follows from the definition of Fk(x)subscript𝐹𝑘𝑥F_{k}(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) in (3) that xFκ(x),xVκformulae-sequence𝑥subscript𝐹𝜅superscript𝑥for-allsuperscript𝑥subscript𝑉𝜅x\notin F_{\kappa}(x^{\prime}),\forall x^{\prime}\in V_{\kappa}italic_x ∉ italic_F start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , ∀ italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_V start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT and κk𝜅𝑘\kappa\geq kitalic_κ ≥ italic_k. This completes the proof. ∎

Lemma VI.1 shows that the supplementary definition of ΘkmsuperscriptsubscriptΘ𝑘𝑚\varTheta_{k}^{m}roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT and Θk*superscriptsubscriptΘ𝑘\varTheta_{k}^{*}roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT over 𝒳(𝒳free+dk)𝒳subscript𝒳freesubscript𝑑𝑘\mathcal{X}\setminus(\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B})caligraphic_X ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) will not affect computations of ΘkmsuperscriptsubscriptΘ𝑘𝑚\varTheta_{k}^{m}roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT and Θk*superscriptsubscriptΘ𝑘\varTheta_{k}^{*}roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT over the free region, and Theorem VI.1 is identical to Theorem V.1 over 𝒳free+dksubscript𝒳freesubscript𝑑𝑘\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B}caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B. Throughout the analysis in this section, we assume all conditions in Theorem VI.1 hold.

VI-A Preliminaries

In this subsection, we introduce a couple of preliminary results that facilicate the proof of our main theorem. The following lemma is borrowed from Lemma VII.4 in [35] and is listed below for the completeness of the paper.

Lemma VI.2 (Lemma VII.4 in [35]).

Given 𝒳1,𝒳2𝒳subscript𝒳1subscript𝒳2𝒳\mathcal{X}_{1},\mathcal{X}_{2}\subseteq\mathcal{X}caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , caligraphic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⊆ caligraphic_X, consider Vi:𝒳[0,1]normal-:subscript𝑉𝑖normal-→𝒳01V_{i}:\mathcal{X}\to[0,1]italic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : caligraphic_X → [ 0 , 1 ] and ηi>0subscript𝜂𝑖0\eta_{i}>0italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > 0 s.t. (x+ηi)𝒳i,x𝒳formulae-sequence𝑥subscript𝜂𝑖subscript𝒳𝑖for-all𝑥𝒳(x+\eta_{i}\mathcal{B})\cap\mathcal{X}_{i}\neq\emptyset,\forall x\in\mathcal{X}( italic_x + italic_η start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT caligraphic_B ) ∩ caligraphic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≠ ∅ , ∀ italic_x ∈ caligraphic_X, where i{1,2}𝑖12i\in\{1,2\}italic_i ∈ { 1 , 2 }. Then for any set-valued map Y:𝒳𝒳normal-:𝑌normal-⇉𝒳𝒳Y:\mathcal{X}\rightrightarrows\mathcal{X}italic_Y : caligraphic_X ⇉ caligraphic_X s.t. Y(x),x𝒳formulae-sequence𝑌𝑥for-all𝑥𝒳Y(x)\neq\emptyset,\forall x\in\mathcal{X}italic_Y ( italic_x ) ≠ ∅ , ∀ italic_x ∈ caligraphic_X, we have

minx~(Y(x)+η1)𝒳1V1(x~)minx~(Y(x)+η2)𝒳2V2(x~)𝒳subscriptnormsubscript~𝑥𝑌𝑥subscript𝜂1subscript𝒳1subscript𝑉1~𝑥subscript~𝑥𝑌𝑥subscript𝜂2subscript𝒳2subscript𝑉2~𝑥𝒳\displaystyle\|\min_{\tilde{x}\in(Y(x)+\eta_{1}\mathcal{B})\cap\mathcal{X}_{1}% }V_{1}(\tilde{x})-\min_{\tilde{x}\in(Y(x)+\eta_{2}\mathcal{B})\cap\mathcal{X}_% {2}}V_{2}(\tilde{x})\|_{\mathcal{X}}∥ roman_min start_POSTSUBSCRIPT over~ start_ARG italic_x end_ARG ∈ ( italic_Y ( italic_x ) + italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( over~ start_ARG italic_x end_ARG ) - roman_min start_POSTSUBSCRIPT over~ start_ARG italic_x end_ARG ∈ ( italic_Y ( italic_x ) + italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT caligraphic_B ) ∩ caligraphic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( over~ start_ARG italic_x end_ARG ) ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT (4)
\displaystyle\leq minx~(x+η1)𝒳1V1(x~)minx~(x+η2)𝒳2V2(x~)𝒳subscriptnormsubscript~𝑥𝑥subscript𝜂1subscript𝒳1subscript𝑉1~𝑥subscript~𝑥𝑥subscript𝜂2subscript𝒳2subscript𝑉2~𝑥𝒳\displaystyle\|\min_{\tilde{x}\in(x+\eta_{1}\mathcal{B})\cap\mathcal{X}_{1}}V_% {1}(\tilde{x})-\min_{\tilde{x}\in(x+\eta_{2}\mathcal{B})\cap\mathcal{X}_{2}}V_% {2}(\tilde{x})\|_{\mathcal{X}}∥ roman_min start_POSTSUBSCRIPT over~ start_ARG italic_x end_ARG ∈ ( italic_x + italic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( over~ start_ARG italic_x end_ARG ) - roman_min start_POSTSUBSCRIPT over~ start_ARG italic_x end_ARG ∈ ( italic_x + italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT caligraphic_B ) ∩ caligraphic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( over~ start_ARG italic_x end_ARG ) ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT

If 𝒳1=𝒳2𝒳¯subscript𝒳1subscript𝒳2normal-≜normal-¯𝒳\mathcal{X}_{1}=\mathcal{X}_{2}\triangleq\bar{\mathcal{X}}caligraphic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = caligraphic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≜ over¯ start_ARG caligraphic_X end_ARG and η1=η2ηsubscript𝜂1subscript𝜂2normal-≜𝜂\eta_{1}=\eta_{2}\triangleq\etaitalic_η start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_η start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≜ italic_η, then

minx~(x+η)𝒳¯V1(x~)minx~(x+η)𝒳¯V2(x~)𝒳V1V2𝒳¯.subscriptnormsubscript~𝑥𝑥𝜂¯𝒳subscript𝑉1~𝑥subscript~𝑥𝑥𝜂¯𝒳subscript𝑉2~𝑥𝒳subscriptnormsubscript𝑉1subscript𝑉2¯𝒳\displaystyle\|\min_{\tilde{x}\in(x+\eta\mathcal{B})\cap\bar{\mathcal{X}}}V_{1% }(\tilde{x})-\min_{\tilde{x}\in(x+\eta\mathcal{B})\cap\bar{\mathcal{X}}}V_{2}(% \tilde{x})\|_{\mathcal{X}}\leq\|V_{1}-V_{2}\|_{\bar{\mathcal{X}}}.∥ roman_min start_POSTSUBSCRIPT over~ start_ARG italic_x end_ARG ∈ ( italic_x + italic_η caligraphic_B ) ∩ over¯ start_ARG caligraphic_X end_ARG end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( over~ start_ARG italic_x end_ARG ) - roman_min start_POSTSUBSCRIPT over~ start_ARG italic_x end_ARG ∈ ( italic_x + italic_η caligraphic_B ) ∩ over¯ start_ARG caligraphic_X end_ARG end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( over~ start_ARG italic_x end_ARG ) ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ≤ ∥ italic_V start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_V start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT over¯ start_ARG caligraphic_X end_ARG end_POSTSUBSCRIPT . (5)

The following lemma shows that the maximum over a finite horizon of a convergent sequence is also convergent.

Lemma VI.3.

Given a sequence {ak}subscript𝑎𝑘\{a_{k}\}{ italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } s.t. limk+ak=0subscriptnormal-→𝑘subscript𝑎𝑘0\lim_{k\to+\infty}a_{k}=0roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0, consider a new sequence {bk}subscript𝑏𝑘\{b_{k}\}{ italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } s.t. bk=maxak+subscript𝑏𝑘subscriptnormal-ℓsubscript𝑎𝑘normal-ℓb_{k}=\max_{\ell\in\mathcal{I}}a_{k+\ell}italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_max start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_I end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k + roman_ℓ end_POSTSUBSCRIPT, where 0subscriptabsent0\mathcal{I}\subseteq\mathbb{N}_{\geq 0}caligraphic_I ⊆ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT is finite set of integers. Then limk+bk=0subscriptnormal-→𝑘subscript𝑏𝑘0\lim_{k\to+\infty}b_{k}=0roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0.

Proof.

Since limk+ak=0subscript𝑘subscript𝑎𝑘0\lim_{k\to+\infty}a_{k}=0roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0, then ϵ>0for-allitalic-ϵ0\forall\epsilon>0∀ italic_ϵ > 0, K>0𝐾0\exists K>0∃ italic_K > 0 s.t. kKfor-all𝑘𝐾\forall k\geq K∀ italic_k ≥ italic_K, |ak|ϵsubscript𝑎𝑘italic-ϵ|a_{k}|\leq\epsilon| italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | ≤ italic_ϵ. Then it holds that |bk|=|maxak+|max|ak+|ϵsubscript𝑏𝑘subscriptsubscript𝑎𝑘subscriptsubscript𝑎𝑘italic-ϵ|b_{k}|=|\max_{\ell\in\mathcal{I}}a_{k+\ell}|\leq\max_{\ell\in\mathcal{I}}|a_{% k+\ell}|\leq\epsilon| italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | = | roman_max start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_I end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT italic_k + roman_ℓ end_POSTSUBSCRIPT | ≤ roman_max start_POSTSUBSCRIPT roman_ℓ ∈ caligraphic_I end_POSTSUBSCRIPT | italic_a start_POSTSUBSCRIPT italic_k + roman_ℓ end_POSTSUBSCRIPT | ≤ italic_ϵ. This implies limk+bk=0subscript𝑘subscript𝑏𝑘0\lim_{k\to+\infty}b_{k}=0roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0. ∎

The following lemma from Lemma 7.2 in [38] characterizes the upper bound of the spatial resolution dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.

Lemma VI.4.

Consider an estimate of spatial dispersion dB(log|V||V|)1/n𝑑𝐵superscript𝑉𝑉1𝑛d\geq B(\frac{\log|V|}{|V|})^{1/n}italic_d ≥ italic_B ( divide start_ARG roman_log | italic_V | end_ARG start_ARG | italic_V | end_ARG ) start_POSTSUPERSCRIPT 1 / italic_n end_POSTSUPERSCRIPT. Then it holds that

lim sup|V|+[x𝒳 s.t. (x+d)V=]=0.subscriptlimit-supremum𝑉delimited-[]𝑥𝒳 s.t. 𝑥𝑑𝑉0\displaystyle\limsup_{|V|\to+\infty}\mathbb{P}[\exists x\in\mathcal{X}\text{ s% .t. }(x+d\mathcal{B})\cap V=\emptyset]=0.lim sup start_POSTSUBSCRIPT | italic_V | → + ∞ end_POSTSUBSCRIPT blackboard_P [ ∃ italic_x ∈ caligraphic_X s.t. ( italic_x + italic_d caligraphic_B ) ∩ italic_V = ∅ ] = 0 .
Remark VI.2.

Lemma VI.4 indicates that for any x𝒳𝑥𝒳x\in\mathcal{X}italic_x ∈ caligraphic_X, there always is at lease one node in Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT within the spatial dispersion dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT of x𝑥xitalic_x with probability one in an asymptotic sense. Via the spatial dispersion, Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT can be treated as a grid in [15] and we can leverage all set-valued tools for analysis.

The following lemma shows the whole graph will be periodically updated within an interval of length P𝑃Pitalic_P.

Lemma VI.5.

It holds that Vk+P=kk+PSsubscript𝑉𝑘𝑃superscriptsubscriptnormal-ℓ𝑘𝑘𝑃subscript𝑆normal-ℓV_{k+P}\subseteq\bigcup_{\ell=k}^{k+P}S_{\ell}italic_V start_POSTSUBSCRIPT italic_k + italic_P end_POSTSUBSCRIPT ⊆ ⋃ start_POSTSUBSCRIPT roman_ℓ = italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k + italic_P end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT.

Proof.

Fix xVk+P𝑥superscript𝑉𝑘𝑃x\in V^{k+P}italic_x ∈ italic_V start_POSTSUPERSCRIPT italic_k + italic_P end_POSTSUPERSCRIPT and three cases arise:

  • Case 1: xSk𝑥subscript𝑆𝑘x\in S_{k}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Then the lemma trivially holds;

  • Case 2: xVkSk𝑥subscript𝑉𝑘subscript𝑆𝑘x\in V_{k}\setminus S_{k}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. This implies ^k(x)<Psubscript^𝑘𝑥𝑃\hat{\mathcal{F}}_{k}(x)<Pover^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) < italic_P. Then the staleness of x𝑥xitalic_x increases to P𝑃Pitalic_P at iteration P^k(x)+k𝑃subscript^𝑘𝑥𝑘P-\hat{\mathcal{F}}_{k}(x)+kitalic_P - over^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) + italic_k. With that being said, xSP^k(x)+k=kk+PS𝑥subscript𝑆𝑃subscript^𝑘𝑥𝑘superscriptsubscript𝑘𝑘𝑃subscript𝑆x\in S_{P-\hat{\mathcal{F}}_{k}(x)+k}\subseteq\bigcup_{\ell=k}^{k+P}S_{\ell}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_P - over^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) + italic_k end_POSTSUBSCRIPT ⊆ ⋃ start_POSTSUBSCRIPT roman_ℓ = italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k + italic_P end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT and this completes the proof in Case 2;

  • Case 3: xVk+PVk𝑥subscript𝑉𝑘𝑃subscript𝑉𝑘x\in V_{k+P}\setminus V_{k}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k + italic_P end_POSTSUBSCRIPT ∖ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Then x𝑥xitalic_x is a newly added node at some iteration k[k+1,k+P]superscript𝑘𝑘1𝑘𝑃k^{\prime}\in[k+1,k+P]italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ [ italic_k + 1 , italic_k + italic_P ] and xSk=kk+PS𝑥subscript𝑆superscript𝑘superscriptsubscript𝑘𝑘𝑃subscript𝑆x\in S_{k^{\prime}}\bigcup_{\ell=k}^{k+P}S_{\ell}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT roman_ℓ = italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k + italic_P end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. This completes the proof in Case 3.

Since the above holds xVk+Pfor-all𝑥superscript𝑉𝑘𝑃\forall x\in V^{k+P}∀ italic_x ∈ italic_V start_POSTSUPERSCRIPT italic_k + italic_P end_POSTSUPERSCRIPT, proof is completed. ∎

The following lemma characterizes the values of nodes within goal regions.

Lemma VI.6.

Given k1𝑘1k\geq 1italic_k ≥ 1 and any x𝒳+(Mϵk+dk)𝑥𝒳𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x\in\mathcal{X}+(M\epsilon_{k}+d_{k})\mathcal{B}italic_x ∈ caligraphic_X + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B, it holds that Θk(x)=Θk*(x)=0superscriptsubscriptnormal-Θ𝑘normal-ℓ𝑥superscriptsubscriptnormal-Θ𝑘𝑥0\varTheta_{k}^{\ell}(x)=\varTheta_{k}^{*}(x)=0roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT ( italic_x ) = roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) = 0.

Proof.

Following the definition of the Bellman operator 𝔗ksubscript𝔗𝑘\mathfrak{T}_{k}fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and the initial values of Θ0subscriptΘ0\varTheta_{0}roman_Θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, one sees that Θk(x)=Θk1(x)==Θk0(x)=0superscriptsubscriptΘ𝑘𝑥superscriptsubscriptΘ𝑘1𝑥superscriptsubscriptΘ𝑘0𝑥0\varTheta_{k}^{\ell}(x)=\varTheta_{k}^{\ell-1}(x)=\cdots=\varTheta_{k}^{0}(x)=0roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT ( italic_x ) = roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ - 1 end_POSTSUPERSCRIPT ( italic_x ) = ⋯ = roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_x ) = 0. The same holds for Θk*(x)superscriptsubscriptΘ𝑘𝑥\varTheta_{k}^{*}(x)roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ). This completes the proof. ∎

VI-B Contraction property on a fixed graph

In this subsection, we analyze the monotonicity of the Bellman operator 𝔗ksubscript𝔗𝑘\mathfrak{T}_{k}fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT on a fixed graph, which is summarized in the following lemma.

Lemma VI.7.

The distance between a value function Θksubscriptnormal-Θ𝑘\varTheta_{k}roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and the fixed point Θk*subscriptsuperscriptnormal-Θ𝑘\varTheta^{*}_{k}roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is monotonically decreasing over the Bellman operator 𝔗ksubscript𝔗𝑘\mathfrak{T}_{k}fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT; i.e.,

𝔗k(Θk)Θk*VkΘkΘk*Vk.subscriptnormsubscript𝔗𝑘subscriptΘ𝑘superscriptsubscriptΘ𝑘subscript𝑉𝑘subscriptnormsubscriptΘ𝑘superscriptsubscriptΘ𝑘subscript𝑉𝑘\displaystyle\|\mathfrak{T}_{k}(\varTheta_{k})-\varTheta_{k}^{*}\|_{V_{k}}\leq% \|\varTheta_{k}-\varTheta_{k}^{*}\|_{V_{k}}.∥ fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ ∥ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (6)

Specially, strict monotonic decrease holds xSkfor-all𝑥subscript𝑆𝑘\forall x\in S_{k}∀ italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT; that is,

|𝔗k(Θk)(x)Θk*(x)|βkΘkΘk*Fk(x).subscript𝔗𝑘subscriptΘ𝑘𝑥superscriptsubscriptΘ𝑘𝑥subscript𝛽𝑘subscriptnormsubscriptΘ𝑘superscriptsubscriptΘ𝑘subscript𝐹𝑘𝑥\displaystyle|\mathfrak{T}_{k}(\varTheta_{k})(x)-\varTheta_{k}^{*}(x)|\leq% \beta_{k}\|\varTheta_{k}-\varTheta_{k}^{*}\|_{F_{k}(x)}.| fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) | ≤ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT . (7)

Moreover, the perturbed version of the above special case holds with probablity one in the asymptotic sense:

|𝔗k(Θk)(x)Θk*(x)|subscript𝔗𝑘subscriptΘ𝑘𝑥superscriptsubscriptΘ𝑘𝑥\displaystyle|\mathfrak{T}_{k}(\varTheta_{k})(x)-\varTheta_{k}^{*}(x)|| fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) | (8)
\displaystyle\leq βkminx(x+dk)VkΘk(x)minx(x+dk)VkΘk*(x)𝒳𝒵subscript𝛽𝑘subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥𝒳𝒵\displaystyle\beta_{k}\|\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}% \varTheta_{k}(x^{\prime})-\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}% \varTheta^{*}_{k}(x^{\prime})\|_{\mathcal{X}\setminus\mathcal{Z}}italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT
\displaystyle\leq βkΘkΘk*Vk,subscript𝛽𝑘subscriptnormsubscriptΘ𝑘superscriptsubscriptΘ𝑘subscript𝑉𝑘\displaystyle\beta_{k}\|\varTheta_{k}-\varTheta_{k}^{*}\|_{V_{k}},italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ,

where 𝒵𝒵\mathcal{Z}caligraphic_Z is a set of sufficiently small measure.

Proof.

We first proceed to show the proofs of (7) and (8). Once (7) and (8) are proven, we extend the results to (6).

Claim VI.1.

Inequality (7) holds for every xSk𝑥subscript𝑆𝑘x\in S_{k}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.

Proof.

Notice that 𝔗k(Θk*)=Θk*subscript𝔗𝑘subscriptsuperscriptΘ𝑘subscriptsuperscriptΘ𝑘\mathfrak{T}_{k}(\varTheta^{*}_{k})=\varTheta^{*}_{k}fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, since Θk*subscriptsuperscriptΘ𝑘\varTheta^{*}_{k}roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the fixed point of 𝔗ksubscript𝔗𝑘\mathfrak{T}_{k}fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Two cases arise.

  • Case 1, x𝒳goal+(Mϵk+dk)𝑥subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x\in\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}+d_{k})\mathcal{B}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B. Then it follows from Lemma VI.6 that |𝔗k(Θk)(x)𝔗k(Θk*)(x)|=0βkΘkΘk*Vksubscript𝔗𝑘subscriptΘ𝑘𝑥subscript𝔗𝑘subscriptsuperscriptΘ𝑘𝑥0subscript𝛽𝑘subscriptnormsubscriptΘ𝑘superscriptsubscriptΘ𝑘subscript𝑉𝑘|\mathfrak{T}_{k}(\varTheta_{k})(x)-\mathfrak{T}_{k}(\varTheta^{*}_{k})(x)|=0% \leq\beta_{k}\|\varTheta_{k}-\varTheta_{k}^{*}\|_{V_{k}}| fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) - fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) | = 0 ≤ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

  • Case 2, xSk(𝒳goal+(Mϵk+dk))𝑥subscript𝑆𝑘subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x\in S_{k}\setminus(\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}+d_{k})\mathcal{% B})italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ). Then the following inequality holds:

    |𝔗k(Θk)(x)𝔗k(Θk*)(x)|subscript𝔗𝑘subscriptΘ𝑘𝑥subscript𝔗𝑘subscriptsuperscriptΘ𝑘𝑥\displaystyle|\mathfrak{T}_{k}(\varTheta_{k})(x)-\mathfrak{T}_{k}(\varTheta^{*% }_{k})(x)|| fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) - fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) | (9)
    =\displaystyle== |Δk+βkminxFk(x)Θ(x)ΔkβkminxFk(x)Θk*(x)|subscriptΔ𝑘subscript𝛽𝑘subscriptsuperscript𝑥subscript𝐹𝑘𝑥Θsuperscript𝑥subscriptΔ𝑘subscript𝛽𝑘subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptsuperscriptΘ𝑘superscript𝑥\displaystyle\Big{|}\Delta_{k}+\beta_{k}\min_{x^{\prime}\in F_{k}(x)}\varTheta% (x^{\prime})-\Delta_{k}-\beta_{k}\min_{x^{\prime}\in F_{k}(x)}\varTheta^{*}_{k% }(x^{\prime})\Big{|}| roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_Δ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) |
    =\displaystyle== βk|minxFk(x)Θ(x)minxFk(x)Θk*(x)|subscript𝛽𝑘subscriptsuperscript𝑥subscript𝐹𝑘𝑥Θsuperscript𝑥subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptsuperscriptΘ𝑘superscript𝑥\displaystyle\beta_{k}\Big{|}\min_{x^{\prime}\in F_{k}(x)}\varTheta(x^{\prime}% )-\min_{x^{\prime}\in F_{k}(x)}\varTheta^{*}_{k}(x^{\prime})\Big{|}italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) |
    \displaystyle\leq βkΘkΘk*Fk(x).subscript𝛽𝑘subscriptnormsubscriptΘ𝑘superscriptsubscriptΘ𝑘subscript𝐹𝑘𝑥\displaystyle\beta_{k}\|\varTheta_{k}-\varTheta_{k}^{*}\|_{F_{k}(x)}.italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT .

In summary, we have 𝔗k(Θk)𝔗k(Θk*)SkβkΘkΘk*Vksubscriptnormsubscript𝔗𝑘subscriptΘ𝑘subscript𝔗𝑘subscriptsuperscriptΘ𝑘subscript𝑆𝑘subscript𝛽𝑘subscriptnormsubscriptΘ𝑘superscriptsubscriptΘ𝑘subscript𝑉𝑘\|\mathfrak{T}_{k}(\varTheta_{k})-\mathfrak{T}_{k}(\varTheta^{*}_{k})\|_{S_{k}% }\leq\beta_{k}\|\varTheta_{k}-\varTheta_{k}^{*}\|_{V_{k}}∥ fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) - fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ∥ start_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT. This completes the proof of Claim VI.1 and (7) is proven. ∎

Claim VI.2.

Inequality (8) holds with probability one in the asymptotic sense.

Proof.

Following the argument towards Claim VI.1, Case 1 trivially holds for Claim VI.2. We rewrite (9) for xSk(𝒳goal+(Mϵk+dk))𝑥subscript𝑆𝑘subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x\in S_{k}\setminus(\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}+d_{k})\mathcal{% B})italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) and arrive at

|𝔗k(Θk)(x)𝔗k(Θk*)(x)|subscript𝔗𝑘subscriptΘ𝑘𝑥subscript𝔗𝑘subscriptsuperscriptΘ𝑘𝑥\displaystyle|\mathfrak{T}_{k}(\varTheta_{k})(x)-\mathfrak{T}_{k}(\varTheta^{*% }_{k})(x)|| fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) - fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) | (10)
=\displaystyle== βk|minxFk(x)Θk(x)minxFk(x)Θk*(x)|subscript𝛽𝑘subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptsuperscriptΘ𝑘superscript𝑥\displaystyle\beta_{k}\Big{|}\min_{x^{\prime}\in F_{k}(x)}\varTheta_{k}(x^{% \prime})-\min_{x^{\prime}\in F_{k}(x)}\varTheta^{*}_{k}(x^{\prime})\Big{|}italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) |
\displaystyle\leq βkminxFk(x)Θk(x)minxFk(x)Θk*(x)Vk.subscript𝛽𝑘subscriptnormsubscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptsuperscriptΘ𝑘superscript𝑥subscript𝑉𝑘\displaystyle\beta_{k}\Big{\|}\min_{x^{\prime}\in F_{k}(x)}\varTheta_{k}(x^{% \prime})-\min_{x^{\prime}\in F_{k}(x)}\varTheta^{*}_{k}(x^{\prime})\Big{\|}_{V% _{k}}.italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

It follows from the supplementary definition in Theorem VI.1 that Θk(x)=1,x𝒳(𝒳free+dk)formulae-sequencesubscriptΘ𝑘𝑥1for-all𝑥𝒳subscript𝒳freesubscript𝑑𝑘\varTheta_{k}(x)=1,\forall x\in\mathcal{X}\setminus(\mathcal{X}_{\mathrm{free}% }+d_{k}\mathcal{B})roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = 1 , ∀ italic_x ∈ caligraphic_X ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ). Then it follow from the definition of Fk(x)subscript𝐹𝑘𝑥F_{k}(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) in (3) that

minxFk(x)Θk(x)=minx(x+ϵkuUf(x,u)+ρ(dk,ϵk))VkΘ(x).subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥𝑥subscriptitalic-ϵ𝑘subscript𝑢𝑈𝑓𝑥𝑢𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑉𝑘Θsuperscript𝑥\displaystyle\min_{x^{\prime}\in F_{k}(x)}\varTheta_{k}(x^{\prime})=\min_{x^{% \prime}\in(x+\epsilon_{k}\bigcup_{u\in U}f(x,u)+\rho(d_{k},\epsilon_{k})% \mathcal{B})\cap V_{k}}\varTheta(x^{\prime}).roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ italic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) .

Same holds for Θk*superscriptsubscriptΘ𝑘\varTheta_{k}^{*}roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. Then by Lemma VI.4 and (4) in Lemma VI.2, (10) renders with probability one at

minxFk(x)Θk(x)minxFk(x)Θk*(x)Vksubscriptnormsubscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥subscript𝐹𝑘𝑥subscriptsuperscriptΘ𝑘superscript𝑥subscript𝑉𝑘\displaystyle\|\min_{x^{\prime}\in F_{k}(x)}\varTheta_{k}(x^{\prime})-\min_{x^% {\prime}\in F_{k}(x)}\varTheta^{*}_{k}(x^{\prime})\|_{V_{k}}∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT (11)
\displaystyle\leq minx(x+ρ(dk,ϵk))VkΘk(x)\displaystyle\|\min_{x^{\prime}\in(x+\rho(d_{k},\epsilon_{k})\mathcal{B})\cap V% _{k}}\varTheta_{k}(x^{\prime})∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT )
minx(x+ρ(dk,ϵk))VkΘk*(x)Vk.evaluated-atsubscriptsuperscript𝑥𝑥𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥subscript𝑉𝑘\displaystyle\quad-\min_{x^{\prime}\in(x+\rho(d_{k},\epsilon_{k})\mathcal{B})% \cap V_{k}}\varTheta^{*}_{k}(x^{\prime})\|_{V_{k}}.- roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

Since ρ(dk,ϵk)dk𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑑𝑘\rho(d_{k},\epsilon_{k})\geq d_{k}italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ≥ italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, we apply (4) again and (11) renders at

minx(x+ρ(dk,ϵk))VkΘk(x)\displaystyle\|\min_{x^{\prime}\in(x+\rho(d_{k},\epsilon_{k})\mathcal{B})\cap V% _{k}}\varTheta_{k}(x^{\prime})∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) (12)
minx(x+ρ(dk,ϵk))VkΘk*(x)Vkevaluated-atsubscriptsuperscript𝑥𝑥𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥subscript𝑉𝑘\displaystyle\quad-\min_{x^{\prime}\in(x+\rho(d_{k},\epsilon_{k})\mathcal{B})% \cap V_{k}}\varTheta^{*}_{k}(x^{\prime})\|_{V_{k}}- roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT
\displaystyle\leq minx(x+dk)VkΘk(x)minx(x+dk)VkΘk*(x)Vk.subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥subscript𝑉𝑘\displaystyle\|\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta_{k}% (x^{\prime})-\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta^{*}_{% k}(x^{\prime})\|_{V_{k}}.∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

Since 𝒵𝒵\mathcal{Z}caligraphic_Z is a set of sufficiently small measure, a new node falls into 𝒵𝒵\mathcal{Z}caligraphic_Z with sufficiently small probability; i.e., [Vk𝒵]=0delimited-[]subscript𝑉𝑘𝒵0\mathbb{P}[V_{k}\cap\mathcal{Z}\neq\emptyset]=0blackboard_P [ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∩ caligraphic_Z ≠ ∅ ] = 0. With that being said, it follows again from (4) and (12) that the following holds with probability one:

minx(x+dk)VkΘk(x)minx(x+dk)VkΘk*(x)Vksubscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥subscript𝑉𝑘\displaystyle\|\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta_{k}% (x^{\prime})-\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta^{*}_{% k}(x^{\prime})\|_{V_{k}}∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT
\displaystyle\leq minx(x+dk)VkΘk(x)minx(x+dk)VkΘk*(x)𝒳𝒵.subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥𝒳𝒵\displaystyle\|\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta_{k}% (x^{\prime})-\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta^{*}_{% k}(x^{\prime})\|_{\mathcal{X}\setminus\mathcal{Z}}.∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT .

Then it follows from (5), the above inequality renders at

minx(x+dk)VkΘk(x)minx(x+dk)VkΘk*(x)𝒳𝒵subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥𝒳𝒵\displaystyle\|\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta_{k}% (x^{\prime})-\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta^{*}_{% k}(x^{\prime})\|_{\mathcal{X}\setminus\mathcal{Z}}\allowdisplaybreaks∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT
\displaystyle\leq minx(x+dk)VkΘk(x)minx(x+dk)VkΘk*(x)𝒳subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptΘ𝑘superscript𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥𝒳\displaystyle\|\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta_{k}% (x^{\prime})-\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V_{k}}\varTheta^{*}_{% k}(x^{\prime})\|_{\mathcal{X}}\allowdisplaybreaks∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT
\displaystyle\leq ΘkΘk*Vk.subscriptnormsubscriptΘ𝑘subscriptsuperscriptΘ𝑘subscript𝑉𝑘\displaystyle\|\varTheta_{k}-\varTheta^{*}_{k}\|_{V_{k}}.∥ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

Combining all above relations completes the proof. ∎

Claim VI.3.

Inequality (6) holds true.

Proof.

It follows from Lemma VI.6 that for xSk(𝒳goal+(Mϵk+dk))𝑥subscript𝑆𝑘subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x\in S_{k}\setminus(\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}+d_{k})\mathcal{% B})italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ), |𝔗k(Θk)(x)𝔗k(Θk*)(x)|=0ΘkΘk*Vksubscript𝔗𝑘subscriptΘ𝑘𝑥subscript𝔗𝑘superscriptsubscriptΘ𝑘𝑥0subscriptnormsubscriptΘ𝑘superscriptsubscriptΘ𝑘subscript𝑉𝑘|\mathfrak{T}_{k}(\varTheta_{k})(x)-\mathfrak{T}_{k}(\varTheta_{k}^{*})(x)|=0% \leq\|\varTheta_{k}-\varTheta_{k}^{*}\|_{V_{k}}| fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) - fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ( italic_x ) | = 0 ≤ ∥ roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then it follows from Claim VI.1 that (6) holds for xSk𝑥subscript𝑆𝑘x\in S_{k}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. This completes the proof. ∎

It follows from Claims VI.1, VI.2 and VI.3 that Lemma VI.7 holds. This completes the proof. ∎

Remark VI.3.

The sufficiently small set 𝒵𝒵\mathcal{Z}caligraphic_Z is to accommodate the result of Corollary .2 in the proof of Theorem V.1, where the uniform convergence of the Kruzhkov transformed functions holds in an almost sure sense.

VI-C Asynchronous contraction over graphs

Following Algorithm 3, we recursively define the set of updated nodes by SkmkSksuperscriptsubscript𝑆𝑘subscript𝑚𝑘subscript𝑆𝑘S_{k}^{m_{k}}\triangleq S_{k}italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ≜ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, and Sk=xSk+1Fk(x),{0,,mk1}formulae-sequencesuperscriptsubscript𝑆𝑘subscript𝑥superscriptsubscript𝑆𝑘1subscript𝐹𝑘𝑥for-all0subscript𝑚𝑘1S_{k}^{\ell}=\bigcup_{x\in S_{k}^{\ell+1}}F_{k}(x),\forall\ell\in\{0,\dots,m_{% k}-1\}italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT = ⋃ start_POSTSUBSCRIPT italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ + 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) , ∀ roman_ℓ ∈ { 0 , … , italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - 1 }. Correspondingly, define the value function by Θk(x)𝔗k(Θk1)(x),xSkformulae-sequencesubscriptsuperscriptΘ𝑘𝑥subscript𝔗𝑘subscriptsuperscriptΘ1𝑘𝑥for-all𝑥superscriptsubscript𝑆𝑘\varTheta^{\ell}_{k}(x)\triangleq\mathfrak{T}_{k}(\varTheta^{\ell-1}_{k})(x),% \forall x\in S_{k}^{\ell}roman_Θ start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) ≜ fraktur_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( roman_Θ start_POSTSUPERSCRIPT roman_ℓ - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ( italic_x ) , ∀ italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT and {1,,mk}1subscript𝑚𝑘\ell\in\{1,\dots,m_{k}\}roman_ℓ ∈ { 1 , … , italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT }, and Θk0(x)=Θ^k(x),xVkformulae-sequencesubscriptsuperscriptΘ0𝑘𝑥subscript^Θ𝑘𝑥for-all𝑥subscript𝑉𝑘\varTheta^{0}_{k}(x)=\hat{\varTheta}_{k}(x),\forall x\in V_{k}roman_Θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = over^ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) , ∀ italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Starting from this subsection, we consider two perturbed estimates of the minimal traveling time function Θ~k*(x)minx(x+dk)VkΘk*(x)superscriptsubscript~Θ𝑘𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘superscriptsubscriptΘ𝑘superscript𝑥\displaystyle\tilde{\varTheta}_{k}^{*}(x)\triangleq\min_{x^{\prime}\in(x+d_{k}% \mathcal{B})\cap V_{k}}\varTheta_{k}^{*}(x^{\prime})over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) ≜ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) and Θ~k*,(x)minx(x+dk1)VkΘk*(x)superscriptsubscript~Θ𝑘𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘1subscript𝑉𝑘superscriptsubscriptΘ𝑘superscript𝑥\displaystyle\tilde{\varTheta}_{k}^{*,-}(x)\triangleq\min_{x^{\prime}\in(x+d_{% k-1}\mathcal{B})\cap V_{k}}\varTheta_{k}^{*}(x^{\prime})over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT ( italic_x ) ≜ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). Notice that the perturbation radii dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and dk1subscript𝑑𝑘1d_{k-1}italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT below the minimization operator are different. Similar notations apply to Θ~kmsuperscriptsubscript~Θ𝑘𝑚\tilde{\varTheta}_{k}^{m}over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT and Θ~km,superscriptsubscript~Θ𝑘𝑚\tilde{\varTheta}_{k}^{m,-}over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m , - end_POSTSUPERSCRIPT respectively for m0𝑚subscriptabsent0m\in\mathbb{N}_{\geq 0}italic_m ∈ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT. The following lemma characterizes the contraction property between two consecutive graphs.

Lemma VI.8.

Consider xSk𝑥subscript𝑆𝑘x\in S_{k}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. It holds with probability one:

|Θkmk(x)Θk*(x)|βkmkΘk1mk1Θk1*Vk1+βkmkck,subscriptsuperscriptΘsubscript𝑚𝑘𝑘𝑥subscriptsuperscriptΘ𝑘𝑥superscriptsubscript𝛽𝑘subscript𝑚𝑘subscriptnormsubscriptsuperscriptΘsubscript𝑚𝑘1𝑘1subscriptsuperscriptΘ𝑘1subscript𝑉𝑘1superscriptsubscript𝛽𝑘subscript𝑚𝑘subscript𝑐𝑘\displaystyle|\varTheta^{m_{k}}_{k}(x)-\varTheta^{*}_{k}(x)|\leq\beta_{k}^{m_{% k}}\|\varTheta^{m_{k-1}}_{k-1}-\varTheta^{*}_{k-1}\|_{V_{k-1}}+\beta_{k}^{m_{k% }}c_{k},| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) | ≤ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ,

where ckΘ~k1*Θ~k*,𝒳𝒵normal-≜subscript𝑐𝑘subscriptnormsubscriptsuperscriptnormal-~normal-Θ𝑘1subscriptsuperscriptnormal-~normal-Θ𝑘𝒳𝒵c_{k}\triangleq\|\tilde{\varTheta}^{*}_{k-1}-\tilde{\varTheta}^{*,-}_{k}\|_{% \mathcal{X}\setminus\mathcal{Z}}italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≜ ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT and 𝒵𝒵\mathcal{Z}caligraphic_Z is sufficiently small.

Proof.

Fix xSk𝑥subscript𝑆𝑘x\in S_{k}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. By line 2 in Algorithm 2, we apply (7) in Lemma VI.7 for mk1subscript𝑚𝑘1m_{k}-1italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - 1 times and it renders at

|Θkmk(x)Θk*(x)|βkmk1Θk1Θk*Vk.subscriptsuperscriptΘsubscript𝑚𝑘𝑘𝑥subscriptsuperscriptΘ𝑘𝑥superscriptsubscript𝛽𝑘subscript𝑚𝑘1subscriptnormsubscriptsuperscriptΘ1𝑘subscriptsuperscriptΘ𝑘subscript𝑉𝑘\displaystyle|\varTheta^{m_{k}}_{k}(x)-\varTheta^{*}_{k}(x)|\leq\beta_{k}^{m_{% k}-1}\|\varTheta^{1}_{k}-\varTheta^{*}_{k}\|_{V_{k}}.| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) | ≤ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (13)

Then it again follows from (6), (8) in Lemma VI.7 and Assumption (A8) that

Θk1Θk*VksubscriptnormsubscriptsuperscriptΘ1𝑘subscriptsuperscriptΘ𝑘subscript𝑉𝑘\displaystyle\|\varTheta^{1}_{k}-\varTheta^{*}_{k}\|_{V_{k}}∥ roman_Θ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT (14)
\displaystyle\leq βkminx(x+ρ(dk,ϵk))VkΘk0(x)conditionalsubscript𝛽𝑘subscriptsuperscript𝑥𝑥𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑉𝑘subscriptsuperscriptΘ0𝑘superscript𝑥\displaystyle\beta_{k}\|\min_{x^{\prime}\in(x+\rho(d_{k},\epsilon_{k})\mathcal% {B})\cap V_{k}}\varTheta^{0}_{k}(x^{\prime})italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT )
minx(x+ρ(dk,ϵk))VkΘk*(x)𝒳𝒵evaluated-atsubscriptsuperscript𝑥𝑥𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥𝒳𝒵\displaystyle-\min_{x^{\prime}\in(x+\rho(d_{k},\epsilon_{k})\mathcal{B})\cap V% _{k}}\varTheta^{*}_{k}(x^{\prime})\|_{\mathcal{X}\setminus\mathcal{Z}}- roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT
\displaystyle\leq βkminx(x+dk1)VkΘk0(x)minx(x+dk1)VkΘk*(x)𝒳𝒵subscript𝛽𝑘subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑𝑘1subscript𝑉𝑘superscriptsubscriptΘ𝑘0superscript𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘1subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥𝒳𝒵\displaystyle\beta_{k}\|\min_{x^{\prime}\in(x+d_{k-1}\mathcal{B})\cap V_{k}}% \varTheta_{k}^{0}(x^{\prime})-\min_{x^{\prime}\in(x+d_{k-1}\mathcal{B})\cap V_% {k}}\varTheta^{*}_{k}(x^{\prime})\|_{\mathcal{X}\setminus\mathcal{Z}}italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT
=\displaystyle== βkΘ~k0,Θ~k*,𝒳𝒵.subscript𝛽𝑘subscriptnormsuperscriptsubscript~Θ𝑘0superscriptsubscript~Θ𝑘𝒳𝒵\displaystyle\beta_{k}\|\tilde{\varTheta}_{k}^{0,-}-\tilde{\varTheta}_{k}^{*,-% }\|_{\mathcal{X}\setminus\mathcal{Z}}.italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT .

The last inequality follows from (4) in Lemma VI.2. Let xkVkVk1subscript𝑥𝑘subscript𝑉𝑘subscript𝑉𝑘1x_{k}\in V_{k}\setminus V_{k-1}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT and two cases arise. We proceed to show both cases render at the same result, summarized in (20).

Case 1, xk𝒳(𝒳goal+(Mϵk+dk))subscript𝑥𝑘𝒳subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x_{k}\in\mathcal{X}\setminus(\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}+d_{k})% \mathcal{B})italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_X ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ). It follows from Lemma VI.6 that Θk0(xk)=Θk*(xk)=0superscriptsubscriptΘ𝑘0subscript𝑥𝑘superscriptsubscriptΘ𝑘subscript𝑥𝑘0\varTheta_{k}^{0}(x_{k})=\varTheta_{k}^{*}(x_{k})=0roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = 0; that is, Θ~k0,(x)=Θ~k*,(x)=0,xxk+dk1formulae-sequencesuperscriptsubscript~Θ𝑘0𝑥superscriptsubscript~Θ𝑘𝑥0for-all𝑥subscript𝑥𝑘subscript𝑑𝑘1\tilde{\varTheta}_{k}^{0,-}(x)=\tilde{\varTheta}_{k}^{*,-}(x)=0,\forall x\in x% _{k}+d_{k-1}\mathcal{B}over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT ( italic_x ) = over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT ( italic_x ) = 0 , ∀ italic_x ∈ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B. Then we have

Θ~k0,Θ~k*,𝒳𝒵subscriptnormsuperscriptsubscript~Θ𝑘0superscriptsubscript~Θ𝑘𝒳𝒵\displaystyle\|\tilde{\varTheta}_{k}^{0,-}-\tilde{\varTheta}_{k}^{*,-}\|_{% \mathcal{X}\setminus\mathcal{Z}}∥ over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT (15)
=\displaystyle== Θ~k0,Θ~k*,𝒳(𝒵(xk+dk1))subscriptnormsuperscriptsubscript~Θ𝑘0superscriptsubscript~Θ𝑘𝒳𝒵subscript𝑥𝑘subscript𝑑𝑘1\displaystyle\|\tilde{\varTheta}_{k}^{0,-}-\tilde{\varTheta}_{k}^{*,-}\|_{% \mathcal{X}\setminus(\mathcal{Z}\cup(x_{k}+d_{k-1}\mathcal{B}))}\allowdisplaybreaks∥ over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ ( caligraphic_Z ∪ ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ) end_POSTSUBSCRIPT
\displaystyle\leq Θ~k0,Θ~k1*𝒳(xk+dk1)+ck.subscriptnormsuperscriptsubscript~Θ𝑘0superscriptsubscript~Θ𝑘1𝒳subscript𝑥𝑘subscript𝑑𝑘1subscript𝑐𝑘\displaystyle\|\tilde{\varTheta}_{k}^{0,-}-\tilde{\varTheta}_{k-1}^{*}\|_{% \mathcal{X}\setminus(x_{k}+d_{k-1}\mathcal{B})}+c_{k}.∥ over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) end_POSTSUBSCRIPT + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT .

We focus on the first term. For x𝒳(𝒵(xk+dk1))𝑥𝒳𝒵subscript𝑥𝑘subscript𝑑𝑘1x\in\mathcal{X}\setminus(\mathcal{Z}\cup(x_{k}+d_{k-1}\mathcal{B}))italic_x ∈ caligraphic_X ∖ ( caligraphic_Z ∪ ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ), since no changes are made to values at xVk1𝑥subscript𝑉𝑘1x\in V_{k-1}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT, we have

Θ~k0,(x)=minx(x+dk1)VkΘk0(x)subscriptsuperscript~Θ0𝑘𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘1subscript𝑉𝑘subscriptsuperscriptΘ0𝑘superscript𝑥\displaystyle\tilde{\varTheta}^{0,-}_{k}(x)=\min_{x^{\prime}\in(x+d_{k-1}% \mathcal{B})\cap V_{k}}\varTheta^{0}_{k}(x^{\prime})\allowdisplaybreaksover~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) (16)
=\displaystyle== minx(x+dk1)Vk1Θk1mk1(x)=Θ~k1mk1(x).subscriptsuperscript𝑥𝑥subscript𝑑𝑘1subscript𝑉𝑘1subscriptsuperscriptΘsubscript𝑚𝑘1𝑘1superscript𝑥subscriptsuperscript~Θsubscript𝑚𝑘1𝑘1𝑥\displaystyle\min_{x^{\prime}\in(x+d_{k-1}\mathcal{B})\cap V_{k-1}}\varTheta^{% m_{k-1}}_{k-1}(x^{\prime})=\tilde{\varTheta}^{m_{k-1}}_{k-1}(x).roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ( italic_x ) .

Combining (15) and (16) renders at

Θ~k0,Θ~k*,𝒳𝒵subscriptnormsuperscriptsubscript~Θ𝑘0superscriptsubscript~Θ𝑘𝒳𝒵absent\displaystyle\|\tilde{\varTheta}_{k}^{0,-}-\tilde{\varTheta}_{k}^{*,-}\|_{% \mathcal{X}\setminus\mathcal{Z}}\leq∥ over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT ≤ Θ~k1mk1Θ~k1*𝒳(xk+dk1)+cksubscriptnormsubscriptsuperscript~Θsubscript𝑚𝑘1𝑘1subscriptsuperscript~Θ𝑘1𝒳subscript𝑥𝑘subscript𝑑𝑘1subscript𝑐𝑘\displaystyle\|\tilde{\varTheta}^{m_{k-1}}_{k-1}-\tilde{\varTheta}^{*}_{k-1}\|% _{\mathcal{X}\setminus(x_{k}+d_{k-1}\mathcal{B})}+c_{k}∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) end_POSTSUBSCRIPT + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (17)
\displaystyle\leq Θ~k1mk1Θ~k1*𝒳+ck.subscriptnormsubscriptsuperscript~Θsubscript𝑚𝑘1𝑘1subscriptsuperscript~Θ𝑘1𝒳subscript𝑐𝑘\displaystyle\|\tilde{\varTheta}^{m_{k-1}}_{k-1}-\tilde{\varTheta}^{*}_{k-1}\|% _{\mathcal{X}}+c_{k}.∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT .

Case 2, xk𝒳goal+(Mϵk+dk)subscript𝑥𝑘subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x_{k}\in\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}+d_{k})\mathcal{B}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B. Then by the triangular inequality of supremum norm, one has a result similar to (15):

Θ~k0,Θ~k*,𝒳𝒵Θ~k0,Θ~k1*𝒳+ck.subscriptnormsuperscriptsubscript~Θ𝑘0superscriptsubscript~Θ𝑘𝒳𝒵subscriptnormsuperscriptsubscript~Θ𝑘0superscriptsubscript~Θ𝑘1𝒳subscript𝑐𝑘\displaystyle\|\tilde{\varTheta}_{k}^{0,-}-\tilde{\varTheta}_{k}^{*,-}\|_{% \mathcal{X}\setminus\mathcal{Z}}\leq\|\tilde{\varTheta}_{k}^{0,-}-\tilde{% \varTheta}_{k-1}^{*}\|_{\mathcal{X}}+c_{k}.∥ over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT ≤ ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT . (18)

We focus on the first term Θ~k0,Θ~k1*𝒳subscriptnormsubscriptsuperscript~Θ0𝑘subscriptsuperscript~Θ𝑘1𝒳\|\tilde{\varTheta}^{0,-}_{k}-\tilde{\varTheta}^{*}_{k-1}\|_{\mathcal{X}}∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT. For any x𝒳(xk+dk1)𝑥𝒳subscript𝑥𝑘subscript𝑑𝑘1x\in\mathcal{X}\setminus(x_{k}+d_{k-1}\mathcal{B})italic_x ∈ caligraphic_X ∖ ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ), it renders at the identical result as (16). For any xxk+dk1𝑥subscript𝑥𝑘subscript𝑑𝑘1x\in x_{k}+d_{k-1}\mathcal{B}italic_x ∈ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B, since xk𝒳(𝒳goal+(Mϵk+dk))subscript𝑥𝑘𝒳subscript𝒳goal𝑀subscriptitalic-ϵ𝑘subscript𝑑𝑘x_{k}\in\mathcal{X}\setminus(\mathcal{X}_{\mathrm{goal}}+(M\epsilon_{k}+d_{k})% \mathcal{B})italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ caligraphic_X ∖ ( caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT + ( italic_M italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ), it follows line 1 in Algorithm 1 that minx(x+dk1)VkΘk0(x)Θk0(xk)=1subscriptsuperscript𝑥𝑥subscript𝑑𝑘1subscript𝑉𝑘subscriptsuperscriptΘ0𝑘superscript𝑥subscriptsuperscriptΘ0𝑘subscript𝑥𝑘1\min_{x^{\prime}\in(x+d_{k-1}\mathcal{B})\cap V_{k}}\varTheta^{0}_{k}(x^{% \prime})\leq\varTheta^{0}_{k}(x_{k})=1roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≤ roman_Θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = 1. Then we have

Θ~k0,(x)=minx(x+dk1)Vk1Θk1mk1(x)=Θ~k1mk1(x).subscriptsuperscript~Θ0𝑘𝑥subscriptsuperscript𝑥𝑥subscript𝑑𝑘1subscript𝑉𝑘1subscriptsuperscriptΘsubscript𝑚𝑘1𝑘1superscript𝑥subscriptsuperscript~Θsubscript𝑚𝑘1𝑘1𝑥\displaystyle\tilde{\varTheta}^{0,-}_{k}(x)=\min_{x^{\prime}\in(x+d_{k-1}% \mathcal{B})\cap V_{k-1}}\varTheta^{m_{k-1}}_{k-1}(x^{\prime})=\tilde{% \varTheta}^{m_{k-1}}_{k-1}(x).over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ( italic_x ) . (19)

Combining (18) and (19) renders at

Θ~k0,Θ~k*,𝒳𝒵Θ~k1mk1Θ~k1*𝒳+ck,subscriptnormsuperscriptsubscript~Θ𝑘0superscriptsubscript~Θ𝑘𝒳𝒵subscriptnormsubscriptsuperscript~Θsubscript𝑚𝑘1𝑘1subscriptsuperscript~Θ𝑘1𝒳subscript𝑐𝑘\displaystyle\|\tilde{\varTheta}_{k}^{0,-}-\tilde{\varTheta}_{k}^{*,-}\|_{% \mathcal{X}\setminus\mathcal{Z}}\leq\|\tilde{\varTheta}^{m_{k-1}}_{k-1}-\tilde% {\varTheta}^{*}_{k-1}\|_{\mathcal{X}}+c_{k},∥ over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT ≤ ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , (20)

which is identical to (17). This verifies that the two cases on xksubscript𝑥𝑘x_{k}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT render at the same result.

Notice that (20) can be fruther relaxed as

Θ~k0,Θ~k1*𝒳=subscriptnormsubscriptsuperscript~Θ0𝑘subscriptsuperscript~Θ𝑘1𝒳absent\displaystyle\|\tilde{\varTheta}^{0,-}_{k}-\tilde{\varTheta}^{*}_{k-1}\|_{% \mathcal{X}}=∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT 0 , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT = Θ~k1mk1Θ~k1*𝒳subscriptnormsubscriptsuperscript~Θsubscript𝑚𝑘1𝑘1subscriptsuperscript~Θ𝑘1𝒳\displaystyle\|\tilde{\varTheta}^{m_{k-1}}_{k-1}-\tilde{\varTheta}^{*}_{k-1}\|% _{\mathcal{X}}∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT (21)
\displaystyle\leq Θk1mk1Θk1*Vk1,subscriptnormsubscriptsuperscriptΘsubscript𝑚𝑘1𝑘1subscriptsuperscriptΘ𝑘1subscript𝑉𝑘1\displaystyle\|\varTheta^{m_{k-1}}_{k-1}-\varTheta^{*}_{k-1}\|_{V_{k-1}},∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ,

where the inequality follows from (5) in Lemma VI.2. Combining (13) to (21) completes the proof. ∎

The following corollary characterizes the upper bound of the approximation errors on Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT using the error on Gk1subscript𝐺𝑘1G_{k-1}italic_G start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT.

Corollary VI.1.

The following inequality holds for any {1,,mk}normal-ℓ1normal-…subscript𝑚𝑘\ell\in\{1,\dots,m_{k}\}roman_ℓ ∈ { 1 , … , italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } with probability one:

ΘkΘk*VkΘk1mk1Θk1*Vk1+ck,subscriptnormsubscriptsuperscriptΘ𝑘subscriptsuperscriptΘ𝑘subscript𝑉𝑘subscriptnormsubscriptsuperscriptΘsubscript𝑚𝑘1𝑘1subscriptsuperscriptΘ𝑘1subscript𝑉𝑘1subscript𝑐𝑘\displaystyle\|\varTheta^{\ell}_{k}-\varTheta^{*}_{k}\|_{V_{k}}\leq\|\varTheta% ^{m_{k-1}}_{k-1}-\varTheta^{*}_{k-1}\|_{V_{k-1}}+c_{k},∥ roman_Θ start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≤ ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ,

where ckΘ~k1*Θ~k*,𝒳𝒵normal-≜subscript𝑐𝑘subscriptnormsubscriptsuperscriptnormal-~normal-Θ𝑘1subscriptsuperscriptnormal-~normal-Θ𝑘𝒳𝒵c_{k}\triangleq\|\tilde{\varTheta}^{*}_{k-1}-\tilde{\varTheta}^{*,-}_{k}\|_{% \mathcal{X}\setminus\mathcal{Z}}italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≜ ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT and 𝒵𝒵\mathcal{Z}caligraphic_Z is sufficiently small.

Proof.

Fix xVk𝑥subscript𝑉𝑘x\in V_{k}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. When xSk𝑥subscript𝑆𝑘x\in S_{k}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, it follows from Lemma VI.8 that |Θk(x)Θk*(x)|βkΘk1Θk*Vk+βkck1kΘk1Θk*Vk+cksubscriptsuperscriptΘ𝑘𝑥subscriptsuperscriptΘ𝑘𝑥subscript𝛽𝑘subscriptnormsubscriptsuperscriptΘ1𝑘subscriptsuperscriptΘ𝑘subscript𝑉𝑘subscript𝛽𝑘subscriptsuperscript𝑐𝑘𝑘1subscriptnormsubscriptsuperscriptΘ1𝑘subscriptsuperscriptΘ𝑘subscript𝑉𝑘subscript𝑐𝑘|\varTheta^{\ell}_{k}(x)-\varTheta^{*}_{k}(x)|\leq\beta_{k}\|\varTheta^{\ell-1% }_{k}-\varTheta^{*}_{k}\|_{V_{k}}+\beta_{k}c^{k}_{k-1}\leq\|\varTheta^{\ell-1}% _{k}-\varTheta^{*}_{k}\|_{V_{k}}+c_{k}| roman_Θ start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) | ≤ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT roman_ℓ - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_c start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ≤ ∥ roman_Θ start_POSTSUPERSCRIPT roman_ℓ - 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. When xVkSk𝑥subscript𝑉𝑘subscript𝑆𝑘x\in V_{k}\setminus S_{k}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the worst case is that no updates are imposed on x𝑥xitalic_x; in such case, we have Θk(x)=Θk0(x)superscriptsubscriptΘ𝑘𝑥superscriptsubscriptΘ𝑘0𝑥\varTheta_{k}^{\ell}(x)=\varTheta_{k}^{0}(x)roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT ( italic_x ) = roman_Θ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_x ). Therefore, |Θk(x)Θk*(x)|=|Θk0(x)Θk*(x)|Θk0Θk*Vk+cksubscriptsuperscriptΘ𝑘𝑥subscriptsuperscriptΘ𝑘𝑥subscriptsuperscriptΘ0𝑘𝑥subscriptsuperscriptΘ𝑘𝑥subscriptnormsubscriptsuperscriptΘ0𝑘subscriptsuperscriptΘ𝑘subscript𝑉𝑘subscript𝑐𝑘|\varTheta^{\ell}_{k}(x)-\varTheta^{*}_{k}(x)|=|\varTheta^{0}_{k}(x)-\varTheta% ^{*}_{k}(x)|\leq\|\varTheta^{0}_{k}-\varTheta^{*}_{k}\|_{V_{k}}+c_{k}| roman_Θ start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) | = | roman_Θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) | ≤ ∥ roman_Θ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. This completes the proof. ∎

The following corollary characterizes the contraction property over P𝑃Pitalic_P consecutive graphs.

Corollary VI.2.

Consider k{k¯+1,,k¯+P}𝑘normal-¯𝑘1normal-⋯normal-¯𝑘𝑃k\in\{\bar{k}+1,\cdots,\bar{k}+P\}italic_k ∈ { over¯ start_ARG italic_k end_ARG + 1 , ⋯ , over¯ start_ARG italic_k end_ARG + italic_P } for k¯0normal-¯𝑘subscriptabsent0\bar{k}\in\mathbb{N}_{\geq 0}over¯ start_ARG italic_k end_ARG ∈ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT and xSk𝑥subscript𝑆𝑘x\in S_{k}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. The following holds with probability one:

|Θkmk(x)Θk*(x)|βkmkΘk¯mk¯Θk¯*Vk¯+βkmkκ=k¯k1cκ,subscriptsuperscriptΘsubscript𝑚𝑘𝑘𝑥subscriptsuperscriptΘ𝑘𝑥superscriptsubscript𝛽𝑘subscript𝑚𝑘subscriptnormsubscriptsuperscriptΘsubscript𝑚¯𝑘¯𝑘subscriptsuperscriptΘ¯𝑘subscript𝑉¯𝑘superscriptsubscript𝛽𝑘subscript𝑚𝑘superscriptsubscript𝜅¯𝑘𝑘1subscript𝑐𝜅\displaystyle|\varTheta^{m_{k}}_{k}(x)-\varTheta^{*}_{k}(x)|\leq\beta_{k}^{m_{% k}}\|\varTheta^{m_{\bar{k}}}_{\bar{k}}-\varTheta^{*}_{\bar{k}}\|_{V_{\bar{k}}}% +\beta_{k}^{m_{k}}\sum_{\kappa=\bar{k}}^{k-1}c_{\kappa},| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) | ≤ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_κ = over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ,

where cκΘ~κ1*Θ~κ*,𝒳𝒵normal-≜subscript𝑐𝜅subscriptnormsubscriptsuperscriptnormal-~normal-Θ𝜅1subscriptsuperscriptnormal-~normal-Θ𝜅𝒳𝒵c_{\kappa}\triangleq\|\tilde{\varTheta}^{*}_{\kappa-1}-\tilde{\varTheta}^{*,-}% _{\kappa}\|_{\mathcal{X}\setminus\mathcal{Z}}italic_c start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ≜ ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_κ - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT and 𝒵𝒵\mathcal{Z}caligraphic_Z is sufficiently small.

Proof.

Fix xSk𝑥subscript𝑆𝑘x\in S_{k}italic_x ∈ italic_S start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Applying Lemma VI.8 once and iteratively applying Corollary VI.1 to the left side renders at

|Θkmk(x)Θk*(x)|subscriptsuperscriptΘsubscript𝑚𝑘𝑘𝑥subscriptsuperscriptΘ𝑘𝑥\displaystyle|\varTheta^{m_{k}}_{k}(x)-\varTheta^{*}_{k}(x)|| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) |
\displaystyle\leq βkmkΘk1mk1Θk1*Vk1+βkmkcksuperscriptsubscript𝛽𝑘subscript𝑚𝑘subscriptnormsubscriptsuperscriptΘsubscript𝑚𝑘1𝑘1subscriptsuperscriptΘ𝑘1subscript𝑉𝑘1superscriptsubscript𝛽𝑘subscript𝑚𝑘subscript𝑐𝑘\displaystyle\beta_{k}^{m_{k}}\|\varTheta^{m_{k-1}}_{k-1}-\varTheta^{*}_{k-1}% \|_{V_{k-1}}+\beta_{k}^{m_{k}}c_{k}italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
\displaystyle\leq βkmkΘk2mk2Θk2*Vk2+βkmkck1+βkmkcksuperscriptsubscript𝛽𝑘subscript𝑚𝑘subscriptnormsubscriptsuperscriptΘsubscript𝑚𝑘2𝑘2subscriptsuperscriptΘ𝑘2subscript𝑉𝑘2superscriptsubscript𝛽𝑘subscript𝑚𝑘subscript𝑐𝑘1superscriptsubscript𝛽𝑘subscript𝑚𝑘subscript𝑐𝑘\displaystyle\beta_{k}^{m_{k}}\|\varTheta^{m_{k-2}}_{k-2}-\varTheta^{*}_{k-2}% \|_{V_{k-2}}+\beta_{k}^{m_{k}}c_{k-1}+\beta_{k}^{m_{k}}c_{k}italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_k - 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT
\displaystyle\leq βkmkΘk¯mk¯Θk¯*Vk¯+βkmkκ=k¯k1cκ+1.superscriptsubscript𝛽𝑘subscript𝑚𝑘subscriptnormsubscriptsuperscriptΘsubscript𝑚¯𝑘¯𝑘subscriptsuperscriptΘ¯𝑘subscript𝑉¯𝑘superscriptsubscript𝛽𝑘subscript𝑚𝑘superscriptsubscript𝜅¯𝑘𝑘1subscript𝑐𝜅1\displaystyle\cdots\leq\beta_{k}^{m_{k}}\|\varTheta^{m_{\bar{k}}}_{\bar{k}}-% \varTheta^{*}_{\bar{k}}\|_{V_{\bar{k}}}+\beta_{k}^{m_{k}}\sum_{\kappa=\bar{k}}% ^{k-1}c_{\kappa+1}.⋯ ≤ italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_κ = over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k - 1 end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_κ + 1 end_POSTSUBSCRIPT .

This completes the proof. ∎

VI-D Proof of Theorem VI.1

In this subsection, we present the proof of Theorem VI.1.

Proof of Theorem VI.1.

Let k¯¯𝑘\bar{k}\in\mathbb{N}over¯ start_ARG italic_k end_ARG ∈ blackboard_N and consider an interval {k¯,k¯+1,,k¯+P}¯𝑘¯𝑘1¯𝑘𝑃\{\bar{k},\bar{k}+1,\dots,\bar{k}+P\}{ over¯ start_ARG italic_k end_ARG , over¯ start_ARG italic_k end_ARG + 1 , … , over¯ start_ARG italic_k end_ARG + italic_P }. We start with minx(x+dk¯+P)Vk¯+PΘk¯+Pmk¯+PΘ*𝒳𝒵subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑¯𝑘𝑃subscript𝑉¯𝑘𝑃subscriptsuperscriptΘsubscript𝑚¯𝑘𝑃¯𝑘𝑃superscriptΘ𝒳𝒵\|\min_{x^{\prime}\in(x+d_{\bar{k}+P}\mathcal{B})\cap V_{\bar{k}+P}}\varTheta^% {m_{\bar{k}+P}}_{\bar{k}+P}-\varTheta^{*}\|_{\mathcal{X}\setminus\mathcal{Z}}∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT. It follows from the triangular inequality of the maximum norm that

minx(x+dk¯+P)Vk¯+PΘk¯+Pmk¯+PΘ*𝒳𝒵subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑¯𝑘𝑃subscript𝑉¯𝑘𝑃subscriptsuperscriptΘsubscript𝑚¯𝑘𝑃¯𝑘𝑃superscriptΘ𝒳𝒵\displaystyle\|\min_{x^{\prime}\in(x+d_{\bar{k}+P}\mathcal{B})\cap V_{\bar{k}+% P}}\varTheta^{m_{\bar{k}+P}}_{\bar{k}+P}-\varTheta^{*}\|_{\mathcal{X}\setminus% \mathcal{Z}}∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT (22)
=\displaystyle== Θ~k¯+Pmk¯+PΘ*𝒳𝒵Θ~k¯+Pmk¯+PΘ~k¯+P*𝒳𝒵+𝔅k¯+Psubscriptnormsubscriptsuperscript~Θsubscript𝑚¯𝑘𝑃¯𝑘𝑃superscriptΘ𝒳𝒵subscriptnormsubscriptsuperscript~Θsubscript𝑚¯𝑘𝑃¯𝑘𝑃subscriptsuperscript~Θ¯𝑘𝑃𝒳𝒵subscript𝔅¯𝑘𝑃\displaystyle\|\tilde{\varTheta}^{m_{\bar{k}+P}}_{\bar{k}+P}-\varTheta^{*}\|_{% \mathcal{X}\setminus\mathcal{Z}}\leq\|\tilde{\varTheta}^{m_{\bar{k}+P}}_{\bar{% k}+P}-\tilde{\varTheta}^{*}_{\bar{k}+P}\|_{\mathcal{X}\setminus\mathcal{Z}}+% \mathfrak{B}_{\bar{k}+P}∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT ≤ ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT + fraktur_B start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT
\displaystyle\leq 𝔄k¯+P+𝔅k¯+P,subscript𝔄¯𝑘𝑃subscript𝔅¯𝑘𝑃\displaystyle\mathfrak{A}_{\bar{k}+P}+\mathfrak{B}_{\bar{k}+P},fraktur_A start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT + fraktur_B start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ,

where 𝔄k¯+PΘk¯+Pmk¯+PΘk¯+P*Vk¯+Psubscript𝔄¯𝑘𝑃subscriptnormsubscriptsuperscriptΘsubscript𝑚¯𝑘𝑃¯𝑘𝑃subscriptsuperscriptΘ¯𝑘𝑃subscript𝑉¯𝑘𝑃\mathfrak{A}_{\bar{k}+P}\triangleq\|\varTheta^{m_{\bar{k}+P}}_{\bar{k}+P}-% \varTheta^{*}_{\bar{k}+P}\|_{V_{\bar{k}+P}}fraktur_A start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ≜ ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT and 𝔅k¯+PΘ~k¯+P*Θ*𝒳𝒵subscript𝔅¯𝑘𝑃subscriptnormsubscriptsuperscript~Θ¯𝑘𝑃superscriptΘ𝒳𝒵\mathfrak{B}_{\bar{k}+P}\triangleq\|\tilde{\varTheta}^{*}_{\bar{k}+P}-% \varTheta^{*}\|_{\mathcal{X}\setminus\mathcal{Z}}fraktur_B start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ≜ ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT and the last inequality follows from (5) in Lemma VI.2.

We fix xVk¯+P𝑥subscript𝑉¯𝑘𝑃x\in V_{\bar{k}+P}italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT and let k^argmaxk¯kk¯+P^k(x)^𝑘subscriptargmax¯𝑘𝑘¯𝑘𝑃subscript^𝑘𝑥\hat{k}\triangleq\operatorname{argmax}_{\bar{k}\leq k\leq\bar{k}+P}\hat{% \mathcal{F}}_{k}(x)over^ start_ARG italic_k end_ARG ≜ roman_argmax start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG ≤ italic_k ≤ over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT over^ start_ARG caligraphic_F end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) be the last time when value at x𝑥xitalic_x is updated. Notice that k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG is a function of x𝑥xitalic_x, but for the conciseness of the proof, we omit the depnedency on x𝑥xitalic_x in notation. It follows from Lemma VI.5 that k^k¯^𝑘¯𝑘\hat{k}\geq\bar{k}over^ start_ARG italic_k end_ARG ≥ over¯ start_ARG italic_k end_ARG and xSk^𝑥subscript𝑆^𝑘x\in S_{\hat{k}}italic_x ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT. Then the following holds:

|Θk¯+Pmk¯+P(x)Θk¯+P*(x)|=|Θk^mk^(x)Θk¯+P*(x)|subscriptsuperscriptΘsubscript𝑚¯𝑘𝑃¯𝑘𝑃𝑥subscriptsuperscriptΘ¯𝑘𝑃𝑥subscriptsuperscriptΘsubscript𝑚^𝑘^𝑘𝑥subscriptsuperscriptΘ¯𝑘𝑃𝑥\displaystyle|\varTheta^{m_{\bar{k}+P}}_{\bar{k}+P}(x)-\varTheta^{*}_{\bar{k}+% P}(x)|=|\varTheta^{m_{\hat{k}}}_{\hat{k}}(x)-\varTheta^{*}_{\bar{k}+P}(x)|| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) | = | roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) | (23)
\displaystyle\leq |Θk^mk^(x)Θk^*(x)|+|Θk^*(x)Θk¯+P*(x)|,subscriptsuperscriptΘsubscript𝑚^𝑘^𝑘𝑥subscriptsuperscriptΘ^𝑘𝑥subscriptsuperscriptΘ^𝑘𝑥subscriptsuperscriptΘ¯𝑘𝑃𝑥\displaystyle|\varTheta^{m_{\hat{k}}}_{\hat{k}}(x)-\varTheta^{*}_{\hat{k}}(x)|% +|\varTheta^{*}_{\hat{k}}(x)-\varTheta^{*}_{\bar{k}+P}(x)|,| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) | + | roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) | ,

where the inequality is a direct result of the triangular inequality of the maximum norm.

We first analyze the first term in (23). Since xSk^𝑥subscript𝑆^𝑘x\in S_{\hat{k}}italic_x ∈ italic_S start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT, it follows from Lemma VI.8 that

|Θk^mk^(x)Θk^*(x)|βk^mk^Θk^1mk^1Θk^1*Vk^1+βk^mk^ck^.subscriptsuperscriptΘsubscript𝑚^𝑘^𝑘𝑥subscriptsuperscriptΘ^𝑘𝑥superscriptsubscript𝛽^𝑘subscript𝑚^𝑘subscriptnormsubscriptsuperscriptΘsubscript𝑚^𝑘1^𝑘1subscriptsuperscriptΘ^𝑘1subscript𝑉^𝑘1superscriptsubscript𝛽^𝑘subscript𝑚^𝑘subscript𝑐^𝑘\displaystyle|\varTheta^{m_{\hat{k}}}_{\hat{k}}(x)-\varTheta^{*}_{\hat{k}}(x)|% \leq\beta_{\hat{k}}^{m_{\hat{k}}}\|\varTheta^{m_{\hat{k}-1}}_{\hat{k}-1}-% \varTheta^{*}_{\hat{k}-1}\|_{V_{\hat{k}-1}}+\beta_{\hat{k}}^{m_{\hat{k}}}c_{% \hat{k}}.| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) | ≤ italic_β start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_β start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT .

Then we apply Corollary VI.2 and it arrives at

|Θk^mk^(x)Θk^*(x)|subscriptsuperscriptΘsubscript𝑚^𝑘^𝑘𝑥subscriptsuperscriptΘ^𝑘𝑥absent\displaystyle|\varTheta^{m_{\hat{k}}}_{\hat{k}}(x)-\varTheta^{*}_{\hat{k}}(x)|\leq| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) | ≤ βk^mk^Θk¯1mk¯1Θk¯1*Vk¯1+k¯1(x)superscriptsubscript𝛽^𝑘subscript𝑚^𝑘subscriptnormsubscriptsuperscriptΘsubscript𝑚¯𝑘1¯𝑘1subscriptsuperscriptΘ¯𝑘1subscript𝑉¯𝑘1subscript¯𝑘1𝑥\displaystyle\beta_{\hat{k}}^{m_{\hat{k}}}\|\varTheta^{m_{\bar{k}-1}}_{\bar{k}% -1}-\varTheta^{*}_{\bar{k}-1}\|_{V_{\bar{k}-1}}+\mathfrak{C}_{\bar{k}-1}(x)italic_β start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) (24)
=\displaystyle== βk^mk^𝔄k¯1+k¯1(x),superscriptsubscript𝛽^𝑘subscript𝑚^𝑘subscript𝔄¯𝑘1subscript¯𝑘1𝑥\displaystyle\beta_{\hat{k}}^{m_{\hat{k}}}\mathfrak{A}_{\bar{k}-1}+\mathfrak{C% }_{\bar{k}-1}(x),italic_β start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT fraktur_A start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT + fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) ,

where k¯(x)βk^mk^κ=k¯k^cκsubscript¯𝑘𝑥superscriptsubscript𝛽^𝑘subscript𝑚^𝑘superscriptsubscript𝜅¯𝑘^𝑘subscript𝑐𝜅\mathfrak{C}_{\bar{k}}(x)\triangleq\beta_{\hat{k}}^{m_{\hat{k}}}\sum_{\kappa=% \bar{k}}^{\hat{k}}c_{\kappa}fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) ≜ italic_β start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_κ = over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT is a function of x𝑥xitalic_x, as k^^𝑘\hat{k}over^ start_ARG italic_k end_ARG inside the definition is a function of x𝑥xitalic_x.

We then analyze the second term. It follows from the triangular inequality that the second term renders at xVkfor-all𝑥subscript𝑉𝑘\forall x\in V_{k}∀ italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT,

|Θk^+1*(x)Θk¯+P*(x)|𝒞¯k^+1(x)+𝒞¯k¯+P(x),subscriptsuperscriptΘ^𝑘1𝑥subscriptsuperscriptΘ¯𝑘𝑃𝑥subscript¯𝒞^𝑘1𝑥subscript¯𝒞¯𝑘𝑃𝑥\displaystyle|\varTheta^{*}_{\hat{k}+1}(x)-\varTheta^{*}_{\bar{k}+P}(x)|\leq% \bar{\mathcal{C}}_{\hat{k}+1}(x)+\bar{\mathcal{C}}_{\bar{k}+P}(x),| roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) | ≤ over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) , (25)

where 𝒞¯k(x)|Θk*(x)Θ*(x)|,xVkformulae-sequencesubscript¯𝒞𝑘𝑥subscriptsuperscriptΘ𝑘𝑥superscriptΘ𝑥for-all𝑥subscript𝑉𝑘\bar{\mathcal{C}}_{k}(x)\triangleq|\varTheta^{*}_{k}(x)-\varTheta^{*}(x)|,% \forall x\in V_{k}over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) ≜ | roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) | , ∀ italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and 𝒞¯k(x)=0subscript¯𝒞𝑘𝑥0\bar{\mathcal{C}}_{k}(x)=0over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = 0 otherwise. Combining (24) and (25), we have

|Θk¯+Pmk¯+P(x)Θk¯+P*(x)|subscriptsuperscriptΘsubscript𝑚¯𝑘𝑃¯𝑘𝑃𝑥subscriptsuperscriptΘ¯𝑘𝑃𝑥\displaystyle|\varTheta^{m_{\bar{k}+P}}_{\bar{k}+P}(x)-\varTheta^{*}_{\bar{k}+% P}(x)|| roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) | (26)
\displaystyle\leq βk^mk^𝔄k¯1+k¯1(x)+𝒞¯k^+1(x)+𝒞¯k¯+P(x).superscriptsubscript𝛽^𝑘subscript𝑚^𝑘subscript𝔄¯𝑘1subscript¯𝑘1𝑥subscript¯𝒞^𝑘1𝑥subscript¯𝒞¯𝑘𝑃𝑥\displaystyle\beta_{\hat{k}}^{m_{\hat{k}}}\mathfrak{A}_{\bar{k}-1}+\mathfrak{C% }_{\bar{k}-1}(x)+\bar{\mathcal{C}}_{\hat{k}+1}(x)+\bar{\mathcal{C}}_{\bar{k}+P% }(x).italic_β start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG end_POSTSUBSCRIPT end_POSTSUPERSCRIPT fraktur_A start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT + fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) .

Since (26) holds for every xVk¯+P𝑥subscript𝑉¯𝑘𝑃x\in V_{\bar{k}+P}italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT, taking the maximum over Vk¯+Psubscript𝑉¯𝑘𝑃V_{\bar{k}+P}italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT renders at

𝔄k¯+P=Θk^+Pmk¯+P(x)Θk¯+P*(x)Vk¯+Psubscript𝔄¯𝑘𝑃subscriptnormsubscriptsuperscriptΘsubscript𝑚¯𝑘𝑃^𝑘𝑃𝑥subscriptsuperscriptΘ¯𝑘𝑃𝑥subscript𝑉¯𝑘𝑃\displaystyle\mathfrak{A}_{\bar{k}+P}=\|\varTheta^{m_{\bar{k}+P}}_{\hat{k}+P}(% x)-\varTheta^{*}_{\bar{k}+P}(x)\|_{V_{\bar{k}+P}}fraktur_A start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT = ∥ roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) ∥ start_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT (27)
\displaystyle\leq β¯k¯𝔄k¯1+maxxVk¯+P[k¯1(x)+𝒞¯k^+1(x)+𝒞¯k¯+P(x)],subscript¯𝛽¯𝑘subscript𝔄¯𝑘1subscript𝑥subscript𝑉¯𝑘𝑃subscript¯𝑘1𝑥subscript¯𝒞^𝑘1𝑥subscript¯𝒞¯𝑘𝑃𝑥\displaystyle\bar{\beta}_{\bar{k}}\mathfrak{A}_{\bar{k}-1}+\max_{x\in V_{\bar{% k}+P}}\Big{[}\mathfrak{C}_{\bar{k}-1}(x)+\bar{\mathcal{C}}_{\hat{k}+1}(x)+\bar% {\mathcal{C}}_{\bar{k}+P}(x)\Big{]},over¯ start_ARG italic_β end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT fraktur_A start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT + roman_max start_POSTSUBSCRIPT italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) ] ,

where β¯k¯+Pmaxk¯kk¯+Pβkmksubscript¯𝛽¯𝑘𝑃subscript¯𝑘𝑘¯𝑘𝑃superscriptsubscript𝛽𝑘subscript𝑚𝑘\displaystyle\bar{\beta}_{\bar{k}+P}\triangleq\max_{\bar{k}\leq k\leq\bar{k}+P% }\beta_{k}^{m_{k}}over¯ start_ARG italic_β end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ≜ roman_max start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG ≤ italic_k ≤ over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. Now we would like to show the maximum term in (27) converges to zero by Lemma VI.3.

Claim VI.4.

The following holds true:

limk¯maxxVk¯+P[k¯1(x)+𝒞¯k^+1(x)+𝒞¯k¯+P(x)]=0.subscript¯𝑘subscript𝑥subscript𝑉¯𝑘𝑃subscript¯𝑘1𝑥subscript¯𝒞^𝑘1𝑥subscript¯𝒞¯𝑘𝑃𝑥0\displaystyle\lim_{\bar{k}\to\infty}\max_{x\in V_{\bar{k}+P}}\Big{[}\mathfrak{% C}_{\bar{k}-1}(x)+\bar{\mathcal{C}}_{\hat{k}+1}(x)+\bar{\mathcal{C}}_{\bar{k}+% P}(x)\Big{]}=0.roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → ∞ end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) ] = 0 .
Proof.

We first show k¯1(x)subscript¯𝑘1𝑥\mathfrak{C}_{\bar{k}-1}(x)fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) converges to zero for every xVk¯+P𝑥subscript𝑉¯𝑘𝑃x\in V_{\bar{k}+P}italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT. For notational simplicity, we use k¯(x)subscript¯𝑘𝑥\mathfrak{C}_{\bar{k}}(x)fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) instead of k¯1(x)subscript¯𝑘1𝑥\mathfrak{C}_{\bar{k}-1}(x)fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) in this proof. Fix xVk¯+P𝑥subscript𝑉¯𝑘𝑃x\in V_{\bar{k}+P}italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT. Since βk1subscript𝛽𝑘1\beta_{k}\leq 1italic_β start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≤ 1, we have k¯(x)κ=k¯k¯+Pcκsubscript¯𝑘𝑥superscriptsubscript𝜅¯𝑘¯𝑘𝑃subscript𝑐𝜅\mathfrak{C}_{\bar{k}}(x)\leq\sum_{\kappa=\bar{k}}^{\bar{k}+P}c_{\kappa}fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) ≤ ∑ start_POSTSUBSCRIPT italic_κ = over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT, a finite sum of cκsubscript𝑐𝜅c_{\kappa}italic_c start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT. It follows from Corollary .2 that

limκ+cκ=limκ+Θ~κ1*Θ~κ*,𝒳𝒵subscript𝜅subscript𝑐𝜅subscript𝜅subscriptnormsubscriptsuperscript~Θ𝜅1subscriptsuperscript~Θ𝜅𝒳𝒵\displaystyle\lim_{\kappa\to+\infty}c_{\kappa}=\lim_{\kappa\to+\infty}\|\tilde% {\varTheta}^{*}_{\kappa-1}-\tilde{\varTheta}^{*,-}_{\kappa}\|_{\mathcal{X}% \setminus\mathcal{Z}}roman_lim start_POSTSUBSCRIPT italic_κ → + ∞ end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT = roman_lim start_POSTSUBSCRIPT italic_κ → + ∞ end_POSTSUBSCRIPT ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_κ - 1 end_POSTSUBSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT
\displaystyle\leq limκ+Θ~κ1*Θ*𝒳𝒵+limκ+Θ*Θ~κ*,𝒳𝒵=0.subscript𝜅subscriptnormsubscriptsuperscript~Θ𝜅1superscriptΘ𝒳𝒵subscript𝜅subscriptnormsuperscriptΘsubscriptsuperscript~Θ𝜅𝒳𝒵0\displaystyle\lim_{\kappa\to+\infty}\|\tilde{\varTheta}^{*}_{\kappa-1}-% \varTheta^{*}\|_{\mathcal{X}\setminus\mathcal{Z}}+\lim_{\kappa\to+\infty}\|% \varTheta^{*}-\tilde{\varTheta}^{*,-}_{\kappa}\|_{\mathcal{X}\setminus\mathcal% {Z}}=0.roman_lim start_POSTSUBSCRIPT italic_κ → + ∞ end_POSTSUBSCRIPT ∥ over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_κ - 1 end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT + roman_lim start_POSTSUBSCRIPT italic_κ → + ∞ end_POSTSUBSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT - over~ start_ARG roman_Θ end_ARG start_POSTSUPERSCRIPT * , - end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT = 0 .

Then we have

limk¯+¯k¯(x)limk¯+κ=k¯k¯+Pcκ=i=0Plimk¯+ck¯+i=0.subscript¯𝑘subscript¯¯𝑘𝑥subscript¯𝑘superscriptsubscript𝜅¯𝑘¯𝑘𝑃subscript𝑐𝜅superscriptsubscript𝑖0𝑃subscript¯𝑘subscript𝑐¯𝑘𝑖0\displaystyle\lim_{\bar{k}\to+\infty}\bar{\mathfrak{C}}_{\bar{k}}(x)\leq\lim_{% \bar{k}\to+\infty}\sum_{\kappa=\bar{k}}^{\bar{k}+P}c_{\kappa}=\sum_{i=0}^{P}% \lim_{\bar{k}\to+\infty}c_{\bar{k}+i}=0.roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → + ∞ end_POSTSUBSCRIPT over¯ start_ARG fraktur_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) ≤ roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → + ∞ end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_κ = over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUPERSCRIPT italic_c start_POSTSUBSCRIPT italic_κ end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_P end_POSTSUPERSCRIPT roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → + ∞ end_POSTSUBSCRIPT italic_c start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_i end_POSTSUBSCRIPT = 0 .

Since ¯k¯(x)0subscript¯¯𝑘𝑥0\bar{\mathfrak{C}}_{\bar{k}}(x)\geq 0over¯ start_ARG fraktur_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) ≥ 0, this completes the proof of limk¯+k¯(x)=0subscript¯𝑘subscript¯𝑘𝑥0\lim_{\bar{k}\to+\infty}\mathfrak{C}_{\bar{k}}(x)=0roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → + ∞ end_POSTSUBSCRIPT fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT ( italic_x ) = 0.

Then we show the rest two terms also converge to zero. It follows from Corollary .1 that both 𝒞¯k^+1(x)subscript¯𝒞^𝑘1𝑥\bar{\mathcal{C}}_{\hat{k}+1}(x)over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT ( italic_x ) and 𝒞¯k¯+P(x)subscript¯𝒞¯𝑘𝑃𝑥\bar{\mathcal{C}}_{\bar{k}+P}(x)over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) converges to zero for every xVk¯+P𝑥subscript𝑉¯𝑘𝑃x\in V_{\bar{k}+P}italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT as k¯+¯𝑘\bar{k}\to+\inftyover¯ start_ARG italic_k end_ARG → + ∞. Then it follows from Lemma VI.3 that limk¯maxxVk¯+P[k¯1(x)+𝒞¯k^+1(x)+𝒞¯k¯+P(x)]=0.subscript¯𝑘subscript𝑥subscript𝑉¯𝑘𝑃subscript¯𝑘1𝑥subscript¯𝒞^𝑘1𝑥subscript¯𝒞¯𝑘𝑃𝑥0\lim_{\bar{k}\to\infty}\max_{x\in V_{\bar{k}+P}}\Big{[}\mathfrak{C}_{\bar{k}-1% }(x)+\bar{\mathcal{C}}_{\hat{k}+1}(x)+\bar{\mathcal{C}}_{\bar{k}+P}(x)\Big{]}=0.roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → ∞ end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) ] = 0 . This completes the proof. ∎

For brevity, we rewrite (27) as

𝔄k˘(τ)subscript𝔄˘𝑘𝜏absent\displaystyle\mathfrak{A}_{\breve{k}(\tau)}\leqfraktur_A start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ ) end_POSTSUBSCRIPT ≤ β¯k˘(τ)𝔄k˘(τ1)+𝔇k˘(τ),subscript¯𝛽˘𝑘𝜏subscript𝔄˘𝑘𝜏1subscript𝔇˘𝑘𝜏\displaystyle\bar{\beta}_{\breve{k}(\tau)}\mathfrak{A}_{\breve{k}(\tau-1)}+% \mathfrak{D}_{\breve{k}(\tau)},over¯ start_ARG italic_β end_ARG start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ ) end_POSTSUBSCRIPT fraktur_A start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ - 1 ) end_POSTSUBSCRIPT + fraktur_D start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ ) end_POSTSUBSCRIPT , (28)

where k˘(τ)k¯+τ(P+1)˘𝑘𝜏¯𝑘𝜏𝑃1\breve{k}(\tau)\triangleq\bar{k}+\tau(P+1)over˘ start_ARG italic_k end_ARG ( italic_τ ) ≜ over¯ start_ARG italic_k end_ARG + italic_τ ( italic_P + 1 ) and 𝔇k¯+PmaxxVk¯+P[k¯1(x)+𝒞¯k^+1(x)+𝒞¯k¯+P(x)]subscript𝔇¯𝑘𝑃subscript𝑥subscript𝑉¯𝑘𝑃subscript¯𝑘1𝑥subscript¯𝒞^𝑘1𝑥subscript¯𝒞¯𝑘𝑃𝑥\displaystyle\mathfrak{D}_{\bar{k}+P}\triangleq\max_{x\in V_{\bar{k}+P}}\Big{[% }\mathfrak{C}_{\bar{k}-1}(x)+\bar{\mathcal{C}}_{\hat{k}+1}(x)+\bar{\mathcal{C}% }_{\bar{k}+P}(x)\Big{]}fraktur_D start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ≜ roman_max start_POSTSUBSCRIPT italic_x ∈ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ fraktur_C start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG - 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_k end_ARG + 1 end_POSTSUBSCRIPT ( italic_x ) + over¯ start_ARG caligraphic_C end_ARG start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT ( italic_x ) ]. Notice that limk+𝔇k=0subscript𝑘subscript𝔇𝑘0\lim_{k\to+\infty}\mathfrak{D}_{k}=0roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT fraktur_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0 as a result of Claim VI.4. By Assumption V.2, we continue to relax 𝔄k˘(τ1)subscript𝔄˘𝑘𝜏1\mathfrak{A}_{\breve{k}(\tau-1)}fraktur_A start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ - 1 ) end_POSTSUBSCRIPT over T𝑇Titalic_T intervals, each of which consists of P+1𝑃1P+1italic_P + 1 graphs, and it holds for any ζ0𝜁subscriptabsent0\zeta\in\mathbb{N}_{\geq 0}italic_ζ ∈ blackboard_N start_POSTSUBSCRIPT ≥ 0 end_POSTSUBSCRIPT that

𝔄k˘((ζ+1)T)subscript𝔄˘𝑘𝜁1𝑇\displaystyle\mathfrak{A}_{\breve{k}((\zeta+1)T)}fraktur_A start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( ( italic_ζ + 1 ) italic_T ) end_POSTSUBSCRIPT (29)
\displaystyle\leq τ=ζT+1(ζ+1)Tβ¯k˘(τ)𝔄k˘(ζT)+τ=ζT+1(ζ+1)T𝔇k˘(τ)τ=τ+1(ζ+1)Tβ¯k˘(τ)superscriptsubscriptproduct𝜏𝜁𝑇1𝜁1𝑇subscript¯𝛽˘𝑘𝜏subscript𝔄˘𝑘𝜁𝑇superscriptsubscript𝜏𝜁𝑇1𝜁1𝑇subscript𝔇˘𝑘𝜏superscriptsubscriptproductsuperscript𝜏𝜏1𝜁1𝑇subscript¯𝛽˘𝑘superscript𝜏\displaystyle\prod_{\tau=\zeta T+1}^{(\zeta+1)T}\bar{\beta}_{\breve{k}(\tau)}% \mathfrak{A}_{\breve{k}(\zeta T)}+\sum_{\tau=\zeta T+1}^{(\zeta+1)T}\mathfrak{% D}_{\breve{k}(\tau)}\prod_{\tau^{\prime}=\tau+1}^{(\zeta+1)T}\bar{\beta}_{% \breve{k}(\tau^{\prime})}∏ start_POSTSUBSCRIPT italic_τ = italic_ζ italic_T + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ζ + 1 ) italic_T end_POSTSUPERSCRIPT over¯ start_ARG italic_β end_ARG start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ ) end_POSTSUBSCRIPT fraktur_A start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_ζ italic_T ) end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_τ = italic_ζ italic_T + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ζ + 1 ) italic_T end_POSTSUPERSCRIPT fraktur_D start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ ) end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_τ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_τ + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ζ + 1 ) italic_T end_POSTSUPERSCRIPT over¯ start_ARG italic_β end_ARG start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT
\displaystyle\leq β¯𝔄k˘(ζT)+𝔇˘ζT,¯𝛽subscript𝔄˘𝑘𝜁𝑇subscript˘𝔇𝜁𝑇\displaystyle\bar{\beta}\mathfrak{A}_{\breve{k}(\zeta T)}+\breve{\mathfrak{D}}% _{\zeta T},over¯ start_ARG italic_β end_ARG fraktur_A start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_ζ italic_T ) end_POSTSUBSCRIPT + over˘ start_ARG fraktur_D end_ARG start_POSTSUBSCRIPT italic_ζ italic_T end_POSTSUBSCRIPT ,

where 𝔇˘ζTτ=ζT+1(ζ+1)T𝔇k˘(τ)τ=τ+1(ζ+1)Tβ¯k˘(τ)subscript˘𝔇𝜁𝑇superscriptsubscript𝜏𝜁𝑇1𝜁1𝑇subscript𝔇˘𝑘𝜏superscriptsubscriptproductsuperscript𝜏𝜏1𝜁1𝑇subscript¯𝛽˘𝑘superscript𝜏\breve{\mathfrak{D}}_{\zeta T}\triangleq\sum_{\tau=\zeta T+1}^{(\zeta+1)T}% \mathfrak{D}_{\breve{k}(\tau)}\prod_{\tau^{\prime}=\tau+1}^{(\zeta+1)T}\bar{% \beta}_{\breve{k}(\tau^{\prime})}over˘ start_ARG fraktur_D end_ARG start_POSTSUBSCRIPT italic_ζ italic_T end_POSTSUBSCRIPT ≜ ∑ start_POSTSUBSCRIPT italic_τ = italic_ζ italic_T + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ζ + 1 ) italic_T end_POSTSUPERSCRIPT fraktur_D start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ ) end_POSTSUBSCRIPT ∏ start_POSTSUBSCRIPT italic_τ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_τ + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_ζ + 1 ) italic_T end_POSTSUPERSCRIPT over¯ start_ARG italic_β end_ARG start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_POSTSUBSCRIPT is a finite sum of 𝔇ksubscript𝔇𝑘\mathfrak{D}_{k}fraktur_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Since limτ+𝔇k˘(τ)=limk+𝔇k=0subscript𝜏subscript𝔇˘𝑘𝜏subscript𝑘subscript𝔇𝑘0\displaystyle\lim_{\tau\to+\infty}\mathfrak{D}_{\breve{k}(\tau)}=\lim_{k\to+% \infty}\mathfrak{D}_{k}=0roman_lim start_POSTSUBSCRIPT italic_τ → + ∞ end_POSTSUBSCRIPT fraktur_D start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_τ ) end_POSTSUBSCRIPT = roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT fraktur_D start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 0, 𝔇˘ζTsubscript˘𝔇𝜁𝑇\breve{\mathfrak{D}}_{\zeta T}over˘ start_ARG fraktur_D end_ARG start_POSTSUBSCRIPT italic_ζ italic_T end_POSTSUBSCRIPT converges to 00 as ζ+𝜁\zeta\to+\inftyitalic_ζ → + ∞. It follows from Lemma VII.5 in [35] that limk+𝔄k=limζ+𝔄k˘(ζT)=0subscript𝑘subscript𝔄𝑘subscript𝜁subscript𝔄˘𝑘𝜁𝑇0\displaystyle\lim_{k\to+\infty}\mathfrak{A}_{k}=\lim_{\zeta\to+\infty}% \mathfrak{A}_{\breve{k}(\zeta T)}=0roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT fraktur_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_lim start_POSTSUBSCRIPT italic_ζ → + ∞ end_POSTSUBSCRIPT fraktur_A start_POSTSUBSCRIPT over˘ start_ARG italic_k end_ARG ( italic_ζ italic_T ) end_POSTSUBSCRIPT = 0. It follows from Corollary .2 that limk¯+𝔅k¯=0subscript¯𝑘subscript𝔅¯𝑘0\lim_{\bar{k}\to+\infty}\mathfrak{B}_{\bar{k}}=0roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → + ∞ end_POSTSUBSCRIPT fraktur_B start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG end_POSTSUBSCRIPT = 0. Together with (22), we have

limk+minx(x+dk)VkΘkmkΘ*𝒳𝒵subscript𝑘subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑𝑘subscript𝑉𝑘subscriptsuperscriptΘsubscript𝑚𝑘𝑘superscriptΘ𝒳𝒵\displaystyle\lim_{k\to+\infty}\|\min_{x^{\prime}\in(x+d_{k}\mathcal{B})\cap V% _{k}}\varTheta^{m_{k}}_{k}-\varTheta^{*}\|_{\mathcal{X}\setminus\mathcal{Z}}roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT ∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT
=\displaystyle== limk¯+minx(x+dk¯+P)Vk¯+PΘk¯+Pmk¯+PΘ*𝒳𝒵subscript¯𝑘subscriptnormsubscriptsuperscript𝑥𝑥subscript𝑑¯𝑘𝑃subscript𝑉¯𝑘𝑃subscriptsuperscriptΘsubscript𝑚¯𝑘𝑃¯𝑘𝑃superscriptΘ𝒳𝒵\displaystyle\lim_{\bar{k}\to+\infty}\|\min_{x^{\prime}\in(x+d_{\bar{k}+P}% \mathcal{B})\cap V_{\bar{k}+P}}\varTheta^{m_{\bar{k}+P}}_{\bar{k}+P}-\varTheta% ^{*}\|_{\mathcal{X}\setminus\mathcal{Z}}roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → + ∞ end_POSTSUBSCRIPT ∥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT italic_m start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT end_POSTSUPERSCRIPT start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT - roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT
\displaystyle\leq limk¯+𝔄k¯+P+limk¯+𝔅k¯+P=0.subscript¯𝑘subscript𝔄¯𝑘𝑃subscript¯𝑘subscript𝔅¯𝑘𝑃0\displaystyle\lim_{\bar{k}\to+\infty}\mathfrak{A}_{\bar{k}+P}+\lim_{\bar{k}\to% +\infty}\mathfrak{B}_{\bar{k}+P}=0.roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → + ∞ end_POSTSUBSCRIPT fraktur_A start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT + roman_lim start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG → + ∞ end_POSTSUBSCRIPT fraktur_B start_POSTSUBSCRIPT over¯ start_ARG italic_k end_ARG + italic_P end_POSTSUBSCRIPT = 0 .

This completes the proof. ∎

VII Numerical Results and Discussion

In this section, we first present a computation-saving adaptation of the proposed iPolicy algorithm for a class of simpler but widely used dynamic systems. Then we present results of numerical experiments for three differential systems - point mass, simple car and Dubins car. Using the point mass, we show the correctness of the theoretical results. We show several simulations for unicycle robots in obstacle-free as well as cluttered environment to perform complex maneuvers like automatic parking. The experiments presented in this section are done to investigate the following characteristics of the proposed iPolicy algorithm.

  1. 1.

    Can iPolicy recover the optimal value function for systems with differential constraints?

  2. 2.

    Can the state-based feedback controllers handle complex motion maneuvers in the presence of obstacles?

  3. 3.

    Can we demonstrate the anytime and incremental nature of the algorithm? Is the algorithm able to monotonically improve the plan towards the goal?

In the rest of this section, we will try to answer the above questions with systems of increasing complexity.

Refer to caption
(a) The estimated value function with 761761761761 samples after 21.821.821.821.8 seconds.
Refer to caption
(b) The estimated value function with 1,93419341,9341 , 934 samples after 205.7205.7205.7205.7 seconds.
Refer to caption
(c) The estimated value function with 16,9791697916,97916 , 979 samples after 20062.720062.720062.720062.7 seconds.
Refer to caption
(d) The root of mean squared errors over total number of samples on 5555 runs.
Figure 2: The estimated value function obtained for a point mass in the presence of obstacles and its convergence with the computational time. Goal region for the point-mass is centered at (0,0)00(0,0)( 0 , 0 ) with red dash line. The colorbar in the plots represent the approximate time to the goal region. Figure 2d shows the errors over 5555 independent runs. As seen in Figure 2d, iPolicy achieves faster convergence compared to the multigrid method.

VII-A Computation-saving query

In this subsection, we introduce a class of simpler but widely used dynamical systems in the robotics community and an adaptation for iPolicy that significantly reduces computations in practice but maintains the optimality in Theorem V.1. Consider a class of systems that, in addition to the satisfaction of Assumption IV.1, also satisfy the following assumption:

Assumption VII.1.

System (1) satisfies that for any x𝒳,u𝒰formulae-sequence𝑥𝒳𝑢𝒰x\in\mathcal{X},\exists u\in\mathcal{U}italic_x ∈ caligraphic_X , ∃ italic_u ∈ caligraphic_U s.t. f(x,u)=0𝑓𝑥𝑢0f(x,u)=0italic_f ( italic_x , italic_u ) = 0.

In other words, Assumption VII.1 indicates the robot can instantly stop at any state. The following lemma shows the one-hop neighbors Fk(x)subscript𝐹𝑘𝑥F_{k}(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) is monotonically decreasing in k𝑘kitalic_k.

Lemma VII.1.

Suppose Assumptions IV.1, V.1 and VII.1 hold. Then it holds that Fk(x)Fk(x)(VkVk),x𝒳freeformulae-sequencesubscript𝐹superscript𝑘normal-′𝑥subscript𝐹𝑘𝑥subscript𝑉superscript𝑘normal-′subscript𝑉𝑘for-all𝑥subscript𝒳normal-freeF_{k^{\prime}}(x)\subseteq F_{k}(x)\cup(V_{k^{\prime}}\setminus V_{k}),\forall x% \in\mathcal{X}_{\mathrm{free}}italic_F start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) ⊆ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) ∪ ( italic_V start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∖ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) , ∀ italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT and kksuperscript𝑘normal-′𝑘k^{\prime}\geq kitalic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≥ italic_k.

Proof.

It follows from Assumption V.1 that dk,ϵksubscript𝑑𝑘subscriptitalic-ϵ𝑘d_{k},\epsilon_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and ρ(dk,ϵk)𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘\rho(d_{k},\epsilon_{k})italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) are monotonically decreasing in k𝑘kitalic_k. Recall the definition of Fk(x)subscript𝐹𝑘𝑥F_{k}(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) in (3) follows Fk(x)=(x+ϵku𝒰f(x,u)+ρ(dk,ϵk))Vk(𝒳free+dk)subscript𝐹𝑘𝑥𝑥subscriptitalic-ϵ𝑘subscript𝑢𝒰𝑓𝑥𝑢𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘subscript𝑉𝑘subscript𝒳freesubscript𝑑𝑘F_{k}(x)=(x+\epsilon_{k}\bigcup_{u\in\mathcal{U}}f(x,u)+\rho(d_{k},\epsilon_{k% })\mathcal{B})\cap V_{k}\cap(\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B})italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = ( italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∩ ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ). Fix x𝒳free𝑥subscript𝒳freex\in\mathcal{X}_{\mathrm{free}}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT and xFk(x)superscript𝑥subscript𝐹superscript𝑘𝑥x^{\prime}\in F_{k^{\prime}}(x)italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ). We proceed to prove xFk(x)(VkVk)superscript𝑥subscript𝐹𝑘𝑥subscript𝑉superscript𝑘subscript𝑉𝑘x^{\prime}\in F_{k}(x)\cup(V_{k^{\prime}}\setminus V_{k})italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) ∪ ( italic_V start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∖ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) by showing xsuperscript𝑥x^{\prime}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is included in each part of Fk(x)subscript𝐹𝑘𝑥F_{k}(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x )

We first show xx+ϵku𝒰f(x,u)+ρ(dk,ϵk)superscript𝑥𝑥subscriptitalic-ϵ𝑘subscript𝑢𝒰𝑓𝑥𝑢𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘x^{\prime}\in x+\epsilon_{k}\bigcup_{u\in\mathcal{U}}f(x,u)+\rho(d_{k},% \epsilon_{k})\mathcal{B}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B. Since xx+ϵku𝒰f(x,u)+ρ(dk,ϵk)superscript𝑥𝑥subscriptitalic-ϵsuperscript𝑘subscript𝑢𝒰𝑓𝑥𝑢𝜌subscript𝑑superscript𝑘subscriptitalic-ϵsuperscript𝑘x^{\prime}\in x+\epsilon_{k^{\prime}}\bigcup_{u\in\mathcal{U}}f(x,u)+\rho(d_{k% ^{\prime}},\epsilon_{k^{\prime}})\mathcal{B}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) caligraphic_B, there exist u𝒰,bformulae-sequencesuperscript𝑢𝒰𝑏u^{\prime}\in\mathcal{U},b\in\mathcal{B}italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_U , italic_b ∈ caligraphic_B and α[0,ρ(dk,ϵk)]𝛼0𝜌subscript𝑑superscript𝑘subscriptitalic-ϵsuperscript𝑘\alpha\in[0,\rho(d_{k^{\prime}},\epsilon_{k^{\prime}})]italic_α ∈ [ 0 , italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) ] s.t. x=x+ϵkf(x,u)+αbsuperscript𝑥𝑥subscriptitalic-ϵsuperscript𝑘𝑓𝑥superscript𝑢𝛼𝑏x^{\prime}=x+\epsilon_{k^{\prime}}f(x,u^{\prime})+\alpha bitalic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_f ( italic_x , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) + italic_α italic_b. It follows from Assumptions (A4) and VII.1 that for any u𝒰𝑢𝒰u\in\mathcal{U}italic_u ∈ caligraphic_U and ζ[0,1]𝜁01\zeta\in[0,1]italic_ζ ∈ [ 0 , 1 ], ζf(x,u)u𝒰f(x,u)𝜁𝑓𝑥𝑢subscript𝑢𝒰𝑓𝑥𝑢\zeta f(x,u)\in\bigcup_{u\in\mathcal{U}}f(x,u)italic_ζ italic_f ( italic_x , italic_u ) ∈ ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ). By the decreasing monotonicity of ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in k𝑘kitalic_k, this implies ϵkf(x,u)=ϵkϵkϵkf(x,u)ϵku𝒰f(x,u)subscriptitalic-ϵsuperscript𝑘𝑓𝑥superscript𝑢subscriptitalic-ϵsuperscript𝑘subscriptitalic-ϵ𝑘subscriptitalic-ϵ𝑘𝑓𝑥superscript𝑢subscriptitalic-ϵ𝑘subscript𝑢𝒰𝑓𝑥𝑢\epsilon_{k^{\prime}}f(x,u^{\prime})=\frac{\epsilon_{k^{\prime}}}{\epsilon_{k}% }\epsilon_{k}f(x,u^{\prime})\in\epsilon_{k}\bigcup_{u\in\mathcal{U}}f(x,u)italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_f ( italic_x , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = divide start_ARG italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT end_ARG start_ARG italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_f ( italic_x , italic_u start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ). It again follows from the decreasing monotonicity of ρ(dk,ϵk)𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘\rho(d_{k},\epsilon_{k})italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) that αbρ(dk,ϵk)𝛼𝑏𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘\alpha b\in\rho(d_{k},\epsilon_{k})\mathcal{B}italic_α italic_b ∈ italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B. In summary, we have xx+ϵku𝒰f(x,u)+ρ(dk,ϵk)superscript𝑥𝑥subscriptitalic-ϵ𝑘subscript𝑢𝒰𝑓𝑥𝑢𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘x^{\prime}\in x+\epsilon_{k}\bigcup_{u\in\mathcal{U}}f(x,u)+\rho(d_{k},% \epsilon_{k})\mathcal{B}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B.

Then we show xVk(VkVk)superscript𝑥subscript𝑉𝑘subscript𝑉superscript𝑘subscript𝑉𝑘x^{\prime}\in V_{k}\cup(V_{k^{\prime}}\setminus V_{k})italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∪ ( italic_V start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∖ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). Since Vk(VkVk)=Vksubscript𝑉𝑘subscript𝑉superscript𝑘subscript𝑉𝑘subscript𝑉superscript𝑘V_{k}\cup(V_{k^{\prime}}\setminus V_{k})=V_{k^{\prime}}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∪ ( italic_V start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∖ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_V start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT, this trivially holds.

Finally, we show x𝒳free+dksuperscript𝑥subscript𝒳freesubscript𝑑𝑘x^{\prime}\in\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B. Since x𝒳free+ρ(dk,ϵk)superscript𝑥subscript𝒳free𝜌subscript𝑑superscript𝑘subscriptitalic-ϵsuperscript𝑘x^{\prime}\in\mathcal{X}_{\mathrm{free}}+\rho(d_{k^{\prime}},\epsilon_{k^{% \prime}})\mathcal{B}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) caligraphic_B, there exist xf𝒳free,bformulae-sequencesubscript𝑥𝑓subscript𝒳free𝑏x_{f}\in\mathcal{X}_{\mathrm{free}},b\in\mathcal{B}italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT , italic_b ∈ caligraphic_B and α[0,dk]𝛼0subscript𝑑superscript𝑘\alpha\in[0,d_{k^{\prime}}]italic_α ∈ [ 0 , italic_d start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] that x=xf+αbsuperscript𝑥subscript𝑥𝑓𝛼𝑏x^{\prime}=x_{f}+\alpha bitalic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_x start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT + italic_α italic_b. It follows from the decreasing monotonicity of dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT that αbdk𝛼𝑏subscript𝑑𝑘\alpha b\in d_{k}\mathcal{B}italic_α italic_b ∈ italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B. The proof of x𝒳free+dksuperscript𝑥subscript𝒳freesubscript𝑑𝑘x^{\prime}\in\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B is completed.

The above arguments show that xF(x)(VkVk)superscript𝑥𝐹𝑥subscript𝑉superscript𝑘subscript𝑉𝑘x^{\prime}\in F(x)\cup(V_{k^{\prime}}\setminus V_{k})italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F ( italic_x ) ∪ ( italic_V start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∖ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ). Since this holds for every xFk(x)superscript𝑥subscript𝐹superscript𝑘𝑥x^{\prime}\in F_{k^{\prime}}(x)italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_x ) and x𝒳free𝑥subscript𝒳freex\in\mathcal{X}_{\mathrm{free}}italic_x ∈ caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT, the proof is completed. ∎

1 Input: Graph index k𝑘kitalic_k, newly added sample xnewsubscript𝑥newx_{\mathrm{new}}italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT;
2 Compute F(xnew)𝐹subscript𝑥newF(x_{\mathrm{new}})italic_F ( italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT ) as (3);
3 for xVk{xnew}𝑥subscript𝑉𝑘subscript𝑥normal-newx\in V_{k}\setminus\{x_{\mathrm{new}}\}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ { italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT } do
4       if xnewx+ϵku𝒰f(x,u)+ρ(dk,ϵk)subscript𝑥normal-new𝑥subscriptitalic-ϵ𝑘subscript𝑢𝒰𝑓𝑥𝑢𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘x_{\mathrm{new}}\in x+\epsilon_{k}\bigcup_{u\in\mathcal{U}}f(x,u)+\rho(d_{k},% \epsilon_{k})\mathcal{B}italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT ∈ italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B then
5             F(x)F(x){xnew}𝐹𝑥𝐹𝑥subscript𝑥newF(x)\leftarrow F(x)\cup\{x_{\mathrm{new}}\}italic_F ( italic_x ) ← italic_F ( italic_x ) ∪ { italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT };
6            
7      
8 for xVk{xnew}𝑥subscript𝑉𝑘subscript𝑥normal-newx\in V_{k}\setminus\{x_{\mathrm{new}}\}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∖ { italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT } do
9       for xF(x)superscript𝑥normal-′𝐹𝑥x^{\prime}\in F(x)italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_F ( italic_x ) do
10             if x(x+ϵku𝒰f(x,u)+ρ(dk,ϵk))superscript𝑥normal-′𝑥subscriptitalic-ϵ𝑘subscript𝑢𝒰𝑓𝑥𝑢𝜌subscript𝑑𝑘subscriptitalic-ϵ𝑘x^{\prime}\notin(x+\epsilon_{k}\bigcup_{u\in\mathcal{U}}f(x,u)+\rho(d_{k},% \epsilon_{k})\mathcal{B})italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∉ ( italic_x + italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⋃ start_POSTSUBSCRIPT italic_u ∈ caligraphic_U end_POSTSUBSCRIPT italic_f ( italic_x , italic_u ) + italic_ρ ( italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ) then
11                   F(x)F(x){x}𝐹𝑥𝐹𝑥superscript𝑥F(x)\leftarrow F(x)\setminus\{x^{\prime}\}italic_F ( italic_x ) ← italic_F ( italic_x ) ∖ { italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT };
12                  
13            if x(𝒳free+dk)superscript𝑥normal-′subscript𝒳normal-freesubscript𝑑𝑘x^{\prime}\notin(\mathcal{X}_{\mathrm{free}}+d_{k}\mathcal{B})italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∉ ( caligraphic_X start_POSTSUBSCRIPT roman_free end_POSTSUBSCRIPT + italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) then
14                   F(x)F(x){x}𝐹𝑥𝐹𝑥superscript𝑥F(x)\leftarrow F(x)\setminus\{x^{\prime}\}italic_F ( italic_x ) ← italic_F ( italic_x ) ∖ { italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT };
15                  
16            
17      
return F𝐹Fitalic_F
Algorithm 4 Computation-saving query of F𝐹Fitalic_F

Based on Lemma VII.1, we can construct Fk(x)subscript𝐹𝑘𝑥F_{k}(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) in line 3 of Algorithm 3 BackProp in a computationally efficient manner, where the execution of the computationally heaviest part, querying Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT within a range of given a random state x𝑥xitalic_x, is reduced. Specifically, we treat Fk(x)subscript𝐹𝑘𝑥F_{k}(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) as a set of samples independent of graph index k𝑘kitalic_k; i.e., Fk(x)=F(x)subscript𝐹𝑘𝑥𝐹𝑥F_{k}(x)=F(x)italic_F start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = italic_F ( italic_x ). See Algorithm 4. When x𝑥xitalic_x is added to Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT as xnewsubscript𝑥newx_{\mathrm{new}}italic_x start_POSTSUBSCRIPT roman_new end_POSTSUBSCRIPT, we initialize F(x)𝐹𝑥F(x)italic_F ( italic_x ) as its definition (3) using the resolutions ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and dksubscript𝑑𝑘d_{k}italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT at that time, and update the one-hop neighbors of every other sampled state as lines 4 to 4; when querying for the one-hop neighbors of x𝑥xitalic_x in a later iteration, we update F(x)𝐹𝑥F(x)italic_F ( italic_x ) by removing faraway one-hop neighbors according to new resolutions as lines 4 to 4. In contrast to the vanilla query command as line 3 in BackProp, where searching the whole Vksubscript𝑉𝑘V_{k}italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is executed every time when F(x)𝐹𝑥F(x)italic_F ( italic_x ) is queried, the new adaptation skips this step and saves computations. This adaptation is leveraged throughout the results in Section VII.

VII-B Point mass

In this subsection, we try to show the correctness of the algorithm and the rate of convergence through a number of numerical simulations for a point mass. The point mass dynamic follows a single integrator [x˙y˙]=[u1u2]matrix˙𝑥˙𝑦matrixsubscript𝑢1subscript𝑢2\begin{bmatrix}\dot{x}\\ \dot{y}\end{bmatrix}=\begin{bmatrix}u_{1}\\ u_{2}\end{bmatrix}[ start_ARG start_ROW start_CELL over˙ start_ARG italic_x end_ARG end_CELL end_ROW start_ROW start_CELL over˙ start_ARG italic_y end_ARG end_CELL end_ROW end_ARG ] = [ start_ARG start_ROW start_CELL italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ], where the velocity on each coordinate can be directly manipulated. The control set is a unit disc; i.e., the maximum velocity is restricted to 1111. The environment is a square of size 20×20202020\times 2020 × 20 populated with a rectangular obstacle and a circular obstacle, and the robot desires to rest at a central circle. We used ϵk=(5dk)2/3subscriptitalic-ϵ𝑘superscript5subscript𝑑𝑘23\epsilon_{k}=(5d_{k})^{2/3}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = ( 5 italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 / 3 end_POSTSUPERSCRIPT and ρ(ϵk,dk)=2dk𝜌subscriptitalic-ϵ𝑘subscript𝑑𝑘2subscript𝑑𝑘\rho(\epsilon_{k},d_{k})=2d_{k}italic_ρ ( italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = 2 italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT for computational simplicity. The experiment used P=50𝑃50P=50italic_P = 50 and mk=500subscript𝑚𝑘500m_{k}=500italic_m start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = 500; i.e., every vertex was updated at-most after 50505050 iterations and the recursion depth is a constant of 500500500500. Notice that a constant recursion depth does not necessarily satisfy Assumption V.2, but the results still show its applicability in practice.

Figures 2a to 2c qualitatively show the estimated value function over the whole environment after the computations of 21.821.821.821.8 seconds, 205.7205.7205.7205.7 seconds and 20062.720062.720062.720062.7 seconds, respectively, where the goal region is marked by the red dash circle. It is observed that the estimate of minimal traveling time to the goal region is monotonically improving.

Quantitative characterization of the approximation error is available, as the minimal traveling time function 𝕋*superscript𝕋\mathds{T}^{*}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT can be analytically computed for a point mass in the presence of regularly shaped obstacles. The approximation error is computed between the estimated value function 𝕋ksubscript𝕋𝑘\mathds{T}_{k}blackboard_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and the ground truth 𝕋*superscript𝕋\mathds{T}^{*}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and is characterized by the root of mean squared error (RMSE), which follows as RMSE1|Vk|xVk|𝕋k(x)𝕋*(x)|2𝑅𝑀𝑆𝐸1subscript𝑉𝑘subscript𝑥subscript𝑉𝑘superscriptsubscript𝕋𝑘𝑥superscript𝕋𝑥2RMSE\triangleq\sqrt{\frac{1}{|V_{k}|}\sum_{x\in V_{k}}|\mathds{T}_{k}(x)-% \mathds{T}^{*}(x)|^{2}}italic_R italic_M italic_S italic_E ≜ square-root start_ARG divide start_ARG 1 end_ARG start_ARG | italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT | end_ARG ∑ start_POSTSUBSCRIPT italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT | blackboard_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) - blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) | start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. Notice that the estimated value function 𝕋ksubscript𝕋𝑘\mathds{T}_{k}blackboard_T start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT can be positive infinity when value iterations are not sufficiently executed and we exclude them in the comparison. We compare iPolicy with the multi-grid method, the single-robot version of [35]. We use 4444 grids with spatial resolutions {0.8,0.4,0.2,0.1}0.80.40.20.1\{0.8,0.4,0.2,0.1\}{ 0.8 , 0.4 , 0.2 , 0.1 } and compute the temporal resolutions in the same way as [35] for the multi-grid method. All other parameters share the same values with those of iPolicy. Figure 2d quantitatively visualizes RMSEs over computational times for 5555 independently executed experiments. It is shown that the approximation error of iPolicy monotonically and continuously decreases in general as more computational time is consumed, and iPolicy obtains lower approximation errors compared to the multi-grid method within the allowed computational resources.

VII-C Simple car

In this subsection, we first show the optimality results with the simple car model. The simple car model follows the unicycle dynamic, a well-known driftless affine system in the configuration space ×2𝕊1{}^{2}\times\mathds{S}^{1}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT × blackboard_S start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT [2]. The unicycle dynamics is given below:

[x˙y˙θ˙]=[cosθ0sinθ001][u1u2],matrix˙𝑥˙𝑦˙𝜃matrix𝜃0𝜃001matrixsubscript𝑢1subscript𝑢2\displaystyle\begin{bmatrix}\dot{x}\\ \dot{y}\\ \dot{\theta}\end{bmatrix}=\begin{bmatrix}\cos\theta&0\\ \sin\theta&0\\ 0&1\end{bmatrix}\begin{bmatrix}u_{1}\\ u_{2}\end{bmatrix},[ start_ARG start_ROW start_CELL over˙ start_ARG italic_x end_ARG end_CELL end_ROW start_ROW start_CELL over˙ start_ARG italic_y end_ARG end_CELL end_ROW start_ROW start_CELL over˙ start_ARG italic_θ end_ARG end_CELL end_ROW end_ARG ] = [ start_ARG start_ROW start_CELL roman_cos italic_θ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL roman_sin italic_θ end_CELL start_CELL 0 end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW end_ARG ] [ start_ARG start_ROW start_CELL italic_u start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_u start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] ,

where θ𝜃\thetaitalic_θ is the steering angle and is a circle 𝕊1superscript𝕊1\mathds{S}^{1}blackboard_S start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT. For computational simplicity, we redefine 𝕊1superscript𝕊1\mathds{S}^{1}blackboard_S start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT by θ[Θ,Θ]/\theta\in[-\Theta,\Theta]/\simitalic_θ ∈ [ - roman_Θ , roman_Θ ] / ∼, where similar-to\sim indicates ΘΘ-\Theta- roman_Θ and ΘΘ\Thetaroman_Θ are identical, and by using the Carnot-Carathéodory distance. To avoid the bias over different coordinates in computing distances, we let the region of interest be a square of 20×20202020\times 2020 × 20 and let Θ=10Θ10\Theta=10roman_Θ = 10. The control set of the simple car is given by U=[1,1]×[1,1]𝑈1111U=[-1,1]\times[-1,1]italic_U = [ - 1 , 1 ] × [ - 1 , 1 ], and the goal set is a sub-Riemannian ball of radius 1111 centered at (0,0,0)000(0,0,0)( 0 , 0 , 0 ). Figure 3 shows the estimated value function after approximately 13131313k seconds of computations over 5,00050005,0005 , 000 samples on the plane, where the orientation of the car is roughly parallel to the orientation at goal (θgoal=0subscript𝜃goal0\theta_{\mathrm{goal}}=0italic_θ start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT = 0) with a maximum difference of 10superscript1010^{\circ}10 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT. The shape of the level sets around the goal depict that the minimal traveling time is higher in the transverse direction when compared to that along the longitudinal direction parallel to the orientation of the goal state, as the car has a non-zero turn radius and it cannot move in the transverse direction.

Refer to caption
Figure 3: The estimated value function obtained by the proposed incremental algorithm for simple car obtained over 5,00050005,0005 , 000 samples with orientation θ=0𝜃0\theta=0italic_θ = 0.
Refer to caption
(a) The path of head-in parking
Refer to caption
(b) The path of parallel parking
Figure 4: Trajectory of the simple car accomplishing automated parking in the cluttered environment. Yellow arrows denote the orientation of the car while red stars denote the center. Goal region is marked by a pink dash circle.

In another simulation, we demonstrate the capability of iPolicy of handling automated parking, including head-in parking and parallel parking, of the simple car in the cluttered environment and show anytime property of the algorithm by visualizing the resulting trajectory under the guidance of the computed controller. See Figure 4a and Figure 4b. In the first simulation, a simple car desires to park in a one-way 90superscript9090^{\circ}90 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT head-in-only parking lot, and it starts from (6,0,0)600(-6,0,0)( - 6 , 0 , 0 ) and needs to safely stop at (4,7,π/2)47𝜋2(4,7,\pi/2)( 4 , 7 , italic_π / 2 ). In the second simulation, we simulate the parallel parking of a simple car on a two-way street, where the car starts from (7,2,0)720(-7,-2,0)( - 7 , - 2 , 0 ) and needs to park at (4,7,π/2)47𝜋2(4,7,\pi/2)( 4 , 7 , italic_π / 2 ) without causing any collisions. In Figure 4a, iPolicy uses 290290290290 samples and runs 53.7153.7153.7153.71 seconds to compute a safe and feasible controller for head-in parking in Figure 4a. In Figure 4b, iPolicy uses 200200200200 samples and runs 22.2322.2322.2322.23 seconds to command the simple car to safely park in parallel. Both cases imply the quick feasibility of iPolicy. Together with Theorem V.1 and Figure 2d, where the increasing optimality is verified, the anytime property of iPolicy is demonstrated. That is, iPolicy can produce a feasible solution given a short computational time, and can continuously optimize its solution if more computational time is given.

Refer to caption
Figure 5: The estimated value function obtained by iPolicy for Dubins car system obtained over 7500750075007500 samples. As the Dubins car can’t move backwards, the value functions are discontinuous and it results in more complex reachable sets in the sub-Riemannian manifold.

VII-D Dubins car

In this subsection, we show Algorithm 1 can handle more complex dynamic models. In particular, we also show the estimate of the value functions obtained using iPolicy for the Dubins car whose dynamics also lies in the ×2𝕊1{}^{2}\times\mathds{S}^{1}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPT × blackboard_S start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT space and is described by the same set of equations as the simple car; however, it can only move forward. Different from the Dubins car in [39] that can instantly stop, the Dubins car can only move forward at a constant speed, making its control set a singleton U={1}×[1,1]𝑈111U=\{1\}\times[-1,1]italic_U = { 1 } × [ - 1 , 1 ]. The Dubins car is another example of nonholonomic system but without a reverse gear or a brake [12, 2]. Moreover, the Dubins car does not satisfy Assumption VII.1; however, we still apply Algorithm 4 in implementation and later result will show this quick adaptation does not affect the The goal set in this case is the sub-Riemannian ball of radius 1111 around the point (0,0,0)000(0,0,0)( 0 , 0 , 0 ) and R=1𝑅1R=1italic_R = 1 (again the goal is the point where we reach first inside the ball). In Figure 5, the estimated cost-to-go on a plane are shown where the orientation of the car is parallel to the orientation at goal θgoal=0subscript𝜃goal0\theta_{\rm goal}=0italic_θ start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT = 0 with a maximum difference of 10superscript1010^{\circ}10 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT. The estimate of the value functions show the more complex level sets obtained for the Dubins car as it can not move backwards. Unlike the simple car, the level sets are not rotationally symmetric about the origin. Notice that area on the right-hand side of x=7.5𝑥7.5x=7.5italic_x = 7.5 in Figure 5 is mostly empty; this indicates iPolicy cannot find a solution in this area, mostly because the Dubins car starting from this area cannot reach the goal region without exiting the region of interest.

Remark VII.1.

Applying iPolicy to larger size environment does not require any additional modifications. However, at the early stage of computations, smaller size of samples may not display the desired probabilistic properties and some minor revisions may be needed for quick feasibility. Specifically, we can revise ϵksubscriptitalic-ϵ𝑘\epsilon_{k}italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT s.t. the one-hop neighbor of a sample does not exit the region of interest; i.e., x+f(x,u)ϵk+ρ(ϵk,dk)𝒳𝑥𝑓𝑥𝑢subscriptitalic-ϵ𝑘𝜌subscriptitalic-ϵ𝑘subscript𝑑𝑘𝒳x+f(x,u)\epsilon_{k}+\rho(\epsilon_{k},d_{k})\mathcal{B}\subseteq\mathcal{X}italic_x + italic_f ( italic_x , italic_u ) italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_ρ ( italic_ϵ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) caligraphic_B ⊆ caligraphic_X for most xVk𝑥subscript𝑉𝑘x\in V_{k}italic_x ∈ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and u𝒰𝑢𝒰u\in\mathcal{U}italic_u ∈ caligraphic_U.

VIII Conclusions and Future Work

In this paper we presented an algorithm for feedback motion planning of dynamical systems. Using results from set-valued control theory and sampling-based algorithms, we presented the iPolicy algorithm which can be used for feedback motion planning of robotic systems and guarantees asymptotic optimality of the value functions. Numerical experiments with point mass, a simple car model and the Dubins car model are presented where the time-to-go costs are recovered using iPolicy. We discussed an asynchronous update rule of the value functions for computational efficiency of the algorithms and proved that it retains asymptotic optimality guarantees.

It was found that majority of the time of the algorithm is used in local connection and collision checking using the sub-Riemannian metric. Computational time is a critical bottleneck for the class of algorithms presented here and needs further research. Our future work includes the research on speeding up the calculations. Using parallel machine learning algorithms which can assist in connecting the states sampled during state-space construction using sampling could potentially help in relaxing the compute times. Learning-based algorithms [40] are shown to be efficient in addressing high-dimensional problems, which has the potential to help reduce computations of planners and makes it more applicable in practice. Furthermore, use of iPolicy for sub-optimal, trajectory-centric control of underactuated systems [41, 42] could also be an interesting topic of future research. Extension to affine or complementarity dynamical systems could be another interesting direction of research [43, 44, 45].

References

  • [1] J. Reif, “Complexity of the mover’s problem and generalizations,” in Proceedings of the 20th Annual IEEE Conference on Foundations of Computer Science, pp. 421–427, 1979.
  • [2] S. M. LaValle, Planning algorithms. Cambridge university press, 2006.
  • [3] S. LaValle, “Motion planning,” Robotics Automation Magazine, IEEE, vol. 18, pp. 79–89, March 2011.
  • [4] S. M. LaValle and J. J. Kuffner, “Randomized kinodynamic planning,” The International Journal of Robotics Research, vol. 20, no. 5, pp. 378–400, 2001.
  • [5] S. Karaman and E. Frazzoli, “Sampling-based algorithms for optimal motion planning,” The International Journal of Robotics Research, vol. 30, no. 7, pp. 846–894, 2011.
  • [6] R. M. Murray and S. S. Sastry, “Nonholonomic motion planning: Steering using sinusoids,” Automatic Control, IEEE Transactions on, vol. 38, no. 5, pp. 700–716, 1993.
  • [7] J.-P. Laumond, S. Sekhavat, and F. Lamiraux, Guidelines in nonholonomic motion planning for mobile robots. Springer, 1998.
  • [8] Y. Wang, D. K. Jha, and Y. Akemi, “A two-stage RRT path planner for automated parking,” in 2017 13th IEEE Conference on Automation Science and Engineering (CASE), pp. 496–502, IEEE, 2017.
  • [9] E. Schmerling, L. Janson, and M. Pavone, “Optimal sampling-based motion planning under differential constraints: the driftless case,” in 2015 IEEE International Conference on Robotics and Automation (ICRA), pp. 2368–2375, IEEE, 2015.
  • [10] A. Perez, R. Platt, G. Konidaris, L. Kaelbling, and T. Lozano-Perez, “LQR-RRT*: Optimal sampling-based motion planning with automatically derived extension heuristics,” in Robotics and Automation (ICRA), 2012 IEEE International Conference on, pp. 2537–2542, IEEE, 2012.
  • [11] D. J. Webb and J. van den Berg, “Kinodynamic RRT*: Asymptotically optimal motion planning for robots with linear dynamics,” in Robotics and Automation (ICRA), 2013 IEEE International Conference on, pp. 5054–5061, IEEE, 2013.
  • [12] S. Karaman and E. Frazzoli, “Sampling-based optimal motion planning for non-holonomic dynamical systems,” in Robotics and Automation (ICRA), 2013 IEEE International Conference on, pp. 5041–5047, IEEE, 2013.
  • [13] K. Hauser, “Lazy collision checking in asymptotically-optimal motion planning,” in 2015 IEEE international conference on robotics and automation (ICRA), pp. 2951–2957, IEEE, 2015.
  • [14] H. Peng, F. Li, J. Liu, and Z. Ju, “A symplectic instantaneous optimal control for robot trajectory tracking with differential-algebraic equation models,” IEEE Transactions on Industrial Electronics, vol. 67, no. 5, pp. 3819–3829, 2020.
  • [15] P. Cardaliaguet, M. Quincampoix, and P. Saint-Pierre, “Set-valued numerical analysis for optimal control and differential games,” in Stochastic and differential games: theory and numerical methods, pp. 177–247, Birkhäuser Boston Boston, MA, 1999.
  • [16] M. Kalakrishnan, S. Chitta, E. Theodorou, P. Pastor, and S. Schaal, “Stomp: Stochastic trajectory optimization for motion planning,” in 2011 IEEE International Conference on Robotics and Automation, pp. 4569–4574, 2011.
  • [17] N. Ratliff, M. Zucker, J. A. Bagnell, and S. Srinivasa, “Chomp: Gradient optimization techniques for efficient motion planning,” in 2009 IEEE International Conference on Robotics and Automation, pp. 489–494, 2009.
  • [18] M. Kelly, “An introduction to trajectory optimization: How to do your own direct collocation,” SIAM Review, vol. 59, no. 4, pp. 849–904, 2017.
  • [19] J. Schulman, Y. Duan, J. Ho, A. Lee, I. Awwal, H. Bradlow, J. Pan, S. Patil, K. Goldberg, and P. Abbeel, “Motion planning with sequential convex optimization and convex collision checking,” The International Journal of Robotics Research, vol. 33, no. 9, pp. 1251–1270, 2014.
  • [20] A. U. Raghunathan, D. K. Jha, and D. Romeres, “PYROBOCOP : Python-based robotic control & optimization package for manipulation and collision avoidance,” CoRR, vol. abs/2106.03220, 2021.
  • [21] X. Wang, J. Liu, Y. Zhang, B. Shi, D. Jiang, and H. Peng, “A unified symplectic pseudospectral method for motion planning and tracking control of 3d underactuated overhead cranes,” International Journal of Robust and Nonlinear Control, vol. 29, no. 7, pp. 2236–2253, 2019.
  • [22] Y. Tassa, T. Erez, and E. Todorov, “Synthesis and stabilization of complex behaviors through online trajectory optimization,” in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 4906–4913, IEEE, 2012.
  • [23] K. Zhang, Q. Sun, and Y. Shi, “Trajectory tracking control of autonomous ground vehicles using adaptive learning mpc,” IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 12, pp. 5554–5564, 2021.
  • [24] J. De Schutter, M. Zanon, and M. Diehl, “Tunempc—a tool for economic tuning of tracking (n) mpc problems,” IEEE Control Systems Letters, vol. 4, no. 4, pp. 910–915, 2020.
  • [25] A. Majumdar and R. Tedrake, “Funnel libraries for real-time robust feedback motion planning,” The International Journal of Robotics Research, vol. 36, no. 8, pp. 947–982, 2017.
  • [26] O. Khatib, “Real-time obstacle avoidance for manipulators and mobile robots,” in Autonomous Robot Vehicles, pp. 396–404, Springer, 1986.
  • [27] J. Borenstein and Y. Koren, “The vector field histogram - fast obstacle avoidance for mobile robots,” IEEE Journal of Robotics and Automation, vol. 7, no. 3, pp. 278–288, 1991.
  • [28] D. E. Koditschek and E. Rimon, “Robot navigation functions on manifolds with boundary,” Advances in applied mathematics, vol. 11, no. 4, pp. 412–442, 1990.
  • [29] A. Orthey, C. Chamzas, and L. E. Kavraki, “Sampling-based motion planning: A comparative review,” Annual Review of Control, Robotics, and Autonomous Systems, vol. 7, no. 1, p. null, 2024.
  • [30] R. Tedrake, I. R. Manchester, M. Tobenkin, and J. W. Roberts, “Lqr-trees: Feedback motion planning via sums-of-squares verification,” The International Journal of Robotics Research, vol. 29, no. 8, pp. 1038–1052, 2010.
  • [31] R. Tedrake, “LQR-trees: Feedback motion planning on sparse randomized trees,” in Proceedings of Robotics: Science and Systems, (Seattle, USA), June 2009.
  • [32] G. J. Maeda, S. P. Singh, and H. Durrant-Whyte, “A tuned approach to feedback motion planning with rrts under model uncertainty,” in Robotics and Automation (ICRA), 2011 IEEE International Conference on, pp. 2288–2294, IEEE, 2011.
  • [33] V. A. Huynh, S. Karaman, and E. Frazzoli, “An incremental sampling-based algorithm for stochastic optimal control,” in Robotics and Automation (ICRA), 2012 IEEE International Conference on, pp. 2865–2872, IEEE, 2012.
  • [34] D. K. Jha, M. Zhu, and A. Ray, “Game theoretic controller synthesis for multi-robot motion planning-part II: Policy-based algorithms,” IFAC-PapersOnLine, vol. 48, no. 22, pp. 168–173, 2015.
  • [35] G. Zhao and M. Zhu, “Pareto optimal multirobot motion planning,” IEEE Transactions on Automatic Control, vol. 66, no. 9, pp. 3984–3999, 2020.
  • [36] A.-A. Agha-Mohammadi, S. Chakravorty, and N. M. Amato, “Firm: Sampling-based feedback motion-planning under motion uncertainty and imperfect measurements,” The International Journal of Robotics Research, vol. 33, no. 2, pp. 268–304, 2014.
  • [37] E. Schmerling, L. Janson, and M. Pavone, “Optimal sampling-based motion planning under differential constraints: the driftless case,” arXiv preprint arXiv:1403.2483, 2014.
  • [38] E. Mueller, M. Zhu, S. Karaman, and E. Frazzoli, “Anytime computation algorithms for approach-evasion differential games,” arXiv preprints, 2013. http://arxiv.longhoe.net/abs/1308.1174.
  • [39] S. Levine and P. Abbeel, “Learning neural network policies with guided policy search under unknown dynamics,” in Advances in Neural Information Processing Systems, pp. 1071–1079, 2014.
  • [40] C. Chi, S. Feng, Y. Du, Z. Xu, E. Cousineau, B. Burchfiel, and S. Song, “Diffusion policy: Visuomotor policy learning via action diffusion,” arXiv preprint arXiv:2303.04137, 2023.
  • [41] A. Majumdar, A. A. Ahmadi, and R. Tedrake, “Control design along trajectories with sums of squares programming,” in 2013 IEEE International Conference on Robotics and Automation, pp. 4054–4061, IEEE, 2013.
  • [42] P. Kolaric, D. K. Jha, A. U. Raghunathan, F. L. Lewis, M. Benosman, D. Romeres, and D. Nikovski, “Local policy optimization for trajectory-centric reinforcement learning,” International Conference on Robotics and Automation (ICRA), 2020. https://arxiv.longhoe.net/pdf/2001.08092.
  • [43] Y. Shirai, D. K. Jha, and A. U. Raghunathan, “Covariance steering for uncertain contact-rich systems,” arXiv preprint arXiv:2303.13382, 2023.
  • [44] Y. Shirai, D. K. Jha, A. U. Raghunathan, and D. Romeres, “Robust pivoting: Exploiting frictional stability using bilevel optimization,” in 2022 International Conference on Robotics and Automation (ICRA), pp. 992–998, 2022.
  • [45] G. Yang, M. Cai, A. Ahmad, C. Belta, and R. Tron, “Efficient LQR-CBF-RRT*: Safe and optimal motion planning,” arXiv preprint arXiv:2304.00790, 2023.
  • [46] H. L. Royden and P. Fitzpatrick, Real analysis. Prentice Hall, 4 ed., 2010.
[Uncaptioned image] Guoxiang Zhao is currently an Associate Professor in the School of Future Technology at Shanghai University, Shanghai, China. He received PhD in Electrical Engineering from Pennsylvania State University in May 2022. He received a M.S. degree in Mechanical Engineering from Purdue University, West Lafayette, IN, USA and a B.E. degree from Shanghai Jiao Tong University, Shanghai, China. He was a Software Engineer from 2022 to 2023 with TuSimple, Inc. in San Diego, CA, USA. His research interests are in the areas of motion planning and multi-robot systems.
[Uncaptioned image] Devesh K. Jha is currently a Principal Research Scientist at Mitsubishi Electric Research Laboratories (MERL) in Cambridge, MA, USA. He received PhD in Mechanical Engineering from Penn State in Decemeber 2016. He received M.S. degrees in Mechanical Engineering and Mathematics also from Penn State. His research interests are in the areas of Machine Learning, Robotics and Deep Learning. He is a recipient of several best paper awards including the Kalman Best Paper Award 2019 from the American Society of Mechanical Engineers (ASME) Dynamic Systems and Control Division. He is a senior member of IEEE and an associate editor of IEEE Robotics and Automation Letters (RA-L).
[Uncaptioned image] Yebin Wang (M’10-SM’16) received the B.Eng. degree in Mechatronics Engineering from Zhejiang University, Hangzhou, China, in 1997, M.Eng. degree in Control Theory & Control Engineering from Tsinghua University, Bei**g, China, in 2001, and Ph.D. in Electrical Engineering from the University of Alberta, Edmonton, Canada, in 2008. Dr. Wang has been with Mitsubishi Electric Research Laboratories in Cambridge, MA, USA, since 2009, and now is a Senior Principal Research Scientist. From 2001 to 2003 he was a Software Engineer, Project Manager, and Manager of R&D Dept. in industries, Bei**g, China. His research interests include nonlinear control and estimation, optimal control, adaptive systems and their applications including mechatronic systems.
[Uncaptioned image] Minghui Zhu is an Associate Professor in the School of Electrical Engineering and Computer Science at the Pennsylvania State University. Prior to joining Penn State in 2013, he was a postdoctoral associate in the Laboratory for Information and Decision Systems at the Massachusetts Institute of Technology. He received Ph.D. in Engineering Science (Mechanical Engineering) from the University of California, San Diego in 2011. His research interests lie in distributed control and decision-making of multi-agent networks with applications in robotic networks, security and the smart grid. He is the co-author of the book ”Distributed optimization-based control of multi-agent networks in complex environments” (Springer, 2015). He is a recipient of the award of Outstanding Graduate Student of Mechanical and Aerospace Engineering at UCSD in 2011, the Dorothy Quiggle Career Development Professorship in Engineering at Penn State in 2013, the award of Outstanding Reviewer of Automatica in 2013 and 2014, and the National Science Foundation CAREER award in 2019. He is an associate editor of the IEEE Open Journal of Control Systems, the IET Cyber-systems and Robotics and the Conference Editorial Board of the IEEE Control Systems Society.

This section contains preliminary results towards the complete proof of Theorem V.1. Most notations are borrowed from [15] but we make the following changes for the consistency throughout this paper. Let K=𝒳𝐾𝒳K=\mathcal{X}italic_K = caligraphic_X be a closed set and C=𝒳goal𝐶subscript𝒳goalC=\mathcal{X}_{\mathrm{goal}}italic_C = caligraphic_X start_POSTSUBSCRIPT roman_goal end_POSTSUBSCRIPT be a closed target set of K𝐾Kitalic_K following the definition in [15]. We rewrite the minimal time function ϑCKsubscriptsuperscriptitalic-ϑ𝐾𝐶\vartheta^{K}_{C}italic_ϑ start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_C end_POSTSUBSCRIPT, the estimated value function T𝑇Titalic_T and the the spatial resolution hpsubscript𝑝h_{p}italic_h start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT in [15] as 𝕋*superscript𝕋\mathds{T}^{*}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, 𝕋𝕋\mathds{T}blackboard_T and dpsubscript𝑑𝑝d_{p}italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT respectively. Notice that all superscripts and subscripts of T𝑇Titalic_T are kept in 𝕋𝕋\mathds{T}blackboard_T. Denote the domain of 𝕋*superscript𝕋\mathds{T}^{*}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT by 𝒟(𝕋*){xK|𝕋*(x)<+}𝒟superscript𝕋conditional-set𝑥𝐾superscript𝕋𝑥\mathcal{D}(\mathds{T}^{*})\triangleq\{x\in K|\mathds{T}^{*}(x)<+\infty\}caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ≜ { italic_x ∈ italic_K | blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) < + ∞ }. The following theorem proves a slightly stronger result compared to Corollary 3.7 in [15], where the perturbation radius can be arbitrarily large.

Theorem .1.

Given the convergence in the epigraphic sense, i.e., pi(𝕋*)=Limp+pi(𝕋p)𝑝𝑖superscript𝕋subscriptnormal-Limnormal-→𝑝𝑝𝑖subscriptsuperscript𝕋𝑝\mathcal{E}pi(\mathds{T}^{*})=\operatorname*{Lim}_{p\to+\infty}\mathcal{E}pi(% \mathds{T}^{\infty}_{p})caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = roman_Lim start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ), 𝕋psubscriptsuperscript𝕋𝑝\mathds{T}^{\infty}_{p}blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT converges pointwise to 𝕋*superscript𝕋\mathds{T}^{*}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, i.e., x𝒟(𝕋*)for-all𝑥𝒟superscript𝕋\forall x\in\mathcal{D}(\mathds{T}^{*})∀ italic_x ∈ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ),

𝕋*(x)=limp+minx(x+ηpKdp)𝕋p(x),superscript𝕋𝑥subscript𝑝subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥\displaystyle\mathds{T}^{*}(x)=\lim_{p\to+\infty}\min_{x^{\prime}\in(x+\eta_{p% }\mathcal{B}\cap K_{d_{p}})}\mathds{T}^{\infty}_{p}(x^{\prime}),blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) = roman_lim start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ,

where limp+ηpdp[1,+)subscriptnormal-→𝑝subscript𝜂𝑝subscript𝑑𝑝1\lim_{p\to+\infty}\frac{\eta_{p}}{d_{p}}\in[1,+\infty)roman_lim start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT divide start_ARG italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_ARG ∈ [ 1 , + ∞ ) and ηpdpsubscript𝜂𝑝subscript𝑑𝑝\eta_{p}\geq d_{p}italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≥ italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT.

Proof.

Define 𝕋~psubscriptsuperscript~𝕋𝑝\tilde{\mathds{T}}^{\infty}_{p}over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT s.t. pi(𝕋p)+ηp=pi(𝕋~p)𝑝𝑖subscriptsuperscript𝕋𝑝subscript𝜂𝑝𝑝𝑖subscriptsuperscript~𝕋𝑝\mathcal{E}pi(\mathds{T}^{\infty}_{p})+\eta_{p}\mathcal{B}=\mathcal{E}pi(% \tilde{\mathds{T}}^{\infty}_{p})caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B = caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ).

Claim .1.

The upper and lower bound of 𝕋p(x)subscriptsuperscript𝕋𝑝𝑥\mathds{T}^{\infty}_{p}(x)blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) for x𝒟(𝕋*)𝑥𝒟superscript𝕋x\in\mathcal{D}(\mathds{T}^{*})italic_x ∈ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) follows respectively

𝕋~p(x)+ηpminx(x+ηp)Kdp𝕋p(x),subscriptsuperscript~𝕋𝑝𝑥subscript𝜂𝑝subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥\displaystyle\tilde{\mathds{T}}^{\infty}_{p}(x)+\eta_{p}\geq\min_{x^{\prime}% \in(x+\eta_{p}\mathcal{B})\cap K_{d_{p}}}\mathds{T}^{\infty}_{p}(x^{\prime}),over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) , (30)
𝕋~p(x)minx(x+ηp)Kdp𝕋p(x).subscriptsuperscript~𝕋𝑝𝑥subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥\displaystyle\tilde{\mathds{T}}^{\infty}_{p}(x)\leq\min_{x^{\prime}\in(x+\eta_% {p}\mathcal{B})\cap K_{d_{p}}}\mathds{T}^{\infty}_{p}(x^{\prime}).over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) ≤ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) . (31)
Proof.

For any x𝒟(𝕋*)𝑥𝒟superscript𝕋x\in\mathcal{D}(\mathds{T}^{*})italic_x ∈ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) and (x,𝕋~p(x))pi(𝕋~p)𝑥subscriptsuperscript~𝕋𝑝𝑥𝑝𝑖subscriptsuperscript~𝕋𝑝(x,\tilde{\mathds{T}}^{\infty}_{p}(x))\in\mathcal{E}pi(\tilde{\mathds{T}}^{% \infty}_{p})( italic_x , over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) ) ∈ caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ), there exists (x,t)pi(𝕋p)superscript𝑥superscript𝑡𝑝𝑖subscriptsuperscript𝕋𝑝(x^{\prime},t^{\prime})\in\mathcal{E}pi(\mathds{T}^{\infty}_{p})( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∈ caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) s.t. (x,𝕋~p(x))(x,t)ηpnorm𝑥subscriptsuperscript~𝕋𝑝𝑥superscript𝑥superscript𝑡subscript𝜂𝑝\|(x,\tilde{\mathds{T}}^{\infty}_{p}(x))-(x^{\prime},t^{\prime})\|\leq\eta_{p}∥ ( italic_x , over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) ) - ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. This implies xxηpnorm𝑥superscript𝑥subscript𝜂𝑝\|x-x^{\prime}\|\leq\eta_{p}∥ italic_x - italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∥ ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and |𝕋~p(x)t|ηpsubscriptsuperscript~𝕋𝑝𝑥superscript𝑡subscript𝜂𝑝|\tilde{\mathds{T}}^{\infty}_{p}(x)-t^{\prime}|\leq\eta_{p}| over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) - italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. Therefore,

𝕋~p(x)+ηpt𝕋p(x)minx(x+ηp)Kdp𝕋p(x).subscriptsuperscript~𝕋𝑝𝑥subscript𝜂𝑝superscript𝑡subscriptsuperscript𝕋𝑝superscript𝑥subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥\displaystyle\tilde{\mathds{T}}^{\infty}_{p}(x)+\eta_{p}\geq t^{\prime}\geq% \mathds{T}^{\infty}_{p}(x^{\prime})\geq\min_{x^{\prime}\in(x+\eta_{p}\mathcal{% B})\cap K_{d_{p}}}\mathds{T}^{\infty}_{p}(x^{\prime}).over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≥ italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≥ blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) .

This completes the proof of (30).

Following the definition of 𝕋~psubscriptsuperscript~𝕋𝑝\tilde{\mathds{T}}^{\infty}_{p}over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT, we have

𝕋~p(x)=inf{t|(x,t)pi(𝕋p)+ηp}subscriptsuperscript~𝕋𝑝𝑥infimumconditional-set𝑡𝑥𝑡𝑝𝑖subscriptsuperscript𝕋𝑝subscript𝜂𝑝\displaystyle\tilde{\mathds{T}}^{\infty}_{p}(x)=\inf\{t|(x,t)\in\mathcal{E}pi(% \mathds{T}^{\infty}_{p})+\eta_{p}\mathcal{B}\}over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) = roman_inf { italic_t | ( italic_x , italic_t ) ∈ caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B } (32)
=\displaystyle== inf{t|xKdp,t𝕋p(x) s.t. (x,t)(x,t)ηp}.infimumconditional-set𝑡formulae-sequencesuperscript𝑥subscript𝐾subscript𝑑𝑝superscript𝑡subscriptsuperscript𝕋𝑝superscript𝑥 s.t. norm𝑥𝑡superscript𝑥superscript𝑡subscript𝜂𝑝\displaystyle\inf\{t|\exists x^{\prime}\in K_{d_{p}},t^{\prime}\geq\mathds{T}^% {\infty}_{p}(x^{\prime})\text{ s.t. }\|(x,t)-(x^{\prime},t^{\prime})\|\leq\eta% _{p}\}.roman_inf { italic_t | ∃ italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≥ blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) s.t. ∥ ( italic_x , italic_t ) - ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } .

Notice that (x,t)(x,t)ηpxxηpnorm𝑥𝑡superscript𝑥superscript𝑡subscript𝜂𝑝norm𝑥superscript𝑥subscript𝜂𝑝\|(x,t)-(x^{\prime},t^{\prime})\|\leq\eta_{p}\Rightarrow\|x-x^{\prime}\|\leq% \eta_{p}∥ ( italic_x , italic_t ) - ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ⇒ ∥ italic_x - italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∥ ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. We can simplify the conditions on (x,t)𝑥𝑡(x,t)( italic_x , italic_t ) as

𝕋~p(x)=subscriptsuperscript~𝕋𝑝𝑥absent\displaystyle\tilde{\mathds{T}}^{\infty}_{p}(x)=over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) = inf{t|x(x+ηp)Kdp,t𝕋p(x)\displaystyle\inf\{t|\forall x^{\prime}\in(x+\eta_{p}\mathcal{B})\cap K_{d_{p}% },t^{\prime}\geq\mathds{T}^{\infty}_{p}(x^{\prime})roman_inf { italic_t | ∀ italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≥ blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT )
 s.t. (x,t)(x,t)ηp}.\displaystyle\quad\text{ s.t. }\|(x,t)-(x^{\prime},t^{\prime})\|\leq\eta_{p}\}.s.t. ∥ ( italic_x , italic_t ) - ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT } .

It again follows from (32) that |tt|ηp𝑡superscript𝑡subscript𝜂𝑝|t-t^{\prime}|\leq\eta_{p}| italic_t - italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT | ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. Since 𝕋~p(x)subscriptsuperscript~𝕋𝑝𝑥\tilde{\mathds{T}}^{\infty}_{p}(x)over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) searches for an infimum of t𝑡titalic_t, taking an infimum of tsuperscript𝑡t^{\prime}italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT would render at the same result but remove unattainable candidate minimizers. Thus, the following holds by letting t=𝕋p(x)superscript𝑡subscriptsuperscript𝕋𝑝superscript𝑥t^{\prime}=\mathds{T}^{\infty}_{p}(x^{\prime})italic_t start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ):

𝕋~p(x)=inf{t|x(x+ηp)Kdp\displaystyle\tilde{\mathds{T}}^{\infty}_{p}(x)=\inf\{t|\forall x^{\prime}\in(% x+\eta_{p}\mathcal{B})\cap K_{d_{p}}over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) = roman_inf { italic_t | ∀ italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT (33)
 s.t. (x,t)(x,𝕋p(x))ηp}\displaystyle\quad\quad\quad\quad\quad\text{ s.t. }\|(x,t)-(x^{\prime},\mathds% {T}^{\infty}_{p}(x^{\prime}))\|\leq\eta_{p}\}s.t. ∥ ( italic_x , italic_t ) - ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) ∥ ≤ italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT }
=\displaystyle== inf{t|x(x+dp)Kdp,(x,t)(x,𝕋p(x))+ηp}infimumconditional-set𝑡formulae-sequencefor-allsuperscript𝑥𝑥subscript𝑑𝑝subscript𝐾subscript𝑑𝑝𝑥𝑡superscript𝑥subscriptsuperscript𝕋𝑝superscript𝑥subscript𝜂𝑝\displaystyle\inf\{t|\forall x^{\prime}\in(x+d_{p}\mathcal{B})\cap K_{d_{p}},(% x,t)\in(x^{\prime},\mathds{T}^{\infty}_{p}(x^{\prime}))+\eta_{p}\mathcal{B}\}roman_inf { italic_t | ∀ italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ( italic_x , italic_t ) ∈ ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B }
\displaystyle\leq minx(x+ηp)Kdp𝕋p(x).subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥\displaystyle\min_{x^{\prime}\in(x+\eta_{p}\mathcal{B})\cap K_{d_{p}}}\mathds{% T}^{\infty}_{p}(x^{\prime}).roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) .

where the last inequality is a direct result of expansion of dpsubscript𝑑𝑝d_{p}\mathcal{B}italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B on (x,𝕋p(x))superscript𝑥subscriptsuperscript𝕋𝑝superscript𝑥(x^{\prime},\mathds{T}^{\infty}_{p}(x^{\prime}))( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ). This completes the proof of (31). ∎

We proceed to show that 𝕋*(x)superscript𝕋𝑥\mathds{T}^{*}(x)blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) is bounded both from below and above by 𝕋p(x)subscriptsuperscript𝕋𝑝𝑥\mathds{T}^{\infty}_{p}(x)blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ). The following claim shows 𝕋*(x)superscript𝕋𝑥\mathds{T}^{*}(x)blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) is lower bounded by 𝕋p(x)subscriptsuperscript𝕋𝑝𝑥\mathds{T}^{\infty}_{p}(x)blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ).

Claim .2.

It holds that for any x𝒟(𝕋*)𝑥𝒟superscript𝕋x\in\mathcal{D}(\mathds{T}^{*})italic_x ∈ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ),

𝕋*(x)minx(x+ηp)Kdp𝕋p(x)ηp.superscript𝕋𝑥subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥subscript𝜂𝑝\displaystyle\mathds{T}^{*}(x)\geq\min_{x^{\prime}\in(x+\eta_{p}\mathcal{B})% \cap K_{d_{p}}}\mathds{T}^{\infty}_{p}(x^{\prime})-\eta_{p}.blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) ≥ roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT .
Proof.

Following the argument in the proof of Corollary 3.7 in [15], we have pi(𝕋*)pi(𝕋~p)𝑝𝑖superscript𝕋𝑝𝑖subscriptsuperscript~𝕋𝑝\mathcal{E}pi(\mathds{T}^{*})\subseteq\mathcal{E}pi(\tilde{\mathds{T}}^{\infty% }_{p})caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ⊆ caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) and it renders at 𝕋~p(x)𝕋*(x)subscriptsuperscript~𝕋𝑝𝑥superscript𝕋𝑥\tilde{\mathds{T}}^{\infty}_{p}(x)\leq\mathds{T}^{*}(x)over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) ≤ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ). Then it follows from (30) in Claim .1 that minx(x+ηp)Kdp𝕋p(x)𝕋~p(x)+ηp𝕋*(x)+ηpsubscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥subscriptsuperscript~𝕋𝑝𝑥subscript𝜂𝑝superscript𝕋𝑥subscript𝜂𝑝\min_{x^{\prime}\in(x+\eta_{p}\mathcal{B})\cap K_{d_{p}}}\mathds{T}^{\infty}_{% p}(x^{\prime})\leq\tilde{\mathds{T}}^{\infty}_{p}(x)+\eta_{p}\leq\mathds{T}^{*% }(x)+\eta_{p}roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≤ over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≤ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. This completes the proof. ∎

The following claim shows the 𝕋*(x)superscript𝕋𝑥\mathds{T}^{*}(x)blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) is upper bounded by the limit of 𝕋p(x)subscriptsuperscript𝕋𝑝𝑥\mathds{T}^{\infty}_{p}(x)blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ).

Claim .3.

For any x𝒟(𝕋*)𝑥𝒟superscript𝕋x\in\mathcal{D}(\mathds{T}^{*})italic_x ∈ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ), it holds that

𝕋*(x)lim infp+minx(x+ηp)Kdp𝕋p(x).superscript𝕋𝑥subscriptlimit-infimum𝑝subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥\displaystyle\mathds{T}^{*}(x)\leq\liminf_{p\to+\infty}\min_{x^{\prime}\in(x+% \eta_{p}\mathcal{B})\cap K_{d_{p}}}\mathds{T}^{\infty}_{p}(x^{\prime}).blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) ≤ lim inf start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) .
Proof.

Following the argument in the proof of Corollary 3.7 in [15], we have Limsupp+pi(𝕋~p)pi(𝕋*)subscriptLimsup𝑝𝑝𝑖subscriptsuperscript~𝕋𝑝𝑝𝑖superscript𝕋\operatorname*{Limsup}_{p\to+\infty}\mathcal{E}pi(\tilde{\mathds{T}}^{\infty}_% {p})\subseteq\mathcal{E}pi(\mathds{T}^{*})roman_Limsup start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ⊆ caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). By the definition of LimsupLimsup\operatorname*{Limsup}roman_Limsup, we have (x,t)Limsupp+pi(𝕋~p)for-all𝑥𝑡subscriptLimsup𝑝𝑝𝑖subscriptsuperscript~𝕋𝑝\forall(x,t)\in\operatorname*{Limsup}_{p\to+\infty}\mathcal{E}pi(\tilde{% \mathds{T}}^{\infty}_{p})∀ ( italic_x , italic_t ) ∈ roman_Limsup start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ), lim infp+dist((x,t),pi(𝕋~p))=0subscriptlimit-infimum𝑝dist𝑥𝑡𝑝𝑖subscriptsuperscript~𝕋𝑝0\liminf_{p\to+\infty}\operatorname{dist}((x,t),\mathcal{E}pi(\tilde{\mathds{T}% }^{\infty}_{p}))=0lim inf start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT roman_dist ( ( italic_x , italic_t ) , caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ) = 0. In other words, there exists a sequence {(xp,tp)pi(𝕋~p)}subscript𝑥𝑝subscript𝑡𝑝𝑝𝑖subscriptsuperscript~𝕋𝑝\{(x_{p},t_{p})\in\mathcal{E}pi(\tilde{\mathds{T}}^{\infty}_{p})\}{ ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ∈ caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) } s.t. lim infp+dist((x,t),(xp,tp))=0subscriptlimit-infimum𝑝dist𝑥𝑡subscript𝑥𝑝subscript𝑡𝑝0\liminf_{p\to+\infty}\operatorname{dist}((x,t),(x_{p},t_{p}))=0lim inf start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT roman_dist ( ( italic_x , italic_t ) , ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ) = 0.

Fix x𝒟(𝕋*)𝑥𝒟superscript𝕋x\in\mathcal{D}(\mathds{T}^{*})italic_x ∈ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) and consider a sequence {(xp,tp)}subscript𝑥𝑝subscript𝑡𝑝\{(x_{p},t_{p})\}{ ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) } s.t. xp=x,tp=𝕋~p(xp)formulae-sequencesubscript𝑥𝑝𝑥subscript𝑡𝑝subscriptsuperscript~𝕋𝑝subscript𝑥𝑝x_{p}=x,t_{p}=\tilde{\mathds{T}}^{\infty}_{p}(x_{p})italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = italic_x , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT = over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ). It follows from Claim .2 that tp𝕋*(x)<+subscript𝑡𝑝superscript𝕋𝑥t_{p}\leq\mathds{T}^{*}(x)<+\inftyitalic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≤ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) < + ∞, where the last inequality follows from the definition of 𝒟(𝕋*)𝒟superscript𝕋\mathcal{D}(\mathds{T}^{*})caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). Therefore, {(xp,tp)}subscript𝑥𝑝subscript𝑡𝑝\{(x_{p},t_{p})\}{ ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) } is bounded. Then it follows from the supplementary statement about the definition of LimsupLimsup\operatorname*{Limsup}roman_Limsup on page 236 in [15] that Limsupp+{(xp,tp)}subscriptLimsup𝑝subscript𝑥𝑝subscript𝑡𝑝\mathrm{Limsup}_{p\to+\infty}\{(x_{p},t_{p})\}\neq\emptysetroman_Limsup start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT { ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) } ≠ ∅. Since (xp,tp)pi(𝕋~p)subscript𝑥𝑝subscript𝑡𝑝𝑝𝑖subscriptsuperscript~𝕋𝑝(x_{p},t_{p})\in\mathcal{E}pi(\tilde{\mathds{T}}^{\infty}_{p})( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ∈ caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ), we have

Limsupp+{(xp,tp)}Limsupp+pi(𝕋~p)pi(𝕋*).subscriptLimsup𝑝subscript𝑥𝑝subscript𝑡𝑝subscriptLimsup𝑝𝑝𝑖subscriptsuperscript~𝕋𝑝𝑝𝑖superscript𝕋\displaystyle\operatorname*{Limsup}_{p\to+\infty}\{(x_{p},t_{p})\}\subseteq% \operatorname*{Limsup}_{p\to+\infty}\mathcal{E}pi(\tilde{\mathds{T}}^{\infty}_% {p})\subseteq\mathcal{E}pi(\mathds{T}^{*}).roman_Limsup start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT { ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) } ⊆ roman_Limsup start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT caligraphic_E italic_p italic_i ( over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) ⊆ caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) .

Let (x*,t*)Limsupp+{(xp,tp)}superscript𝑥superscript𝑡subscriptLimsup𝑝subscript𝑥𝑝subscript𝑡𝑝(x^{*},t^{*})\in\operatorname*{Limsup}_{p\to+\infty}\{(x_{p},t_{p})\}( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_t start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) ∈ roman_Limsup start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT { ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) } s.t. t*t,(x,t)Limsupp+{(xp,tp)}formulae-sequencesuperscript𝑡𝑡for-all𝑥𝑡subscriptLimsup𝑝subscript𝑥𝑝subscript𝑡𝑝t^{*}\leq t,\forall(x,t)\in\operatorname*{Limsup}_{p\to+\infty}\{(x_{p},t_{p})\}italic_t start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≤ italic_t , ∀ ( italic_x , italic_t ) ∈ roman_Limsup start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT { ( italic_x start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_t start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) }. Therefore, t*𝕋*(x*)=𝕋*(x)superscript𝑡superscript𝕋superscript𝑥superscript𝕋𝑥t^{*}\geq\mathds{T}^{*}(x^{*})=\mathds{T}^{*}(x)italic_t start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≥ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ). Then we have lim infp+𝕋~p(x)=t*𝕋*(x)subscriptlimit-infimum𝑝subscriptsuperscript~𝕋𝑝𝑥superscript𝑡superscript𝕋𝑥\liminf_{p\to+\infty}\tilde{\mathds{T}}^{\infty}_{p}(x)=t^{*}\geq\mathds{T}^{*% }(x)lim inf start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT over~ start_ARG blackboard_T end_ARG start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) = italic_t start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≥ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ). It follows from (31) in Claim .1 that

𝕋*(x)lim infp+minx(x+ηp)Kdp𝕋p(x).superscript𝕋𝑥subscriptlimit-infimum𝑝subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥\displaystyle\mathds{T}^{*}(x)\leq\liminf_{p\to+\infty}\min_{x^{\prime}\in(x+% \eta_{p}\mathcal{B})\cap K_{d_{p}}}\mathds{T}^{\infty}_{p}(x^{\prime}).blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) ≤ lim inf start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) .

This completes the proof. ∎

Combining Claims .2 and .3 renders at

minx(x+ηp)Kdp𝕋p(x)ηp𝕋*(x)subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥subscript𝜂𝑝superscript𝕋𝑥\displaystyle\min_{x^{\prime}\in(x+\eta_{p}\mathcal{B})\cap K_{d_{p}}}\mathds{% T}^{\infty}_{p}(x^{\prime})-\eta_{p}\leq\mathds{T}^{*}(x)roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) - italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≤ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x )
\displaystyle\leq lim infp+minx(x+ηp)Kdp𝕋p(x).subscriptlimit-infimum𝑝subscriptsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝subscriptsuperscript𝕋𝑝superscript𝑥\displaystyle\liminf_{p\to+\infty}\min_{x^{\prime}\in(x+\eta_{p}\mathcal{B})% \cap K_{d_{p}}}\mathds{T}^{\infty}_{p}(x^{\prime}).lim inf start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) .

By taking p+𝑝p\to+\inftyitalic_p → + ∞ on the left side, both ηpsubscript𝜂𝑝\eta_{p}italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT and dpsubscript𝑑𝑝d_{p}italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT disappear at the same rate and we complete the proof. ∎

We proceed to show the convergence of the value function estimate on each point xKdp𝑥subscript𝐾subscript𝑑𝑝x\in K_{d_{p}}italic_x ∈ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Different from Theorem .1, we remove the perturbation x(x+ηp)Kdpsuperscript𝑥𝑥subscript𝜂𝑝subscript𝐾subscript𝑑𝑝x^{\prime}\in(x+\eta_{p}\mathcal{B})\cap K_{d_{p}}italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT in the theorem statement and directly analyze the values 𝕋p(x),xKdpsuperscriptsubscript𝕋𝑝𝑥for-all𝑥subscript𝐾subscript𝑑𝑝\mathds{T}_{p}^{\infty}(x),\forall x\in K_{d_{p}}blackboard_T start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_x ) , ∀ italic_x ∈ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Towards this end, we first prove the following lemma quantifying the lowerbound of 𝕋*(x),xKdpsuperscript𝕋𝑥for-all𝑥subscript𝐾subscript𝑑𝑝\mathds{T}^{*}(x),\forall x\in K_{d_{p}}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) , ∀ italic_x ∈ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

Lemma .1.

Given all conditions in Theorem .1, the following relationship holds:

𝕋*(x)𝕋p(x)dp,xKdp.formulae-sequencesuperscript𝕋𝑥subscriptsuperscript𝕋𝑝𝑥subscript𝑑𝑝for-all𝑥subscript𝐾subscript𝑑𝑝\displaystyle\mathds{T}^{*}(x)\geq\mathds{T}^{\infty}_{p}(x)-d_{p},\forall x% \in K_{d_{p}}.blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) ≥ blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) - italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , ∀ italic_x ∈ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT .
Proof.

It follows from Proposition 2.21 in [15] that (ViabF()+dp)dp𝑉𝑖𝑎subscript𝑏𝐹subscript𝑑𝑝subscriptsubscript𝑑𝑝(Viab_{F}(\mathcal{H})+d_{p}\mathcal{B})\cap\mathcal{H}_{d_{p}}( italic_V italic_i italic_a italic_b start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ( caligraphic_H ) + italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ caligraphic_H start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT is a discrete viability domain for Γϵp,dpsubscriptΓsubscriptitalic-ϵ𝑝subscript𝑑𝑝\Gamma_{\epsilon_{p},d_{p}}roman_Γ start_POSTSUBSCRIPT italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Thus, it holds that (ViabF()+dp)dpViabΓϵp,dp(dp)𝑉𝑖𝑎subscript𝑏𝐹subscript𝑑𝑝subscriptsubscript𝑑𝑝𝑉𝑖𝑎subscript𝑏subscriptΓsubscriptitalic-ϵ𝑝subscript𝑑𝑝subscriptsubscript𝑑𝑝(Viab_{F}(\mathcal{H})+d_{p}\mathcal{B})\cap\mathcal{H}_{d_{p}}\subseteq Viab_% {\Gamma_{\epsilon_{p},d_{p}}}(\mathcal{H}_{d_{p}})( italic_V italic_i italic_a italic_b start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT ( caligraphic_H ) + italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ caligraphic_H start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊆ italic_V italic_i italic_a italic_b start_POSTSUBSCRIPT roman_Γ start_POSTSUBSCRIPT italic_ϵ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT , italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( caligraphic_H start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ). It follows from Theorem 3.2 and Lemma 3.6 that we can replace the discrete viability kernals with epigraphs of traveling time functions; i.e.,

(pi(𝕋*)+dp)dppi(𝕋p).𝑝𝑖superscript𝕋subscript𝑑𝑝subscriptsubscript𝑑𝑝𝑝𝑖subscriptsuperscript𝕋𝑝\displaystyle(\mathcal{E}pi(\mathds{T}^{*})+d_{p}\mathcal{B})\cap\mathcal{H}_{% d_{p}}\subseteq\mathcal{E}pi(\mathds{T}^{\infty}_{p}).( caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) + italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B ) ∩ caligraphic_H start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊆ caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ) .

Fix xKdp𝑥subscript𝐾subscript𝑑𝑝x\in K_{d_{p}}italic_x ∈ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT end_POSTSUBSCRIPT and there exists t𝕋*(x)+dp𝑡superscript𝕋𝑥subscript𝑑𝑝t\in\mathds{T}^{*}(x)+d_{p}\mathcal{B}italic_t ∈ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) + italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B s.t. (x,t)pi(𝕋p)𝑥𝑡𝑝𝑖subscriptsuperscript𝕋𝑝(x,t)\in\mathcal{E}pi(\mathds{T}^{\infty}_{p})( italic_x , italic_t ) ∈ caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ); that is, t𝕋p(x)𝑡subscriptsuperscript𝕋𝑝𝑥t\geq\mathds{T}^{\infty}_{p}(x)italic_t ≥ blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ). This implies 𝕋*(x)+dp𝕋p(x)superscript𝕋𝑥subscript𝑑𝑝subscriptsuperscript𝕋𝑝𝑥\mathds{T}^{*}(x)+d_{p}\geq\mathds{T}^{\infty}_{p}(x)blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) + italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ≥ blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ). This completes the proof. ∎

The following corollary characterized the pointwise convergence of the value function estimate.

Corollary .1.

Given the conditions in Theorem .1 and xKdp¯𝒟(𝕋*)𝑥subscript𝐾subscript𝑑normal-¯𝑝𝒟superscript𝕋x\in K_{d_{\bar{p}}}\cap\mathcal{D}(\mathds{T}^{*})italic_x ∈ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT over¯ start_ARG italic_p end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∩ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) for some p¯1normal-¯𝑝1\bar{p}\geq 1over¯ start_ARG italic_p end_ARG ≥ 1. Then the following holds for every xKdp¯𝒟(𝕋*)𝑥subscript𝐾subscript𝑑normal-¯𝑝𝒟superscript𝕋x\in K_{d_{\bar{p}}}\cap\mathcal{D}(\mathds{T}^{*})italic_x ∈ italic_K start_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT over¯ start_ARG italic_p end_ARG end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∩ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ):

𝕋*(x)=limp+𝕋p(x).superscript𝕋𝑥subscript𝑝subscriptsuperscript𝕋𝑝𝑥\displaystyle\mathds{T}^{*}(x)=\lim_{p\to+\infty}\mathds{T}^{\infty}_{p}(x).blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) = roman_lim start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) .
Proof.

The proof generally follows the arguments towards Theorem .1. For T~psuperscriptsubscript~𝑇𝑝\tilde{T}_{p}^{\infty}over~ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT defined as pi(T~p)=pi(𝕋p)+ηp𝑝𝑖superscriptsubscript~𝑇𝑝𝑝𝑖superscriptsubscript𝕋𝑝subscript𝜂𝑝\mathcal{E}pi(\tilde{T}_{p}^{\infty})=\mathcal{E}pi(\mathds{T}_{p}^{\infty})+% \eta_{p}\mathcal{B}caligraphic_E italic_p italic_i ( over~ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) = caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ) + italic_η start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT caligraphic_B, T~p(x)𝕋p(x)superscriptsubscript~𝑇𝑝𝑥superscriptsubscript𝕋𝑝𝑥\tilde{T}_{p}^{\infty}(x)\leq\mathds{T}_{p}^{\infty}(x)over~ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_x ) ≤ blackboard_T start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_x ). Then it follows from Claim .3 that lim infp+𝕋p(x)lim infp+T~p(x)𝕋*(x)subscriptlimit-infimum𝑝superscriptsubscript𝕋𝑝𝑥subscriptlimit-infimum𝑝superscriptsubscript~𝑇𝑝𝑥superscript𝕋𝑥\liminf_{p\to+\infty}\mathds{T}_{p}^{\infty}(x)\geq\liminf_{p\to+\infty}\tilde% {T}_{p}^{\infty}(x)\geq\mathds{T}^{*}(x)lim inf start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT blackboard_T start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_x ) ≥ lim inf start_POSTSUBSCRIPT italic_p → + ∞ end_POSTSUBSCRIPT over~ start_ARG italic_T end_ARG start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT ( italic_x ) ≥ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ). It follows from Lemma .1 that 𝕋p(x)𝕋*(x)+dpsubscriptsuperscript𝕋𝑝𝑥superscript𝕋𝑥subscript𝑑𝑝\mathds{T}^{\infty}_{p}(x)\leq\mathds{T}^{*}(x)+d_{p}blackboard_T start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT ( italic_x ) ≤ blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) + italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT. Taking limits p+𝑝p\to+\inftyitalic_p → + ∞, dp0subscript𝑑𝑝0d_{p}\to 0italic_d start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT → 0 completes the proof. ∎

Notice that Theorem .1 only characterizes the pointwise convergence over 𝒟(𝕋*)𝒟superscript𝕋\mathcal{D}(\mathds{T}^{*})caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). On K𝒟(𝕋*)𝐾𝒟superscript𝕋K\setminus\mathcal{D}(\mathds{T}^{*})italic_K ∖ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ), the value of 𝕋*superscript𝕋\mathds{T}^{*}blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is not well defined and the convergence cannot be analyzed. The Kruzhkov transform can address this issue and let Θk*(x)=1,xK𝒟(𝕋*)formulae-sequencesubscriptsuperscriptΘ𝑘𝑥1for-all𝑥𝐾𝒟superscript𝕋\varTheta^{*}_{k}(x)=1,\forall x\in K\setminus\mathcal{D}(\mathds{T}^{*})roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x ) = 1 , ∀ italic_x ∈ italic_K ∖ caligraphic_D ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ). Without proof, the theorem is stated below.

Lemma .2.

Given the convergence in the epigraphic sense, i.e., pi(𝕋*)=Limk+pi(𝕋k*)𝑝𝑖superscript𝕋subscriptnormal-Limnormal-→𝑘𝑝𝑖subscriptsuperscript𝕋𝑘\mathcal{E}pi(\mathds{T}^{*})=\operatorname*{Lim}_{k\to+\infty}\mathcal{E}pi(% \mathds{T}^{*}_{k})caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ) = roman_Lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT caligraphic_E italic_p italic_i ( blackboard_T start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ), Θk*subscriptsuperscriptnormal-Θ𝑘\varTheta^{*}_{k}roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT converges pointwise to Θ*superscriptnormal-Θ\varTheta^{*}roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT, i.e., x𝒳for-all𝑥𝒳\forall x\in\mathcal{X}∀ italic_x ∈ caligraphic_X,

Θ*(x)=limk+minx(x+k)VkΘk*(x),superscriptΘ𝑥subscript𝑘subscriptsuperscript𝑥𝑥subscript𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥\displaystyle\varTheta^{*}(x)=\lim_{k\to+\infty}\min_{x^{\prime}\in(x+\ell_{k}% \mathcal{B})\cap V_{k}}\varTheta^{*}_{k}(x^{\prime}),roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ( italic_x ) = roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + roman_ℓ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ,

where limk+kdk[1,+)subscriptnormal-→𝑘subscriptnormal-ℓ𝑘subscript𝑑𝑘1\lim_{k\to+\infty}\frac{\ell_{k}}{d_{k}}\in[1,+\infty)roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT divide start_ARG roman_ℓ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ∈ [ 1 , + ∞ ) and kdksubscriptnormal-ℓ𝑘subscript𝑑𝑘\ell_{k}\geq d_{k}roman_ℓ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.

The following corollary shows the uniform convergence for the transformed value function estimates.

Corollary .2.

The transformed value function estimate {Θk*}subscriptsuperscriptnormal-Θ𝑘\{\varTheta^{*}_{k}\}{ roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } converges uniformly to Θ*superscriptnormal-Θ\varTheta^{*}roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT almost everywhere on 𝒳𝒳\mathcal{X}caligraphic_X; i.e.,

limk+Θ*minx(x+k)VkΘk*(x)𝒳𝒵=0,subscript𝑘subscriptnormsuperscriptΘsubscriptsuperscript𝑥𝑥subscript𝑘subscript𝑉𝑘subscriptsuperscriptΘ𝑘superscript𝑥𝒳𝒵0\displaystyle\lim_{k\to+\infty}\|\varTheta^{*}-\min_{x^{\prime}\in(x+\ell_{k}% \mathcal{B})\cap V_{k}}\varTheta^{*}_{k}(x^{\prime})\|_{\mathcal{X}\setminus% \mathcal{Z}}=0,roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT ∥ roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT - roman_min start_POSTSUBSCRIPT italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ( italic_x + roman_ℓ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT caligraphic_B ) ∩ italic_V start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_x start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ∥ start_POSTSUBSCRIPT caligraphic_X ∖ caligraphic_Z end_POSTSUBSCRIPT = 0 ,

where 𝒵𝒵\mathcal{Z}caligraphic_Z has sufficiently small measure, limk+kdk[1,+)subscriptnormal-→𝑘subscriptnormal-ℓ𝑘subscript𝑑𝑘1\lim_{k\to+\infty}\frac{\ell_{k}}{d_{k}}\in[1,+\infty)roman_lim start_POSTSUBSCRIPT italic_k → + ∞ end_POSTSUBSCRIPT divide start_ARG roman_ℓ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG start_ARG italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG ∈ [ 1 , + ∞ ) and kdksubscriptnormal-ℓ𝑘subscript𝑑𝑘\ell_{k}\geq d_{k}roman_ℓ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≥ italic_d start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT.

Proof.

The proof leverages Egoroff’s theorem [46]. Since 𝒳𝒳\mathcal{X}caligraphic_X is compact, 𝒳𝒳\mathcal{X}caligraphic_X has finite measure. It follows from Theorem .1 that {Θk*}subscriptsuperscriptΘ𝑘\{\varTheta^{*}_{k}\}{ roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } are measurable functions converging on 𝒳𝒳\mathcal{X}caligraphic_X pointwise to Θ*superscriptΘ\varTheta^{*}roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT. Then {Θk*}subscriptsuperscriptΘ𝑘\{\varTheta^{*}_{k}\}{ roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } converges uniformly to Θ*superscriptΘ\varTheta^{*}roman_Θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT almost everywhere to 𝒳𝒵𝒳𝒵\mathcal{X}\setminus\mathcal{Z}caligraphic_X ∖ caligraphic_Z by the Egoroff’s theorem, where 𝒵𝒵\mathcal{Z}caligraphic_Z has sufficiently small measure. This completes the proof of the first relation. ∎