Adjoint Sensitivity Analysis on Multi-Scale bioprocess stochastic reaction network

Keilung Choy
Northeastern University &Wei Xie
Northeastern University
Corresponding author. Email: [email protected]
Abstract

Motivated by the pressing challenges in the digital twin development for biomanufacturing systems, we introduce an adjoint sensitivity analysis (SA) approach to expedite the learning of mechanistic model parameters. In this paper, we consider enzymatic stochastic reaction networks representing a multi-scale bioprocess mechanistic model that allows us to integrate disparate data from diverse production processes and leverage the information from existing macro-kinetic and genome-scale models. To support forward prediction and backward reasoning, we develop a convergent adjoint SA algorithm studying how the perturbations of model parameters and inputs (e.g., initial state) propagate through enzymatic reaction networks and impact on output trajectory predictions. This SA can provide a sample efficient and interpretable way to assess the sensitivities between inputs and outputs accounting for their causal dependencies. Our empirical study underscores the resilience of these sensitivities and illuminates a deeper comprehension of the regulatory mechanisms behind bioprocess through sensitivities.

Keywords Multi-Scale Bioprocess  \cdot Enzymatic Reaction Network  \cdot Diffusion Approximation  \cdot Stochastic Differential Equation  \cdot Sensitivity Analysis  \cdot Gradient-Based Optimization

1 INTRODUCTION

With prior knowledge of the reaction network structure and regulatory mechanisms, the trajectory output of bioprocess hinges on three key inputs: (1) initial states; (2) actions; and (3) parameters of the mechanistic model(Xie et al., 2022). Suppose the effect of actions, such as feeding strategies, on state change is immediate and known. Therefore, our objective is to develop an interpretable and sample-efficient sensitivity analysis (SA) approach for the multi-scale bioprocess mechanistic model that focuses on investigating the sensitivities between inputs (i.e., initial states and model parameters) and outputs.

A multi-scale bioprocess mechanistic model can facilitate the development of digital twins and BioFoundries for biomanufacturing processes. Within this paradigm, the model’s foundation is constructed upon fundamental building blocks, i.e., molecular reaction networks. There exist various challenges to conduct SA on the mechanistic model of enzymatic stochastic reaction networks. One significant feature is its double-stochasticity, which means at any time molecular reaction rates are contingent upon random states, such as species concentrations and environmental variables. Upon formulating the multi-scale bioprocess mechanistic model in the form of stochastic differential equations (SDEs), the drift and diffusion terms could be built based on Michaelis–Menten kinetics, which is the most frequently used kinetic model of enzymatic reactions found in existing literature (Kyriakopoulos et al., 2018). It leverages our existing understanding of the reaction rate structure and encapsulates the double stochasticity inherent in stochastic reaction networks, i.e., both the drift and diffusion terms are contingent upon the current states. Consequently, deriving analytical solutions for such SDEs, characterizing stochastic molecular reaction network dynamics and variations, could be difficult. Moreover, the interactions between input factors and intermediate states further introduce high complexity in SA.

Existing sensitivity analysis approaches could be divided into two categories: local and global sensitivity analysis. Global sensitivity analysis focuses on the impact of significant variations in model inputs. For example, the Sobol method (Sobol, 2001) is a global sensitivity analysis approach based on variance decomposition, which decomposes the variance of the model output into the contributions of different model inputs and parameters. In contrast, the objective of local sensitivity analysis is to quantify the effects of minor perturbations in model inputs and parameters on model predictions. There are a variety of local sensitivity analysis approaches including finite direct differential method (Kramer et al., 1984), Nominal Range Sensitivity Method (John Bailer, 2001), and automatic differentiation (Kedem, 1980). Unfortunately, most of these existing approaches are not applicable to our study since they scale poorly, either in terms of computational time or memory usage, as the number of parameters and states within the model increases. Given that the multi-scale bioprocess model typically represents a complex system with many inputs and parameters, these limitations pose a significant barrier to the learning process of underlying mechanisms.

In this paper, we formulate the multi-scale bioprocess mechanistic model in SDEs form, accounting for underlying causal interdependencies of an enzymatic stochastic reaction network, and then develop an adjoint SA approach studying the sensitivities between inputs and outputs. It can correctly and efficiently quantify the contribution and criticality of each input and model parameter impacting on the prediction errors of multivariate output trajectories, such as productivity and product critical quality attributes (CQAs). To support forward prediction and backward reasoning, this adjoint sensitivity analysis over the operator of SDEs, characterizing bioprocess mechanisms, exhibits robust scalability even when the complexity of the mechanistic model increases. In addition, since it leverages the structural information of the regulatory reaction network, the adjoint SA can provide sample efficient and interpretable guidance to search inputs and model parameters, accounting for their interactions, to speed up the learning process.

This paper addresses two key challenges in bioprocess modeling through the development of the adjoint SA approach. Firstly, it addresses the issue that the dimensionality of model parameters can be so high that searching for the optimal parameter estimation becomes challenging. The local SA approach assesses the impact of model parameters on prediction accuracy and expedites process mechanism learning. This facilitates digital twin development and more efficient experimental design, which is crucial in the costly, highly regulated biomanufacturing industry. Secondly, the proposed adjoint SA can incorporate the intricate interdependencies of bioprocesses. By leveraging structural information within enzymatic molecular reaction networks, the paper mitigates complexity and streamlines the calculation of input-output sensitivities, alleviating computational bottlenecks encountered in previous local SA studies.

The structure of the paper is organized as follows. In Section 2, we initiate by delineating the multi-scale bioprocess mechanistic model and summarizing its key characteristics. Subsequently, in Section 3, we proceed to construct a metamodel to the multi-scale bioprocess, leveraging the insights from the preceding section. Section 4 is dedicated to local sensitivity analysis algorithm development, which will help us investigate the barriers to the reduction of model prediction errors and provide a guidance to speed up the search for optimal inputs. Building upon this part, Section 4.2 introduces an adjoint SA algorithm on SDEs designed to enhance computational efficiency. Improvements brought by the algorithm will be validated in Section 5 through the empirical study of its finite-sample performance. Finally, in Section 6, we synthesize the findings and insights gathered throughout this study, leading to the paper’s conclusion.

2 Problem Description

2.1 Multi-Scale Bioprocess Stochastic Reaction Network

A multi-scale bioprocessing mechanistic model characterizes the causal dependence from molecular to macroscopic kinetics and it is built on the fundamental building block, i.e., enzymatic molecular reaction networks. Suppose the system is composed of I𝐼Iitalic_I species, denoted by 𝑿=(X1,X2,,XI)𝑿superscriptsubscript𝑋1subscript𝑋2subscript𝑋𝐼top\boldsymbol{X}=(X_{1},X_{2},\ldots,X_{I})^{\top}bold_italic_X = ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, interacting with each other through J𝐽Jitalic_J reactions. At any time t𝑡titalic_t, let 𝒔t=(st1,st2,,stI)+Isubscript𝒔𝑡superscriptsuperscriptsubscript𝑠𝑡1superscriptsubscript𝑠𝑡2superscriptsubscript𝑠𝑡𝐼topsuperscriptsubscript𝐼\boldsymbol{s}_{t}=(s_{t}^{1},s_{t}^{2},\ldots,s_{t}^{I})^{\top}\in\mathbb{R}_% {+}^{I}bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = ( italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , … , italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUBSCRIPT + end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT be the bioprocess state, where stisuperscriptsubscript𝑠𝑡𝑖s_{t}^{i}italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT denotes the number of molecules of species i𝑖iitalic_i. Each j𝑗jitalic_j-th reaction with j=1,2,,J𝑗12𝐽j=1,2,\ldots,Jitalic_j = 1 , 2 , … , italic_J is characterized by a reaction vector 𝑵jIsubscript𝑵𝑗superscript𝐼\boldsymbol{N}_{j}\in\mathbb{R}^{I}bold_italic_N start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_I end_POSTSUPERSCRIPT, describing the change in the numbers of I𝐼Iitalic_I species’ molecules when a j𝑗jitalic_j-th molecular reaction occurs. The associated reaction rate denoted by vjsubscript𝑣𝑗v_{j}italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, depending on state, such as the current number of molecules of each species, describes the rate at which the j𝑗jitalic_j-th reaction occurs. Specifically, for the j𝑗jitalic_j-th reaction equation given by

pj1X1+pj2X2++pjIXI𝒗jqj1X1+qj2X2++qjIXI,subscript𝒗𝑗subscript𝑝𝑗1subscript𝑋1subscript𝑝𝑗2subscript𝑋2subscript𝑝𝑗𝐼subscript𝑋𝐼subscript𝑞𝑗1subscript𝑋1subscript𝑞𝑗2subscript𝑋2subscript𝑞𝑗𝐼subscript𝑋𝐼p_{j1}X_{1}+p_{j2}X_{2}+\cdots+p_{jI}X_{I}\xrightarrow{\bm{v}_{j}}q_{j1}X_{1}+% q_{j2}X_{2}+\cdots+q_{jI}X_{I},italic_p start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT italic_j 2 end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ⋯ + italic_p start_POSTSUBSCRIPT italic_j italic_I end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT start_ARROW start_OVERACCENT bold_italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_OVERACCENT → end_ARROW italic_q start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_q start_POSTSUBSCRIPT italic_j 2 end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ⋯ + italic_q start_POSTSUBSCRIPT italic_j italic_I end_POSTSUBSCRIPT italic_X start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT ,

the reaction relational structure, specified by the vector 𝑵j=(qj1pj1,qj2pj2,,qjIpiI)subscript𝑵𝑗superscriptsubscript𝑞𝑗1subscript𝑝𝑗1subscript𝑞𝑗2subscript𝑝𝑗2subscript𝑞𝑗𝐼subscript𝑝𝑖𝐼top\boldsymbol{N}_{j}=(q_{j1}-p_{j1},q_{j2}-p_{j2},\ldots,q_{jI}-p_{iI})^{\top}bold_italic_N start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT = ( italic_q start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_j 1 end_POSTSUBSCRIPT , italic_q start_POSTSUBSCRIPT italic_j 2 end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_j 2 end_POSTSUBSCRIPT , … , italic_q start_POSTSUBSCRIPT italic_j italic_I end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT italic_i italic_I end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, is known for j=1,2,,J𝑗12𝐽j=1,2,\ldots,Jitalic_j = 1 , 2 , … , italic_J. Thus, the stoichiometry matrix 𝑵=(𝑵1,𝑵2,,𝑵J)I×J𝑵subscript𝑵1subscript𝑵2subscript𝑵𝐽superscript𝐼𝐽\boldsymbol{N}=\left(\boldsymbol{N}_{1},\boldsymbol{N}_{2},\ldots,\boldsymbol{% N}_{J}\right)\in\mathbb{R}^{I\times J}bold_italic_N = ( bold_italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_N start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , bold_italic_N start_POSTSUBSCRIPT italic_J end_POSTSUBSCRIPT ) ∈ blackboard_R start_POSTSUPERSCRIPT italic_I × italic_J end_POSTSUPERSCRIPT characterizes the structure information of the reaction network composed of J𝐽Jitalic_J reactions. The (i,j)𝑖𝑗(i,j)( italic_i , italic_j )-th element of 𝑵𝑵\boldsymbol{N}bold_italic_N, denoted by Nijsubscript𝑁𝑖𝑗N_{ij}italic_N start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT, represents the number of molecules of the i𝑖iitalic_i-th species that are either consumed (indicated by a negative value) or produced (indicated by a positive value) in each random occurrence of the j𝑗jitalic_j-th reaction.

In this paper, we suppose that the structure of the reaction network represented by matrix 𝑵𝑵\boldsymbol{N}bold_italic_N is known. The regulation mechanism of each j𝑗jitalic_j-th reaction is characterized by the reaction rate function 𝒗jsubscript𝒗𝑗\bm{v}_{j}bold_italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, which is associated with the current system state 𝒔tsubscript𝒔𝑡\boldsymbol{s}_{t}bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and the mechanistic model parameters denoted by 𝜽tsubscript𝜽𝑡\boldsymbol{\theta}_{t}bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. Let 𝑹tsubscript𝑹𝑡\bm{R}_{t}bold_italic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT be a vector of the occurrences of each molecular reaction in a given short time interval (t,t+Δt]𝑡𝑡Δ𝑡(t,t+\Delta t]( italic_t , italic_t + roman_Δ italic_t ] and the system state is updated from 𝒔tsubscript𝒔𝑡\boldsymbol{s}_{t}bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT to 𝒔t+1subscript𝒔𝑡1\boldsymbol{s}_{t+1}bold_italic_s start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT. Since a molecular reaction will occur when one molecule collides, binds, and reacts with another one while molecules move around randomly, driven by stochastic thermodynamics of Brownian motion Golightly and Wilkinson (2005), the occurrences of molecular reactions are modeled by non-homogeneous Poisson process. Therefore, the state transition model becomes,

𝒔t+1=𝒔t+𝑵𝑹𝒕with𝑹tPoisson(𝒗(𝒔t,𝜽t)),formulae-sequencesubscript𝒔𝑡1subscript𝒔𝑡𝑵subscript𝑹𝒕withsimilar-tosubscript𝑹𝑡Poisson𝒗subscript𝒔𝑡subscript𝜽𝑡\boldsymbol{s}_{t+1}=\boldsymbol{s}_{t}+\bm{N}\cdot\bm{R_{t}}\quad\mbox{with}% \quad\bm{R}_{t}\sim\mbox{Poisson}(\bm{v}(\boldsymbol{s}_{t},\bm{\theta}_{t})),bold_italic_s start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT = bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + bold_italic_N ⋅ bold_italic_R start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT with bold_italic_R start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∼ Poisson ( bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) , (1)

where 𝑵𝑹𝒕𝑵subscript𝑹𝒕\bm{N}\cdot\bm{R_{t}}bold_italic_N ⋅ bold_italic_R start_POSTSUBSCRIPT bold_italic_t end_POSTSUBSCRIPT represents the net amount of reaction outputs during time interval (t,t+Δt]𝑡𝑡Δ𝑡(t,t+\Delta t]( italic_t , italic_t + roman_Δ italic_t ].

Michaelis–Menten (MM) kinetics is commonly used to model the regulation mechanisms of enzymatic reaction networks and the flux rates 𝒗(𝒔t,𝜽t)𝒗subscript𝒔𝑡subscript𝜽𝑡\bm{v}(\boldsymbol{s}_{t},\bm{\theta}_{t})bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) depend on the state (Michaelis and Menten, 2007). In an enzymatic molecular reaction as shown in the first equation in (2), the substrate (S) initially forms a reversible complex (ES) with the enzyme (E), i.e., the enzyme and substrate have to interact for the enzyme to be able to perform its catalytic function to produce the product (P), with KFsubscript𝐾𝐹K_{F}italic_K start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT, KRsubscript𝐾𝑅K_{R}italic_K start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT, and Kcatsubscript𝐾𝑐𝑎𝑡K_{cat}italic_K start_POSTSUBSCRIPT italic_c italic_a italic_t end_POSTSUBSCRIPT representing kinetic rates. For MM kinetics as shown in the second equation in (2), we assume the enzyme is either present as the free enzyme or as the ES complex, i.e., [E]total=[E]+[ES]subscriptdelimited-[]𝐸𝑡𝑜𝑡𝑎𝑙delimited-[]𝐸delimited-[]𝐸𝑆[E]_{total}=[E]+[ES][ italic_E ] start_POSTSUBSCRIPT italic_t italic_o italic_t italic_a italic_l end_POSTSUBSCRIPT = [ italic_E ] + [ italic_E italic_S ] with [E]delimited-[]𝐸[E][ italic_E ] denoting the concentration of the enzyme. Suppose the rate of formation of the ES complex is equal to the rate of dissociation plus the breakdown, i.e., KF[E][S]=[ES](KR+Kcat)subscript𝐾𝐹delimited-[]𝐸delimited-[]𝑆delimited-[]𝐸𝑆subscript𝐾𝑅subscript𝐾𝑐𝑎𝑡K_{F}[E][S]=[ES](K_{R}+K_{cat})italic_K start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT [ italic_E ] [ italic_S ] = [ italic_E italic_S ] ( italic_K start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_c italic_a italic_t end_POSTSUBSCRIPT ). Thus, the parameters in MM kinetics characterize the regulation mechanisms of enzymatic reaction network: (1) Vmaxj=Kcat[E]totalsuperscriptsubscript𝑉𝑚𝑎𝑥𝑗subscript𝐾𝑐𝑎𝑡subscriptdelimited-[]𝐸𝑡𝑜𝑡𝑎𝑙V_{max}^{j}=K_{cat}[E]_{total}italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = italic_K start_POSTSUBSCRIPT italic_c italic_a italic_t end_POSTSUBSCRIPT [ italic_E ] start_POSTSUBSCRIPT italic_t italic_o italic_t italic_a italic_l end_POSTSUBSCRIPT, is the maximum possible velocity of the j𝑗jitalic_j-th molecular reaction that can occur when all the enzyme molecules are bound with the substrate, i.e.,[E]total=[ES]subscriptdelimited-[]𝐸𝑡𝑜𝑡𝑎𝑙delimited-[]𝐸𝑆[E]_{total}=[ES][ italic_E ] start_POSTSUBSCRIPT italic_t italic_o italic_t italic_a italic_l end_POSTSUBSCRIPT = [ italic_E italic_S ]; and (2) Kmj=KR+KcatKF=[E][S][ES]superscriptsubscript𝐾𝑚𝑗subscript𝐾𝑅subscript𝐾𝑐𝑎𝑡subscript𝐾𝐹delimited-[]𝐸delimited-[]𝑆delimited-[]𝐸𝑆K_{m}^{j}=\frac{K_{R}+K_{cat}}{K_{F}}=\frac{[E][S]}{[ES]}italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = divide start_ARG italic_K start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT + italic_K start_POSTSUBSCRIPT italic_c italic_a italic_t end_POSTSUBSCRIPT end_ARG start_ARG italic_K start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_ARG = divide start_ARG [ italic_E ] [ italic_S ] end_ARG start_ARG [ italic_E italic_S ] end_ARG is a dissociation constant for the ES complex,

E+SKRKFESKcatE+P(product)and𝒗j(𝒔t,𝜽t)=VmaxjstjKmj+stj.𝐸𝑆subscript𝐾𝑅subscript𝐾𝐹𝐸𝑆subscript𝐾𝑐𝑎𝑡𝐸𝑃(product)andsubscript𝒗𝑗subscript𝒔𝑡subscript𝜽𝑡superscriptsubscript𝑉𝑚𝑎𝑥𝑗superscriptsubscript𝑠𝑡𝑗superscriptsubscript𝐾𝑚𝑗superscriptsubscript𝑠𝑡𝑗E+S\underset{K_{R}}{\overset{K_{F}}{\rightleftarrows}}ES\overset{K_{cat}}{% \rightarrow}E+P\text{(product)}~{}~{}~{}\mbox{and}~{}~{}~{}\bm{v}_{j}(% \boldsymbol{s}_{t},\bm{\theta}_{t})=\frac{V_{max}^{j}s_{t}^{j}}{K_{m}^{j}+s_{t% }^{j}}.italic_E + italic_S start_UNDERACCENT italic_K start_POSTSUBSCRIPT italic_R end_POSTSUBSCRIPT end_UNDERACCENT start_ARG start_OVERACCENT italic_K start_POSTSUBSCRIPT italic_F end_POSTSUBSCRIPT end_OVERACCENT start_ARG ⇄ end_ARG end_ARG italic_E italic_S start_OVERACCENT italic_K start_POSTSUBSCRIPT italic_c italic_a italic_t end_POSTSUBSCRIPT end_OVERACCENT start_ARG → end_ARG italic_E + italic_P (product) and bold_italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = divide start_ARG italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG start_ARG italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG . (2)

By applying diffusion approximation, the state transition model becomes,

p(𝒔t+1|𝒔t,𝜽t){𝒩(𝒔t+𝑵𝒗(𝒔t,𝜽t)Δt,𝑵diag(𝒗(𝒔t,𝜽t))𝑵Δt),𝒔t+1>𝟎,0,𝒔t+1𝟎.similar-to𝑝conditionalsubscript𝒔𝑡1subscript𝒔𝑡subscript𝜽𝑡cases𝒩subscript𝒔𝑡𝑵𝒗subscript𝒔𝑡subscript𝜽𝑡Δ𝑡𝑵𝑑𝑖𝑎𝑔𝒗subscript𝒔𝑡subscript𝜽𝑡superscript𝑵topΔ𝑡subscript𝒔𝑡100subscript𝒔𝑡10p(\boldsymbol{s}_{t+1}|\boldsymbol{s}_{t},\bm{\theta}_{t})\sim\left\{\begin{% array}[]{l}\mathcal{N}(\boldsymbol{s}_{t}+\bm{N}\boldsymbol{v}(\boldsymbol{s}_% {t},\bm{\theta}_{t})\Delta t,\bm{N}diag(\boldsymbol{v}(\boldsymbol{s}_{t},\bm{% \theta}_{t}))\bm{N}^{\top}\Delta t),\boldsymbol{s}_{t+1}>\bm{0},\\ 0,\boldsymbol{s}_{t+1}\leq\bm{0}.\end{array}\right.italic_p ( bold_italic_s start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ∼ { start_ARRAY start_ROW start_CELL caligraphic_N ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT + bold_italic_N bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) roman_Δ italic_t , bold_italic_N italic_d italic_i italic_a italic_g ( bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) bold_italic_N start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Δ italic_t ) , bold_italic_s start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT > bold_0 , end_CELL end_ROW start_ROW start_CELL 0 , bold_italic_s start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ≤ bold_0 . end_CELL end_ROW end_ARRAY (3)

Thus, the multi-scale bioprocess mechanistic model is further represented as a system of Stochastic Differential Equations (SDEs), intricately reliant on parameters 𝜽tsubscript𝜽𝑡\bm{\theta}_{t}bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and current states 𝒔tsubscript𝒔𝑡\boldsymbol{s}_{t}bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. The diffusion term is necessary due to the inherent stochasticity of molecular reactions in the bioprocess:

(d𝒔t,d𝜽t)superscript𝑑subscript𝒔𝑡𝑑subscript𝜽𝑡top\displaystyle(d\boldsymbol{s}_{t},d\bm{\theta}_{t})^{\top}( italic_d bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_d bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT =\displaystyle== (μ(𝒔t,𝜽t),f(𝒔t,𝜽t))dt+(σ(𝒔t,𝜽t),g(𝒔t,𝜽t))dWt.superscript𝜇subscript𝒔𝑡subscript𝜽𝑡𝑓subscript𝒔𝑡subscript𝜽𝑡top𝑑𝑡superscript𝜎subscript𝒔𝑡subscript𝜽𝑡𝑔subscript𝒔𝑡subscript𝜽𝑡top𝑑subscript𝑊𝑡\displaystyle(\mu(\boldsymbol{s}_{t},\bm{\theta}_{t}),f(\boldsymbol{s}_{t},\bm% {\theta}_{t}))^{\top}dt+(\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t}),g(% \boldsymbol{s}_{t},\bm{\theta}_{t}))^{\top}dW_{t}.( italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_f ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_d italic_t + ( italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_g ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT . (4)

where μ𝜇\muitalic_μ, f𝑓fitalic_f are the drift terms and σ𝜎\sigmaitalic_σ, g𝑔gitalic_g are the diffusion terms defined as:

(μ(𝒔t,𝜽t),f(𝒔t,𝜽t))superscript𝜇subscript𝒔𝑡subscript𝜽𝑡𝑓subscript𝒔𝑡subscript𝜽𝑡top\displaystyle(\mu(\boldsymbol{s}_{t},\bm{\theta}_{t}),f(\boldsymbol{s}_{t},\bm% {\theta}_{t}))^{\top}( italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_f ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT =\displaystyle== E[(𝒔t+Δt,𝜽t+Δt)(𝒔t,𝜽t)|𝒔t,𝜽t],Edelimited-[]superscriptsubscript𝒔𝑡Δ𝑡subscript𝜽𝑡Δ𝑡topconditionalsuperscriptsubscript𝒔𝑡subscript𝜽𝑡topsubscript𝒔𝑡subscript𝜽𝑡\displaystyle\mbox{E}[(\boldsymbol{s}_{t+\Delta t},\bm{\theta}_{t+\Delta t})^{% \top}-(\boldsymbol{s}_{t},\bm{\theta}_{t})^{\top}|\boldsymbol{s}_{t},\bm{% \theta}_{t}],E [ ( bold_italic_s start_POSTSUBSCRIPT italic_t + roman_Δ italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t + roman_Δ italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT - ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT | bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] ,
(σ2(𝒔t,𝜽t),g2(𝒔t,𝜽t))superscriptsuperscript𝜎2subscript𝒔𝑡subscript𝜽𝑡superscript𝑔2subscript𝒔𝑡subscript𝜽𝑡top\displaystyle(\sigma^{2}(\boldsymbol{s}_{t},\bm{\theta}_{t}),g^{2}(\boldsymbol% {s}_{t},\bm{\theta}_{t}))^{\top}( italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_g start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT =\displaystyle== Var[(𝒔t+Δt,𝜽t+Δt)|𝒔t,𝜽t].Vardelimited-[]conditionalsuperscriptsubscript𝒔𝑡Δ𝑡subscript𝜽𝑡Δ𝑡topsubscript𝒔𝑡subscript𝜽𝑡\displaystyle\mbox{Var}[(\boldsymbol{s}_{t+\Delta t},\bm{\theta}_{t+\Delta t})% ^{\top}|\boldsymbol{s}_{t},\bm{\theta}_{t}].Var [ ( bold_italic_s start_POSTSUBSCRIPT italic_t + roman_Δ italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t + roman_Δ italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT | bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] .

The specific form of infinitesimal mean and variance on the right side will be derived in Section 3. This SDE system is the foundational groundwork for the subsequent local sensitivity analysis we will conduct. In this paper, 𝜽tsubscript𝜽𝑡\bm{\theta}_{t}bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT remains constant, resulting in both f𝑓fitalic_f and g𝑔gitalic_g becoming 0. However, given that we model the dynamics of 𝜽tsubscript𝜽𝑡\bm{\theta}_{t}bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT in SDE form as shown in Equation (4), our approach can be readily extended to accommodate the scenarios where 𝜽tsubscript𝜽𝑡\bm{\theta}_{t}bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT varies over time.

Given any feasible policy with action, i.e., 𝒂t=π(𝒔t)subscript𝒂𝑡𝜋subscript𝒔𝑡\boldsymbol{a}_{t}=\pi(\boldsymbol{s}_{t})bold_italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_π ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) at any decision time t𝑡titalic_t, we consider the state transition model characterizing the process mechanistic dynamics and inherent stochasticity, i.e.,

𝒔t+1p(𝒔t+1|𝒔t,𝒂t;𝜽).similar-tosubscript𝒔𝑡1𝑝conditionalsubscript𝒔𝑡1subscript𝒔𝑡subscript𝒂𝑡𝜽\boldsymbol{s}_{t+1}\sim p(\boldsymbol{s}_{t+1}|\boldsymbol{s}_{t},\boldsymbol% {a}_{t};\boldsymbol{\theta}).bold_italic_s start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ∼ italic_p ( bold_italic_s start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ; bold_italic_θ ) .

For the biomanufacturing process, suppose the impact of decision 𝒂tsubscript𝒂𝑡\boldsymbol{a}_{t}bold_italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (e.g., feeding strategy) on the state is known and happens immediately; that means we get the post-decision state denoted by 𝒔t=𝒇(𝒔t,𝒂t)superscriptsubscript𝒔𝑡𝒇subscript𝒔𝑡subscript𝒂𝑡\boldsymbol{s}_{t}^{\prime}=\boldsymbol{f}(\boldsymbol{s}_{t},\boldsymbol{a}_{% t})bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = bold_italic_f ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) with a known function 𝒇𝒇\boldsymbol{f}bold_italic_f. For notation simplification, we ignore the impact of action. The proposed sensitivity analysis over inputs (i.e., states and model parameters) is extendable to account for the policy effect.

2.2 Parameter selection and calibration

Local sensitivity analysis studies the changes in the model prediction outputs with respect to initial input values, i.e., (𝒔0,𝜽0)subscript𝒔0subscript𝜽0(\boldsymbol{s}_{0},\bm{\theta}_{0})( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). The variations around this local point are quantified by the sensitivity coefficients (Zi, 2011) In this way, we could generate the forward flow Φ0,T(𝒔,𝜽)subscriptΦ0𝑇𝒔𝜽\Phi_{0,T}(\boldsymbol{s},\bm{\theta})roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s , bold_italic_θ ) and derive both Φ0,T(𝒔,𝜽)𝒔0subscriptΦ0𝑇𝒔𝜽subscript𝒔0\frac{\partial\Phi_{0,T}(\boldsymbol{s},\bm{\theta})}{\partial\boldsymbol{s}_{% 0}}divide start_ARG ∂ roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s , bold_italic_θ ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG and Φ0,T(𝒔,𝜽)𝜽0subscriptΦ0𝑇𝒔𝜽subscript𝜽0\frac{\partial\Phi_{0,T}(\boldsymbol{s},\bm{\theta})}{\partial\bm{\theta}_{0}}divide start_ARG ∂ roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s , bold_italic_θ ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG, then (Φ0,T(𝒔,𝜽))ssubscriptΦ0𝑇𝒔𝜽𝑠\frac{\partial\mathcal{L}(\Phi_{0,T}(\boldsymbol{s},\bm{\theta}))}{\partial s}divide start_ARG ∂ caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s , bold_italic_θ ) ) end_ARG start_ARG ∂ italic_s end_ARG and (Φ0,T(𝒔,𝜽))𝜽subscriptΦ0𝑇𝒔𝜽𝜽\frac{\partial\mathcal{L}(\Phi_{0,T}(\boldsymbol{s},\bm{\theta}))}{\partial\bm% {\theta}}divide start_ARG ∂ caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s , bold_italic_θ ) ) end_ARG start_ARG ∂ bold_italic_θ end_ARG. that are intimately connected to the solution of the SDE system (4). In Section 3, we will derive μ(𝒔t,𝜽t)𝜇subscript𝒔𝑡subscript𝜽𝑡\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) and σ(𝒔t,𝜽t)𝜎subscript𝒔𝑡subscript𝜽𝑡\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t})italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ), showing that both functions are characterized by infinite differentiability, with their first-order derivatives being bounded, i.e., μ,σCb,1𝜇𝜎superscriptsubscript𝐶𝑏1\mu,\sigma\in C_{b}^{\infty,1}italic_μ , italic_σ ∈ italic_C start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ , 1 end_POSTSUPERSCRIPT. Consequently, once the initial values are given, a unique solution to the system (4) is guaranteed to exist. We use Φ0,t2(𝒔0,𝜽0)subscriptΦ0subscript𝑡2subscript𝒔0subscript𝜽0\Phi_{0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0})roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) to represent the solution of the SDEs in (4) at time t2subscript𝑡2t_{2}italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and it is called forward flow satisfying the property:

Φ0,t2(𝒔0,𝜽0)=Φ0,t(Φt,t2(𝒔0,𝜽0)) for 0tt2.subscriptΦ0subscript𝑡2subscript𝒔0subscript𝜽0subscriptΦ0𝑡subscriptΦ𝑡subscript𝑡2subscript𝒔0subscript𝜽0 for 0𝑡subscript𝑡2\Phi_{0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0})=\Phi_{0,t}(\Phi_{t,t_{2}}(% \boldsymbol{s}_{0},\bm{\theta}_{0}))~{}~{}\mbox{ for }~{}~{}0\leq t\leq t_{2}.roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = roman_Φ start_POSTSUBSCRIPT 0 , italic_t end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) for 0 ≤ italic_t ≤ italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT .

To simulate Φ0,t2(𝒔0,𝜽0)subscriptΦ0subscript𝑡2subscript𝒔0subscript𝜽0\Phi_{0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0})roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), we consider its calculus form based on Equation (4):

Φ0,t2(𝒔0,𝜽0)subscriptΦ0subscript𝑡2subscript𝒔0subscript𝜽0\displaystyle\Phi_{0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0})roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) =\displaystyle== (𝒔0,𝜽0)+0t2(μ(Φ0,t(𝒔0,𝜽0)),f(Φ0,t(𝒔0,𝜽0)))𝑑tsuperscriptsubscript𝒔0subscript𝜽0topsuperscriptsubscript0subscript𝑡2superscript𝜇subscriptΦ0𝑡subscript𝒔0subscript𝜽0𝑓subscriptΦ0𝑡subscript𝒔0subscript𝜽0topdifferential-d𝑡\displaystyle(\boldsymbol{s}_{0},\bm{\theta}_{0})^{\top}+\int_{0}^{t_{2}}(\mu(% \Phi_{0,t}(\boldsymbol{s}_{0},\bm{\theta}_{0})),f(\Phi_{0,t}(\boldsymbol{s}_{0% },\bm{\theta}_{0})))^{\top}dt( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_μ ( roman_Φ start_POSTSUBSCRIPT 0 , italic_t end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) , italic_f ( roman_Φ start_POSTSUBSCRIPT 0 , italic_t end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_d italic_t (5)
+0t2(σ(Φ0,t(𝒔0,𝜽0)),g(Φ0,t(𝒔0,𝜽0)))𝑑Wt,superscriptsubscript0subscript𝑡2superscript𝜎subscriptΦ0𝑡subscript𝒔0subscript𝜽0𝑔subscriptΦ0𝑡subscript𝒔0subscript𝜽0topdifferential-dsubscript𝑊𝑡\displaystyle+\int_{0}^{t_{2}}(\sigma(\Phi_{0,t}(\boldsymbol{s}_{0},\bm{\theta% }_{0})),g(\Phi_{0,t}(\boldsymbol{s}_{0},\bm{\theta}_{0})))^{\top}\circ dW_{t},+ ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_σ ( roman_Φ start_POSTSUBSCRIPT 0 , italic_t end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) , italic_g ( roman_Φ start_POSTSUBSCRIPT 0 , italic_t end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∘ italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ,

where dWtabsent𝑑subscript𝑊𝑡\circ dW_{t}∘ italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT represents the Stratonovich stochastic integral. For a continuous semimartingale {ft}t<Tsubscriptsubscript𝑓𝑡𝑡𝑇\{f_{t}\}_{t<T}{ italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_t < italic_T end_POSTSUBSCRIPT adapted to the forward filtration {0,t}t<Tsubscriptsubscript0𝑡𝑡𝑇\{\mathcal{F}_{0,t}\}_{t<T}{ caligraphic_F start_POSTSUBSCRIPT 0 , italic_t end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_t < italic_T end_POSTSUBSCRIPT, the Stratonovich stochastic integral is:

0Tft𝑑Wt=lim|Π|0i=1N(fti1+fti)2(WtiWti1),superscriptsubscript0𝑇subscript𝑓𝑡differential-dsubscript𝑊𝑡subscriptΠ0superscriptsubscript𝑖1𝑁subscript𝑓subscript𝑡𝑖1subscript𝑓subscript𝑡𝑖2subscript𝑊subscript𝑡𝑖subscript𝑊subscript𝑡𝑖1\int_{0}^{T}f_{t}\circ dW_{t}=\lim_{|\Pi|\to 0}\sum\limits_{i=1}^{N}\frac{(f_{% t_{i-1}}+f_{t_{i}})}{2}(W_{t_{i}}-W_{t_{i-1}}),∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ∘ italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_lim start_POSTSUBSCRIPT | roman_Π | → 0 end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT divide start_ARG ( italic_f start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + italic_f start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG 2 end_ARG ( italic_W start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_W start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ,

where Π={0=t0<0<<tN=T}Π0subscript𝑡00subscript𝑡𝑁𝑇\Pi=\{0=t_{0}<0<\ldots<t_{N}=T\}roman_Π = { 0 = italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < 0 < … < italic_t start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT = italic_T } is a partition of time interval [0,T]0𝑇[0,T][ 0 , italic_T ] and |Π|=maxn(tntn1)Πsubscript𝑛subscript𝑡𝑛subscript𝑡𝑛1|\Pi|=\max\limits_{n}(t_{n}-t_{n-1})| roman_Π | = roman_max start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_t start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT - italic_t start_POSTSUBSCRIPT italic_n - 1 end_POSTSUBSCRIPT ). The reason to introduce Stratonovich stochastic integral is that we could generate the inverse flow ψ0,t2=Φ0,t21subscript𝜓0subscript𝑡2superscriptsubscriptΦ0subscript𝑡21\psi_{0,t_{2}}=\Phi_{0,t_{2}}^{-1}italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT from SDE system (4) based on Equation (5) Kunita (2019):

ψ0,t2(𝒔t2,𝜽t2)subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\displaystyle\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) =\displaystyle== (𝒔t2,𝜽t2)0t2(μ(ψt,t2(𝒔t2,𝜽t2)),f(ψt,t2(𝒔t2,𝜽t2)))𝑑tsuperscriptsubscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsuperscriptsubscript0subscript𝑡2superscript𝜇subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2𝑓subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topdifferential-d𝑡\displaystyle(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})^{\top}-\int_{0}^{t_{% 2}}(\mu(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),f(\psi_{t,% t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}dt( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT - ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_μ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , italic_f ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_d italic_t (6)
0t2(σ(ψt,t2(𝒔t2,𝜽t2)),g(ψt,t2(𝒔t2,𝜽t2)))𝑑W~t,superscriptsubscript0subscript𝑡2superscript𝜎subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2𝑔subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topdifferential-dsubscript~𝑊𝑡\displaystyle-\int_{0}^{t_{2}}(\sigma(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},% \bm{\theta}_{t_{2}})),g(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{% 2}})))^{\top}\circ d\widetilde{W}_{t},- ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_σ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , italic_g ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∘ italic_d over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ,

where W~tsubscript~𝑊𝑡\widetilde{W}_{t}over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the backward Wiener process defined as W~t=WtWTsubscript~𝑊𝑡subscript𝑊𝑡subscript𝑊𝑇\widetilde{W}_{t}=W_{t}-W_{T}over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_W start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT for any t<T𝑡𝑇t<Titalic_t < italic_T. It is adapted to the backward filtration {t,T}t<Tsubscriptsubscript𝑡𝑇𝑡𝑇\{\mathcal{F}_{t,T}\}_{t<T}{ caligraphic_F start_POSTSUBSCRIPT italic_t , italic_T end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_t < italic_T end_POSTSUBSCRIPT. The difference between Equations (5) and (6) is only the negative sign, and such symmetry is attributed to the use of Stratonovich stochastic integral.

We could further define a scalar loss function \mathcal{L}caligraphic_L of Φ0,T(𝒔0,𝜽0)subscriptΦ0𝑇subscript𝒔0subscript𝜽0\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). Then, the loss for system (4) becomes (Φ0,T(𝒔0,𝜽0))subscriptΦ0𝑇subscript𝒔0subscript𝜽0\mathcal{L}(\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0}))caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) and the sensitivity coefficient for 𝒔0subscript𝒔0\boldsymbol{s}_{0}bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and 𝜽0subscript𝜽0\bm{\theta}_{0}bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT becomes,

A0,T(𝒔0,𝜽0)((Φ0,T(𝒔0,𝜽0))𝒔0,(Φ0,T(𝒔0,𝜽0))𝜽0).subscript𝐴0𝑇subscript𝒔0subscript𝜽0superscriptsubscriptΦ0𝑇subscript𝒔0subscript𝜽0subscript𝒔0subscriptΦ0𝑇subscript𝒔0subscript𝜽0subscript𝜽0topA_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})\equiv\left(\frac{\partial\mathcal{% L}(\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0}))}{\partial\boldsymbol{s}_{0}% },\frac{\partial\mathcal{L}(\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0}))}{% \partial\bm{\theta}_{0}}\right)^{\top}.italic_A start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≡ ( divide start_ARG ∂ caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG , divide start_ARG ∂ caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT . (7)

Then, based on the chain rule:

A0,T(𝒔0,𝜽0)=Φ(Φ0,T(𝒔0,𝜽0))Φ0,T(𝒔0,𝜽0).subscript𝐴0𝑇subscript𝒔0subscript𝜽0subscriptΦsubscriptΦ0𝑇subscript𝒔0subscript𝜽0subscriptΦ0𝑇subscript𝒔0subscript𝜽0\displaystyle A_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})=\triangledown_{\Phi}% \mathcal{L}(\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0}))\triangledown\Phi_{% 0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0}).italic_A start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = ▽ start_POSTSUBSCRIPT roman_Φ end_POSTSUBSCRIPT caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) ▽ roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) . (8)

For Φ0,T(𝒔0,𝜽0)subscriptΦ0𝑇subscript𝒔0subscript𝜽0\triangledown\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})▽ roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) in (7), we first derive (𝒔T𝒔0,𝜽T𝜽0)superscriptsubscript𝒔𝑇subscript𝒔0subscript𝜽𝑇subscript𝜽0top\left(\frac{\partial\boldsymbol{s}_{T}}{\partial\boldsymbol{s}_{0}},\frac{% \partial\bm{\theta}_{T}}{\partial\bm{\theta}_{0}}\right)^{\top}( divide start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG , divide start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT under the assumption that (𝒔T,𝜽T)superscriptsubscript𝒔𝑇subscript𝜽𝑇top(\boldsymbol{s}_{T},\bm{\theta}_{T})^{\top}( bold_italic_s start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is deterministic, i.e., not dependent on the Wiener process Wtsubscript𝑊𝑡W_{t}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. We then extend to the case that (𝒔T,𝜽T)superscriptsubscript𝒔𝑇subscript𝜽𝑇top(\boldsymbol{s}_{T},\bm{\theta}_{T})^{\top}( bold_italic_s start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is stochastically obtained from forward flow Φ0,T(𝒔0,𝜽0)subscriptΦ0𝑇subscript𝒔0subscript𝜽0\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) from SDE system (4) and derive (Φ0,T(𝒔0,𝜽0)𝒔0,Φ0,T(𝒔0,𝜽0)𝜽0)superscriptsubscriptΦ0𝑇subscript𝒔0subscript𝜽0subscript𝒔0subscriptΦ0𝑇subscript𝒔0subscript𝜽0subscript𝜽0top\left(\frac{\partial\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})}{\partial% \boldsymbol{s}_{0}},\frac{\partial\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0% })}{\partial\bm{\theta}_{0}}\right)^{\top}( divide start_ARG ∂ roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG , divide start_ARG ∂ roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, which implements the structural information in the stochastic reaction network through a dual process of forward and backward propagation.

Then, for Φ(Φ0,T(𝒔0,𝜽0))subscriptΦsubscriptΦ0𝑇subscript𝒔0subscript𝜽0\triangledown_{\Phi}\mathcal{L}(\Phi_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0}))▽ start_POSTSUBSCRIPT roman_Φ end_POSTSUBSCRIPT caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) in (8), it is determined by the loss function \mathcal{L}caligraphic_L. Denote the observed bioprocess states as 𝒔csuperscript𝒔𝑐\boldsymbol{s}^{c}bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT, the underlying true parameters of the real reaction network as 𝜽𝒄superscript𝜽𝒄\bm{\theta^{c}}bold_italic_θ start_POSTSUPERSCRIPT bold_italic_c end_POSTSUPERSCRIPT and the output as O(𝒔c)𝑂superscript𝒔𝑐O(\boldsymbol{s}^{c})italic_O ( bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) which is a scalar function of states. Here the superscript “c" indicates observations and parameters from the real system. Then the simulated output is denoted by O(𝒔)𝑂𝒔O(\boldsymbol{s})italic_O ( bold_italic_s ), where 𝒔𝒔\boldsymbol{s}bold_italic_s is the simulated state with initial state 𝒔0=𝒔0csubscript𝒔0subscriptsuperscript𝒔𝑐0\boldsymbol{s}_{0}=\boldsymbol{s}^{c}_{0}bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and 𝜽0subscript𝜽0\bm{\theta}_{0}bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT as the input of simulation model. \mathcal{L}caligraphic_L is a scalar function measuring the difference of observed and simulated outputs, such as mean squared error (MSE) ,i.e.,E[(O(𝒔c)O(𝒔))2|𝒔0,𝜽0],i.e.,\mbox{E}[(O(\boldsymbol{s}^{c})-O(\boldsymbol{s}))^{2}|\boldsymbol{s}_{0% },\bm{\theta}_{0}], italic_i . italic_e . , E [ ( italic_O ( bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) - italic_O ( bold_italic_s ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ]. Then we could generate Φ(Φ0,T(𝒔0))subscriptΦsubscriptΦ0𝑇subscript𝒔0\triangledown_{\Phi}\mathcal{L}(\Phi_{0,T}(\boldsymbol{s}_{0}))▽ start_POSTSUBSCRIPT roman_Φ end_POSTSUBSCRIPT caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ). Combining the above results, we can derive the sensitivities A0,T(𝒔0,𝜽0)subscript𝐴0𝑇subscript𝒔0subscript𝜽0A_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})italic_A start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ), which can identify the key contributor to the model prediction MSE.

3 Metamodeling and Prediction

In this section, we will derive the expression of μ𝜇\muitalic_μ and σ𝜎\sigmaitalic_σ, we start with two-step transition from initial states,

E(𝒔2|𝒔0,𝜽0)Econditionalsubscript𝒔2subscript𝒔0subscript𝜽0\displaystyle\mbox{E}(\boldsymbol{s}_{2}|\boldsymbol{s}_{0},\bm{\theta}_{0})E ( bold_italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) =\displaystyle== p(𝒔1,𝜽1|𝒔0,𝜽0)E(𝒔2|𝒔1,𝜽1)d(𝒔1,𝜽1)=p(𝒔1,𝜽1|𝒔0,𝜽0)[𝒔1+𝑵𝒗(𝒔1,𝜽1)Δt]d(𝒔1,𝜽1)𝑝subscript𝒔1conditionalsubscript𝜽1subscript𝒔0subscript𝜽0Econditionalsubscript𝒔2subscript𝒔1subscript𝜽1𝑑subscript𝒔1subscript𝜽1𝑝subscript𝒔1conditionalsubscript𝜽1subscript𝒔0subscript𝜽0delimited-[]subscript𝒔1𝑵𝒗subscript𝒔1subscript𝜽1Δ𝑡𝑑subscript𝒔1subscript𝜽1\displaystyle\int p(\boldsymbol{s}_{1},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{% \theta}_{0})\mbox{E}(\boldsymbol{s}_{2}|\boldsymbol{s}_{1},\bm{\theta}_{1})d(% \boldsymbol{s}_{1},\bm{\theta}_{1})=\int p(\boldsymbol{s}_{1},\bm{\theta}_{1}|% \boldsymbol{s}_{0},\bm{\theta}_{0})[\boldsymbol{s}_{1}+\bm{N}\bm{v}(% \boldsymbol{s}_{1},\bm{\theta}_{1})\Delta t]d(\boldsymbol{s}_{1},\bm{\theta}_{% 1})∫ italic_p ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) E ( bold_italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = ∫ italic_p ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) [ bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + bold_italic_N bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) roman_Δ italic_t ] italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) (9)
=\displaystyle== E(𝒔1|𝒔0,𝜽0)+p(𝒔1,𝜽1|𝒔0,𝜽0)𝑵𝒗(𝒔1,𝜽1)Δtd(𝒔1,𝜽1).Econditionalsubscript𝒔1subscript𝒔0subscript𝜽0𝑝subscript𝒔1conditionalsubscript𝜽1subscript𝒔0subscript𝜽0𝑵𝒗subscript𝒔1subscript𝜽1Δ𝑡𝑑subscript𝒔1subscript𝜽1\displaystyle\mbox{E}(\boldsymbol{s}_{1}|\boldsymbol{s}_{0},\bm{\theta}_{0})+% \int p(\boldsymbol{s}_{1},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{\theta}_{0})% \bm{N}\bm{v}(\boldsymbol{s}_{1},\bm{\theta}_{1})\Delta td(\boldsymbol{s}_{1},% \bm{\theta}_{1}).E ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) + ∫ italic_p ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_italic_N bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) roman_Δ italic_t italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) .

By combining Equations (9) and E(𝒔1|𝒔0,𝜽0)=𝒔0+𝑵𝒗(𝒔0,𝜽0)Δt.Econditionalsubscript𝒔1subscript𝒔0subscript𝜽0subscript𝒔0𝑵𝒗subscript𝒔0subscript𝜽0Δ𝑡\mbox{E}(\boldsymbol{s}_{1}|\boldsymbol{s}_{0},\bm{\theta}_{0})=\boldsymbol{s}% _{0}+\bm{N}\bm{v}(\boldsymbol{s}_{0},\bm{\theta}_{0})\Delta t.E ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + bold_italic_N bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) roman_Δ italic_t ., we have

E(𝒔2|𝒔0,𝜽0)=𝒔0+𝑵𝒗(𝒔0,𝜽0)Δt+𝑵p(𝒔1,𝜽1|𝒔0,𝜽0)𝒗(𝒔1,𝜽1)Δtd(𝒔1,𝜽1).Econditionalsubscript𝒔2subscript𝒔0subscript𝜽0subscript𝒔0𝑵𝒗subscript𝒔0subscript𝜽0Δ𝑡𝑵𝑝subscript𝒔1conditionalsubscript𝜽1subscript𝒔0subscript𝜽0𝒗subscript𝒔1subscript𝜽1Δ𝑡𝑑subscript𝒔1subscript𝜽1\mbox{E}(\boldsymbol{s}_{2}|\boldsymbol{s}_{0},\bm{\theta}_{0})=\boldsymbol{s}% _{0}+\bm{N}\bm{v}(\boldsymbol{s}_{0},\bm{\theta}_{0})\Delta t+\bm{N}\int p(% \boldsymbol{s}_{1},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{\theta}_{0})\bm{v}(% \boldsymbol{s}_{1},\bm{\theta}_{1})\Delta td(\boldsymbol{s}_{1},\bm{\theta}_{1% }).E ( bold_italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + bold_italic_N bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) roman_Δ italic_t + bold_italic_N ∫ italic_p ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) roman_Δ italic_t italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) . (10)

We need to calculate p(𝒔1,𝜽1|𝒔0,𝜽0)𝒗(𝒔1,𝜽1)Δtd(𝒔1,𝜽1)𝑝subscript𝒔1conditionalsubscript𝜽1subscript𝒔0subscript𝜽0𝒗subscript𝒔1subscript𝜽1Δ𝑡𝑑subscript𝒔1subscript𝜽1\int p(\boldsymbol{s}_{1},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{\theta}_{0})% \bm{v}(\boldsymbol{s}_{1},\bm{\theta}_{1})\Delta td(\boldsymbol{s}_{1},\bm{% \theta}_{1})∫ italic_p ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) roman_Δ italic_t italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) in Equation (10), i.e., the expectation of flux rate after one step transition. For each j𝑗jitalic_j-th reaction, its reaction rate vj(𝒔1,𝜽1)subscript𝑣𝑗subscript𝒔1subscript𝜽1{v_{j}}(\boldsymbol{s}_{1},\bm{\theta}_{1})italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) is related to s1jsuperscriptsubscript𝑠1𝑗s_{1}^{j}italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT, and by applying the MM kinetics in (2), we have

p(𝒔1,𝜽1|𝒔0,𝜽0)𝒗𝒋(𝒔1,𝜽1)Δtd(𝒔1,𝜽1)=p(s1j,𝜽1|𝒔0,𝜽0)Vmaxjs1jKmj+s1jΔtd(𝒔1,𝜽1)𝑝subscript𝒔1conditionalsubscript𝜽1subscript𝒔0subscript𝜽0subscript𝒗𝒋subscript𝒔1subscript𝜽1Δ𝑡𝑑subscript𝒔1subscript𝜽1𝑝superscriptsubscript𝑠1𝑗conditionalsubscript𝜽1subscript𝒔0subscript𝜽0superscriptsubscript𝑉𝑚𝑎𝑥𝑗superscriptsubscript𝑠1𝑗superscriptsubscript𝐾𝑚𝑗superscriptsubscript𝑠1𝑗Δ𝑡𝑑subscript𝒔1subscript𝜽1\displaystyle\int p(\boldsymbol{s}_{1},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{% \theta}_{0})\bm{v_{j}}(\boldsymbol{s}_{1},\bm{\theta}_{1})\Delta td(% \boldsymbol{s}_{1},\bm{\theta}_{1})=\int p(s_{1}^{j},\bm{\theta}_{1}|% \boldsymbol{s}_{0},\bm{\theta}_{0})\frac{V_{max}^{j}s_{1}^{j}}{K_{m}^{j}+s_{1}% ^{j}}\Delta td(\boldsymbol{s}_{1},\bm{\theta}_{1})∫ italic_p ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) bold_italic_v start_POSTSUBSCRIPT bold_italic_j end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) roman_Δ italic_t italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) = ∫ italic_p ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) divide start_ARG italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG start_ARG italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG roman_Δ italic_t italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) (11)
=\displaystyle== p(s1j,𝜽1|𝒔0,𝜽0)(VmaxjVmaxjKmjKmj+s1j)Δtd(𝒔1,𝜽1)𝑝superscriptsubscript𝑠1𝑗conditionalsubscript𝜽1subscript𝒔0subscript𝜽0superscriptsubscript𝑉𝑚𝑎𝑥𝑗superscriptsubscript𝑉𝑚𝑎𝑥𝑗superscriptsubscript𝐾𝑚𝑗superscriptsubscript𝐾𝑚𝑗superscriptsubscript𝑠1𝑗Δ𝑡𝑑subscript𝒔1subscript𝜽1\displaystyle\int p(s_{1}^{j},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{\theta}_{% 0})\left(V_{max}^{j}-\frac{V_{max}^{j}K_{m}^{j}}{K_{m}^{j}+s_{1}^{j}}\right)% \Delta td(\boldsymbol{s}_{1},\bm{\theta}_{1})∫ italic_p ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ( italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT - divide start_ARG italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG start_ARG italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG ) roman_Δ italic_t italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT )
=\displaystyle== VmaxjΔtVmaxjKmjΔtp(s1j,𝜽1|𝒔0,𝜽0)1Kmj+s1jd(𝒔1,𝜽1).superscriptsubscript𝑉𝑚𝑎𝑥𝑗Δ𝑡superscriptsubscript𝑉𝑚𝑎𝑥𝑗superscriptsubscript𝐾𝑚𝑗Δ𝑡𝑝superscriptsubscript𝑠1𝑗conditionalsubscript𝜽1subscript𝒔0subscript𝜽01superscriptsubscript𝐾𝑚𝑗superscriptsubscript𝑠1𝑗𝑑subscript𝒔1subscript𝜽1\displaystyle V_{max}^{j}\Delta t-V_{max}^{j}K_{m}^{j}\Delta t\int p(s_{1}^{j}% ,\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{\theta}_{0})\frac{1}{K_{m}^{j}+s_{1}^{% j}}d(\boldsymbol{s}_{1},\bm{\theta}_{1}).italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT roman_Δ italic_t - italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT roman_Δ italic_t ∫ italic_p ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) divide start_ARG 1 end_ARG start_ARG italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) .

To compute the second term in Equation (11), we first consider the probability distribution function p(s1j,𝜽1|𝒔0,𝜽0)𝑝superscriptsubscript𝑠1𝑗conditionalsubscript𝜽1subscript𝒔0subscript𝜽0p(s_{1}^{j},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{\theta}_{0})italic_p ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). Since 𝜽tsubscript𝜽𝑡\bm{\theta}_{t}bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is constant over time, we will put our focus on p(s1j|𝒔0,𝜽0)𝑝conditionalsuperscriptsubscript𝑠1𝑗subscript𝒔0subscript𝜽0p(s_{1}^{j}|\boldsymbol{s}_{0},\bm{\theta}_{0})italic_p ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). As shown in Section 2.1, we’ve already derived the state transition model in Equation (3). This model adopts a truncated normal distribution, which is motivated by the fact that the concentration of molecules, represented by s1jsuperscriptsubscript𝑠1𝑗s_{1}^{j}italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT, cannot be negative in reality. Furthermore, the introduction of the truncated normal distribution is due to the absence of an inverse moment for the standard normal distribution. To address this issue, we will first review Theorem 1 and then apply it to approximate the second term in Equation (11).

Theorem 1 (Hall (1979)).

Suppose μ>0𝜇0\mu>0italic_μ > 0 and X𝑋Xitalic_X is a random variable with density

ϕ(X)={k2πσe(Xμ)22σ2,Xa>0,0,X<a,italic-ϕ𝑋cases𝑘2𝜋𝜎superscript𝑒superscript𝑋𝜇22superscript𝜎2𝑋𝑎00𝑋𝑎\phi(X)=\left\{\begin{array}[]{l}\frac{k}{\sqrt{2\pi}\sigma}e^{\frac{-(X-\mu)^% {2}}{2\sigma^{2}}},X\geq a>0,\\ 0,X<a,\end{array}\right.italic_ϕ ( italic_X ) = { start_ARRAY start_ROW start_CELL divide start_ARG italic_k end_ARG start_ARG square-root start_ARG 2 italic_π end_ARG italic_σ end_ARG italic_e start_POSTSUPERSCRIPT divide start_ARG - ( italic_X - italic_μ ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_POSTSUPERSCRIPT , italic_X ≥ italic_a > 0 , end_CELL end_ROW start_ROW start_CELL 0 , italic_X < italic_a , end_CELL end_ROW end_ARRAY

where k𝑘kitalic_k is the normalization constant. Then for each σ^=σμ15^𝜎𝜎𝜇15\hat{\sigma}=\frac{\sigma}{\mu}\leq\frac{1}{5}over^ start_ARG italic_σ end_ARG = divide start_ARG italic_σ end_ARG start_ARG italic_μ end_ARG ≤ divide start_ARG 1 end_ARG start_ARG 5 end_ARG and any value of a𝑎aitalic_a satisfying the inequality σ^2aμ125superscript^𝜎2𝑎𝜇125\hat{\sigma}^{2}\leq\frac{a}{\mu}\leq\frac{1}{25}over^ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ divide start_ARG italic_a end_ARG start_ARG italic_μ end_ARG ≤ divide start_ARG 1 end_ARG start_ARG 25 end_ARG, we have: μE[X1]=I(σ^)+e1,|e1|<8000σ^12<3.3×105,formulae-sequence𝜇Edelimited-[]superscript𝑋1𝐼^𝜎subscript𝑒1subscript𝑒18000superscript^𝜎123.3superscript105\mu\mbox{E}[X^{-1}]=I(\hat{\sigma})+e_{1},\left|e_{1}\right|<8000\hat{\sigma}^% {12}<3.3\times 10^{-5},italic_μ E [ italic_X start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ] = italic_I ( over^ start_ARG italic_σ end_ARG ) + italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , | italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | < 8000 over^ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT < 3.3 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT , where I(σ^)𝐼^𝜎I(\hat{\sigma})italic_I ( over^ start_ARG italic_σ end_ARG ) is given in terms of Dawson’s Integral by the following equations:

I(σ^)=2σ^D(12σ^),D(x)=ex20xet2𝑑t.formulae-sequence𝐼^𝜎2^𝜎𝐷12^𝜎𝐷𝑥superscript𝑒superscript𝑥2superscriptsubscript0𝑥superscript𝑒superscript𝑡2differential-d𝑡\displaystyle I(\hat{\sigma})=\frac{\sqrt{2}}{\hat{\sigma}}D\left(\frac{1}{% \sqrt{2}\hat{\sigma}}\right),\quad D(x)=e^{-x^{2}}\int_{0}^{x}e^{t^{2}}dt.italic_I ( over^ start_ARG italic_σ end_ARG ) = divide start_ARG square-root start_ARG 2 end_ARG end_ARG start_ARG over^ start_ARG italic_σ end_ARG end_ARG italic_D ( divide start_ARG 1 end_ARG start_ARG square-root start_ARG 2 end_ARG over^ start_ARG italic_σ end_ARG end_ARG ) , italic_D ( italic_x ) = italic_e start_POSTSUPERSCRIPT - italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_x end_POSTSUPERSCRIPT italic_e start_POSTSUPERSCRIPT italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT italic_d italic_t . (12)

Before applying Theorem 1, we rewrite Equation (11) by changing the variable,

p(s1j|𝒔0,𝜽0)1Kmj+s1j𝑑s1j=s^1j=s1j+Kmjp(s^1j|𝒔0,𝜽0)1s^1j𝑑s^1j.𝑝conditionalsuperscriptsubscript𝑠1𝑗subscript𝒔0subscript𝜽01superscriptsubscript𝐾𝑚𝑗superscriptsubscript𝑠1𝑗differential-dsuperscriptsubscript𝑠1𝑗superscriptsubscript^𝑠1𝑗superscriptsubscript𝑠1𝑗superscriptsubscript𝐾𝑚𝑗𝑝conditionalsuperscriptsubscript^𝑠1𝑗subscript𝒔0subscript𝜽01superscriptsubscript^𝑠1𝑗differential-dsuperscriptsubscript^𝑠1𝑗\int p(s_{1}^{j}|\boldsymbol{s}_{0},\bm{\theta}_{0})\frac{1}{K_{m}^{j}+s_{1}^{% j}}ds_{1}^{j}\overset{\hat{s}_{1}^{j}=s_{1}^{j}+K_{m}^{j}}{=}\int p(\hat{s}_{1% }^{j}|\boldsymbol{s}_{0},\bm{\theta}_{0})\frac{1}{\hat{s}_{1}^{j}}d\hat{s}_{1}% ^{j}.∫ italic_p ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) divide start_ARG 1 end_ARG start_ARG italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG italic_d italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT start_OVERACCENT over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_OVERACCENT start_ARG = end_ARG ∫ italic_p ( over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) divide start_ARG 1 end_ARG start_ARG over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG italic_d over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT .

When 𝒔0Δtmuch-greater-thansubscript𝒔0Δ𝑡\boldsymbol{s}_{0}\gg\Delta tbold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≫ roman_Δ italic_t, both conditions σ^15^𝜎15\hat{\sigma}\leq\frac{1}{5}over^ start_ARG italic_σ end_ARG ≤ divide start_ARG 1 end_ARG start_ARG 5 end_ARG and σ^2aμ125superscript^𝜎2𝑎𝜇125\hat{\sigma}^{2}\leq\frac{a}{\mu}\leq\frac{1}{25}over^ start_ARG italic_σ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≤ divide start_ARG italic_a end_ARG start_ARG italic_μ end_ARG ≤ divide start_ARG 1 end_ARG start_ARG 25 end_ARG could be satisfied. Thus, Theorem 1 is applicable to the approximation of p(s1j|𝒔0,𝜽0)1Kmj+s1j𝑑s1j𝑝conditionalsuperscriptsubscript𝑠1𝑗subscript𝒔0subscript𝜽01superscriptsubscript𝐾𝑚𝑗superscriptsubscript𝑠1𝑗differential-dsuperscriptsubscript𝑠1𝑗\int p(s_{1}^{j}|\boldsymbol{s}_{0},\bm{\theta}_{0})\frac{1}{K_{m}^{j}+s_{1}^{% j}}ds_{1}^{j}∫ italic_p ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) divide start_ARG 1 end_ARG start_ARG italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG italic_d italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT, which means:

p(s1j|𝒔0,𝜽0)1Kmj+s1j𝑑s1jI(σ^0j)μ0j,𝑝conditionalsuperscriptsubscript𝑠1𝑗subscript𝒔0subscript𝜽01superscriptsubscript𝐾𝑚𝑗superscriptsubscript𝑠1𝑗differential-dsuperscriptsubscript𝑠1𝑗𝐼superscriptsubscript^𝜎0𝑗superscriptsubscript𝜇0𝑗\int p(s_{1}^{j}|\boldsymbol{s}_{0},\bm{\theta}_{0})\frac{1}{K_{m}^{j}+s_{1}^{% j}}ds_{1}^{j}\approx\frac{I(\hat{\sigma}_{0}^{j})}{\mu_{0}^{j}},∫ italic_p ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) divide start_ARG 1 end_ARG start_ARG italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG italic_d italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ≈ divide start_ARG italic_I ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG , (13)
μ0j=s0j+k=1R𝑵𝒋,𝒌𝒗k(𝒔0,𝜽0)Δt+Kmj,σ0j=k=1R𝑵j,k2𝒗k(𝒔0,𝜽0)Δt,σ^0j=σ0jμ0j.formulae-sequencesuperscriptsubscript𝜇0𝑗superscriptsubscript𝑠0𝑗superscriptsubscript𝑘1𝑅subscript𝑵𝒋𝒌subscript𝒗𝑘subscript𝒔0subscript𝜽0Δ𝑡superscriptsubscript𝐾𝑚𝑗formulae-sequencesuperscriptsubscript𝜎0𝑗superscriptsubscript𝑘1𝑅superscriptsubscript𝑵𝑗𝑘2subscript𝒗𝑘subscript𝒔0subscript𝜽0Δ𝑡superscriptsubscript^𝜎0𝑗superscriptsubscript𝜎0𝑗superscriptsubscript𝜇0𝑗\mu_{0}^{j}=s_{0}^{j}+\sum\limits_{k=1}^{R}\bm{N_{j,k}}\boldsymbol{v}_{k}(% \boldsymbol{s}_{0},\bm{\theta}_{0})\Delta t+K_{m}^{j},~{}~{}~{}\sigma_{0}^{j}=% \sqrt{\sum\limits_{k=1}^{R}\bm{N}_{j,k}^{2}\boldsymbol{v}_{k}(\boldsymbol{s}_{% 0},\bm{\theta}_{0})\Delta t},~{}~{}~{}\hat{\sigma}_{0}^{j}=\frac{\sigma_{0}^{j% }}{\mu_{0}^{j}}.italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT bold_italic_N start_POSTSUBSCRIPT bold_italic_j bold_, bold_italic_k end_POSTSUBSCRIPT bold_italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) roman_Δ italic_t + italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = square-root start_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R end_POSTSUPERSCRIPT bold_italic_N start_POSTSUBSCRIPT italic_j , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT bold_italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) roman_Δ italic_t end_ARG , over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = divide start_ARG italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG . (14)

Since Dawson’s Integral does not have an analytic expression, we introduce an analytic approximation of Dawson’s Integral (Filobello-Nino et al., 2019): D(x)12x+14x3𝐷𝑥12𝑥14superscript𝑥3D(x)\approx\frac{1}{2x}+\frac{1}{4x^{3}}italic_D ( italic_x ) ≈ divide start_ARG 1 end_ARG start_ARG 2 italic_x end_ARG + divide start_ARG 1 end_ARG start_ARG 4 italic_x start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG for x>2.68.𝑥2.68x>2.68.italic_x > 2.68 . Based on Filobello-Nino’s results, the approximation above can control the relative error below 2.5% (Filobello-Nino et al., 2019). Since we follow Theorem 1’s assumption that σ^015subscript^𝜎015\hat{\sigma}_{0}\leq\frac{1}{5}over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG 5 end_ARG, we could derive 12σ^0125>2.6812subscript^𝜎01252.68\frac{1}{\sqrt{2}\hat{\sigma}_{0}}\geq\frac{1}{\frac{\sqrt{2}}{5}}>2.68divide start_ARG 1 end_ARG start_ARG square-root start_ARG 2 end_ARG over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ≥ divide start_ARG 1 end_ARG start_ARG divide start_ARG square-root start_ARG 2 end_ARG end_ARG start_ARG 5 end_ARG end_ARG > 2.68. Thus:

I(σ^0j)μ0j=2σ^0jD(12σ^0j)μ0j2σ^0j2σ^0j2(1+(2σ^0j)22)μ0j=1+(σ^0j)2μ0j=(σ0j)2+(μ0j)2(μ0j)3.𝐼superscriptsubscript^𝜎0𝑗superscriptsubscript𝜇0𝑗2superscriptsubscript^𝜎0𝑗𝐷12superscriptsubscript^𝜎0𝑗superscriptsubscript𝜇0𝑗2superscriptsubscript^𝜎0𝑗2superscriptsubscript^𝜎0𝑗21superscript2superscriptsubscript^𝜎0𝑗22superscriptsubscript𝜇0𝑗1superscriptsuperscriptsubscript^𝜎0𝑗2superscriptsubscript𝜇0𝑗superscriptsuperscriptsubscript𝜎0𝑗2superscriptsuperscriptsubscript𝜇0𝑗2superscriptsuperscriptsubscript𝜇0𝑗3\frac{I(\hat{\sigma}_{0}^{j})}{\mu_{0}^{j}}=\frac{\frac{\sqrt{2}}{\hat{\sigma}% _{0}^{j}}D\left(\frac{1}{\sqrt{2}\hat{\sigma}_{0}^{j}}\right)}{\mu_{0}^{j}}% \approx\frac{\frac{\sqrt{2}}{\hat{\sigma}_{0}^{j}}\frac{\sqrt{2}\hat{\sigma}_{% 0}^{j}}{2}\left(1+\frac{(\sqrt{2}\hat{\sigma}_{0}^{j})^{2}}{2}\right)}{\mu_{0}% ^{j}}=\frac{1+{(\hat{\sigma}_{0}^{j})^{2}}}{\mu_{0}^{j}}=\frac{{(\sigma_{0}^{j% })^{2}}+{(\mu_{0}^{j})^{2}}}{{(\mu_{0}^{j})^{3}}}.divide start_ARG italic_I ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG = divide start_ARG divide start_ARG square-root start_ARG 2 end_ARG end_ARG start_ARG over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG italic_D ( divide start_ARG 1 end_ARG start_ARG square-root start_ARG 2 end_ARG over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG ≈ divide start_ARG divide start_ARG square-root start_ARG 2 end_ARG end_ARG start_ARG over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG divide start_ARG square-root start_ARG 2 end_ARG over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ( 1 + divide start_ARG ( square-root start_ARG 2 end_ARG over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 2 end_ARG ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG = divide start_ARG 1 + ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG = divide start_ARG ( italic_σ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG . (15)

Combining results from Equations (10), (11), (13) and (15), we can get,

E(𝒔2|𝒔0,𝜽0)f(𝒗(𝒔0,𝜽0))𝒔0+𝑵𝒗(𝒔0,𝜽0)Δt+𝑵VmaxΔt𝑵Δt(VmaxKmI(σ^0)μ0),Econditionalsubscript𝒔2subscript𝒔0subscript𝜽0𝑓𝒗subscript𝒔0subscript𝜽0subscript𝒔0𝑵𝒗subscript𝒔0subscript𝜽0Δ𝑡𝑵subscript𝑉𝑚𝑎𝑥Δ𝑡𝑵Δ𝑡direct-productsubscript𝑉𝑚𝑎𝑥subscript𝐾𝑚𝐼subscript^𝜎0subscript𝜇0\mbox{E}(\boldsymbol{s}_{2}|\boldsymbol{s}_{0},\bm{\theta}_{0})\approx f(\bm{v% }(\boldsymbol{s}_{0},\bm{\theta}_{0}))\triangleq\boldsymbol{s}_{0}+\bm{N}\bm{v% }(\boldsymbol{s}_{0},\bm{\theta}_{0})\Delta t+\bm{N}V_{max}\Delta t-\bm{N}% \Delta t\left(V_{max}\odot K_{m}\odot\frac{I(\hat{\sigma}_{0})}{\mu_{0}}\right),E ( bold_italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≈ italic_f ( bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) ≜ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + bold_italic_N bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) roman_Δ italic_t + bold_italic_N italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT roman_Δ italic_t - bold_italic_N roman_Δ italic_t ( italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT ⊙ italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ⊙ divide start_ARG italic_I ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) , (16)

where I(σ^0)μ0𝐼subscript^𝜎0subscript𝜇0\frac{I(\hat{\sigma}_{0})}{\mu_{0}}divide start_ARG italic_I ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG,Vmaxsubscript𝑉𝑚𝑎𝑥V_{max}italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT and Kmsubscript𝐾𝑚K_{m}italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT are R𝑅Ritalic_R-dimensional vectors with j𝑗jitalic_jth component equal to I(σ^0j)μ0j𝐼superscriptsubscript^𝜎0𝑗superscriptsubscript𝜇0𝑗\frac{I(\hat{\sigma}_{0}^{j})}{\mu_{0}^{j}}divide start_ARG italic_I ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_ARG,Vmaxjsuperscriptsubscript𝑉𝑚𝑎𝑥𝑗V_{max}^{j}italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT and Kmjsuperscriptsubscript𝐾𝑚𝑗K_{m}^{j}italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT.
In addition, we can derive Var(𝒔2|s0,𝜽0)Varconditionalsubscript𝒔2subscript𝑠0subscript𝜽0\mbox{Var}(\boldsymbol{s}_{2}|{s}_{0},\bm{\theta}_{0})Var ( bold_italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) following above process:

Var(𝒔2|𝒔0,𝜽0)Varconditionalsubscript𝒔2subscript𝒔0subscript𝜽0\displaystyle\mbox{Var}(\boldsymbol{s}_{2}|\boldsymbol{s}_{0},\bm{\theta}_{0})Var ( bold_italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) =\displaystyle== p(𝒔1,𝜽1|𝒔0,𝜽0)Var(𝒔2|𝒔1,𝜽1)d(𝒔1,𝜽1)𝑝subscript𝒔1conditionalsubscript𝜽1subscript𝒔0subscript𝜽0Varconditionalsubscript𝒔2subscript𝒔1subscript𝜽1𝑑subscript𝒔1subscript𝜽1\displaystyle\int p(\boldsymbol{s}_{1},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{% \theta}_{0})\mbox{Var}(\boldsymbol{s}_{2}|\boldsymbol{s}_{1},\bm{\theta}_{1})d% (\boldsymbol{s}_{1},\bm{\theta}_{1})∫ italic_p ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) Var ( bold_italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) (17)
=\displaystyle== p(𝒔1,𝜽1|𝒔0,𝜽0)[𝑵diag(𝒗(𝒔1,𝜽1))𝑵Δt]d(𝒔1,𝜽1)𝑝subscript𝒔1conditionalsubscript𝜽1subscript𝒔0subscript𝜽0delimited-[]𝑵𝑑𝑖𝑎𝑔𝒗subscript𝒔1subscript𝜽1superscript𝑵topΔ𝑡𝑑subscript𝒔1subscript𝜽1\displaystyle\int p(\boldsymbol{s}_{1},\bm{\theta}_{1}|\boldsymbol{s}_{0},\bm{% \theta}_{0})\left[\bm{N}diag(\bm{v}(\boldsymbol{s}_{1},\bm{\theta}_{1}))\bm{N}% ^{\top}\Delta t\right]d(\boldsymbol{s}_{1},\bm{\theta}_{1})∫ italic_p ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) [ bold_italic_N italic_d italic_i italic_a italic_g ( bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ) bold_italic_N start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Δ italic_t ] italic_d ( bold_italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT )
=\displaystyle== 𝑵diag(Vmax(VmaxKmI(σ^0)μ0))𝑵Δt.𝑵𝑑𝑖𝑎𝑔subscript𝑉𝑚𝑎𝑥direct-productsubscript𝑉𝑚𝑎𝑥subscript𝐾𝑚𝐼subscript^𝜎0subscript𝜇0superscript𝑵topΔ𝑡\displaystyle\bm{N}diag\left(V_{max}-\left(V_{max}\odot K_{m}\odot\frac{I(\hat% {\sigma}_{0})}{\mu_{0}}\right)\right)\bm{N}^{\top}\Delta t.bold_italic_N italic_d italic_i italic_a italic_g ( italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT - ( italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT ⊙ italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ⊙ divide start_ARG italic_I ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) ) bold_italic_N start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Δ italic_t .

Since we know the distribution of 𝒔2subscript𝒔2\boldsymbol{s}_{2}bold_italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT conditioned on 𝒔0subscript𝒔0\boldsymbol{s}_{0}bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we could extend it to time t𝑡titalic_t and write the form of μ𝜇\muitalic_μ and σ𝜎\sigmaitalic_σ:

d𝒔t=μ(𝒔t,𝜽t)dt+σ(𝒔t,𝜽t)dWt,𝑑subscript𝒔𝑡𝜇subscript𝒔𝑡subscript𝜽𝑡𝑑𝑡𝜎subscript𝒔𝑡subscript𝜽𝑡𝑑subscript𝑊𝑡\displaystyle d\boldsymbol{s}_{t}=\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})dt+% \sigma(\boldsymbol{s}_{t},\bm{\theta}_{t})dW_{t},italic_d bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_d italic_t + italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) italic_d italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ,
μ(𝒔t,𝜽t)=E[𝒔t+2𝒔t|𝒔t,𝜽t]𝑵𝒗(𝒔t,𝜽t)Δt+𝑵VmaxΔt𝑵Δt(VmaxKmI(σ^t)μt),𝜇subscript𝒔𝑡subscript𝜽𝑡Edelimited-[]subscript𝒔𝑡2conditionalsubscript𝒔𝑡subscript𝒔𝑡subscript𝜽𝑡𝑵𝒗subscript𝒔𝑡subscript𝜽𝑡Δ𝑡𝑵subscript𝑉𝑚𝑎𝑥Δ𝑡𝑵Δ𝑡direct-productsubscript𝑉𝑚𝑎𝑥subscript𝐾𝑚𝐼subscript^𝜎𝑡subscript𝜇𝑡\displaystyle\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})=\mbox{E}[\boldsymbol{s}_{% t+2}-\boldsymbol{s}_{t}|\boldsymbol{s}_{t},\bm{\theta}_{t}]\approx\bm{N}\bm{v}% (\boldsymbol{s}_{t},\bm{\theta}_{t})\Delta t+\bm{N}V_{max}\Delta t-\bm{N}% \Delta t\left(V_{max}\odot K_{m}\odot\frac{I(\hat{\sigma}_{t})}{\mu_{t}}\right),italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = E [ bold_italic_s start_POSTSUBSCRIPT italic_t + 2 end_POSTSUBSCRIPT - bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] ≈ bold_italic_N bold_italic_v ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) roman_Δ italic_t + bold_italic_N italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT roman_Δ italic_t - bold_italic_N roman_Δ italic_t ( italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT ⊙ italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ⊙ divide start_ARG italic_I ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG ) ,
σ2(𝒔t,𝜽t)=Var[𝒔t+2|𝒔t,𝜽t]𝑵diag(Vmax(VmaxKmI(σ^t)μt))𝑵Δt.superscript𝜎2subscript𝒔𝑡subscript𝜽𝑡Vardelimited-[]conditionalsubscript𝒔𝑡2subscript𝒔𝑡subscript𝜽𝑡𝑵𝑑𝑖𝑎𝑔subscript𝑉𝑚𝑎𝑥direct-productsubscript𝑉𝑚𝑎𝑥subscript𝐾𝑚𝐼subscript^𝜎𝑡subscript𝜇𝑡superscript𝑵topΔ𝑡\displaystyle\sigma^{2}(\boldsymbol{s}_{t},\bm{\theta}_{t})=\mbox{Var}[% \boldsymbol{s}_{t+2}|\boldsymbol{s}_{t},\bm{\theta}_{t}]\approx\bm{N}diag\left% (V_{max}-\left(V_{max}\odot K_{m}\odot\frac{I(\hat{\sigma}_{t})}{\mu_{t}}% \right)\right)\bm{N}^{\top}\Delta t.italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) = Var [ bold_italic_s start_POSTSUBSCRIPT italic_t + 2 end_POSTSUBSCRIPT | bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] ≈ bold_italic_N italic_d italic_i italic_a italic_g ( italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT - ( italic_V start_POSTSUBSCRIPT italic_m italic_a italic_x end_POSTSUBSCRIPT ⊙ italic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ⊙ divide start_ARG italic_I ( over^ start_ARG italic_σ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG ) ) bold_italic_N start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Δ italic_t .

We could also calculate μ(𝒔t,𝜽t)𝒔t𝜇subscript𝒔𝑡subscript𝜽𝑡subscript𝒔𝑡\frac{\partial\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\boldsymbol{s}_% {t}}divide start_ARG ∂ italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG, μ(𝒔t,𝜽t)𝜽t𝜇subscript𝒔𝑡subscript𝜽𝑡subscript𝜽𝑡\frac{\partial\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\bm{\theta}_{t}}divide start_ARG ∂ italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG, σ(𝒔t,𝜽t)𝒔t𝜎subscript𝒔𝑡subscript𝜽𝑡subscript𝒔𝑡\frac{\partial\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\boldsymbol{% s}_{t}}divide start_ARG ∂ italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG and σ(𝒔t,𝜽t)𝜽t𝜎subscript𝒔𝑡subscript𝜽𝑡subscript𝜽𝑡\frac{\partial\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\bm{\theta}_% {t}}divide start_ARG ∂ italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG which will be the input of the algorithm in Section 4.

4 Local Sensitivity Analysis

4.1 Convergence Analysis

We start from the gradient of the backward flow represented in Equation (6),

ψ0,t2(𝒔t2,𝜽t2)subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\displaystyle\triangledown\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t% _{2}})▽ italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) =\displaystyle== 𝒔t20t2(μ(ψt,t2(𝒔t2,𝜽t2)),f(ψt,t2(𝒔t2,𝜽t2)))𝑑tsubscript𝒔subscript𝑡2superscriptsubscript0subscript𝑡2superscript𝜇subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2𝑓subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topdifferential-d𝑡\displaystyle\triangledown\boldsymbol{s}_{t_{2}}-\triangledown\int_{0}^{t_{2}}% (\mu(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),f(\psi_{t,t_{% 2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}dt▽ bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT - ▽ ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_μ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , italic_f ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_d italic_t
0t2(σ(ψt,t2(𝒔t2,𝜽t2)),g(ψt,t2(𝒔t2,𝜽t2)))𝑑W~t.superscriptsubscript0subscript𝑡2superscript𝜎subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2𝑔subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topdifferential-dsubscript~𝑊𝑡\displaystyle-\triangledown\int_{0}^{t_{2}}(\sigma(\psi_{t,t_{2}}(\boldsymbol{% s}_{t_{2}},\bm{\theta}_{t_{2}})),g(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{% \theta}_{t_{2}})))^{\top}\circ d\widetilde{W}_{t}.- ▽ ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( italic_σ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , italic_g ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∘ italic_d over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .

The gradient 𝒔0subscript𝒔0\triangledown\boldsymbol{s}_{0}▽ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the identity matrix 𝕀𝕀\mathbb{I}blackboard_I with d𝑑ditalic_d dimension, where d𝑑ditalic_d is the sum of the dimension of bioprocess states 𝒔tsubscript𝒔𝑡\boldsymbol{s}_{t}bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT and the parameter set 𝜽tsubscript𝜽𝑡\bm{\theta}_{t}bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT. For the last two terms on the right side of equation, based on Proposition 2.4.3 and Theorem 3.4.3 from Kunita (2019), we can switch the order of derivative and integral, i.e.,

ψ0,t2(𝒔t2,𝜽t2)subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\displaystyle\triangledown\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t% _{2}})▽ italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) =\displaystyle== 𝕀d0t2(ψμ(ψt,t2(𝒔t2,𝜽t2)),ψf(ψt,t2(𝒔t2,𝜽t2)))ψt,t2(𝒔t2,𝜽t2)𝑑tsubscript𝕀𝑑superscriptsubscript0subscript𝑡2superscriptsubscript𝜓𝜇subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑓subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsubscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2differential-d𝑡\displaystyle\mathbb{I}_{d}-\int_{0}^{t_{2}}(\triangledown_{\psi}\mu(\psi_{t,t% _{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),\triangledown_{\psi}f(\psi_% {t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}\triangledown% \psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})dtblackboard_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT - ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_μ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_f ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ▽ italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_d italic_t (18)
0t2(ψσ(ψt,t2(𝒔t2,𝜽t2)),ψg(ψt,t2(𝒔t2,𝜽t2)))ψt,t2(𝒔t2,𝜽t2)𝑑W~t.superscriptsubscript0subscript𝑡2superscriptsubscript𝜓𝜎subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑔subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsubscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2differential-dsubscript~𝑊𝑡\displaystyle-\int_{0}^{t_{2}}(\triangledown_{\psi}\sigma(\psi_{t,t_{2}}(% \boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),\triangledown_{\psi}g(\psi_{t,t_{% 2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}\triangledown\psi_{t,t% _{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})\circ d\widetilde{W}_{t}.- ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_σ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_g ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ▽ italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∘ italic_d over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .

Since ψ0,t2subscript𝜓0subscript𝑡2\psi_{0,t_{2}}italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is the inverse function of Φ0,t2subscriptΦ0subscript𝑡2\Phi_{0,t_{2}}roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, we have Φ0,t2(ψ0,t2(𝒔t2,𝜽t2))=(𝒔t2,𝜽t2)subscriptΦ0subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\Phi_{0,t_{2}}(\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}))=(% \boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) = ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ). Therefore, by applying the chain rule, Φ0,t2(ψ0,t2(𝒔t2,𝜽t2))ψ0,t2(𝒔t2,𝜽t2)=𝕀dsubscriptΦ0subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝕀𝑑\triangledown\Phi_{0,t_{2}}(\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_% {t_{2}}))\triangledown\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}% })=\mathbb{I}_{d}▽ roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ▽ italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = blackboard_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT. Then by applying Stratonovich version of Itô’s formula (Theorem 2.4.1 (Kunita, 2019)) on Equation (18), we have,

Φ0,t2(ψ0,t2(𝒔t2,𝜽t2))ψ0,t2(𝒔t2,𝜽t2)=(ψ0,t2(𝒔t2,𝜽t2))1ψ0,t2(𝒔t2,𝜽t2)=1ψ0,t2(𝒔t2,𝜽t2)2,subscriptΦ0subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2superscriptsubscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡21subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡21subscript𝜓0subscript𝑡2superscriptsubscript𝒔subscript𝑡2subscript𝜽subscript𝑡22\frac{\partial\triangledown\Phi_{0,t_{2}}(\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}% },\bm{\theta}_{t_{2}}))}{\partial\triangledown\psi_{0,t_{2}}(\boldsymbol{s}_{t% _{2}},\bm{\theta}_{t_{2}})}=\frac{\partial(\triangledown\psi_{0,t_{2}}(% \boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}))^{-1}}{\partial\triangledown\psi_{% 0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})}=-\frac{1}{\triangledown% \psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})^{2}},divide start_ARG ∂ ▽ roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) end_ARG start_ARG ∂ ▽ italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG = divide start_ARG ∂ ( ▽ italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_ARG start_ARG ∂ ▽ italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG = - divide start_ARG 1 end_ARG start_ARG ▽ italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ,

where

Φ0,t2(ψ0,t2(𝒔t2,𝜽t2))subscriptΦ0subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\displaystyle\triangledown\Phi_{0,t_{2}}(\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}}% ,\bm{\theta}_{t_{2}}))▽ roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) (19)
=\displaystyle== 𝕀d0t2(ψμ(ψt,t2(𝒔t2,𝜽t2)),ψf(ψt,t2(𝒔t2,𝜽t2)))ψt,t2(𝒔t2,𝜽t2)Φt,t2(ψt,t2(𝒔t2,𝜽t2))ψt,t2(𝒔t2,𝜽t2)𝑑tsubscript𝕀𝑑superscriptsubscript0subscript𝑡2superscriptsubscript𝜓𝜇subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑓subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsubscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscriptΦ𝑡subscript𝑡2subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2differential-d𝑡\displaystyle\mathbb{I}_{d}-\int_{0}^{t_{2}}(\triangledown_{\psi}\mu(\psi_{t,t% _{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),\triangledown_{\psi}f(\psi_% {t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}\triangledown% \psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})\frac{\partial% \triangledown\Phi_{t,t_{2}}(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_% {t_{2}}))}{\partial\triangledown\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{% \theta}_{t_{2}})}dtblackboard_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT - ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_μ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_f ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ▽ italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) divide start_ARG ∂ ▽ roman_Φ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) end_ARG start_ARG ∂ ▽ italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG italic_d italic_t
0t2(ψσ(ψt,t2(𝒔t2,𝜽t2)),ψg(ψt,t2(𝒔t2,𝜽t2)))ψt,t2(𝒔t2,𝜽t2)Φt,t2(ψt,t2(𝒔t2,𝜽t2))ψt,t2(𝒔t2,𝜽t2)𝑑W~tsuperscriptsubscript0subscript𝑡2superscriptsubscript𝜓𝜎subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑔subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsubscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscriptΦ𝑡subscript𝑡2subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2differential-dsubscript~𝑊𝑡\displaystyle-\int_{0}^{t_{2}}(\triangledown_{\psi}\sigma(\psi_{t,t_{2}}(% \boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),\triangledown_{\psi}g(\psi_{t,t_{% 2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}\triangledown\psi_{t,t% _{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})\frac{\partial\triangledown% \Phi_{t,t_{2}}(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}))}{% \partial\triangledown\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}% )}\circ d\widetilde{W}_{t}- ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_σ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_g ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ▽ italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) divide start_ARG ∂ ▽ roman_Φ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) end_ARG start_ARG ∂ ▽ italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG ∘ italic_d over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT
=\displaystyle== 𝕀d+0t2(ψμ(ψt,t2(𝒔t2,𝜽t2)),ψf(ψt,t2(𝒔t2,𝜽t2)))Φt,t2(ψt,t2(𝒔t2,𝜽t2))𝑑tsubscript𝕀𝑑superscriptsubscript0subscript𝑡2superscriptsubscript𝜓𝜇subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑓subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsubscriptΦ𝑡subscript𝑡2subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2differential-d𝑡\displaystyle\mathbb{I}_{d}+\int_{0}^{t_{2}}(\triangledown_{\psi}\mu(\psi_{t,t% _{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),\triangledown_{\psi}f(\psi_% {t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}\triangledown% \Phi_{t,t_{2}}(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}))dtblackboard_I start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_μ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_f ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ▽ roman_Φ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) italic_d italic_t
+0t2(ψσ(ψt,t2(𝒔t2,𝜽t2)),ψg(ψt,t2(𝒔t2,𝜽t2)))Φt,t2(ψt,t2(𝒔t2,𝜽t2))𝑑W~t.superscriptsubscript0subscript𝑡2superscriptsubscript𝜓𝜎subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑔subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsubscriptΦ𝑡subscript𝑡2subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2differential-dsubscript~𝑊𝑡\displaystyle+\int_{0}^{t_{2}}(\triangledown_{\psi}\sigma(\psi_{t,t_{2}}(% \boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),\triangledown_{\psi}g(\psi_{t,t_{% 2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}\triangledown\Phi_{t,t% _{2}}(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}))\circ d% \widetilde{W}_{t}.+ ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_σ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_g ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ▽ roman_Φ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ∘ italic_d over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .

Let A~0,t2(𝒔t2,𝜽t2)=A0,t2(ψ0,t2(𝒔t2,𝜽t2))subscript~𝐴0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝐴0subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\widetilde{A}_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})=A_{0,t_{2}% }(\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}))over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_A start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ). Then, by combining Equation (19) and

A0,t2(𝒔0,𝜽0)=(Φ0,t2(𝒔0,𝜽0))Φ0,t2(𝒔0,𝜽0),subscript𝐴0subscript𝑡2subscript𝒔0subscript𝜽0subscriptΦ0subscript𝑡2subscript𝒔0subscript𝜽0subscriptΦ0subscript𝑡2subscript𝒔0subscript𝜽0A_{0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0})=\triangledown\mathcal{L}(\Phi_% {0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0}))\triangledown\Phi_{0,t_{2}}(% \boldsymbol{s}_{0},\bm{\theta}_{0}),\\ italic_A start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = ▽ caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) ▽ roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ,

we can derive

A~0,t2(𝒔t2,𝜽t2)=A0,t2(ψ0,t2(𝒔t2,𝜽t2))=(Φ0,t2(ψ0,t2(𝒔t2,𝜽t2)))Φ0,t2(ψ0,t2(𝒔t2,𝜽t2))subscript~𝐴0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝐴0subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscriptΦ0subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscriptΦ0subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\displaystyle\widetilde{A}_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}% })=A_{0,t_{2}}(\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}))=% \triangledown\mathcal{L}(\Phi_{0,t_{2}}(\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},% \bm{\theta}_{t_{2}})))\triangledown\Phi_{0,t_{2}}(\psi_{0,t_{2}}(\boldsymbol{s% }_{t_{2}},\bm{\theta}_{t_{2}}))over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_A start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) = ▽ caligraphic_L ( roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) ▽ roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) (20)
=\displaystyle== (𝒔t2)+0t2(ψμ(ψt,t2(𝒔t2,𝜽t2)),ψf(ψt,t2(𝒔t2,𝜽t2)))A~t,t2(𝒔t2,𝜽t2)𝑑tsubscript𝒔subscript𝑡2superscriptsubscript0subscript𝑡2superscriptsubscript𝜓𝜇subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑓subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsubscript~𝐴𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2differential-d𝑡\displaystyle\triangledown\mathcal{L}(\boldsymbol{s}_{t_{2}})+\int_{0}^{t_{2}}% (\triangledown_{\psi}\mu(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_% {2}})),\triangledown_{\psi}f(\psi_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}% _{t_{2}})))^{\top}\widetilde{A}_{t,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{% t_{2}})dt▽ caligraphic_L ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) + ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_μ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_f ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) italic_d italic_t
+0t2(ψσ(ψt,t2(𝒔t2,𝜽t2)),ψg(ψt,t2(𝒔t2,𝜽t2)))A~t,t2(𝒔t2,𝜽t2)𝑑W~t.superscriptsubscript0subscript𝑡2superscriptsubscript𝜓𝜎subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓𝑔subscript𝜓𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2topsubscript~𝐴𝑡subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2differential-dsubscript~𝑊𝑡\displaystyle+\int_{0}^{t_{2}}(\triangledown_{\psi}\sigma(\psi_{t,t_{2}}(% \boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})),\triangledown_{\psi}g(\psi_{t,t_{% 2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})))^{\top}\widetilde{A}_{t,t_{2}% }(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})\circ d\widetilde{W}_{t}.+ ∫ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_σ ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) , ▽ start_POSTSUBSCRIPT italic_ψ end_POSTSUBSCRIPT italic_g ( italic_ψ start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT italic_t , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∘ italic_d over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT .

From Equations (6) and (20), we can see that the drift and diffusion terms for ψ0,t2(𝒔t2,𝜽t2)subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\psi_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) and A~0,t2(𝒔t2,𝜽t2)subscript~𝐴0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2\widetilde{A}_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) are Cb,1superscriptsubscript𝐶𝑏1C_{b}^{\infty,1}italic_C start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ , 1 end_POSTSUPERSCRIPT, which means the system (A~0,t2(𝒔t2,𝜽t2),ψ0,t2(𝒔t2,𝜽t2))subscript~𝐴0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2subscript𝜓0subscript𝑡2subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2(\widetilde{A}_{0,t_{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}),\psi_{0,t% _{2}}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}}))( over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , italic_ψ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) has a unique strong solution. Thus, we could define A~0,T(𝒔t2,𝜽t2)=F(𝒔t2,𝜽t2,W)subscript~𝐴0𝑇subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2𝐹subscript𝒔subscript𝑡2subscript𝜽subscript𝑡2𝑊\widetilde{A}_{0,T}(\boldsymbol{s}_{t_{2}},\bm{\theta}_{t_{2}})=F(\boldsymbol{% s}_{t_{2}},\bm{\theta}_{t_{2}},W)over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = italic_F ( bold_italic_s start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_W ) where W={Wt}0tT𝑊subscriptsubscript𝑊𝑡0𝑡𝑇W=\{W_{t}\}_{0\leq t\leq T}italic_W = { italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT } start_POSTSUBSCRIPT 0 ≤ italic_t ≤ italic_T end_POSTSUBSCRIPT is a path of Wiener process and F:d×C([0,1],J)d:𝐹superscript𝑑𝐶01superscript𝐽superscript𝑑F:\mathbb{R}^{d}\times C([0,1],\mathbb{R}^{J})\to\mathbb{R}^{d}italic_F : blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT × italic_C ( [ 0 , 1 ] , blackboard_R start_POSTSUPERSCRIPT italic_J end_POSTSUPERSCRIPT ) → blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT is a deterministic measurable function, which is also called Itô map. In our case, the number of reactions denoted by J𝐽Jitalic_J defines the dimension of the Wiener process deployed to simulate the stochastic reaction network. Note that A0,t2(𝒔0,𝜽0)=A~0,t2(Φ0,t2(𝒔0,𝜽0))subscript𝐴0subscript𝑡2subscript𝒔0subscript𝜽0subscript~𝐴0subscript𝑡2subscriptΦ0subscript𝑡2subscript𝒔0subscript𝜽0A_{0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0})=\widetilde{A}_{0,t_{2}}(\Phi_{% 0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0}))italic_A start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ). In Section 2.2, we’ve already shown that Φ0,t2(𝒔0,𝜽0)subscriptΦ0subscript𝑡2subscript𝒔0subscript𝜽0\Phi_{0,t_{2}}(\boldsymbol{s}_{0},\bm{\theta}_{0})roman_Φ start_POSTSUBSCRIPT 0 , italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) has a unique strong solution. Similar to F𝐹Fitalic_F, we define G:d×C([0,1],J)d:𝐺superscript𝑑𝐶01superscript𝐽superscript𝑑G:\mathbb{R}^{d}\times C([0,1],\mathbb{R}^{J})\to\mathbb{R}^{d}italic_G : blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT × italic_C ( [ 0 , 1 ] , blackboard_R start_POSTSUPERSCRIPT italic_J end_POSTSUPERSCRIPT ) → blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT as the solution map for forward flow, i.e., G(𝒔0,𝜽0,W)=Φ0,T(𝒔0,𝜽0)𝐺subscript𝒔0subscript𝜽0𝑊subscriptΦ0𝑇subscript𝒔0subscript𝜽0G(\boldsymbol{s}_{0},\bm{\theta}_{0},W)=\Phi_{0,T}(\boldsymbol{s}_{0},\bm{% \theta}_{0})italic_G ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_W ) = roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). Then, apparently:

A0,T(𝒔0,𝜽0)=A~0,T(G(𝒔0,𝜽0,W))=F(G(𝒔0,𝜽0,W),W).subscript𝐴0𝑇subscript𝒔0subscript𝜽0subscript~𝐴0𝑇𝐺subscript𝒔0subscript𝜽0𝑊𝐹𝐺subscript𝒔0subscript𝜽0𝑊𝑊A_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})=\widetilde{A}_{0,T}(G(\boldsymbol{% s}_{0},\bm{\theta}_{0},W))=F(G(\boldsymbol{s}_{0},\bm{\theta}_{0},W),W).italic_A start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( italic_G ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_W ) ) = italic_F ( italic_G ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_W ) , italic_W ) .

4.2 Algorithm Development

Based on Equations (5), (6), and (20), with μ(𝒔t,𝜽t)𝒔t𝜇subscript𝒔𝑡subscript𝜽𝑡subscript𝒔𝑡\frac{\partial\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\boldsymbol{s}_% {t}}divide start_ARG ∂ italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG, μ(𝒔t,𝜽t)𝜽t𝜇subscript𝒔𝑡subscript𝜽𝑡subscript𝜽𝑡\frac{\partial\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\bm{\theta}_{t}}divide start_ARG ∂ italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG, σ(𝒔t,𝜽t)𝒔t𝜎subscript𝒔𝑡subscript𝜽𝑡subscript𝒔𝑡\frac{\partial\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\boldsymbol{% s}_{t}}divide start_ARG ∂ italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG and σ(𝒔t,𝜽t)𝜽t𝜎subscript𝒔𝑡subscript𝜽𝑡subscript𝜽𝑡\frac{\partial\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\bm{\theta}_% {t}}divide start_ARG ∂ italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG as inputs, we could approximate A0,T(𝒔0,𝜽0)subscript𝐴0𝑇subscript𝒔0subscript𝜽0A_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0})italic_A start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ). Since the system is described in Stratonovich integral, the most commonly used Euler-Maruyama Scheme is not applicable, which requires the system to be represented in Itô integral. Thus, we instead deployed the Euler-Heun method below:

(𝒔t+1,𝜽t+1)superscriptsubscript𝒔𝑡1subscript𝜽𝑡1top\displaystyle(\boldsymbol{s}_{t+1},\boldsymbol{\theta}_{t+1})^{\top}( bold_italic_s start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t + 1 end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT =\displaystyle== (𝒔t,𝜽t)+(μ(𝒔t,𝜽t),f(𝒔t,𝜽t))Δtsuperscriptsubscript𝒔𝑡subscript𝜽𝑡topsuperscript𝜇subscript𝒔𝑡subscript𝜽𝑡𝑓subscript𝒔𝑡subscript𝜽𝑡topΔ𝑡\displaystyle(\boldsymbol{s}_{t},\boldsymbol{\theta}_{t})^{\top}+(\mu(% \boldsymbol{s}_{t},\bm{\theta}_{t}),f(\boldsymbol{s}_{t},\bm{\theta}_{t}))^{% \top}\Delta t( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + ( italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_f ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT roman_Δ italic_t
+12[(σ(𝒔t,𝜽t),g(𝒔t,𝜽t))+(σ(𝒔¯t,𝜽¯t),g(𝒔¯t,𝜽¯t))](WtWt1),12delimited-[]superscript𝜎subscript𝒔𝑡subscript𝜽𝑡𝑔subscript𝒔𝑡subscript𝜽𝑡topsuperscript𝜎subscript¯𝒔𝑡subscript¯𝜽𝑡𝑔subscript¯𝒔𝑡subscript¯𝜽𝑡topsubscript𝑊𝑡subscript𝑊𝑡1\displaystyle+\frac{1}{2}\left[(\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t}),g(% \boldsymbol{s}_{t},\bm{\theta}_{t}))^{\top}+(\sigma(\overline{\boldsymbol{s}}_% {t},\overline{\bm{\theta}}_{t}),g(\overline{\boldsymbol{s}}_{t},\overline{\bm{% \theta}}_{t}))^{\top}\right](W_{t}-W_{t-1}),+ divide start_ARG 1 end_ARG start_ARG 2 end_ARG [ ( italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_g ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + ( italic_σ ( over¯ start_ARG bold_italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , over¯ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_g ( over¯ start_ARG bold_italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , over¯ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] ( italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_W start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) ,
(𝒔¯t,𝜽¯t)superscriptsubscript¯𝒔𝑡subscript¯𝜽𝑡top\displaystyle(\overline{\boldsymbol{s}}_{t},\overline{\bm{\theta}}_{t})^{\top}( over¯ start_ARG bold_italic_s end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , over¯ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT =\displaystyle== (𝒔t,𝜽t)+(σ(𝒔t,𝜽t),g(𝒔t,𝜽t))(WtWt1).superscriptsubscript𝒔𝑡subscript𝜽𝑡topsuperscript𝜎subscript𝒔𝑡subscript𝜽𝑡𝑔subscript𝒔𝑡subscript𝜽𝑡topsubscript𝑊𝑡subscript𝑊𝑡1\displaystyle(\boldsymbol{s}_{t},\bm{\theta}_{t})^{\top}+(\sigma(\boldsymbol{s% }_{t},\bm{\theta}_{t}),g(\boldsymbol{s}_{t},\bm{\theta}_{t}))^{\top}(W_{t}-W_{% t-1}).( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT + ( italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , italic_g ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT - italic_W start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT ) .

Based on the schemes, the procedure of adjoint sensitivity analysis on the SDEs-based mechanistic model is described in Algorithm 1. The algorithm is built on the Euler-Heun Scheme, using the gradients of the drift and diffusion terms as inputs, and produces the expected gradient of the simulation prediction MSE as its output. The algorithm begins with the generation of sample paths of the Wiener process Wtnsuperscriptsubscript𝑊𝑡𝑛W_{t}^{n}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT in Step 1. Subsequently, as depicted in Steps 2 and 3, we conduct simulations under 𝒔0subscript𝒔0\boldsymbol{s}_{0}bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and 𝜽0subscript𝜽0\bm{\theta}_{0}bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT to obtain the simulated output denoted by O(𝒔)𝑂𝒔O(\boldsymbol{s})italic_O ( bold_italic_s ). In the empirical study, the observed states 𝒔csuperscript𝒔𝑐\bm{s}^{c}bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT are generated from the model with a predefined set of true parameters 𝜽csuperscript𝜽𝑐\bm{\theta}^{c}bold_italic_θ start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT, using the same path Wtnsuperscriptsubscript𝑊𝑡𝑛W_{t}^{n}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to control randomness. Wtnsuperscriptsubscript𝑊𝑡𝑛W_{t}^{n}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is queried again in the backward pass W~tnsuperscriptsubscript~𝑊𝑡𝑛\widetilde{W}_{t}^{n}over~ start_ARG italic_W end_ARG start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT to generate the backward flow ψ(t1)Δt,Tn(Φ0,Tn(𝒔0,𝜽0))subscriptsuperscript𝜓𝑛𝑡1Δ𝑡𝑇superscriptsubscriptΦ0𝑇𝑛subscript𝒔0subscript𝜽0\psi^{n}_{(t-1)\Delta t,T}(\Phi_{0,T}^{n}(\boldsymbol{s}_{0},\bm{\theta}_{0}))italic_ψ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t - 1 ) roman_Δ italic_t , italic_T end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ). By combining observed output, simulated output, and the backward flow, we derive sensitivity A~(t1)Δt,Tn(Φ0,Tn(𝒔0,𝜽0))subscriptsuperscript~𝐴𝑛𝑡1Δ𝑡𝑇superscriptsubscriptΦ0𝑇𝑛subscript𝒔0subscript𝜽0\widetilde{A}^{n}_{(t-1)\Delta t,T}(\Phi_{0,T}^{n}(\boldsymbol{s}_{0},\bm{% \theta}_{0}))over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t - 1 ) roman_Δ italic_t , italic_T end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) corresponding to path Wtnsuperscriptsubscript𝑊𝑡𝑛W_{t}^{n}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT in Step 6. By repeating these steps, we generate N𝑁Nitalic_N sample paths, and the mean value of A~(t1)Δt,Tn(Φ0,Tn(𝒔0,𝜽0))subscriptsuperscript~𝐴𝑛𝑡1Δ𝑡𝑇superscriptsubscriptΦ0𝑇𝑛subscript𝒔0subscript𝜽0\widetilde{A}^{n}_{(t-1)\Delta t,T}(\Phi_{0,T}^{n}(\boldsymbol{s}_{0},\bm{% \theta}_{0}))over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t - 1 ) roman_Δ italic_t , italic_T end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) for each sample path serves as the final sensitivity, as shown in Step 7.

Input:
  • Observed states 𝒔csuperscript𝒔𝑐\boldsymbol{s}^{c}bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT from the real system.

  • Initial parameters 𝜽0subscript𝜽0\bm{\theta}_{0}bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

  • Start time t0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, end time T𝑇Titalic_T.

  • Gradient of drift and diffusion term μ(𝒔t,𝜽t)𝒔t𝜇subscript𝒔𝑡subscript𝜽𝑡subscript𝒔𝑡\frac{\partial\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\boldsymbol{s}_% {t}}divide start_ARG ∂ italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG, μ(𝒔t,𝜽t)𝜽t𝜇subscript𝒔𝑡subscript𝜽𝑡subscript𝜽𝑡\frac{\partial\mu(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\bm{\theta}_{t}}divide start_ARG ∂ italic_μ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG, σ(𝒔t,𝜽t)𝒔t𝜎subscript𝒔𝑡subscript𝜽𝑡subscript𝒔𝑡\frac{\partial\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\boldsymbol{% s}_{t}}divide start_ARG ∂ italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG and σ(𝒔t,𝜽t)𝜽t𝜎subscript𝒔𝑡subscript𝜽𝑡subscript𝜽𝑡\frac{\partial\sigma(\boldsymbol{s}_{t},\bm{\theta}_{t})}{\partial\bm{\theta}_% {t}}divide start_ARG ∂ italic_σ ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG.

  • Number of iterations N𝑁Nitalic_N.

  • Grid size ΔtΔ𝑡\Delta troman_Δ italic_t, number of steps S=TΔt𝑆𝑇Δ𝑡S=\frac{T}{\Delta t}italic_S = divide start_ARG italic_T end_ARG start_ARG roman_Δ italic_t end_ARG.

Output: E[((O(𝒔)O(𝒔c))2𝒔0,(O(𝒔)O(𝒔c))2𝜽0)|(𝒔0,𝜽0)]Edelimited-[]conditionalsuperscriptsuperscript𝑂𝒔𝑂superscript𝒔𝑐2subscript𝒔0superscript𝑂𝒔𝑂superscript𝒔𝑐2subscript𝜽0topsubscript𝒔0subscript𝜽0\mbox{E}\left[\left.\left(\frac{\partial(O(\boldsymbol{s})-O(\boldsymbol{s}^{c% }))^{2}}{\partial\boldsymbol{s}_{0}},\frac{\partial(O(\boldsymbol{s})-O(% \boldsymbol{s}^{c}))^{2}}{\partial\bm{\theta}_{0}}\right)^{\top}\right|(% \boldsymbol{s}_{0},\bm{\theta}_{0})\right]E [ ( divide start_ARG ∂ ( italic_O ( bold_italic_s ) - italic_O ( bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG , divide start_ARG ∂ ( italic_O ( bold_italic_s ) - italic_O ( bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT | ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ]
1. Calculate the real system output O(𝒔c)𝑂superscript𝒔𝑐O(\boldsymbol{s}^{c})italic_O ( bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) and define 𝒔0=𝒔0csubscript𝒔0superscriptsubscript𝒔0𝑐\boldsymbol{s}_{0}=\boldsymbol{s}_{0}^{c}bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT;
for n=1,,N𝑛1𝑁n=1,\ldots,Nitalic_n = 1 , … , italic_N do
       for t1𝑡1t\leftarrow 1italic_t ← 1 to S𝑆Sitalic_S do
             2. Generate a sample path of Wiener process Wtnsubscriptsuperscript𝑊𝑛𝑡W^{n}_{t}italic_W start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT with grid size ΔtΔ𝑡\Delta troman_Δ italic_t;
             3. Based on Wtnsubscriptsuperscript𝑊𝑛𝑡W^{n}_{t}italic_W start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, generate Φ0,tΔtn(𝒔0,𝜽0)subscriptsuperscriptΦ𝑛0𝑡Δ𝑡subscript𝒔0subscript𝜽0\Phi^{n}_{0,t\Delta t}(\boldsymbol{s}_{0},\bm{\theta}_{0})roman_Φ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 , italic_t roman_Δ italic_t end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) from Equation (5) with Euler-Heun Scheme;
            
       end for
      4. Generate backward pass W~tnsubscriptsuperscript~𝑊𝑛𝑡\widetilde{W}^{n}_{t}over~ start_ARG italic_W end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT based on forward pass Wtnsuperscriptsubscript𝑊𝑡𝑛W_{t}^{n}italic_W start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT;
       5. Calculate the model prediction output O(Φ0,Tn(𝒔0,𝜽0))𝑂subscriptsuperscriptΦ𝑛0𝑇subscript𝒔0subscript𝜽0O(\Phi^{n}_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0}))italic_O ( roman_Φ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) );
      
      for tS𝑡𝑆t\leftarrow Sitalic_t ← italic_S to 1111 do
             6. Based on W~tnsubscriptsuperscript~𝑊𝑛𝑡\widetilde{W}^{n}_{t}over~ start_ARG italic_W end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, generate ψ(t1)Δt,Tn(Φ0,Tn(𝒔0,𝜽0))subscriptsuperscript𝜓𝑛𝑡1Δ𝑡𝑇superscriptsubscriptΦ0𝑇𝑛subscript𝒔0subscript𝜽0\psi^{n}_{(t-1)\Delta t,T}(\Phi_{0,T}^{n}(\boldsymbol{s}_{0},\bm{\theta}_{0}))italic_ψ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t - 1 ) roman_Δ italic_t , italic_T end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) from Equation (6) with Euler-Heun Scheme;
             7. Based on ψ(t1)Δt,Tn(Φ0,Tn(𝒔0,𝜽0))subscriptsuperscript𝜓𝑛𝑡1Δ𝑡𝑇superscriptsubscriptΦ0𝑇𝑛subscript𝒔0subscript𝜽0\psi^{n}_{(t-1)\Delta t,T}(\Phi_{0,T}^{n}(\boldsymbol{s}_{0},\bm{\theta}_{0}))italic_ψ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t - 1 ) roman_Δ italic_t , italic_T end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ), generate A~(t1)Δt,Tn(Φ0,Tn(𝒔0,𝜽0))subscriptsuperscript~𝐴𝑛𝑡1Δ𝑡𝑇superscriptsubscriptΦ0𝑇𝑛subscript𝒔0subscript𝜽0\widetilde{A}^{n}_{(t-1)\Delta t,T}(\Phi_{0,T}^{n}(\boldsymbol{s}_{0},\bm{% \theta}_{0}))over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_t - 1 ) roman_Δ italic_t , italic_T end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ) from Equation (20) with Euler-Heun Scheme, (𝒔T)=2|O(𝒔)O(𝒔c)|subscript𝒔𝑇2𝑂𝒔𝑂superscript𝒔𝑐\triangledown\mathcal{L}(\boldsymbol{s}_{T})=2|O(\boldsymbol{s})-O(\boldsymbol% {s}^{c})|▽ caligraphic_L ( bold_italic_s start_POSTSUBSCRIPT italic_T end_POSTSUBSCRIPT ) = 2 | italic_O ( bold_italic_s ) - italic_O ( bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) |;
       end for
      
end for
8. Calculate E[((O(𝒔)O(𝒔c))2𝒔0,(O(𝒔)O(𝒔c))2𝜽0)|(𝒔0,𝜽0)]=1Nn=1NA~0,Tn(Φ0,Tn(𝒔0,𝜽0))Edelimited-[]conditionalsuperscriptsuperscript𝑂𝒔𝑂superscript𝒔𝑐2subscript𝒔0superscript𝑂𝒔𝑂superscript𝒔𝑐2subscript𝜽0topsubscript𝒔0subscript𝜽01𝑁superscriptsubscript𝑛1𝑁subscriptsuperscript~𝐴𝑛0𝑇superscriptsubscriptΦ0𝑇𝑛subscript𝒔0subscript𝜽0\mbox{E}\left[\left.\left(\frac{\partial(O(\boldsymbol{s})-O(\boldsymbol{s}^{c% }))^{2}}{\partial\boldsymbol{s}_{0}},\frac{\partial(O(\boldsymbol{s})-O(% \boldsymbol{s}^{c}))^{2}}{\partial\bm{\theta}_{0}}\right)^{\top}\right|(% \boldsymbol{s}_{0},\bm{\theta}_{0})\right]=\frac{1}{N}\sum\limits_{n=1}^{N}% \widetilde{A}^{n}_{0,T}(\Phi_{0,T}^{n}(\boldsymbol{s}_{0},\bm{\theta}_{0}))E [ ( divide start_ARG ∂ ( italic_O ( bold_italic_s ) - italic_O ( bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∂ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG , divide start_ARG ∂ ( italic_O ( bold_italic_s ) - italic_O ( bold_italic_s start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT ) ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∂ bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT | ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ] = divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT over~ start_ARG italic_A end_ARG start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( roman_Φ start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) )
Algorithm 1 Adjoint Sensitivity Analysis on SDEs based on the MSE of Output Prediction.

From Theorem 2, as the grid or time size Δt0Δ𝑡0\Delta t\to 0roman_Δ italic_t → 0, the output of Algorithm 1 will converge pathwise (i.e., almost surely) from any fixed starting point to true local sensitivity. This sensitivity analysis, accounting for the interdependency of multi-scale mechanistic model, accelerates the search for the optimal parameters 𝜽𝜽\boldsymbol{\theta}bold_italic_θ that minimizes the model prediction’s MSE.

Theorem 2 (Li et al. (2020)).

Suppose the schemes Fhsubscript𝐹F_{h}italic_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT and Ghsubscript𝐺G_{h}italic_G start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT, with h=TL𝑇𝐿h=\frac{T}{L}italic_h = divide start_ARG italic_T end_ARG start_ARG italic_L end_ARG denoting the size of the grid for solvers, satisfy the following conditions:

  • Fh(𝒔t,𝜽t,W)F(𝒔t,𝜽t,W)subscript𝐹subscript𝒔𝑡subscript𝜽𝑡𝑊𝐹subscript𝒔𝑡subscript𝜽𝑡𝑊F_{h}(\boldsymbol{s}_{t},\boldsymbol{\theta}_{t},W)\to F(\boldsymbol{s}_{t},% \boldsymbol{\theta}_{t},W)italic_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_W ) → italic_F ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_W ), Gh(𝒔t,𝜽t,W)G(𝒔t,𝜽t,W)subscript𝐺subscript𝒔𝑡subscript𝜽𝑡𝑊𝐺subscript𝒔𝑡subscript𝜽𝑡𝑊G_{h}(\boldsymbol{s}_{t},\boldsymbol{\theta}_{t},W)\to G(\boldsymbol{s}_{t},% \boldsymbol{\theta}_{t},W)italic_G start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_W ) → italic_G ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_W ) as h00h\to 0italic_h → 0;

  • For M>0for-all𝑀0\forall M>0∀ italic_M > 0, sup𝒔t;𝜽tM|Fh(𝒔t,𝜽t,W)F(𝒔t,𝜽t,W)|0subscriptsupremumsubscript𝒔𝑡subscript𝜽𝑡𝑀subscript𝐹subscript𝒔𝑡subscript𝜽𝑡𝑊𝐹subscript𝒔𝑡subscript𝜽𝑡𝑊0\sup_{\boldsymbol{s}_{t};\boldsymbol{\theta}_{t}\leq M}|F_{h}(\boldsymbol{s}_{% t},\boldsymbol{\theta}_{t},W)-F(\boldsymbol{s}_{t},\boldsymbol{\theta}_{t},W)|\to 0roman_sup start_POSTSUBSCRIPT bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ; bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ≤ italic_M end_POSTSUBSCRIPT | italic_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_W ) - italic_F ( bold_italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_W ) | → 0 as h00h\to 0italic_h → 0.

Then for any starting point 𝐬0subscript𝐬0\boldsymbol{s}_{0}bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and 𝛉0subscript𝛉0\boldsymbol{\theta}_{0}bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we have,

Fh(Gh(𝒔0,𝜽0,W),W)F(G(𝒔0,𝜽0,W),W)=A0,T(𝒔0,𝜽0),𝒔0d.formulae-sequencesubscript𝐹subscript𝐺subscript𝒔0subscript𝜽0𝑊𝑊𝐹𝐺subscript𝒔0subscript𝜽0𝑊𝑊subscript𝐴0𝑇subscript𝒔0subscript𝜽0for-allsubscript𝒔0superscript𝑑F_{h}(G_{h}(\boldsymbol{s}_{0},\bm{\theta}_{0},W),W)\to F(G(\boldsymbol{s}_{0}% ,\bm{\theta}_{0},W),W)=A_{0,T}(\boldsymbol{s}_{0},\bm{\theta}_{0}),\forall% \boldsymbol{s}_{0}\in\mathbb{R}^{d}.italic_F start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( italic_G start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_W ) , italic_W ) → italic_F ( italic_G ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_W ) , italic_W ) = italic_A start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , bold_italic_θ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) , ∀ bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT . (21)

5 Empirical Study

In this section, we conduct the empirical study by using the In-Vitro Transcription (IVT) system example to validate our SA algorithm. The bioprocess model from Wang et al. (2024) with the true model parameters denoted by 𝜽csuperscript𝜽𝑐\bm{\theta}^{c}bold_italic_θ start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT is used to assess the performance of proposed adjoint SA algorithm. We start the section by providing an in-depth interpretation of the sensitivity analysis results obtained from Algorithm 1. We then assess the efficacy of our approach by showing the decrease rate in the simulation output prediction MSE over multiple search iterations of model parameters.

The IVT process is mainly composed of four sub-processes: (1) initiation; (2) elongation; (3) termination; and (4) degradation of generated RNA molecule product. The molecular reaction network is illustrated in Figure 1. One insightful interpretation of the sensitivity analysis results is to investigate the relative importance of each subprocess, which is inferred from the sensitivity of their corresponding parameters. Aligned with this reasoning, Figure 1 presents the relative importance of each subprocess within the IVT system. Notably, the result underscores the pivotal role of elongation reactions in the IVT process regardless of termination time T𝑇Titalic_T. Furthermore, it is observed that the impact of degradation increases over time, which fits our expectation since the synthesis rate of mRNA transcripts decreases as the IVT process approaches completion. Concurrently, the accumulation of generated RNA transcripts leads to an increase in the degradation rate at the system level. Therefore, the relevance of parameters associated with the degradation model becomes increasingly significant over time. Conversely, the influence of the initiation process appears to diminish over time as the raw materials (i.e., NTPs) used to synthesize RNA products are consumed.

Refer to caption
((a))
Refer to caption
((b))
Figure 1: Interaction between subprocesses of the IVT system and their relative importance. (a) The reaction network for the IVT process (Wang et al., 2024). (b) Relative importance of the IVT system subprocesses parameter estimation on the output predictions at different times based on SA results.

To further assess the performance of the proposed adjoint SA, we benchmark our approach with a state-of-art gradient estimation approach (Fu, 2015) that estimates the gradient by using finite difference (FD) simulation estimator, i.e.,

O(Φ0,Tn(𝒔0,𝜽^+ck𝒆k))O(Φ0,Tn(𝒔0,𝜽^ck𝒆k))2ck𝑂subscriptsuperscriptΦ𝑛0𝑇subscript𝒔0^𝜽subscript𝑐𝑘subscript𝒆𝑘𝑂subscriptsuperscriptΦ𝑛0𝑇subscript𝒔0^𝜽subscript𝑐𝑘subscript𝒆𝑘2subscript𝑐𝑘\frac{O(\Phi^{n}_{0,T}(\boldsymbol{s}_{0},\hat{\bm{\theta}}+c_{k}\boldsymbol{e% }_{k}))-O(\Phi^{n}_{0,T}(\boldsymbol{s}_{0},\hat{\bm{\theta}}-c_{k}\boldsymbol% {e}_{k}))}{2c_{k}}divide start_ARG italic_O ( roman_Φ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , over^ start_ARG bold_italic_θ end_ARG + italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ) - italic_O ( roman_Φ start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 0 , italic_T end_POSTSUBSCRIPT ( bold_italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , over^ start_ARG bold_italic_θ end_ARG - italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT bold_italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) ) end_ARG start_ARG 2 italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_ARG

for the n𝑛nitalic_n-th iteration of parameter search, where cksubscript𝑐𝑘c_{k}italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is the perturbation and 𝒆ksubscript𝒆𝑘\boldsymbol{e}_{k}bold_italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT denotes a unit vector with the element in k𝑘kitalic_k-th dimension equal to 1 and all remaining elements equal to 0. In our case, we define ck=p×𝜽^ksubscript𝑐𝑘𝑝subscript^𝜽𝑘c_{k}=p\times\hat{\bm{\theta}}_{k}italic_c start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_p × over^ start_ARG bold_italic_θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT where p𝑝pitalic_p is a percentage selected to be 5%percent55\%5 %, 10%percent1010\%10 %, and 20%percent2020\%20 %.

Approach Observed Adjoint sensitivity Finite difference (p=5%percent55\%5 %) Finite difference (p=10%percent1010\%10 %) Finite difference (p=20%percent2020\%20 %)
Yield (μM𝜇𝑀\mu Mitalic_μ italic_M) 73.48±0.28plus-or-minus73.480.2873.48\pm 0.2873.48 ± 0.28 73.05±0.27plus-or-minus73.050.2773.05\pm 0.2773.05 ± 0.27 72.66±0.32plus-or-minus72.660.3272.66\pm 0.3272.66 ± 0.32 74.37±0.39plus-or-minus74.370.3974.37\pm 0.3974.37 ± 0.39 74.51±0.31plus-or-minus74.510.3174.51\pm 0.3174.51 ± 0.31
Table 1: Final RNA yield and its 95%percent\%% confidence interval with 𝜽^^𝜽\hat{\bm{\theta}}over^ start_ARG bold_italic_θ end_ARG calibrated with different approaches.
Refer to caption
((a))
Refer to caption
((b))
Figure 2: The results of IVT output prediction MSE and model parameter estimation relative error obtained by using adjoint SA and finite difference approaches.

Figure 2 illustrates the convergence of the Mean Squared Error (MSE) of IVT process output prediction over the iterations. It’s evident that as the number of iterations increases, the adjoint SA approach surpasses the finite difference gradient estimation approach. Nonetheless, same as the finite difference estimator, the convergence speed gradually decreases, and the lower bound shows no further improvement once it reaches approximately 0.4(g/L)20.4superscript𝑔𝐿20.4(g/L)^{2}0.4 ( italic_g / italic_L ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, which could be associated with a fixed step size, i.e., α=0.01𝛼0.01\alpha=0.01italic_α = 0.01, used during the gradient search process. This slowdown could also be attributed to our MSE objective measure that solely relies on the final RNA yield. This discrete data provide very limited information about the interactions within the complex reaction network. We also show the specific RNA yield prediction by using the model with parameters 𝜽^^𝜽\hat{\bm{\theta}}over^ start_ARG bold_italic_θ end_ARG estimated with different approaches in Table 1. In contrast, the final RNA yield and its 95%percent9595\%95 % confidence interval (CI) obtained from the system with true parameters 𝜽csuperscript𝜽𝑐\bm{\theta}^{c}bold_italic_θ start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT are 73.48±0.28plus-or-minus73.480.2873.48\pm 0.2873.48 ± 0.28. There is an overlap between the 95% confidence interval of the adjoint SA approach’s predictions and the observed RNA yield, indicating no statistically significant difference between them. In contrast, the finite difference approach shows a significant prediction discrepancy from the observed values, demonstrating that the adjoint SA approach outperforms the finite difference method.

Furthermore, we present the results of the expected relative error between 𝜽^^𝜽\hat{\bm{\theta}}over^ start_ARG bold_italic_θ end_ARG and 𝜽csuperscript𝜽𝑐\bm{\theta}^{c}bold_italic_θ start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT, i.e., E[|𝜽^𝜽c𝜽c|]Edelimited-[]^𝜽superscript𝜽𝑐superscript𝜽𝑐\mbox{E}\big{[}|\frac{\hat{\bm{\theta}}-\bm{\theta}^{c}}{\bm{\theta}^{c}}|\big% {]}E [ | divide start_ARG over^ start_ARG bold_italic_θ end_ARG - bold_italic_θ start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT end_ARG start_ARG bold_italic_θ start_POSTSUPERSCRIPT italic_c end_POSTSUPERSCRIPT end_ARG | ], in Figure 2. Our approach consistently outperforms the finite difference estimator, as the expected relative error diminishes more significantly after 50 iterations compared with the finite difference estimator regardless of the p𝑝pitalic_p value chosen. However, similar to the MSE results, the lower bound of the relative difference stagnates at approximately 15%percent1515\%15 %, which could be attributed to the same limitation mentioned above.

6 CONCLUSION

In this paper, we present an adjoint sensitivity analysis approach designed to expedite the search for stochastic reaction network model parameters . We first formulate the multi-scale bioprocess stochastic reaction network in the form of SDEs. Then, we develop an adjoint SA algorithm on SDEs for computing local sensitivities of model parameters and validate the convergence of the algorithm. Our empirical study underscores the importance of model parameters of enzymatic stochastic reaction networks and provides empirical evidence of the efficacy of our approach. Moving forward, we aim to leverage this capability by refining the loss function. Rather than solely relying on the Mean Squared Error (MSE) of the final output, we intend to integrate additional bioprocess states and consider the disparities between simulated and real trajectories, which promises to enhance the effectiveness and accuracy of our method.

References

  • Filobello-Nino et al. (2019) Uriel A. Filobello-Nino, Hector Vazquez-Leal, Agustin Herrera-May, ROBERTO Ambrosio, Roberto Castañeda-Sheissa, V.M. Jimenez-Fernandez, Mario Sandoval-Hernandez, and Ana Contreras-Hernandez. A handy, accurate, invertible and integrable expression for dawson’s function. 29:18, 09 2019. doi: 10.15174/au.2019.2124.
  • Fu (2015) Michael C. Fu. Stochastic Gradient Estimation, pages 105–147. Springer New York, New York, NY, 2015. ISBN 978-1-4939-1384-8. doi: 10.1007/978-1-4939-1384-8_5. URL https://doi.org/10.1007/978-1-4939-1384-8_5.
  • Golightly and Wilkinson (2005) Andrew Golightly and Darren J Wilkinson. Bayesian inference for stochastic kinetic models using a diffusion approximation. Biometrics, 61(3):781–788, 2005.
  • Hall (1979) Richard L. Hall. Inverse moments for a class of truncated normal distributions. Sankhyā: The Indian Journal of Statistics, Series B (1960-2002), 41(1/2):66–76, 1979. ISSN 05815738. URL http://www.jstor.org/stable/25052135.
  • John Bailer (2001) A. John Bailer. Probabilistic techniques in exposure assessment. a handbook for dealing with variability and uncertainty in models and inputs. a. c. cullen and h. c. frey, plenum press, new york and london, 1999. no. of pages: ix + 335. price: $99.50. isbn 0-306-45956-6. Statistics in Medicine, 20(14):2211–2213, 2001. doi: https://doi.org/10.1002/sim.958. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/sim.958.
  • Kedem (1980) Gershon Kedem. Automatic differentiation of computer programs. ACM Trans. Math. Softw., 6(2):150–165, jun 1980. ISSN 0098-3500. doi: 10.1145/355887.355890. URL https://doi.org/10.1145/355887.355890.
  • Kramer et al. (1984) Mark A. Kramer, Herschel Rabitz, Joseph M. Calo, and Robert J. Kee. Sensitivity analysis in chemical kinetics: Recent developments and computational comparisons. International Journal of Chemical Kinetics, 16(5):559–578, 1984. doi: https://doi.org/10.1002/kin.550160506. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/kin.550160506.
  • Kunita (2019) H. Kunita. Stochastic Flows and Jump-diffusions. Probability theory and stochastic modelling. Springer, 2019. ISBN 9789811338021. URL https://books.google.com/books?id=_hftxgEACAAJ.
  • Kyriakopoulos et al. (2018) Sarantos Kyriakopoulos, Kok Siong Ang, Meiyappan Lakshmanan, Zhuangrong Huang, Seongkyu Yoon, Rudiyanto Gunawan, and Dong-Yup Lee. Kinetic modeling of mammalian cell culture bioprocessing: The quest to advance biomanufacturing. Biotechnology Journal, 13(3):1700229, 2018. doi: https://doi.org/10.1002/biot.201700229. URL https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/abs/10.1002/biot.201700229.
  • Li et al. (2020) Xuechen Li, Ting-Kam Leonard Wong, Ricky T. Q. Chen, and David Kristjanson Duvenaud. Scalable gradients for stochastic differential equations. ArXiv, abs/2001.01328, 2020. URL https://api.semanticscholar.org/CorpusID:209862121.
  • Michaelis and Menten (2007) Leonor Michaelis and Maud Leonora Menten. Die kinetik der invertinwirkung. Biochemische Zeitschrift, 49:333 – 369, 2007.
  • Sobol (2001) Ilya M. Sobol. Global sensitivity indices for nonlinear mathematical models and their monte carlo estimates. 2001. URL https://api.semanticscholar.org/CorpusID:202584415.
  • Wang et al. (2024) Keqi Wang, Wei Xie, and Hua Zheng. Stochastic molecular reaction queueing network modeling for in vitro transcription process. In Proceedings of the Winter Simulation Conference, WSC ’23, page 1900–1911. IEEE Press, 2024. ISBN 9798350369663.
  • Xie et al. (2022) Wei Xie, Bo Wang, Cheng Li, Dongming Xie, and Jared Auclair. Interpretable biomanufacturing process risk and sensitivity analyses for quality-by-design and stability control. Naval Research Logistics (NRL), 69(3):461–483, 2022. doi: https://doi.org/10.1002/nav.22019. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/nav.22019.
  • Zi (2011) Zhike Zi. Sensitivity analysis approaches applied to systems biology models. IET systems biology, 5 6:336–6, 2011. URL https://api.semanticscholar.org/CorpusID:473778.