A Two-stage Identification Method for Switched Linear Systems

Zheng Wenju [email protected]    Ye Hao [email protected] Department of Automation, Tsinghua University, Bei**g
Abstract

In this work, a new two-stage identification method based on dynamic programming and sparsity inducing is proposed for switched linear systems. Our method achieves sparsity inducing in the identification of switched linear systems by the constrained switching mechanism, in contrast to previous optimization-based identification techniques that rely on the rigid data distribution assumption in the parameter space. The proposed mechanism assumes the existence of a minimal interval between adjacent switching instants. First, an efficient iterative dynamic programming approach is used to determine the switching instants and segments using the constrained switching mechanism. Then, each submodel is identified as a combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT optimization problem, and the true parameter for each submodel is determined by solving the problem. The problem of combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT optimization is solved by relaxing it into a convex 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization problem. Furthermore, the unbiasedness of the switched linear system identification is discussed thoroughly with the constrained switching mechanism and a new persistent excitation condition is proposed. Simulation experiments are conducted to indicate that our algorithms exhibit strong robustness against noise.

keywords:
Switched linear system, System Identification, Sparsity Optimization, Persistent Excitation Condition
thanks: This paper was not presented at any IFAC meeting.

,

1 Introduction

Identification of switched linear systems has attracted significant interest in recent years and has been widely applied to real-world problems [1], such as computer vision, benchmark detection, system biology, electromechanical systems, etc. This interest stems from the fact that the complex inherently nonlinear systems and phenomenon can be effectively modeled as a combination of a finite number of linear systems [2]. Generally speaking, a switched linear system which comprises a finite number of linear subsystems and a switched mechanism determining the active subsystem at each time instant, can be parameterized and formulated as [3]:

yksubscript𝑦𝑘\displaystyle y_{k}italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =θλtxk+ekabsentsuperscriptsubscript𝜃subscript𝜆𝑡topsubscript𝑥𝑘subscript𝑒𝑘\displaystyle=\theta_{\lambda_{t}}^{\top}x_{k}+e_{k}= italic_θ start_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (1a)
xksubscript𝑥𝑘\displaystyle x_{k}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =[yk1,,ykna,uk1,,uknb]absentlimit-fromsubscript𝑦𝑘1subscript𝑦𝑘subscript𝑛𝑎limit-from𝑢subscripttop𝑘1limit-from𝑢subscripttop𝑘subscript𝑛𝑏top\displaystyle=\left[y_{k-1},\cdots,y_{k-n_{a}},u\top_{k-1},\cdots,u\top_{k-n_{% b}}\right]\top= [ italic_y start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_k - italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_u ⊤ start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT , ⋯ , italic_u ⊤ start_POSTSUBSCRIPT italic_k - italic_n start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] ⊤ (1b)

where xknsubscript𝑥𝑘superscript𝑛x_{k}\in\mathbb{R}^{n}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is the regressor vector, uksubscript𝑢𝑘u_{k}\in\mathbb{R}italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_R and yksubscript𝑦𝑘y_{k}\in\mathbb{R}italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_R are designated as the input and the output of the system, nasubscript𝑛𝑎n_{a}italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT and nbsubscript𝑛𝑏n_{b}italic_n start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT are the orders of the systems and eksubscript𝑒𝑘e_{k}\in\mathbb{R}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ blackboard_R is an unknown noise. λk{1,,S}subscript𝜆𝑘1𝑆\lambda_{k}\in\{1,\cdots,S\}italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ { 1 , ⋯ , italic_S } is the discrete state which indicates the active subsystem at time k𝑘kitalic_k where S𝑆Sitalic_S is the number of the subsystems and θλkn,n=na+nbformulae-sequencesubscript𝜃subscript𝜆𝑘superscript𝑛𝑛subscript𝑛𝑎subscript𝑛𝑏\theta_{\lambda_{k}}\in\mathbb{R}^{n},n=n_{a}+n_{b}italic_θ start_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , italic_n = italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT + italic_n start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT is the parameter vector corresponding to the active subsystem at time k𝑘kitalic_k.

In this paper, we focus on the identification problem of switched linear system from a collection of time series input-output data. Besides the estimation of each subsystem, the identification of the switched linear system also need to confirm the discrete state which is generated by the switched mechanism and indicates the active subsystem at each time. Therefore, the identification of the switched linear system is essentially a mixed integer programming problem. Moreover, the identification of the number of the switching segments and submodels remains a critical and open issue [1].

1.1 Prior works on the identification of switched linear systems

There have been a good deal of literature on the identification of switched linear systems. In recent years, data-driven methods are widely used to the identification of switched linear systems, such as model selection techniques based on statistical learning [4, 1], regression trees [5], self-adaptative multi-kernel algorithm [6], semi-supervised learning approach [7] etc. However, these methods lack of convergence analysis. The algebraic recursive methods are proposed in literature [8, 9, 10, 11, 12, 13] by transforming a switched linear system into a lifted linear model independent of the switching sequence. The optimization based methods address the identification problem of switched linear systems through formulations involving sum-of-norms regularization [14], mixed-integer programming [15], sparse optimization [3], and so forth. Sparse optimization has been widely used in inverse problems and signal and image processing problems [16] in recent years and the identification algorithm for switched linear systems based on sparse optimization is proposed in [3], leveraging a specific combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem, where the optimal solution is related to the subsystem parameters. This decouples the identification of system parameters and the determination of the discrete states in the switched linear system. The unbiasedness of the identification in the algebraic and optimization-based methods mentioned above relies on the strict assumption of data distribution in the parameter space, as follows.

Assumption 1.

(Assumption 6 in [3]) There is no data pair (xk,yk)subscript𝑥𝑘subscript𝑦𝑘(x_{k},y_{k})( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) that fits two different submodels of the switched linear system (1), i.e., yk=θixk+ek=θjxk+eki=j.subscript𝑦𝑘superscriptsubscript𝜃𝑖topsubscript𝑥𝑘subscript𝑒𝑘superscriptsubscript𝜃𝑗topsubscript𝑥𝑘subscript𝑒𝑘𝑖𝑗y_{k}=\theta_{i}^{\top}x_{k}+e_{k}=\theta_{j}^{\top}x_{k}+e_{k}\Rightarrow i=j.italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT + italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ⇒ italic_i = italic_j .

Assumption 1 requires that all regressor vectors {xk}subscript𝑥𝑘\{x_{k}\}{ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } cannot lie on the intersection of the planes formed by the parameters of each subsystem. Based on Assumption 1, the persistent excitation condition for the identification of the switched linear systems are proposed in [17] in the absence of noise. However, Assumption 1 cannot be guaranteed in practical applications, especially in the presence of noise.

1.2 Contributions of this paper

In this paper, we address the identification of switched linear systems based on a constrained switching mechanism, which assumes the existence of a minimal interval between adjacent switching instants. Considering the identification of the switching instants, we propose a two-stage identification framework for switched linear systems, departing from the assumption of the distribution of data in parameter space, as described in Assumption 1.

Firstly, inspired by the qualitative trend analysis [18] and curve fitting [19, 20, 21], an efficient iterative dynamic programming algorithm proposed in our previous work [22] is utilized to identify the switching instants under the constrained switching mechanism. Meanwhile, the collected time series data can be partitioned into different time segments according to the estimated switching instants. Subsequently, in the second stage inspired by the sparse optimization identification algorithm [3], the parameters of each submodel can be identified by solving a specific combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem based on the segmented data. The combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem is solved by relaxing it into a convex 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization problem. Furthermore, the unbiasedness of the proposed two-stage algorithm is guaranteed under the assumption of the constrained switching mechanism and a new sufficient condition based on the constrained switched mechanism for the persistent excitation condition of switched linear systems is proposed as a comparison of that in review [17] based on Assumption 1.

1.3 Outline of this paper

This paper is organized as follows. In the next section, the identification problem of the switched linear system is presented, and the conceptions of the sparse optimization are briefly discussed. Later in Section III, a two-stage identification framework is proposed based on both dynamic programming and sparse optimization. The unbiasedness of the identification algorithms based on 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization and 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization are detailed discussed. In Section IV, a new persistent excitation condition for switched linear systems is proposed. Experimental results based on simulation and real data of high speed trains are presented in Section V. Finally, brief conclusions are drawn in Section VI.

2 Preliminaries and problem formulation

In this section, we introduce the basic conceptions and notations of the sparsity presentation, which will be extensively used throughout the rest of the paper. The identification problem of the switched linear system will be also formulated.

2.1 Sparse Optimization

Recent years, sparse optimization has been widely used in inverse problems and signal and image processing problems [16]. The general optimization problem with the equations of linear systems can be formulated as

(PJ):minzJ(z)subject to Az=b:subscript𝑃𝐽subscript𝑧𝐽𝑧subject to 𝐴𝑧𝑏(P_{J}):\min_{z}J(z)~{}\text{subject to }Az=b( italic_P start_POSTSUBSCRIPT italic_J end_POSTSUBSCRIPT ) : roman_min start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT italic_J ( italic_z ) subject to italic_A italic_z = italic_b (2)

where A𝐴Aitalic_A is always given as a full-rank matrix and the objective function J()𝐽J(\cdot)italic_J ( ⋅ ) is selected to guarantee a unique solution. In this section, we primarily introduce the most commonly used sparse optimization criterion, 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT norm and 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norm. The problem with J(z)=z0𝐽𝑧subscriptdelimited-∥∥𝑧0J(z)=\lVert z\rVert_{0}italic_J ( italic_z ) = ∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, representing the nonzero entries of the vector z𝑧zitalic_z, is denoted as the 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem, P0subscript𝑃0P_{0}italic_P start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, as well as the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization problem, P1subscript𝑃1P_{1}italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, with the objective function J(z)=z1𝐽𝑧subscriptdelimited-∥∥𝑧1J(z)=\lVert z\rVert_{1}italic_J ( italic_z ) = ∥ italic_z ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, representing the sum of the absolute values of the elements in the vector z𝑧zitalic_z.

There are some measures of how sparse the columns of the given matrix are and here we mainly introduce the spark𝑠𝑝𝑎𝑟𝑘sparkitalic_s italic_p italic_a italic_r italic_k, mutual coherence and k𝑘kitalic_k-genericity index [3, 16] for the convenience of the uniqueness of sparse solutions as well as the unbiasedness of the identification which will be discussed in the later Section III. The definition of spark𝑠𝑝𝑎𝑟𝑘sparkitalic_s italic_p italic_a italic_r italic_k is written as:

Definition 1.

The spark of a given matrix A𝐴Aitalic_A is the smallest number of columns from A𝐴Aitalic_A that are linearly dependent, written as spark(A)𝑠𝑝𝑎𝑟𝑘𝐴spark(A)italic_s italic_p italic_a italic_r italic_k ( italic_A ).

In fact, the spark𝑠𝑝𝑎𝑟𝑘sparkitalic_s italic_p italic_a italic_r italic_k of the given matrix A𝐴Aitalic_A can be formulated as [3]

spark(A)=minzker(A),z0z0spark(A)=\min_{z\in\ker(A),z\neq 0}\lVert z\rVert_{0}italic_s italic_p italic_a italic_r italic_k ( italic_A ) = roman_min start_POSTSUBSCRIPT italic_z ∈ roman_ker ( italic_A ) , italic_z ≠ 0 end_POSTSUBSCRIPT ∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (3)

where ker(A)kernel𝐴\ker(A)roman_ker ( italic_A ) represents the kernel space or the null space of the matrix A𝐴Aitalic_A and z0subscriptdelimited-∥∥𝑧0\lVert z\rVert_{0}∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT represents the 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm of the vector z𝑧zitalic_z which is inherently the number of the nonzero entries of the vector z𝑧zitalic_z. There is a crucial theorem about spark𝑠𝑝𝑎𝑟𝑘sparkitalic_s italic_p italic_a italic_r italic_k as follows:

Theorem 1.

(See Theorem 2 in [16]) If a system of linear equations Az=b𝐴𝑧𝑏Az=bitalic_A italic_z = italic_b with full row rank matrix A𝐴Aitalic_A has a solution z𝑧zitalic_z obeying z0<spark(A)/2subscriptdelimited-∥∥𝑧0𝑠𝑝𝑎𝑟𝑘𝐴2\lVert z\rVert_{0}<{spark(A)}/{2}∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_s italic_p italic_a italic_r italic_k ( italic_A ) / 2, this solution is necessarily the sparsest possible.

The spark𝑠𝑝𝑎𝑟𝑘sparkitalic_s italic_p italic_a italic_r italic_k gives a simple criterion for uniqueness of sparse solutions and this criterion forms the basis of many results in the field of the sparse optimization. A simpler way to guarantee the uniqueness is to exploit the mutual coherence of the given matrix A𝐴Aitalic_A, defined as follows.

Definition 2.

The mutual coherence of a given matrix A𝐴Aitalic_A is the largest absolute normalized inner product between different columns from A𝐴Aitalic_A. Denoting the k𝑘kitalic_kth column in A𝐴Aitalic_A by aksubscript𝑎𝑘a_{k}italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the mutual coherence can be formulated as

μ(A)=max1i,jm,ij|aiaj|ai2aj2.𝜇𝐴subscriptformulae-sequence1𝑖formulae-sequence𝑗𝑚𝑖𝑗superscriptsubscript𝑎𝑖topsubscript𝑎𝑗subscriptnormsubscript𝑎𝑖2subscriptnormsubscript𝑎𝑗2\mu(A)=\max_{{1\leq i,j\leq m},~{}{i\neq j}}\frac{|a_{i}^{\top}a_{j}|}{\|a_{i}% \|_{2}\|a_{j}\|_{2}}.italic_μ ( italic_A ) = roman_max start_POSTSUBSCRIPT 1 ≤ italic_i , italic_j ≤ italic_m , italic_i ≠ italic_j end_POSTSUBSCRIPT divide start_ARG | italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT | end_ARG start_ARG ∥ italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ∥ italic_a start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG . (4)

By introducing the definition of mutual coherence, an analogue of Theorem 1 can be formulated as follows:

Theorem 2.

(See Theorem 5 and Theorem 7 in [16]) If Az=b𝐴𝑧𝑏Az=bitalic_A italic_z = italic_b, where A𝐴Aitalic_A is a matrix with full row rank, and there exists a solution z𝑧zitalic_z obeying

z0<12(1+1μ(A))subscriptnorm𝑧01211𝜇𝐴\|z\|_{0}<\frac{1}{2}(1+\frac{1}{\mu(A)})∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 + divide start_ARG 1 end_ARG start_ARG italic_μ ( italic_A ) end_ARG ) (5)

then this solution is necessarily the sparsest possible. Moreover, it represents the unique solution to the sparse problem (2) where J(z)=z0𝐽𝑧subscriptdelimited-∥∥𝑧0J(z)=\lVert z\rVert_{0}italic_J ( italic_z ) = ∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT as well as when J(z)=z1𝐽𝑧subscriptdelimited-∥∥𝑧1J(z)=\lVert z\rVert_{1}italic_J ( italic_z ) = ∥ italic_z ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

The k𝑘kitalic_k-genericity index of the given matrix A𝐴Aitalic_A, denoted as vk(A)subscript𝑣𝑘𝐴v_{k}(A)italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_A ), is another way to measure the linear independence of the columns of A𝐴Aitalic_A and is first introduced in [3].

Definition 3.

(See Definition 1 in [3]) For a given matrix An×N,nNformulae-sequence𝐴superscript𝑛𝑁𝑛𝑁A\in\mathbb{R}^{n\times N},~{}n\leq Nitalic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_N end_POSTSUPERSCRIPT , italic_n ≤ italic_N with the k𝑘kitalic_kth column denoted as aksubscript𝑎𝑘a_{k}italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the k𝑘kitalic_k-genericity index, vk(A)subscript𝑣𝑘𝐴v_{k}(A)italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_A ) is the minimum integer m𝑚mitalic_m such that any n×m𝑛𝑚n\times mitalic_n × italic_m submatrix of A𝐴Aitalic_A has rank k𝑘kitalic_k:

vk(A)=subscript𝑣𝑘𝐴absent\displaystyle v_{k}(A)=italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_A ) = min{m:(t1,,tm) with titj for ij,\displaystyle\min\left\{m:\forall(t_{1},\cdots,t_{m})\text{ with }t_{i}\neq t_% {j}\text{ for }i\neq j,\right.roman_min { italic_m : ∀ ( italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) with italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≠ italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT for italic_i ≠ italic_j , (6)
rank[at1,,atm]=k}.\displaystyle rank[a_{t_{1}},\cdots,a_{t_{m}}]=k\left.\right\}.italic_r italic_a italic_n italic_k [ italic_a start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ , italic_a start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ] = italic_k } .

Moreover, if k>rank(A)𝑘𝑟𝑎𝑛𝑘𝐴k>rank(A)italic_k > italic_r italic_a italic_n italic_k ( italic_A ), vk(A)=+subscript𝑣𝑘𝐴v_{k}(A)=+\inftyitalic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ( italic_A ) = + ∞ and if k=0𝑘0k=0italic_k = 0, v0(A)=0subscript𝑣0𝐴0v_{0}(A)=0italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_A ) = 0.

2.2 Identification Problem

As mentioned in Section I, the identification of switched linear system (1) involves not only identifying the parameters of each subsystem, {θi}i=1Ssuperscriptsubscriptsubscript𝜃𝑖𝑖1𝑆\{\theta_{i}\}_{i=1}^{S}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT, but also determining the structure of the systems as well as the number of the subsystems, S𝑆Sitalic_S, and the switching mechanism, which refers to the integer discrete states, {λk}k=1Nsuperscriptsubscriptsubscript𝜆𝑘𝑘1𝑁\{\lambda_{k}\}_{k=1}^{N}{ italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT.

Remark 1.

The orders of the switched linear system (1), denoted as nasubscript𝑛𝑎n_{a}italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT and nbsubscript𝑛𝑏n_{b}italic_n start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, are assumed to be known and equal for all subsystems which is adopted in most methods [5, 23, 7, 3, 10].

Then, the identification problem of the switched linear system (1) can be formulated as follows:

Identification Problem 1.

Given a collection of input-output time series data 𝒟={(uk,yk)k=1N}𝒟superscriptsubscriptsubscript𝑢𝑘subscript𝑦𝑘𝑘1𝑁\mathcal{D}=\{(u_{k},y_{k})_{k=1}^{N}\}caligraphic_D = { ( italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT } generated by the switched linear system (1), estimate the number of the subsystems S𝑆Sitalic_S, the parameters of each subsystem {θi}i=1Ssuperscriptsubscriptsubscript𝜃𝑖𝑖1𝑆\{\theta_{i}\}_{i=1}^{S}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT and the discrete states {λk}k=1Nsuperscriptsubscriptsubscript𝜆𝑘𝑘1𝑁\{\lambda_{k}\}_{k=1}^{N}{ italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT.

With a given nonnegative cost function, l()𝑙l(\cdot)italic_l ( ⋅ ), to measure the fitting errors of the identified model (1), the above identification problem can be written as the following mixed integer programming problem:

minS,{θi}i=1S,{λk}k=1Nsubscript𝑆superscriptsubscriptsubscript𝜃𝑖𝑖1𝑆superscriptsubscriptsubscript𝜆𝑘𝑘1𝑁\displaystyle\min\limits_{S,\{\theta_{i}\}_{i=1}^{S},\{\lambda_{k}\}_{k=1}^{N}}roman_min start_POSTSUBSCRIPT italic_S , { italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT , { italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT end_POSTSUBSCRIPT k=1Nl(ykθλkxk)superscriptsubscript𝑘1𝑁𝑙subscript𝑦𝑘superscriptsubscript𝜃subscript𝜆𝑘topsubscript𝑥𝑘\displaystyle\quad\sum_{k=1}^{N}l(y_{k}-\theta_{\lambda_{k}}^{\top}x_{k})∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_l ( italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) (7)
subj. to λk{1,,S}k=1,,N,formulae-sequencesubscript𝜆𝑘1𝑆for-all𝑘1𝑁\displaystyle\quad\lambda_{k}\in\{1,\cdots,S\}~{}\forall k=1,\cdots,N,italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ { 1 , ⋯ , italic_S } ∀ italic_k = 1 , ⋯ , italic_N ,
S,θin,i=1,,S.formulae-sequence𝑆formulae-sequencesubscript𝜃𝑖superscript𝑛for-all𝑖1𝑆\displaystyle~{}S\in\mathbb{N},~{}\theta_{i}\in\mathbb{R}^{n},~{}\forall i=1,% \cdots,S.italic_S ∈ blackboard_N , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , ∀ italic_i = 1 , ⋯ , italic_S .

With the definition of the regressor vector xksubscript𝑥𝑘x_{k}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in (1), we consider the vector of the bias:

ϕ(θ)=[x¯1θ¯x¯Nθ¯]=𝒚Xθ𝒆italic-ϕ𝜃delimited-[]superscriptsubscript¯𝑥1top¯𝜃missing-subexpressionmissing-subexpressionmissing-subexpressionmissing-subexpressionsuperscriptsubscript¯𝑥𝑁top¯𝜃missing-subexpressionmissing-subexpressiontop𝒚𝑋𝜃𝒆\phi(\theta)=\left[\begin{array}[]{ccc}\bar{x}_{1}^{\top}\bar{\theta}\\ \vdots\\ \bar{x}_{N}^{\top}\bar{\theta}\end{array}\right]=\bm{y}-X\top\theta-\bm{e}italic_ϕ ( italic_θ ) = [ start_ARRAY start_ROW start_CELL over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_θ end_ARG end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW start_ROW start_CELL over¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_θ end_ARG end_CELL start_CELL end_CELL start_CELL end_CELL end_ROW end_ARRAY ] = bold_italic_y - italic_X ⊤ italic_θ - bold_italic_e (8)

where x¯k=[yk,xk],θ¯k=[1,θ]n+1,k=1,,Nformulae-sequenceformulae-sequencesubscript¯𝑥𝑘superscriptsubscript𝑦𝑘superscriptsubscript𝑥𝑘toptopsubscript¯𝜃𝑘superscript1superscript𝜃toptopsuperscript𝑛1𝑘1𝑁\bar{x}_{k}=[y_{k},-x_{k}^{\top}]^{\top},\bar{\theta}_{k}=[1,\theta^{\top}]^{% \top}\in\mathbb{R}^{n+1},k=1,\cdots,Nover¯ start_ARG italic_x end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = [ italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , - italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , over¯ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = [ 1 , italic_θ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n + 1 end_POSTSUPERSCRIPT , italic_k = 1 , ⋯ , italic_N, X=[x1,,xN]n×N𝑋subscript𝑥1subscript𝑥𝑁superscript𝑛𝑁X=[x_{1},\cdots,x_{N}]\in\mathbb{R}^{n\times N}italic_X = [ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_N end_POSTSUPERSCRIPT, 𝒚=[y1,,yN]𝒚superscriptsubscript𝑦1subscript𝑦𝑁top\bm{y}=[y_{1},\cdots,y_{N}]^{\top}bold_italic_y = [ italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and 𝒆=[e1,,eN]𝒆superscriptsubscript𝑒1subscript𝑒𝑁top\bm{e}=[e_{1},\cdots,e_{N}]^{\top}bold_italic_e = [ italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_e start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT.

Inspired by [3], the linear equation of the identification problem with an orthogonal projection matrix A=INX(XX)1X𝐴subscript𝐼𝑁superscript𝑋topsuperscripttop𝑋𝑋1𝑋A=I_{N}-X^{\top}(X\top X)^{-1}Xitalic_A = italic_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - italic_X start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( italic_X ⊤ italic_X ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_X (INsubscript𝐼𝑁I_{N}italic_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT represents the N𝑁Nitalic_N identity matrix) can be formulated as follows:

Az=b𝐴𝑧𝑏Az=bitalic_A italic_z = italic_b (9)

where 𝒛=ϕ(θ)𝒛italic-ϕ𝜃\bm{z}=\phi(\theta)bold_italic_z = italic_ϕ ( italic_θ ) and b=A𝒚𝑏𝐴𝒚b=A\bm{y}italic_b = italic_A bold_italic_y. Considering that the matrix AN×N𝐴superscript𝑁𝑁A\in\mathbb{R}^{N\times N}italic_A ∈ blackboard_R start_POSTSUPERSCRIPT italic_N × italic_N end_POSTSUPERSCRIPT can be replaced by the full row rank matrix AX(Nn)×Nsubscript𝐴𝑋superscript𝑁𝑛𝑁A_{X}\in\mathbb{R}^{(N-n)\times N}italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_N - italic_n ) × italic_N end_POSTSUPERSCRIPT spanning the orthogonal complement of the column space of X𝑋Xitalic_X. Then the Identification Problem 1 can be regarded as the sparse optimization problem (2) with J(z)=k=1Nl(ykθλkxk)𝐽𝑧superscriptsubscript𝑘1𝑁𝑙subscript𝑦𝑘superscriptsubscript𝜃subscript𝜆𝑘topsubscript𝑥𝑘J(z)=\sum_{k=1}^{N}l(y_{k}-\theta_{\lambda_{k}}^{\top}x_{k})italic_J ( italic_z ) = ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT italic_l ( italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) and AXz=AX𝒚=bXsubscript𝐴𝑋𝑧subscript𝐴𝑋𝒚subscript𝑏𝑋A_{X}z=A_{X}\bm{y}=b_{X}italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT italic_z = italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT bold_italic_y = italic_b start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT. For convenience, τ(X)𝜏𝑋\tau(X)italic_τ ( italic_X ) is introduced to replace the mutual coherence μ(AX)𝜇subscript𝐴𝑋\mu(A_{X})italic_μ ( italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) inspired by [3],

τ(X)𝜏𝑋\displaystyle\tau(X)italic_τ ( italic_X ) =max1t,kN,tk|ΓX(t,k)|(1ΓX(t,t))(1ΓX(k,k))absentsubscriptformulae-sequence1𝑡formulae-sequence𝑘𝑁𝑡𝑘subscriptΓ𝑋𝑡𝑘1subscriptΓ𝑋𝑡𝑡1subscriptΓ𝑋𝑘𝑘\displaystyle=\max_{{1\leq t,k\leq N},~{}{t\neq k}}\frac{\lvert\Gamma_{X}(t,k)% \rvert}{\sqrt{(1-\Gamma_{X}(t,t))(1-\Gamma_{X}(k,k))}}= roman_max start_POSTSUBSCRIPT 1 ≤ italic_t , italic_k ≤ italic_N , italic_t ≠ italic_k end_POSTSUBSCRIPT divide start_ARG | roman_Γ start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_t , italic_k ) | end_ARG start_ARG square-root start_ARG ( 1 - roman_Γ start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_t , italic_t ) ) ( 1 - roman_Γ start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_k , italic_k ) ) end_ARG end_ARG (10)
ΓXsubscriptΓ𝑋\displaystyle\Gamma_{X}roman_Γ start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT =X(XX)1Xabsentsuperscript𝑋topsuperscript𝑋superscript𝑋top1𝑋\displaystyle=X^{\top}(XX^{\top})^{-1}X= italic_X start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( italic_X italic_X start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_X

where ΓX(t,k)subscriptΓ𝑋𝑡𝑘\Gamma_{X}(t,k)roman_Γ start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ( italic_t , italic_k ) represents the value of the (t,k)𝑡𝑘(t,k)( italic_t , italic_k )-entry of ΓXsubscriptΓ𝑋\Gamma_{X}roman_Γ start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT. According to the properties of idempotent matrix A𝐴Aitalic_A, it follows μ(AX)=τ(X)𝜇subscript𝐴𝑋𝜏𝑋\mu(A_{X})=\tau(X)italic_μ ( italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ) = italic_τ ( italic_X ).

2.3 Persistence of excitation problem

The convergence of the identification algorithm always requires the input signal or the regressor vector constructed from input-output data to be persistently exciting [24]. The persistence of excitation is given as follows.

Definition 4.

{xk}k=1Nsuperscriptsubscriptsubscript𝑥𝑘𝑘1𝑁\{x_{k}\}_{k=1}^{N}{ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT is a persistently exciting sequence of order oedim(xk)=nsubscript𝑜𝑒dimensionsubscript𝑥𝑘𝑛o_{e}\geq\dim(x_{k})=nitalic_o start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT ≥ roman_dim ( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) = italic_n if there exist ρ1,ρ2>0subscript𝜌1subscript𝜌20\rho_{1},\rho_{2}>0italic_ρ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_ρ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > 0 such that

ρ1In1oek=0oe1xk0+kxk0+kρ2In,k0formulae-sequencesubscript𝜌1subscript𝐼𝑛1subscript𝑜𝑒superscriptsubscript𝑘0subscript𝑜𝑒1superscriptsubscript𝑥subscript𝑘0𝑘topsubscript𝑥subscript𝑘0𝑘subscript𝜌2subscript𝐼𝑛for-allsubscript𝑘0\rho_{1}I_{n}\leq\frac{1}{o_{e}}\sum_{k=0}^{o_{e}-1}x_{k_{0}+k}^{\top}x_{k_{0}% +k}\leq\rho_{2}I_{n},\forall k_{0}\in\mathbb{N}italic_ρ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ≤ divide start_ARG 1 end_ARG start_ARG italic_o start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_k = 0 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_o start_POSTSUBSCRIPT italic_e end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_k end_POSTSUBSCRIPT ≤ italic_ρ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , ∀ italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ blackboard_N (11)

where Insubscript𝐼𝑛I_{n}italic_I start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT denotes the n𝑛nitalic_n identity matrix.

In the absence of noise, ek0subscript𝑒𝑘0e_{k}\equiv 0italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≡ 0, the persistence of excitation problem of the switched linear system (1) becomes whether the following equation has a unique solution for {θi,ξi,k,k=1,,N}i=1S\{\theta_{i},\xi_{i,k},k=1,\cdots,N\}_{i=1}^{S}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_ξ start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT , italic_k = 1 , ⋯ , italic_N } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT [17],

k=1Ni=1Sξi,k(ykxkθi)2=0,superscriptsubscript𝑘1𝑁superscriptsubscript𝑖1𝑆subscript𝜉𝑖𝑘superscriptsubscript𝑦𝑘superscriptsubscript𝑥𝑘topsubscript𝜃𝑖20\displaystyle\sum_{k=1}^{N}\sum_{i=1}^{S}\xi_{i,k}(y_{k}-x_{k}^{\top}\theta_{i% })^{2}=0,∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT italic_ξ start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT ( italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0 , (12)
subject to ξi,k=I{λk=i}subscript𝜉𝑖𝑘𝐼subscript𝜆𝑘𝑖\displaystyle\xi_{i,k}=I\{\lambda_{k}=i\}italic_ξ start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT = italic_I { italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = italic_i }
i=1Sξi,k=1superscriptsubscript𝑖1𝑆subscript𝜉𝑖𝑘1\displaystyle\sum_{i=1}^{S}\xi_{i,k}=1∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT italic_ξ start_POSTSUBSCRIPT italic_i , italic_k end_POSTSUBSCRIPT = 1

where I{}𝐼I\{\cdot\}italic_I { ⋅ } represents the indicator function. Then for switched linear system (1), a PE condition was proposed based on Assumption 1 as follows [17].

Assumption 2.

(See Assumption 1 in [17]) For switched linear systems (1):

  1. 1.

    θiθj,ij,i,j{1,,S}formulae-sequencesubscript𝜃𝑖subscript𝜃𝑗formulae-sequence𝑖𝑗for-all𝑖𝑗1𝑆\theta_{i}\neq\theta_{j},~{}i\neq j,~{}\forall i,j\in\{1,\cdots,S\}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≠ italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_i ≠ italic_j , ∀ italic_i , italic_j ∈ { 1 , ⋯ , italic_S }

  2. 2.

    Assumption 1 holds.

  3. 3.

    There exists an ordered set sequence (κp1,,κpS)subscript𝜅subscript𝑝1subscript𝜅subscript𝑝𝑆(\kappa_{p_{1}},\cdots,\kappa_{p_{S}})( italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ , italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) in the order of (p1,,pS)subscript𝑝1subscript𝑝𝑆(p_{1},\cdots,p_{S})( italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_p start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) with (p1,,pS)subscript𝑝1subscript𝑝𝑆(p_{1},\cdots,p_{S})( italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_p start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ) being a permutation of (1,,S)1𝑆(1,\cdots,S)( 1 , ⋯ , italic_S ) such that the following statements hold sequentially:

    1. (a)

      For any non-overlap** partition having the form κp1=i=1Sκp1(S,i)subscript𝜅subscript𝑝1superscriptsubscript𝑖1𝑆superscriptsubscript𝜅subscript𝑝1𝑆𝑖\kappa_{p_{1}}=\cup_{i=1}^{S}\kappa_{p_{1}}^{(S,i)}italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∪ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_S , italic_i ) end_POSTSUPERSCRIPT, there exists some subset κp1s,i1superscriptsubscript𝜅subscript𝑝1𝑠subscript𝑖1\kappa_{p_{1}}^{s,i_{1}}italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s , italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT with i1{1,,S}subscript𝑖11𝑆i_{1}\in\{1,\cdots,S\}italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ { 1 , ⋯ , italic_S } such that kκp1S,i1xkxksubscript𝑘superscriptsubscript𝜅subscript𝑝1𝑆subscript𝑖1subscript𝑥𝑘superscriptsubscript𝑥𝑘top\sum_{k\in\kappa_{p_{1}}^{S,i_{1}}}x_{k}x_{k}^{\top}∑ start_POSTSUBSCRIPT italic_k ∈ italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S , italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is nonsingular.

    2. (b)

      For any non-overlap** partition of κprsubscript𝜅subscript𝑝𝑟\kappa_{p_{r}}italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT having the following form with r{2,,S}𝑟2𝑆r\in\{2,\cdots,S\}italic_r ∈ { 2 , ⋯ , italic_S }

      {κpr(Sr+1,i),i{1,,S}\{i1,,ir1}},superscriptsubscript𝜅subscript𝑝𝑟𝑆𝑟1𝑖𝑖\1𝑆subscript𝑖1subscript𝑖subscript𝑟1\{\kappa_{p_{r}}^{(S-r+1,i)},i\in\{1,\cdots,S\}\backslash\{i_{1},\cdots,i_{r_{% 1}}\}\},{ italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_S - italic_r + 1 , italic_i ) end_POSTSUPERSCRIPT , italic_i ∈ { 1 , ⋯ , italic_S } \ { italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_i start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT } } , (13)

      there exists ir{1,,S}\{i1,,ir1}subscript𝑖𝑟\1𝑆subscript𝑖1subscript𝑖subscript𝑟1i_{r}\in\{1,\cdots,S\}\backslash\{i_{1},\cdots,i_{r_{1}}\}italic_i start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ∈ { 1 , ⋯ , italic_S } \ { italic_i start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_i start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT } such that kκprSr+1,irxkxksubscript𝑘superscriptsubscript𝜅subscript𝑝𝑟𝑆𝑟1subscript𝑖𝑟subscript𝑥𝑘superscriptsubscript𝑥𝑘top\sum_{k\in\kappa_{p_{r}}^{S-r+1,i_{r}}}x_{k}x_{k}^{\top}∑ start_POSTSUBSCRIPT italic_k ∈ italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S - italic_r + 1 , italic_i start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is nonsingular.

    3. (c)

      kκpSxkxksubscript𝑘subscript𝜅subscript𝑝𝑆subscript𝑥𝑘superscriptsubscript𝑥𝑘top\sum_{k\in\kappa_{p_{S}}}x_{k}x_{k}^{\top}∑ start_POSTSUBSCRIPT italic_k ∈ italic_κ start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is nonsingular.

3 Identification based on dynamic programming and sparsity inducing

In this section, a two-stage identification method for switched linear systems (1) is proposed. In the first stage, the switching instants are identified based on an iterative dynamic programming algorithm in our prior work [22] and the computational complexity of the algorithm has been further simplified. Under the segmented data which are divided according to the estimated switching instants, the parameters of each subsystem are determined based on the sparsity inducing methods.

3.1 Constrained switching mechanism and problem formulation

We begin with the limitation of Assumption 1 for the identification of switched linear system (1).

Actually, if we use fitting error to distinguish the mode of the data at that moment (xk,yk)subscript𝑥𝑘subscript𝑦𝑘(x_{k},y_{k})( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) as mentioned in most literature, it is also known that [yk,xk]superscriptsubscript𝑦𝑘superscriptsubscript𝑥𝑘toptop[y_{k},-x_{k}^{\top}]^{\top}[ italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , - italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT belongs to the kernel space (or the null space) of the parameter vector [1,θi]1superscriptsubscript𝜃𝑖top[1,\theta_{i}^{\top}][ 1 , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] which is written as, [yk,xk]ker([1,θi])superscriptsubscript𝑦𝑘superscriptsubscript𝑥𝑘toptopkernel1superscriptsubscript𝜃𝑖top[y_{k},-x_{k}^{\top}]^{\top}\in\ker([-1,\theta_{i}^{\top}])[ italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , - italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ∈ roman_ker ( [ - 1 , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] ). Obviously, the data that belongs to the intersection of the two kernel spaces, ker([1,θi])ker([1,θj]),ijkernel1superscriptsubscript𝜃𝑖topkernel1superscriptsubscript𝜃𝑗top𝑖𝑗\ker([1,\theta_{i}^{\top}])\cap\ker([1,\theta_{j}^{\top}]),i\neq jroman_ker ( [ 1 , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] ) ∩ roman_ker ( [ 1 , italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] ) , italic_i ≠ italic_j, cannot be distinguished whether it belongs to the i𝑖iitalic_ith or j𝑗jitalic_jth submodel. In the absence of noise, partitioning the data sample (xk,yk)subscript𝑥𝑘subscript𝑦𝑘(x_{k},y_{k})( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) into the i𝑖iitalic_ith submodel or j𝑗jitalic_jth submodel does not affect the identification of the parameters {θi}i=1Ssuperscriptsubscriptsubscript𝜃𝑖𝑖1𝑆\{\theta_{i}\}_{i=1}^{S}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT. While in the presence of noise, we cannot distinguish whether the distance from a sample point, (xk,yk)subscript𝑥𝑘subscript𝑦𝑘(x_{k},y_{k})( italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ), to the kernel space, ker([1,θi])kernel1superscriptsubscript𝜃𝑖top\ker([1,\theta_{i}^{\top}])roman_ker ( [ 1 , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] ), is influenced by noise or not and this leads to the inevitable identification errors and the influence on the unbiasedness of the identification. We make a graphical illustration for the result of the mismatch estimated partition in Fig. 1.

Refer to caption
Figure 1: A noisy example with data samples {xk}k=18superscriptsubscriptsubscript𝑥𝑘𝑘18\{x_{k}\}_{k=1}^{8}{ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT for the illustration of the mismatch in data partition. The original data partition corresponding to the true switching instants, {λk}k=18superscriptsubscriptsubscript𝜆𝑘𝑘18\{\lambda_{k}\}_{k=1}^{8}{ italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT, is shown on the left graph while the estimated partition with mismatch samples, {λ^k}k=18superscriptsubscriptsubscript^𝜆𝑘𝑘18\{\hat{\lambda}_{k}\}_{k=1}^{8}{ over^ start_ARG italic_λ end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT, is on the right.

As depicted in Fig. 1, {xk}k=14superscriptsubscriptsubscript𝑥𝑘𝑘14\{x_{k}\}_{k=1}^{4}{ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT and {xk}k=58superscriptsubscriptsubscript𝑥𝑘𝑘58\{x_{k}\}_{k=5}^{8}{ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT belong to different subsystems. If we only use the fitting errors to identify the switched linear system with {xk}k=18superscriptsubscriptsubscript𝑥𝑘𝑘18\{x_{k}\}_{k=1}^{8}{ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT, the estimated partition with the least fitting error can be shown on the right graph in Fig. 1, with {x1,x2,x4,x5}subscript𝑥1subscript𝑥2subscript𝑥4subscript𝑥5\{x_{1},x_{2},x_{4},x_{5}\}{ italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 5 end_POSTSUBSCRIPT } and {x3,x6,x7,x8}subscript𝑥3subscript𝑥6subscript𝑥7subscript𝑥8\{x_{3},x_{6},x_{7},x_{8}\}{ italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 6 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 7 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT }, thereby introducing bias in the identification of the parameter θ^2subscript^𝜃2\hat{\theta}_{2}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

However, by incorporating prior knowledge about the switching instants in time, specifically that mode switching only occurs once, the identification result is clearly formulated as the true partition with two segments: xkk=14subscript𝑥𝑘𝑘superscript14{x_{k}}{k=1}^{4}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_k = 1 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT and xkk=58subscript𝑥𝑘𝑘superscript58{x_{k}}{k=5}^{8}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_k = 5 start_POSTSUPERSCRIPT 8 end_POSTSUPERSCRIPT. Therefore, methods that only rely on fitting errors for data partitioning depend on the noise distribution, while method leveraging the prior knowledge about switching instants in time may enhance the robustness of the identification. This explains why the aforementioned methods based on Assumption 1 exhibit poor robustness against noise in practical applications.

To introduce the prior knowledge about switching instants in time, we need the segmentation of time series data before the identification for switched linear systems. Then we propose the constrained switching mechanism which assumes the existence of a minimal interval between adjacent switching instants and stated as follows:

Assumption 3.

Assumed that there exist M𝑀Mitalic_M different switching instants, {sm}m=1M+1,s1=1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1subscript𝑠11\{s_{m}\}_{m=1}^{M+1},s_{1}=1{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 in the collected time series data. The configuration of adjacent switching instants satisfies

sm+1smΔsmin,subscript𝑠𝑚1subscript𝑠𝑚Δsubscript𝑠s_{m+1}-s_{m}\geq\Delta s_{\min},italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ≥ roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , (14)

where ΔsminΔsubscript𝑠\Delta s_{\min}roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT is an integer and regulates the minimum distance between adjacent segmentation points.

Here we make a comment on Assumption 3, which not only avoid the nontrivial solutions in our proposed identification methods but also holds significant practical implications. This is because in practical applications, such as industrial processes, there exists a time interval between the switching of system modes due to the change of the environment or system operating points. This is because in industrial processes, the switching instants of systems always stem from changes in the environment and adjustments in operating points. Therefore, system switches cannot occur frequently. This accounts for the prior knowledge of the constrained switching mechanism as described in Assumption 3 on the timescale.

Next, we will introduce our two-stage identification framework for switched linear systems based on Assumption 3. The identification problem (7) of the switched linear system (1) are transformed into the identification of the switching instants {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT and the identification of the parameters {θi}i=1S,{λk}k=1Ssuperscriptsubscriptsubscript𝜃𝑖𝑖1𝑆superscriptsubscriptsubscript𝜆𝑘𝑘1𝑆\{\theta_{i}\}_{i=1}^{S},\{\lambda_{k}\}_{k=1}^{S}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT , { italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT. The qualitative trend analysis [18] and curve fitting [19, 20, 21] are introduced for the identification of switching instants {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT, since the parameter vectors θλksubscript𝜃subscript𝜆𝑘\theta_{\lambda_{k}}italic_θ start_POSTSUBSCRIPT italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT can be considered as the trend of the segment, βmsubscript𝛽𝑚\beta_{m}italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT. Then the identification problem for the switching instants are formulated as,

minM,{sm}m=1M,{βm}m=1Msubscript𝑀superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀superscriptsubscriptsubscript𝛽𝑚𝑚1𝑀\displaystyle\min\limits_{M,\{s_{m}\}_{m=1}^{M},\{\beta_{m}\}_{m=1}^{M}}roman_min start_POSTSUBSCRIPT italic_M , { italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT , { italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT end_POSTSUBSCRIPT m=1M+1k=smsm+11l(ykβmxk)superscriptsubscript𝑚1𝑀1superscriptsubscript𝑘subscript𝑠𝑚subscript𝑠𝑚11𝑙subscript𝑦𝑘superscriptsubscript𝛽𝑚topsubscript𝑥𝑘\displaystyle\quad\sum_{m=1}^{M+1}\sum_{k=s_{m}}^{s_{m+1}-1}l(y_{k}-\beta_{m}^% {\top}x_{k})∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT italic_l ( italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) (15)
subj. to M,s1=1,sM+1=N+1formulae-sequence𝑀formulae-sequencesubscript𝑠11subscript𝑠𝑀1𝑁1\displaystyle~{}M\in\mathbb{N},~{}s_{1}=1,~{}s_{M+1}=N+1italic_M ∈ blackboard_N , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 , italic_s start_POSTSUBSCRIPT italic_M + 1 end_POSTSUBSCRIPT = italic_N + 1
βmn,m=1,,M.formulae-sequencesubscript𝛽𝑚superscript𝑛for-all𝑚1𝑀\displaystyle~{}\beta_{m}\in\mathbb{R}^{n},~{}\forall m=1,\cdots,M.italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , ∀ italic_m = 1 , ⋯ , italic_M .

If the data segmentation is sufficiently precise, we can make the following assumption.

Assumption 4.

Each segment of data belongs to the same subsystem. That is, m=1,,Mfor-all𝑚1𝑀\forall m=1,\cdots,M∀ italic_m = 1 , ⋯ , italic_M, we have

λk=Λm,k=sm,sm+1,,sm+11formulae-sequencesubscript𝜆𝑘subscriptΛ𝑚𝑘subscript𝑠𝑚subscript𝑠𝑚1subscript𝑠𝑚11\lambda_{k}=\Lambda_{m},~{}k=s_{m},s_{m}+1,\cdots,s_{m+1}-1italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT = roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_k = italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 , ⋯ , italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - 1 (16)

where ΛmsubscriptΛ𝑚\Lambda_{m}\in\mathbb{N}roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_N indicates the order of the active subsystem among the time instants smsubscript𝑠𝑚s_{m}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT and sm+1subscript𝑠𝑚1s_{m+1}italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT.

Remark 2.

Assumption 4 is based on the segmented data by introducing the prior knowledge about the switching instants in time, and it degenerates into Assumption 1, when there is only one data sample in each segment, denoted as M=N𝑀𝑁M=Nitalic_M = italic_N, sm=msubscript𝑠𝑚𝑚s_{m}=mitalic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_m, m=1,,Mfor-all𝑚1𝑀\forall m=1,\cdots,M∀ italic_m = 1 , ⋯ , italic_M and Δsmin=1Δsubscript𝑠𝑚𝑖𝑛1\Delta{s_{min}}=1roman_Δ italic_s start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT = 1.

Considering that data is segmented by {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT and Assumption 4 holds, the identification of parameters {θi}i=1S,{λk}k=1Ssuperscriptsubscriptsubscript𝜃𝑖𝑖1𝑆superscriptsubscriptsubscript𝜆𝑘𝑘1𝑆\{\theta_{i}\}_{i=1}^{S},\{\lambda_{k}\}_{k=1}^{S}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT , { italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT can be formulated as

minS,{θi}i=1S,{Λm}m=1Msubscript𝑆superscriptsubscriptsubscript𝜃𝑖𝑖1𝑆superscriptsubscriptsubscriptΛ𝑚𝑚1𝑀\displaystyle\min\limits_{S,\{\theta_{i}\}_{i=1}^{S},\{\Lambda_{m}\}_{m=1}^{M}}roman_min start_POSTSUBSCRIPT italic_S , { italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT , { roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT end_POSTSUBSCRIPT m=1Mk=smsm+11l(ykθΛmxk)superscriptsubscript𝑚1𝑀superscriptsubscript𝑘subscript𝑠𝑚subscript𝑠𝑚11𝑙subscript𝑦𝑘superscriptsubscript𝜃subscriptΛ𝑚topsubscript𝑥𝑘\displaystyle\quad\sum_{m=1}^{M}\sum_{k=s_{m}}^{s_{m+1}-1}l(y_{k}-\theta_{% \Lambda_{m}}^{\top}x_{k})∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_k = italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT italic_l ( italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_θ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) (17)
subj. to S,θin,i=1,,S,formulae-sequence𝑆formulae-sequencesubscript𝜃𝑖superscript𝑛for-all𝑖1𝑆\displaystyle~{}S\in\mathbb{N},~{}\theta_{i}\in\mathbb{R}^{n},~{}\forall i=1,% \cdots,S,italic_S ∈ blackboard_N , italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT , ∀ italic_i = 1 , ⋯ , italic_S ,
Λm{1,,S}m=1,,M.formulae-sequencesubscriptΛ𝑚1𝑆for-all𝑚1𝑀\displaystyle~{}\quad\Lambda_{m}\in\{1,\cdots,S\}~{}\forall m=1,\cdots,M.roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ { 1 , ⋯ , italic_S } ∀ italic_m = 1 , ⋯ , italic_M .

Next, we will introduce the two-stage identification method to address the identification problems of switched linear system (1), which is divided into the optimization problems (15) and (17).

3.2 Identification of the switching instants by dynamic programming

Before introducing the algorithm, the expressions of the data segmentation are defined for the sake of convenience. Suppose that the input-output time series data 𝒯={xk}k=1N,𝒚={y1,,yN}formulae-sequence𝒯superscriptsubscriptsubscript𝑥𝑘𝑘1𝑁𝒚subscript𝑦1subscript𝑦𝑁\mathcal{T}=\{x_{k}\}_{k=1}^{N},~{}\bm{y}=\{y_{1},\cdots,y_{N}\}caligraphic_T = { italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT , bold_italic_y = { italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } are collected by the switched linear system (1) and there exist M𝑀Mitalic_M switching instants. Identification of the switching instants is to divide the time series data into M𝑀Mitalic_M non-overlap** data 𝒯=m=1M𝒯m,𝒚=m=1M𝒚m,𝒆=m=1M𝒆mformulae-sequence𝒯superscriptsubscript𝑚1𝑀subscript𝒯𝑚formulae-sequence𝒚superscriptsubscript𝑚1𝑀superscript𝒚𝑚𝒆superscriptsubscript𝑚1𝑀superscript𝒆𝑚\mathcal{T}=\cup_{m=1}^{M}\mathcal{T}_{m},~{}\bm{y}=\cup_{m=1}^{M}\bm{y}^{m},~% {}\bm{e}=\cup_{m=1}^{M}\bm{e}^{m}caligraphic_T = ∪ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , bold_italic_y = ∪ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT bold_italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , bold_italic_e = ∪ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT bold_italic_e start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT, where each segment is expressed as:

𝒯m={xsm,xsm+1,,xsm+11}subscript𝒯𝑚subscript𝑥subscript𝑠𝑚subscript𝑥subscript𝑠𝑚1subscript𝑥subscript𝑠𝑚11\displaystyle\mathcal{T}_{m}=\left\{x_{s_{m}},x_{s_{m}+1},\cdots,x_{s_{m+1}-1}\right\}caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = { italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT , ⋯ , italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT } (18)
𝒚m={ysm,ysm+1,,ysm+11}superscript𝒚𝑚subscript𝑦subscript𝑠𝑚subscript𝑦subscript𝑠𝑚1subscript𝑦subscript𝑠𝑚11\displaystyle\bm{y}^{m}=\left\{y_{s_{m}},y_{s_{m}+1},\cdots,y_{s_{m+1}-1}\right\}bold_italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT = { italic_y start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT }
𝒆m={esm,esm+1,,esm+11}.superscript𝒆𝑚subscript𝑒subscript𝑠𝑚subscript𝑒subscript𝑠𝑚1subscript𝑒subscript𝑠𝑚11\displaystyle\bm{e}^{m}=\left\{e_{s_{m}},e_{s_{m}+1},\cdots,e_{s_{m+1}-1}% \right\}.bold_italic_e start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT = { italic_e start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_e start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT , ⋯ , italic_e start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT } .

Here, a grid of segmentation points {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT is introduced that satisfy 1=s1<s2<<sM<sM+1=N+11subscript𝑠1subscript𝑠2subscript𝑠𝑀subscript𝑠𝑀1𝑁11=s_{1}<s_{2}<\cdots<s_{M}<s_{M+1}=N+11 = italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < ⋯ < italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT < italic_s start_POSTSUBSCRIPT italic_M + 1 end_POSTSUBSCRIPT = italic_N + 1, where smsubscript𝑠𝑚s_{m}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT indicates the index of the first data sample in 𝒯msubscript𝒯𝑚\mathcal{T}_{m}caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, m=1,,M𝑚1𝑀m=1,\cdots,Mitalic_m = 1 , ⋯ , italic_M. For convenience, the restriction of entire time series 𝒯={xk,k=1,,N}\mathcal{T}=\{x_{k},~{}k=1,\cdots,N\}caligraphic_T = { italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k = 1 , ⋯ , italic_N } and 𝒚={yk,k=1,,N}\bm{y}=\{y_{k},~{}k=1,\cdots,N\}bold_italic_y = { italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_k = 1 , ⋯ , italic_N } to a specific interval [k,k],k<k𝑘superscript𝑘𝑘superscript𝑘[k,k^{\prime}],k<k^{\prime}[ italic_k , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ] , italic_k < italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is denoted as:

𝒚(k:k)=[yk,yk+1,,yk]\displaystyle\bm{y}(k:k^{\prime})=[y_{k},y_{k+1},\cdots,y_{k^{\prime}}]^{\top}bold_italic_y ( italic_k : italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = [ italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT (19)
𝒯(k:k)=[xk,xk+1,,xk].\displaystyle\mathcal{T}({k:k^{\prime}})=[x_{k},x_{k+1},\cdots,x_{k^{\prime}}].caligraphic_T ( italic_k : italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = [ italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_k + 1 end_POSTSUBSCRIPT , ⋯ , italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ] .

Then with the monotonic increasing 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT-norm function l()=2𝑙subscriptdelimited-∥∥2l(\cdot)=\lVert\cdot\rVert_{2}italic_l ( ⋅ ) = ∥ ⋅ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, the objective function of the mixed integer programming problem (15) with known number of switching instants M𝑀Mitalic_M can be rewritten as

𝒥(M)=min{sm+1,βm}m=1Mm=1M𝒚(sm:sm+1)𝒯mβm22\displaystyle\mathcal{J}(M)=\min_{\{s_{m+1},\beta_{m}\}_{m=1}^{M}}\sum_{m=1}^{% M}\lVert\bm{y}(s_{m}:s_{m+1})-\mathcal{T}_{m}^{\top}\beta_{m}\rVert_{2}^{2}caligraphic_J ( italic_M ) = roman_min start_POSTSUBSCRIPT { italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ∥ bold_italic_y ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT : italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT ) - caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (20)

where {sj}j=1M+1,s1=1superscriptsubscriptsubscript𝑠𝑗𝑗1𝑀1subscript𝑠11\{s_{j}\}_{j=1}^{M+1},s_{1}=1{ italic_s start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_j = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 represent the switching instants. According to the least squares algorithm for the parameter estimation of linear systems, βmsubscript𝛽𝑚\beta_{m}italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT can be constructed from the segmented data 𝒯msubscript𝒯𝑚\mathcal{T}_{m}caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT and 𝒚(sm:sm+1)\bm{y}(s_{m}:s_{m+1})bold_italic_y ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT : italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT ), denoted as βm=(𝒯m𝒯m)1𝒯m𝒚(sm:sm+1)\beta_{m}=(\mathcal{T}_{m}\mathcal{T}_{m}^{\top})^{-1}\mathcal{T}_{m}\bm{y}(s_% {m}:s_{m+1})italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = ( caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT bold_italic_y ( italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT : italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT ). Therefore, the problem (20) will be addressed with the decisive parameters {sm}m=1Msuperscriptsubscriptsubscript𝑠𝑚𝑚1𝑀\{s_{m}\}_{m=1}^{M}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT of 𝒥(M)𝒥𝑀\mathcal{J}(M)caligraphic_J ( italic_M ) by an iterative dynamic programming method in [22].

For a subproblem involving optimally dividing a shorter data trajectory 𝒯(1:k)\mathcal{T}({1:k})caligraphic_T ( 1 : italic_k ), where kN𝑘𝑁k\leq Nitalic_k ≤ italic_N, into m𝑚mitalic_m segments, the total fitting error and the optimal segmentation choice are defined as follows:

Cm(k)subscript𝐶𝑚𝑘\displaystyle C_{m}(k)italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) =mins1,,smi=1m𝒚(si:si+1)𝒯iβi22,\displaystyle=\min_{s_{1},\cdots,s_{m}}\sum_{i=1}^{m}\lVert\bm{y}(s_{i}:s_{i+1% })-\mathcal{T}_{i}^{\top}\beta_{i}\rVert_{2}^{2},= roman_min start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∥ bold_italic_y ( italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_s start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) - caligraphic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (21)
𝒦m(k)subscript𝒦𝑚𝑘\displaystyle\mathcal{K}_{m}(k)caligraphic_K start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) =argmins1,,smi=1m𝒚(si:si+1)𝒯iβi22,\displaystyle=\mathop{\arg\min}_{s_{1},\cdots,s_{m}}\sum_{i=1}^{m}\lVert\bm{y}% (s_{i}:s_{i+1})-\mathcal{T}_{i}^{\top}\beta_{i}\rVert_{2}^{2},= start_BIGOP roman_arg roman_min end_BIGOP start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∥ bold_italic_y ( italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT : italic_s start_POSTSUBSCRIPT italic_i + 1 end_POSTSUBSCRIPT ) - caligraphic_T start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ,
={s^1m(k),,s^mm(k)}absentsuperscriptsubscript^𝑠1𝑚𝑘superscriptsubscript^𝑠𝑚𝑚𝑘\displaystyle=\{\hat{s}_{1}^{m}(k),\cdots,\hat{s}_{m}^{m}(k)\}= { over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_k ) , ⋯ , over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_k ) }

where s^1m(k)=s1=1superscriptsubscript^𝑠1𝑚𝑘subscript𝑠11\hat{s}_{1}^{m}(k)=s_{1}=1over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_k ) = italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 is enforced and sm+1=ksubscript𝑠𝑚1𝑘s_{m+1}=kitalic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT = italic_k is assumed to simplify notations. Besides, we define Pm(k)=s^mm(k)subscript𝑃𝑚𝑘superscriptsubscript^𝑠𝑚𝑚𝑘P_{m}(k)=\hat{s}_{m}^{m}(k)italic_P start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) = over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ( italic_k ). Obviously, the optimal value of (20) is given by 𝒥(M)=CM(N)𝒥𝑀subscript𝐶𝑀𝑁\mathcal{J}(M)=C_{M}(N)caligraphic_J ( italic_M ) = italic_C start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( italic_N ) and the optimal choice of segmentation can be readily recovered from PM(N)subscript𝑃𝑀𝑁P_{M}(N)italic_P start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( italic_N ). In virtue of dynamic programming, Cm(k)subscript𝐶𝑚𝑘C_{m}(k)italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) and Pm(k)subscript𝑃𝑚𝑘P_{m}(k)italic_P start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) can be conveniently calculated based on the following recursions [22] with Assumption 3.

  • Step 1: Initialize C1(k),k=1,,Nformulae-sequencesubscript𝐶1𝑘𝑘1𝑁C_{1}(k),~{}k=1,\cdots,Nitalic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_k ) , italic_k = 1 , ⋯ , italic_N:

    C1(k)=0,P1(k)=k.formulae-sequencesubscript𝐶1𝑘0subscript𝑃1𝑘𝑘C_{1}(k)=0,~{}P_{1}(k)=k.italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_k ) = 0 , italic_P start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_k ) = italic_k . (22)
  • Step 2: Calculate Cm(k)subscript𝐶𝑚𝑘C_{m}(k)italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) and Pm(k)subscript𝑃𝑚𝑘P_{m}(k)italic_P start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) for m=2,,M𝑚2𝑀m=2,\cdots,Mitalic_m = 2 , ⋯ , italic_M by following the Bellman principle of optimality:

    Cm(k)=subscript𝐶𝑚𝑘absent\displaystyle C_{m}(k)=italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) = min(m1)ΔsminjkCm1(j)+dm,k(j),subscript𝑚1Δsubscript𝑠𝑗𝑘subscript𝐶𝑚1𝑗subscript𝑑𝑚𝑘𝑗\displaystyle\min_{(m-1)\Delta s_{\min}\leq j\leq k}C_{m-1}(j)+d_{m,k}(j),roman_min start_POSTSUBSCRIPT ( italic_m - 1 ) roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ≤ italic_j ≤ italic_k end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_m - 1 end_POSTSUBSCRIPT ( italic_j ) + italic_d start_POSTSUBSCRIPT italic_m , italic_k end_POSTSUBSCRIPT ( italic_j ) , (23)
    Pm(k)=subscript𝑃𝑚𝑘absent\displaystyle P_{m}(k)=italic_P start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_k ) = argmin(m1)ΔsminjkCm1(j)+dm,k(j),subscript𝑚1Δsubscript𝑠𝑗𝑘subscript𝐶𝑚1𝑗subscript𝑑𝑚𝑘𝑗\displaystyle\mathop{\arg\min}_{(m-1)\Delta s_{\min}\leq j\leq k}C_{m-1}(j)+d_% {m,k}(j),start_BIGOP roman_arg roman_min end_BIGOP start_POSTSUBSCRIPT ( italic_m - 1 ) roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT ≤ italic_j ≤ italic_k end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_m - 1 end_POSTSUBSCRIPT ( italic_j ) + italic_d start_POSTSUBSCRIPT italic_m , italic_k end_POSTSUBSCRIPT ( italic_j ) ,

    where k=(m1)Δsmin,(m1)Δsmin+1,,Nfor-all𝑘𝑚1Δsubscript𝑠𝑚1Δsubscript𝑠1𝑁\forall k=(m-1)\Delta s_{\min},(m-1)\Delta s_{\min}+1,\cdots,N∀ italic_k = ( italic_m - 1 ) roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , ( italic_m - 1 ) roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT + 1 , ⋯ , italic_N, dm,k(j)=minβn𝒚(j+1:k)𝒯(j+1:k)β22d_{m,k}(j)=\min_{\beta\in\mathbb{R}^{n}}\lVert\bm{y}(j+1:k)-\mathcal{T}({j+1:k% })^{\top}\beta\rVert_{2}^{2}italic_d start_POSTSUBSCRIPT italic_m , italic_k end_POSTSUBSCRIPT ( italic_j ) = roman_min start_POSTSUBSCRIPT italic_β ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ bold_italic_y ( italic_j + 1 : italic_k ) - caligraphic_T ( italic_j + 1 : italic_k ) start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT.

  • Step 3: Recover {s^m}m=1M+1superscriptsubscriptsubscript^𝑠𝑚𝑚1𝑀1\{\hat{s}_{m}\}_{m=1}^{M+1}{ over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT successively with s^M+1=Nsubscript^𝑠𝑀1𝑁\hat{s}_{M+1}=Nover^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_M + 1 end_POSTSUBSCRIPT = italic_N:

    s^M=PM(s^M+1),,s^2=P2(s^3),s^1=1.formulae-sequencesubscript^𝑠𝑀subscript𝑃𝑀subscript^𝑠𝑀1formulae-sequencesubscript^𝑠2subscript𝑃2subscript^𝑠3subscript^𝑠11\displaystyle\hat{s}_{M}=P_{M}(\hat{s}_{M+1}),\dots,~{}\hat{s}_{2}=P_{2}(\hat{% s}_{3}),~{}\hat{s}_{1}=1.over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ( over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_M + 1 end_POSTSUBSCRIPT ) , … , over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_P start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) , over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 . (24)

During the implementation of the identification process for the switching instants, we utilize the following criterion to optimally determine the number of segments M𝑀Mitalic_M [25, 22]:

M=argminMlog(𝒥(1)/𝒥(M))M1.superscript𝑀subscript𝑀𝒥1𝒥𝑀𝑀1M^{*}=\arg\min_{M}\frac{\log(\mathcal{J}(1)/\mathcal{J}(M))}{M-1}.italic_M start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT divide start_ARG roman_log ( caligraphic_J ( 1 ) / caligraphic_J ( italic_M ) ) end_ARG start_ARG italic_M - 1 end_ARG . (25)
Remark 3.

Here we make a comment on the calculation of the linear estimation problem dm,k(j)subscript𝑑𝑚𝑘𝑗d_{m,k}(j)italic_d start_POSTSUBSCRIPT italic_m , italic_k end_POSTSUBSCRIPT ( italic_j ). Although the problem can be directly solved using the batch least-squares algorithm, the recursive least-squares algorithm [26] is introduced to significantly reduce the complexity of the algorithm. Then, dm,k(j+1)subscript𝑑𝑚𝑘𝑗1d_{m,k}(j+1)italic_d start_POSTSUBSCRIPT italic_m , italic_k end_POSTSUBSCRIPT ( italic_j + 1 ) can be derived in the following recursive equations for j=(m1)Δsmin,,k𝑗𝑚1Δsubscript𝑠𝑚𝑖𝑛𝑘j=(m-1)\Delta s_{min},\cdots,kitalic_j = ( italic_m - 1 ) roman_Δ italic_s start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT , ⋯ , italic_k:

{dm,k(j+1)=dm,k(j)+[zj+1xj+1𝜷^jm]21+xj+1TQjmxj+1Kj+1m=Qjmxj+1[1+xj+1Qjmxj+1]1Qj+1m=[IKjmxj+1]1Qjm𝜷^j+1m=𝜷^jm+Kjm[yj+1xj+1𝜷^jm].\left\{\begin{aligned} d_{m,k}(j+1)&=d_{m,k}(j)+\frac{[z_{j+1}-x_{j+1}^{\top}% \bm{\hat{\beta}}_{j}^{m}]^{2}}{1+x_{j+1}^{T}Q_{j}^{m}x_{j+1}}\\ K_{j+1}^{m}&=Q_{j}^{m}x_{j+1}[1+x_{j+1}^{\top}Q_{j}^{m}x_{j+1}]^{-1}\\ Q_{j+1}^{m}&=[I-K_{j}^{m}x_{j+1}^{\top}]^{-1}Q_{j}^{m}\\ \bm{\hat{\beta}}_{j+1}^{m}&=\bm{\hat{\beta}}_{j}^{m}+K_{j}^{m}[y_{j+1}-x_{j+1}% ^{\top}\bm{\hat{\beta}}_{j}^{m}]\\ \end{aligned}\right..{ start_ROW start_CELL italic_d start_POSTSUBSCRIPT italic_m , italic_k end_POSTSUBSCRIPT ( italic_j + 1 ) end_CELL start_CELL = italic_d start_POSTSUBSCRIPT italic_m , italic_k end_POSTSUBSCRIPT ( italic_j ) + divide start_ARG [ italic_z start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT overbold_^ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 1 + italic_x start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_ARG end_CELL end_ROW start_ROW start_CELL italic_K start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL = italic_Q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT [ 1 + italic_x start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_Q start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL = [ italic_I - italic_K start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT italic_x start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_Q start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL overbold_^ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL = overbold_^ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT + italic_K start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT [ italic_y start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT overbold_^ start_ARG bold_italic_β end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ] end_CELL end_ROW . (26)

The procedure for implementing the identification of switching instants by dynamic programming can be outlined as Algorithm 1.

Algorithm 1 Identification of Switching Instants

Require:
  Time series dataset 𝒯={x1,x2,,xN},𝒚={y1,,yN}formulae-sequence𝒯subscript𝑥1subscript𝑥2subscript𝑥𝑁𝒚subscript𝑦1subscript𝑦𝑁\mathcal{T}=\{x_{1},x_{2},\cdots,x_{N}\},~{}\bm{y}=\{y_{1},\cdots,y_{N}\}caligraphic_T = { italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } , bold_italic_y = { italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } and hyperparameters ΔsminΔsubscript𝑠\Delta s_{\min}roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT.

1:Calculate {Cm(n)}m=1Msuperscriptsubscriptsubscript𝐶𝑚𝑛𝑚1𝑀\{C_{m}(n)\}_{m=1}^{M}{ italic_C start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_n ) } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT and {Pm(n)}m=1Msuperscriptsubscriptsubscript𝑃𝑚𝑛𝑚1𝑀\{P_{m}(n)\}_{m=1}^{M}{ italic_P start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( italic_n ) } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT as per (22) and (23).
2:Decide M𝑀Mitalic_M using the criterion (25) and derive {s^m}m=1Msuperscriptsubscriptsubscript^𝑠𝑚𝑚1𝑀\{\hat{s}_{m}\}_{m=1}^{M}{ over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT with (24).
3:Derive M𝑀Mitalic_M segments 𝒯=m=1M𝒯m𝒯superscriptsubscript𝑚1𝑀subscript𝒯𝑚\mathcal{T}=\cup_{m=1}^{M}\mathcal{T}_{m}caligraphic_T = ∪ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT based on the switching instants {s^m}m=1Msuperscriptsubscriptsubscript^𝑠𝑚𝑚1𝑀\{\hat{s}_{m}\}_{m=1}^{M}{ over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT.
4:return Segmented data 𝒯=m=1M𝒯m,𝒚=m=1M𝒚mformulae-sequence𝒯superscriptsubscript𝑚1𝑀subscript𝒯𝑚𝒚superscriptsubscript𝑚1𝑀superscript𝒚𝑚\mathcal{T}=\cup_{m=1}^{M}\mathcal{T}_{m},~{}\bm{y}=\cup_{m=1}^{M}\bm{y}^{m}caligraphic_T = ∪ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , bold_italic_y = ∪ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT bold_italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT.

3.3 Identification for the switched linear system by sparsity inducing

In the previous section, the switching instants of system (1) are identified, and segmented data are derived, denoted as D¯={D¯1,,D¯M},D¯m=[𝒚m,𝒯m]pm×(n+1),m=1,,Mformulae-sequenceformulae-sequence¯𝐷subscript¯𝐷1subscript¯𝐷𝑀subscript¯𝐷𝑚superscript𝒚𝑚superscriptsubscript𝒯𝑚topsuperscriptsubscript𝑝𝑚𝑛1𝑚1𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\},~{}\bar{D}_{m}=[\bm{y}^{m},-% \mathcal{T}_{m}^{\top}]\in\mathbb{R}^{p_{m}\times(n+1)},m=1,\cdots,Mover¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = [ bold_italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , - caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT × ( italic_n + 1 ) end_POSTSUPERSCRIPT , italic_m = 1 , ⋯ , italic_M where pmsubscript𝑝𝑚p_{m}italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT represents the number of the samples in D¯msubscript¯𝐷𝑚\bar{D}_{m}over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT. The following assumption can be clearly made throughout the paper according to the identification of the switching instants. Based on Assumption 4, the identification of the discrete states {λk}k=1Nsuperscriptsubscriptsubscript𝜆𝑘𝑘1𝑁\{\lambda_{k}\}_{k=1}^{N}{ italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT in the switched linear system (1) can be transformed into the identification of the discrete states for the segmented data, denoted as {Λm}m=1MsuperscriptsubscriptsubscriptΛ𝑚𝑚1𝑀\{\Lambda_{m}\}_{m=1}^{M}{ roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT, where Λm{1,,S}subscriptΛ𝑚1𝑆\Lambda_{m}\in\{1,\cdots,S\}roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ { 1 , ⋯ , italic_S } indicates the order of the active subsystem among the time instants corresponding to D¯msubscript¯𝐷𝑚\bar{D}_{m}over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT. Given that the number of data samples in the data segment 𝒯msubscript𝒯𝑚\mathcal{T}_{m}caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, denoted as pmsubscript𝑝𝑚p_{m}italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, exceeds the minimal time interval vn(X)subscript𝑣𝑛𝑋v_{n}(X)italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ), and considering rank(𝒯m)n=dim(θ)𝑟𝑎𝑛𝑘subscript𝒯𝑚𝑛dimension𝜃rank(\mathcal{T}_{m})\geq n=\dim(\theta)italic_r italic_a italic_n italic_k ( caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ≥ italic_n = roman_dim ( italic_θ ) as defined in (6), it follows that the linear system of equations 𝒚m=𝒯mθ+𝒆msuperscript𝒚𝑚superscriptsubscript𝒯𝑚top𝜃superscript𝒆𝑚\bm{y}^{m}=\mathcal{T}_{m}^{\top}\theta+\bm{e}^{m}bold_italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT = caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ + bold_italic_e start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT possesses a unique solution for θ𝜃\thetaitalic_θ. Then, if Assumption 3 with Δsmin>vn(X)Δsubscript𝑠𝑚𝑖𝑛subscript𝑣𝑛𝑋\Delta s_{min}>v_{n}(X)roman_Δ italic_s start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT > italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ) and Assumption 4 holds, an analogue of Assumption 1 can be formulated as follows.

Assumption 5.

The data samples of each segment (𝒯m,𝐲m)subscript𝒯𝑚superscript𝐲𝑚(\mathcal{T}_{m},\bm{y}^{m})( caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , bold_italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) belong to the same subsystem ΛmsubscriptΛ𝑚\Lambda_{m}roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT after the identification for the switching instants of the system (1). There is no segment (𝒯m,𝐲m)subscript𝒯𝑚superscript𝐲𝑚(\mathcal{T}_{m},\bm{y}^{m})( caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , bold_italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) that fits two different submodels of the switched linear system (1), i.e., 𝐲m=𝒯mθi+𝐞m=𝒯mθj+𝐞mi=j.superscript𝐲𝑚superscriptsubscript𝒯𝑚topsubscript𝜃𝑖superscript𝐞𝑚superscriptsubscript𝒯𝑚topsubscript𝜃𝑗superscript𝐞𝑚𝑖𝑗\bm{y}^{m}=\mathcal{T}_{m}^{\top}\theta_{i}+\bm{e}^{m}=\mathcal{T}_{m}^{\top}% \theta_{j}+\bm{e}^{m}\Rightarrow i=j.bold_italic_y start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT = caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + bold_italic_e start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT = caligraphic_T start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT + bold_italic_e start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ⇒ italic_i = italic_j .

Based on the segmented data D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }, the identification problem (17) of the switched linear system (1) can be formulated as an analogue of the sparse optimization (2) with the regressor matrix X𝑋Xitalic_X, the output vector 𝒚𝒚\bm{y}bold_italic_y and J(z)=l(z)=z0𝐽𝑧𝑙𝑧subscriptdelimited-∥∥𝑧0J(z)=l(z)=\lVert z\rVert_{0}italic_J ( italic_z ) = italic_l ( italic_z ) = ∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT:

minzNsubscript𝑧superscript𝑁\displaystyle\min_{z\in\mathbb{R}^{N}}roman_min start_POSTSUBSCRIPT italic_z ∈ blackboard_R start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT end_POSTSUBSCRIPT m=1MD¯mθ¯0=m=1Mzm0superscriptsubscript𝑚1𝑀subscriptnormsubscript¯𝐷𝑚¯𝜃0superscriptsubscript𝑚1𝑀subscriptnormsuperscript𝑧𝑚0\displaystyle~{}\sum_{m=1}^{M}\|\bar{D}_{m}\bar{\theta}\|_{0}=\sum_{m=1}^{M}\|% z^{m}\|_{0}∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ∥ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over¯ start_ARG italic_θ end_ARG ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ∥ italic_z start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (27)
subject to Az=b𝐴𝑧𝑏\displaystyle~{}Az=bitalic_A italic_z = italic_b

where θ¯=[1,θ]¯𝜃superscript1superscript𝜃toptop\bar{\theta}=[1,\theta^{\top}]^{\top}over¯ start_ARG italic_θ end_ARG = [ 1 , italic_θ start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, z=[D¯1,,D¯M]θ¯𝑧superscriptsubscriptsuperscript¯𝐷top1subscriptsuperscript¯𝐷top𝑀top¯𝜃z=[\bar{D}^{\top}_{1},\cdots,\bar{D}^{\top}_{M}]^{\top}\bar{\theta}italic_z = [ over¯ start_ARG italic_D end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_θ end_ARG, zm=D¯mθ¯superscript𝑧𝑚subscript¯𝐷𝑚¯𝜃z^{m}=\bar{D}_{m}\bar{\theta}italic_z start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT = over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over¯ start_ARG italic_θ end_ARG, A=INX(XX)1X,b=A𝒚formulae-sequence𝐴subscript𝐼𝑁superscript𝑋topsuperscripttop𝑋𝑋1𝑋𝑏𝐴𝒚A=I_{N}-X^{\top}(X\top X)^{-1}X,~{}b=A\bm{y}italic_A = italic_I start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT - italic_X start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( italic_X ⊤ italic_X ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_X , italic_b = italic_A bold_italic_y. However, the optimization problem (27) does not take into account the crucial prior knowledge summarized as Assumption 5. To incorporate this knowledge and transform the parameters into θ𝜃\thetaitalic_θ, the identification problem can be formulated as

minθnsubscript𝜃superscript𝑛\displaystyle\min_{\theta\in\mathbb{R}^{n}}roman_min start_POSTSUBSCRIPT italic_θ ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT m=1MpmI{D¯mθ¯0pm}superscriptsubscript𝑚1𝑀subscript𝑝𝑚𝐼subscriptnormsubscript¯𝐷𝑚¯𝜃0subscript𝑝𝑚\displaystyle~{}\sum_{m=1}^{M}p_{m}I\{\|\bar{D}_{m}\bar{\theta}\|_{0}\geq p_{m}\}∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_I { ∥ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over¯ start_ARG italic_θ end_ARG ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ≥ italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } (28)
subject to Az=b𝐴𝑧𝑏\displaystyle~{}Az=bitalic_A italic_z = italic_b

where I{}𝐼I\{\cdot\}italic_I { ⋅ } is the indicator function and pmsubscript𝑝𝑚p_{m}italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT represents the number of the samples in D¯msubscript¯𝐷𝑚\bar{D}_{m}over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT.

If Assumption 5 holds and all the subsystems are sufficient excited within the input-output time series data 𝒟𝒟\mathcal{D}caligraphic_D, the solution to the sparse problem (28) with J(z)=z0𝐽𝑧subscriptdelimited-∥∥𝑧0J(z)=\lVert z\rVert_{0}italic_J ( italic_z ) = ∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is a parameter vector representing one of the subsystem of the system (1) as the following lemma.

Lemma 1.

Assume that the number of the data which belong to the i𝑖iitalic_ith submodel Ni>Svn(X),i=1,,Sformulae-sequencesubscript𝑁𝑖𝑆subscript𝑣𝑛𝑋for-all𝑖1𝑆N_{i}>Sv_{n}(X),~{}\forall i=1,\cdots,Sitalic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT > italic_S italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ) , ∀ italic_i = 1 , ⋯ , italic_S, where S𝑆Sitalic_S is the number of the submodels. If Assumption 5 holds, then the solution to the problem (28) θi0subscript𝜃subscript𝑖0\theta_{i_{0}}italic_θ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is the parameter of the i0subscript𝑖0i_{0}italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTth subsystem that has generated the most number of data.

Proof 1.

The proof is actually an extension of Lemma 7 in [3]. Suppose by contradiction that the solution θ𝜃\thetaitalic_θ to (28) does not lie in {θ1,,θS}subscript𝜃1subscript𝜃𝑆\{\theta_{1},\cdots,\theta_{S}\}{ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_θ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT }. And if we let I(θ)={pm:D¯mθ¯=0}𝐼𝜃conditional-setsubscript𝑝𝑚subscript¯𝐷𝑚¯𝜃0I(\theta)=\{p_{m}:\bar{D}_{m}\bar{\theta}=0\}italic_I ( italic_θ ) = { italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT : over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over¯ start_ARG italic_θ end_ARG = 0 }, then |I(θ)|Nisvn(X)𝐼𝜃subscript𝑁𝑖𝑠subscript𝑣𝑛𝑋|I(\theta)|\geq N_{i}\geq sv_{n}(X)| italic_I ( italic_θ ) | ≥ italic_N start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ italic_s italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ), where |I(θ)|𝐼𝜃|I(\theta)|| italic_I ( italic_θ ) | is the sum of elements belonging to I(θ)𝐼𝜃I(\theta)italic_I ( italic_θ ). Assumed that nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT denotes the number of data generated by the i𝑖iitalic_ith submodel whose indices are contained in I(θ)𝐼𝜃I(\theta)italic_I ( italic_θ ). Then we have

|I(θ)|=i=1SniSvn(X).𝐼𝜃superscriptsubscript𝑖1𝑆subscript𝑛𝑖𝑆subscript𝑣𝑛𝑋|I(\theta)|=\sum_{i=1}^{S}n_{i}\geq Sv_{n}(X).| italic_I ( italic_θ ) | = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ italic_S italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ) . (29)

In other words, there exists an index j𝑗jitalic_j such that njvn(X)subscript𝑛𝑗subscript𝑣𝑛𝑋n_{j}\geq v_{n}(X)italic_n start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ). Such data vector forms a matrix 𝒳=[D¯t1jT,,D¯tojjT]Tnj×(n+1)𝒳superscriptsuperscriptsubscript¯𝐷superscriptsubscript𝑡1𝑗𝑇superscriptsubscript¯𝐷superscriptsubscript𝑡subscript𝑜𝑗𝑗𝑇𝑇superscriptsubscript𝑛𝑗𝑛1\mathcal{X}=[\bar{D}_{t_{1}^{j}}^{T},\cdots,\bar{D}_{t_{o_{j}}^{j}}^{T}]^{T}% \in\mathbb{R}^{n_{j}\times(n+1)}caligraphic_X = [ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_o start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT × ( italic_n + 1 ) end_POSTSUPERSCRIPT where k=1ojptkj=njsuperscriptsubscript𝑘1subscript𝑜𝑗subscript𝑝superscriptsubscript𝑡𝑘𝑗subscript𝑛𝑗\sum_{k=1}^{o_{j}}p_{t_{k}^{j}}=n_{j}∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_o start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_n start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT. Then, both θ¯¯𝜃\bar{\theta}over¯ start_ARG italic_θ end_ARG and θ¯jsubscript¯𝜃𝑗\bar{\theta}_{j}over¯ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT lie in the kernel space (or the null space) of 𝒳𝒳\mathcal{X}caligraphic_X. Since the initial entries of θ¯¯𝜃\bar{\theta}over¯ start_ARG italic_θ end_ARG and θ¯jsubscript¯𝜃𝑗\bar{\theta}_{j}over¯ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are both equal to 1111, and njvn(X)subscript𝑛𝑗subscript𝑣𝑛𝑋n_{j}\geq v_{n}(X)italic_n start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ≥ italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ), it follows that θ=θj𝜃subscript𝜃𝑗{\theta}={\theta}_{j}italic_θ = italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, leading to a contradiction with the initial thesis.

Considering Theorem 2 and the definition (10), it follows that θi0subscript𝜃subscript𝑖0\theta_{i_{0}}italic_θ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is the unique solution to the combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem (28) if there is a solution z=[θ¯D¯1,,θ¯D¯M]𝑧superscriptsuperscript¯𝜃topsuperscriptsubscript¯𝐷1topsuperscript¯𝜃topsuperscriptsubscript¯𝐷𝑀toptopz=[\bar{\theta}^{\top}\bar{D}_{1}^{\top},\cdots,\bar{\theta}^{\top}\bar{D}_{M}% ^{\top}]^{\top}italic_z = [ over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , ⋯ , over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT obeying that,

z0<12(1+1μ(A)).subscriptnorm𝑧01211𝜇𝐴\|z\|_{0}<\frac{1}{2}(1+\frac{1}{\mu(A)}).∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < divide start_ARG 1 end_ARG start_ARG 2 end_ARG ( 1 + divide start_ARG 1 end_ARG start_ARG italic_μ ( italic_A ) end_ARG ) . (30)

Although it is challenging to solve the 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem directly, such as PJsubscript𝑃𝐽P_{J}italic_P start_POSTSUBSCRIPT italic_J end_POSTSUBSCRIPT with J(z)=z0𝐽𝑧subscriptnorm𝑧0J(z)=\|z\|_{0}italic_J ( italic_z ) = ∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, the segmented data and prior knowledge contribute significantly to addressing this sparse problem. We improve the traditional orthogonal matching pursuit (OMP) algorithm [27, 28] to address the sparse problem (28) in Appendix A. However, this 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm based methods exhibit bad robustness against noise and have significant computational complexity [28]. Based on the above reasons, we consider the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm relaxation of the combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm sparse optimization problem (28) with the segmented data,

minθnsubscript𝜃superscript𝑛\displaystyle\min_{\theta\in\mathbb{R}^{n}}{}roman_min start_POSTSUBSCRIPT italic_θ ∈ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT WXz1subscriptdelimited-∥∥subscript𝑊𝑋𝑧1\displaystyle\lVert W_{X}z\rVert_{1}∥ italic_W start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT italic_z ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT (31)
subject to AXz=bXsubscript𝐴𝑋𝑧subscript𝑏𝑋\displaystyle A_{X}z=b_{X}italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT italic_z = italic_b start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT
z=[θ¯D¯1,,θ¯D¯M]𝑧superscriptsuperscript¯𝜃topsuperscriptsubscript¯𝐷1topsuperscript¯𝜃topsuperscriptsubscript¯𝐷𝑀toptop\displaystyle z=[\bar{\theta}^{\top}\bar{D}_{1}^{\top},\cdots,\bar{\theta}^{% \top}\bar{D}_{M}^{\top}]^{\top}italic_z = [ over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , ⋯ , over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT
WX=diag(ν(AX1)Ip1,,ν(AXM)IpM)subscript𝑊𝑋diag𝜈superscriptsubscript𝐴𝑋1subscript𝐼subscript𝑝1𝜈superscriptsubscript𝐴𝑋𝑀subscript𝐼subscript𝑝𝑀\displaystyle W_{X}=\text{diag}(\nu(A_{X}^{1})I_{p_{1}},\cdots,\nu(A_{X}^{M})I% _{p_{M}})italic_W start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT = diag ( italic_ν ( italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT ) italic_I start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ , italic_ν ( italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ) italic_I start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT )

where AX=[AX1,,AXM]subscript𝐴𝑋subscriptsuperscript𝐴1𝑋subscriptsuperscript𝐴𝑀𝑋A_{X}=[A^{1}_{X},\cdots,A^{M}_{X}]italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT = [ italic_A start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT , ⋯ , italic_A start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ] is partitioned in column order according to the switching instants {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT, WXsubscript𝑊𝑋W_{X}italic_W start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT is the weighting matrix for increasing the sparsity of z𝑧zitalic_z and ν(AXm)𝜈superscriptsubscript𝐴𝑋𝑚\nu(A_{X}^{m})italic_ν ( italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) is defined as the average of the Euclidean norms of the column vectors in matrix AXm=[asm,asm+1,,asm+11]superscriptsubscript𝐴𝑋𝑚subscript𝑎subscript𝑠𝑚subscript𝑎subscript𝑠𝑚1subscript𝑎subscript𝑠𝑚11A_{X}^{m}=[a_{s_{m}},a_{s_{m}+1},\cdots,a_{s_{m+1}-1}]italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT = [ italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT , ⋯ , italic_a start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - 1 end_POSTSUBSCRIPT ],

ν(AXm)=k=smsm+11ak2pm,m=1,,M.formulae-sequence𝜈superscriptsubscript𝐴𝑋𝑚superscriptsubscript𝑘subscript𝑠𝑚subscript𝑠𝑚11subscriptdelimited-∥∥subscript𝑎𝑘2subscript𝑝𝑚𝑚1𝑀\nu(A_{X}^{m})=\frac{\sum_{k=s_{m}}^{s_{m+1}-1}\lVert a_{k}\rVert_{2}}{p_{m}},% m=1,\cdots,M.italic_ν ( italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ) = divide start_ARG ∑ start_POSTSUBSCRIPT italic_k = italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - 1 end_POSTSUPERSCRIPT ∥ italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_ARG , italic_m = 1 , ⋯ , italic_M . (32)

We have the following theorem to discuss the uniqueness of the solution, θ𝜃\thetaitalic_θ, for the sparse problem (31).

Theorem 3.

If Assumption 5 holds and there is a parameter vector θ𝜃\thetaitalic_θ achieving the error vector z=[θ¯D¯1,,θ¯D¯M]𝑧superscriptsuperscript¯𝜃topsuperscriptsubscript¯𝐷1topsuperscript¯𝜃topsuperscriptsubscript¯𝐷𝑀toptopz=[\bar{\theta}^{\top}\bar{D}_{1}^{\top},\cdots,\bar{\theta}^{\top}\bar{D}_{M}% ^{\top}]^{\top}italic_z = [ over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , ⋯ , over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT based on the segmented data D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } such that

z0<1+τ(X)2,subscriptnorm𝑧01𝜏𝑋2\|z\|_{0}<\frac{1+\tau(X)}{2},∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < divide start_ARG 1 + italic_τ ( italic_X ) end_ARG start_ARG 2 end_ARG , (33)

then θ𝜃\thetaitalic_θ is the unique solution to the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization problem. τ(X)𝜏𝑋\tau(X)italic_τ ( italic_X ) is defined in (10).

Proof 2.

According to the definition (10), we have τ(X)=μ(A)=μ(AX)𝜏𝑋𝜇𝐴𝜇subscript𝐴𝑋\tau(X)=\mu(A)=\mu(A_{X})italic_τ ( italic_X ) = italic_μ ( italic_A ) = italic_μ ( italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ). Then, if we only focus on the variable z𝑧zitalic_z, the 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm problem (28) and the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm problem (31) have the unique solution z𝑧zitalic_z with the condition (33) according to Theorem 2. Furthermore, if z𝑧zitalic_z is unique and Assumption (5) holds, θ𝜃\thetaitalic_θ can be formulated uniquely based the z=[θ¯D¯1,,θ¯D¯M]𝑧superscriptsuperscript¯𝜃topsuperscriptsubscript¯𝐷1topsuperscript¯𝜃topsuperscriptsubscript¯𝐷𝑀toptopz=[\bar{\theta}^{\top}\bar{D}_{1}^{\top},\cdots,\bar{\theta}^{\top}\bar{D}_{M}% ^{\top}]^{\top}italic_z = [ over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , ⋯ , over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT.

The unbiasedness and uniqueness of the solution θ𝜃\thetaitalic_θ to the sparse problem have been guaranteed according to aforementioned Lemma 1 and Theorem 3. Then by solving the sparse optimization problem (31), the parameter of one of the subsystem θ𝜃\thetaitalic_θ can be extracted, and we use the following criterion to determine whether the segments belong to the extracted parameter vector θ𝜃\thetaitalic_θ in presence of noise,

I(θ)={D¯m:D¯mθ1ν(D¯m)θ2pmεthres,m=1,,M}𝐼𝜃conditional-setsubscript¯𝐷𝑚formulae-sequencesubscriptnormsubscript¯𝐷𝑚𝜃1𝜈subscript¯𝐷𝑚subscriptdelimited-∥∥𝜃2subscript𝑝𝑚subscript𝜀𝑡𝑟𝑒𝑠𝑚1𝑀\displaystyle I(\theta)=\{\bar{D}_{m}:\frac{\|\bar{D}_{m}\theta\|_{1}}{\nu(% \bar{D}_{m})\lVert\theta\rVert_{2}p_{m}}\leq\varepsilon_{thres},m=1,\cdots,M\}italic_I ( italic_θ ) = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT : divide start_ARG ∥ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_θ ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_ν ( over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ∥ italic_θ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_ARG ≤ italic_ε start_POSTSUBSCRIPT italic_t italic_h italic_r italic_e italic_s end_POSTSUBSCRIPT , italic_m = 1 , ⋯ , italic_M } (34)

where ν()𝜈\nu(\cdot)italic_ν ( ⋅ ) is defined in (32).

Inspired by the reweighted 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT minimization method in [3], an 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization algorithm with momentum method [29] is proposed for the sparse optimization problem (31), detailed in Algorithm 2.

Algorithm 2 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm Optimization Algorithm

Require:
AX(Nn)×N,bXNnformulae-sequencesubscript𝐴𝑋superscript𝑁𝑛𝑁subscript𝑏𝑋superscript𝑁𝑛A_{X}\in\mathbb{R}^{(N-n)\times N},~{}b_{X}\in\mathbb{R}^{N-n}italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_N - italic_n ) × italic_N end_POSTSUPERSCRIPT , italic_b start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_N - italic_n end_POSTSUPERSCRIPT, switching instants {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT, segmented data D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } and hyperparameters ε0,εthres,α,η,v0subscript𝜀0subscript𝜀𝑡𝑟𝑒𝑠𝛼𝜂subscript𝑣0\varepsilon_{0},\varepsilon_{thres},\alpha,\eta,v_{0}italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT italic_t italic_h italic_r italic_e italic_s end_POSTSUBSCRIPT , italic_α , italic_η , italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

1:Initial settings: j=0𝑗0j=0italic_j = 0, v0=0.1subscript𝑣00.1v_{0}=0.1italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0.1
  • Initial weights: W(0)=diag(w1(0)Ip1,,wM(0)IpM)superscript𝑊0𝑑𝑖𝑎𝑔superscriptsubscript𝑤10subscript𝐼subscript𝑝1superscriptsubscript𝑤𝑀0subscript𝐼subscript𝑝𝑀W^{(0)}=diag(w_{1}^{(0)}I_{p_{1}},\cdots,w_{M}^{(0)}I_{p_{M}})italic_W start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = italic_d italic_i italic_a italic_g ( italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ , italic_w start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT italic_I start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) with wk(0)=1,k=1,,Nformulae-sequencesuperscriptsubscript𝑤𝑘01𝑘1𝑁w_{k}^{(0)}=1,k=1,\cdots,Nitalic_w start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( 0 ) end_POSTSUPERSCRIPT = 1 , italic_k = 1 , ⋯ , italic_N

2:While θjθ(j1)2ε0subscriptnormsuperscript𝜃𝑗superscript𝜃𝑗12subscript𝜀0\|\theta^{j}-\theta^{(j-1)}\|_{2}\leq\varepsilon_{0}∥ italic_θ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT - italic_θ start_POSTSUPERSCRIPT ( italic_j - 1 ) end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT
  • Solve the convex 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization problem

    θj=argminθnW(j)WL1ϕ0(θ)2\displaystyle\theta^{j}=\arg\min_{\theta\mathbb{R}^{n}}\lVert W^{(j)}W_{L}^{-1% }\phi^{0}(\theta)\rVert_{2}italic_θ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT italic_θ blackboard_R start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ italic_W start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT italic_W start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_ϕ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_θ ) ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (35)

    where ϕ0(θ)=[θ¯D¯1,,θ¯D¯M]superscriptitalic-ϕ0𝜃superscriptsuperscript¯𝜃topsuperscriptsubscript¯𝐷1topsuperscript¯𝜃topsuperscriptsubscript¯𝐷𝑀toptop\phi^{0}(\theta)=[\bar{\theta}^{\top}\bar{D}_{1}^{\top},\cdots,\bar{\theta}^{% \top}\bar{D}_{M}^{\top}]^{\top}italic_ϕ start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ( italic_θ ) = [ over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , ⋯ , over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT, WL=diag(ν(D¯1)Ip1,,ν(D¯M)IpM)subscript𝑊𝐿diag𝜈subscript¯𝐷1subscript𝐼subscript𝑝1𝜈subscript¯𝐷𝑀subscript𝐼subscript𝑝𝑀W_{L}=\text{diag}(\nu(\bar{D}_{1})I_{p_{1}},\cdots,\nu(\bar{D}_{M})I_{p_{M}})italic_W start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT = diag ( italic_ν ( over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_I start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ , italic_ν ( over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ) italic_I start_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT ).

  • Update the weights with (32)

    vj+1subscript𝑣𝑗1\displaystyle v_{j+1}italic_v start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT =αvjη1ν(D¯mθ¯j)absent𝛼subscript𝑣𝑗𝜂1𝜈subscript¯𝐷𝑚superscript¯𝜃𝑗\displaystyle=\alpha v_{j}-\eta\frac{1}{\nu(\bar{D}_{m}\bar{\theta}^{j})}= italic_α italic_v start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT - italic_η divide start_ARG 1 end_ARG start_ARG italic_ν ( over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) end_ARG (36)
    wm(j+1)superscriptsubscript𝑤𝑚𝑗1\displaystyle w_{m}^{(j+1)}italic_w start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_j + 1 ) end_POSTSUPERSCRIPT =1ν(D¯mθ¯j)+vj+1,m=1,,Mformulae-sequenceabsent1𝜈subscript¯𝐷𝑚superscript¯𝜃𝑗subscript𝑣𝑗1𝑚1𝑀\displaystyle=\frac{1}{\nu(\bar{D}_{m}\bar{\theta}^{j})+v_{j+1}},~{}m=1,\cdots,M= divide start_ARG 1 end_ARG start_ARG italic_ν ( over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) + italic_v start_POSTSUBSCRIPT italic_j + 1 end_POSTSUBSCRIPT end_ARG , italic_m = 1 , ⋯ , italic_M
  • j=j+1𝑗𝑗1j=j+1italic_j = italic_j + 1

3:Concatenate the data,
I(θj)={D¯m:|D¯mθj|ν(D¯m)θj2εthres,m=1,,M},𝐼superscript𝜃𝑗conditional-setsubscript¯𝐷𝑚formulae-sequencesubscript¯𝐷𝑚superscript𝜃𝑗𝜈subscript¯𝐷𝑚subscriptdelimited-∥∥superscript𝜃𝑗2subscript𝜀𝑡𝑟𝑒𝑠𝑚1𝑀\displaystyle I(\theta^{j})=\{\bar{D}_{m}:\frac{|\bar{D}_{m}\theta^{j}|}{\nu(% \bar{D}_{m})\lVert\theta^{j}\rVert_{2}}\leq\varepsilon_{thres},m=1,\cdots,M\},italic_I ( italic_θ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ) = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT : divide start_ARG | over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT | end_ARG start_ARG italic_ν ( over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ) ∥ italic_θ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ≤ italic_ε start_POSTSUBSCRIPT italic_t italic_h italic_r italic_e italic_s end_POSTSUBSCRIPT , italic_m = 1 , ⋯ , italic_M } , (37)
and re-estimate the parameter vector θjsuperscript𝜃𝑗\theta^{j}italic_θ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT as (47).
4:return θj,I(θj)superscript𝜃𝑗𝐼superscript𝜃𝑗\theta^{j},~{}I(\theta^{j})italic_θ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , italic_I ( italic_θ start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT )

3.4 The complete algorithm

Considering Theorem 3 and the definition of (10), all the parameter vectors {θ1,,θs}subscript𝜃1subscript𝜃𝑠\{\theta_{1},\cdots,\theta_{s}\}{ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } can be derived by solving the sparse problem (28) as Algorithm 2. If one of the subsystem parameters θ𝜃\thetaitalic_θ is extracted, the data segments belonging to this subsystem are removed from the data [θ¯D¯1,,θ¯D¯M]superscriptsuperscript¯𝜃topsuperscriptsubscript¯𝐷1topsuperscript¯𝜃topsuperscriptsubscript¯𝐷𝑀toptop[\bar{\theta}^{\top}\bar{D}_{1}^{\top},\cdots,\bar{\theta}^{\top}\bar{D}_{M}^{% \top}]^{\top}[ over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , ⋯ , over¯ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT and z𝑧zitalic_z can be reformulated based on the remaining data. Obviously, if we want to extract the parameter vectors {θ1,,θs}subscript𝜃1subscript𝜃𝑠\{\theta_{1},\cdots,\theta_{s}\}{ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } one after another uniquely, the condition (33) still need to be satisfied by the remaining data according to Theorem 3.

Then we give the complete condition for the unbiasedness and uniqueness of {θ1,,θs}subscript𝜃1subscript𝜃𝑠\{\theta_{1},\cdots,\theta_{s}\}{ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_θ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } in the following theorem, which is mentioned in [3].

Theorem 4.

(Theorem 14141414 in [3]) Consider the data matrix Xn×N𝑋superscript𝑛𝑁X\in\mathbb{R}^{n\times N}italic_X ∈ blackboard_R start_POSTSUPERSCRIPT italic_n × italic_N end_POSTSUPERSCRIPT generated by the system (1) and assume that

N1>Nϑ(X1)>0,subscript𝑁1𝑁italic-ϑsubscript𝑋10\displaystyle N_{1}>N-\vartheta(X_{1})>0,italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > italic_N - italic_ϑ ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) > 0 , (38)
N2>NN1ϑ(X2)>0,subscript𝑁2𝑁subscript𝑁1italic-ϑsubscript𝑋20\displaystyle N_{2}>N-N_{1}-\vartheta(X_{2})>0,italic_N start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT > italic_N - italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - italic_ϑ ( italic_X start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) > 0 ,
\displaystyle\vdots
NS1>NN1NS2ϑ(XS1)>0subscript𝑁𝑆1𝑁subscript𝑁1subscript𝑁𝑆2italic-ϑsubscript𝑋𝑆10\displaystyle N_{S-1}>N-N_{1}-\cdots-N_{S-2}-\vartheta(X_{S-1})>0italic_N start_POSTSUBSCRIPT italic_S - 1 end_POSTSUBSCRIPT > italic_N - italic_N start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT - ⋯ - italic_N start_POSTSUBSCRIPT italic_S - 2 end_POSTSUBSCRIPT - italic_ϑ ( italic_X start_POSTSUBSCRIPT italic_S - 1 end_POSTSUBSCRIPT ) > 0

where ϑ(X)=1/2(1+1/τ(X))italic-ϑ𝑋1211𝜏𝑋\vartheta(X)=1/2(1+1/\tau(X))italic_ϑ ( italic_X ) = 1 / 2 ( 1 + 1 / italic_τ ( italic_X ) ) with τ(X)𝜏𝑋\tau(X)italic_τ ( italic_X ) defined in (10). Then all the parameter vectors {θ1,,θS}subscript𝜃1subscript𝜃𝑆\{\theta_{1},\cdots,\theta_{S}\}{ italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_θ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT } can be extracted one after another by solving the sparse problem (28).

Therefore, the complete framework for the identification of the switched linear systems (1), including the identification of switching instants, data segmentation and the identification of subsystem parameters, can be formulated as Algorithm 3.

Algorithm 3 Identification for switched linear system with 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm

Require:
  Time series dataset 𝒯={x1,x2,,xN},𝒚={y1,,yN}formulae-sequence𝒯subscript𝑥1subscript𝑥2subscript𝑥𝑁𝒚subscript𝑦1subscript𝑦𝑁\mathcal{T}=\{x_{1},x_{2},\cdots,x_{N}\},~{}\bm{y}=\{y_{1},\cdots,y_{N}\}caligraphic_T = { italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , ⋯ , italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } , bold_italic_y = { italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT } and hyperparameters Δsmin,ε0,εthres,α,η,v0Δsubscript𝑠subscript𝜀0subscript𝜀𝑡𝑟𝑒𝑠𝛼𝜂subscript𝑣0\Delta s_{\min},\varepsilon_{0},\varepsilon_{thres},\alpha,\eta,v_{0}roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT italic_t italic_h italic_r italic_e italic_s end_POSTSUBSCRIPT , italic_α , italic_η , italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

1:Determine the switching instants {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT and derive the segmented data D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } by Algorithm 1 with the hyperparameter ΔsminΔsubscript𝑠\Delta s_{\min}roman_Δ italic_s start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT.
2:Initial settings: S=0,Θ={},E=D¯formulae-sequence𝑆0formulae-sequenceΘ𝐸¯𝐷S=0,~{}\Theta=\{\hbox{}\},~{}E=\bar{D}italic_S = 0 , roman_Θ = { } , italic_E = over¯ start_ARG italic_D end_ARG
3:While E𝐸E\neq\emptysetitalic_E ≠ ∅
  • Based on the rest dataset E𝐸Eitalic_E, derive the parameter vector θ𝜃\thetaitalic_θ and its segment data I(θ)𝐼𝜃I(\theta)italic_I ( italic_θ ) through the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm problem (31) by Algorithm 2 with ε0,εthres,α,η,v0subscript𝜀0subscript𝜀𝑡𝑟𝑒𝑠𝛼𝜂subscript𝑣0\varepsilon_{0},\varepsilon_{thres},\alpha,\eta,v_{0}italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_ε start_POSTSUBSCRIPT italic_t italic_h italic_r italic_e italic_s end_POSTSUBSCRIPT , italic_α , italic_η , italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

  • Update the set: EEI(θ),ΘΘ{θ}formulae-sequence𝐸𝐸𝐼𝜃ΘΘ𝜃E\leftarrow E\cap I(\theta),\Theta\leftarrow\Theta\cup\{\theta\}italic_E ← italic_E ∩ italic_I ( italic_θ ) , roman_Θ ← roman_Θ ∪ { italic_θ }.

  • S=S+1𝑆𝑆1S=S+1italic_S = italic_S + 1

4:return ΘΘ\Thetaroman_Θ and S𝑆Sitalic_S.

4 A new sufficient persistent excitation condition for switched linear system

In this section, we propose a new sufficient persistent excitation condition for switched linear system, based on the constrained switching mechanism, as a comparison with Assumption 1. The difference lies in the fact that in the previous section, we proposed Assumption 3 with Δsmin>vn(X)Δsubscript𝑠𝑚𝑖𝑛subscript𝑣𝑛𝑋\Delta{s_{min}}>v_{n}(X)roman_Δ italic_s start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT > italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_X ) from the perspective of sparse optimization to address the identification problem, while in this section, we illustrate the sufficiency of the persistent excitation condition from an existential standpoint.

According to the data segmentation D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }, if Assumption 4 holds, the PE problem (12) can be formulated as

m=1Mi=1Sζi,mD¯mθ¯i22=0,superscriptsubscript𝑚1𝑀superscriptsubscript𝑖1𝑆subscript𝜁𝑖𝑚superscriptsubscriptnormsubscript¯𝐷𝑚subscript¯𝜃𝑖220\displaystyle\sum_{m=1}^{M}\sum_{i=1}^{S}\zeta_{i,m}\|\bar{D}_{m}\bar{\theta}_% {i}\|_{2}^{2}=0,∑ start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT italic_ζ start_POSTSUBSCRIPT italic_i , italic_m end_POSTSUBSCRIPT ∥ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over¯ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0 , (39)
subject to ζi,m=I{Λm=i}subscript𝜁𝑖𝑚𝐼subscriptΛ𝑚𝑖\displaystyle\zeta_{i,m}=I\{\Lambda_{m}=i\}italic_ζ start_POSTSUBSCRIPT italic_i , italic_m end_POSTSUBSCRIPT = italic_I { roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_i }
i=1Sζi,m=1superscriptsubscript𝑖1𝑆subscript𝜁𝑖𝑚1\displaystyle\sum_{i=1}^{S}\zeta_{i,m}=1∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT italic_ζ start_POSTSUBSCRIPT italic_i , italic_m end_POSTSUBSCRIPT = 1

where Λm{1,,S}subscriptΛ𝑚1𝑆\Lambda_{m}\in\{1,\cdots,S\}roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ { 1 , ⋯ , italic_S } indicates the order of the active subsystem among the time instants corresponding to D¯msubscript¯𝐷𝑚\bar{D}_{m}over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT and I{}𝐼I\{\cdot\}italic_I { ⋅ } represents the indicator function. To discuss the uniqueness of the solution to the problem (39), the identification for the switching instants and the data segmentation are further investigated. Considering the segmentation with ek0subscript𝑒𝑘0e_{k}\equiv 0italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≡ 0, we have βm=θΛmsubscript𝛽𝑚subscript𝜃subscriptΛ𝑚\beta_{m}=\theta_{\Lambda_{m}}italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = italic_θ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Then how to guarantee the identifiability of the switching instants or the uniqueness of the solution to the segmentation problem (20) can be quite challenge and confusing.

Suppose that the adjacent two segments D¯m,D¯m+1subscript¯𝐷𝑚subscript¯𝐷𝑚1\bar{D}_{m},\bar{D}_{m+1}over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT are partitioned with the switching instant sm+1subscript𝑠𝑚1s_{m+1}italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT. Then, if there exist some samples fitting two subsystems, xsm+1jθΛm=xsm+1jθΛm+1,j=1,,kformulae-sequencesuperscriptsubscript𝑥subscript𝑠𝑚1𝑗topsubscript𝜃subscriptΛ𝑚superscriptsubscript𝑥subscript𝑠𝑚1𝑗topsubscript𝜃subscriptΛ𝑚1for-all𝑗1superscript𝑘x_{s_{m+1}-j}^{\top}\theta_{\Lambda_{m}}=x_{s_{m+1}-j}^{\top}\theta_{\Lambda_{% m+1}},~{}\forall j=1,\cdots,k^{\prime}italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ∀ italic_j = 1 , ⋯ , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, then sm+1=sm+1j,j=1,,kformulae-sequencesuperscriptsubscript𝑠𝑚1subscript𝑠𝑚1𝑗𝑗1superscript𝑘s_{m+1}^{\prime}=s_{m+1}-j,j=1,\cdots,k^{\prime}italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT - italic_j , italic_j = 1 , ⋯ , italic_k start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT can be another solution to the segmentation problem (20) for sm+1subscript𝑠𝑚1s_{m+1}italic_s start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT. This example is provided to illustrate the challenge of identifiability of the switching instants, and to avoid the ambiguity, an assumption for the uniqueness of the solution to the segmentation problem is proposed as Assumption 6.

Assumption 6.

Consider the data segmentation, {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT, {βm}m=1Msuperscriptsubscriptsubscript𝛽𝑚𝑚1𝑀\{\beta_{m}\}_{m=1}^{M}{ italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT and D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }, for m=1,,M𝑚1𝑀m=1,\cdots,Mitalic_m = 1 , ⋯ , italic_M:

  1. 1.

    βm{θ1,,θS}subscript𝛽𝑚subscript𝜃1subscript𝜃𝑆\beta_{m}\in\{\theta_{1},\cdots,\theta_{S}\}italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ { italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_θ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT }

  2. 2.

    βmβm+1,m=1,,M1formulae-sequencesubscript𝛽𝑚subscript𝛽𝑚1𝑚1𝑀1\beta_{m}\neq\beta_{m+1},m=1,\cdots,M-1italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ≠ italic_β start_POSTSUBSCRIPT italic_m + 1 end_POSTSUBSCRIPT , italic_m = 1 , ⋯ , italic_M - 1.

  3. 3.

    xsmβm=0,xsmβm0,m={1,,M}\{m}formulae-sequencesuperscriptsubscript𝑥subscript𝑠𝑚topsubscript𝛽𝑚0formulae-sequencesuperscriptsubscript𝑥subscript𝑠𝑚topsubscript𝛽superscript𝑚0for-allsuperscript𝑚\1𝑀𝑚x_{s_{m}}^{\top}\beta_{m}=0,x_{s_{m}}^{\top}\beta_{m^{\prime}}\neq 0,\forall m% ^{\prime}=\{1,\cdots,M\}\backslash\{m\}italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = 0 , italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_m start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ≠ 0 , ∀ italic_m start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = { 1 , ⋯ , italic_M } \ { italic_m }

  4. 4.

    There exists a subset D¯mD¯m,m=1,,Mformulae-sequencesuperscriptsubscript¯𝐷𝑚subscript¯𝐷𝑚for-all𝑚1𝑀\bar{D}_{m}^{\prime}\subseteq\bar{D}_{m},\forall m=1,\cdots,Mover¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ⊆ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , ∀ italic_m = 1 , ⋯ , italic_M such that xkD¯mxkxksubscriptsubscript𝑥𝑘superscriptsubscript¯𝐷𝑚subscript𝑥𝑘superscriptsubscript𝑥𝑘top\sum_{x_{k}\in\bar{D}_{m}^{\prime}}x_{k}x_{k}^{\top}∑ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is nonsingular.

Theorem 5.

In the absence of noise, ek0subscript𝑒𝑘0e_{k}\equiv 0italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≡ 0, if Assumption 6 holds, then {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT, {βm}m=1Msuperscriptsubscriptsubscript𝛽𝑚𝑚1𝑀\{\beta_{m}\}_{m=1}^{M}{ italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT is the unique solution to the problem (20) with fixed M𝑀Mitalic_M.

Proof 3.

We will prove the theorem by the Mathematical Induction and take M𝑀Mitalic_M as the induction variable.

To the basic case, consider M=2,{sm}m=13𝑀2superscriptsubscriptsubscript𝑠𝑚𝑚13M=2,\{s_{m}\}_{m=1}^{3}italic_M = 2 , { italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT with s1=1,s3=N+1formulae-sequencesubscript𝑠11subscript𝑠3𝑁1s_{1}=1,s_{3}=N+1italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 1 , italic_s start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT = italic_N + 1. Obviously, the case M=2𝑀2M=2italic_M = 2 is true and the problem (20) with M=2𝑀2M=2italic_M = 2 have the unique solution according to the Assumption 6.

Then we make the induction that the theorem holds for M1𝑀1M-1italic_M - 1 and consider the case with M𝑀Mitalic_M. Suppose by contradiction that there are two solutions {sm,βm}m=1Msuperscriptsubscriptsubscript𝑠𝑚subscript𝛽𝑚𝑚1𝑀\{s_{m},\beta_{m}\}_{m=1}^{M}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT and {sm,βm}m=1Msuperscriptsubscriptsuperscriptsubscript𝑠𝑚superscriptsubscript𝛽𝑚𝑚1𝑀\{s_{m}^{\prime},\beta_{m}^{\prime}\}_{m=1}^{M}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_β start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT to the problem (20) with M𝑀Mitalic_M, and we have s1=s1=1,sM+1=sM+1=N+1formulae-sequencesubscript𝑠1superscriptsubscript𝑠11subscript𝑠𝑀1superscriptsubscript𝑠𝑀1𝑁1s_{1}=s_{1}^{\prime}=1,s_{M+1}=s_{M+1}^{\prime}=N+1italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = 1 , italic_s start_POSTSUBSCRIPT italic_M + 1 end_POSTSUBSCRIPT = italic_s start_POSTSUBSCRIPT italic_M + 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_N + 1. According to the last segment and the Assumption 6, we have βM1βM=βMβM1subscript𝛽𝑀1subscript𝛽𝑀superscriptsubscript𝛽𝑀superscriptsubscript𝛽𝑀1\beta_{M-1}\neq\beta_{M}=\beta_{M}^{\prime}\neq\beta_{M-1}^{\prime}italic_β start_POSTSUBSCRIPT italic_M - 1 end_POSTSUBSCRIPT ≠ italic_β start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = italic_β start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ italic_β start_POSTSUBSCRIPT italic_M - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Considering the beginning of the last segment, sMsuperscriptsubscript𝑠𝑀s_{M}^{\prime}italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, and the Assumption 6 with sMsubscript𝑠𝑀s_{M}italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT, xsMβM=0,xsMβM10formulae-sequencesuperscriptsubscript𝑥subscript𝑠𝑀topsubscript𝛽𝑀0superscriptsubscript𝑥subscript𝑠𝑀topsubscript𝛽𝑀10x_{s_{M}}^{\top}\beta_{M}=0,x_{s_{M}}^{\top}\beta_{M-1}\neq 0italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = 0 , italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_M - 1 end_POSTSUBSCRIPT ≠ 0, we have sMsMsuperscriptsubscript𝑠𝑀subscript𝑠𝑀s_{M}^{\prime}\leq s_{M}italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≤ italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT. This is because if not, this mismatch will result in xsMβm¯0,m¯{1,,M1}formulae-sequencesuperscriptsubscript𝑥subscript𝑠𝑀topsuperscriptsubscript𝛽¯𝑚0¯𝑚1𝑀1x_{s_{M}}^{\top}\beta_{\bar{m}}^{\prime}\neq 0,\bar{m}\in\{1,\cdots,M-1\}italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT over¯ start_ARG italic_m end_ARG end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ 0 , over¯ start_ARG italic_m end_ARG ∈ { 1 , ⋯ , italic_M - 1 } which is contradicted to the third condition in Assumption 6. For the same reason with sMsubscript𝑠𝑀s_{M}italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT and xsMβM=0,xsMβM10formulae-sequencesuperscriptsubscript𝑥superscriptsubscript𝑠𝑀topsubscript𝛽𝑀0superscriptsubscript𝑥superscriptsubscript𝑠𝑀topsuperscriptsubscript𝛽𝑀10x_{s_{M}^{\prime}}^{\top}\beta_{M}=0,x_{s_{M}^{\prime}}^{\top}\beta_{M-1}^{% \prime}\neq 0italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = 0 , italic_x start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_β start_POSTSUBSCRIPT italic_M - 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≠ 0, we have sMsMsubscript𝑠𝑀superscriptsubscript𝑠𝑀s_{M}\leq s_{M}^{\prime}italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT ≤ italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Then we can derive sM=sMsubscript𝑠𝑀superscriptsubscript𝑠𝑀s_{M}=s_{M}^{\prime}italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT = italic_s start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and the partition for the last data segment D¯msubscript¯𝐷𝑚\bar{D}_{m}over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT is unique. According to the induction, the solution to the partition of the data D¯={D¯1,,D¯M1}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀1\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M-1}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M - 1 end_POSTSUBSCRIPT } with M1𝑀1M-1italic_M - 1 segments is unique and the case with M𝑀Mitalic_M is proved. Finally, according to the induction, the theorem is proved.

Although the uniqueness of the solution to the segmentation problem (20) and the identifiability of the switching instants are guaranteed by Theorem 5, The uniqueness of the solution to the segmentation problem (20) and the identifiability of the switching instants are not necessary for the identification of the parameters of the switched linear system (1), although they are guaranteed by Theorem 5. As a matter of fact, Assumption 6 is a sufficient but not necessary condition for Assumption 5. Then, we continue to discuss the persistent excitation condition for the switched linear system based on Assumption 5.

Assumption 7.

For the switched linear system (1) with segmented data D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } and switching instants {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT:

  1. 1.

    θiθj,ij,i,j{1,,S}formulae-sequencesubscript𝜃𝑖subscript𝜃𝑗formulae-sequence𝑖𝑗for-all𝑖𝑗1𝑆\theta_{i}\neq\theta_{j},~{}i\neq j,~{}\forall i,j\in\{1,\cdots,S\}italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≠ italic_θ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , italic_i ≠ italic_j , ∀ italic_i , italic_j ∈ { 1 , ⋯ , italic_S }.

  2. 2.

    Assumption 5 holds.

  3. 3.

    There exists a subset D¯mD¯m,m=1,,Mformulae-sequencesuperscriptsubscript¯𝐷𝑚subscript¯𝐷𝑚for-all𝑚1𝑀\bar{D}_{m}^{\prime}\subseteq\bar{D}_{m},\forall m=1,\cdots,Mover¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ⊆ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , ∀ italic_m = 1 , ⋯ , italic_M such that xkD¯mxkxksubscriptsubscript𝑥𝑘superscriptsubscript¯𝐷𝑚subscript𝑥𝑘superscriptsubscript𝑥𝑘top\sum_{x_{k}\in\bar{D}_{m}^{\prime}}x_{k}x_{k}^{\top}∑ start_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT is nonsingular.

  4. 4.

    For any segment D¯msubscript¯𝐷𝑚\bar{D}_{m}over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT, there exists xkmD¯msubscript𝑥superscript𝑘𝑚subscript¯𝐷𝑚x_{k^{m}}\in\bar{D}_{m}italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∈ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT such that xkmθΛm0,i{1,,S}\{Λm}formulae-sequencesuperscriptsubscript𝑥superscript𝑘𝑚topsubscript𝜃subscriptΛ𝑚0for-all𝑖\1𝑆subscriptΛ𝑚x_{k^{m}}^{\top}\theta_{\Lambda_{m}}\neq 0,\forall i\in\{1,\cdots,S\}% \backslash\{\Lambda_{m}\}italic_x start_POSTSUBSCRIPT italic_k start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT ≠ 0 , ∀ italic_i ∈ { 1 , ⋯ , italic_S } \ { roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT }.

Theorem 6.

In the absence of noise ek0,kNformulae-sequencesubscript𝑒𝑘0for-all𝑘superscript𝑁e_{k}\equiv 0,~{}\forall k\in\mathbb{R}^{N}italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ≡ 0 , ∀ italic_k ∈ blackboard_R start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT and suppose Assumption 7 holds. Then the segmented data D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } and {ζi,m,i=1,,S}m=1M\{\zeta_{i,m},i=1,\cdots,S\}_{m=1}^{M}{ italic_ζ start_POSTSUBSCRIPT italic_i , italic_m end_POSTSUBSCRIPT , italic_i = 1 , ⋯ , italic_S } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT are persistently exciting for the switched linear system (1).

Proof 4.

Considering each segment D¯msubscript¯𝐷𝑚\bar{D}_{m}over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT for m=1,,M𝑚1𝑀m=1,\cdots,Mitalic_m = 1 , ⋯ , italic_M, we have the unique solution to the equation D¯mθ¯22=0superscriptsubscriptnormsubscript¯𝐷𝑚¯𝜃220\|\bar{D}_{m}\bar{\theta}\|_{2}^{2}=0∥ over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT over¯ start_ARG italic_θ end_ARG ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0 with θ¯=θ¯Λm,ζΛm,m=1,ζi,m=0,i{1,,S}\{Λm}formulae-sequence¯𝜃subscript¯𝜃subscriptΛ𝑚formulae-sequencesubscript𝜁subscriptΛ𝑚𝑚1formulae-sequencesubscript𝜁𝑖𝑚0for-all𝑖\1𝑆subscriptΛ𝑚\bar{\theta}=\bar{\theta}_{\Lambda_{m}},\zeta_{\Lambda_{m},m}=1,\zeta_{i,m}=0,% \forall i\in\{1,\cdots,S\}\backslash\{\Lambda_{m}\}over¯ start_ARG italic_θ end_ARG = over¯ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_ζ start_POSTSUBSCRIPT roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT , italic_m end_POSTSUBSCRIPT = 1 , italic_ζ start_POSTSUBSCRIPT italic_i , italic_m end_POSTSUBSCRIPT = 0 , ∀ italic_i ∈ { 1 , ⋯ , italic_S } \ { roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } according to the fourth condition of Assumption 5. Then ignoring the permutation of the subsystem order (1,,S)1𝑆(1,\cdots,S)( 1 , ⋯ , italic_S ), {θi,ζi,m,m=1,,M}i=1S\{\theta_{i},\zeta_{i,m},m=1,\cdots,M\}_{i=1}^{S}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_ζ start_POSTSUBSCRIPT italic_i , italic_m end_POSTSUBSCRIPT , italic_m = 1 , ⋯ , italic_M } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT is the unique solution to the segmentation problem (39) and D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } and {ζi,m,i=1,,S}m=1M\{\zeta_{i,m},i=1,\cdots,S\}_{m=1}^{M}{ italic_ζ start_POSTSUBSCRIPT italic_i , italic_m end_POSTSUBSCRIPT , italic_i = 1 , ⋯ , italic_S } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT are persistently exciting.

5 Experiments

In this section, experiments with both periodic switching instants and random switching instants are conducted to assess the performance of our identification methods for switched linear systems.

5.1 Numerical Experiments with periodic switching instants

In this section, our identification algorithms are applied into the switched linear system (1) composed of three linear subsystems of order two na=nb=2,n=4formulae-sequencesubscript𝑛𝑎subscript𝑛𝑏2𝑛4n_{a}=n_{b}=2,n=4italic_n start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = italic_n start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = 2 , italic_n = 4 which is similar in [3] with their parameters λk{1,2,3},k=1,,Nformulae-sequencesubscript𝜆𝑘123𝑘1𝑁\lambda_{k}\in\{1,2,3\},k=1,\cdots,Nitalic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ { 1 , 2 , 3 } , italic_k = 1 , ⋯ , italic_N and {θi}i=13superscriptsubscriptsubscript𝜃𝑖𝑖13\{\theta_{i}\}_{i=1}^{3}{ italic_θ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT:

θ1subscript𝜃1\displaystyle\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =[0.4,0.25,0.15,0.08]absentsuperscript0.40.250.150.08top\displaystyle=[-0.4,0.25,-0.15,0.08]^{\top}= [ - 0.4 , 0.25 , - 0.15 , 0.08 ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT (40)
θ2subscript𝜃2\displaystyle\theta_{2}italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =[0.55,0.58,1.1,1.2]absentsuperscript0.550.581.11.2top\displaystyle=[0.55,-0.58,-1.1,1.2]^{\top}= [ 0.55 , - 0.58 , - 1.1 , 1.2 ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT
θ3subscript𝜃3\displaystyle\theta_{3}italic_θ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT =[1,0.24,0.65,0.3].absentsuperscript10.240.650.3top\displaystyle=[1,-0.24,-0.65,0.3]^{\top}.= [ 1 , - 0.24 , - 0.65 , 0.3 ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT .

The time series input-output data 𝒟={(uk,yk)k=1N}𝒟superscriptsubscriptsubscript𝑢𝑘subscript𝑦𝑘𝑘1𝑁\mathcal{D}=\{(u_{k},y_{k})_{k=1}^{N}\}caligraphic_D = { ( italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT } with N=1000𝑁1000N=1000italic_N = 1000 samples is generated by (1) with the above parameters and the following conditions:

  1. 1.

    The input {uk}k=1Nsuperscriptsubscriptsubscript𝑢𝑘𝑘1𝑁\{u_{k}\}_{k=1}^{N}{ italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT is generated from Gaussian distribution with zero mean and variance unity.

  2. 2.

    The Gaussian noise term {ek}k=1Nsuperscriptsubscriptsubscript𝑒𝑘𝑘1𝑁\{e_{k}\}_{k=1}^{N}{ italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT with zero mean is generated such that the Signal to Noise Ratio (SNR) is equal to 30303030dB with respect to the output {yk}k=1Nsuperscriptsubscriptsubscript𝑦𝑘𝑘1𝑁\{y_{k}\}_{k=1}^{N}{ italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT by adjusting the variance.

  3. 3.

    The switching instants are periodic {sm}m=211={100,200,,1000}superscriptsubscriptsubscript𝑠𝑚𝑚2111002001000\{s_{m}\}_{m=2}^{11}=\{100,200,\cdots,1000\}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 11 end_POSTSUPERSCRIPT = { 100 , 200 , ⋯ , 1000 } with the discrete state of each segment {Λm}m=110={1,2,3,1,2,1,2,1,3,2}superscriptsubscriptsubscriptΛ𝑚𝑚1101231212132\{\Lambda_{m}\}_{m=1}^{10}=\{1,2,3,1,2,1,2,1,3,2\}{ roman_Λ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 10 end_POSTSUPERSCRIPT = { 1 , 2 , 3 , 1 , 2 , 1 , 2 , 1 , 3 , 2 }.

Although the input {uk}k=1Nsuperscriptsubscriptsubscript𝑢𝑘𝑘1𝑁\{u_{k}\}_{k=1}^{N}{ italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT is generated randomly, the intersection of the kernel spaces corresponding to subsystem parameters θ1,θ2,θ3subscript𝜃1subscript𝜃2subscript𝜃3\theta_{1},\theta_{2},\theta_{3}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT are inevitable, which is summarized as follows:

|{I(θ1)I(θ2)}|/N=30.4%𝐼subscript𝜃1𝐼subscript𝜃2𝑁percent30.4\displaystyle|\{I(\theta_{1})\cap I(\theta_{2})\}|/N=30.4\%| { italic_I ( italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ∩ italic_I ( italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) } | / italic_N = 30.4 % (41)
|{I(θ1)I(θ3)}|/N=26.9%𝐼subscript𝜃1𝐼subscript𝜃3𝑁percent26.9\displaystyle|\{I(\theta_{1})\cap I(\theta_{3})\}|/N=26.9\%| { italic_I ( italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ∩ italic_I ( italic_θ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) } | / italic_N = 26.9 %
|{I(θ2)I(θ3)}|/N=32.1%𝐼subscript𝜃2𝐼subscript𝜃3𝑁percent32.1\displaystyle|\{I(\theta_{2})\cap I(\theta_{3})\}|/N=32.1\%| { italic_I ( italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ∩ italic_I ( italic_θ start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) } | / italic_N = 32.1 %

where |||\cdot|| ⋅ | denotes the cardinality of the set and I(θ)𝐼𝜃I(\theta)italic_I ( italic_θ ) is defined in (34) with εthres=104subscript𝜀𝑡𝑟𝑒𝑠superscript104\varepsilon_{thres}=10^{-4}italic_ε start_POSTSUBSCRIPT italic_t italic_h italic_r italic_e italic_s end_POSTSUBSCRIPT = 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT. The first 800800800800 samples are defined as training set while the last 200200200200 samples are used for test set. A criterion named ”Fit” is used to estimate the fitting error and the prediction accuracy which is proposed in [30]:

Fit=(1y^y2yy¯𝟏N2)×100%\text{Fit}=\left(1-\frac{\lVert\hat{y}-y\|_{2}}{\lVert y-\bar{y}\bm{1}_{N}\|_{% 2}}\right)\times 100\%Fit = ( 1 - divide start_ARG ∥ over^ start_ARG italic_y end_ARG - italic_y ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_y - over¯ start_ARG italic_y end_ARG bold_1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG ) × 100 % (42)

where y𝑦yitalic_y is the true output, y^^𝑦\hat{y}over^ start_ARG italic_y end_ARG is the model output, y¯¯𝑦\bar{y}over¯ start_ARG italic_y end_ARG is the mean of y𝑦yitalic_y and 𝟏Nsubscript1𝑁\bm{1}_{N}bold_1 start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT represents the N𝑁Nitalic_N-dimensional column vector with all entries equal to one.

Then the reproducible algorithm in [3] is compared with our proposed Algorithm 3 which utilizes 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization, with hyperparameters set as outlined in Table 1. The prediction results are depicted in Fig. 2. Additionally, the improved OMP algorithm in Appendix A is employed to identify the segmented data as for further comparison.

Table 1: Parameter Settings
Parameters Value
ΔsminsubscriptΔsubscript𝑠𝑚𝑖𝑛\Delta_{s_{min}}roman_Δ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_m italic_i italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT 10101010
ε0subscript𝜀0\varepsilon_{0}italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 0.0010.0010.0010.001
εthressubscript𝜀𝑡𝑟𝑒𝑠\varepsilon_{thres}italic_ε start_POSTSUBSCRIPT italic_t italic_h italic_r italic_e italic_s end_POSTSUBSCRIPT 1×1041superscript1041\times 10^{-4}1 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT
α𝛼\alphaitalic_α 0.90.90.90.9
η𝜂\etaitalic_η 0.010.010.010.01
v0subscript𝑣0v_{0}italic_v start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT 0.10.10.10.1
Refer to caption
Figure 2: 300 identification and 200 test data with SNR=30dB𝑆𝑁𝑅30𝑑𝐵SNR=30dBitalic_S italic_N italic_R = 30 italic_d italic_B. The blue solid line, marked with ”Origin”, represents the true output. The green solid line, marked with ”Sparse”, denotes the model prediction by the reproducible algorithm in [3]. The red dotted line, marked as ”0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm” represents the model prediction by the improved OMP algorithm, as Algorithm 4 in Appendix A while the purple dot-dashed line, marked as ”1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm” represents the result of Algorithm 3 with 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm.

To test the robustness of the identification algorithm for the switched linear system, 100100100100 different independent runs are also conducted as the Monte Carlo experiments. The results for the single run and the Monte Carlo experiments are summarized in Table 2 and Table 3. The histogram of the 100100100100 independent runs for Algorithm 3 with 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm is depicted in Fig. 3 while the result for 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm is shown in Fig. 4.

As depicted in Fig. 2 and the summary in Table 2 and Table 3, illustrating the prediction results, both \ellroman_ℓ-norm and 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm methods in Algorithm 3 exhibit superior performance in reconstructing the true system output compared to the reproducible identification algorithm in [3] and Algorithm 3 based on the identification of switching instants exhibits better prediction accuracy in output fitting with periodic switching instants. Moreover, Algorithm 3 with 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm exhibits stronger robustness against noise than 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm.

Table 2: Comparison of the proposed algorithms with periodic switching instants
Algorithm Sparse 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm
Fit (Single run in Fig. 2) 52.14% 85.17% 91.16%
Average Fit (Monte Carlo) 47.02% 90.67% 92.51%
Table 3: Comparison of the proposed identification algorithms over 100100100100 independent runs
Method θ^1subscript^𝜃1\hat{\theta}_{1}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT θ^2subscript^𝜃2\hat{\theta}_{2}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT θ^3subscript^𝜃3\hat{\theta}_{3}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT
0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT norm 0.321±0.0860.166±0.0750.260±0.0890.210±0.117plus-or-minus0.3210.086plus-or-minus0.1660.075plus-or-minus0.2600.089plus-or-minus0.2100.117\begin{aligned} -0.321\pm 0.086\\ 0.166\pm 0.075\\ -0.260\pm 0.089\\ 0.210\pm 0.117\end{aligned}start_ROW start_CELL - 0.321 ± 0.086 end_CELL end_ROW start_ROW start_CELL 0.166 ± 0.075 end_CELL end_ROW start_ROW start_CELL - 0.260 ± 0.089 end_CELL end_ROW start_ROW start_CELL 0.210 ± 0.117 end_CELL end_ROW 0.505±0.0180.468±0.0720.986±0.0550.992±0.145plus-or-minus0.5050.018plus-or-minus0.4680.072plus-or-minus0.9860.055plus-or-minus0.9920.145\begin{aligned} 0.505\pm 0.018\\ -0.468\pm 0.072\\ -0.986\pm 0.055\\ 0.992\pm 0.145\\ \end{aligned}start_ROW start_CELL 0.505 ± 0.018 end_CELL end_ROW start_ROW start_CELL - 0.468 ± 0.072 end_CELL end_ROW start_ROW start_CELL - 0.986 ± 0.055 end_CELL end_ROW start_ROW start_CELL 0.992 ± 0.145 end_CELL end_ROW 0.638±0.1140.082±0.0480.624±0.0340.156±0.065plus-or-minus0.6380.114plus-or-minus0.0820.048plus-or-minus0.6240.034plus-or-minus0.1560.065\begin{aligned} 0.638\pm 0.114\\ -0.082\pm 0.048\\ -0.624\pm 0.034\\ 0.156\pm 0.065\end{aligned}start_ROW start_CELL 0.638 ± 0.114 end_CELL end_ROW start_ROW start_CELL - 0.082 ± 0.048 end_CELL end_ROW start_ROW start_CELL - 0.624 ± 0.034 end_CELL end_ROW start_ROW start_CELL 0.156 ± 0.065 end_CELL end_ROW
1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norm 0.368±0.0430.221±0.0040.155±0.0210.086±0.001plus-or-minus0.3680.043plus-or-minus0.2210.004plus-or-minus0.1550.021plus-or-minus0.0860.001\begin{aligned} -0.368\pm 0.043\\ 0.221\pm 0.004\\ -0.155\pm 0.021\\ 0.086\pm 0.001\end{aligned}start_ROW start_CELL - 0.368 ± 0.043 end_CELL end_ROW start_ROW start_CELL 0.221 ± 0.004 end_CELL end_ROW start_ROW start_CELL - 0.155 ± 0.021 end_CELL end_ROW start_ROW start_CELL 0.086 ± 0.001 end_CELL end_ROW 0.550±0.0000.579±0.0001.096±0.0001.191±0.001plus-or-minus0.5500.000plus-or-minus0.5790.000plus-or-minus1.0960.000plus-or-minus1.1910.001\begin{aligned} 0.550\pm 0.000\\ -0.579\pm 0.000\\ -1.096\pm 0.000\\ 1.191\pm 0.001\end{aligned}start_ROW start_CELL 0.550 ± 0.000 end_CELL end_ROW start_ROW start_CELL - 0.579 ± 0.000 end_CELL end_ROW start_ROW start_CELL - 1.096 ± 0.000 end_CELL end_ROW start_ROW start_CELL 1.191 ± 0.001 end_CELL end_ROW 0.832±0.0060.182±0.0470.648±0.0000.302±0.024plus-or-minus0.8320.006plus-or-minus0.1820.047plus-or-minus0.6480.000plus-or-minus0.3020.024\begin{aligned} 0.832\pm 0.006\\ -0.182\pm 0.047\\ -0.648\pm 0.000\\ 0.302\pm 0.024\\ \end{aligned}start_ROW start_CELL 0.832 ± 0.006 end_CELL end_ROW start_ROW start_CELL - 0.182 ± 0.047 end_CELL end_ROW start_ROW start_CELL - 0.648 ± 0.000 end_CELL end_ROW start_ROW start_CELL 0.302 ± 0.024 end_CELL end_ROW
Refer to caption
Figure 3: The histogram of Fit for 100100100100 independent runs by Algorithm 3 with 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm.
Refer to caption
Figure 4: The histogram of Fit for 100100100100 independent runs by Algorithm 3 with 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm.

5.2 Numerical Experiment with random switching instants

In this section, experiments are conducted to test the identification of switching instants and its impact on the identification for the parameters of switched linear systems (1). We consider a switched system (1) with two subsystems:

θ1subscript𝜃1\displaystyle\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =[0.9,0.2,0.16,0.2]absentsuperscript0.90.20.160.2top\displaystyle=[-0.9,-0.2,0.16,0.2]^{\top}= [ - 0.9 , - 0.2 , 0.16 , 0.2 ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT (43)
θ2subscript𝜃2\displaystyle\theta_{2}italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =[0.8,0.1,0.26,0.15]absentsuperscript0.80.10.260.15top\displaystyle=[-0.8,-0.1,0.26,0.15]^{\top}= [ - 0.8 , - 0.1 , 0.26 , 0.15 ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT

where the number of the modes S=2𝑆2S=2italic_S = 2, the mode λk{0,1}subscript𝜆𝑘01\lambda_{k}\in\{0,1\}italic_λ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∈ { 0 , 1 }, the input uk𝒩(0,1)subscript𝑢𝑘𝒩01u_{k}\thicksim\mathcal{N}(0,1)italic_u start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , 1 ), noise ek𝒩(0,σ2)subscript𝑒𝑘𝒩0superscript𝜎2e_{k}\thicksim\mathcal{N}(0,\sigma^{2})italic_e start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∼ caligraphic_N ( 0 , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) and the number of the time segments M=15𝑀15M=15italic_M = 15. 14141414 switching instants are randomly selected between 1111 and 1000100010001000. The first 800800800800 samples are used to identify the switched linear system (1) and the last 200200200200 samples are regarded as the test data. In the first experiment, the output yksubscript𝑦𝑘y_{k}italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is noiseless, with σ=0,SNR=formulae-sequence𝜎0𝑆𝑁𝑅\sigma=0,SNR=\inftyitalic_σ = 0 , italic_S italic_N italic_R = ∞, and in the second experiment, we set the variance of the noise σ𝜎\sigmaitalic_σ so that SNR=10𝑆𝑁𝑅10SNR=10italic_S italic_N italic_R = 10. We also compute the number of the elements in the intersection of the kernel spaces corresponding to subsystem parameters θ1,θ2subscript𝜃1subscript𝜃2\theta_{1},\theta_{2}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT:

SNR=:|{I(θ1)I(θ2)}|/N=51.7%:𝑆𝑁𝑅𝐼subscript𝜃1𝐼subscript𝜃2𝑁percent51.7\displaystyle SNR=\infty:~{}|\{I(\theta_{1})\cap I(\theta_{2})\}|/N=51.7\%italic_S italic_N italic_R = ∞ : | { italic_I ( italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ∩ italic_I ( italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) } | / italic_N = 51.7 % (44)
SNR=10:|{I(θ1)I(θ2)}|/N=38.6%.:𝑆𝑁𝑅10𝐼subscript𝜃1𝐼subscript𝜃2𝑁percent38.6\displaystyle SNR=10:~{}|\{I(\theta_{1})\cap I(\theta_{2})\}|/N=38.6\%.italic_S italic_N italic_R = 10 : | { italic_I ( italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ∩ italic_I ( italic_θ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) } | / italic_N = 38.6 % .

The results of the identification for the switching instants based on both train data and test data are summarized in Table 4 and Table 5 while the results of the identified system parameters on test data are summarized in Table 6. The prediction results are depicted in Fig. 5 with SNR=𝑆𝑁𝑅SNR=\inftyitalic_S italic_N italic_R = ∞ and Fig. 6 with SNR=10𝑆𝑁𝑅10SNR=10italic_S italic_N italic_R = 10. The results of the above experiments indicate that our proposed identification algorithms for switching instants exhibit strong robustness and the robust identification of switching instants can assist in achieving higher accuracy in the identification of system parameters than the algorithm proposed in [3].

Table 4: Identification for switching instants With SNR=SNR\text{SNR}=\inftySNR = ∞
Index m𝑚mitalic_m 2 3 4 5 6 7 8
True smsubscript𝑠𝑚s_{m}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT 34 57 237 295 451 605 636
Result s^msubscript^𝑠𝑚\hat{s}_{m}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT 33 56 236 295 450 604 636
Index m𝑚mitalic_m 9 10 11 12 13 14 15
True smsubscript𝑠𝑚s_{m}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT 715 770 777 822 845 962 968
Result s^msubscript^𝑠𝑚\hat{s}_{m}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT 714 769 777 821 844 961 968
Table 5: Identification for switching instants With SNR=10SNR10\text{SNR}=10SNR = 10
Index m𝑚mitalic_m 2 3 4 5 6 7 8
True smsubscript𝑠𝑚s_{m}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT 39 59 239 269 439 579 599
Result s^msubscript^𝑠𝑚\hat{s}_{m}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT 39 54 239 268 439 574 599
Index m𝑚mitalic_m 9 10 11 12 13 14 15
True smsubscript𝑠𝑚s_{m}italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT 659 729 749 779 809 939 969
Result s^msubscript^𝑠𝑚\hat{s}_{m}over^ start_ARG italic_s end_ARG start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT 668 728 751 778 806 935 969
Table 6: Comparison of the proposed algorithms with random switching instants
Algorithm Sparse Zero norm One norm
Fit (SNR=𝑆𝑁𝑅SNR=\inftyitalic_S italic_N italic_R = ∞) 79.79.14% 96.03% 98.02%
Fit (SNR=10𝑆𝑁𝑅10SNR=10italic_S italic_N italic_R = 10) 31.22% 39.34% 79.01%
Refer to caption
Figure 5: 500 identification and 200 test data with SNR=𝑆𝑁𝑅SNR=\inftyitalic_S italic_N italic_R = ∞.
Refer to caption
Figure 6: 500 identification and 200 test data with SNR=10𝑆𝑁𝑅10SNR=10italic_S italic_N italic_R = 10.

5.3 Case study on high speed train modeling

In this section, we adopt data collected from field trials on CRH3 high speed train [31] in China to investigate the performance of our proposed Algorithm 3 as compared to the linear system by the least squares identification without data segmentation.

High speed trains exhibit different modalities under various states and inputs, which can be modeled as a switched linear system. The output vector yksubscript𝑦𝑘y_{k}italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and the regressor vector xksubscript𝑥𝑘x_{k}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT in the switched linear system (1) are defined as

yksubscript𝑦𝑘\displaystyle y_{k}italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =[vk,sk]absentsuperscriptsubscript𝑣𝑘subscript𝑠𝑘top\displaystyle=[v_{k},s_{k}]^{\top}= [ italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT (45)
xksubscript𝑥𝑘\displaystyle x_{k}italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT =[yk,,yk5,σk,,σk5]absentsuperscriptsubscript𝑦𝑘topsuperscriptsubscript𝑦𝑘5topsubscript𝜎𝑘subscript𝜎𝑘5\displaystyle=[y_{k}^{\top},\cdots,y_{k-5}^{\top},\sigma_{k},\cdots,\sigma_{k-% 5}]= [ italic_y start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , ⋯ , italic_y start_POSTSUBSCRIPT italic_k - 5 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT , italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , ⋯ , italic_σ start_POSTSUBSCRIPT italic_k - 5 end_POSTSUBSCRIPT ]

where vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, sksubscript𝑠𝑘s_{k}italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT and σksubscript𝜎𝑘\sigma_{k}italic_σ start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT represents the velocity, position and the motor power output variable of the train.

Refer to caption
Figure 7: High speed train modeling.

The results of model prediction and error output are depicted in Fig. 7. It is evident that Algorithm 3 based on the switched linear system with data segmentation and sparsity inducing methods has much smaller fitting error than the traditional linear model. This indicates that the segmentation of time series data and the identification of switching rules can significantly improve the performance of high speed train modeling.

6 Conclusion

In this paper, we introduce a new two-stage identification algorithm for switched linear systems with constrained switching mechanisms. Initially, the switching instants are determined using the iterative dynamic programming method. Subsequently, system parameters are identified based on data segmentation between these switching instants using sparse inducing methods. We formulate a specific combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem, which can be relaxed into an 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm optimization problem for ease of solution, to decouple the identification of discrete mode states and system parameters. Furthermore, we discuss the unbiasedness of our algorithm’s identification in detail. Additionally, we propose a new persistence of excitation criterion for switched linear systems based on the constrained switching mechanism. Experimental results demonstrate robustness in identifying switching instants, leading to improved prediction accuracy of the identified switched linear system.

{ack}

This work is supported in part by the National Natural Science Foundation of China under Grant 61933015 and Grant 61803163, in part by the Bei**g National Research Center for Information Science and Technology (BNRist) Program under Grant BNR2019TD01009, and in part by the National Innovation Center of High Speed Train R&D project “Modeling and comprehensive intelligent optimization for new high-efficiency urban rail transit system” (Grant No. CX/KJ-2020-0006).

References

  • [1] Louis Massucci, Fabien Lauer, and Marion Gilson. A statistical learning perspective on switched linear system identification. Automatica, 145:110532, 2022.
  • [2] Andrea Garulli, Simone Paoletti, and Antonio Vicino. A survey on switched and piecewise affine system identification. IFAC Proceedings Volumes, 45(16):344–355, 2012.
  • [3] Laurent Bako. Identification of switched linear systems via sparse optimization. Automatica, 47(4):668–677, 2011.
  • [4] Louis Massucci, Fabien Lauer, and Marion Gilson. Structural risk minimization for switched system identification. In 2020 59th IEEE Conference on Decision and Control (CDC), pages 1002–1007. IEEE, 2020.
  • [5] Vittorio De Iuliis, Francesco Smarra, Costanzo Manes, and Alessandro D’Innocenzo. On the stability of switched arx models, with an application to learning via regression trees. IFAC-PapersOnLine, 54(5):61–66, 2021.
  • [6] Lamaa Sellami, Salah Zidi, and Kamel Abderrahim. Self-adaptative multi-kernel algorithm for switched linear systems identification. International Journal of Modelling, Identification and Control, 31(1):103–111, 2019.
  • [7] Yingwei Du, Fangzhou Liu, Jianbin Qiu, and Martin Buss. A semi-supervised learning approach for identification of piecewise affine systems. IEEE Transactions on Circuits and Systems I: Regular Papers, 67(10):3521–3532, 2020.
  • [8] René Vidal, Stefano Soatto, Yi Ma, and Shankar Sastry. An algebraic geometric approach to the identification of a class of linear hybrid systems. In 42nd IEEE International Conference on Decision and Control (IEEE Cat. No. 03CH37475), volume 1, pages 167–172. IEEE, 2003.
  • [9] René Vidal and Brian DO Anderson. Recursive identification of switched arx hybrid models: exponential convergence and persistence of excitation. In 2004 43rd IEEE Conference on Decision and Control (CDC)(IEEE Cat. No. 04CH37601), volume 1, pages 32–37. IEEE, 2004.
  • [10] Laurent Bako and René Vidal. Algebraic identification of mimo sarx models. In Hybrid Systems: Computation and Control: 11th International Workshop, HSCC 2008, St. Louis, MO, USA, April 22-24, 2008. Proceedings 11, pages 43–57. Springer, 2008.
  • [11] Abdelhak Goudjil, Mathieu Pouliquen, Eric Pigeon, Olivier Gehan, and Boubekeur Targui. Recursive output error identification algorithm for switched linear systems with bounded noise. IFAC-PapersOnLine, 50(1):14112–14117, 2017.
  • [12] Laurent Bako, Khaled Boukharouba, Eric Duviella, and Stéphane Lecoeuche. A recursive identification algorithm for switched linear/affine models. Nonlinear Analysis: Hybrid Systems, 5(2):242–253, 2011.
  • [13] Necmiye Ozay, Constantino Lagoa, and Mario Sznaier. Set membership identification of switched linear systems with known number of subsystems. Automatica, 51:180–191, 2015.
  • [14] Henrik Ohlsson and Lennart Ljung. Identification of switched linear regression models using sum-of-norms regularization. Automatica, 49(4):1045–1050, 2013.
  • [15] Jacob Roll, Alberto Bemporad, and Lennart Ljung. Identification of piecewise affine systems via mixed-integer programming. Automatica, 40(1):37–50, 2004.
  • [16] Alfred M Bruckstein, David L Donoho, and Michael Elad. From sparse solutions of systems of equations to sparse modeling of signals and images. SIAM review, 51(1):34–81, 2009.
  • [17] Biqiang Mu, Tianshi Chen, Changming Cheng, and Er-wei Bai. Persistence of excitation for identifying switched linear systems. Automatica, 137:110142, 2022.
  • [18] Bo Zhou, Hao Ye, Haifeng Zhang, and Mingliang Li. A new qualitative trend analysis algorithm based on global polynomial fit. AIChE Journal, 63(8):3374–3383, 2017.
  • [19] Richard Bellman and Robert Roth. Curve fitting by segmented straight lines. Journal of the American Statistical Association, 64(327):1079–1084, 1969.
  • [20] Richard Bellman. On the approximation of curves by line segments using dynamic programming. Communications of the ACM, 4(6):284, 1961.
  • [21] John Shawe-Taylor, Nello Cristianini, et al. Kernel methods for pattern analysis. Cambridge university press, 2004.
  • [22] Wenju Zheng and Hao Ye. Identification of switched linear system based on dynamic programming. In 2023 CAA Symposium on Fault Detection, Supervision and Safety for Technical Processes (SAFEPROCESS), pages 1–5. IEEE, 2023.
  • [23] Necmiye Ozay, Mario Sznaier, Constantino M Lagoa, and Octavia I Camps. A sparsification approach to set membership identification of switched affine systems. IEEE Transactions on Automatic Control, 57(3):634–648, 2011.
  • [24] Kumpati S Narendra and Anuradha M Annaswamy. Persistent excitation in adaptive systems. International Journal of Control, 45(1):127–160, 1987.
  • [25] Ilya Zaliapin, Andrei Gabrielov, and Vladimir Keilis-Borok. Multiscale trend analysis. Fractals, 12(03):275–292, 2004.
  • [26] Jacob Benesty, Constantin Paleologu, Tomas Gänsler, Silviu Ciochină, Jacob Benesty, Constantin Paleologu, Tomas Gänsler, and Silviu Ciochină. Recursive least-squares algorithms. A perspective on stereophonic acoustic echo cancellation, pages 63–69, 2011.
  • [27] Joel A Tropp and Anna C Gilbert. Signal recovery from random measurements via orthogonal matching pursuit. IEEE Transactions on information theory, 53(12):4655–4666, 2007.
  • [28] Jian Wang, Seokbeop Kwon, and Byonghyo Shim. Generalized orthogonal matching pursuit. IEEE Transactions on signal processing, 60(12):6202–6216, 2012.
  • [29] Zhiyong Hao, Yixuan Jiang, Huihua Yu, and Hsiao-Dong Chiang. Adaptive learning rate and momentum for training deep neural networks. In Machine Learning and Knowledge Discovery in Databases. Research Track: European Conference, ECML PKDD 2021, Bilbao, Spain, September 13–17, 2021, Proceedings, Part III 21, pages 381–396. Springer, 2021.
  • [30] Lennart Ljung. System identification toolbox: User’s guide. Citeseer, 1995.
  • [31] ZhenXu Sun, **g**g Song, and YiRan An. Optimization of the head shape of the CRH3 high speed train. Science China Technological Sciences, 53:3356–3364, 2010.
  • [32] David L Donoho, Michael Elad, and Vladimir N Temlyakov. Stable recovery of sparse overcomplete representations in the presence of noise. IEEE Transactions on information theory, 52(1):6–18, 2005.

Appendix A Improved OMP

The orthogonal matching pursuit (OMP) [27, 28] is proposed to address the normalized 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem (2) with J(z)=z0𝐽𝑧subscriptdelimited-∥∥𝑧0J(z)=\lVert z\rVert_{0}italic_J ( italic_z ) = ∥ italic_z ∥ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Although OMP is inherently greedy, the convergence and optimality of the solution to the general 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem against noise is guaranteed when condition (5) holds, as discussed in [32]. To compare with the proposed 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm based algorithm, the OMP algorithm is improved to address the combinatorial 0subscript0\ell_{0}roman_ℓ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT-norm optimization problem (28) with segmented data D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT }. Notably, we focus on refining the ”Sweep” step and the ”Update Support” step, as the selection of basic elements now relies on segmented data rather than a single data sample. The improved OMP algorithm is outlined in the following Algorithm 4.

Algorithm 4 Improved OMP

Require:
AX(Nn)×N,bXNnformulae-sequencesubscript𝐴𝑋superscript𝑁𝑛𝑁subscript𝑏𝑋superscript𝑁𝑛A_{X}\in\mathbb{R}^{(N-n)\times N},~{}b_{X}\in\mathbb{R}^{N-n}italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_N - italic_n ) × italic_N end_POSTSUPERSCRIPT , italic_b start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_N - italic_n end_POSTSUPERSCRIPT, switching instants {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT, segmented data D¯={D¯1,,D¯M}¯𝐷subscript¯𝐷1subscript¯𝐷𝑀\bar{D}=\{\bar{D}_{1},\cdots,\bar{D}_{M}\}over¯ start_ARG italic_D end_ARG = { over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , over¯ start_ARG italic_D end_ARG start_POSTSUBSCRIPT italic_M end_POSTSUBSCRIPT } and hyperparameters ε0subscript𝜀0\varepsilon_{0}italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

1:According to the switching instants {sm}m=1M+1superscriptsubscriptsubscript𝑠𝑚𝑚1𝑀1\{s_{m}\}_{m=1}^{M+1}{ italic_s start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_m = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M + 1 end_POSTSUPERSCRIPT in column order, partition matrix AXsubscript𝐴𝑋A_{X}italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT into block matrices, AX=[AX1,,AXM]subscript𝐴𝑋subscriptsuperscript𝐴1𝑋subscriptsuperscript𝐴𝑀𝑋A_{X}=[A^{1}_{X},\cdots,A^{M}_{X}]italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT = [ italic_A start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT , ⋯ , italic_A start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ], where AXm(Nn)×pmsubscriptsuperscript𝐴𝑚𝑋superscript𝑁𝑛subscript𝑝𝑚A^{m}_{X}\in\mathbb{R}^{(N-n)\times p_{m}}italic_A start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT ( italic_N - italic_n ) × italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUPERSCRIPT.
2:Initial settings: j=0𝑗0j=0italic_j = 0
  • Initial solution: z0=0superscript𝑧00z^{0}=0italic_z start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = 0

  • Initial residual: r0=bXAXz0superscript𝑟0subscript𝑏𝑋subscript𝐴𝑋superscript𝑧0r^{0}=b_{X}-A_{X}z^{0}italic_r start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = italic_b start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT - italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT

  • Initial solution support: S0={}superscript𝑆0S^{0}=\{\emptyset\}italic_S start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT = { ∅ }

3:While rj2ε0subscriptnormsuperscript𝑟𝑗2subscript𝜀0\|r^{j}\|_{2}\leq\varepsilon_{0}∥ italic_r start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT
  • Sweep: Computing errors ϵ(m)italic-ϵ𝑚\epsilon(m)italic_ϵ ( italic_m ) with the optimal choice qmsuperscriptsubscript𝑞𝑚q_{m}^{*}italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT

    ϵ(m)italic-ϵ𝑚\displaystyle\epsilon(m)italic_ϵ ( italic_m ) =minqmpmAmqmrj122absentsubscriptsubscript𝑞𝑚superscriptsubscript𝑝𝑚superscriptsubscriptnormsubscript𝐴𝑚subscript𝑞𝑚superscript𝑟𝑗122\displaystyle=\min_{q_{m}\in\mathbb{R}^{p_{m}}}\|A_{m}q_{m}-r^{j-1}\|_{2}^{2}= roman_min start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ italic_A start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_r start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (46)
    qmsuperscriptsubscript𝑞𝑚\displaystyle q_{m}^{*}italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT =argminqmpmAmqmrj122.absentsubscriptsubscript𝑞𝑚superscriptsubscript𝑝𝑚superscriptsubscriptnormsubscript𝐴𝑚subscript𝑞𝑚superscript𝑟𝑗122\displaystyle=\arg\min_{q_{m}\in\mathbb{R}^{p_{m}}}\|A_{m}q_{m}-r^{j-1}\|_{2}^% {2}.= roman_arg roman_min start_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ italic_A start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT - italic_r start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT .
  • Update Support: Find a minimizer m0subscript𝑚0m_{0}italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT of ϵ(m)italic-ϵ𝑚\epsilon(m)italic_ϵ ( italic_m ) such that ϵ(m0)ϵ(m),mSj1formulae-sequenceitalic-ϵsubscript𝑚0italic-ϵ𝑚for-all𝑚superscript𝑆𝑗1\epsilon(m_{0})\leq\epsilon(m),\forall m\notin S^{j-1}italic_ϵ ( italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ≤ italic_ϵ ( italic_m ) , ∀ italic_m ∉ italic_S start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT and update Sj=Sj1{sm0,,sm0+11}superscript𝑆𝑗superscript𝑆𝑗1subscript𝑠subscript𝑚0subscript𝑠subscript𝑚011S^{j}=S^{j-1}\bigcup\{s_{m_{0}},\cdots,s_{m_{0}+1}-1\}italic_S start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = italic_S start_POSTSUPERSCRIPT italic_j - 1 end_POSTSUPERSCRIPT ⋃ { italic_s start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , ⋯ , italic_s start_POSTSUBSCRIPT italic_m start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT - 1 }

  • Update provisional solution: Compute zjsuperscript𝑧𝑗z^{j}italic_z start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT the minimizer of AXzbX22superscriptsubscriptnormsubscript𝐴𝑋𝑧subscript𝑏𝑋22\|A_{X}z-b_{X}\|_{2}^{2}∥ italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT italic_z - italic_b start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT subject to Support{z}=Sj𝑧superscript𝑆𝑗\{z\}=S^{j}{ italic_z } = italic_S start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT

  • Update residual: Compute rj=bXAXzjsuperscript𝑟𝑗subscript𝑏𝑋subscript𝐴𝑋superscript𝑧𝑗r^{j}=b_{X}-A_{X}z^{j}italic_r start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT = italic_b start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT - italic_A start_POSTSUBSCRIPT italic_X end_POSTSUBSCRIPT italic_z start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT

4:Concatenate the data belonging to the i0subscript𝑖0i_{0}italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTth subsystem, I(θi0)={xk:kSj}𝐼subscript𝜃subscript𝑖0conditional-setsubscript𝑥𝑘𝑘superscript𝑆𝑗I(\theta_{i_{0}})=\{x_{k}:k\in S^{j}\}italic_I ( italic_θ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = { italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT : italic_k ∈ italic_S start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT }, and re-estimate the parameters of the i0subscript𝑖0i_{0}italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTth subsystem as follows:
θi0=argminθkSjzkxkθ22.subscript𝜃subscript𝑖0subscript𝜃subscript𝑘superscript𝑆𝑗superscriptsubscriptnormsubscript𝑧𝑘superscriptsubscript𝑥𝑘top𝜃22\displaystyle\theta_{i_{0}}=\arg\min_{\theta}\sum_{k\in S^{j}}\|z_{k}-x_{k}^{% \top}\theta\|_{2}^{2}.italic_θ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_k ∈ italic_S start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∥ italic_z start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT - italic_x start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_θ ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT . (47)
5:return θi0,I(θi0)subscript𝜃subscript𝑖0𝐼subscript𝜃subscript𝑖0\theta_{i_{0}},I(\theta_{i_{0}})italic_θ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_I ( italic_θ start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT )