11institutetext: Centre for Medical Image Computing, Department of Medical Physics and Biomedical Engineering, University College London, United Kingdom 11email: [email protected] 22institutetext: Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, United Kingdom 33institutetext: Joint Department of Physics, The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust, London, United Kingdom 44institutetext: Institute of Nuclear Medicine, University College London, London, United Kingdom

Resolving Variable Respiratory Motion From Unsorted 4D Computed Tomography

Yuliang Huang(🖂) 1122    Bjoern Eiben 33    Kris Thielemans 1144    Jamie R. McClelland 1122
Abstract

4D Computed Tomography (4DCT) is widely used for many clinical applications such as radiotherapy treatment planning, PET and ventilation imaging. However, common 4DCT methods reconstruct multiple breath cycles into a single, arbitrary breath cycle which can lead to various artefacts, impacting the downstream clinical applications. Surrogate driven motion models can estimate continuous variable motion across multiple cycles based on CT segments ‘unsorted’ from 4DCT, but it requires respiration surrogate signals with strong correlation to the internal motion, which are not always available. The method proposed in this study eliminates such dependency by adapting the hyper-gradient method to the optimization of surrogate signals as hyper-parameters, while achieving better or comparable performance, as demonstrated on digital phantom simulations and real patient data. Our method produces a high-quality motion-compensated image together with estimates of the motion, including breath-to-breath variability, throughout the image acquisition. Our method has the potential to improve downstream clinical applications, and also enables retrospective analysis of open access 4DCT dataset where no respiration signals are stored. Code is avaibale at https://github.com/Yuliang-Huang/4DCT-irregular-motion.

Keywords:
4DCT Irregular breath Motion model.

1 Introduction

4D Computed Tomography (4DCT) can capture 3DCT images of each respiration motion state in a breathing cycle [21], thus reducing motion artefacts. The basic principle is to synchronize the CT acquisition with a respiration signal measured by external devices such as a belt or skin marker. Most commonly, respiratory motion states are defined based on phase, i.e. intervals between adjacent peaks/valleys of the respiration signal are evenly divided into several (normally ten) respiration phases. CT data acquired at different couch positions are then sorted and stacked to form the 3DCT of each respiration phase [7].

4DCT is used to delineate tumors and critical organs for almost all the thoracic and abdominal cancer patients treated by radiotherapy [15]. The tendency to increase the use of proton radiotherapy will place a higher emphasis on motion management guided by 4DCT [20]. 4DCT also has been applied in other domains outside radiotherapy, such as PET-imaging for attenuation correction [4], and ventilation imaging [11] for ablative pulmonary interventions.

Despite its critical role in various applications, a fundamental challenge to 4DCT is irregular breathing, which has not been solved [22]. Fig. 1 illustrates an artefact caused by irregular breathing, using the end-inhalation phase as an example. As shown by Fig. 1, the patient had deeper inhalation at tjsubscript𝑡𝑗t_{j}italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT than tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, causing the diaphragm to be lower at tjsubscript𝑡𝑗t_{j}italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT and appear in the CT segments acquired at the two couch positions. The final end-inhalation images, formed by stacking these CT segments, will therefore show misalignment, duplicate, missing or distortion of structures. Indeed, clinical evidence has shown correlation between 4DCT motion artefacts and worse control of lung and liver cancer metastasis after radiotherapy [18], indicating the significance of overcoming the irregular breathing problem for 4DCT.

Refer to caption
Figure 1: Illustration of 4DCT artefacts caused by irregular breathing. Time points of the end-inhalation phase are identified as black dots in the respiration signal plot, with two of them being tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Itisubscript𝐼subscript𝑡𝑖I_{t_{i}}italic_I start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT and Itjsubscript𝐼subscript𝑡𝑗I_{t_{j}}italic_I start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT are the real-time volumes at tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and tjsubscript𝑡𝑗t_{j}italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT respectively. Ptisubscript𝑃subscript𝑡𝑖P_{t_{i}}italic_P start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT and Ptjsubscript𝑃subscript𝑡𝑗P_{t_{j}}italic_P start_POSTSUBSCRIPT italic_t start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT indicate the CT segments acquired at the couch positions corresponding to tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, which are stacked into the final end-inhalation phase image. Artefact of duplicated diaphragm can be observed in the end-inhalation phase image.

Different solutions have been proposed in the past two decades. Different sorting methods [1, 9, 23] could produce fewer artefacts, but might lead to gaps in slice locations for certain phase [1]. Other studies were based on pairwise/groupwise registration [24, 10, 19], where a motion-free template image could be obtained from all the phase images warped to the template image space, and then used to recover motion-free images of each phase by backward war**. It is unknown how the residual motion artefacts within 4DCT would impact the pairwise/groupwise registration and these methods could only recover arbitrary breath cycle, unable to address errors/uncertainties in the motion seen in the 4DCT. Low et al. [12] developed a 5DCT protocol using a 5D lung motion model [25] but required a special acquisition protocol. Similar to 5DCT method, McClelland et al. [13] proposed the surrogate-driven motion model method that fits in with standard protocol, which could estimate a motion-free reference image and the motion fields at each timepoint based on CT segments. Therefore, phase-binning was not required and the irregular breath would not cause artefact.

Although promising, the surrogate-driven motion model relies on existence of surrogate signals that have strong correlation with internal motion, which are challenging to obtain. Weak or unstable correlation between the surrogate signals and internal motion can significantly impact model precision [8]. More importantly, respiration signals are often absent from retrospective data, especially open access data, and some hospitals do not keep the respiration signals after 4DCT data are sorted, so the method cannot be applied retrospectively.

In this work, we propose a method that improves the surrogate-driven motion model. The proposed method uses the hypergradient method [3] to obtain the suitable surrogate signals when the clinical respiration signals are unavailable (denoted as surrogate-free motion model), or optimize existing surrogate signals so that the correlation with internal motion can be strengthened (denoted as surrogate-optimized motion model). A phantom simulation with ground-truth is used to quantitatively evaluate our methods against other methods in literature. Furthermore, to demonstrate the benefits of our method when no respiration signals are available, we retrospectively process 4DCTs from local cancer center and public available dataset by our method and show that the irregular breathing motion could be estimated by our method.

2 Methodology

2.0.1 Problem Formulation

During 4DCT scan, only a few slices of the whole volume will be acquired at each time and the couch will move to change the slice locations being imaged. The CT segment Ptsubscript𝑃𝑡P_{t}italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT at time t𝑡titalic_t can be obtained by

Pt=E𝒵t(It),𝒵t=[ztinf,ztsup]formulae-sequencesubscript𝑃𝑡subscript𝐸subscript𝒵𝑡subscript𝐼𝑡subscript𝒵𝑡superscriptsubscript𝑧𝑡infsuperscriptsubscript𝑧𝑡supP_{t}=E_{\mathcal{Z}_{t}}(I_{t}),\mathcal{Z}_{t}=[z_{t}^{\mathrm{inf}},z_{t}^{% \mathrm{sup}}]italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_E start_POSTSUBSCRIPT caligraphic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , caligraphic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = [ italic_z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_inf end_POSTSUPERSCRIPT , italic_z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_sup end_POSTSUPERSCRIPT ] (1)

where Itsubscript𝐼𝑡I_{t}italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the full dynamic volume at time t𝑡titalic_t, E𝐸Eitalic_E is the slice extraction operator and 𝒵tsubscript𝒵𝑡\mathcal{Z}_{t}caligraphic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the slice locations being imaged at time t𝑡titalic_t. If there is a reference volume I0subscript𝐼0I_{0}italic_I start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, the dynamic volumes could be transformed from I0subscript𝐼0I_{0}italic_I start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT as below:

It=T(I0,Mt)subscript𝐼𝑡𝑇subscript𝐼0subscript𝑀𝑡I_{t}=T(I_{0},M_{t})italic_I start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_T ( italic_I start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) (2)

where T𝑇Titalic_T is a function that can resample I0subscript𝐼0I_{0}italic_I start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT according to given transformation, and Mtsubscript𝑀𝑡M_{t}italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is the parameters that determine the transformation from dynamic volume space to reference volume space at time t𝑡titalic_t. Here the free-form cubic bspline transformation is adopted, so Mtsubscript𝑀𝑡M_{t}italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT refers to the control point grid displacement field. To get the dynamic volumes at all the Ntsubscript𝑁𝑡N_{t}italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT timepoints, a motion-free I0subscript𝐼0I_{0}italic_I start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and the transformations over time {Mt|t[1,Nt]}conditional-setsubscript𝑀𝑡𝑡1subscript𝑁𝑡\{M_{t}|t\in[1,N_{t}]\}{ italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | italic_t ∈ [ 1 , italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ] } need to be estimated.

2.0.2 Surrogate-Driven Motion Model Approach

Directly solving Mtsubscript𝑀𝑡M_{t}italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT is impossible given the ill-posedness of the question. The surrogate-driven motion model approach [13] models the transformation parameters as linear combination of products of surrogate signals and spatial correspondence models:

Mt=Σi=1NsSi,tCisubscript𝑀𝑡superscriptsubscriptΣ𝑖1subscript𝑁𝑠subscript𝑆𝑖𝑡subscript𝐶𝑖M_{t}=\Sigma_{i=1}^{N_{s}}S_{i,t}C_{i}italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT (3)

where Nssubscript𝑁𝑠N_{s}italic_N start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is the number of surrogate signals, Si,tsubscript𝑆𝑖𝑡S_{i,t}italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT is the ithsuperscript𝑖𝑡i^{th}italic_i start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT surrogate signal at time t𝑡titalic_t, and Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the corresponding spatial correspondence model with the same degree of freedom as Mtsubscript𝑀𝑡M_{t}italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT that indicates major breathing patterns over space. The surrogate signals can be adapted from respiration signals measured by external devices and are fixed during the motion model fitting.

The motion model parameters can be solved by optimizing the similarity such as mean-square-error between the estimated and acquired CT segments over time:

f(C)=Σt=1NtE𝒵t(T(I0,Σi=1NsSi,tCi))Pt22𝑓𝐶superscriptsubscriptΣ𝑡1subscript𝑁𝑡superscriptsubscriptnormsubscript𝐸subscript𝒵𝑡𝑇subscript𝐼0superscriptsubscriptΣ𝑖1subscript𝑁𝑠subscript𝑆𝑖𝑡subscript𝐶𝑖subscript𝑃𝑡22f(C)=\Sigma_{t=1}^{N_{t}}||E_{\mathcal{Z}_{t}}(T(I_{0},\Sigma_{i=1}^{N_{s}}S_{% i,t}C_{i}))-P_{t}||_{2}^{2}italic_f ( italic_C ) = roman_Σ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT | | italic_E start_POSTSUBSCRIPT caligraphic_Z start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_T ( italic_I start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) - italic_P start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT | | start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (4)

Gradient descent can be used to minimize the objective function f𝑓fitalic_f:

C(k)=C(k1)λ(k)Σt=1NtS,tMtf(C(k1))superscript𝐶𝑘superscript𝐶𝑘1superscript𝜆𝑘superscriptsubscriptΣ𝑡1subscript𝑁𝑡subscript𝑆𝑡subscriptsubscript𝑀𝑡𝑓superscript𝐶𝑘1C^{(k)}=C^{(k-1)}-\lambda^{(k)}\Sigma_{t=1}^{N_{t}}S_{\cdot,t}\cdot\nabla_{M_{% t}}{f(C^{(k-1)})}italic_C start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT = italic_C start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT - italic_λ start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT roman_Σ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT ⋅ , italic_t end_POSTSUBSCRIPT ⋅ ∇ start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( italic_C start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT ) (5)

where C(k)superscript𝐶𝑘C^{(k)}italic_C start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT is the updated motion model parameters after the kthsuperscript𝑘𝑡k^{th}italic_k start_POSTSUPERSCRIPT italic_t italic_h end_POSTSUPERSCRIPT iteration, and λ(k)superscript𝜆𝑘\lambda^{(k)}italic_λ start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT is the learning rate that can be determined by line search.

2.0.3 Proposed Method: Surrogate-Free/Optimized Motion Model

The surrogate signals are fixed in the method above. In contrast, we propose to treat the surrogate signals as optimizable hyper-parameters beside the spatial correspondence model. Inspired by hypergradient proposed in [3], we calculate the following hypergradient w.s.t. the surrogate signals through the chain rule:

hi,t(k)=f(C(k1))Si,t=(Σi=1NsSi,tCi(k1))Si,tMtf(C(k1))subscriptsuperscript𝑘𝑖𝑡𝑓superscript𝐶𝑘1subscript𝑆𝑖𝑡superscriptsubscriptΣ𝑖1subscript𝑁𝑠subscript𝑆𝑖𝑡superscriptsubscript𝐶𝑖𝑘1subscript𝑆𝑖𝑡subscriptsubscript𝑀𝑡𝑓superscript𝐶𝑘1h^{(k)}_{i,t}=\frac{\partial{f(C^{(k-1)})}}{\partial{S_{i,t}}}=\frac{\partial{% (\Sigma_{i=1}^{N_{s}}S_{i,t}C_{i}^{(k-1)}})}{\partial{S_{i,t}}}\cdot\nabla_{M_% {t}}{f(C^{(k-1)})}italic_h start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT = divide start_ARG ∂ italic_f ( italic_C start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT ) end_ARG start_ARG ∂ italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG = divide start_ARG ∂ ( roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT ) end_ARG start_ARG ∂ italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG ⋅ ∇ start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( italic_C start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT ) (6)

By using the update rule of the previous iteration:

C(k1)=C(k2)λ(k1)Σt=1NtS,tMtf(C(k2))superscript𝐶𝑘1superscript𝐶𝑘2superscript𝜆𝑘1superscriptsubscriptΣ𝑡1subscript𝑁𝑡subscript𝑆𝑡subscriptsubscript𝑀𝑡𝑓superscript𝐶𝑘2C^{(k-1)}=C^{(k-2)}-\lambda^{(k-1)}\Sigma_{t=1}^{N_{t}}S_{\cdot,t}\cdot\nabla_% {M_{t}}{f(C^{(k-2)})}italic_C start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT = italic_C start_POSTSUPERSCRIPT ( italic_k - 2 ) end_POSTSUPERSCRIPT - italic_λ start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT roman_Σ start_POSTSUBSCRIPT italic_t = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT ⋅ , italic_t end_POSTSUBSCRIPT ⋅ ∇ start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( italic_C start_POSTSUPERSCRIPT ( italic_k - 2 ) end_POSTSUPERSCRIPT ) (7)

and assuming that optimal values of S𝑆Sitalic_S do not change much between two consecutive iterations, we can get

(Σi=1NsSi,tCi(k1))Si,t=Ci(k1)+Si,tCi(k1)Si,t=Ci(k1)λ(k1)Si,tMtf(C(k2))superscriptsubscriptΣ𝑖1subscript𝑁𝑠subscript𝑆𝑖𝑡superscriptsubscript𝐶𝑖𝑘1subscript𝑆𝑖𝑡superscriptsubscript𝐶𝑖𝑘1subscript𝑆𝑖𝑡superscriptsubscript𝐶𝑖𝑘1subscript𝑆𝑖𝑡superscriptsubscript𝐶𝑖𝑘1superscript𝜆𝑘1subscript𝑆𝑖𝑡subscriptsubscript𝑀𝑡𝑓superscript𝐶𝑘2\begin{split}\frac{\partial{(\Sigma_{i=1}^{N_{s}}S_{i,t}C_{i}^{(k-1)}})}{% \partial{S_{i,t}}}&=C_{i}^{(k-1)}+S_{i,t}\frac{\partial{C_{i}^{(k-1)}}}{% \partial{S_{i,t}}}\\ &=C_{i}^{(k-1)}-\lambda^{(k-1)}S_{i,t}\nabla_{M_{t}}{f(C^{(k-2)})}\end{split}start_ROW start_CELL divide start_ARG ∂ ( roman_Σ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT ) end_ARG start_ARG ∂ italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG end_CELL start_CELL = italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT + italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT divide start_ARG ∂ italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT end_ARG start_ARG ∂ italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT end_ARG end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL = italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT - italic_λ start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT ∇ start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( italic_C start_POSTSUPERSCRIPT ( italic_k - 2 ) end_POSTSUPERSCRIPT ) end_CELL end_ROW (8)

Note that the second equality is obtained by plugging in Eqn. 7.

Substituting Eqn. 8 back into Eqn. 6 will yield

hi,t(k)=[Ci(k1)λ(k1)Si,t(k1)Mtf(C(k2))]Mtf(C(k1))subscriptsuperscript𝑘𝑖𝑡delimited-[]superscriptsubscript𝐶𝑖𝑘1superscript𝜆𝑘1superscriptsubscript𝑆𝑖𝑡𝑘1subscriptsubscript𝑀𝑡𝑓superscript𝐶𝑘2subscriptsubscript𝑀𝑡𝑓superscript𝐶𝑘1h^{(k)}_{i,t}=[C_{i}^{(k-1)}-\lambda^{(k-1)}S_{i,t}^{(k-1)}\nabla_{M_{t}}{f(C^% {(k-2)})}]\cdot\nabla_{M_{t}}{f(C^{(k-1)})}italic_h start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT = [ italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT - italic_λ start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT ∇ start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( italic_C start_POSTSUPERSCRIPT ( italic_k - 2 ) end_POSTSUPERSCRIPT ) ] ⋅ ∇ start_POSTSUBSCRIPT italic_M start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_f ( italic_C start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT ) (9)

and the surrogate signals can then be updated accordingly:

Si,t(k)=Si,t(k1)αhi,t(k)superscriptsubscript𝑆𝑖𝑡𝑘superscriptsubscript𝑆𝑖𝑡𝑘1𝛼subscriptsuperscript𝑘𝑖𝑡S_{i,t}^{(k)}=S_{i,t}^{(k-1)}-\alpha h^{(k)}_{i,t}italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT = italic_S start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_k - 1 ) end_POSTSUPERSCRIPT - italic_α italic_h start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i , italic_t end_POSTSUBSCRIPT (10)

where the learning rate α𝛼\alphaitalic_α is fixed to 0.01, which is selected based on our empirical studies. The initial values of S𝑆Sitalic_S can be sinusoidal functions if no existing surrogate signals are available, in which case our method is called surrogate-free motion model. When respiration signals are provided, they can be used as the initial surrogates and then optimized by our method, which is denoted as surrogate-optimized motion model.

2.0.4 Motion Compensated Iterative Reconstruction

In all the methods above, the reference volume can be initialized with the average of all phase images in 4DCT, which will exhibit quite large artefacts. Once the motion model and surrogate signals are estimated with the method mentioned above, they can be used to update the reference volume by motion compensated iterative reconstruction (MCIR). Briefly, the transformation at each timepoint can be estimated by Eqn. 3 and used to compensate for the motion during an iterative reconstruction (similar to the motion compensated super resolution reconstruction performed in [13], but with reconstructed image having the same resolution as the CT segments), resulting in a motion-free volume if the transformations are accurate. Multi-resolution approach was used to update the motion model and reference volume alternatively at three levels. At each resolution level, our method would switch between motion model fitting and MCIR for up to six times or until the objective function did not improve. Each run of motion model fitting or MCIR would be terminated when the objective function stopped decreasing or up to 5 iterations of gradient descent had been used.

3 Experiments

3.0.1 Digital Phantom Simulation

A digital phantom with motion was generated by the 4DXCAT software [17] and post-processed by the cid-X software [5]. The simulation was controlled by two respiration traces, i.e. the motion of the chest and the diaphragm, which were measured from 2D Cine MRI from a real patient. The diaphragm trace was deliberately delayed by 1 second to add hysteresis, i.e. the breathing would follow different paths during inhalation and exhalation. The chest trace could simulate the skin marker or belt signals normally used to sort 4DCT data in clinical practice. The phantom dataset consisted of images at 182 timepoints, each with size 355×\times×280×\times×115 voxels and resolution of 1×\times×1×\times×3 mm. A movie of the simulated 4DCT can be found in the supplementary material.

Four motion modelling methods were compared, including the groupwise registration method [24], surrogate-driven motion model [13], and our proposed surrogate-free/optimized motion models. The original 4DCT was also included as a baseline. The groupwise registration method alternatively estimated the deformation field sequence and the mid-position image through 5 iterations, using stationary velocity based diffeomorphism [14]. For 4DCT and groupwise registration, the real-time volumes were obtained by repeating the ten phase images over time. For the surrogate-driven and surrogate-optimized methods, the chest signal and its temporal derivative [13] were used as the fixed or initial surrogate signals, respectively. For the surrogate-free method, the initial surrogate signals were set to the cos and sin functions derived from the respiration phases. Evaluation metrics include average DSC (dice similarity coefficient) between the estimated and the true tumor masks, TRE (target registration error) of the tumor centroid, and the average RMSE (root-mean-square-error) between the estimated and the true image intensities.

3.0.2 Real Patient Data

Our method was also demonstrated on four clinical 4DCT data, two of which (Pat1/Pat2) were from the ROSS-LC clinical trial [16] and the other two (Pat3/Pat4) were from the public TCIA 4D-Lung dataset [2]. Due to the absence of respiration signals used for 4DCT sorting, the surrogate-driven/optimized motion models could not be applied to these data retrospectively. Therefore, only the surrogate-free motion model was fit on the real patient data, and each slice was treated as an individual CT segment since no timing information was provided to determine which slices were acquired at the same time. Unlike the digital phantom data, there was no ground-truth for real data, so the end-inhalation images estimated by the surrogate-free motion model were visually compared with the end-inhalation phase of 4DCT. To show that our method could estimate irregular breathing motion, all the end-inhlation timepoints were identified according to the optimized surrogate signals and the two timepoints with the deepest and shallowest inhalation among them were selected for visual comparison.

4 Results

Table 1: Evaluation metrics in all scenarios. Unit for TRE and RMSE is mm and HU (Houndsfield unit) respectively.
Metrics 4DCT groupReg surrDriven surrFree surrOptimized
DSC 0.37±0.27plus-or-minus0.370.270.37\pm 0.270.37 ± 0.27 0.46±0.17plus-or-minus0.460.170.46\pm 0.170.46 ± 0.17 0.70±0.16plus-or-minus0.700.160.70\pm 0.160.70 ± 0.16 0.76±0.11plus-or-minus0.760.110.76\pm 0.110.76 ± 0.11 0.79±0.10plus-or-minus0.790.10\boldsymbol{0.79\pm 0.10}bold_0.79 bold_± bold_0.10
TRE 3.94±2.04plus-or-minus3.942.043.94\pm 2.043.94 ± 2.04 3.04±1.27plus-or-minus3.041.273.04\pm 1.273.04 ± 1.27 1.17±0.76plus-or-minus1.170.761.17\pm 0.761.17 ± 0.76 1.16±0.70plus-or-minus1.160.701.16\pm 0.701.16 ± 0.70 0.91±0.50plus-or-minus0.910.50\boldsymbol{0.91\pm 0.50}bold_0.91 bold_± bold_0.50
RMSE 108.6±28.8plus-or-minus108.628.8108.6\pm 28.8108.6 ± 28.8 107.1±31.1plus-or-minus107.131.1107.1\pm 31.1107.1 ± 31.1 40.6±13.4plus-or-minus40.613.4\boldsymbol{40.6\pm 13.4}bold_40.6 bold_± bold_13.4 57.5±14.8plus-or-minus57.514.857.5\pm 14.857.5 ± 14.8 40.6±13.9plus-or-minus40.613.9\boldsymbol{40.6\pm 13.9}bold_40.6 bold_± bold_13.9

4.0.1 Phantom Experiment Results

Table 1 displays the evaluation metrics for the 4DCT and the four motion models, as explained in section 3.0.1. The original 4DCT and groupwise registration method have the worst results, highlighting their inability to model the variable motion. The surrogate-free method is comparable to the surrogate-driven method, although the RMSE metric is slightly worse. By leveraging existing respiration signals, the surrogate-optimized method achieved the best results in all the metrics.

Refer to caption
Figure 2: Effects of artefact correction by different methods on digital phantom data.

Fig. 2 shows the end-inhalation phase image of the original 4DCT and the one obtained by groupwise registration, as well as the images obtained by the surrogate-driven/surrogate-free/surrogate-optimized methods, corresponding to the timepoint when the inhalation is deepest. Ground-truth image at the same timepoint as shown by the model is also plotted. While the four motion correction methods all reduce the artefacts in the 4DCT, the proposed surrogate-free/optimized methods obtain the sharpest images without the apparent distortion of the diaphragm and lung tissues, as illustrated by the red arrows.

Refer to caption
Figure 3: Visual comparison between end-inhalation 4DCT and the deepest/shallowest end-inhalation images estimated by the surrogate-free motion model.

4.0.2 Real Patient Results

Fig. 3 shows the end-inhalation 4DCTs (left column), and the shallowest (middle) and the deepest (right) end-inhalation images estimated by the surrogate-free motion model for the four real patients. The sorting artefacts around the diaphragm were reduced in all the patients. For Pat1/3, the duplicate and discontinuity of certain structures were repaired by our method, as indicated by the red arrow. Fig. 3 also showed the variability of the end-inhalation phases due to the irregular breathing, i.e. the depth of end-inhalation could vary across breath cycles as indicated by the difference between the middle and the right column in Fig. 3. Our method could estimate the variation of breath which 4DCT could not estimate. Movies showing the estimated variable motion for all the patients and the phantom can be found in the supplementary.

5 Discussion and Conclusion

Irregular breathing has been a long-standing problem for 4DCT, and the associated artefacts will impact many down-stream applications such as radiotherapy treatment planning, PET imaging and ventilation map derivation. Resolving the irregular breathing motion is challenging due to the limited data available at each slice location, which is only imaged for a single, arbitrary, breath cycle. By applying the surrogate-driven motion model, all the CT segments acquired at different timepoints can contribute to fitting the same set of motion model parameters and forming the motion-free reference image, which can solve the irregular breathing problem if synchronized surrogate signals exist and have strong correlation with internal motion. However, surrogate signals are not always stored after the 4DCT data have been sorted, which hinders retrospective application of surrogate-driven motion model on 4DCT data for clinical verification or research purpose. In addition, the existing surrogate signals often do not have strong correlation with the internal motion.

Our proposed method does not require existing surrogate signals so can be applied to any retrospective 4DCT dataset, including open datasets available online. It encourages revisiting the studies built on top of those dataset to check if the improved 4DCT images can lead to different clinical findings or provide better image-based biomarkers for outcome prediction, which is one of our future research directions. When surrogate signals are available our method can be used to optimise the signals, correcting for weak correlations between the measures signals and the internal motion.

The hypergradient method, although normally used in the meta-learning domain, can be adapted to our own problem and has been found to perform better than alternating the updates to the spatial correspondence models and the surrogate signals, with the other being fixed during the update. With the hypergradient method, most of the calculation of the gradient can be reused for both the gradient for the spatial correspondence models and for the surrogate signals, which can reduce the computation time by approximately a factor of 2. Currently, the execution time for each 4DCT is about 15 to 30 minutes on an Intel Core i7-10700K CPU of 3.8GHz and 8 cores. Acceleration by GPU will be investigated in the future.

Our method is not limited to 4DCT but can also be applied to other 4D imaging data such as interleaved 4DMRI [6] and CBCT [8]. By reducing the artefacts and more importantly estimating the variable breathing motion, our method can potentially benefit all thoracic cancer patients treated with radiotherapy, as well as other clinical applications that rely on 4DCT.

{credits}

5.0.1 Acknowledgements

This study was funded by Elekta Ltd., Crawley and the EPSRC-funded UCL Center for Doctoral Training in Intelligent, Integrated Imaging in Healthcare (i4health) (EP/S021930/1). We thank Dr. Gareth Price from the Christie NHS, University of Manchester, UK for providing data from ROSS-LC clinical trial.

5.0.2 \discintname

The authors have no competing interests to declare that are relevant to the content of this article.

References

  • [1] Abdelnour, A., Nehmeh, S., Pan, T., Humm, J., Vernon, P., Schöder, H., Rosenzweig, K., Mageras, G., Yorke, E., Larson, S., et al.: Phase and amplitude binning for 4d-ct imaging. Physics in medicine & biology 52(12),  3515 (2007)
  • [2] Balik, S., Weiss, E., Jan, N., Roman, N., Sleeman, W.C., Fatyga, M., Christensen, G.E., Zhang, C., Murphy, M.J., Lu, J., et al.: Evaluation of 4-dimensional computed tomography to 4-dimensional cone-beam computed tomography deformable image registration for lung cancer adaptive radiation therapy. International Journal of Radiation Oncology* Biology* Physics 86(2), 372–379 (2013)
  • [3] Baydin, A.G., Cornish, R., Rubio, D.M., Schmidt, M., Wood, F.: Online learning rate adaptation with hypergradient descent. arXiv preprint arXiv:1703.04782 (2017)
  • [4] Bettinardi, V., Picchio, M., Di Muzio, N., Gianolli, L., Gilardi, M.C., Messa, C.: Detection and compensation of organ/lesion motion using 4d-pet/ct respiratory gated acquisition techniques. Radiotherapy and oncology 96(3), 311–316 (2010)
  • [5] Eiben, B., Bertholet, J., Menten, M.J., Nill, S., Oelfke, U., McClelland, J.R.: Consistent and invertible deformation vector fields for a breathing anthropomorphic phantom: a post-processing framework for the xcat phantom. Physics in Medicine & Biology 65(16), 165005 (2020)
  • [6] Eiben, B., Bertholet, J., Tran, E.H., Wetscherek, A., Shiarli, A.M., Nill, S., Oelfke, U., McClelland, J.R.: Respiratory motion modelling for mr-guided lung cancer radiotherapy: model development and geometric accuracy evaluation. Physics in Medicine & Biology 69(5), 055009 (2024)
  • [7] Ford, E.C., Mageras, G., Yorke, E., Ling, C.: Respiration-correlated spiral ct: a method of measuring respiratory-induced anatomic motion for radiation treatment planning. Medical physics 30(1), 88–97 (2003)
  • [8] Huang, Y., Thielemans, K., Price, G., McClelland, J.R.: Surrogate-driven respiratory motion model for projection-resolved motion estimation and motion compensated cone-beam ct reconstruction from unsorted projection data. Physics in Medicine & Biology 69(2), 025020 (2024)
  • [9] Li, H., Noel, C., Garcia-Ramirez, J., Low, D., Bradley, J., Robinson, C., Mutic, S., Parikh, P.: Clinical evaluations of an amplitude-based binning algorithm for 4dct reconstruction in radiation therapy. Medical physics 39(2), 922–932 (2012)
  • [10] Li, M., Castillo, S.J., Castillo, R., Castillo, E., Guerrero, T., Xiao, L., Zheng, X.: Automated identification and reduction of artifacts in cine four-dimensional computed tomography (4dct) images using respiratory motion model. International Journal of Computer Assisted Radiology and Surgery 12, 1521–1532 (2017)
  • [11] Low, D.A., O’Connell, D., Lauria, M., Stiehl, B., Naumann, L., Lee, P., Hegde, J., Barjaktarevic, I., Goldin, J., Santhanam, A.: Ventilation measurements using fast-helical free-breathing computed tomography. Medical physics 48(10), 6094–6105 (2021)
  • [12] Low, D.A., White, B.M., Lee, P.P., Thomas, D.H., Gaudio, S., Jani, S.S., Wu, X., Lamb, J.M.: A novel ct acquisition and analysis technique for breathing motion modeling. Physics in Medicine & Biology 58(11),  L31 (2013)
  • [13] McClelland, J.R., Modat, M., Arridge, S., Grimes, H., D’Souza, D., Thomas, D., O’Connell, D., Low, D.A., Kaza, E., Collins, D.J., et al.: A generalized framework unifying image registration and respiratory motion models and incorporating image reconstruction, for partial image data or full images. Physics in Medicine & Biology 62(11),  4273 (2017)
  • [14] Modat, M., Daga, P., Cardoso, M.J., Ourselin, S., Ridgway, G.R., Ashburner, J.: Parametric non-rigid registration using a stationary velocity field. In: 2012 IEEE Workshop on Mathematical Methods in Biomedical Image Analysis. pp. 145–150. IEEE (2012)
  • [15] Muirhead, R., McNee, S.G., Featherstone, C., Moore, K., Muscat, S.: Use of maximum intensity projections (mips) for target outlining in 4dct radiotherapy planning. Journal of Thoracic Oncology 3(12), 1433–1438 (2008)
  • [16] Price, G.J., Faivre-Finn, C., Stratford, J., Chauhan, S., Bewley, M., Clarke, L., Johnson, C.N., Moore, C.J.: Results from a clinical trial evaluating the efficacy of real-time body surface visual feedback in reducing patient motion during lung cancer radiotherapy. Acta Oncologica 57(2), 211–218 (2018)
  • [17] Segars, W.P., Sturgeon, G., Mendonca, S., Grimes, J., Tsui, B.M.: 4d xcat phantom for multimodality imaging research. Medical physics 37(9), 4902–4915 (2010)
  • [18] Sentker, T., Schmidt, V., Ozga, A.K., Petersen, C., Madesta, F., Hofmann, C., Werner, R., Gauer, T.: 4d ct image artifacts affect local control in sbrt of lung and liver metastases. Radiotherapy and Oncology 148, 229–234 (2020)
  • [19] Shao, W., Pan, Y., Durumeric, O.C., Reinhardt, J.M., Bayouth, J.E., Rusu, M., Christensen, G.E.: Geodesic density regression for correcting 4dct pulmonary respiratory motion artifacts. Medical image analysis 72, 102140 (2021)
  • [20] Taasti, V.T., Hattu, D., Vaassen, F., Canters, R., Velders, M., Mannens, J., van Loon, J., Rinaldi, I., Unipan, M., van Elmpt, W.: Treatment planning and 4d robust evaluation strategy for proton therapy of lung tumors with large motion amplitude. Medical Physics 48(8), 4425–4437 (2021)
  • [21] Taguchi, K.: Temporal resolution and the evaluation of candidate algorithms for four-dimensional ct. Medical physics 30(4), 640–650 (2003)
  • [22] Tryggestad, E., Li, H., Rong, Y.: 4dct is long overdue for improvement. Journal of applied clinical medical physics 24(4) (2023)
  • [23] Werner, R., Hofmann, C., Gauer, T.: Optimized projection binning for improved helical amplitude-and phase-based 4dct reconstruction in the presence of breathing irregularity. In: Medical Imaging 2016: Physics of Medical Imaging. vol. 9783, pp. 271–276. SPIE (2016)
  • [24] Zhang, Y., Yang, J., Zhang, L., Court, L.E., Balter, P.A., Dong, L.: Modeling respiratory motion for reducing motion artifacts in 4d ct images. Medical physics 40(4), 041716 (2013)
  • [25] Zhao, T., Lu, W., Yang, D., Mutic, S., Noel, C.E., Parikh, P.J., Bradley, J.D., Low, D.A.: Characterization of free breathing patterns with 5d lung motion model. Medical physics 36(11), 5183–5189 (2009)