Intensity-based 3D motion correction for cardiac MR images

Abstract

Cardiac magnetic resonance (CMR) image acquisition requires subjects to hold their breath while 2D cine images are acquired. This process assumes that the heart remains in the same position across all slices. However, differences in breathhold positions or patient motion introduce 3D slice misalignments. In this work, we propose an algorithm that simultaneously aligns all SA and LA slices by maximizing the pair-wise intensity agreement between their intersections. Unlike previous works, our approach is formulated as a subject-specific optimization problem and requires no prior knowledge of the underlying anatomy. We quantitatively demonstrate that the proposed method is robust against a large range of rotations and translations by synthetically misaligning 10101010 motion-free datasets and aligning them back using the proposed method.

Index Terms—  cardiac magnetic resonance imaging, alignment, contour-free, single-subject, motion correction

1 Introduction

Cardiac magnetic resonance (CMR) imaging is typically acquired along acquisition planes aligned with the cardiac anatomy, each intended to allow the extraction of different clinically relevant measurements of target regions. Most notably, the long axis (LA) and short axis (SA) are planes that aim to visualize the left ventricle (LV) to derive longitudinal and radial motion information, respectively. Although the acquisition of slices is typically performed in a fixed scanner coordinate system, each slice is an independent acquisition originating from a separate breathhold. This makes anatomical misalignments between slices a common occurrence due to variations in the heart’s location across acquisitions.

Refer to caption

Fig. 1: Diagram of resulting optimization on slice alignment. Ventricle contours are hand-drawn for illustrative purposes. Top row: LA 4-chamber image with intersections lines of SA slices with original orientations (left/red) and optimized orientations (right/green). Bottom row: Nearest neighbour interpolation of LA 4-chamber view from SA slices with original orientations (left) and optimized orientations (right).

Acquisition planes. The long axis is the line that passes from the center of the mitral valve to the apex of the heart. The LV myocardium typically has three long-axis imaging planes: 2-chamber, 3-chamber, and 4-chamber views. These are named after the number of ventricles and atria they intersect and are meant to capture different sections of the myocardium. The short axis is perpendicular to the long axis, visualizing myocardial cross-sections at various ventricular steps. These slices are stacked for volumetric measurements.

Motion correction. Considerable research efforts using image registration have been made to correct these misalignments [1, 2, 3, 4, 5, 6, 7]. In [1], Chandler et al. aim to correct misaligned cardiac anatomy in multi-slice SA images by rigidly registering stacks of two slices to a high-resolution 3D MR axial cardiac volume. Although they demonstrate good results in synthetic and real misaligned data, the method requires a high-resolution 3D cardiac volume, which is not always available.

Refer to caption

Fig. 2: Result of intensity interpolation at slice intersections before (red) and after (green) optimization. Two slice pairs are displayed: short axis (SA) to long axis (LA) 3-chamber (left), and short axis to long axis 2-chamber (right). Red and green lines depict the intersection lines of each image pair. Each pair’s interpolated intersection intensities are displayed side-by-side on the vertical block. Blue segments depict padding points outside of image bounds. Areas of significant change are shown in yellow (ventricles) and cyan (stomach).

Another approach is to utilize shape information [3, 2, 6, 7]. Su et al. [3] combine 3D meshes and apply the constraint that the epicardial surface must remain smooth to generate a time-series model of the heart. The 3D meshes are created from border-delineated MRI data at every time frame of the cardiac cycle. Tarroni et al. [2] corrects inter-slice respiratory motion in SA CMR image stacks utilizing probabilistic segmentation maps (PSMs) of the left ventricular (LV) cavity generated with hybrid decision forests. PSMs are generated for each slice of the SA stack and rigidly registered in-plane to a target PSM. Both these methods utilize shape information that is not always available or might not generalize outside the intended subject cohort, limiting the methods’ applicability. A more recent approach [4] proposes a slice-to-slice group-registration framework based on the similarity of the local phase vectors of the images. As registration is notorious for being susceptible to local minima, they fix all slices and only allow one slice to ”move” at a time.

Contributions. In this work, we propose a method to mitigate the effect of inter-slice motion for all SA and LA slices simultaneously by optimizing the 3D rotation and translation parameters on sampled intensities along slice intersections. Unlike other approaches, our approach exclusively utilizes the image intensity information, which requires no prior knowledge about the underlying anatomy. We demonstrate that our GPU-accelerated approach can reliably converge in seconds despite large perturbations on the initial alignment parameters. Our contributions can be summarized as follows:

  • We propose a solution for motion correction in CMR that jointly aligns all SA and LA slices simultaneously.

  • We utilize all SA-LA and LA-LA intersections and minimize the intensity differences along the intersection lines without incorporating any anatomical information.

  • We evaluate our method quantitatively recovering synthetic rigid deformations using aligned CMR scans as a golden standard.

2 Method

With this method, we try to address SA-LA and LA-LA misalignments due to patient motion in the scanner with respect to their anatomy. To do that, we assume that any deformation applied to a slice during acquisition is strictly rigid. More precisely, our approach corrects for 3D3𝐷3D3 italic_D rotation and translation while it does not model scaling or shearing deformations. Our rigid rotation and translation matrices are defined as R(𝜽)𝑅𝜽R(\boldsymbol{\theta})italic_R ( bold_italic_θ ) and T(𝒕)𝑇𝒕T(\boldsymbol{t})\>italic_T ( bold_italic_t ), where 𝜽𝜽\boldsymbol{\theta}bold_italic_θ and 𝒕𝒕\boldsymbol{t}bold_italic_t are 3D vectors corresponding to rotation angles and translation coefficients for a given slice.

When two intersecting slices are perfectly aligned, a metric computing similarity of the intensities along the intersection line of the two planes should be at a global maximum. We formulate the alignment problem as a constraint optimization of parameters ϕN×6italic-ϕsuperscript𝑁6\phi\in\mathbb{R}^{N\times 6}italic_ϕ ∈ blackboard_R start_POSTSUPERSCRIPT italic_N × 6 end_POSTSUPERSCRIPT that aims to jointly minimize the pairwise intensity differences at slice intersections (where N𝑁Nitalic_N is the number of slices and 6 is the total rotation and translation deformation parameters in 3D). We use the summation of pair-wise L2 intensity differences as a similarity metric between the intensities along slice intersections, which minimizes the influence of shifts in intensities across slices.

The set of slice pairs is based on all possible LA-LA slice combinations and LA-SA slice combinations of a given subject. We do not consider SA-SA combinations since their planes do not overlap significantly. For every slice pair (A,B)𝐴𝐵(A,B)( italic_A , italic_B ), we compute their plane equations given their current rotation and translation parameters and thereby obtain the intersection line dABsuperscript𝑑𝐴𝐵d^{AB}italic_d start_POSTSUPERSCRIPT italic_A italic_B end_POSTSUPERSCRIPT. We define a sampling center 𝒄𝒄\boldsymbol{c}bold_italic_c by finding the closest points 𝒄𝑨superscript𝒄𝑨\boldsymbol{c^{A}}bold_italic_c start_POSTSUPERSCRIPT bold_italic_A end_POSTSUPERSCRIPT and 𝒄𝑩superscript𝒄𝑩\boldsymbol{c^{B}}bold_italic_c start_POSTSUPERSCRIPT bold_italic_B end_POSTSUPERSCRIPT along line dABsuperscript𝑑𝐴𝐵d^{AB}italic_d start_POSTSUPERSCRIPT italic_A italic_B end_POSTSUPERSCRIPT to the centers of images A𝐴Aitalic_A and B𝐵Bitalic_B, and subsequently taking the mid-point of the two. Using 𝒄𝒄\boldsymbol{c}bold_italic_c, we can define a sampling segment Kp×3𝐾superscript𝑝3K\in\mathbb{R}^{p\times 3}italic_K ∈ blackboard_R start_POSTSUPERSCRIPT italic_p × 3 end_POSTSUPERSCRIPT in 3D space, where p𝑝pitalic_p is the number of sampling points. We set p𝑝pitalic_p to 100 samples (50 points in each direction of 𝒄𝒄\boldsymbol{c}bold_italic_c along dABsuperscript𝑑𝐴𝐵d^{AB}italic_d start_POSTSUPERSCRIPT italic_A italic_B end_POSTSUPERSCRIPT) and the distance between samples to be 5 millimeters in world space.

Refer to caption

Fig. 3: Distributions of absolute error in parameters for all runs (with respect to motion-free parameters). Absolute errors are presented for parameters before and after optimization for both rotation and translation parameters. Results are shown under various ranges of uniformly sampled initial rigid transformations. Dataset consisted of 10 aligned subjects, each of which was randomly misaligned 10 times per condition.

Refer to caption

Fig. 4: Distributions of mean and maximum absolute error in parameters for a given optimization run (with respect to motion-free parameters). Absolute errors are presented both rotation and translation parameters. Results are shown under various ranges of uniformly sampled initial rigid transformations. Dataset consisted of 10 aligned subjects, each of which was randomly misaligned 10 times per condition.

We obtain sampling segment intensities KAsuperscript𝐾𝐴K^{A}italic_K start_POSTSUPERSCRIPT italic_A end_POSTSUPERSCRIPT and KBsuperscript𝐾𝐵K^{B}italic_K start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT by projecting the sampling segment onto both image planes and bi-linearly interpolating pixel intensities at each coordinate. We do this for all q𝑞qitalic_q time frames in the 2D+time slice, making KAsuperscript𝐾𝐴K^{A}italic_K start_POSTSUPERSCRIPT italic_A end_POSTSUPERSCRIPT and KBsuperscript𝐾𝐵K^{B}italic_K start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT matrices of size q×p𝑞𝑝q\times pitalic_q × italic_p. For each sampling line, we also create mask vectors 𝐦A,𝐦B{0,1}superscript𝐦𝐴superscript𝐦𝐵01\mathbf{m}^{A},\mathbf{m}^{B}\in\{0,1\}bold_m start_POSTSUPERSCRIPT italic_A end_POSTSUPERSCRIPT , bold_m start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT ∈ { 0 , 1 } to denote whether a sample is within the bounds of a given image.

The loss is the sum of L2 intensity differences between pair-wise sampled lines across all point pairs and time frames:

(ϕ)=A,Bjqip(KjiAKjiB)2(𝐦iA𝐦iB),italic-ϕsubscript𝐴𝐵subscriptsuperscript𝑞𝑗subscriptsuperscript𝑝𝑖superscriptsubscriptsuperscript𝐾𝐴𝑗𝑖subscriptsuperscript𝐾𝐵𝑗𝑖2subscriptsuperscript𝐦𝐴𝑖subscriptsuperscript𝐦𝐵𝑖\mathcal{L}\left(\phi\right)=\sum_{A,B}\sum^{q}_{j}\sum^{p}_{i}\left(K^{A}_{ji% }-K^{B}_{ji}\right)^{2}*\left(\mathbf{m}^{A}_{i}*\mathbf{m}^{B}_{i}\right),caligraphic_L ( italic_ϕ ) = ∑ start_POSTSUBSCRIPT italic_A , italic_B end_POSTSUBSCRIPT ∑ start_POSTSUPERSCRIPT italic_q end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∑ start_POSTSUPERSCRIPT italic_p end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( italic_K start_POSTSUPERSCRIPT italic_A end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT - italic_K start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_j italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∗ ( bold_m start_POSTSUPERSCRIPT italic_A end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∗ bold_m start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , (1)

where (𝐦A𝐦B)superscript𝐦𝐴superscript𝐦𝐵\left(\mathbf{m}^{A}*\mathbf{m}^{B}\right)( bold_m start_POSTSUPERSCRIPT italic_A end_POSTSUPERSCRIPT ∗ bold_m start_POSTSUPERSCRIPT italic_B end_POSTSUPERSCRIPT ) is used to exclude pixel-pairs with intensity samples outside image bounds.

The loss is minimized by computing the gradient w.r.t. the rotation and translation parameters ϕitalic-ϕ\phiitalic_ϕ through the affine transformation and intensity interpolation. Adam optimizer is used to increase convergence speed. We implement the algorithm in Pytorch and run it on an NVIDIA RTX2070 consumer-grade GPU card. We make the code repository publicly available111https://github.com/NILOIDE/CMR_intensity_3Dmoco.git.

3 Experiments and results

As previously mentioned, we do not have any segmentation labels or contours available that could serve as a surrogate measure to evaluate the performance of the proposed method. Therefore, apart from demonstrating qualitatively that our method manages to recover the slice misalignments (Figure 2) due to patient breathing/motion, we also investigate the convergence properties of the algorithm and present quantitative results on synthetic deformations.

Dataset. We evaluate the performance of our method using 10 CMR datasets from the UK Biobank222UK Biobank Imaging Study: http://imaging.ukbiobank.ac.uk. Particularly, each dataset is comprised of a stack of 9139139-139 - 13 individual SA slices with 1.81.81.81.8mm ×\times× 1.81.81.81.8mm in-plane resolution and their corresponding 2222-chamber, 3333-chamber, and 4444-chamber LA scans with 1.81.81.81.8mm ×\times× 1.81.81.81.8mm in-plane resolution. Additionally, each scan has 50505050 time frames.

Evaluation. We use 10101010 motion-free datasets as our golden standard and we synthetically transform them using a random rigid transformation per slice and we recover the misalignment using the proposed method. In this manner, we conduct an algorithm stress-test by exercising control over the ranges. This enables us to assess the algorithm’s robustness against both small and large misalignments.

We do this at various magnitudes of uniformly sampled perturbations of ±0.0,2.5,7.5,22.5plus-or-minussuperscript0.0superscript2.5superscript7.5superscript22.5\pm 0.0^{\circ},2.5^{\circ},7.5^{\circ},22.5^{\circ}± 0.0 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 2.5 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 7.5 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT , 22.5 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT rotation and ±0.0,2.5,7.5,22.5plus-or-minus0.02.57.522.5\pm 0.0,2.5,7.5,22.5± 0.0 , 2.5 , 7.5 , 22.5 mm of translation around the motion-free parameters. We allow the algorithm 1000 steps to converge (approx. 30 seconds) before measuring its parameters’ absolute error. In Figure 3, we report the distribution of absolute errors before and after optimization for each misalignment interval. Figure 4 shows the post-optimization distributions of mean and max absolute errors of every run for the recovered rotation and translation parameters.

Discussion. As seen in Figure 3, our algorithm reliably converges to the motion-free parameters even under the largest tested rotation and translation misalignments. Figure 4 appears to show that our approach corrects rotations better than translations. Furthermore, the most challenging scenario is when only translations are present. We hypothesize that rotation motion is easier to correct as the intensities closer to the axes of rotation preserve enough intensity overlap to provide meaningful gradient information. This is however not the case with translation, where the entire intersection will stop displaying similarities in intensities given a large enough displacement, making gradient descent unable to discern an appropriate direction in which to shift the slices. Surprisingly, this phenomenon disappears when some rotation misalignment is also present.

4 Conclusion

This study explores the potential of intra-subject CMR slice alignment. For this reason, the proposed method involves aligning all SA and LA slices simultaneously. Unlike other approaches, our algorithm exclusively utilizes the image intensity information along slice intersections and requires no prior knowledge about the underlying anatomy. We demonstrate both quantitatively and qualitatively that the proposed method is capable of recovering a wide range of rigid transformations. However, our approach also demonstrates some limitations that we would like to address in the future. We believe it would be beneficial to utilize stochastic optimization techniques that would further allow the algorithm to overcome local minima. Moreover, we believe another beneficial addition would be to incorporate a loss function that is more robust against intensity changes across slices such as mutual information.

5 Acknowledgements

This research study was conducted retrospectively using human subject data made available in open access by the UK Biobank Resource under Application Number 87802. Ethical approval was not required as confirmed by the license attached with the open access data.

This work is funded in part by the European Research Council (ERC) project Deep4MI (884622) and the Munich Center for Machine Learning.

References

  • [1] A. G. Chandler, R. J. Pinder, T. Netsch, J. A. Schnabel, D. John Hawkes, D. L. G. Hill, and R. Razavi, “Correction of misaligned slices in multi-slice cardiovascular magnetic resonance using slice-to-volume registration,” Journal of Cardiovascular Magnetic Resonance, vol. 10, 2008.
  • [2] G. Tarroni, O. Oktay, M. Sinclair, W. Bai, A. Schuh, H. Suzuki, A. de Marvao, D. P. O’Regan, S. A. Cook, and D. Rueckert, “A comprehensive approach for learning-based fully-automated inter-slice motion correction for short-axis cine cardiac mr image stacks,” ArXiv, vol. abs/1810.02201, 2018.
  • [3] Yi Su, May-Ling Tan, Chi-Wan Lim, Soo-Kng Teo, Senthil Kumar Selvaraj, Min Wan, Liang Zhong, and Ru-San Tan, “Automatic correction of motion artifacts in 4d left ventricle model reconstructed from mri,” in Computing in Cardiology 2014, 2014.
  • [4] B. Villard, E. Zacur, E. D. Armellina, and V. Grau, “Correction of slice misalignment in multi-breath-hold cardiac mri scans,” in STACOM@MICCAI, 2016.
  • [5] J. Lötjönen, M. Pollari, S. Kivistö, and K. Lauerma, “Correction of movement artifacts from 4-d cardiac short and long-axis mr data,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2004.
  • [6] M. Sinclair, W. Bai, E. Puyol-Antón, O. Oktay, D. Rueckert, and A. P. King, “Fully automated segmentation-based respiratory motion correction of multiplanar cardiac magnetic resonance images for large-scale datasets,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2017.
  • [7] D. Yang, P. Wu, C. Tan, K. M. Pohl, L. Axel, and D. N. Metaxas, “3d motion modeling and reconstruction of left ventricle wall in cardiac mri,” Functional imaging and modeling of the heart (FIMH), 2017.