Intensity-based 3D motion correction for cardiac MR images

Abstract

Cardiac magnetic resonance (CMR) image acquisition requires subjects to hold their breath while 2D cine images are acquired. This process assumes that the heart remains in the same position across all slices. However, differences in breathhold positions or patient motion introduce 3D slice misalignments. In this work, we propose an algorithm that simultaneously aligns all SA and LA slices by maximizing the pair-wise intensity agreement between their intersections. Unlike previous works, our approach is formulated as a subject-specific optimization problem and requires no prior knowledge of the underlying anatomy. We quantitatively demonstrate that the proposed method is robust against a large range of rotations and translations by synthetically misaligning $10$ motion-free datasets and aligning them back using the proposed method.

Index Terms— cardiac magnetic resonance imaging, alignment, contour-free, single-subject, motion correction

1 Introduction

Cardiac magnetic resonance (CMR) imaging is typically acquired along acquisition planes aligned with the cardiac anatomy, each intended to allow the extraction of different clinically relevant measurements of target regions. Most notably, the long axis (LA) and short axis (SA) are planes that aim to visualize the left ventricle (LV) to derive longitudinal and radial motion information, respectively. Although the acquisition of slices is typically performed in a fixed scanner coordinate system, each slice is an independent acquisition originating from a separate breathhold. This makes anatomical misalignments between slices a common occurrence due to variations in the heart’s location across acquisitions.

Refer to caption — Fig. 1: Diagram of resulting optimization on slice alignment. Ventricle contours are hand-drawn for illustrative purposes. Top row: LA 4-chamber image with intersections lines of SA slices with original orientations (left/red) and optimized orientations (right/green). Bottom row: Nearest neighbour interpolation of LA 4-chamber view from SA slices with original orientations (left) and optimized orientations (right).

Acquisition planes. The long axis is the line that passes from the center of the mitral valve to the apex of the heart. The LV myocardium typically has three long-axis imaging planes: 2-chamber, 3-chamber, and 4-chamber views. These are named after the number of ventricles and atria they intersect and are meant to capture different sections of the myocardium. The short axis is perpendicular to the long axis, visualizing myocardial cross-sections at various ventricular steps. These slices are stacked for volumetric measurements.

Motion correction. Considerable research efforts using image registration have been made to correct these misalignments [1, 2, 3, 4, 5, 6, 7]. In [1], Chandler et al. aim to correct misaligned cardiac anatomy in multi-slice SA images by rigidly registering stacks of two slices to a high-resolution 3D MR axial cardiac volume. Although they demonstrate good results in synthetic and real misaligned data, the method requires a high-resolution 3D cardiac volume, which is not always available.

Another approach is to utilize shape information [3, 2, 6, 7]. Su et al. [3] combine 3D meshes and apply the constraint that the epicardial surface must remain smooth to generate a time-series model of the heart. The 3D meshes are created from border-delineated MRI data at every time frame of the cardiac cycle. Tarroni et al. [2] corrects inter-slice respiratory motion in SA CMR image stacks utilizing probabilistic segmentation maps (PSMs) of the left ventricular (LV) cavity generated with hybrid decision forests. PSMs are generated for each slice of the SA stack and rigidly registered in-plane to a target PSM. Both these methods utilize shape information that is not always available or might not generalize outside the intended subject cohort, limiting the methods’ applicability. A more recent approach [4] proposes a slice-to-slice group-registration framework based on the similarity of the local phase vectors of the images. As registration is notorious for being susceptible to local minima, they fix all slices and only allow one slice to ”move” at a time.

Contributions. In this work, we propose a method to mitigate the effect of inter-slice motion for all SA and LA slices simultaneously by optimizing the 3D rotation and translation parameters on sampled intensities along slice intersections. Unlike other approaches, our approach exclusively utilizes the image intensity information, which requires no prior knowledge about the underlying anatomy. We demonstrate that our GPU-accelerated approach can reliably converge in seconds despite large perturbations on the initial alignment parameters. Our contributions can be summarized as follows:

•

We propose a solution for motion correction in CMR that jointly aligns all SA and LA slices simultaneously.
•

We utilize all SA-LA and LA-LA intersections and minimize the intensity differences along the intersection lines without incorporating any anatomical information.
•

We evaluate our method quantitatively recovering synthetic rigid deformations using aligned CMR scans as a golden standard.

2 Method

With this method, we try to address SA-LA and LA-LA misalignments due to patient motion in the scanner with respect to their anatomy. To do that, we assume that any deformation applied to a slice during acquisition is strictly rigid. More precisely, our approach corrects for $3D$ rotation and translation while it does not model scaling or shearing deformations. Our rigid rotation and translation matrices are defined as $R(\boldsymbol{\theta})$ and $T(\boldsymbol{t})\>$ , where $\boldsymbol{\theta}$ and $\boldsymbol{t}$ are 3D vectors corresponding to rotation angles and translation coefficients for a given slice.

When two intersecting slices are perfectly aligned, a metric computing similarity of the intensities along the intersection line of the two planes should be at a global maximum. We formulate the alignment problem as a constraint optimization of parameters $\phi\in\mathbb{R}^{N\times 6}$ that aims to jointly minimize the pairwise intensity differences at slice intersections (where $N$ is the number of slices and 6 is the total rotation and translation deformation parameters in 3D). We use the summation of pair-wise L2 intensity differences as a similarity metric between the intensities along slice intersections, which minimizes the influence of shifts in intensities across slices.

The set of slice pairs is based on all possible LA-LA slice combinations and LA-SA slice combinations of a given subject. We do not consider SA-SA combinations since their planes do not overlap significantly. For every slice pair $(A,B)$ , we compute their plane equations given their current rotation and translation parameters and thereby obtain the intersection line $d^{AB}$ . We define a sampling center $\boldsymbol{c}$ by finding the closest points $\boldsymbol{c^{A}}$ and $\boldsymbol{c^{B}}$ along line $d^{AB}$ to the centers of images $A$ and $B$ , and subsequently taking the mid-point of the two. Using $\boldsymbol{c}$ , we can define a sampling segment $K\in\mathbb{R}^{p\times 3}$ in 3D space, where $p$ is the number of sampling points. We set $p$ to 100 samples (50 points in each direction of $\boldsymbol{c}$ along $d^{AB}$ ) and the distance between samples to be 5 millimeters in world space.

We obtain sampling segment intensities $K^{A}$ and $K^{B}$ by projecting the sampling segment onto both image planes and bi-linearly interpolating pixel intensities at each coordinate. We do this for all $q$ time frames in the 2D+time slice, making $K^{A}$ and $K^{B}$ matrices of size $q\times p$ . For each sampling line, we also create mask vectors $\mathbf{m}^{A},\mathbf{m}^{B}\in\{0,1\}$ to denote whether a sample is within the bounds of a given image.

The loss is the sum of L2 intensity differences between pair-wise sampled lines across all point pairs and time frames:

\mathcal{L}\left(\phi\right)=\sum_{A,B}\sum^{q}_{j}\sum^{p}_{i}\left(K^{A}_{ji% }-K^{B}_{ji}\right)^{2}*\left(\mathbf{m}^{A}_{i}*\mathbf{m}^{B}_{i}\right),

(1)

where $\left(\mathbf{m}^{A}*\mathbf{m}^{B}\right)$ is used to exclude pixel-pairs with intensity samples outside image bounds.

The loss is minimized by computing the gradient w.r.t. the rotation and translation parameters $\phi$ through the affine transformation and intensity interpolation. Adam optimizer is used to increase convergence speed. We implement the algorithm in Pytorch and run it on an NVIDIA RTX2070 consumer-grade GPU card. We make the code repository publicly available¹¹1https://github.com/NILOIDE/CMR_intensity_3Dmoco.git.

3 Experiments and results

As previously mentioned, we do not have any segmentation labels or contours available that could serve as a surrogate measure to evaluate the performance of the proposed method. Therefore, apart from demonstrating qualitatively that our method manages to recover the slice misalignments (Figure 2) due to patient breathing/motion, we also investigate the convergence properties of the algorithm and present quantitative results on synthetic deformations.

Dataset. We evaluate the performance of our method using 10 CMR datasets from the UK Biobank²²2UK Biobank Imaging Study: http://imaging.ukbiobank.ac.uk. Particularly, each dataset is comprised of a stack of $9-13$ individual SA slices with $1.8$ mm $\times$ $1.8$ mm in-plane resolution and their corresponding $2$ -chamber, $3$ -chamber, and $4$ -chamber LA scans with $1.8$ mm $\times$ $1.8$ mm in-plane resolution. Additionally, each scan has $50$ time frames.

Evaluation. We use $10$ motion-free datasets as our golden standard and we synthetically transform them using a random rigid transformation per slice and we recover the misalignment using the proposed method. In this manner, we conduct an algorithm stress-test by exercising control over the ranges. This enables us to assess the algorithm’s robustness against both small and large misalignments.

We do this at various magnitudes of uniformly sampled perturbations of $\pm 0.0^{\circ},2.5^{\circ},7.5^{\circ},22.5^{\circ}$ rotation and $\pm 0.0,2.5,7.5,22.5$ mm of translation around the motion-free parameters. We allow the algorithm 1000 steps to converge (approx. 30 seconds) before measuring its parameters’ absolute error. In Figure 3, we report the distribution of absolute errors before and after optimization for each misalignment interval. Figure 4 shows the post-optimization distributions of mean and max absolute errors of every run for the recovered rotation and translation parameters.

Discussion. As seen in Figure 3, our algorithm reliably converges to the motion-free parameters even under the largest tested rotation and translation misalignments. Figure 4 appears to show that our approach corrects rotations better than translations. Furthermore, the most challenging scenario is when only translations are present. We hypothesize that rotation motion is easier to correct as the intensities closer to the axes of rotation preserve enough intensity overlap to provide meaningful gradient information. This is however not the case with translation, where the entire intersection will stop displaying similarities in intensities given a large enough displacement, making gradient descent unable to discern an appropriate direction in which to shift the slices. Surprisingly, this phenomenon disappears when some rotation misalignment is also present.

4 Conclusion

This study explores the potential of intra-subject CMR slice alignment. For this reason, the proposed method involves aligning all SA and LA slices simultaneously. Unlike other approaches, our algorithm exclusively utilizes the image intensity information along slice intersections and requires no prior knowledge about the underlying anatomy. We demonstrate both quantitatively and qualitatively that the proposed method is capable of recovering a wide range of rigid transformations. However, our approach also demonstrates some limitations that we would like to address in the future. We believe it would be beneficial to utilize stochastic optimization techniques that would further allow the algorithm to overcome local minima. Moreover, we believe another beneficial addition would be to incorporate a loss function that is more robust against intensity changes across slices such as mutual information.

5 Acknowledgements

This research study was conducted retrospectively using human subject data made available in open access by the UK Biobank Resource under Application Number 87802. Ethical approval was not required as confirmed by the license attached with the open access data.

This work is funded in part by the European Research Council (ERC) project Deep4MI (884622) and the Munich Center for Machine Learning.

References

[1] A. G. Chandler, R. J. Pinder, T. Netsch, J. A. Schnabel, D. John Hawkes, D. L. G. Hill, and R. Razavi, “Correction of misaligned slices in multi-slice cardiovascular magnetic resonance using slice-to-volume registration,” Journal of Cardiovascular Magnetic Resonance, vol. 10, 2008.
[2] G. Tarroni, O. Oktay, M. Sinclair, W. Bai, A. Schuh, H. Suzuki, A. de Marvao, D. P. O’Regan, S. A. Cook, and D. Rueckert, “A comprehensive approach for learning-based fully-automated inter-slice motion correction for short-axis cine cardiac mr image stacks,” ArXiv, vol. abs/1810.02201, 2018.
[3] Yi Su, May-Ling Tan, Chi-Wan Lim, Soo-Kng Teo, Senthil Kumar Selvaraj, Min Wan, Liang Zhong, and Ru-San Tan, “Automatic correction of motion artifacts in 4d left ventricle model reconstructed from mri,” in Computing in Cardiology 2014, 2014.
[4] B. Villard, E. Zacur, E. D. Armellina, and V. Grau, “Correction of slice misalignment in multi-breath-hold cardiac mri scans,” in STACOM@MICCAI, 2016.
[5] J. Lötjönen, M. Pollari, S. Kivistö, and K. Lauerma, “Correction of movement artifacts from 4-d cardiac short and long-axis mr data,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2004.
[6] M. Sinclair, W. Bai, E. Puyol-Antón, O. Oktay, D. Rueckert, and A. P. King, “Fully automated segmentation-based respiratory motion correction of multiplanar cardiac magnetic resonance images for large-scale datasets,” in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2017.
[7] D. Yang, P. Wu, C. Tan, K. M. Pohl, L. Axel, and D. N. Metaxas, “3d motion modeling and reconstruction of left ventricle wall in cardiac mri,” Functional imaging and modeling of the heart (FIMH), 2017.