Soft Masked Mamba Diffusion Model for CT to MRI Conversion

Wang, Zhenbin; Zhang, Lei; Wang, Lituan; Zhang, Zhenwei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.15910 (cs)

[Submitted on 22 Jun 2024]

Title:Soft Masked Mamba Diffusion Model for CT to MRI Conversion

Authors:Zhenbin Wang, Lei Zhang, Lituan Wang, Zhenwei Zhang

View PDF HTML (experimental)

Abstract:Magnetic Resonance Imaging (MRI) and Computed Tomography (CT) are the predominant modalities utilized in the field of medical imaging. Although MRI capture the complexity of anatomical structures with greater detail than CT, it entails a higher financial costs and requires longer image acquisition times. In this study, we aim to train latent diffusion model for CT to MRI conversion, replacing the commonly-used U-Net or Transformer backbone with a State-Space Model (SSM) called Mamba that operates on latent patches. First, we noted critical oversights in the scan scheme of most Mamba-based vision methods, including inadequate attention to the spatial continuity of patch tokens and the lack of consideration for their varying importance to the target task. Secondly, extending from this insight, we introduce Diffusion Mamba (DiffMa), employing soft masked to integrate Cross-Sequence Attention into Mamba and conducting selective scan in a spiral manner. Lastly, extensive experiments demonstrate impressive performance by DiffMa in medical image generation tasks, with notable advantages in input scaling efficiency over existing benchmark models. The code and models are available at this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.15910 [cs.CV]
	(or arXiv:2406.15910v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.15910

Submission history

From: Zhenbin Wang [view email]
[v1] Sat, 22 Jun 2024 18:06:50 UTC (2,303 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Soft Masked Mamba Diffusion Model for CT to MRI Conversion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Soft Masked Mamba Diffusion Model for CT to MRI Conversion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators