Prompt-Based Tuning of Transformer Models for Multi-Center Medical Image Segmentation of Head and Neck Cancer

Saeed, Numan; Ridzuan, Muhammad; Majzoub, Roba Al; Yaqub, Mohammad

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.18948 (cs)

[Submitted on 30 May 2023 (v1), last revised 2 Aug 2023 (this version, v2)]

Title:Prompt-Based Tuning of Transformer Models for Multi-Center Medical Image Segmentation of Head and Neck Cancer

Authors:Numan Saeed, Muhammad Ridzuan, Roba Al Majzoub, Mohammad Yaqub

View PDF

Abstract:Medical image segmentation is a vital healthcare endeavor requiring precise and efficient models for appropriate diagnosis and treatment. Vision transformer (ViT)-based segmentation models have shown great performance in accomplishing this task. However, to build a powerful backbone, the self-attention block of ViT requires large-scale pre-training data. The present method of modifying pre-trained models entails updating all or some of the backbone parameters. This paper proposes a novel fine-tuning strategy for adapting a pretrained transformer-based segmentation model on data from a new medical center. This method introduces a small number of learnable parameters, termed prompts, into the input space (less than 1\% of model parameters) while kee** the rest of the model parameters frozen. Extensive studies employing data from new unseen medical centers show that the prompt-based fine-tuning of medical segmentation models provides excellent performance regarding the new-center data with a negligible drop regarding the old centers. Additionally, our strategy delivers great accuracy with minimum re-training on new-center data, significantly decreasing the computational and time costs of fine-tuning pre-trained models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2305.18948 [cs.CV]
	(or arXiv:2305.18948v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.18948

Submission history

From: Numan Saeed [view email]
[v1] Tue, 30 May 2023 11:26:52 UTC (1,934 KB)
[v2] Wed, 2 Aug 2023 07:49:41 UTC (4,202 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Prompt-Based Tuning of Transformer Models for Multi-Center Medical Image Segmentation of Head and Neck Cancer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Prompt-Based Tuning of Transformer Models for Multi-Center Medical Image Segmentation of Head and Neck Cancer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators