Skip to main content

Showing 1–1 of 1 results for author: Kakkar, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20735  [pdf, other

    cs.CV

    Language Augmentation in CLIP for Improved Anatomy Detection on Multi-modal Medical Images

    Authors: Mansi Kakkar, Dattesh Shanbhag, Chandan Aladahalli, Gurunath Reddy M

    Abstract: Vision-language models have emerged as a powerful tool for previously challenging multi-modal classification problem in the medical domain. This development has led to the exploration of automated image description generation for multi-modal clinical scans, particularly for radiology report generation. Existing research has focused on clinical descriptions for specific modalities or body regions,… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: $©$ 2024 IEEE. Accepted in 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 2024