Skip to main content

Showing 1–12 of 12 results for author: Aghdam, E K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.19882  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights

    Authors: Moein Heidari, Reza Azad, Sina Ghorbani Kolahi, René Arimond, Leon Niggemeier, Alaa Sulaiman, Afshin Bozorgpour, Ehsan Khodapanah Aghdam, Amirhossein Kazerouni, Ilker Hacihaliloglu, Dorit Merhof

    Abstract: Intrigued by the inherent ability of the human visual system to identify salient regions in complex scenes, attention mechanisms have been seamlessly integrated into various Computer Vision (CV) tasks. Building upon this paradigm, Vision Transformer (ViT) networks exploit attention mechanisms for improved efficiency. This review navigates the landscape of redesigned attention mechanisms within ViT… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Submitted to Computational Visual Media Journal

  2. arXiv:2309.00121  [pdf, other

    cs.CV

    Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation

    Authors: Reza Azad, Leon Niggemeier, Michael Huttemann, Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof

    Abstract: Medical image segmentation has seen significant improvements with transformer models, which excel in gras** far-reaching contexts and global contextual information. However, the increasing computational demands of these models, proportional to the squared token count, limit their depth and resolution capabilities. Most current methods process D volumetric image data slice-by-slice (called pseudo… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  3. arXiv:2309.00108  [pdf, other

    cs.CV

    Laplacian-Former: Overcoming the Limitations of Vision Transformers in Local Texture Detection

    Authors: Reza Azad, Amirhossein Kazerouni, Babak Azad, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof

    Abstract: Vision Transformer (ViT) models have demonstrated a breakthrough in a wide range of computer vision tasks. However, compared to the Convolutional Neural Network (CNN) models, it has been observed that the ViT models struggle to capture high-frequency components of images, which can limit their ability to detect local textures and edge information. As abnormalities in human tissue, such as tumors a… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

    Comments: Accepted in the main conference MICCAI 2023

  4. arXiv:2308.13442  [pdf, other

    cs.CV

    Unlocking Fine-Grained Details with Wavelet-based High-Frequency Enhancement in Transformers

    Authors: Reza Azad, Amirhossein Kazerouni, Alaa Sulaiman, Afshin Bozorgpour, Ehsan Khodapanah Aghdam, Abin Jose, Dorit Merhof

    Abstract: Medical image segmentation is a critical task that plays a vital role in diagnosis, treatment planning, and disease monitoring. Accurate segmentation of anatomical structures and abnormalities from medical images can aid in the early detection and treatment of various diseases. In this paper, we address the local feature deficiency of the Transformer model by carefully re-designing the self-attent… ▽ More

    Submitted 12 September, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted in MICCAI 2023 workshop MLMI

    Journal ref: MICCAI 2023 workshop

  5. arXiv:2301.10847  [pdf, other

    cs.CV

    Enhancing Medical Image Segmentation with TransCeption: A Multi-Scale Feature Fusion Approach

    Authors: Reza Azad, Yiwei Jia, Ehsan Khodapanah Aghdam, Julien Cohen-Adad, Dorit Merhof

    Abstract: While CNN-based methods have been the cornerstone of medical image segmentation due to their promising performance and robustness, they suffer from limitations in capturing long-range dependencies. Transformer-based approaches are currently prevailing since they enlarge the reception field to model global contextual correlation. To further extract rich representations, some extensions of the U-Net… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: Submitted to IEEE TMI Journal

  6. arXiv:2301.03505  [pdf, other

    cs.CV

    Advances in Medical Image Analysis with Vision Transformers: A Comprehensive Review

    Authors: Reza Azad, Amirhossein Kazerouni, Moein Heidari, Ehsan Khodapanah Aghdam, Amirali Molaei, Yiwei Jia, Abin Jose, Rijo Roy, Dorit Merhof

    Abstract: The remarkable performance of the Transformer architecture in natural language processing has recently also triggered broad interest in Computer Vision. Among other merits, Transformers are witnessed as capable of learning long-range dependencies and spatial correlations, which is a clear advantage over convolutional neural networks (CNNs), which have been the de facto standard in Computer Vision… ▽ More

    Submitted 5 November, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: https://www.sciencedirect.com/science/article/abs/pii/S1361841523002608

  7. arXiv:2212.13504  [pdf, other

    cs.CV

    DAE-Former: Dual Attention-guided Efficient Transformer for Medical Image Segmentation

    Authors: Reza Azad, René Arimond, Ehsan Khodapanah Aghdam, Amirhossein Kazerouni, Dorit Merhof

    Abstract: Transformers have recently gained attention in the computer vision domain due to their ability to model long-range dependencies. However, the self-attention mechanism, which is the core part of the Transformer model, usually suffers from quadratic computational complexity with respect to the number of tokens. Many architectures attempt to reduce model complexity by limiting the self-attention mech… ▽ More

    Submitted 26 July, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: MICCAI 2023 PRIME workshop

  8. arXiv:2211.14830  [pdf, other

    eess.IV cs.CV

    Medical Image Segmentation Review: The success of U-Net

    Authors: Reza Azad, Ehsan Khodapanah Aghdam, Amelie Rauland, Yiwei Jia, Atlas Haddadi Avval, Afshin Bozorgpour, Sanaz Karimijafarbigloo, Joseph Paul Cohen, Ehsan Adeli, Dorit Merhof

    Abstract: Automatic medical image segmentation is a crucial topic in the medical domain and successively a critical counterpart in the computer-aided diagnosis paradigm. U-Net is the most widespread image segmentation architecture due to its flexibility, optimized modular design, and success in all medical image modalities. Over the years, the U-Net model achieved tremendous attention from academic and indu… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Submitted to the IEEE Transactions on Pattern Analysis and Machine Intelligence Journal

  9. arXiv:2211.07804  [pdf, other

    eess.IV cs.CV

    Diffusion Models for Medical Image Analysis: A Comprehensive Survey

    Authors: Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Moein Heidari, Reza Azad, Mohsen Fayyaz, Ilker Hacihaliloglu, Dorit Merhof

    Abstract: Denoising diffusion models, a class of generative models, have garnered immense interest lately in various deep-learning problems. A diffusion probabilistic model defines a forward diffusion stage where the input data is gradually perturbed over several steps by adding Gaussian noise and then learns to reverse the diffusion process to retrieve the desired noise-free data from noisy data samples. D… ▽ More

    Submitted 3 June, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: Third revision: including more papers and further discussions

  10. arXiv:2210.16898  [pdf, other

    eess.IV cs.CV cs.LG

    Attention Swin U-Net: Cross-Contextual Attention Mechanism for Skin Lesion Segmentation

    Authors: Ehsan Khodapanah Aghdam, Reza Azad, Maral Zarvani, Dorit Merhof

    Abstract: Melanoma is caused by the abnormal growth of melanocytes in human skin. Like other cancers, this life-threatening skin cancer can be treated with early diagnosis. To support a diagnosis by automatic skin lesion segmentation, several Fully Convolutional Network (FCN) approaches, specifically the U-Net architecture, have been proposed. The U-Net model with a symmetrical architecture has exhibited su… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  11. arXiv:2208.00713  [pdf, other

    eess.IV cs.CV cs.LG

    TransDeepLab: Convolution-Free Transformer-based DeepLab v3+ for Medical Image Segmentation

    Authors: Reza Azad, Moein Heidari, Moein Shariatnia, Ehsan Khodapanah Aghdam, Sanaz Karimijafarbigloo, Ehsan Adeli, Dorit Merhof

    Abstract: Convolutional neural networks (CNNs) have been the de facto standard in a diverse set of computer vision tasks for many years. Especially, deep neural networks based on seminal architectures such as U-shaped models with skip-connections or atrous convolution with pyramid pooling have been tailored to a wide range of medical image analysis tasks. The main advantage of such architectures is that the… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  12. arXiv:2207.08518  [pdf, other

    cs.CV cs.AI

    HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation

    Authors: Moein Heidari, Amirhossein Kazerouni, Milad Soltany, Reza Azad, Ehsan Khodapanah Aghdam, Julien Cohen-Adad, Dorit Merhof

    Abstract: Convolutional neural networks (CNNs) have been the consensus for medical image segmentation tasks. However, they suffer from the limitation in modeling long-range dependencies and spatial correlations due to the nature of convolution operation. Although transformers were first developed to address this issue, they fail to capture low-level features. In contrast, it is demonstrated that both local… ▽ More

    Submitted 9 January, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: The accepted version of the paper at WACV 2023

    Journal ref: WACV 2023