Skip to main content

Showing 1–2 of 2 results for author: Kolahi, S G

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.03430  [pdf, other

    eess.IV cs.CV

    Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis

    Authors: Moein Heidari, Sina Ghorbani Kolahi, Sanaz Karimijafarbigloo, Bobby Azad, Afshin Bozorgpour, Soheila Hatami, Reza Azad, Ali Diba, Ulas Bagci, Dorit Merhof, Ilker Hacihaliloglu

    Abstract: Sequence modeling plays a vital role across various domains, with recurrent neural networks being historically the predominant method of performing these tasks. However, the emergence of transformers has altered this paradigm due to their superior performance. Built upon these advances, transformers have conjoined CNNs as two leading foundational models for learning visual representations. However… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: This is the first version of our survey, and the paper is currently under review

  2. arXiv:2403.19882  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights

    Authors: Moein Heidari, Reza Azad, Sina Ghorbani Kolahi, René Arimond, Leon Niggemeier, Alaa Sulaiman, Afshin Bozorgpour, Ehsan Khodapanah Aghdam, Amirhossein Kazerouni, Ilker Hacihaliloglu, Dorit Merhof

    Abstract: Intrigued by the inherent ability of the human visual system to identify salient regions in complex scenes, attention mechanisms have been seamlessly integrated into various Computer Vision (CV) tasks. Building upon this paradigm, Vision Transformer (ViT) networks exploit attention mechanisms for improved efficiency. This review navigates the landscape of redesigned attention mechanisms within ViT… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Submitted to Computational Visual Media Journal