Skip to main content

Showing 1–1 of 1 results for author: Arimond, R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2403.19882  [pdf, other

    eess.IV cs.CV cs.LG

    Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights

    Authors: Moein Heidari, Reza Azad, Sina Ghorbani Kolahi, René Arimond, Leon Niggemeier, Alaa Sulaiman, Afshin Bozorgpour, Ehsan Khodapanah Aghdam, Amirhossein Kazerouni, Ilker Hacihaliloglu, Dorit Merhof

    Abstract: Intrigued by the inherent ability of the human visual system to identify salient regions in complex scenes, attention mechanisms have been seamlessly integrated into various Computer Vision (CV) tasks. Building upon this paradigm, Vision Transformer (ViT) networks exploit attention mechanisms for improved efficiency. This review navigates the landscape of redesigned attention mechanisms within ViT… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Submitted to Computational Visual Media Journal