Skip to main content

Showing 1–1 of 1 results for author: Rahimian, A K

.
  1. arXiv:2406.19391  [pdf, other

    cs.CV

    Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

    Authors: Ali Khaleghi Rahimian, Manish Kumar Govind, Subhajit Maity, Dominick Reilly, Christian Kümmerle, Srijan Das, Aritra Dutta

    Abstract: Visual perception tasks are predominantly solved by Vision Transformer (ViT) architectures, which, despite their effectiveness, encounter a computational bottleneck due to the quadratic complexity of computing self-attention. This inefficiency is largely due to the self-attention heads capturing redundant token interactions, reflecting inherent redundancy within visual data. Many works have aimed… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: The code is publicly available at https://github.com/Charlotte-CharMLab/Fibottention