Skip to main content

Showing 1–1 of 1 results for author: Rakhmangulova, Y

.
  1. arXiv:2211.05187  [pdf, other

    cs.CV

    Training a Vision Transformer from scratch in less than 24 hours with 1 GPU

    Authors: Saghar Irandoust, Thibaut Durand, Yunduz Rakhmangulova, Wenjie Zi, Hossein Hajimirsadeghi

    Abstract: Transformers have become central to recent advances in computer vision. However, training a vision Transformer (ViT) model from scratch can be resource intensive and time consuming. In this paper, we aim to explore approaches to reduce the training costs of ViT models. We introduce some algorithmic improvements to enable training a ViT model from scratch with limited hardware (1 GPU) and time (24… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: 7 pages, 2 figures, 1 table, published in "Has it Trained Yet? Workshop at the Conference on Neural Information Processing Systems (NeurIPS 2022)"

    ACM Class: I.2.10