Skip to main content

Showing 1–1 of 1 results for author: Mozafari, S H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.01169  [pdf, ps, other

    cs.CV cs.AI

    Faster Inference of Integer SWIN Transformer by Removing the GELU Activation

    Authors: Mohammadreza Tayaranian, Seyyed Hasan Mozafari, James J. Clark, Brett Meyer, Warren Gross

    Abstract: SWIN transformer is a prominent vision transformer model that has state-of-the-art accuracy in image classification tasks. Despite this success, its unique architecture causes slower inference compared with similar deep neural networks. Integer quantization of the model is one of the methods used to improve its inference latency. However, state-of-the-art has not been able to fully quantize the mo… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 5 pages, 1 figure. Submitted to Edge Intelligence Workshop III, an AAAI 2024 workshop