Skip to main content

Showing 1–1 of 1 results for author: Chitty-Venkata, K T

.
  1. arXiv:2307.07982  [pdf, other

    cs.LG cs.AR cs.CL cs.CV

    A Survey of Techniques for Optimizing Transformer Inference

    Authors: Krishna Teja Chitty-Venkata, Sparsh Mittal, Murali Emani, Venkatram Vishwanath, Arun K. Somani

    Abstract: Recent years have seen a phenomenal rise in performance and applications of transformer neural networks. The family of transformer networks, including Bidirectional Encoder Representations from Transformer (BERT), Generative Pretrained Transformer (GPT) and Vision Transformer (ViT), have shown their effectiveness across Natural Language Processing (NLP) and Computer Vision (CV) domains. Transforme… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.