Skip to main content

Showing 1–1 of 1 results for author: Copening, H G

.
  1. arXiv:2310.03971  [pdf, other

    cs.CL cs.AR

    Quantized Transformer Language Model Implementations on Edge Devices

    Authors: Mohammad Wali Ur Rahman, Murad Mehrab Abrar, Hunter Gibbons Copening, Salim Hariri, Sicong Shao, Pratik Satam, Soheil Salehi

    Abstract: Large-scale transformer-based models like the Bidirectional Encoder Representations from Transformers (BERT) are widely used for Natural Language Processing (NLP) applications, wherein these models are initially pre-trained with a large corpus with millions of parameters and then fine-tuned for a downstream NLP task. One of the major limitations of these large-scale models is that they cannot be d… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted for publication on 22nd International Conference of Machine Learning and Applications, ICMLA 2023