Skip to main content

Showing 1–1 of 1 results for author: Dura, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.03986  [pdf, other

    cs.LG

    SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers

    Authors: Alberto Marchisio, Davide Dura, Maurizio Capra, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Transformers' compute-intensive operations pose enormous challenges for their deployment in resource-constrained EdgeAI / tinyML devices. As an established neural network compression technique, quantization reduces the hardware computational and memory resources. In particular, fixed-point quantization is desirable to ease the computations using lightweight blocks, like adders and multipliers, of… ▽ More

    Submitted 25 April, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

    Comments: To appear at the 2023 International Joint Conference on Neural Networks (IJCNN), Queensland, Australia, June 2023