Skip to main content

Showing 1–1 of 1 results for author: Yaacov, H B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2112.10769  [pdf, other

    cs.LG

    Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats

    Authors: Brian Chmiel, Ron Banner, Elad Hoffer, Hilla Ben Yaacov, Daniel Soudry

    Abstract: Quantization of the weights and activations is one of the main methods to reduce the computational footprint of Deep Neural Networks (DNNs) training. Current methods enable 4-bit quantization of the forward phase. However, this constitutes only a third of the training process. Reducing the computational footprint of the entire training process requires the quantization of the neural gradients, i.e… ▽ More

    Submitted 9 June, 2024; v1 submitted 19 December, 2021; originally announced December 2021.