Skip to main content

Showing 1–1 of 1 results for author: Safonov, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14024  [pdf, other

    cs.CV cs.AI

    Two Heads are Better Than One: Neural Networks Quantization with 2D Hilbert Curve-based Output Representation

    Authors: Mykhailo Uss, Ruslan Yermolenko, Olena Kolodiazhna, Oleksii Shashko, Ivan Safonov, Volodymyr Savin, Yoonjae Yeo, Seowon Ji, Jaeyun Jeong

    Abstract: Quantization is widely used to increase deep neural networks' (DNN) memory, computation, and power efficiency. Various techniques, such as post-training quantization and quantization-aware training, have been proposed to improve quantization quality. We introduce a novel approach for DNN quantization that uses a redundant representation of DNN's output. We represent the target quantity as a point… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 18 pages, 10 figures