Skip to main content

Showing 1–1 of 1 results for author: Terentev, E

Searching in archive cs. Search in all archives.
.
  1. Fast Adjustable Threshold For Uniform Neural Network Quantization (Winning solution of LPIRC-II)

    Authors: Alexander Goncharenko, Andrey Denisov, Sergey Alyamkin, Evgeny Terentev

    Abstract: Neural network quantization procedure is the necessary step for porting of neural networks to mobile devices. Quantization allows accelerating the inference, reducing memory consumption and model size. It can be performed without fine-tuning using calibration procedure (calculation of parameters necessary for quantization), or it is possible to train the network with quantization from scratch. Tra… ▽ More

    Submitted 18 June, 2019; v1 submitted 19 December, 2018; originally announced December 2018.