Skip to main content

Showing 1–4 of 4 results for author: Shkolnik, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.10598  [pdf, other

    cs.LG

    DropCompute: simple and more robust distributed synchronous training via compute variance reduction

    Authors: Niv Giladi, Shahar Gottlieb, Moran Shkolnik, Asaf Karnieli, Ron Banner, Elad Hoffer, Kfir Yehuda Levy, Daniel Soudry

    Abstract: Background: Distributed training is essential for large scale training of deep neural networks (DNNs). The dominant methods for large scale DNN training are synchronous (e.g. All-Reduce), but these require waiting for all workers in each step. Thus, these methods are limited by the delays caused by straggling workers. Results: We study a typical scenario in which workers are straggling due to vari… ▽ More

    Submitted 24 September, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

    Comments: https://github.com/paper-submissions/dropcompute

    Journal ref: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  2. arXiv:2006.08173  [pdf, other

    cs.CV cs.LG

    Neural gradients are near-lognormal: improved quantized and sparse training

    Authors: Brian Chmiel, Liad Ben-Uri, Moran Shkolnik, Elad Hoffer, Ron Banner, Daniel Soudry

    Abstract: While training can mostly be accelerated by reducing the time needed to propagate neural gradients back throughout the model, most previous works focus on the quantization/pruning of weights and activations. These methods are often not applicable to neural gradients, which have very different statistical properties. Distinguished from weights and activations, we find that the distribution of neura… ▽ More

    Submitted 12 October, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  3. arXiv:2002.07686  [pdf, other

    cs.LG cs.CV stat.ML

    Robust Quantization: One Model to Rule Them All

    Authors: Moran Shkolnik, Brian Chmiel, Ron Banner, Gil Shomron, Yury Nahshan, Alex Bronstein, Uri Weiser

    Abstract: Neural network quantization methods often involve simulating the quantization process during training, making the trained model highly dependent on the target bit-width and precise way quantization is performed. Robust quantization offers an alternative approach with improved tolerance to different classes of data-types and quantization policies. It opens up new exciting applications where the qua… ▽ More

    Submitted 22 October, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

  4. arXiv:1909.07636  [pdf, other

    cs.CV

    Thanks for Nothing: Predicting Zero-Valued Activations with Lightweight Convolutional Neural Networks

    Authors: Gil Shomron, Ron Banner, Moran Shkolnik, Uri Weiser

    Abstract: Convolutional neural networks (CNNs) introduce state-of-the-art results for various tasks with the price of high computational demands. Inspired by the observation that spatial correlation exists in CNN output feature maps (ofms), we propose a method to dynamically predict whether ofm activations are zero-valued or not according to their neighboring activation values, thereby avoiding zero-valued… ▽ More

    Submitted 13 July, 2020; v1 submitted 17 September, 2019; originally announced September 2019.