Skip to main content

Showing 1–1 of 1 results for author: Shlyazhko, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2202.10435  [pdf, ps, other

    cs.LG cs.AI

    Survey on Large Scale Neural Network Training

    Authors: Julia Gusak, Daria Cherniuk, Alena Shilova, Alexander Katrutsa, Daniel Bershatsky, Xunyi Zhao, Lionel Eyraud-Dubois, Oleg Shlyazhko, Denis Dimitrov, Ivan Oseledets, Olivier Beaumont

    Abstract: Modern Deep Neural Networks (DNNs) require significant memory to store weight, activations, and other intermediate tensors during training. Hence, many models do not fit one GPU device or can be trained using only a small per-GPU batch size. This survey provides a systematic overview of the approaches that enable more efficient DNNs training. We analyze techniques that save memory and make good us… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.