Skip to main content

Showing 1–1 of 1 results for author: Torisawa, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2103.16063  [pdf, ps, other

    cs.LG cs.DC

    Automatic Graph Partitioning for Very Large-scale Deep Learning

    Authors: Masahiro Tanaka, Kenjiro Taura, Toshihiro Hanawa, Kentaro Torisawa

    Abstract: This work proposes RaNNC (Rapid Neural Network Connector) as middleware for automatic hybrid parallelism. In recent deep learning research, as exemplified by T5 and GPT-3, the size of neural network models continues to grow. Since such models do not fit into the memory of accelerator devices, they need to be partitioned by model parallelism techniques. Moreover, to accelerate training for huge tra… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted to the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2021), May 2021