Skip to main content

Showing 1–2 of 2 results for author: Noordhuis, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2006.15704  [pdf, other

    cs.DC cs.LG

    PyTorch Distributed: Experiences on Accelerating Data Parallel Training

    Authors: Shen Li, Yanli Zhao, Rohan Varma, Omkar Salpekar, Pieter Noordhuis, Teng Li, Adam Paszke, Jeff Smith, Brian Vaughan, Pritam Damania, Soumith Chintala

    Abstract: This paper presents the design, implementation, and evaluation of the PyTorch distributed data parallel module. PyTorch is a widely-adopted scientific computing package used in deep learning research and applications. Recent advances in deep learning argue for the value of large datasets and large models, which necessitates the ability to scale out model training to more computational resources. D… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

    Comments: To appear in VLDB 2020

  2. arXiv:1706.02677  [pdf, other

    cs.CV cs.DC cs.LG

    Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

    Authors: Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, Kaiming He

    Abstract: Deep learning thrives with large neural networks and large datasets. However, larger networks and larger datasets result in longer training times that impede research and development progress. Distributed synchronous SGD offers a potential solution to this problem by dividing SGD minibatches over a pool of parallel workers. Yet to make this scheme efficient, the per-worker workload must be large,… ▽ More

    Submitted 30 April, 2018; v1 submitted 8 June, 2017; originally announced June 2017.

    Comments: Tech report (v2: correct typos)