Skip to main content

Showing 1–1 of 1 results for author: Perrone, M P

Searching in archive cs. Search in all archives.
.
  1. arXiv:1911.06459  [pdf, other

    cs.LG cs.DC stat.ML

    Optimal Mini-Batch Size Selection for Fast Gradient Descent

    Authors: Michael P. Perrone, Haidar Khan, Changhoan Kim, Anastasios Kyrillidis, Jerry Quinn, Valentina Salapura

    Abstract: This paper presents a methodology for selecting the mini-batch size that minimizes Stochastic Gradient Descent (SGD) learning time for single and multiple learner problems. By decoupling algorithmic analysis issues from hardware and software implementation details, we reveal a robust empirical inverse law between mini-batch size and the average number of SGD updates required to converge to a speci… ▽ More

    Submitted 14 November, 2019; originally announced November 2019.