Skip to main content

Showing 1–1 of 1 results for author: Sebastianelli, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2004.14696  [pdf, ps, other

    cs.DC cs.LG

    Dynamic backup workers for parallel machine learning

    Authors: Chuan Xu, Giovanni Neglia, Nicola Sebastianelli

    Abstract: The most popular framework for distributed training of machine learning models is the (synchronous) parameter server (PS). This paradigm consists of $n$ workers, which iteratively compute updates of the model parameters, and a stateful PS, which waits and aggregates all updates to generate a new estimate of model parameters and sends it back to the workers for a new iteration. Transient computatio… ▽ More

    Submitted 24 January, 2021; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: Journal version