Skip to main content

Showing 1–1 of 1 results for author: Buchlovsky, P

.
  1. arXiv:1902.00465  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    TF-Replicator: Distributed Machine Learning for Researchers

    Authors: Peter Buchlovsky, David Budden, Dominik Grewe, Chris Jones, John Aslanides, Frederic Besse, Andy Brock, Aidan Clark, Sergio Gómez Colmenarejo, Aedan Pope, Fabio Viola, Dan Belov

    Abstract: We describe TF-Replicator, a framework for distributed machine learning designed for DeepMind researchers and implemented as an abstraction over TensorFlow. TF-Replicator simplifies writing data-parallel and model-parallel research code. The same models can be effortlessly deployed to different cluster architectures (i.e. one or many machines containing CPUs, GPUs or TPU accelerators) using synchr… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.