Skip to main content

Showing 1–3 of 3 results for author: Richings, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18047  [pdf, other

    cs.LG cs.AI cs.DC

    2BP: 2-Stage Backpropagation

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings

    Abstract: As Deep Neural Networks (DNNs) grow in size and complexity, they often exceed the memory capacity of a single accelerator, necessitating the sharding of model parameters across multiple accelerators. Pipeline parallelism is a commonly used sharding strategy for training large DNNs. However, current implementations of pipeline parallelism are being unintentionally bottlenecked by the automatic diff… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2404.10536  [pdf, ps, other

    cs.DC

    Benchmarking Machine Learning Applications on Heterogeneous Architecture using Reframe

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings, Michele Weiland

    Abstract: With the rapid increase in machine learning workloads performed on HPC systems, it is beneficial to regularly perform machine learning specific benchmarks to monitor performance and identify issues. Furthermore, as part of the Edinburgh International Data Facility, EPCC currently hosts a wide range of machine learning accelerators including Nvidia GPUs, the Graphcore Bow Pod64 and Cerebras CS-2, w… ▽ More

    Submitted 25 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Author accepted version of paper in the PERMAVOST workshop at the 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC 24)

  3. arXiv:2308.07402  [pdf, other

    cs.PF cs.DC quant-ph

    Energy Efficiency of Quantum Statevector Simulation at Scale

    Authors: Jakub Adamski, James Peter Richings, Oliver Thomson Brown

    Abstract: Classical simulations are essential for the development of quantum computing, and their exponential scaling can easily fill any modern supercomputer. In this paper we consider the performance and energy consumption of large Quantum Fourier Transform (QFT) simulations run on ARCHER2, the UK's National Supercomputing Service, with QuEST toolkit. We take into account CPU clock frequency and node memo… ▽ More

    Submitted 18 September, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

    Comments: 5 pages, 5 figures. Accepted to Sustainable Supercomputing workshop at SC23