Skip to main content

Showing 1–6 of 6 results for author: Lee, J K L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18047  [pdf, other

    cs.LG cs.AI cs.DC

    2BP: 2-Stage Backpropagation

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings

    Abstract: As Deep Neural Networks (DNNs) grow in size and complexity, they often exceed the memory capacity of a single accelerator, necessitating the sharding of model parameters across multiple accelerators. Pipeline parallelism is a commonly used sharding strategy for training large DNNs. However, current implementations of pipeline parallelism are being unintentionally bottlenecked by the automatic diff… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2404.10536  [pdf, ps, other

    cs.DC

    Benchmarking Machine Learning Applications on Heterogeneous Architecture using Reframe

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings, Michele Weiland

    Abstract: With the rapid increase in machine learning workloads performed on HPC systems, it is beneficial to regularly perform machine learning specific benchmarks to monitor performance and identify issues. Furthermore, as part of the Edinburgh International Data Facility, EPCC currently hosts a wide range of machine learning accelerators including Nvidia GPUs, the Graphcore Bow Pod64 and Cerebras CS-2, w… ▽ More

    Submitted 25 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Author accepted version of paper in the PERMAVOST workshop at the 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC 24)

  3. arXiv:2311.03210  [pdf, other

    cs.DC

    Quantum Task Offloading with the OpenMP API

    Authors: Joseph K. L. Lee, Oliver T. Brown, Mark Bull, Martin Ruefenacht, Johannes Doerfert, Michael Klemm, Martin Schulz

    Abstract: Most of the widely used quantum programming languages and libraries are not designed for the tightly coupled nature of hybrid quantum-classical algorithms, which run on quantum resources that are integrated on-premise with classical HPC infrastructure. We propose a programming model using the API provided by OpenMP to target quantum devices, which provides an easy-to-use and efficient interface fo… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Poster extended abstract for Supercomputing 2023 (SC23)

  4. arXiv:2305.00512  [pdf, other

    cs.DC

    Experiences of running an HPC RISC-V testbed

    Authors: Nick Brown, Maurice Jamieson, Joseph K. L. Lee

    Abstract: Funded by the UK ExCALIBUR H\&ES exascale programme, in early 2022 a RISC-V testbed for HPC was stood up to provide free access for scientific software developers to experiment with RISC-V for their workloads. Here we report on successes, challenges, and lessons learnt from this activity with a view to better understanding the suitability of RISC-V for HPC and important areas to focus RISC-V HPC c… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

    Comments: Author accepted version of extended abstract in RISC-V Summit Europe

  5. arXiv:2304.10324  [pdf, other

    cs.DC

    Backporting RISC-V Vector assembly

    Authors: Joseph K. L. Lee, Maurice Jamieson, Nick Brown

    Abstract: Leveraging vectorisation, the ability for a CPU to apply operations to multiple elements of data concurrently, is critical for high performance workloads. However, at the time of writing, commercially available physical RISC-V hardware that provides the RISC-V vector extension (RVV) only supports version 0.7.1, which is incompatible with the latest ratified version 1.0. The challenge is that upstr… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: Preprint of paper accepted to First International Workshop on RISC-V for HPC (2023)

  6. arXiv:2304.10319  [pdf, other

    cs.DC

    Test-driving RISC-V Vector hardware for HPC

    Authors: Joseph K. L. Lee, Maurice Jamieson, Nick Brown, Ricardo Jesus

    Abstract: Whilst the RISC-V Vector extension (RVV) has been ratified, at the time of writing both hardware implementations and open source software support are still limited for vectorisation on RISC-V. This is important because vectorisation is crucial to obtaining good performance for High Performance Computing (HPC) workloads and, as of April 2023, the Allwinner D1 SoC, containing the XuanTie C906 proces… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: Preprint of paper accepted to First International Workshop on RISC-V for HPC (2023)