Skip to main content

Showing 1–2 of 2 results for author: Kirchhart, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17607  [pdf, other

    cs.AI

    Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs

    Authors: Kai Yuan, Christoph Bauinger, Xiangyi Zhang, Pascal Baehr, Matthias Kirchhart, Darius Dabert, Adrien Tousnakhoff, Pierre Boudier, Michael Paulitsch

    Abstract: This paper presents a SYCL implementation of Multi-Layer Perceptrons (MLPs), which targets and is optimized for the Intel Data Center GPU Max 1550. To increase the performance, our implementation minimizes the slow global memory accesses by maximizing the data reuse within the general register file and the shared local memory by fusing the operations in each layer of the MLP. We show with a simple… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  2. arXiv:2202.02847  [pdf, other

    cs.MS

    Solidfmm: A highly optimised library of operations on the solid harmonics for use in fast multipole methods

    Authors: Matthias Kirchhart

    Abstract: We present solidfmm, a highly optimised C++ library for the solid harmonics as they are needed in fast multipole methods. The library provides efficient, vectorised implementations of the translation operations M2M, M2L, and L2L, and is available as free software. While asymptotically of complexity $O(P^3)$, for all practically relevant expansion orders, the translation operators display an empiri… ▽ More

    Submitted 6 February, 2022; originally announced February 2022.

    MSC Class: 35-04