Skip to main content

Showing 1–1 of 1 results for author: Dabert, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17607  [pdf, other

    cs.AI

    Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs

    Authors: Kai Yuan, Christoph Bauinger, Xiangyi Zhang, Pascal Baehr, Matthias Kirchhart, Darius Dabert, Adrien Tousnakhoff, Pierre Boudier, Michael Paulitsch

    Abstract: This paper presents a SYCL implementation of Multi-Layer Perceptrons (MLPs), which targets and is optimized for the Intel Data Center GPU Max 1550. To increase the performance, our implementation minimizes the slow global memory accesses by maximizing the data reuse within the general register file and the shared local memory by fusing the operations in each layer of the MLP. We show with a simple… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.