Skip to main content

Showing 1–1 of 1 results for author: Barca, G M J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19621  [pdf, other

    cs.DC cs.LG

    Machine-Learning-Driven Runtime Optimization of BLAS Level 3 on Modern Multi-Core Systems

    Authors: Yufan Xia, Giuseppe Maria Junior Barca

    Abstract: BLAS Level 3 operations are essential for scientific computing, but finding the optimal number of threads for multi-threaded implementations on modern multi-core systems is challenging. We present an extension to the Architecture and Data-Structure Aware Linear Algebra (ADSALA) library that uses machine learning to optimize the runtime of all BLAS Level 3 operations. Our method predicts the best n… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Multi-Thread, Matrix Multiplication, Optimization, BLAS, Machine Learning

    Journal ref: 2024 International Parallel and Distributed Processing Symposium (IPDPS)