Skip to main content

Showing 1–6 of 6 results for author: Katagiri, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00019  [pdf

    cs.DC cs.PF

    An Auto-tuning Method for Run-time Data Transformation for Sparse Matrix-Vector Multiplication

    Authors: Takahiro Katagiri, Masahiko Sato

    Abstract: In this paper, we research the run-time sparse matrix data transformation from Compressed Row Storage (CRS) to Coordinate (COO) storage and an ELL (ELLPACK/ITPACK) format with OpenMP parallelization for sparse matrix-vector multiplication (SpMV). We propose an auto-tuning (AT) method by using the $D_{mat}^i$ - $R_{ell}^i$ graph, which plots the derivation/average for the number of non-zero element… ▽ More

    Submitted 6 May, 2024; originally announced July 2024.

  2. arXiv:2405.10973  [pdf

    cs.SE cs.AI cs.LG cs.MS

    Adaptation of XAI to Auto-tuning for Numerical Libraries

    Authors: Shota Aoki, Takahiro Katagiri, Satoshi Ohshima, Masatoshi Kawai, Toru Nagai, Tetsuya Hoshino

    Abstract: Concerns have arisen regarding the unregulated utilization of artificial intelligence (AI) outputs, potentially leading to various societal issues. While humans routinely validate information, manually inspecting the vast volumes of AI-generated results is impractical. Therefore, automation and visualization are imperative. In this context, Explainable AI (XAI) technology is gaining prominence, ai… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: This article has been submitted to Special Session: Performance Optimization and Auto-Tuning of Software on Multicore/Manycore Systems (POAT), In conjunction with IEEE MCSoC-2024 (Dec 16-19, 2024, Days Hotel & Suites by Wyndham Fraser Business Park, Kuala Lumpur)

  3. arXiv:2405.01599  [pdf

    cs.MS cs.DC cs.PF

    Xabclib:A Fully Auto-tuned Sparse Iterative Solver

    Authors: Takahiro Katagiri, Takao Sakurai, Mitsuyoshi Igai, Shoji Itoh, Satoshi Ohshima, Hisayasu Kuroda, Ken Naono, Kengo Nakajima

    Abstract: In this paper, we propose a general application programming interface named OpenATLib for auto-tuning (AT). OpenATLib is designed to establish the reusability of AT functions. By using OpenATLib, we develop a fully auto-tuned sparse iterative solver named Xabclib. Xabclib has several novel run-time AT functions. First, the following new implementations of sparse matrix-vector multiplication (SpMV)… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: This article was submitted to SC11, and also was published as a preprint for Research Gate in April 2011. Please refer to: https://www.researchgate.net/publication/258223774_Xabclib_A_Fully_Auto-tuned_Sparse_Iterative_Solver

  4. arXiv:2405.00326  [pdf

    cs.DC cs.MS cs.PF

    A Communication Avoiding and Reducing Algorithm for Symmetric Eigenproblem for Very Small Matrices

    Authors: Takahiro Katagiri, Jun'ichi Iwata, Kazuyuki Uchida

    Abstract: In this paper, a parallel symmetric eigensolver with very small matrices in massively parallel processing is considered. We define very small matrices that fit the sizes of caches per node in a supercomputer. We assume that the sizes also fit the exa-scale computing requirements of current production runs of an application. To minimize communication time, we added several communication avoiding an… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: This article was submitted to Parallel Computing in December 9, 2013.This article was also published in IPSJ SIG Notes, Vol. 2015-HPC-148, Vol.2, pp.1-17 (February 23, 2015). (a non-reviewed technical report)

  5. arXiv:2404.15752  [pdf

    cs.PF

    Performance Evaluation of CMOS Annealing with Support Vector Machine

    Authors: Ryoga Fukuhara, Makoto Morishita, Takahiro Katagiri, Masatoshi Kawai, Toru Nagai, Tetsuya Hoshino

    Abstract: In this paper, support vector machine (SVM) performance was assessed utilizing a quantum-inspired complementary metal-oxide semiconductor (CMOS) annealer. The primary focus during performance evaluation was the accuracy rate in binary classification problems. A comparative analysis was conducted between SVM running on a CPU (classical computation) and executed on a quantum-inspired annealer. The p… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  6. arXiv:2312.05779  [pdf

    cs.PF cs.DC

    Autotuning by Changing Directives and Number of Threads in OpenMP using ppOpen-AT

    Authors: Toma Sakurai, Satoshi Ohshima, Takahiro Katagiri, Toru Nagai

    Abstract: Recently, computers have diversified architectures. To achieve high numerical calculation software performance, it is necessary to tune the software according to the target computer architecture. However, code optimization for each environment is difficult unless it is performed by a specialist who knows computer architectures well. By applying autotuning (AT), the tuning effort can be reduced. Op… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.