Skip to main content

Showing 1–3 of 3 results for author: Fukumoto, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.16044  [pdf, other

    cs.DC

    mpiQulacs: A Distributed Quantum Computer Simulator for A64FX-based Cluster Systems

    Authors: Satoshi Imamura, Masafumi Yamazaki, Takumi Honda, Akihiko Kasagi, Akihiro Tabuchi, Hiroshi Nakao, Naoto Fukumoto, Kohta Nakashima

    Abstract: Quantum computer simulators running on classical computers are essential for develo** real quantum computers and emerging quantum applications. In particular, state vector simulators, which store a full state vector in memory and update it in every quantum operation, are available to simulate an arbitrary form of quantum circuits, debug quantum applications, and validate future quantum computers… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: This preprint is related to the press release of Fujitsu LTD. in https://www.fujitsu.com/global/about/resources/news/press-releases/2022/0330-01.html, 11 pages, 12 figures

  2. arXiv:2110.11466  [pdf, other

    cs.LG cs.DC

    MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems

    Authors: Steven Farrell, Murali Emani, Jacob Balma, Lukas Drescher, Aleksandr Drozd, Andreas Fink, Geoffrey Fox, David Kanter, Thorsten Kurth, Peter Mattson, Dawei Mu, Amit Ruhela, Kento Sato, Koichi Shirahata, Tsuguchika Tabaru, Aristeidis Tsaris, Jan Balewski, Ben Cumming, Takumi Danjo, Jens Domke, Takaaki Fukai, Naoto Fukumoto, Tatsuya Fukushi, Balazs Gerofi, Takumi Honda , et al. (18 additional authors not shown)

    Abstract: Scientific communities are increasingly adopting machine learning and deep learning models in their applications to accelerate scientific insights. High performance computing systems are pushing the frontiers of performance with a rich diversity of hardware resources and massive scale-out capabilities. There is a critical need to understand fair and effective benchmarking of machine learning appli… ▽ More

    Submitted 26 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

  3. arXiv:1903.12650  [pdf, ps, other

    cs.LG stat.ML

    Yet Another Accelerated SGD: ResNet-50 Training on ImageNet in 74.7 seconds

    Authors: Masafumi Yamazaki, Akihiko Kasagi, Akihiro Tabuchi, Takumi Honda, Masahiro Miwa, Naoto Fukumoto, Tsuguchika Tabaru, Atsushi Ike, Kohta Nakashima

    Abstract: There has been a strong demand for algorithms that can execute machine learning as faster as possible and the speed of deep learning has accelerated by 30 times only in the past two years. Distributed deep learning using the large mini-batch is a key technology to address the demand and is a great challenge as it is difficult to achieve high scalability on large clusters without compromising accur… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.