-
White Paper from Workshop on Large-scale Parallel Numerical Computing Technology (LSPANC 2020): HPC and Computer Arithmetic toward Minimal-Precision Computing
Authors:
Roman Iakymchuk,
Daichi Mukunoki,
Artur Podobas,
Fabienne Jézéquel,
Toshiyuki Imamura,
Norihisa Fujita,
Jens Huthmann,
Shuhei Kudo,
Yiyu Tan,
Jens Domke,
Kai Torben Ohlhus,
Takeshi Fukaya,
Takeo Hoshi,
Yuki Murakami,
Maho Nakata,
Takeshi Ogita,
Kentaro Sano,
Taisuke Boku
Abstract:
In numerical computations, precision of floating-point computations is a key factor to determine the performance (speed and energy-efficiency) as well as the reliability (accuracy and reproducibility). However, precision generally plays a contrary role for both. Therefore, the ultimate concept for maximizing both at the same time is the minimal-precision computing through precision-tuning, which a…
▽ More
In numerical computations, precision of floating-point computations is a key factor to determine the performance (speed and energy-efficiency) as well as the reliability (accuracy and reproducibility). However, precision generally plays a contrary role for both. Therefore, the ultimate concept for maximizing both at the same time is the minimal-precision computing through precision-tuning, which adjusts the optimal precision for each operation and data. Several studies have been already conducted for it so far (e.g. Precimoniuos and Verrou), but the scope of those studies is limited to the precision-tuning alone. Hence, we aim to propose a broader concept of the minimal-precision computing system with precision-tuning, involving both hardware and software stack.
In 2019, we have started the Minimal-Precision Computing project to propose a more broad concept of the minimal-precision computing system with precision-tuning, involving both hardware and software stack. Specifically, our system combines (1) a precision-tuning method based on Discrete Stochastic Arithmetic (DSA), (2) arbitrary-precision arithmetic libraries, (3) fast and accurate numerical libraries, and (4) Field-Programmable Gate Array (FPGA) with High-Level Synthesis (HLS).
In this white paper, we aim to provide an overview of various technologies related to minimal- and mixed-precision, to outline the future direction of the project, as well as to discuss current challenges together with our project members and guest speakers at the LSPANC 2020 workshop; https://www.r-ccs.riken.jp/labs/lpnctrt/lspanc2020jan/.
△ Less
Submitted 11 April, 2020; v1 submitted 9 April, 2020;
originally announced April 2020.
-
An a posteriori verification method for generalized real-symmetric eigenvalue problems in large-scale electronic state calculations
Authors:
Takeo Hoshi,
Takeshi Ogita,
Katsuhisa Ozaki,
Takeshi Terao
Abstract:
An a posteriori verification method is proposed for the generalized real-symmetric eigenvalue problem and is applied to densely clustered eigenvalue problems in large-scale electronic state calculations. The proposed method is realized by a two-stage process in which the approximate solution is computed by existing numerical libraries and is then verified in a moderate computational time. The proc…
▽ More
An a posteriori verification method is proposed for the generalized real-symmetric eigenvalue problem and is applied to densely clustered eigenvalue problems in large-scale electronic state calculations. The proposed method is realized by a two-stage process in which the approximate solution is computed by existing numerical libraries and is then verified in a moderate computational time. The procedure returns intervals containing one exact eigenvalue in each interval. Test calculations were carried out for organic device materials, and the verification method confirms that all exact eigenvalues are well separated in the obtained intervals. This verification method will be integrated into EigenKernel (https://github.com/eigenkernel/), which is middleware for various parallel solvers for the generalized eigenvalue problem. Such an a posteriori verification method will be important in future computational science.
△ Less
Submitted 26 February, 2020; v1 submitted 12 April, 2019;
originally announced April 2019.
-
Low/Hard State Spectra of GRO J1655-40 Observed with Suzaku
Authors:
Hiromitsu Takahashi,
Yasushi Fukazawa,
Tsunefumi Mizuno,
Ayumi Hirasawa,
Shunji Kitamoto,
Keisuke Sudoh,
Takayuki Ogita,
Aya Kubota,
Kazuo Makishima,
Takeshi Itoh,
Arvind N. Parmar,
Ken Ebisawa,
Sachindra Naik,
Tadayasu Dotani,
Motohide Kokubun,
Kousuke Ohnuki,
Tadayuki Takahashi,
Tahir Yaqoob,
Lorella Angelini,
Yoshihiro Ueda,
Kazutaka Yamaoka,
Taro Kotani,
Nobuyuki Kawai,
Masaaki Namiki,
Takayoshi Kohmura
, et al. (1 additional authors not shown)
Abstract:
The Galactic black-hole binary GRO J1655$-$40 was observed with Suzaku on 2005 September 22--23, for a net exposure of 35 ks with the X-ray Imaging Spectrometer (XIS) and 20 ks with the Hard X-ray Detector (HXD). The source was detected over a broad and continuous energy range of 0.7--300 keV, with an intensity of $\sim$50 mCrab at 20 keV. At a distance of 3.2 kpc, the 0.7--300 keV luminosity is…
▽ More
The Galactic black-hole binary GRO J1655$-$40 was observed with Suzaku on 2005 September 22--23, for a net exposure of 35 ks with the X-ray Imaging Spectrometer (XIS) and 20 ks with the Hard X-ray Detector (HXD). The source was detected over a broad and continuous energy range of 0.7--300 keV, with an intensity of $\sim$50 mCrab at 20 keV. At a distance of 3.2 kpc, the 0.7--300 keV luminosity is $ \sim 5.1 \times 10^{36}$ erg s$^{-1}$ ($\sim 0.7$ % of the Eddington luminosity for a 6 $M_{\odot}$ black hole). The source was in a typical low/hard state, exhibiting a power-law shaped continuum with a photon index of $\sim 1.6$. During the observation, the source intensity gradually decreased by 25% at energies above $\sim 3$ keV, and by 35% below 2 keV. This, together with the soft X-ray spectra taken with the XIS, suggests the presence of an independent soft component that can be represented by emission from a cool ($\sim 0.2$ keV) disk. The hard X-ray spectra obtained with the HXD reveal a high-energy spectral cutoff, with an e-folding energy of $\sim 200$ keV. Since the spectral photon index above 10 keV is harder by $\sim 0.4$ than that observed in the softer energy band, and the e-folding energy is higher than those of typical reflection humps, the entire 0.7--300 keV spectrum cannot be reproduced by a single thermal Comptonization model, even considering reflection effects. Instead, the spectrum (except the soft excess) can be successfully explained by invoking two thermal-Comptonization components with different $y$-parameters. In contrast to the high/soft state spectra of this object in which narrow iron absorption lines are detected with equivalent widths of 60--100 eV, the present XIS spectra bear no such features beyond an upper-limit equivalent width of 25 eV.
△ Less
Submitted 26 July, 2007;
originally announced July 2007.