Showing 1–2 of 2 results for author: Yoshie, T
-
Multi-block/multi-core SSOR preconditioner for the QCD quark solver for K computer
Authors:
T. Boku,
K. -I. Ishikawa,
Y. Kuramashi,
K. Minami,
Y. Nakamura,
F. Shoji,
D. Takahashi,
M. Terai,
A. Ukawa,
T. Yoshie
Abstract:
We study the algorithmic optimization and performance tuning of the Lattice QCD clover-fermion solver for the K computer. We implement the Lüscher's SAP preconditioner with sub-blocking in which the lattice block in a node is further divided to several sub-blocks to extract enough parallelism for the 8-core CPU SPARC64$^{\mathrm{TM}}$ VIIIfx of the K computer. To achieve a better convergence prope…
▽ More
We study the algorithmic optimization and performance tuning of the Lattice QCD clover-fermion solver for the K computer. We implement the Lüscher's SAP preconditioner with sub-blocking in which the lattice block in a node is further divided to several sub-blocks to extract enough parallelism for the 8-core CPU SPARC64$^{\mathrm{TM}}$ VIIIfx of the K computer. To achieve a better convergence property we use the symmetric successive over-relaxation (SSOR) iteration with {\it locally-lexicographical} ordering for the sub-blocks in obtaining the block inverse. The SAP preconditioner is included in the single precision BiCGStab solver of the nested BiCGStab solver. The single precision part of the computational kernel are solely written with the SIMD oriented intrinsics to achieve the best performance of the \SPARC on the K computer. We benchmark the single precision BiCGStab solver on the three lattice sizes: $12^3\times 24$, $24^3\times 48$ and $48^3\times 96$, with fixing the local lattice size in a node at $6^3\times 12$. We observe an ideal weak-scaling performance from 16 nodes to 4096 nodes. The performance of a computational kernel exceeds 50% efficiency, and the single precision BiCGstab has $\sim26% susutained efficiency.
△ Less
Submitted 28 October, 2012;
originally announced October 2012.
-
Optical surface edge Bloch modes: low-loss subwavelength-scale 2D light localization
Authors:
Shu-Yu Su,
Tomoyuki Yoshie
Abstract:
Edge modes of a finite-size woodpile can appear within a complete bandgap on an <010> edge. The mode area is as small as 0.066 squared half-in-vacuum-wavelengths, and the propagation loss is small. The field maxima occur at a dielectric-vacuum interface, like at a metal-dielectric interface for surface plasmon modes. The edge mode is a subwavelength-scale 2D light localization mode in non-metallic…
▽ More
Edge modes of a finite-size woodpile can appear within a complete bandgap on an <010> edge. The mode area is as small as 0.066 squared half-in-vacuum-wavelengths, and the propagation loss is small. The field maxima occur at a dielectric-vacuum interface, like at a metal-dielectric interface for surface plasmon modes. The edge mode is a subwavelength-scale 2D light localization mode in non-metallic materials. Analysis of two-mode co-directional coupling between identical surface Bloch modes suggests that a large photonic crystal or surface designing would be needed for suppressing the evanescent field coupling in the woodpile.
△ Less
Submitted 27 July, 2012; v1 submitted 19 June, 2012;
originally announced June 2012.