-
Ookami: An A64FX Computing Resource
Authors:
A. C. Calder,
E. Siegmann,
C. Feldman,
S. Chheda,
D. C. Smolarski,
F. D. Swesty,
A. Curtis,
J. Dey,
D. Carlson,
B. Michalowicz,
R. J. Harrison
Abstract:
We present a look at Ookami, a project providing community access to a testbed supercomputer with the ARM-based A64FX processors developed by a collaboration between RIKEN and Fujitsu and deployed in the Japanese supercomputer Fugaku. We describe the project, provide details about the user base and education/training program, and present highlights from performance studies of two astrophysical sim…
▽ More
We present a look at Ookami, a project providing community access to a testbed supercomputer with the ARM-based A64FX processors developed by a collaboration between RIKEN and Fujitsu and deployed in the Japanese supercomputer Fugaku. We describe the project, provide details about the user base and education/training program, and present highlights from performance studies of two astrophysical simulation codes.
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
A Further Study of Linux Kernel Hugepages on A64FX with FLASH, an Astrophysical Simulation Code
Authors:
Catherine Feldman,
Smeet Chheda,
Alan C. Calder,
Eva Siegmann,
John Dey,
Tony Curtis,
Robert J. Harrison
Abstract:
We present an expanded study of the performance of FLASH when using Linux Kernel Hugepages on Ookami, an HPE Apollo 80 A64FX platform. FLASH is a multi-scale, multi-physics simulation code written principally in modern Fortran and makes use of the PARAMESH library to manage a block-structured adaptive mesh. Our initial study used only the Fujitsu compiler to utilize standard hugepages (hp), but fu…
▽ More
We present an expanded study of the performance of FLASH when using Linux Kernel Hugepages on Ookami, an HPE Apollo 80 A64FX platform. FLASH is a multi-scale, multi-physics simulation code written principally in modern Fortran and makes use of the PARAMESH library to manage a block-structured adaptive mesh. Our initial study used only the Fujitsu compiler to utilize standard hugepages (hp), but further investigation allowed us to utilize hp for multiple compilers by linking to the Fujitsu library libmpg and transparent hugepages (thp) by enabling it at the node level. By comparing the results of hardware counters and in-code timers, we found that hp and thp do not significantly impact the runtime performance of FLASH. Interestingly, there is a significant reduction in the TLB misses, differences in cache and memory access counters, and strange behavior is observed when using thp.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
On Using Linux Kernel Huge Pages with FLASH, an Astrophysical Simulation Code
Authors:
Alan C. Calder,
Catherine Feldman,
Eva Siegmann,
John Dey,
Anthony Curtis,
Smeet Chheda,
Robert J. Harrison
Abstract:
We present efforts at improving the performance of FLASH, a multi-scale, multi-physics simulation code principally for astrophysical applications, by using huge pages on Ookami, an HPE Apollo 80 A64FX platform. FLASH is written principally in modern Fortran and makes use of the PARAMESH library to manage a block-structured adaptive mesh. We explored options for enabling the use of huge pages with…
▽ More
We present efforts at improving the performance of FLASH, a multi-scale, multi-physics simulation code principally for astrophysical applications, by using huge pages on Ookami, an HPE Apollo 80 A64FX platform. FLASH is written principally in modern Fortran and makes use of the PARAMESH library to manage a block-structured adaptive mesh. We explored options for enabling the use of huge pages with several compilers, but we were only able to successfully use huge pages when compiling with the Fujitsu compiler. The use of huge pages substantially reduced the number of translation lookaside buffer misses, but overall performance gains were marginal.
△ Less
Submitted 27 July, 2022;
originally announced July 2022.
-
On computing bound states of the Dirac and Schrödinger Equations
Authors:
Gregory Beylkin,
Joel Anderson,
Robert J. Harrison
Abstract:
We cast the quantum chemistry problem of computing bound states as that of solving a set of auxiliary eigenvalue problems for a family of parameterized compact integral operators. The compactness of operators assures that their spectrum is discrete and bounded with the only possible accumulation point at zero. We show that, by changing the parameter, we can always find the bound states, i.e., the…
▽ More
We cast the quantum chemistry problem of computing bound states as that of solving a set of auxiliary eigenvalue problems for a family of parameterized compact integral operators. The compactness of operators assures that their spectrum is discrete and bounded with the only possible accumulation point at zero. We show that, by changing the parameter, we can always find the bound states, i.e., the eigenfunctions that satisfy the original equations and are normalizable. While for the non-relativistic equations these properties may not be surprising, it is remarkable that the same holds for the relativistic equations where the spectrum of the original relativistic operators does not have a lower bound. We demonstrate that starting from an arbitrary initialization of the iteration leads to the solution, as dictated by the properties of compact operators.
△ Less
Submitted 5 July, 2021;
originally announced July 2021.
-
Ookami: Deployment and Initial Experiences
Authors:
Andrew Burford,
Alan C. Calder,
David Carlson,
Barbara Chapman,
Firat CoŞKun,
Tony Curtis,
Catherine Feldman,
Robert J. Harrison,
Yan Kang,
Benjamin Michalow-Icz,
Eric Raut,
Eva Siegmann,
Daniel G. Wood,
Robert L. Deleon,
Mathew Jones,
Nikolay A. Simakov,
Joseph P. White,
Dossay Oryspayev
Abstract:
Ookami is a computer technology testbed supported by the United States National Science Foundation. It provides researchers with access to the A64FX processor developed by Fujitsu in collaboration with RIKΞN for the Japanese path to exascale computing, as deployed in Fugaku, the fastest computer in the world. By focusing on crucial architectural details, the ARM-based, multi-core, 512-bit SIMD-vec…
▽ More
Ookami is a computer technology testbed supported by the United States National Science Foundation. It provides researchers with access to the A64FX processor developed by Fujitsu in collaboration with RIKΞN for the Japanese path to exascale computing, as deployed in Fugaku, the fastest computer in the world. By focusing on crucial architectural details, the ARM-based, multi-core, 512-bit SIMD-vector processor with ultrahigh-bandwidth memory promises to retain familiar and successful programming models while achieving very high performance for a wide range of applications. We review relevant technology and system details, and the main body of the paper focuses on initial experiences with the hardware and software ecosystem for micro-benchmarks, mini-apps, and full applications, and starts to answer questions about where such technologies fit into the NSF ecosystem.
△ Less
Submitted 16 June, 2021;
originally announced June 2021.
-
NWChem: Past, Present, and Future
Authors:
E. Aprà,
E. J. Bylaska,
W. A. de Jong,
N. Govind,
K. Kowalski,
T. P. Straatsma,
M. Valiev,
H. J. J. van Dam,
Y. Alexeev,
J. Anchell,
V. Anisimov,
F. W. Aquino,
R. Atta-Fynn,
J. Autschbach,
N. P. Bauman,
J. C. Becca,
D. E. Bernholdt,
K. Bhaskaran-Nair,
S. Bogatko,
P. Borowski,
J. Boschen,
J. Brabec,
A. Bruner,
E. Cauët,
Y. Chen
, et al. (89 additional authors not shown)
Abstract:
Specialized computational chemistry packages have permanently reshaped the landscape of chemical and materials science by providing tools to support and guide experimental efforts and for the prediction of atomistic and electronic properties. In this regard, electronic structure packages have played a special role by using first-principledriven methodologies to model complex chemical and materials…
▽ More
Specialized computational chemistry packages have permanently reshaped the landscape of chemical and materials science by providing tools to support and guide experimental efforts and for the prediction of atomistic and electronic properties. In this regard, electronic structure packages have played a special role by using first-principledriven methodologies to model complex chemical and materials processes. Over the last few decades, the rapid development of computing technologies and the tremendous increase in computational power have offered a unique chance to study complex transformations using sophisticated and predictive many-body techniques that describe correlated behavior of electrons in molecular and condensed phase systems at different levels of theory. In enabling these simulations, novel parallel algorithms have been able to take advantage of computational resources to address the polynomial scaling of electronic structure methods. In this paper, we briefly review the NWChem computational chemistry suite, including its history, design principles, parallel tools, current capabilities, outreach and outlook.
△ Less
Submitted 26 May, 2020; v1 submitted 24 April, 2020;
originally announced April 2020.
-
Group theoretical analysis of structural instability, vacancy ordering and magnetic transitions in the system troilite (FeS) - pyrrhotite (Fe$_{1-x}$S)
Authors:
Charles Robert Sebastian Haines,
Christopher J. Howard,
Richard J. Harrison,
Michael A. Carpenter
Abstract:
A group-theoretical framework to describe vacancy ordering and magnetism in the Fe$_{1-x}$S system is developed. This framework is used to determine the sequence of crystal structures consistent with the observed magnetic structures of troilite (FeS), and to determine the crystallographic nature of the low-temperature Besnus transition in Fe$_{0.875}$S. We conclude that the Besnus transition is a…
▽ More
A group-theoretical framework to describe vacancy ordering and magnetism in the Fe$_{1-x}$S system is developed. This framework is used to determine the sequence of crystal structures consistent with the observed magnetic structures of troilite (FeS), and to determine the crystallographic nature of the low-temperature Besnus transition in Fe$_{0.875}$S. We conclude that the Besnus transition is a magnetically driven transition characterised by the rotation of the moments out of the ac-plane, accompanied by small atomic displacements that lower the symmetry to triclinic at low temperatures. Based on our phase diagram, we predict related magnetically driven phase transitions at low temperatures in all the commensurate superstructures of pyrrhotite. The exact nature of the transition is determined by the symmetry of the vacancy ordered state Based on this we predict spin-flop transitions in 3C and 5C pyrrhotite and a transition akin to the Besnus transition in 6C pyrrhotite. Furthermore, we clarify that 3C and 4C pyrrhotite carry a ferrimagnetic moment whereas 5C and 6C are antiferromagnetic.
△ Less
Submitted 15 January, 2019;
originally announced January 2019.
-
Evaluating the paleomagnetic potential of single zircon crystals using the Bishop Tuff
Authors:
Roger R. Fu,
Benjamin P. Weiss,
Eduardo A. Lima,
Pauli Kehayias,
Jefferson F. D. F. Araujo,
David R. Glenn,
Jeff Gelb,
Joshua F. Einsle,
Ann M. Bauer,
Richard J. Harrison,
Guleed A. H. Ali,
Ronald L. Walsworth
Abstract:
Zircon crystals offer a unique combination of suitability for high-precision radiometric dating and high resistance to alteration. Paleomagnetic experiments on ancient zircons may potentially constrain the earliest geodynamo, which holds broad implications for the early Earth interior and atmosphere. However, the ability of zircons to record accurately the geomagnetic field has not been fully demo…
▽ More
Zircon crystals offer a unique combination of suitability for high-precision radiometric dating and high resistance to alteration. Paleomagnetic experiments on ancient zircons may potentially constrain the earliest geodynamo, which holds broad implications for the early Earth interior and atmosphere. However, the ability of zircons to record accurately the geomagnetic field has not been fully demonstrated. Here we conduct thermal and room temperature alternating field (AF) paleointensity experiments on 767.1 thousand year old (ka) zircons from the Bishop Tuff, California. The rapid emplacement of these zircons in a well-characterized magnetic field provides a high-fidelity test of the zircons intrinsic paleomagnetic recording accuracy. Successful dual heating experiments on nine zircons measured using a superconducting quantum interference device (SQUID) microscope yield a mean paleointensity of 46.2 +/- 18.8 microtesla (1sigma), which agrees closely with high-precision results from Bishop Tuff whole rock (43.0 +/- 3.2 microtesla). High-resolution quantum diamond magnetic map**, electron microscopy, and X-ray tomography indicate that the bulk of the remanent magnetization in Bishop Tuff zircons is carried by Fe oxides associated with apatite inclusions, which would be susceptible to destruction via metamorphism and aqueous alteration in older zircons. As such, while zircons can reliably record the geomagnetic field, robust zircon-derived paleomagnetic results require careful characterization of the ferromagnetic carrier and demonstration of their occurrence in primary inclusions. We further conclude that a combination of quantum diamond magnetometry and high-resolution imaging can provide detailed, direct characterization of the ferromagnetic mineralogy of geological samples.
△ Less
Submitted 26 May, 2016;
originally announced May 2016.
-
MADNESS: A Multiresolution, Adaptive Numerical Environment for Scientific Simulation
Authors:
Robert J. Harrison,
Gregory Beylkin,
Florian A. Bischoff,
Justus A. Calvin,
George I. Fann,
Jacob Fosso-Tande,
Diego Galindo,
Jeff R. Hammond,
Rebecca Hartman-Baker,
Judith C. Hill,
Jun Jia,
Jakob S. Kottmann,
M-J. Yvonne Ou,
Laura E. Ratcliff,
Matthew G. Reuter,
Adam C. Richie-Halford,
Nichols A. Romero,
Hideo Sekino,
William A. Shelton,
Bryan E. Sundahl,
W. Scott Thornton,
Edward F. Valeev,
Álvaro Vázquez-Mayagoitia,
Nicholas Vence,
Yukina Yokoi
Abstract:
MADNESS (multiresolution adaptive numerical environment for scientific simulation) is a high-level software environment for solving integral and differential equations in many dimensions that uses adaptive and fast harmonic analysis methods with guaranteed precision based on multiresolution analysis and separated representations. Underpinning the numerical capabilities is a powerful petascale para…
▽ More
MADNESS (multiresolution adaptive numerical environment for scientific simulation) is a high-level software environment for solving integral and differential equations in many dimensions that uses adaptive and fast harmonic analysis methods with guaranteed precision based on multiresolution analysis and separated representations. Underpinning the numerical capabilities is a powerful petascale parallel programming environment that aims to increase both programmer productivity and code scalability. This paper describes the features and capabilities of MADNESS and briefly discusses some current applications in chemistry and several areas of physics.
△ Less
Submitted 5 July, 2015;
originally announced July 2015.
-
Coordinate-Space Hartree-Fock-Bogoliubov Solvers for Superfluid Fermi Systems in Large Boxes
Authors:
J. C. Pei,
G. I. Fann,
R. J. Harrison,
W. Nazarewicz,
J. Hill,
D. Galindo,
J. Jia
Abstract:
The self-consistent Hartree-Fock-Bogoliubov problem in large boxes can be solved accurately in the coordinate space with the recently developed solvers HFB-AX (2D) and MADNESS-HFB (3D). This is essential for the description of superfluid Fermi systems with complicated topologies and significant spatial extend, such as fissioning nuclei, weakly-bound nuclei, nuclear matter in the neutron star rust,…
▽ More
The self-consistent Hartree-Fock-Bogoliubov problem in large boxes can be solved accurately in the coordinate space with the recently developed solvers HFB-AX (2D) and MADNESS-HFB (3D). This is essential for the description of superfluid Fermi systems with complicated topologies and significant spatial extend, such as fissioning nuclei, weakly-bound nuclei, nuclear matter in the neutron star rust, and ultracold Fermi atoms in elongated traps. The HFB-AX solver based on B-spline techniques uses a hybrid MPI and OpenMP programming model for parallel computation for distributed parallel computation, within a node multi-threaded LAPACK and BLAS libraries are used to further enable parallel calculations of large eigensystems. The MADNESS-HFB solver uses a novel multi-resolution analysis based adaptive pseudo-spectral techniques to enable fully parallel 3D calculations of very large systems. In this work we present benchmark results for HFB-AX and MADNESS-HFB on ultracold trapped fermions.
△ Less
Submitted 23 April, 2012;
originally announced April 2012.
-
Effect of Chemical Pressure on the Magnetic Transition of Multiferroic Ca-BiFeO3
Authors:
G. Catalan,
K. Sardar,
N. S. Church,
J. F. Scott,
R. J. Harrison,
S. A. T. Redfern
Abstract:
Multiferroic BiFeO3 ceramics have been doped with Ca. The smaller ionic size of Ca compared with Bi means that do** acts as a proxy for hydrostatic pressure, at a rate of 1%Ca=0.3GPa. It is also found that the magnetic Neel temperature (TNeel) increases as Ca concentration increases, at a rate of 0.66K per 1%Ca (molar). Based on the effect of chemical pressure on TNeel, we argue that applying…
▽ More
Multiferroic BiFeO3 ceramics have been doped with Ca. The smaller ionic size of Ca compared with Bi means that do** acts as a proxy for hydrostatic pressure, at a rate of 1%Ca=0.3GPa. It is also found that the magnetic Neel temperature (TNeel) increases as Ca concentration increases, at a rate of 0.66K per 1%Ca (molar). Based on the effect of chemical pressure on TNeel, we argue that applying hydrostatic pressure to pure BiFeO3 can be expected to increase its magnetic transition temperature at a rate around ~2.2K/GPa. The results also suggest that pressure (chemical or hydrostatic) could be used to bring the ferroelectric critical temperature, Tc, and the magnetic TNeel closer together, thereby enhancing magnetoelectric coupling, provided that electrical conductivity can be kept sufficiently low.
△ Less
Submitted 17 March, 2009;
originally announced March 2009.
-
Vortex Ferroelectric Domains
Authors:
A. Gruverman,
D. Wu,
H. -J. Fan,
I. Vrejoiu,
M. Alexe,
R. J. Harrison,
J. F. Scott
Abstract:
We show experimental switching data on microscale capacitors of lead-zirconate-titanate (PZT), which reveal time-resolved domain behavior during switching on a 100-ns scale. For small circular capacitors, an unswitched domain remains in the center while complete switching is observed in square capacitors. The observed effect is attributed to the formation of vortex domain during polarization swi…
▽ More
We show experimental switching data on microscale capacitors of lead-zirconate-titanate (PZT), which reveal time-resolved domain behavior during switching on a 100-ns scale. For small circular capacitors, an unswitched domain remains in the center while complete switching is observed in square capacitors. The observed effect is attributed to the formation of vortex domain during polarization switching in circular capacitors. This dynamical behavior is modeled using the Landau-Liftshitz-Gilbert equations and found to be in detailed agreement with experiment. This simulation implies rotational motion of polarization in the xy-plane, a Heisenberg-like result supported by the recent model of Naumov and Fu [Phys. Rev. Lett. 98, 077603 (2007)], although not directly measurable by the present quasi-static measurements.
△ Less
Submitted 1 February, 2008;
originally announced February 2008.