-
Interactive and Urgent HPC: Challenges and Opportunities
Authors:
Albert Reuther,
Nick Brown,
William Arndt,
Johannes Blaschke,
Christian Boehme,
Antony Chazapis,
Bjoern Enders,
Robert Henschel,
Julian Kunkel,
Maxime Martinasso
Abstract:
As a broader set of applications from simulations to data analysis and machine learning require more parallel computational capability, the demand for interactive and urgent high performance computing (HPC) continues to increase. This paper overviews the progress made so far and elucidates the challenges and opportunities for greater integration of interactive and urgent HPC policies, techniques,…
▽ More
As a broader set of applications from simulations to data analysis and machine learning require more parallel computational capability, the demand for interactive and urgent high performance computing (HPC) continues to increase. This paper overviews the progress made so far and elucidates the challenges and opportunities for greater integration of interactive and urgent HPC policies, techniques, and technologies into HPC ecosystems.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
Oceananigans.jl: A model that achieves breakthrough resolution, memory and energy efficiency in global ocean simulations
Authors:
Simone Silvestri,
Gregory Wagner,
Christopher Hill,
Matin Raayai Ardakani,
Johannes Blaschke,
Jean-Michel Campin,
Valentin Churavy,
Navid Constantinou,
Alan Edelman,
John Marshall,
Ali Ramadhan,
Andre Souza,
Raffaele Ferrari
Abstract:
Climate models must simulate hundreds of future scenarios for hundreds of years at coarse resolutions, and a handful of high-resolution decadal simulations to resolve localized extreme events. Using Oceananigans.jl, written from scratch in Julia, we report several achievements: First, a global ocean simulation with breakthrough horizontal resolution -- 488m -- reaching 15 simulated days per day (0…
▽ More
Climate models must simulate hundreds of future scenarios for hundreds of years at coarse resolutions, and a handful of high-resolution decadal simulations to resolve localized extreme events. Using Oceananigans.jl, written from scratch in Julia, we report several achievements: First, a global ocean simulation with breakthrough horizontal resolution -- 488m -- reaching 15 simulated days per day (0.04 simulated years per day; SYPD). Second, Oceananigans simulates the global ocean at 488m with breakthrough memory efficiency on just 768 Nvidia A100 GPUs, a fraction of the resources available on current and upcoming exascale supercomputers. Third, and arguably most significant for climate modeling, Oceananigans achieves breakthrough energy efficiency reaching 0.95 SYPD at 1.7 km on 576 A100s and 9.9 SYPD at 10 km on 68 A100s -- the latter representing the highest horizontal resolutions employed by current IPCC-class ocean models. Routine climate simulations with 10 km ocean components are within reach.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Bridging HPC Communities through the Julia Programming Language
Authors:
Valentin Churavy,
William F Godoy,
Carsten Bauer,
Hendrik Ranocha,
Michael Schlottke-Lakemper,
Ludovic Räss,
Johannes Blaschke,
Mosè Giordano,
Erik Schnetter,
Samuel Omlin,
Jeffrey S. Vetter,
Alan Edelman
Abstract:
The Julia programming language has evolved into a modern alternative to fill existing gaps in scientific computing and data science applications. Julia leverages a unified and coordinated single-language and ecosystem paradigm and has a proven track record of achieving high performance without sacrificing user productivity. These aspects make Julia a viable alternative to high-performance computin…
▽ More
The Julia programming language has evolved into a modern alternative to fill existing gaps in scientific computing and data science applications. Julia leverages a unified and coordinated single-language and ecosystem paradigm and has a proven track record of achieving high performance without sacrificing user productivity. These aspects make Julia a viable alternative to high-performance computing's (HPC's) existing and increasingly costly many-body workflow composition strategy in which traditional HPC languages (e.g., Fortran, C, C++) are used for simulations, and higher-level languages (e.g., Python, R, MATLAB) are used for data analysis and interactive computing. Julia's rapid growth in language capabilities, package ecosystem, and community make it a promising universal language for HPC. This paper presents the views of a multidisciplinary group of researchers from academia, government, and industry that advocate for an HPC software development paradigm that emphasizes developer productivity, workflow portability, and low barriers for entry. We believe that the Julia programming language, its ecosystem, and its community provide modern and powerful capabilities that enable this group's objectives. Crucially, we believe that Julia can provide a feasible and less costly approach to programming scientific applications and workflows that target HPC facilities. In this work, we examine the current practice and role of Julia as a common, end-to-end programming model to address major challenges in scientific reproducibility, data-driven AI/machine learning, co-design and workflows, scalability and performance portability in heterogeneous computing, network communication, data management, and community education. As a result, the diversification of current investments to fulfill the needs of the upcoming decade is crucial as more supercomputing centers prepare for the exascale era.
△ Less
Submitted 10 November, 2022; v1 submitted 4 November, 2022;
originally announced November 2022.
-
The LBNL Superfacility Project Report
Authors:
Deborah Bard,
Cory Snavely,
Lisa Gerhardt,
Jason Lee,
Becci Totzke,
Katie Antypas,
William Arndt,
Johannes Blaschke,
Suren Byna,
Ravi Cheema,
Shreyas Cholia,
Mark Day,
Bjoern Enders,
Aditi Gaur,
Annette Greiner,
Taylor Groves,
Mariam Kiran,
Quincey Koziol,
Tom Lehman,
Kelly Rowland,
Chris Samuel,
Ashwin Selvarajan,
Alex Sim,
David Skinner,
Laurie Stephey
, et al. (2 additional authors not shown)
Abstract:
The Superfacility model is designed to leverage HPC for experimental science. It is more than simply a model of connected experiment, network, and HPC facilities; it encompasses the full ecosystem of infrastructure, software, tools, and expertise needed to make connected facilities easy to use. The three-year Lawrence Berkeley National Laboratory (LBNL) Superfacility project was initiated in 2019…
▽ More
The Superfacility model is designed to leverage HPC for experimental science. It is more than simply a model of connected experiment, network, and HPC facilities; it encompasses the full ecosystem of infrastructure, software, tools, and expertise needed to make connected facilities easy to use. The three-year Lawrence Berkeley National Laboratory (LBNL) Superfacility project was initiated in 2019 to coordinate work being performed at LBNL to support this model, and to provide a coherent and comprehensive set of science requirements to drive existing and new work.
A key component of the project was the in-depth engagements with eight science teams that represent challenging use cases across the DOE Office of Science. By the close of the project, we met our project goal by enabling our science application engagements to demonstrate automated pipelines that analyze data from remote facilities at large scale, without routine human intervention. In several cases, we have gone beyond demonstrations and now provide production-level services. To achieve this goal, the Superfacility team developed tools, infrastructure, and policies for near-real-time computing support, dynamic high-performance networking, data management and movement tools, API-driven automation, HPC-scale notebooks via Jupyter, authentication using Federated Identity and container-based edge services supported.
The lessons we learned during this project provide a valuable model for future large, complex, cross-disciplinary collaborations. There is a pressing need for a coherent computing infrastructure across national facilities, and LBNL's Superfacility project is a unique model for success in tackling the challenges that will be faced in hardware, software, policies, and services across multiple science domains.
△ Less
Submitted 27 June, 2022; v1 submitted 23 June, 2022;
originally announced June 2022.
-
Accelerating X-Ray Tracing for Exascale Systems using Kokkos
Authors:
Felix Wittwer,
Nicholas K. Sauter,
Derek Mendez,
Billy K. Poon,
Aaron S. Brewster,
James M. Holton,
Michael E. Wall,
William E. Hart,
Deborah J. Bard,
Johannes P. Blaschke
Abstract:
The upcoming exascale computing systems Frontier and Aurora will draw much of their computing power from GPU accelerators. The hardware for these systems will be provided by AMD and Intel, respectively, each supporting their own GPU programming model. The challenge for applications that harness one of these exascale systems will be to avoid lock-in and to preserve performance portability.
We rep…
▽ More
The upcoming exascale computing systems Frontier and Aurora will draw much of their computing power from GPU accelerators. The hardware for these systems will be provided by AMD and Intel, respectively, each supporting their own GPU programming model. The challenge for applications that harness one of these exascale systems will be to avoid lock-in and to preserve performance portability.
We report here on our results of using Kokkos to accelerate a real-world application on NERSC's Perlmutter Phase 1 (using NVIDIA A100 accelerators) and the testbed system for OLCF's Frontier (using AMD MI250X). By porting to Kokkos, we were able to successfully run the same X-ray tracing code on both systems and achieved speed-ups between 13% and 66% compared to the original CUDA code. These results are a highly encouraging demonstration of using Kokkos to accelerate production science code.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.
-
Real-Time XFEL Data Analysis at SLAC and NERSC: a Trial Run of Nascent Exascale Experimental Data Analysis
Authors:
Johannes P. Blaschke,
Aaron S. Brewster,
Daniel W. Paley,
Derek Mendez,
Asmit Bhowmick,
Nicholas K. Sauter,
Wilko Kröger,
Murali Shankar,
Bjoern Enders,
Deborah Bard
Abstract:
X-ray scattering experiments using Free Electron Lasers (XFELs) are a powerful tool to determine the molecular structure and function of unknown samples (such as COVID-19 viral proteins). XFEL experiments are a challenge to computing in two ways: i) due to the high cost of running XFELs, a fast turnaround time from data acquisition to data analysis is essential to make informed decisions on experi…
▽ More
X-ray scattering experiments using Free Electron Lasers (XFELs) are a powerful tool to determine the molecular structure and function of unknown samples (such as COVID-19 viral proteins). XFEL experiments are a challenge to computing in two ways: i) due to the high cost of running XFELs, a fast turnaround time from data acquisition to data analysis is essential to make informed decisions on experimental protocols; ii) data collection rates are growing exponentially, requiring new scalable algorithms. Here we report our experiences analyzing data from two experiments at the Linac Coherent Light Source (LCLS) during September 2020. Raw data were analyzed on NERSC's Cori XC40 system, using the Superfacility paradigm: our workflow automatically moves raw data between LCLS and NERSC, where it is analyzed using the software package CCTBX. We achieved real time data analysis with a turnaround time from data acquisition to full molecular reconstruction in as little as 10 min -- sufficient time for the experiment's operators to make informed decisions. By hosting the data analysis on Cori, and by automating LCLS-NERSC interoperability, we achieved a data analysis rate which matches the data acquisition rate. Completing data analysis with 10 mins is a first for XFEL experiments and an important milestone if we are to keep up with data collection trends.
△ Less
Submitted 31 December, 2023; v1 submitted 21 June, 2021;
originally announced June 2021.
-
Accelerating GMRES with Deep Learning in Real-Time
Authors:
Kevin Luna,
Katherine Klymko,
Johannes P. Blaschke
Abstract:
GMRES is a powerful numerical solver used to find solutions to extremely large systems of linear equations. These systems of equations appear in many applications in science and engineering. Here we demonstrate a real-time machine learning algorithm that can be used to accelerate the time-to-solution for GMRES. Our framework is novel in that is integrates the deep learning algorithm in an in situ…
▽ More
GMRES is a powerful numerical solver used to find solutions to extremely large systems of linear equations. These systems of equations appear in many applications in science and engineering. Here we demonstrate a real-time machine learning algorithm that can be used to accelerate the time-to-solution for GMRES. Our framework is novel in that is integrates the deep learning algorithm in an in situ fashion: the AI-accelerator gradually learns how to optimizes the time to solution without requiring user input (such as a pre-trained data set). We describe how our algorithm collects data and optimizes GMRES. We demonstrate our algorithm by implementing an accelerated (MLGMRES) solver in Python. We then use MLGMRES to accelerate a solver for the Poisson equation -- a class of linear problems that appears in may applications.
Informed by the properties of formal solutions to the Poisson equation, we test the performance of different neural networks. Our key takeaway is that networks which are capable of learning non-local relationships perform well, without needing to be scaled with the input problem size, making them good candidates for the extremely large problems encountered in high-performance computing. For the inputs studied, our method provides a roughly 2$\times$ acceleration.
△ Less
Submitted 19 March, 2021;
originally announced March 2021.
-
cuFINUFFT: a load-balanced GPU library for general-purpose nonuniform FFTs
Authors:
Yu-hsuan Shih,
Garrett Wright,
Joakim Andén,
Johannes Blaschke,
Alex H. Barnett
Abstract:
Nonuniform fast Fourier transforms dominate the computational cost in many applications including image reconstruction and signal processing. We thus present a general-purpose GPU-based CUDA library for type 1 (nonuniform to uniform) and type 2 (uniform to nonuniform) transforms in dimensions 2 and 3, in single or double precision. It achieves high performance for a given user-requested accuracy,…
▽ More
Nonuniform fast Fourier transforms dominate the computational cost in many applications including image reconstruction and signal processing. We thus present a general-purpose GPU-based CUDA library for type 1 (nonuniform to uniform) and type 2 (uniform to nonuniform) transforms in dimensions 2 and 3, in single or double precision. It achieves high performance for a given user-requested accuracy, regardless of the distribution of nonuniform points, via cache-aware point reordering, and load-balanced blocked spreading in shared memory. At low accuracies, this gives on-GPU throughputs around $10^9$ nonuniform points per second, and (even including host-device transfer) is typically 4-10$\times$ faster than the latest parallel CPU code FINUFFT (at 28 threads). It is competitive with two established GPU codes, being up to 90$\times$ faster at high accuracy and/or type 1 clustered point distributions. Finally we demonstrate a 5-12$\times$ speedup versus CPU in an X-ray diffraction 3D iterative reconstruction task at $10^{-12}$ accuracy, observing excellent multi-GPU weak scaling up to one rank per GPU.
△ Less
Submitted 25 March, 2021; v1 submitted 16 February, 2021;
originally announced February 2021.
-
Benchmarking of a preliminary MFiX-Exa code
Authors:
William D. Fullmer,
Ann S. Almgren,
Michele Rosso,
Johannes Blaschke,
Jordan Musser
Abstract:
MFiX-Exa is a new code being actively developed at Lawrence Berkeley National Laboratory and the National Energy Technology Laboratory as part of the U.S. Department of Energy's Exascale Computing Project. The starting point for the MFiX-Exa code development was the extraction of basic computational fluid dynamic (CFD) and discrete element method (DEM) capabilities from the existing MFiX-DEM code…
▽ More
MFiX-Exa is a new code being actively developed at Lawrence Berkeley National Laboratory and the National Energy Technology Laboratory as part of the U.S. Department of Energy's Exascale Computing Project. The starting point for the MFiX-Exa code development was the extraction of basic computational fluid dynamic (CFD) and discrete element method (DEM) capabilities from the existing MFiX-DEM code which was refactored into an AMReX code architecture, herein referred to as the preliminary MFiX-Exa code. Although drastic changes to the codebase will be required to produce an exascale capable application, benchmarking of the originating code helps to establish a valid start point for future development. In this work, four benchmark cases are considered, each corresponding to experimental data sets with history of CFD-DEM validation. We find that the preliminary MFiX-Exa code compares favorably with classic MFiX-DEM simulation predictions for three slugging/bubbling fluidized beds and one spout-fluid bed. Comparison to experimental data is also acceptable (within accuracy expected from previous CFD-DEM benchmarking and validation exercises) which is comprised of several measurement techniques including particle tracking velocimetry, positron emission particle tracking and magnetic resonance imaging. The work concludes with an overview of planned developmental work and potential benchmark cases to validate new MFiX-Exa capabilities.
△ Less
Submitted 4 September, 2019;
originally announced September 2019.
-
Gravity-induced dynamics of a squirmer microswimmer in wall proximity
Authors:
Felix Rühle,
Johannes Blaschke,
Jan-Timm Kuhr,
Holger Stark
Abstract:
We perform hydrodynamic simulations using the method of multi-particle collision dynamics and a theoretical analysis to study a single squirmer microswimmer at high Péclet number, which moves in a low Reynolds number fluid and under gravity. The relevant parameters are the ratio $α$ of swimming to bulk sedimentation velocity and the squirmer type $β$. The combination of self-propulsion, gravitatio…
▽ More
We perform hydrodynamic simulations using the method of multi-particle collision dynamics and a theoretical analysis to study a single squirmer microswimmer at high Péclet number, which moves in a low Reynolds number fluid and under gravity. The relevant parameters are the ratio $α$ of swimming to bulk sedimentation velocity and the squirmer type $β$. The combination of self-propulsion, gravitational force, hydrodynamic interactions with the wall, and thermal noise leads to a surprisingly diverse behavior. At $α> 1$ we observe cruising states, while for $α<1$ the squirmer resides close to the bottom wall with the motional state determined by stable fixed points in height and orientation. They strongly depend on the squirmer type $β$. While neutral squirmers permanently float above the wall with upright orientation, pullers float for $α$ larger than a threshold value $α_{\mathrm{th}}$ and are pinned to the wall below $α_{\mathrm{th}}$. In contrast, pushers slide along the wall at lower heights, from which thermal orientational fluctuations drive them into a recurrent floating state with upright orientation, where they remain on the timescale of orientational persistence.
△ Less
Submitted 23 November, 2017; v1 submitted 22 August, 2017;
originally announced August 2017.
-
Collective Sedimentation of Squirmers under Gravity
Authors:
Jan-Timm Kuhr,
Johannes Blaschke,
Felix Rühle,
Holger Stark
Abstract:
Active particles, which interact hydrodynamically, display a remarkable variety of emergent collective phenomena. We use squirmers to model spherical microswimmers and explore the collective behavior of thousands of them under the influence of strong gravity using the method of multi-particle collision dynamics for simulating fluid flow. The sedimentation profile depends on the ratio of swimming t…
▽ More
Active particles, which interact hydrodynamically, display a remarkable variety of emergent collective phenomena. We use squirmers to model spherical microswimmers and explore the collective behavior of thousands of them under the influence of strong gravity using the method of multi-particle collision dynamics for simulating fluid flow. The sedimentation profile depends on the ratio of swimming to sedimentation velocity as well as on the squirmer type. It shows close packed squirmer layers at the bottom and a highly dynamic region with exponential density dependence towards the top. The mean vertical orientation of the squirmers strongly depends on height. For swimming velocities larger than the sedimentation velocity, squirmers show strong convection in the exponential region. We quantify the strength of convection and the extent of convection cells by the vertical current density and its current dipole, which are large for neutral squirmers as well as for weak pushers and pullers.
△ Less
Submitted 15 August, 2017; v1 submitted 14 June, 2017;
originally announced June 2017.
-
Phase Separation and Coexistence of Hydrodynamically Interacting Microswimmers
Authors:
Johannes Blaschke,
Maurice Maurer,
Karthik Menon,
Andreas Zöttl,
Holger Stark
Abstract:
A striking feature of the collective behavior of spherical microswimmers is that for sufficiently strong self-propulsion they phase-separate into a dense cluster coexisting with a low-density dis- ordered surrounding. Extending our previous work, we use the squirmer as a model swimmer and the particle-based simulation method of multi-particle collision dynamics to explore the influence of hydrodyn…
▽ More
A striking feature of the collective behavior of spherical microswimmers is that for sufficiently strong self-propulsion they phase-separate into a dense cluster coexisting with a low-density dis- ordered surrounding. Extending our previous work, we use the squirmer as a model swimmer and the particle-based simulation method of multi-particle collision dynamics to explore the influence of hydrodynamics on their phase behavior in a quasi-two-dimensional geometry. The coarsening dynamics towards the phase-separated state is diffusive in an intermediate time regime followed by a final ballistic compactification of the dense cluster. We determine the binodal lines in a phase diagram of Péclet number versus density. Interestingly, the gas binodals are shifted to smaller densities for increasing mean density or dense-cluster size, which we explain using a recently introduced pressure balance [S. C. Takatori et al., Phys. Rev. Lett. 113, 028103 (2014)] extended by a hydrodynamic contribution. Furthermore, we find that for pushers and pullers the binodal line is shifted to larger Péclet numbers compared to neutral squirmers. Finally, when lowering the Péclet number, the dense phase transforms from a hexagonal "solid" to a disordered "fluid" state.
△ Less
Submitted 28 October, 2016; v1 submitted 8 September, 2016;
originally announced September 2016.
-
Granular Brownian Motors: role of gas anisotropy and inelasticity
Authors:
Johannes Blaschke,
Jürgen Vollmer
Abstract:
We investigate the motion of a wedge-shaped object (a granular Brownian motor), which is restricted to move along the x-axis and cannot rotate, as gas particles collide with it. We show that its steady-state drift, resulting from inelastic gas-motor collisions, is dramatically affected by anisotropy in the velocity distribution of the gas. We identify the dimensionless parameter providing the depe…
▽ More
We investigate the motion of a wedge-shaped object (a granular Brownian motor), which is restricted to move along the x-axis and cannot rotate, as gas particles collide with it. We show that its steady-state drift, resulting from inelastic gas-motor collisions, is dramatically affected by anisotropy in the velocity distribution of the gas. We identify the dimensionless parameter providing the dependence of this drift on shape, masses, inelasticity, and anisotropy: the anisotropy leads to a dramatic breaking of equipartition, which should easily be visible in experimental realizations.
△ Less
Submitted 6 April, 2013; v1 submitted 12 February, 2013;
originally announced February 2013.
-
Arrest of three-dimensional gravity-confined shear flow of wet granular matter
Authors:
Klaus Roeller,
Johannes Blaschke,
Stephan Herminghaus,
Jürgen Vollmer
Abstract:
We study the arrest of three-dimensional flow in wet granular matter subject to a sinusoidal external force and a gravitational field confining the flow in the vertical direction. The minimal strength of the external force that is required to keep the system in motion is determined by considering the balance of injected and dissipated power. This provides a prediction whose excellent quality is de…
▽ More
We study the arrest of three-dimensional flow in wet granular matter subject to a sinusoidal external force and a gravitational field confining the flow in the vertical direction. The minimal strength of the external force that is required to keep the system in motion is determined by considering the balance of injected and dissipated power. This provides a prediction whose excellent quality is demonstrated by a data collapse for an extensive set of event-driven molecular dynamics simulations where we varied the system size, particle number, the energy dissipated upon rupturing capillary bridges, and the bridge length where rupture occurs. The three parameters of the theoretical prediction all lie within narrow margins of theoretical estimates.
△ Less
Submitted 11 October, 2012;
originally announced October 2012.
-
Breath Figures: Nucleation, Growth, Coalescence, and the Size Distribution of Droplets
Authors:
Johannes Blaschke,
Tobias Lapp,
Björn Hof,
Jürgen Vollmer
Abstract:
The analysis of the size distribution of droplets condensing on a substrate (breath figures) is a test ground for scaling theories. Here, we show that a faithful description of these distributions must explicitly deal with the growth mechanisms of the droplets. This finding establishes a gateway connecting nucleation and growth of the smallest droplets on surfaces to gross features of the evolutio…
▽ More
The analysis of the size distribution of droplets condensing on a substrate (breath figures) is a test ground for scaling theories. Here, we show that a faithful description of these distributions must explicitly deal with the growth mechanisms of the droplets. This finding establishes a gateway connecting nucleation and growth of the smallest droplets on surfaces to gross features of the evolution of the droplet size distribution.
△ Less
Submitted 28 June, 2012;
originally announced June 2012.
-
Classical orbit bifurcation and quantum interference in mesoscopic magnetoconductance
Authors:
J. Blaschke,
M. Brack
Abstract:
We study the magnetoconductance of electrons through a mesoscopic channel with antidots. Through quantum interference effects, the conductance maxima as functions of the magnetic field strength and the antidot radius (regulated by the applied gate voltage) exhibit characteristic dislocations that have been observed experimentally. Using the semiclassical periodic orbit theory, we relate these di…
▽ More
We study the magnetoconductance of electrons through a mesoscopic channel with antidots. Through quantum interference effects, the conductance maxima as functions of the magnetic field strength and the antidot radius (regulated by the applied gate voltage) exhibit characteristic dislocations that have been observed experimentally. Using the semiclassical periodic orbit theory, we relate these dislocations directly to bifurcations of the leading classes of periodic orbits.
△ Less
Submitted 18 September, 1999; v1 submitted 25 June, 1999;
originally announced June 1999.
-
Periodic Orbit Theory of the circular billiard in homogeneous magnetic fields
Authors:
J. Blaschke,
M. Brack
Abstract:
We present a semiclassical description of the level density of a two-dimensional circular quantum dot in a homogeneous magnetic field. We model the total potential (including electron-electron interaction) of the dot containing many electrons by a circular billiard, i.e., a hard-wall potential. Using the extended approach of the Gutzwiller theory developed by Creagh and Littlejohn, we derive an…
▽ More
We present a semiclassical description of the level density of a two-dimensional circular quantum dot in a homogeneous magnetic field. We model the total potential (including electron-electron interaction) of the dot containing many electrons by a circular billiard, i.e., a hard-wall potential. Using the extended approach of the Gutzwiller theory developed by Creagh and Littlejohn, we derive an analytic semiclassical trace formula. For its numerical evaluation we use a generalization of the common Gaussian smoothing technique. In strong fields orbit bifurcations, boundary effects (grazing orbits) and diffractive effects (cree** orbits) come into play, and the comparison with the exact quantum mechanical result shows major deviations. We show that the dominant corrections stem from grazing orbits, the other effects being much less important. We implement the boundary effects, replacing the Maslov index by a quantum-mechanical reflection phase, and obtain a good agreement between the semiclassical and the quantum result for all field strengths. With this description, we are able to explain the main features of the gross-shell structure in terms of just one or two classical periodic orbits.
△ Less
Submitted 18 August, 1997; v1 submitted 1 April, 1997;
originally announced April 1997.