-
Combined track finding with GNN & CKF
Authors:
Lukas Heinrich,
Benjamin Huth,
Andreas Salzburger,
Tilo Wettig
Abstract:
The application of Graph Neural Networks (GNN) in track reconstruction is a promising approach to cope with the challenges arising at the High-Luminosity upgrade of the Large Hadron Collider (HL-LHC). GNNs show good track-finding performance in high-multiplicity scenarios and are naturally parallelizable on heterogeneous compute architectures.
Typical high-energy-physics detectors have high reso…
▽ More
The application of Graph Neural Networks (GNN) in track reconstruction is a promising approach to cope with the challenges arising at the High-Luminosity upgrade of the Large Hadron Collider (HL-LHC). GNNs show good track-finding performance in high-multiplicity scenarios and are naturally parallelizable on heterogeneous compute architectures.
Typical high-energy-physics detectors have high resolution in the innermost layers to support vertex reconstruction but lower resolution in the outer parts. GNNs mainly rely on 3D space-point information, which can cause reduced track-finding performance in the outer regions.
In this contribution, we present a novel combination of GNN-based track finding with the classical Combinatorial Kalman Filter (CKF) algorithm to circumvent this issue: The GNN resolves the track candidates in the inner pixel region, where 3D space points can represent measurements very well. These candidates are then picked up by the CKF in the outer regions, where the CKF performs well even for 1D measurements.
Using the ACTS infrastructure, we present a proof of concept based on truth tracking in the pixels as well as a dedicated GNN pipeline trained on $t\bar{t}$ events with pile-up 200 in the OpenDataDetector.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Taming numerical imprecision by adapting the KL divergence to negative probabilities
Authors:
Simon Pfahler,
Peter Georg,
Rudolf Schill,
Maren Klever,
Lars Grasedyck,
Rainer Spang,
Tilo Wettig
Abstract:
The Kullback-Leibler (KL) divergence is frequently used in data science. For discrete distributions on large state spaces, approximations of probability vectors may result in a few small negative entries, rendering the KL divergence undefined. We address this problem by introducing a parameterized family of substitute divergence measures, the shifted KL (sKL) divergence measures. Our approach is g…
▽ More
The Kullback-Leibler (KL) divergence is frequently used in data science. For discrete distributions on large state spaces, approximations of probability vectors may result in a few small negative entries, rendering the KL divergence undefined. We address this problem by introducing a parameterized family of substitute divergence measures, the shifted KL (sKL) divergence measures. Our approach is generic and does not increase the computational overhead. We show that the sKL divergence shares important theoretical properties with the KL divergence and discuss how its shift parameters should be chosen. If Gaussian noise is added to a probability vector, we prove that the average sKL divergence converges to the KL divergence for small enough noise. We also show that our method solves the problem of negative entries in an application from computational oncology, the optimization of Mutual Hazard Networks for cancer progression using tensor-train approximations.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Gauge-equivariant pooling layers for preconditioners in lattice QCD
Authors:
Christoph Lehner,
Tilo Wettig
Abstract:
We demonstrate that gauge-equivariant pooling and unpooling layers can perform as well as traditional restriction and prolongation layers in multigrid preconditioner models for lattice QCD. These layers introduce a gauge degree of freedom on the coarse grid, allowing for the use of explicitly gauge-equivariant layers on the coarse grid. We investigate the construction of coarse-grid gauge fields a…
▽ More
We demonstrate that gauge-equivariant pooling and unpooling layers can perform as well as traditional restriction and prolongation layers in multigrid preconditioner models for lattice QCD. These layers introduce a gauge degree of freedom on the coarse grid, allowing for the use of explicitly gauge-equivariant layers on the coarse grid. We investigate the construction of coarse-grid gauge fields and study their efficiency in the preconditioner model. We show that a combined multigrid neural network using a Galerkin construction for the coarse-grid gauge field eliminates critical slowing down.
△ Less
Submitted 20 April, 2023;
originally announced April 2023.
-
Provenance for Lattice QCD workflows
Authors:
Tanja Auge,
Gunnar Bali,
Meike Klettke,
Bertram Ludäscher,
Wolfgang Söldner,
Simon Weishäupl,
Tilo Wettig
Abstract:
We present a provenance model for the generic workflow of numerical Lattice Quantum Chromodynamics (QCD) calculations, which constitute an important component of particle physics research. These calculations are carried out on the largest supercomputers worldwide with data in the multi-PetaByte range being generated and analyzed. In the Lattice QCD community, a custom metadata standard (QCDml) tha…
▽ More
We present a provenance model for the generic workflow of numerical Lattice Quantum Chromodynamics (QCD) calculations, which constitute an important component of particle physics research. These calculations are carried out on the largest supercomputers worldwide with data in the multi-PetaByte range being generated and analyzed. In the Lattice QCD community, a custom metadata standard (QCDml) that includes certain provenance information already exists for one part of the workflow, the so-called generation of configurations.
In this paper, we follow the W3C PROV standard and formulate a provenance model that includes both the generation part and the so-called measurement part of the Lattice QCD workflow. We demonstrate the applicability of this model and show how the model can be used to answer some provenance-related research questions. However, many important provenance questions in the Lattice QCD community require extensions of this provenance model. To this end, we propose a multi-layered provenance approach that combines prospective and retrospective elements.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Gauge-equivariant neural networks as preconditioners in lattice QCD
Authors:
Christoph Lehner,
Tilo Wettig
Abstract:
We demonstrate that a state-of-the art multi-grid preconditioner can be learned efficiently by gauge-equivariant neural networks. We show that the models require minimal re-training on different gauge configurations of the same gauge ensemble and to a large extent remain efficient under modest modifications of ensemble parameters. We also demonstrate that important paradigms such as communication…
▽ More
We demonstrate that a state-of-the art multi-grid preconditioner can be learned efficiently by gauge-equivariant neural networks. We show that the models require minimal re-training on different gauge configurations of the same gauge ensemble and to a large extent remain efficient under modest modifications of ensemble parameters. We also demonstrate that important paradigms such as communication avoidance are straightforward to implement in this framework.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
MRHS multigrid solver for Wilson-clover fermions
Authors:
Daniel Richtmann,
Nils Meyer,
Tilo Wettig
Abstract:
We describe our implementation of a multigrid solver for Wilson-clover fermions, which increases parallelism by solving for multiple right-hand sides (MRHS) simultaneously. The solver is based on Grid and thus runs on all computing architectures supported by the Grid framework. We present detailed benchmarks of the relevant kernels, such as hop** and clover term on the various multigrid levels,…
▽ More
We describe our implementation of a multigrid solver for Wilson-clover fermions, which increases parallelism by solving for multiple right-hand sides (MRHS) simultaneously. The solver is based on Grid and thus runs on all computing architectures supported by the Grid framework. We present detailed benchmarks of the relevant kernels, such as hop** and clover term on the various multigrid levels, intergrid operators, and reductions. The benchmarks were performed on the JUWELS Booster system at Jülich Supercomputing Centre, which is based on Nvidia A100 GPUs. For example, solving a $24^3\times128$ lattice on 16 GPUs, the overall speedup obtained solely from MRHS is about 10x.
△ Less
Submitted 24 November, 2022;
originally announced November 2022.
-
Approximation formula for complex spacing ratios in the Ginibre ensemble
Authors:
Ioachim G. Dusa,
Tilo Wettig
Abstract:
Recently, Sá, Ribeiro and Prosen introduced complex spacing ratios to analyze eigenvalue correlations in non-Hermitian systems. At present there are no analytical results for the probability distribution of these ratios in the limit of large system size. We derive an approximation formula for the Ginibre universality class of random matrix theory which converges exponentially fast to the limit of…
▽ More
Recently, Sá, Ribeiro and Prosen introduced complex spacing ratios to analyze eigenvalue correlations in non-Hermitian systems. At present there are no analytical results for the probability distribution of these ratios in the limit of large system size. We derive an approximation formula for the Ginibre universality class of random matrix theory which converges exponentially fast to the limit of infinite matrix size. We also give results for moments of the distribution in this limit.
△ Less
Submitted 19 May, 2022; v1 submitted 13 January, 2022;
originally announced January 2022.
-
Differentiated uniformization: A new method for inferring Markov chains on combinatorial state spaces including stochastic epidemic models
Authors:
Kevin Rupp,
Rudolf Schill,
Jonas Süskind,
Peter Georg,
Maren Klever,
Andreas Lösch,
Lars Grasedyck,
Tilo Wettig,
Rainer Spang
Abstract:
Motivation: We consider continuous-time Markov chains that describe the stochastic evolution of a dynamical system by a transition-rate matrix $Q$ which depends on a parameter $θ$. Computing the probability distribution over states at time $t$ requires the matrix exponential $\exp(tQ)$, and inferring $θ$ from data requires its derivative $\partial\exp\!(tQ)/\partialθ$. Both are challenging to comp…
▽ More
Motivation: We consider continuous-time Markov chains that describe the stochastic evolution of a dynamical system by a transition-rate matrix $Q$ which depends on a parameter $θ$. Computing the probability distribution over states at time $t$ requires the matrix exponential $\exp(tQ)$, and inferring $θ$ from data requires its derivative $\partial\exp\!(tQ)/\partialθ$. Both are challenging to compute when the state space and hence the size of $Q$ is huge. This can happen when the state space consists of all combinations of the values of several interacting discrete variables. Often it is even impossible to store $Q$. However, when $Q$ can be written as a sum of tensor products, computing $\exp(tQ)$ becomes feasible by the uniformization method, which does not require explicit storage of $Q$.
Results: Here we provide an analogous algorithm for computing $\partial\exp\!(tQ)/\partialθ$, the differentiated uniformization method. We demonstrate our algorithm for the stochastic SIR model of epidemic spread, for which we show that $Q$ can be written as a sum of tensor products. We estimate monthly infection and recovery rates during the first wave of the COVID-19 pandemic in Austria and quantify their uncertainty in a full Bayesian analysis.
Availability: Implementation and data are available at https://github.com/spang-lab/TenSIR.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
Grid on QPACE 4
Authors:
Peter Georg,
Nils Meyer,
Stefan Solbrig,
Tilo Wettig
Abstract:
In 2020 we deployed QPACE 4, which features 64 Fujitsu A64FX model FX700 processors interconnected by InfiniBand EDR. QPACE 4 runs an open-source software stack. For Lattice QCD simulations we ported the Grid LQCD framework to support the ARM Scalable Vector Extension (SVE). In this contribution we discuss our SVE port of Grid, the status of SVE compilers and the performance of Grid. We also prese…
▽ More
In 2020 we deployed QPACE 4, which features 64 Fujitsu A64FX model FX700 processors interconnected by InfiniBand EDR. QPACE 4 runs an open-source software stack. For Lattice QCD simulations we ported the Grid LQCD framework to support the ARM Scalable Vector Extension (SVE). In this contribution we discuss our SVE port of Grid, the status of SVE compilers and the performance of Grid. We also present the benefits of an alternative data layout of complex numbers for the Domain Wall operator.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
Complex spacing ratios of the non-Hermitian Dirac operator in universality classes AI$^\dagger$ and AII$^\dagger$
Authors:
Takuya Kanazawa,
Tilo Wettig
Abstract:
We consider non-Hermitian Dirac operators in QCD-like theories coupled to a chiral U(1) potential or an imaginary chiral chemical potential. We show that in the continuum they fall into the recently discovered universality classes AI$^\dagger$ or AII$^\dagger$ of random matrix theory if the fermions transform in pseudoreal or real representations of the gauge group, respectively. For staggered fer…
▽ More
We consider non-Hermitian Dirac operators in QCD-like theories coupled to a chiral U(1) potential or an imaginary chiral chemical potential. We show that in the continuum they fall into the recently discovered universality classes AI$^\dagger$ or AII$^\dagger$ of random matrix theory if the fermions transform in pseudoreal or real representations of the gauge group, respectively. For staggered fermions on the lattice this correspondence is reversed. We verify our predictions by computing spacing ratios of complex eigenvalues, whose distribution is universal without the need for unfolding.
△ Less
Submitted 8 November, 2021;
originally announced November 2021.
-
Machine learning for surface prediction in ACTS
Authors:
Benjamin Huth,
Andreas Salzburger,
Tilo Wettig
Abstract:
We present an ongoing R&D activity for machine-learning-assisted navigation through detectors to be used for track reconstruction. We investigate different approaches of training neural networks for surface prediction and compare their results. This work is carried out in the context of the ACTS tracking toolkit.
We present an ongoing R&D activity for machine-learning-assisted navigation through detectors to be used for track reconstruction. We investigate different approaches of training neural networks for surface prediction and compare their results. This work is carried out in the context of the ACTS tracking toolkit.
△ Less
Submitted 6 August, 2021;
originally announced August 2021.
-
New universality classes of the non-Hermitian Dirac operator in QCD-like theories
Authors:
Takuya Kanazawa,
Tilo Wettig
Abstract:
In non-Hermitian random matrix theory there are three universality classes for local spectral correlations: the Ginibre class and the nonstandard classes $\mathrm{AI}^\dagger$ and $\mathrm{AII}^\dagger$. We show that the continuum Dirac operator in two-color QCD coupled to a chiral $\mathrm{U}(1)$ gauge field or an imaginary chiral chemical potential falls in class $\mathrm{AI}^\dagger$ (…
▽ More
In non-Hermitian random matrix theory there are three universality classes for local spectral correlations: the Ginibre class and the nonstandard classes $\mathrm{AI}^\dagger$ and $\mathrm{AII}^\dagger$. We show that the continuum Dirac operator in two-color QCD coupled to a chiral $\mathrm{U}(1)$ gauge field or an imaginary chiral chemical potential falls in class $\mathrm{AI}^\dagger$ ($\mathrm{AII}^\dagger$) for fermions in pseudoreal (real) representations of $\mathrm{SU}(2)$. We introduce the corresponding chiral random matrix theories and verify our predictions in lattice simulations with staggered fermions, for which the correspondence between representation and universality class is reversed. Specifically, we compute the complex eigenvalue spacing ratios introduced recently. We also derive novel spectral sum rules.
△ Less
Submitted 12 April, 2021;
originally announced April 2021.
-
ECM modeling and performance tuning of SpMV and Lattice QCD on A64FX
Authors:
Christie Alappat,
Nils Meyer,
Jan Laukemann,
Thomas Gruber,
Georg Hager,
Gerhard Wellein,
Tilo Wettig
Abstract:
The A64FX CPU is arguably the most powerful Arm-based processor design to date. Although it is a traditional cache-based multicore processor, its peak performance and memory bandwidth rival accelerator devices. A good understanding of its performance features is of paramount importance for developers who wish to leverage its full potential. We present an architectural analysis of the A64FX used in…
▽ More
The A64FX CPU is arguably the most powerful Arm-based processor design to date. Although it is a traditional cache-based multicore processor, its peak performance and memory bandwidth rival accelerator devices. A good understanding of its performance features is of paramount importance for developers who wish to leverage its full potential. We present an architectural analysis of the A64FX used in the Fujitsu FX1000 supercomputer at a level of detail that allows for the construction of Execution-Cache-Memory (ECM) performance models for steady-state loops. In the process we identify architectural peculiarities that point to viable generic optimization strategies. After validating the model using simple streaming loops we apply the insight gained to sparse matrix-vector multiplication (SpMV) and the domain wall (DW) kernel from quantum chromodynamics (QCD). For SpMV we show why the CRS matrix storage format is not a good practical choice on this architecture and how the SELL-C-sigma format can achieve bandwidth saturation. For the DW kernel we provide a cache-reuse analysis and show how an appropriate choice of data layout for complex arrays can realize memory-bandwidth saturation in this case as well. A comparison with state-of-the-art high-end Intel Cascade Lake AP and Nvidia V100 systems puts the capabilities of the A64FX into perspective. We also explore the potential for power optimizations using the tuning knobs provided by the Fugaku system, achieving energy savings of about 31% for SpMV and 18% for DW.
△ Less
Submitted 30 July, 2021; v1 submitted 4 March, 2021;
originally announced March 2021.
-
Performance Modeling of Streaming Kernels and Sparse Matrix-Vector Multiplication on A64FX
Authors:
Christie L. Alappat,
Jan Laukemann,
Thomas Gruber,
Georg Hager,
Gerhard Wellein,
Nils Meyer,
Tilo Wettig
Abstract:
The A64FX CPU powers the current number one supercomputer on the Top500 list. Although it is a traditional cache-based multicore processor, its peak performance and memory bandwidth rival accelerator devices. Generating efficient code for such a new architecture requires a good understanding of its performance features. Using these features, we construct the Execution-Cache-Memory (ECM) performanc…
▽ More
The A64FX CPU powers the current number one supercomputer on the Top500 list. Although it is a traditional cache-based multicore processor, its peak performance and memory bandwidth rival accelerator devices. Generating efficient code for such a new architecture requires a good understanding of its performance features. Using these features, we construct the Execution-Cache-Memory (ECM) performance model for the A64FX processor in the FX700 supercomputer and validate it using streaming loops. We also identify architectural peculiarities and derive optimization hints. Applying the ECM model to sparse matrix-vector multiplication (SpMV), we motivate why the CRS matrix storage format is inappropriate and how the SELL-C-sigma format with suitable code optimizations can achieve bandwidth saturation for SpMV.
△ Less
Submitted 29 September, 2020;
originally announced September 2020.
-
Low-rank tensor methods for Markov chains with applications to tumor progression models
Authors:
Peter Georg,
Lars Grasedyck,
Maren Klever,
Rudolf Schill,
Rainer Spang,
Tilo Wettig
Abstract:
Continuous-time Markov chains describing interacting processes exhibit a state space that grows exponentially in the number of processes. This state-space explosion renders the computation or storage of the time-marginal distribution, which is defined as the solution of a certain linear system, infeasible using classical methods. We consider Markov chains whose transition rates are separable funct…
▽ More
Continuous-time Markov chains describing interacting processes exhibit a state space that grows exponentially in the number of processes. This state-space explosion renders the computation or storage of the time-marginal distribution, which is defined as the solution of a certain linear system, infeasible using classical methods. We consider Markov chains whose transition rates are separable functions, which allows for an efficient low-rank tensor representation of the operator of this linear system. Typically, the right-hand side also has low-rank structure, and thus we can reduce the cost for computation and storage from exponential to linear. Previously known iterative methods also allow for low-rank approximations of the solution but are unable to guarantee that its entries sum up to one as required for a probability distribution. We derive a convergent iterative method using low-rank formats satisfying this condition. We also perform numerical experiments illustrating that the marginal distribution is well approximated with low rank.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
Lattice QCD on a novel vector architecture
Authors:
Benjamin Huth,
Nils Meyer,
Tilo Wettig
Abstract:
The SX-Aurora TSUBASA PCIe accelerator card is the newest model of NEC's SX architecture family. Its multi-core vector processor features a vector length of 16 kbits and interfaces with up to 48 GB of HBM2 memory in the current models, available since 2018. The compute performance is up to 2.45 TFlop/s peak in double precision, and the memory throughput is up to 1.2 TB/s peak. New models with impr…
▽ More
The SX-Aurora TSUBASA PCIe accelerator card is the newest model of NEC's SX architecture family. Its multi-core vector processor features a vector length of 16 kbits and interfaces with up to 48 GB of HBM2 memory in the current models, available since 2018. The compute performance is up to 2.45 TFlop/s peak in double precision, and the memory throughput is up to 1.2 TB/s peak. New models with improved performance characteristics are announced for the near future. In this contribution we discuss key aspects of the SX-Aurora and describe how we enabled the architecture in the Grid Lattice QCD framework.
△ Less
Submitted 1 February, 2020; v1 submitted 21 January, 2020;
originally announced January 2020.
-
Multigrid for Wilson Clover Fermions in Grid
Authors:
Daniel Richtmann,
Peter A. Boyle,
Tilo Wettig
Abstract:
With the ever-growing number of computing architectures, performance portability is an important aspect of (Lattice QCD) software. The Grid library provides a good framework for writing such code, as it thoroughly separates hardware-specific code from algorithmic functionality and already supports many modern architectures. We describe the implementation of a multigrid solver for Wilson clover fer…
▽ More
With the ever-growing number of computing architectures, performance portability is an important aspect of (Lattice QCD) software. The Grid library provides a good framework for writing such code, as it thoroughly separates hardware-specific code from algorithmic functionality and already supports many modern architectures. We describe the implementation of a multigrid solver for Wilson clover fermions in Grid by the RQCD group. We present the features included in our implementation, discuss initial optimization efforts, and compare the performance with another multigrid implementation.
△ Less
Submitted 18 April, 2019;
originally announced April 2019.
-
Induced QCD II: Numerical results
Authors:
Bastian B. Brandt,
Robert Lohmayer,
Tilo Wettig
Abstract:
We numerically explore an alternative discretization of continuum $\text{SU}(N_c)$ Yang-Mills theory on a Euclidean spacetime lattice, originally introduced by Budzcies and Zirnbauer for gauge group $\text{U}(N_c)$. This discretization can be reformulated such that the self-interactions of the gauge field are induced by a path integral over $N_b$ auxiliary bosonic fields, which couple linearly to…
▽ More
We numerically explore an alternative discretization of continuum $\text{SU}(N_c)$ Yang-Mills theory on a Euclidean spacetime lattice, originally introduced by Budzcies and Zirnbauer for gauge group $\text{U}(N_c)$. This discretization can be reformulated such that the self-interactions of the gauge field are induced by a path integral over $N_b$ auxiliary bosonic fields, which couple linearly to the gauge field. In the first paper of the series we have shown that the theory reproduces continuum $\text{SU}(N_c)$ Yang-Mills theory in $d=2$ dimensions if $N_b$ is larger than $N_c-\frac{3}{4}$ and conjectured, following the argument of Budzcies and Zirnbauer, that this remains true for $d>2$. In the present paper, we test this conjecture by performing lattice simulations of the simplest nontrivial case, i.e., gauge group $\text{SU}(2)$ in three dimensions. We show that observables computed in the induced theory, such as the static $q\bar q$ potential and the deconfinement transition temperature, agree with the same observables computed from the ordinary plaquette action up to lattice artifacts. We also find that the bound for $N_b$ can be relaxed to $N_c-\frac{5}{4}$ as conjectured in our earlier paper. Studies of how the new discretization can be used to change the order of integration in the path integral to arrive at dual formulations of QCD are left for future work.
△ Less
Submitted 8 April, 2019;
originally announced April 2019.
-
Lattice QCD on upcoming Arm architectures
Authors:
Nils Meyer,
Dirk Pleiter,
Stefan Solbrig,
Tilo Wettig
Abstract:
Recently Arm introduced a new instruction set called Scalable Vector Extension (SVE), which supports vector lengths up to 2048 bits. While SVE hardware will not be generally available until about 2021, we believe that future SVE-based architectures will have great potential for Lattice QCD. In this contribution we discuss key aspects of SVE and describe how we implemented SVE in the Grid Lattice Q…
▽ More
Recently Arm introduced a new instruction set called Scalable Vector Extension (SVE), which supports vector lengths up to 2048 bits. While SVE hardware will not be generally available until about 2021, we believe that future SVE-based architectures will have great potential for Lattice QCD. In this contribution we discuss key aspects of SVE and describe how we implemented SVE in the Grid Lattice QCD framework.
△ Less
Submitted 8 April, 2019;
originally announced April 2019.
-
SVE-enabling Lattice QCD Codes
Authors:
Nils Meyer,
Peter Georg,
Dirk Pleiter,
Stefan Solbrig,
Tilo Wettig
Abstract:
Optimization of applications for supercomputers of the highest performance class requires parallelization at multiple levels using different techniques. In this contribution we focus on parallelization of particle physics simulations through vector instructions. With the advent of the Scalable Vector Extension (SVE) ISA, future ARM-based processors are expected to provide a significant level of pa…
▽ More
Optimization of applications for supercomputers of the highest performance class requires parallelization at multiple levels using different techniques. In this contribution we focus on parallelization of particle physics simulations through vector instructions. With the advent of the Scalable Vector Extension (SVE) ISA, future ARM-based processors are expected to provide a significant level of parallelism at this level.
△ Less
Submitted 22 January, 2019;
originally announced January 2019.
-
Dirac spectrum and chiral condensate for QCD at fixed $θ$-angle
Authors:
M. Kieburg,
J. J. M. Verbaarschot,
T. Wettig
Abstract:
We analyze the mass dependence of the chiral condensate for QCD at nonzero $θ$-angle and find that in general the discontinuity of the chiral condensate is not on the support of the Dirac spectrum. To understand this behavior we decompose the spectral density and the chiral condensate into contributions from the zero modes, the quenched part, and a remainder which is sensitive to the fermion deter…
▽ More
We analyze the mass dependence of the chiral condensate for QCD at nonzero $θ$-angle and find that in general the discontinuity of the chiral condensate is not on the support of the Dirac spectrum. To understand this behavior we decompose the spectral density and the chiral condensate into contributions from the zero modes, the quenched part, and a remainder which is sensitive to the fermion determinant and is referred to as the dynamical part. We obtain general formulas for the contributions of the zero modes. Expressions for the quenched part, valid for an arbitrary number of flavors, and for the dynamical part, valid for one and two flavors, are derived in the microscopic domain of QCD. We find that at nonzero $θ$-angle the quenched and dynamical part of the Dirac spectral density are strongly oscillating with an amplitude that increases exponentially with the volume $V$ and a period of order of $1/V$. The quenched part of the chiral condensate becomes exponentially large at $θ\ne0$, but this divergence is canceled by the contribution from the zero modes. The oscillatory behavior of the dynamical part of the density is essential for moving the discontinuity of the chiral condensate away from the support of the Dirac spectrum. As important by-products of this work we obtain analytical expressions for the microscopic spectral density of the Dirac operator at nonzero $θ$-angle for both one- and two-flavor QCD with nonzero quark masses.
△ Less
Submitted 25 September, 2018;
originally announced September 2018.
-
Loss-function learning for digital tissue deconvolution
Authors:
Franziska Görtler,
Stefan Solbrig,
Tilo Wettig,
Peter J. Oefner,
Rainer Spang,
Michael Altenbuchinger
Abstract:
The gene expression profile of a tissue averages the expression profiles of all cells in this tissue. Digital tissue deconvolution (DTD) addresses the following inverse problem: Given the expression profile $y$ of a tissue, what is the cellular composition $c$ of that tissue? If $X$ is a matrix whose columns are reference profiles of individual cell types, the composition $c$ can be computed by mi…
▽ More
The gene expression profile of a tissue averages the expression profiles of all cells in this tissue. Digital tissue deconvolution (DTD) addresses the following inverse problem: Given the expression profile $y$ of a tissue, what is the cellular composition $c$ of that tissue? If $X$ is a matrix whose columns are reference profiles of individual cell types, the composition $c$ can be computed by minimizing $\mathcal L(y-Xc)$ for a given loss function $\mathcal L$. Current methods use predefined all-purpose loss functions. They successfully quantify the dominating cells of a tissue, while often falling short in detecting small cell populations.
Here we learn the loss function $\mathcal L$ along with the composition $c$. This allows us to adapt to application-specific requirements such as focusing on small cell populations or distinguishing phenotypically similar cell populations. Our method quantifies large cell fractions as accurately as existing methods and significantly improves the detection of small cell populations and the distinction of similar cell types.
△ Less
Submitted 25 January, 2018;
originally announced January 2018.
-
DD-$α$AMG on QPACE 3
Authors:
Peter Georg,
Daniel Richtmann,
Tilo Wettig
Abstract:
We describe our experience porting the Regensburg implementation of the DD-$α$AMG solver from QPACE 2 to QPACE 3. We first review how the code was ported from the first generation Intel Xeon Phi processor (Knights Corner) to its successor (Knights Landing). We then describe the modifications in the communication library necessitated by the switch from InfiniBand to Omni-Path. Finally, we present t…
▽ More
We describe our experience porting the Regensburg implementation of the DD-$α$AMG solver from QPACE 2 to QPACE 3. We first review how the code was ported from the first generation Intel Xeon Phi processor (Knights Corner) to its successor (Knights Landing). We then describe the modifications in the communication library necessitated by the switch from InfiniBand to Omni-Path. Finally, we present the performance of the code on a single processor as well as the scaling on many nodes, where in both cases the speedup factor is close to the theoretical expectations.
△ Less
Submitted 19 October, 2017;
originally announced October 2017.
-
Chiral condensate and Dirac spectrum of one- and two-flavor QCD at nonzero $θ$-angle
Authors:
Mario Kieburg,
Jacobus Verbaarschot,
Tilo Wettig
Abstract:
In the $ε$-domain of QCD we have obtained exact analytical expressions for the eigenvalue density of the Dirac operator at fixed $θ\ne 0$ for both one and two flavors. These results made it possible to explain how the different contributions to the spectral density conspire to give a chiral condensate at fixed $θ$ that does not change sign when the quark mass (or one of the quark masses for two fl…
▽ More
In the $ε$-domain of QCD we have obtained exact analytical expressions for the eigenvalue density of the Dirac operator at fixed $θ\ne 0$ for both one and two flavors. These results made it possible to explain how the different contributions to the spectral density conspire to give a chiral condensate at fixed $θ$ that does not change sign when the quark mass (or one of the quark masses for two flavors) crosses the imaginary axis, while the chiral condensate at fixed topological charge does change sign. From QCD at nonzero density we have learnt that the discontinuity of the chiral condensate may move to a different location when the spectral density increases exponentially with the volume with oscillations on the order of the inverse volume. This is indeed what happens when the product of the quark masses becomes negative, but the situation is more subtle in this case: the contribution of the "quenched" part of the spectral density diverges in the thermodynamic limit at nonzero $θ$, but this divergence is canceled exactly by the contribution from the zero modes. We conclude that the zero modes are essential for the continuity of the chiral condensate and that their contribution has to be perfectly balanced against the contribution from the nonzero modes. Lattice simulations at nonzero $θ$-angle can only be trusted if this is indeed the case.
△ Less
Submitted 18 October, 2017;
originally announced October 2017.
-
Complete random matrix classification of SYK models with $\mathcal{N}=0$, $1$ and $2$ supersymmetry
Authors:
Takuya Kanazawa,
Tilo Wettig
Abstract:
We present a complete symmetry classification of the Sachdev-Ye-Kitaev (SYK) model with $\mathcal{N}=0$, $1$ and $2$ supersymmetry (SUSY) on the basis of the Altland-Zirnbauer scheme in random matrix theory (RMT). For $\mathcal{N}=0$ and $1$ we consider generic $q$-body interactions in the Hamiltonian and find RMT classes that were not present in earlier classifications of the same model with…
▽ More
We present a complete symmetry classification of the Sachdev-Ye-Kitaev (SYK) model with $\mathcal{N}=0$, $1$ and $2$ supersymmetry (SUSY) on the basis of the Altland-Zirnbauer scheme in random matrix theory (RMT). For $\mathcal{N}=0$ and $1$ we consider generic $q$-body interactions in the Hamiltonian and find RMT classes that were not present in earlier classifications of the same model with $q=4$. We numerically establish quantitative agreement between the distributions of the smallest energy levels in the $\mathcal{N}=1$ SYK model and RMT. Furthermore, we delineate the distinctive structure of the $\mathcal{N}=2$ SYK model and provide its complete symmetry classification based on RMT for all eigenspaces of the fermion number operator. We corroborate our classification by detailed numerical comparisons with RMT and thus establish the presence of quantum chaotic dynamics in the $\mathcal{N}=2$ SYK model. We also introduce a new SYK-like model without SUSY that exhibits hybrid properties of the $\mathcal{N}=1$ and $\mathcal{N}=2$ SYK models and uncover its rich structure both analytically and numerically.
△ Less
Submitted 29 August, 2017; v1 submitted 9 June, 2017;
originally announced June 2017.
-
Scale-invariant biomarker discovery in urine and plasma metabolite fingerprints
Authors:
Helena U. Zacharias,
Thorsten Rehberg,
Sebastian Mehrl,
Daniel Richtmann,
Tilo Wettig,
Peter J. Oefner,
Rainer Spang,
Wolfram Gronwald,
Michael Altenbuchinger
Abstract:
Motivation: Metabolomics data is typically scaled to a common reference like a constant volume of body fluid, a constant creatinine level, or a constant area under the spectrum. Such normalization of the data, however, may affect the selection of biomarkers and the biological interpretation of results in unforeseen ways.
Results: First, we study how the outcome of hypothesis tests for differenti…
▽ More
Motivation: Metabolomics data is typically scaled to a common reference like a constant volume of body fluid, a constant creatinine level, or a constant area under the spectrum. Such normalization of the data, however, may affect the selection of biomarkers and the biological interpretation of results in unforeseen ways.
Results: First, we study how the outcome of hypothesis tests for differential metabolite concentration is affected by the choice of scale. Furthermore, we observe this interdependence also for different classification approaches. Second, to overcome this problem and establish a scale-invariant biomarker discovery algorithm, we extend linear zero-sum regression to the logistic regression framework and show in two applications to ${}^1$H NMR-based metabolomics data how this approach overcomes the scaling problem.
Availability: Logistic zero-sum regression is available as an R package as well as a high-performance computing implementation that can be downloaded at https://github.com/rehbergT/zeroSum
△ Less
Submitted 22 March, 2017;
originally announced March 2017.
-
pMR: A high-performance communication library
Authors:
Peter Georg,
Daniel Richtmann,
Tilo Wettig
Abstract:
On many parallel machines, the time LQCD applications spent in communication is a significant contribution to the total wall-clock time, especially in the strong-scaling limit. We present a novel high-performance communication library that can be used as a de facto drop-in replacement for MPI in existing software. Its lightweight nature that avoids some of the unnecessary overhead introduced by MP…
▽ More
On many parallel machines, the time LQCD applications spent in communication is a significant contribution to the total wall-clock time, especially in the strong-scaling limit. We present a novel high-performance communication library that can be used as a de facto drop-in replacement for MPI in existing software. Its lightweight nature that avoids some of the unnecessary overhead introduced by MPI allows us to improve the communication performance of applications without any algorithmic or complicated implementation changes. As a first real-world benchmark, we make use of the pMR library in the coarse-grid solve of the Regensburg implementation of the DD-$α$AMG algorithm. On realistic lattices, we see an improvement of a factor 2x in pure communication time and total execution time savings of up to 20%.
△ Less
Submitted 30 January, 2017;
originally announced January 2017.
-
Induced QCD I: Theory
Authors:
Bastian B. Brandt,
Robert Lohmayer,
Tilo Wettig
Abstract:
We explore an alternative discretization of continuum SU(N_c) Yang-Mills theory on a Euclidean spacetime lattice, originally introduced by Budzcies and Zirnbauer. In this discretization the self-interactions of the gauge field are induced by a path integral over N_b auxiliary boson fields, which are coupled linearly to the gauge field. The main progress compared to earlier approaches is that N_b c…
▽ More
We explore an alternative discretization of continuum SU(N_c) Yang-Mills theory on a Euclidean spacetime lattice, originally introduced by Budzcies and Zirnbauer. In this discretization the self-interactions of the gauge field are induced by a path integral over N_b auxiliary boson fields, which are coupled linearly to the gauge field. The main progress compared to earlier approaches is that N_b can be as small as N_c. In the present paper we (i) extend the proof that the continuum limit of the new discretization reproduces Yang-Mills theory in two dimensions from gauge group U(N_c) to SU(N_c), (ii) derive refined bounds on N_b for non-integer values, and (iii) perform a perturbative calculation to match the bare parameter of the induced gauge theory to the standard lattice coupling. In follow-up papers we will present numerical evidence in support of the conjecture that the induced gauge theory reproduces Yang-Mills theory also in three and four dimensions, and explore the possibility to integrate out the gauge fields to arrive at a dual formulation of lattice QCD.
△ Less
Submitted 21 September, 2016;
originally announced September 2016.
-
Multiple right-hand-side setup for the DD-αAMG
Authors:
Daniel Richtmann,
Simon Heybrock,
Tilo Wettig
Abstract:
The setup cost of a modern solver such as DD-αAMG (Wuppertal Multigrid) is a significant contribution to the total time spent on solving the Dirac equation, and in HMC it can even be dominant. We present an improved implementation of this algorithm with modified computation order in the setup procedure. By processing multiple right-hand sides simultaneously we can alleviate many of the performance…
▽ More
The setup cost of a modern solver such as DD-αAMG (Wuppertal Multigrid) is a significant contribution to the total time spent on solving the Dirac equation, and in HMC it can even be dominant. We present an improved implementation of this algorithm with modified computation order in the setup procedure. By processing multiple right-hand sides simultaneously we can alleviate many of the performance issues of the default single right-hand-side setup. The main improvements are as follows: By combining multiple right-hand sides the message size for off-chip communication is larger, which leads to better utilization of the network bandwidth. Many matrix-vector products are replaced by matrix-matrix products, leading to better cache reuse. The synchronization overhead inflicted by on-chip parallelization (threading), which is becoming crucial on many-core architectures such as the Intel Xeon Phi, is effectively reduced. In the parts implemented so far, we observe a speedup of roughly 3x compared to the optimized version of the single right-hand-side setup on realistic lattices.
△ Less
Submitted 13 January, 2016;
originally announced January 2016.
-
Adaptive algebraic multigrid on SIMD architectures
Authors:
Simon Heybrock,
Matthias Rottmann,
Peter Georg,
Tilo Wettig
Abstract:
We present details of our implementation of the Wuppertal adaptive algebraic multigrid code DD-$α$AMG on SIMD architectures, with particular emphasis on the Intel Xeon Phi processor (KNC) used in QPACE 2. As a smoother, the algorithm uses a domain-decomposition-based solver code previously developed for the KNC in Regensburg. We optimized the remaining parts of the multigrid code and conclude that…
▽ More
We present details of our implementation of the Wuppertal adaptive algebraic multigrid code DD-$α$AMG on SIMD architectures, with particular emphasis on the Intel Xeon Phi processor (KNC) used in QPACE 2. As a smoother, the algorithm uses a domain-decomposition-based solver code previously developed for the KNC in Regensburg. We optimized the remaining parts of the multigrid code and conclude that it is a very good target for SIMD architectures. Some of the remaining bottlenecks can be eliminated by vectorizing over multiple test vectors in the setup, which is discussed in the contribution of Daniel Richtmann.
△ Less
Submitted 14 December, 2015;
originally announced December 2015.
-
Induced YM theory with auxiliary bosons
Authors:
Bastian B. Brandt,
Robert Lohmayer,
Tilo Wettig
Abstract:
We study pure SU(N) lattice gauge theory with a plaquette weight factor given by an inverse determinant which can be written as an integral over auxiliary bosonic fields (modifying a proposal of Budczies and Zirnbauer). We derive conditions for the existence of a continuum limit and its equivalence to Yang-Mills theory. Furthermore, we perturbatively compute the relation between the coupling const…
▽ More
We study pure SU(N) lattice gauge theory with a plaquette weight factor given by an inverse determinant which can be written as an integral over auxiliary bosonic fields (modifying a proposal of Budczies and Zirnbauer). We derive conditions for the existence of a continuum limit and its equivalence to Yang-Mills theory. Furthermore, we perturbatively compute the relation between the coupling constants of the `induced' gauge action and the familiar Wilson gauge action using the background-field technique. The perturbative relation agrees well with numerical results for N=2 in three dimensions.
△ Less
Submitted 26 November, 2015;
originally announced November 2015.
-
QPACE 2 and Domain Decomposition on the Intel Xeon Phi
Authors:
Paul Arts,
Jacques Bloch,
Peter Georg,
Benjamin Glaessle,
Simon Heybrock,
Yu Komatsubara,
Robert Lohmayer,
Simon Mages,
Bernhard Mendl,
Nils Meyer,
Alessio Parcianello,
Dirk Pleiter,
Florian Rappl,
Mauro Rossi,
Stefan Solbrig,
Giampietro Tecchiolli,
Tilo Wettig,
Gianpaolo Zanier
Abstract:
We give an overview of QPACE 2, which is a custom-designed supercomputer based on Intel Xeon Phi processors, developed in a collaboration of Regensburg University and Eurotech. We give some general recommendations for how to write high-performance code for the Xeon Phi and then discuss our implementation of a domain-decomposition-based solver and present a number of benchmarks.
We give an overview of QPACE 2, which is a custom-designed supercomputer based on Intel Xeon Phi processors, developed in a collaboration of Regensburg University and Eurotech. We give some general recommendations for how to write high-performance code for the Xeon Phi and then discuss our implementation of a domain-decomposition-based solver and present a number of benchmarks.
△ Less
Submitted 13 February, 2015;
originally announced February 2015.
-
The Chiral Condensate of One-Flavor QCD and the Dirac Spectrum at θ=0
Authors:
Jacobus Verbaarschot,
Tilo Wettig
Abstract:
In a sector of fixed topological charge, the chiral condensate has a discontinuity given by the Banks-Casher formula also in the case of one-flavor QCD. However, at fixed θ-angle, the chiral condensate remains constant when the quark mass crosses zero. To reconcile these contradictory observations, we have evaluated the spectral density of one-flavor QCD at θ=0. For negative quark mass, it becomes…
▽ More
In a sector of fixed topological charge, the chiral condensate has a discontinuity given by the Banks-Casher formula also in the case of one-flavor QCD. However, at fixed θ-angle, the chiral condensate remains constant when the quark mass crosses zero. To reconcile these contradictory observations, we have evaluated the spectral density of one-flavor QCD at θ=0. For negative quark mass, it becomes a strongly oscillating function with a period that scales as the inverse space-time volume and an amplitude that increases exponentially with the space-time volume. As we have learned from QCD at nonzero chemical potential, if this is the case, an alternative to the Banks-Casher formula applies, and as we will demonstrate in this talk, for one-flavor QCD this results in a continuous chiral condensate. A special role is played by the topological zero modes which have to be taken into account exactly in order to get a finite chiral condensate in the thermodynamic limit.
△ Less
Submitted 17 December, 2014;
originally announced December 2014.
-
Lattice QCD with Domain Decomposition on Intel Xeon Phi Co-Processors
Authors:
Simon Heybrock,
Bálint Joó,
Dhiraj D. Kalamkar,
Mikhail Smelyanskiy,
Karthikeyan Vaidyanathan,
Tilo Wettig,
Pradeep Dubey
Abstract:
The gap between the cost of moving data and the cost of computing continues to grow, making it ever harder to design iterative solvers on extreme-scale architectures. This problem can be alleviated by alternative algorithms that reduce the amount of data movement. We investigate this in the context of Lattice Quantum Chromodynamics and implement such an alternative solver algorithm, based on domai…
▽ More
The gap between the cost of moving data and the cost of computing continues to grow, making it ever harder to design iterative solvers on extreme-scale architectures. This problem can be alleviated by alternative algorithms that reduce the amount of data movement. We investigate this in the context of Lattice Quantum Chromodynamics and implement such an alternative solver algorithm, based on domain decomposition, on Intel Xeon Phi co-processor (KNC) clusters. We demonstrate close-to-linear on-chip scaling to all 60 cores of the KNC. With a mix of single- and half-precision the domain-decomposition method sustains 400-500 Gflop/s per chip. Compared to an optimized KNC implementation of a standard solver [1], our full multi-node domain-decomposition solver strong-scales to more nodes and reduces the time-to-solution by a factor of 5.
△ Less
Submitted 8 December, 2014;
originally announced December 2014.
-
Induced QCD with two auxiliary bosonic fields
Authors:
Bastian B. Brandt,
Tilo Wettig
Abstract:
Following a proposal of Budczies and Zirnbauer, we investigate an alternative lattice discretization of continuum ${\rm SU}(N_c)$ Yang-Mills theory in which the self-interactions of the gauge field are induced by a path integral over $N_b\ge N_c-1$ auxiliary bosonic fields which are coupled linearly to the gauge field. In two dimensions there exists an analytic proof that the new discretization re…
▽ More
Following a proposal of Budczies and Zirnbauer, we investigate an alternative lattice discretization of continuum ${\rm SU}(N_c)$ Yang-Mills theory in which the self-interactions of the gauge field are induced by a path integral over $N_b\ge N_c-1$ auxiliary bosonic fields which are coupled linearly to the gauge field. In two dimensions there exists an analytic proof that the new discretization reproduces Yang-Mills theory in its non-perturbative continuum limit. We provide numerical evidence that this is also the case in three and four dimensions and that, after a suitable matching of the free parameters, the results of the induced theory agree with results from the ordinary plaquette action up to lattice artifacts. The new discretization is ideally suited to change the order of integration in the QCD path integral to arrive at formulations in which the gauge fields have been integrated out. The resulting theories might be amenable to methods previously used in the infinite-coupling limit, and we briefly discuss possibilities to arrive at dual representations of lattice QCD.
△ Less
Submitted 12 November, 2014;
originally announced November 2014.
-
Dirac spectrum of one-flavor QCD at θ=0 and continuity of the chiral condensate
Authors:
J. J. M. Verbaarschot,
T. Wettig
Abstract:
We derive exact analytical expressions for the spectral density of the Dirac operator at fixed θ-angle in the microscopic domain of one-flavor QCD. These results are obtained by performing the sum over topological sectors using novel identities involving sums of products of Bessel functions. Because the fermion determinant is not positive definite for negative quark mass, the usual Banks-Casher re…
▽ More
We derive exact analytical expressions for the spectral density of the Dirac operator at fixed θ-angle in the microscopic domain of one-flavor QCD. These results are obtained by performing the sum over topological sectors using novel identities involving sums of products of Bessel functions. Because the fermion determinant is not positive definite for negative quark mass, the usual Banks-Casher relation is not valid and has to be replaced by a different mechanism first observed for QCD at nonzero chemical potential. Using the exact results for the spectral density we explain how this mechanism results in a chiral condensate that remains constant when the quark mass changes sign.
△ Less
Submitted 31 July, 2014;
originally announced July 2014.
-
Stressed Cooper pairing in QCD at high isospin density: effective Lagrangian and random matrix theory
Authors:
Takuya Kanazawa,
Tilo Wettig
Abstract:
We generalize QCD at asymptotically large isospin chemical potential to an arbitrary even number of flavors. We also allow for small quark chemical potentials, which stress the coincident Fermi surfaces of the paired quarks and lead to a sign problem in Monte Carlo simulations. We derive the corresponding low-energy effective theory in both $p$- and $ε$-expansion and quantify the severity of the s…
▽ More
We generalize QCD at asymptotically large isospin chemical potential to an arbitrary even number of flavors. We also allow for small quark chemical potentials, which stress the coincident Fermi surfaces of the paired quarks and lead to a sign problem in Monte Carlo simulations. We derive the corresponding low-energy effective theory in both $p$- and $ε$-expansion and quantify the severity of the sign problem. We construct the random matrix theory describing our physical situation and show that it can be mapped to a known random matrix theory at low baryon density so that new insights can be gained without additional calculations. In particular, we explain the Silver Blaze phenomenon at high isospin density. We also introduce stressed singular values of the Dirac operator and relate them to the pionic condensate. Finally we comment on extensions of our work to two-color QCD.
△ Less
Submitted 28 September, 2014; v1 submitted 23 June, 2014;
originally announced June 2014.
-
Banks-Casher-type relations for complex Dirac spectra
Authors:
Takuya Kanazawa,
Tilo Wettig,
Naoki Yamamoto
Abstract:
For theories with a sign problem there is no analog of the Banks-Casher relation. This is true in particular for QCD at nonzero quark chemical potential. However, for QCD-like theories without a sign problem the Banks-Casher relation can be extended to the case of complex Dirac eigenvalues. We derive such extensions for the zero-temperature, high-density limits of two-color QCD, QCD at nonzero iso…
▽ More
For theories with a sign problem there is no analog of the Banks-Casher relation. This is true in particular for QCD at nonzero quark chemical potential. However, for QCD-like theories without a sign problem the Banks-Casher relation can be extended to the case of complex Dirac eigenvalues. We derive such extensions for the zero-temperature, high-density limits of two-color QCD, QCD at nonzero isospin chemical potential, and adjoint QCD. In all three cases the density of the complex Dirac eigenvalues at the origin is proportional to the BCS gap squared.
△ Less
Submitted 16 March, 2014;
originally announced March 2014.
-
Sign problem and subsets in one-dimensional QCD
Authors:
Jacques Bloch,
Falk Bruckmann,
Tilo Wettig
Abstract:
We present a subset method that solves the sign problem for QCD at nonzero quark chemical potential in 0+1 dimensions. The subsets of gauge configurations are constructed using the center symmetry of the SU(3) group. These subsets completely solve the sign problem for up to five flavors. For a larger number of flavors the sign problem slowly reappears, and we propose an extension of the subsets th…
▽ More
We present a subset method that solves the sign problem for QCD at nonzero quark chemical potential in 0+1 dimensions. The subsets of gauge configurations are constructed using the center symmetry of the SU(3) group. These subsets completely solve the sign problem for up to five flavors. For a larger number of flavors the sign problem slowly reappears, and we propose an extension of the subsets that also solves the sign problem for these cases. The subset method allows for numerical simulations of the model at nonzero chemical potential. We also present some preliminary results on subsets for QCD in two, three, and four dimensions.
△ Less
Submitted 14 April, 2014; v1 submitted 24 October, 2013;
originally announced October 2013.
-
iDataCool: HPC with Hot-Water Cooling and Energy Reuse
Authors:
Nils Meyer,
Manfred Ries,
Stefan Solbrig,
Tilo Wettig
Abstract:
iDataCool is an HPC architecture jointly developed by the University of Regensburg and the IBM Research and Development Lab Böblingen. It is based on IBM's iDataPlex platform, whose air-cooling solution was replaced by a custom water-cooling solution that allows for cooling water temperatures of 70C/158F. The system is coupled to an adsorption chiller by InvenSor that operates efficiently at these…
▽ More
iDataCool is an HPC architecture jointly developed by the University of Regensburg and the IBM Research and Development Lab Böblingen. It is based on IBM's iDataPlex platform, whose air-cooling solution was replaced by a custom water-cooling solution that allows for cooling water temperatures of 70C/158F. The system is coupled to an adsorption chiller by InvenSor that operates efficiently at these temperatures. Thus a significant portion of the energy spent on HPC can be recovered in the form of chilled water, which can then be used to cool other parts of the computing center. We describe the architecture of iDataCool and present benchmarks of the cooling performance and the energy (reuse) efficiency.
△ Less
Submitted 19 September, 2013;
originally announced September 2013.
-
Subset method for one-dimensional QCD
Authors:
Jacques Bloch,
Falk Bruckmann,
Tilo Wettig
Abstract:
We present a subset method which solves the sign problem for QCD at nonzero quark chemical potential in 0+1 dimensions. The subsets gather gauge configurations based on the center symmetry of the SU(3) group. We show that the sign problem is solved for one to five quark flavors and that it slowly reappears for a larger number of flavors. We formulate an extension of the center subsets that solves…
▽ More
We present a subset method which solves the sign problem for QCD at nonzero quark chemical potential in 0+1 dimensions. The subsets gather gauge configurations based on the center symmetry of the SU(3) group. We show that the sign problem is solved for one to five quark flavors and that it slowly reappears for a larger number of flavors. We formulate an extension of the center subsets that solves the sign problem for a larger number of flavors as well. We also derive some new analytical results for this toy model.
△ Less
Submitted 24 October, 2013; v1 submitted 4 July, 2013;
originally announced July 2013.
-
Singular values of the Dirac operator at nonzero density
Authors:
Takuya Kanazawa,
Tilo Wettig,
Naoki Yamamoto
Abstract:
At nonzero density the eigenvalues of the Dirac operator move into the complex plane, while its singular values remain real and nonnegative. In QCD-like theories, the singular-value spectrum carries information on the diquark (or pionic) condensate. We have constructed low-energy effective theories in different density regimes and derived a number of exact results for the Dirac singular values, in…
▽ More
At nonzero density the eigenvalues of the Dirac operator move into the complex plane, while its singular values remain real and nonnegative. In QCD-like theories, the singular-value spectrum carries information on the diquark (or pionic) condensate. We have constructed low-energy effective theories in different density regimes and derived a number of exact results for the Dirac singular values, including Banks-Casher-type relations for the diquark (or pionic) condensate, Smilga-Stern-type relations for the slope of the singular-value density, and Leutwyler-Smilga-type sum rules for the inverse singular values. We also present a rigorous index theorem for non-Hermitian Dirac operators.
△ Less
Submitted 10 December, 2012;
originally announced December 2012.
-
Banks-Casher-type relation for the BCS gap at high density
Authors:
Takuya Kanazawa,
Tilo Wettig,
Naoki Yamamoto
Abstract:
We derive a new Banks-Casher-type relation which relates the density of complex Dirac eigenvalues at the origin to the BCS gap of quarks at high density. Our relation is applicable to QCD and QCD-like theories without a sign problem, such as two-color QCD and adjoint QCD with baryon chemical potential, and QCD with isospin chemical potential. It provides us with a method to measure the BCS gap thr…
▽ More
We derive a new Banks-Casher-type relation which relates the density of complex Dirac eigenvalues at the origin to the BCS gap of quarks at high density. Our relation is applicable to QCD and QCD-like theories without a sign problem, such as two-color QCD and adjoint QCD with baryon chemical potential, and QCD with isospin chemical potential. It provides us with a method to measure the BCS gap through the Dirac spectrum on the lattice.
△ Less
Submitted 23 April, 2013; v1 submitted 22 November, 2012;
originally announced November 2012.
-
Wigner surmise for mixed symmetry classes in random matrix theory
Authors:
Sebastian Schierenberg,
Falk Bruckmann,
Tilo Wettig
Abstract:
We consider the nearest-neighbor spacing distributions of mixed random matrix ensembles interpolating between different symmetry classes, or between integrable and non-integrable systems. We derive analytical formulas for the spacing distributions of 2x2 or 4x4 matrices and show numerically that they provide very good approximations for those of random matrices with large dimension. This generaliz…
▽ More
We consider the nearest-neighbor spacing distributions of mixed random matrix ensembles interpolating between different symmetry classes, or between integrable and non-integrable systems. We derive analytical formulas for the spacing distributions of 2x2 or 4x4 matrices and show numerically that they provide very good approximations for those of random matrices with large dimension. This generalizes the Wigner surmise, which is valid for pure ensembles that are recovered as limits of the mixed ensembles. We show how the coupling parameters of small and large matrices must be matched depending on the local eigenvalue density.
△ Less
Submitted 21 August, 2012; v1 submitted 16 February, 2012;
originally announced February 2012.
-
Singular values of the Dirac operator in dense QCD-like theories
Authors:
Takuya Kanazawa,
Tilo Wettig,
Naoki Yamamoto
Abstract:
We study the singular values of the Dirac operator in dense QCD-like theories at zero temperature. The Dirac singular values are real and nonnegative at any nonzero quark density. The scale of their spectrum is set by the diquark condensate, in contrast to the complex Dirac eigenvalues whose scale is set by the chiral condensate at low density and by the BCS gap at high density. We identify three…
▽ More
We study the singular values of the Dirac operator in dense QCD-like theories at zero temperature. The Dirac singular values are real and nonnegative at any nonzero quark density. The scale of their spectrum is set by the diquark condensate, in contrast to the complex Dirac eigenvalues whose scale is set by the chiral condensate at low density and by the BCS gap at high density. We identify three different low-energy effective theories with diquark sources applicable at low, intermediate, and high density, together with their overlap** domains of validity. We derive a number of exact formulas for the Dirac singular values, including Banks-Casher-type relations for the diquark condensate, Smilga-Stern-type relations for the slope of the singular value density, and Leutwyler-Smilga-type sum rules for the inverse singular values. We construct random matrix theories and determine the form of the microscopic spectral correlation functions of the singular values for all nonzero quark densities. We also derive a rigorous index theorem for non-Hermitian Dirac operators. Our results can in principle be tested in lattice simulations.
△ Less
Submitted 14 December, 2011; v1 submitted 26 October, 2011;
originally announced October 2011.
-
Lattice QCD Applications on QPACE
Authors:
Y. Nakamura,
A. Nobile,
D. Pleiter,
H. Simma,
T. Streuer,
T. Wettig,
F. Winter
Abstract:
QPACE is a novel massively parallel architecture optimized for lattice QCD simulations. A single QPACE node is based on the IBM PowerXCell 8i processor. The nodes are interconnected by a custom 3-dimensional torus network implemented on an FPGA. The compute power of the processor is provided by 8 Synergistic Processing Units. Making efficient use of these accelerator cores in scientific applicatio…
▽ More
QPACE is a novel massively parallel architecture optimized for lattice QCD simulations. A single QPACE node is based on the IBM PowerXCell 8i processor. The nodes are interconnected by a custom 3-dimensional torus network implemented on an FPGA. The compute power of the processor is provided by 8 Synergistic Processing Units. Making efficient use of these accelerator cores in scientific applications is challenging. In this paper we describe our strategies for porting applications to the QPACE architecture and report on performance numbers.
△ Less
Submitted 7 March, 2011;
originally announced March 2011.
-
The QCD sign problem and dynamical simulations of random matrices
Authors:
Jacques Bloch,
Tilo Wettig
Abstract:
At nonzero quark chemical potential dynamical lattice simulations of QCD are hindered by the sign problem caused by the complex fermion determinant. The severity of the sign problem can be assessed by the average phase of the fermion determinant. In an earlier paper we derived a formula for the microscopic limit of the average phase for general topology using chiral random matrix theory. In the cu…
▽ More
At nonzero quark chemical potential dynamical lattice simulations of QCD are hindered by the sign problem caused by the complex fermion determinant. The severity of the sign problem can be assessed by the average phase of the fermion determinant. In an earlier paper we derived a formula for the microscopic limit of the average phase for general topology using chiral random matrix theory. In the current paper we present an alternative derivation of the same quantity, leading to a simpler expression which is also calculable for finite-sized matrices, away from the microscopic limit. We explicitly prove the equivalence of the old and new results in the microscopic limit. The results for finite-sized matrices illustrate the convergence towards the microscopic limit. We compare the analytical results with dynamical random matrix simulations, where various reweighting methods are used to circumvent the sign problem. We discuss the pros and cons of these reweighting methods.
△ Less
Submitted 26 May, 2011; v1 submitted 17 February, 2011;
originally announced February 2011.
-
Geometry dependence of RMT-based methods to extract the low-energy constants Sigma and F
Authors:
Christoph Lehner,
Jacques Bloch,
Shoji Hashimoto,
Tilo Wettig
Abstract:
The lowest-order low-energy constants $Σ$ and $F$ of chiral pertubation theory can be extracted from lattice data using methods based on the equivalence of random matrix theory (RMT) and QCD in the epsilon regime. We discuss how the choice of the lattice geometry affects such methods. In particular, we show how to minimize systematic deviations from RMT by an optimal choice of the lattice geometry…
▽ More
The lowest-order low-energy constants $Σ$ and $F$ of chiral pertubation theory can be extracted from lattice data using methods based on the equivalence of random matrix theory (RMT) and QCD in the epsilon regime. We discuss how the choice of the lattice geometry affects such methods. In particular, we show how to minimize systematic deviations from RMT by an optimal choice of the lattice geometry in the case of two light quark flavors. We illustrate our findings by determining $Σ$ and $F$ from lattice configurations with two dynamical overlap fermions generated by JLQCD, using two different lattice geometries.
△ Less
Submitted 21 June, 2011; v1 submitted 28 January, 2011;
originally announced January 2011.
-
Exact results for two-color QCD at low and high density
Authors:
Takuya Kanazawa,
Tilo Wettig,
Naoki Yamamoto
Abstract:
We discuss a random matrix theory that was originally constructed to describe two-color QCD at low density in the phase with a nonzero chiral condensate. With a particular choice of a parameter, the same random matrix theory also describes the high-density phase of two-color QCD. In this phase a BCS superfluid of diquark pairs is formed, and the pattern of chiral symmetry breaking is very differen…
▽ More
We discuss a random matrix theory that was originally constructed to describe two-color QCD at low density in the phase with a nonzero chiral condensate. With a particular choice of a parameter, the same random matrix theory also describes the high-density phase of two-color QCD. In this phase a BCS superfluid of diquark pairs is formed, and the pattern of chiral symmetry breaking is very different from that at low density. Analytical results for the spectral density obtained from this random matrix theory allow for the extraction of the BCS gap from lattice data.
△ Less
Submitted 3 January, 2011;
originally announced January 2011.
-
Random matrix theory of unquenched two-colour QCD with nonzero chemical potential
Authors:
G. Akemann,
T. Kanazawa,
M. J. Phillips,
T. Wettig
Abstract:
We solve a random two-matrix model with two real asymmetric matrices whose primary purpose is to describe certain aspects of quantum chromodynamics with two colours and dynamical fermions at nonzero quark chemical potential mu. In this symmetry class the determinant of the Dirac operator is real but not necessarily positive. Despite this sign problem the unquenched matrix model remains completely…
▽ More
We solve a random two-matrix model with two real asymmetric matrices whose primary purpose is to describe certain aspects of quantum chromodynamics with two colours and dynamical fermions at nonzero quark chemical potential mu. In this symmetry class the determinant of the Dirac operator is real but not necessarily positive. Despite this sign problem the unquenched matrix model remains completely solvable and provides detailed predictions for the Dirac operator spectrum in two different physical scenarios/limits: (i) the epsilon-regime of chiral perturbation theory at small mu, where mu^2 multiplied by the volume remains fixed in the infinite-volume limit and (ii) the high-density regime where a BCS gap is formed and mu is unscaled. We give explicit examples for the complex, real, and imaginary eigenvalue densities including Nf=2 non-degenerate flavours. Whilst the limit of two degenerate masses has no sign problem and can be tested with standard lattice techniques, we analyse the severity of the sign problem for non-degenerate masses as a function of the mass split and of mu.
On the mathematical side our new results include an analytical formula for the spectral density of real Wishart eigenvalues in the limit (i) of weak non-Hermiticity, thus completing the previous solution of the corresponding quenched model of two real asymmetric Wishart matrices.
△ Less
Submitted 16 March, 2011; v1 submitted 20 December, 2010;
originally announced December 2010.