Showing 1–2 of 2 results for author: Dick, H

Search v0.5.6 released 2020-02-24

arXiv:2306.01098 [pdf, other]

hep-lat

doi 10.1016/j.cpc.2024.109164

SIMULATeQCD: A simple multi-GPU lattice code for QCD calculations

Authors: Lukas Mazur, Dennis Bollweg, David A. Clarke, Luis Altenkort, Olaf Kaczmarek, Rasmus Larsen, Hai-Tao Shu, Jishnu Goswami, Philipp Scior, Hauke Sandmeyer, Marius Neumann, Henrik Dick, Sajid Ali, Jangho Kim, Christian Schmidt, Peter Petreczky, Swagato Mukherjee

Abstract: The rise of exascale supercomputers has fueled competition among GPU vendors, driving lattice QCD developers to write code that supports multiple APIs. Moreover, new developments in algorithms and physics research require frequent updates to existing software. These challenges have to be balanced against constantly changing personnel. At the same time, there is a wide range of applications for HIS… ▽ More The rise of exascale supercomputers has fueled competition among GPU vendors, driving lattice QCD developers to write code that supports multiple APIs. Moreover, new developments in algorithms and physics research require frequent updates to existing software. These challenges have to be balanced against constantly changing personnel. At the same time, there is a wide range of applications for HISQ fermions in QCD studies. This situation encourages the development of software featuring a HISQ action that is flexible, high-performing, open source, easy to use, and easy to adapt. In this technical paper, we explain the design strategy, provide implementation details, list available algorithms and modules, and show key performance indicators for SIMULATeQCD, a simple multi-GPU lattice code for large-scale QCD calculations, mainly developed and used by the HotQCD collaboration. The code is publicly available on GitHub. △ Less

Submitted 3 April, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

Comments: 17 pages, 7 figures

Journal ref: Comp. Phys. Commun. 300 (2024) 109164
arXiv:1903.08066 [pdf, other]

cs.CV cs.AI cs.LG

Trained Quantization Thresholds for Accurate and Efficient Fixed-Point Inference of Deep Neural Networks

Authors: Sambhav R. Jain, Albert Gural, Michael Wu, Chris H. Dick

Abstract: We propose a method of training quantization thresholds (TQT) for uniform symmetric quantizers using standard backpropagation and gradient descent. Contrary to prior work, we show that a careful analysis of the straight-through estimator for threshold gradients allows for a natural range-precision trade-off leading to better optima. Our quantizers are constrained to use power-of-2 scale-factors an… ▽ More We propose a method of training quantization thresholds (TQT) for uniform symmetric quantizers using standard backpropagation and gradient descent. Contrary to prior work, we show that a careful analysis of the straight-through estimator for threshold gradients allows for a natural range-precision trade-off leading to better optima. Our quantizers are constrained to use power-of-2 scale-factors and per-tensor scaling of weights and activations to make it amenable for hardware implementations. We present analytical support for the general robustness of our methods and empirically validate them on various CNNs for ImageNet classification. We are able to achieve near-floating-point accuracy on traditionally difficult networks such as MobileNets with less than 5 epochs of quantized (8-bit) retraining. Finally, we present Graffitist, a framework that enables automatic quantization of TensorFlow graphs for TQT (available at https://github.com/Xilinx/graffitist ). △ Less

Submitted 28 February, 2020; v1 submitted 19 March, 2019; originally announced March 2019.

Comments: Link to Conference (Oral & Poster) Schedule - https://mlsys.org/Conferences/2020/ScheduleMultitrack?event=1431

Journal ref: Proceedings of the 3rd Machine Learning and Systems (MLSys) Conference, Austin, TX, USA, 2020

Search v0.5.6 released 2020-02-24