Search | arXiv e-print repository

arXiv:1903.10584 [pdf, other]

doi 10.1145/3316279.3316282

Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

Authors: Zachariah Carmichael, Hamed F. Langroudi, Char Khazanov, Jeffrey Lillie, John L. Gustafson, Dhireesha Kudithipudi

Abstract: Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for executing any reasonably complex task. To optimize the inference time and alleviate the power consumption of these networks, DNN accelerators with low-precision… ▽ More Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for executing any reasonably complex task. To optimize the inference time and alleviate the power consumption of these networks, DNN accelerators with low-precision representations of data and DNN parameters are being actively studied. An interesting research question is in how low-precision networks can be ported to edge-devices with similar performance as high-precision networks. In this work, we employ the fixed-point, floating point, and posit numerical formats at $\leq$8-bit precision within a DNN accelerator, Deep Positron, with exact multiply-and-accumulate (EMAC) units for inference. A unified analysis quantifies the trade-offs between overall network efficiency and performance across five classification tasks. Our results indicate that posits are a natural fit for DNN inference, outperforming at $\leq$8-bit precision, and can be realized with competitive resource requirements relative to those of floating point. △ Less

Submitted 25 March, 2019; originally announced March 2019.

Comments: 9 pages, Proceedings of the ACM Conference for Next Generation Arithmetic (CoNGA) 2019

arXiv:1812.01762 [pdf, other]

Deep Positron: A Deep Neural Network Using the Posit Number System

Authors: Zachariah Carmichael, Hamed F. Langroudi, Char Khazanov, Jeffrey Lillie, John L. Gustafson, Dhireesha Kudithipudi

Abstract: The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance and power gains can be obtained when DNN accelerators support low-precision numerical formats. Despite considerable research, there is still a knowledge gap on ho… ▽ More The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance and power gains can be obtained when DNN accelerators support low-precision numerical formats. Despite considerable research, there is still a knowledge gap on how low-precision operations can be realized for both DNN training and inference. In this work, we propose a DNN architecture, Deep Positron, with posit numerical format operating successfully at $\leq$8 bits for inference. We propose a precision-adaptable FPGA soft core for exact multiply-and-accumulate for uniform comparison across three numerical formats, fixed, floating-point and posit. Preliminary results demonstrate that 8-bit posit has better accuracy than 8-bit fixed or floating-point for three different low-dimensional datasets. Moreover, the accuracy is comparable to 32-bit floating-point on a Xilinx Virtex-7 FPGA device. The trade-offs between DNN performance and hardware resources, i.e. latency, power, and resource utilization, show that posit outperforms in accuracy and latency at 8-bit and below. △ Less

Submitted 18 January, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

Comments: 6 pages, Design, Automation and Test in Europe 2019

arXiv:1709.09108 [pdf]

Tensors Come of Age: Why the AI Revolution will help HPC

Authors: John L. Gustafson, Lenore M. Mullin

Abstract: This article discusses how the automation of tensor algorithms, based on A Mathematics of Arrays and Psi Calculus, and a new way to represent numbers, Unum Arithmetic, enables mechanically provable, scalable, portable, and more numerically accurate software. This article discusses how the automation of tensor algorithms, based on A Mathematics of Arrays and Psi Calculus, and a new way to represent numbers, Unum Arithmetic, enables mechanically provable, scalable, portable, and more numerically accurate software. △ Less

Submitted 26 September, 2017; originally announced September 2017.

Comments: To be published in this years 30th anniversary edition of HPCwire

arXiv:math/9310223 [pdf, ps, other]

Asymptotic approximations for symmetric elliptic integrals

Authors: Bille C. Carlson, John L. Gustafson

Abstract: Symmetric elliptic integrals, which have been used as replacements for Legendre's integrals in recent integral tables and computer codes, are homogeneous functions of three or four variables. When some of the variables are much larger than the others, asymptotic approximations with error bounds are presented. In most cases they are derived from a uniform approximation to the integrand. As an app… ▽ More Symmetric elliptic integrals, which have been used as replacements for Legendre's integrals in recent integral tables and computer codes, are homogeneous functions of three or four variables. When some of the variables are much larger than the others, asymptotic approximations with error bounds are presented. In most cases they are derived from a uniform approximation to the integrand. As an application the symmetric elliptic integrals of the first, second, and third kinds are proved to be linearly independent with respect to coefficients that are rational functions. △ Less

Submitted 6 October, 1993; originally announced October 1993.

Report number: OP-SF 7 Oct 1993

Showing 1–4 of 4 results for author: Gustafson, J L