Skip to main content

Showing 1–3 of 3 results for author: Lindquist, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2011.01850  [pdf, ps, other

    math.NA cs.MS

    Improving the Performance of the GMRES Method using Mixed-Precision Techniques

    Authors: Neil Lindquist, Piotr Luszczek, Jack Dongarra

    Abstract: The GMRES method is used to solve sparse, non-symmetric systems of linear equations arising from many scientific applications. The solver performance within a single node is memory bound, due to the low arithmetic intensity of its computational kernels. To reduce the amount of data movement, and thus, to improve performance, we investigated the effect of using a mix of single and double precision… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 16 pages. In the 17th Smoky Mountains Computational Sciences and Engineering Conference

  2. arXiv:2007.06674  [pdf, other

    cs.MS math.NA

    A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic

    Authors: Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin Carson, Terry Cojean, Jack Dongarra, Mark Gates, Thomas Grützmacher, Nicholas J. Higham, Sherry Li, Neil Lindquist, Yang Liu, Jennifer Loe, Piotr Luszczek, Pratik Nayak, Sri Pranesh, Siva Rajamanickam, Tobias Ribizel, Barry Smith, Kasia Swirydowicz, Stephen Thomas, Stanimire Tomov, Yaohung M. Tsai, Ichitaro Yamazaki, Urike Meier Yang

    Abstract: Within the past years, hardware vendors have started designing low precision special function units in response to the demand of the Machine Learning community and their demand for high compute power in low precision formats. Also the server-line products are increasingly featuring low-precision special function units, such as the NVIDIA tensor cores in ORNL's Summit supercomputer providing more t… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: Technical report as a part of the Exascale computing project (ECP)

    ACM Class: G.1.3; G.4

  3. Replicated Computational Results (RCR) Report for "Code Generation for Generally Mapped Finite Elements"

    Authors: Neil Lindquist

    Abstract: "Code Generation for Generally Mapped Finite Elements" includes performance results for the finite element methods discussed in that manuscript. The authors provided a Zenodo archive with the Firedrake components and dependencies used, as well as the scripts that generated the results. The software was installed on two similar platforms; then, new results were gathered and compared to the original… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: 7 pages, 7 figures. Submitted to ACM Transactions on Mathematical Software

    MSC Class: 10002944.10011123.10011676; 10002950.10003705.10011686; 10002950.10003705.10003707 ACM Class: G.1.8; G.4

    Journal ref: ACM Transactions on Mathematical Software (TOMS): Volume 45 Issue 4, December 2019