Skip to main content

Showing 1–2 of 2 results for author: Grisenthwaite, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.05433  [pdf, other

    cs.LG

    FP8 Formats for Deep Learning

    Authors: Paulius Micikevicius, Dusan Stosic, Neil Burgess, Marius Cornea, Pradeep Dubey, Richard Grisenthwaite, Sangwon Ha, Alexander Heinecke, Patrick Judd, John Kamalu, Naveen Mellempudi, Stuart Oberman, Mohammad Shoeybi, Michael Siu, Hao Wu

    Abstract: FP8 is a natural progression for accelerating deep learning training inference beyond the 16-bit formats common in modern processors. In this paper we propose an 8-bit floating point (FP8) binary interchange format consisting of two encodings - E4M3 (4-bit exponent and 3-bit mantissa) and E5M2 (5-bit exponent and 2-bit mantissa). While E5M2 follows IEEE 754 conventions for representatio of special… ▽ More

    Submitted 29 September, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

  2. arXiv:2203.00642  [pdf, other

    cs.AR cs.OS cs.PL

    Relaxed virtual memory in Armv8-A (extended version)

    Authors: Ben Simner, Alasdair Armstrong, Jean Pichon-Pharabod, Christopher Pulte, Richard Grisenthwaite, Peter Sewell

    Abstract: Virtual memory is an essential mechanism for enforcing security boundaries, but its relaxed-memory concurrency semantics has not previously been investigated in detail. The concurrent systems code managing virtual memory has been left on an entirely informal basis, and OS and hypervisor verification has had to make major simplifying assumptions. We explore the design space for relaxed virtual me… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    ACM Class: C.1.2; D.3.1; F.3.2