Skip to main content

Showing 1–5 of 5 results for author: Smith, T M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.05810  [pdf, other

    cs.LG

    Explaining Reinforcement Learning with Shapley Values

    Authors: Daniel Beechey, Thomas M. S. Smith, Özgür Şimşek

    Abstract: For reinforcement learning systems to be widely adopted, their users must understand and trust them. We present a theoretical analysis of explaining reinforcement learning using Shapley values, following a principled approach from game theory for identifying the contribution of individual players to the outcome of a cooperative game. We call this general framework Shapley Values for Explaining Rei… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 12 pages, 9 figures. Accepted at ICML 2023

  2. arXiv:1904.05717  [pdf, ps, other

    cs.MS

    The MOMMS Family of Matrix Multiplication Algorithms

    Authors: Tyler M. Smith, Robert A. van de Geijn

    Abstract: As the ratio between the rate of computation and rate with which data can be retrieved from various layers of memory continues to deteriorate, a question arises: Will the current best algorithms for computing matrix-matrix multiplication on future CPUs continue to be (near) optimal? This paper provides compelling analytical and empirical evidence that the answer is "no". The analytical results gui… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

  3. arXiv:1702.02017  [pdf, ps, other

    cs.CC

    A Tight I/O Lower Bound for Matrix Multiplication

    Authors: Tyler Michael Smith, Bradley Lowery, Julien Langou, Robert A. van de Geijn

    Abstract: A tight lower bound for required I/O when computing an ordinary matrix-matrix multiplication on a processor with two layers of memory is established. Prior work obtained weaker lower bounds by reasoning about the number of segments needed to perform $C:=AB$, for distinct matrices $A$, $B$, and $C$, where each segment is a series of operations involving $M$ reads and writes to and from fast memory,… ▽ More

    Submitted 6 February, 2019; v1 submitted 3 February, 2017; originally announced February 2017.

  4. arXiv:1611.08035  [pdf, other

    cs.MS

    Automating the Last-Mile for High Performance Dense Linear Algebra

    Authors: Richard Michael Veras, Tze Meng Low, Tyler Michael Smith, Robert van de Geijn, Franz Franchetti

    Abstract: High performance dense linear algebra (DLA) libraries often rely on a general matrix multiply (Gemm) kernel that is implemented using assembly or with vector intrinsics. In particular, the real-valued Gemm kernels provide the overwhelming fraction of performance for the complex-valued Gemm kernels, along with the entire level-3 BLAS and many of the real and complex LAPACK routines. Thus,achieving… ▽ More

    Submitted 28 April, 2017; v1 submitted 23 November, 2016; originally announced November 2016.

  5. arXiv:1605.01078  [pdf, other

    cs.MS

    Implementing Strassen's Algorithm with BLIS

    Authors: Jianyu Huang, Tyler M. Smith, Greg M. Henry, Robert A. van de Geijn

    Abstract: We dispel with "street wisdom" regarding the practical implementation of Strassen's algorithm for matrix-matrix multiplication (DGEMM). Conventional wisdom: it is only practical for very large matrices. Our implementation is practical for small matrices. Conventional wisdom: the matrices being multiplied should be relatively square. Our implementation is practical for rank-k updates, where k is re… ▽ More

    Submitted 3 May, 2016; originally announced May 2016.

    Report number: FLAME Working Note #79, The University of Texas at Austin, Department of Computer Sciences Technical Report TR-16-03