Skip to main content

Showing 1–3 of 3 results for author: Pegolotti, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.03738  [pdf, other

    cs.LG cs.CL cs.PF

    QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models

    Authors: Tommaso Pegolotti, Elias Frantar, Dan Alistarh, Markus Püschel

    Abstract: We present ongoing work on a new automatic code generation approach for supporting quantized generative inference on LLMs such as LLaMA or OPT on off-the-shelf CPUs. Our approach is informed by the target architecture and a performance model, including both hardware characteristics and method-specific accuracy constraints. Results on CPU-based inference for LLaMA models show that our approach can… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  2. arXiv:2302.04852  [pdf, other

    cs.LG

    SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks

    Authors: Mahdi Nikdan, Tommaso Pegolotti, Eugenia Iofinova, Eldar Kurtic, Dan Alistarh

    Abstract: We provide a new efficient version of the backpropagation algorithm, specialized to the case where the weights of the neural network being trained are sparse. Our algorithm is general, as it applies to arbitrary (unstructured) sparsity and common layer types (e.g., convolutional or linear). We provide a fast vectorized implementation on commodity CPUs, and show that it can yield speedups in end-to… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  3. arXiv:2211.13706  [pdf, other

    cs.DM

    Fast Möbius and Zeta Transforms

    Authors: Tommaso Pegolotti, Bastian Seifert, Markus Püschel

    Abstract: Möbius inversion of functions on partially ordered sets (posets) $\mathcal{P}$ is a classical tool in combinatorics. For finite posets it consists of two, mutually inverse, linear transformations called zeta and Möbius transform, respectively. In this paper we provide novel fast algorithms for both that require $O(nk)$ time and space, where $n = |\mathcal{P}|$ and $k$ is the width (length of longe… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: 16 pages, 7 figures, submitted for review

    MSC Class: 06A06; 05C50; 15A04; 15B36