Skip to main content

Showing 1–1 of 1 results for author: Tsarova, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.16367  [pdf, other

    cs.LG cs.AI cs.CL

    TQCompressor: improving tensor decomposition methods in neural networks via permutations

    Authors: V. Abronin, A. Naumov, D. Mazur, D. Bystrov, K. Tsarova, Ar. Melnikov, I. Oseledets, S. Dolgov, R. Brasher, M. Perelshtein

    Abstract: We introduce TQCompressor, a novel method for neural network model compression with improved tensor decompositions. We explore the challenges posed by the computational and storage demands of pre-trained language models in NLP tasks and propose a permutation-based enhancement to Kronecker decomposition. This enhancement makes it possible to reduce loss in model expressivity which is usually associ… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.