Skip to main content

Showing 1–5 of 5 results for author: Kapushev, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.11062  [pdf, other

    cs.CL cs.AI cs.LG

    Scaling Transformer to 1M tokens and beyond with RMT

    Authors: Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail S. Burtsev

    Abstract: A major limitation for the broader scope of problems solvable by transformers is the quadratic scaling of computational complexity with input size. In this study, we investigate the recurrent memory augmentation of pre-trained transformer models to extend input context length while linearly scaling compute. Our approach demonstrates the capability to store information in memory for sequences of up… ▽ More

    Submitted 6 February, 2024; v1 submitted 19 April, 2023; originally announced April 2023.

  2. arXiv:2101.05239  [pdf, other

    cs.LG stat.ML

    Denoising Score Matching with Random Fourier Features

    Authors: Tsimboy Olga, Yermek Kapushev, Evgeny Burnaev, Ivan Oseledets

    Abstract: The density estimation is one of the core problems in statistics. Despite this, existing techniques like maximum likelihood estimation are computationally inefficient due to the intractability of the normalizing constant. For this reason an interest to score matching has increased being independent on the normalizing constant. However, such estimator is consistent only for distributions with the f… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

  3. arXiv:2011.00594  [pdf, other

    cs.RO

    Random Fourier Features based SLAM

    Authors: Yermek Kapushev, Anastasia Kishkun, Gonzalo Ferrer, Evgeny Burnaev

    Abstract: This work is dedicated to simultaneous continuous-time trajectory estimation and map** based on Gaussian Processes (GP). State-of-the-art GP-based models for Simultaneous Localization and Map** (SLAM) are computationally efficient but can only be used with a restricted class of kernel functions. This paper provides the algorithm based on GP with Random Fourier Features (RFF) approximation for… ▽ More

    Submitted 6 September, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

  4. arXiv:1912.05179  [pdf, other

    math.NA cs.LG stat.ML

    Tensor Completion via Gaussian Process Based Initialization

    Authors: Yermek Kapushev, Ivan Oseledets, Evgeny Burnaev

    Abstract: In this paper, we consider the tensor completion problem representing the solution in the tensor train (TT) format. It is assumed that tensor is high-dimensional, and tensor values are generated by an unknown smooth function. The assumption allows us to develop an efficient initialization scheme based on Gaussian Process Regression and TT-cross approximation technique. The proposed approach can be… ▽ More

    Submitted 26 August, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

    MSC Class: 68W25; 65F99; 60G15

  5. arXiv:1802.03832  [pdf, other

    cs.LG stat.ML

    Quadrature-based features for kernel approximation

    Authors: Marina Munkhoeva, Yermek Kapushev, Evgeny Burnaev, Ivan Oseledets

    Abstract: We consider the problem of improving kernel approximation via randomized feature maps. These maps arise as Monte Carlo approximation to integral representations of kernel functions and scale up kernel methods for larger datasets. Based on an efficient numerical integration technique, we propose a unifying approach that reinterprets the previous random features methods and extends to better estimat… ▽ More

    Submitted 29 October, 2018; v1 submitted 11 February, 2018; originally announced February 2018.

    Comments: Accepted to NIPS 2018; 9 pages, 3 figures, Appendix: 4 pages, 2 figures