Skip to main content

Showing 1–14 of 14 results for author: Katrutsa, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00154  [pdf, other

    cs.DB

    EEvA: Fast Expert-Based Algorithms for Buffer Page Replacement

    Authors: Alexander Demin, Yuriy Dorn, Aleksandr Katrutsa, Daniil Kazantsev, Ilgam Latypov, Yulia Maximlyuk, Denis Ponomaryov

    Abstract: Optimal page replacement is an important problem in efficient buffer management. The range of replacement strategies known in the literature varies from simple but efficient FIFO-based algorithms to more accurate but potentially costly methods tailored to specific data access patterns. The principal issue in adopting a pattern-specific replacement logic in a DB buffer manager is to guarantee non-d… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

  2. arXiv:2402.07062  [pdf, other

    cs.LG math.OC stat.ML

    Fast UCB-type algorithms for stochastic bandits with heavy and super heavy symmetric noise

    Authors: Yuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Andrey Pudovikov

    Abstract: In this study, we propose a new method for constructing UCB-type algorithms for stochastic multi-armed bandits based on general convex optimization methods with an inexact oracle. We derive the regret bounds corresponding to the convergence rates of the optimization methods. We propose a new algorithm Clipped-SGD-UCB and show, both theoretically and empirically, that in the case of symmetric noise… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  3. arXiv:2310.01595  [pdf, other

    cs.RO cs.AI

    Memory-efficient particle filter recurrent neural network for object localization

    Authors: Roman Korkin, Ivan Oseledets, Aleksandr Katrutsa

    Abstract: This study proposes a novel memory-efficient recurrent neural network (RNN) architecture specified to solve the object localization problem. This problem is to recover the object states along with its movement in a noisy environment. We take the idea of the classical particle filter and combine it with GRU RNN architecture. The key feature of the resulting memory-efficient particle filter RNN mode… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  4. arXiv:2303.07897  [pdf, other

    cs.RO cs.AI

    Multiparticle Kalman filter for object localization in symmetric environments

    Authors: Roman Korkin, Ivan Oseledets, Aleksandr Katrutsa

    Abstract: This study considers the object localization problem and proposes a novel multiparticle Kalman filter to solve it in complex and symmetric environments. Two well-known classes of filtering algorithms to solve the localization problem are Kalman filter-based methods and particle filter-based methods. We consider these classes, demonstrate their complementary properties, and propose a novel filterin… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  5. arXiv:2303.04744  [pdf, other

    cs.IR cs.CR cs.HC cs.LG

    Federated Privacy-preserving Collaborative Filtering for On-Device Next App Prediction

    Authors: Albert Sayapin, Gleb Balitskiy, Daniel Bershatsky, Aleksandr Katrutsa, Evgeny Frolov, Alexey Frolov, Ivan Oseledets, Vitaliy Kharin

    Abstract: In this study, we propose a novel SeqMF model to solve the problem of predicting the next app launch during mobile device usage. Although this problem can be represented as a classical collaborative filtering problem, it requires proper modification since the data are sequential, the user feedback is distributed among devices and the transmission of users' data to aggregate common patterns must be… ▽ More

    Submitted 5 February, 2023; originally announced March 2023.

  6. arXiv:2209.14937  [pdf, other

    math.OC cs.LG math.NA

    NAG-GS: Semi-Implicit, Accelerated and Robust Stochastic Optimizer

    Authors: Valentin Leplat, Daniil Merkulov, Aleksandr Katrutsa, Daniel Bershatsky, Olga Tsymboi, Ivan Oseledets

    Abstract: Classical machine learning models such as deep neural networks are usually trained by using Stochastic Gradient Descent-based (SGD) algorithms. The classical SGD can be interpreted as a discretization of the stochastic gradient flow. In this paper we propose a novel, robust and accelerated stochastic optimizer that relies on two key elements: (1) an accelerated Nesterov-like Stochastic Differentia… ▽ More

    Submitted 30 September, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: We study Nesterov acceleration for the Stochastic Differential Equation

  7. Extension of Dynamic Mode Decomposition for dynamic systems with incomplete information based on t-model of optimal prediction

    Authors: Aleksandr Katrutsa, Sergey Utyuzhnikov, Ivan Oseledets

    Abstract: The Dynamic Mode Decomposition has proved to be a very efficient technique to study dynamic data. This is entirely a data-driven approach that extracts all necessary information from data snapshots which are commonly supposed to be sampled from measurement. The application of this approach becomes problematic if the available data is incomplete because some dimensions of smaller scale either missi… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  8. arXiv:2202.10435  [pdf, ps, other

    cs.LG cs.AI

    Survey on Large Scale Neural Network Training

    Authors: Julia Gusak, Daria Cherniuk, Alena Shilova, Alexander Katrutsa, Daniel Bershatsky, Xunyi Zhao, Lionel Eyraud-Dubois, Oleg Shlyazhko, Denis Dimitrov, Ivan Oseledets, Olivier Beaumont

    Abstract: Modern Deep Neural Networks (DNNs) require significant memory to store weight, activations, and other intermediate tensors during training. Hence, many models do not fit one GPU device or can be trained using only a small per-GPU batch size. This survey provides a systematic overview of the approaches that enable more efficient DNNs training. We analyze techniques that save memory and make good us… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

  9. arXiv:2201.13195  [pdf, other

    cs.LG cs.AI stat.ML

    Memory-Efficient Backpropagation through Large Linear Layers

    Authors: Daniel Bershatsky, Aleksandr Mikhalev, Alexandr Katrutsa, Julia Gusak, Daniil Merkulov, Ivan Oseledets

    Abstract: In modern neural networks like Transformers, linear layers require significant memory to store activations during backward pass. This study proposes a memory reduction approach to perform backpropagation through linear layers. Since the gradients of linear layers are computed by matrix multiplications, we consider methods for randomized matrix multiplications and demonstrate that they require less… ▽ More

    Submitted 2 February, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: Submitted

  10. arXiv:2103.08561  [pdf, other

    cs.LG

    Meta-Solver for Neural Ordinary Differential Equations

    Authors: Julia Gusak, Alexandr Katrutsa, Talgat Daulbaev, Andrzej Cichocki, Ivan Oseledets

    Abstract: A conventional approach to train neural ordinary differential equations (ODEs) is to fix an ODE solver and then learn the neural network's weights to optimize a target loss function. However, such an approach is tailored for a specific discretization method and its properties, which may not be optimal for the selected application and yield the overfitting to the given solver. In our paper, we inve… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

  11. arXiv:2007.06937  [pdf, other

    math.OC cs.LG

    Follow the bisector: a simple method for multi-objective optimization

    Authors: Alexandr Katrutsa, Daniil Merkulov, Nurislam Tursynbek, Ivan Oseledets

    Abstract: This study presents a novel Equiangular Direction Method (EDM) to solve a multi-objective optimization problem. We consider optimization problems, where multiple differentiable losses have to be minimized. The presented method computes descent direction in every iteration to guarantee equal relative decrease of objective functions. This descent direction is based on the normalized gradients of the… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

  12. arXiv:2004.09222  [pdf, other

    cs.LG stat.ML

    Towards Understanding Normalization in Neural ODEs

    Authors: Julia Gusak, Larisa Markeeva, Talgat Daulbaev, Alexandr Katrutsa, Andrzej Cichocki, Ivan Oseledets

    Abstract: Normalization is an important and vastly investigated technique in deep learning. However, its role for Ordinary Differential Equation based networks (neural ODEs) is still poorly understood. This paper investigates how different normalization techniques affect the performance of neural ODEs. Particularly, we show that it is possible to achieve 93% accuracy in the CIFAR-10 classification task, and… ▽ More

    Submitted 27 April, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  13. arXiv:2003.05271  [pdf, other

    cs.NE math.NA stat.ML

    Interpolation Technique to Speed Up Gradients Propagation in Neural ODEs

    Authors: Talgat Daulbaev, Alexandr Katrutsa, Larisa Markeeva, Julia Gusak, Andrzej Cichocki, Ivan Oseledets

    Abstract: We propose a simple interpolation-based method for the efficient approximation of gradients in neural ODE models. We compare it with the reverse dynamic method (known in the literature as "adjoint method") to train neural ODEs on classification, density estimation, and inference approximation tasks. We also propose a theoretical justification of our approach using logarithmic norm formalism. As a… ▽ More

    Submitted 30 October, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

  14. arXiv:1502.07167  [pdf, ps, other

    cs.DS math.NA

    Linear complexity SimRank computation based on the iterative diagonal estimation

    Authors: I. V. Oseledets, G. V. Ovchinnikov, A. M. Katrutsa

    Abstract: This paper presents a deterministic linear time complexity IDE-SimRank method to approximately compute SimRank with proved error bound. SimRank is a well-known similarity measure between graph vertices which relies on graph topology only and is built on intuition that "two objects are similar if they are related to similar objects". The fixed point equation for direct SimRank computation is the di… ▽ More

    Submitted 25 February, 2015; originally announced February 2015.